multiple-gene biological processes: Topics by WorldWideScience.org

Sample records for multiple-gene biological processes

Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology.

Science.gov (United States)

Soltis, P S; Soltis, D E; Chase, M W

1999-11-25

Comparative biology requires a firm phylogenetic foundation to uncover and understand patterns of diversification and evaluate hypotheses of the processes responsible for these patterns. In the angiosperms, studies of diversification in floral form, stamen organization, reproductive biology, photosynthetic pathway, nitrogen-fixing symbioses and life histories have relied on either explicit or implied phylogenetic trees. Furthermore, to understand the evolution of specific genes and gene families, evaluate the extent of conservation of plant genomes and make proper sense of the huge volume of molecular genetic data available for model organisms such as Arabidopsis, Antirrhinum, maize, rice and wheat, a phylogenetic perspective is necessary. Here we report the results of parsimony analyses of DNA sequences of the plastid genes rbcL and atpB and the nuclear 18S rDNA for 560 species of angiosperms and seven non-flowering seed plants and show a well-resolved and well-supported phylogenetic tree for the angiosperms for use in comparative biology.
Finding biological process modifications in cancer tissues by mining gene expression correlations

Directory of Open Access Journals (Sweden)

Storari Sergio

2006-01-01

Full Text Available Abstract Background Through the use of DNA microarrays it is now possible to obtain quantitative measurements of the expression of thousands of genes from a biological sample. This technology yields a global view of gene expression that can be used in several ways. Functional insight into expression profiles is routinely obtained by using Gene Ontology terms associated to the cellular genes. In this paper, we deal with functional data mining from expression profiles, proposing a novel approach that studies the correlations between genes and their relations to Gene Ontology (GO. By using this "functional correlations comparison" we explore all possible pairs of genes identifying the affected biological processes by analyzing in a pair-wise manner gene expression patterns and linking correlated pairs with Gene Ontology terms. Results We apply here this "functional correlations comparison" approach to identify the existing correlations in hepatocarcinoma (161 microarray experiments and to reveal functional differences between normal liver and cancer tissues. The number of well-correlated pairs in each GO term highlights several differences in genetic interactions between cancer and normal tissues. We performed a bootstrap analysis in order to compute false detection rates (FDR and confidence limits. Conclusion Experimental results show the main advantage of the applied method: it both picks up general and specific GO terms (in particular it shows a fine resolution in the specific GO terms. The results obtained by this novel method are highly coherent with the ones proposed by other cancer biology studies. But additionally they highlight the most specific and interesting GO terms helping the biologist to focus his/her studies on the most relevant biological processes.
Statistical approach for selection of biologically informative genes.

Science.gov (United States)

Das, Samarendra; Rai, Anil; Mishra, D C; Rai, Shesh N

2018-05-20

Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes
Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information

Directory of Open Access Journals (Sweden)

Lemke Ney

2009-09-01

Full Text Available Abstract Background The identification of essential genes is important for the understanding of the minimal requirements for cellular life and for practical purposes, such as drug design. However, the experimental techniques for essential genes discovery are labor-intensive and time-consuming. Considering these experimental constraints, a computational approach capable of accurately predicting essential genes would be of great value. We therefore present here a machine learning-based computational approach relying on network topological features, cellular localization and biological process information for prediction of essential genes. Results We constructed a decision tree-based meta-classifier and trained it on datasets with individual and grouped attributes-network topological features, cellular compartments and biological processes-to generate various predictors of essential genes. We showed that the predictors with better performances are those generated by datasets with integrated attributes. Using the predictor with all attributes, i.e., network topological features, cellular compartments and biological processes, we obtained the best predictor of essential genes that was then used to classify yeast genes with unknown essentiality status. Finally, we generated decision trees by training the J48 algorithm on datasets with all network topological features, cellular localization and biological process information to discover cellular rules for essentiality. We found that the number of protein physical interactions, the nuclear localization of proteins and the number of regulating transcription factors are the most important factors determining gene essentiality. Conclusion We were able to demonstrate that network topological features, cellular localization and biological process information are reliable predictors of essential genes. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

Directory of Open Access Journals (Sweden)

Paules Richard S

2007-11-01

biological processes affected by IR- and/or UV- induced DNA damage. Conclusion EPIG competed with CLICK and performed better than CAST in extracting patterns from simulated data. EPIG extracted more biological informative patterns and co-expressed genes from both C. elegans and IR/UV-treated human fibroblasts. Using Gene Ontology analysis of the genes in the patterns extracted by EPIG, several key biological categories related to p53-dependent cell cycle control were revealed from the IR/UV data. Among them were mitotic cell cycle, DNA replication, DNA repair, cell cycle checkpoint, and G0-like status transition. EPIG can be applied to data sets from a variety of experimental designs.
Identifying novel genes and biological processes relevant to the development of cancer therapy-induced mucositis: An informative gene network analysis.

Directory of Open Access Journals (Sweden)

Cielito C Reyes-Gibby

Full Text Available Mucositis is a complex, dose-limiting toxicity of chemotherapy or radiotherapy that leads to painful mouth ulcers, difficulty eating or swallowing, gastrointestinal distress, and reduced quality of life for patients with cancer. Mucositis is most common for those undergoing high-dose chemotherapy and hematopoietic stem cell transplantation and for those being treated for malignancies of the head and neck. Treatment and management of mucositis remain challenging. It is expected that multiple genes are involved in the formation, severity, and persistence of mucositis. We used Ingenuity Pathway Analysis (IPA, a novel network-based approach that integrates complex intracellular and intercellular interactions involved in diseases, to systematically explore the molecular complexity of mucositis. As a first step, we searched the literature to identify genes that harbor or are close to the genetic variants significantly associated with mucositis. Our literature review identified 27 candidate genes, of which ERCC1, XRCC1, and MTHFR were the most frequently studied for mucositis. On the basis of this 27-gene list, we used IPA to generate gene networks for mucositis. The most biologically significant novel molecules identified through IPA analyses included TP53, CTNNB1, MYC, RB1, P38 MAPK, and EP300. Additionally, uracil degradation II (reductive and thymine degradation pathways (p = 1.06-08 were most significant. Finally, utilizing 66 SNPs within the 8 most connected IPA-derived candidate molecules, we conducted a genetic association study for oral mucositis in the head and neck cancer patients who were treated using chemotherapy and/or radiation therapy (186 head and neck cancer patients with oral mucositis vs. 699 head and neck cancer patients without oral mucositis. The top ranked gene identified through this association analysis was RB1 (rs2227311, p-value = 0.034, odds ratio = 0.67. In conclusion, gene network analysis identified novel molecules and
Identifying novel genes and biological processes relevant to the development of cancer therapy-induced mucositis: An informative gene network analysis.

Science.gov (United States)

Reyes-Gibby, Cielito C; Melkonian, Stephanie C; Wang, Jian; Yu, Robert K; Shelburne, Samuel A; Lu, Charles; Gunn, Gary Brandon; Chambers, Mark S; Hanna, Ehab Y; Yeung, Sai-Ching J; Shete, Sanjay

2017-01-01

Mucositis is a complex, dose-limiting toxicity of chemotherapy or radiotherapy that leads to painful mouth ulcers, difficulty eating or swallowing, gastrointestinal distress, and reduced quality of life for patients with cancer. Mucositis is most common for those undergoing high-dose chemotherapy and hematopoietic stem cell transplantation and for those being treated for malignancies of the head and neck. Treatment and management of mucositis remain challenging. It is expected that multiple genes are involved in the formation, severity, and persistence of mucositis. We used Ingenuity Pathway Analysis (IPA), a novel network-based approach that integrates complex intracellular and intercellular interactions involved in diseases, to systematically explore the molecular complexity of mucositis. As a first step, we searched the literature to identify genes that harbor or are close to the genetic variants significantly associated with mucositis. Our literature review identified 27 candidate genes, of which ERCC1, XRCC1, and MTHFR were the most frequently studied for mucositis. On the basis of this 27-gene list, we used IPA to generate gene networks for mucositis. The most biologically significant novel molecules identified through IPA analyses included TP53, CTNNB1, MYC, RB1, P38 MAPK, and EP300. Additionally, uracil degradation II (reductive) and thymine degradation pathways (p = 1.06-08) were most significant. Finally, utilizing 66 SNPs within the 8 most connected IPA-derived candidate molecules, we conducted a genetic association study for oral mucositis in the head and neck cancer patients who were treated using chemotherapy and/or radiation therapy (186 head and neck cancer patients with oral mucositis vs. 699 head and neck cancer patients without oral mucositis). The top ranked gene identified through this association analysis was RB1 (rs2227311, p-value = 0.034, odds ratio = 0.67). In conclusion, gene network analysis identified novel molecules and biological
Evaluation of gene association methods for coexpression network construction and biological knowledge discovery.

Directory of Open Access Journals (Sweden)

Sapna Kumari

Full Text Available BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods - Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson - and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.
Mining gene expression data of multiple sclerosis.

Directory of Open Access Journals (Sweden)

Pi Guo

Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.
Do two machine-learning based prognostic signatures for breast cancer capture the same biological processes?

Science.gov (United States)

Drier, Yotam; Domany, Eytan

2011-03-14

The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question.
Do two machine-learning based prognostic signatures for breast cancer capture the same biological processes?

Directory of Open Access Journals (Sweden)

Yotam Drier

2011-03-01

Full Text Available The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question.
Gene-ontology enrichment analysis in two independent family-based samples highlights biologically plausible processes for autism spectrum disorders.

LENUS (Irish Health Repository)

Anney, Richard J L

2012-02-01

Recent genome-wide association studies (GWAS) have implicated a range of genes from discrete biological pathways in the aetiology of autism. However, despite the strong influence of genetic factors, association studies have yet to identify statistically robust, replicated major effect genes or SNPs. We apply the principle of the SNP ratio test methodology described by O\\'Dushlaine et al to over 2100 families from the Autism Genome Project (AGP). Using a two-stage design we examine association enrichment in 5955 unique gene-ontology classifications across four groupings based on two phenotypic and two ancestral classifications. Based on estimates from simulation we identify excess of association enrichment across all analyses. We observe enrichment in association for sets of genes involved in diverse biological processes, including pyruvate metabolism, transcription factor activation, cell-signalling and cell-cycle regulation. Both genes and processes that show enrichment have previously been examined in autistic disorders and offer biologically plausibility to these findings.
Multiple levels of epigenetic control for bone biology and pathology.

Science.gov (United States)

Montecino, Martin; Stein, Gary; Stein, Janet; Zaidi, Kaleem; Aguilar, Rodrigo

2015-12-01

Multiple dimensions of epigenetic control contribute to regulation of gene expression that governs bone biology and pathology. Once confined to DNA methylation and a limited number of post-translational modifications of histone proteins, the definition of epigenetic mechanisms is expanding to include contributions of non-coding RNAs and mitotic bookmarking, a mechanism for retaining phenotype identity during cell proliferation. Together these different levels of epigenetic control of physiological processes and their perturbations that are associated with compromised gene expression during the onset and progression of disease, have contributed to an unprecedented understanding of the activities (operation) of the genomic landscape. Here, we address general concepts that explain the contribution of epigenetic control to the dynamic regulation of gene expression during eukaryotic transcription. This article is part of a Special Issue entitled Epigenetics and Bone. Copyright © 2015 Elsevier Inc. All rights reserved.
Biclustering with Flexible Plaid Models to Unravel Interactions between Biological Processes.

Science.gov (United States)

Henriques, Rui; Madeira, Sara C

2015-01-01

Genes can participate in multiple biological processes at a time and thus their expression can be seen as a composition of the contributions from the active processes. Biclustering under a plaid assumption allows the modeling of interactions between transcriptional modules or biclusters (subsets of genes with coherence across subsets of conditions) by assuming an additive composition of contributions in their overlapping areas. Despite the biological interest of plaid models, few biclustering algorithms consider plaid effects and, when they do, they place restrictions on the allowed types and structures of biclusters, and suffer from robustness problems by seizing exact additive matchings. We propose BiP (Biclustering using Plaid models), a biclustering algorithm with relaxations to allow expression levels to change in overlapping areas according to biologically meaningful assumptions (weighted and noise-tolerant composition of contributions). BiP can be used over existing biclustering solutions (seizing their benefits) as it is able to recover excluded areas due to unaccounted plaid effects and detect noisy areas non-explained by a plaid assumption, thus producing an explanatory model of overlapping transcriptional activity. Experiments on synthetic data support BiP's efficiency and effectiveness. The learned models from expression data unravel meaningful and non-trivial functional interactions between biological processes associated with putative regulatory modules.
AffyMiner: mining differentially expressed genes and biological knowledge in GeneChip microarray data

Directory of Open Access Journals (Sweden)

Xia Yuannan

2006-12-01

Full Text Available Abstract Background DNA microarrays are a powerful tool for monitoring the expression of tens of thousands of genes simultaneously. With the advance of microarray technology, the challenge issue becomes how to analyze a large amount of microarray data and make biological sense of them. Affymetrix GeneChips are widely used microarrays, where a variety of statistical algorithms have been explored and used for detecting significant genes in the experiment. These methods rely solely on the quantitative data, i.e., signal intensity; however, qualitative data are also important parameters in detecting differentially expressed genes. Results AffyMiner is a tool developed for detecting differentially expressed genes in Affymetrix GeneChip microarray data and for associating gene annotation and gene ontology information with the genes detected. AffyMiner consists of the functional modules, GeneFinder for detecting significant genes in a treatment versus control experiment and GOTree for mapping genes of interest onto the Gene Ontology (GO space; and interfaces to run Cluster, a program for clustering analysis, and GenMAPP, a program for pathway analysis. AffyMiner has been used for analyzing the GeneChip data and the results were presented in several publications. Conclusion AffyMiner fills an important gap in finding differentially expressed genes in Affymetrix GeneChip microarray data. AffyMiner effectively deals with multiple replicates in the experiment and takes into account both quantitative and qualitative data in identifying significant genes. AffyMiner reduces the time and effort needed to compare data from multiple arrays and to interpret the possible biological implications associated with significant changes in a gene's expression.
Polyuridylylation and processing of transcripts from multiple gene minicircles in chloroplasts of the dinoflagellate Amphidinium carterae

KAUST Repository

Barbrook, Adrian C.; Dorrell, Richard G.; Burrows, Jennifer; Plenderleith, Lindsey J.; Nisbet, R. Ellen R.; Howe, Christopher J.

2012-01-01

-PCR to study transcription and transcript processing in the chloroplasts of Amphidinium carterae, a model peridinin-containing dinoflagellate. These organisms have a highly unusual chloroplast genome, with genes located on multiple small 'minicircle' elements
Piecewise deterministic processes in biological models

CERN Document Server

Rudnicki, Ryszard

2017-01-01

This book presents a concise introduction to piecewise deterministic Markov processes (PDMPs), with particular emphasis on their applications to biological models. Further, it presents examples of biological phenomena, such as gene activity and population growth, where different types of PDMPs appear: continuous time Markov chains, deterministic processes with jumps, processes with switching dynamics, and point processes. Subsequent chapters present the necessary tools from the theory of stochastic processes and semigroups of linear operators, as well as theoretical results concerning the long-time behaviour of stochastic semigroups induced by PDMPs and their applications to biological models. As such, the book offers a valuable resource for mathematicians and biologists alike. The first group will find new biological models that lead to interesting and often new mathematical questions, while the second can observe how to include seemingly disparate biological processes into a unified mathematical theory, and...
Multiple Gene-Environment Interactions on the Angiogenesis Gene-Pathway Impact Rectal Cancer Risk and Survival

Directory of Open Access Journals (Sweden)

Noha Sharafeldin

2017-09-01

Full Text Available Characterization of gene-environment interactions (GEIs in cancer is limited. We aimed at identifying GEIs in rectal cancer focusing on a relevant biologic process involving the angiogenesis pathway and relevant environmental exposures: cigarette smoking, alcohol consumption, and animal protein intake. We analyzed data from 747 rectal cancer cases and 956 controls from the Diet, Activity and Lifestyle as a Risk Factor for Rectal Cancer study. We applied a 3-step analysis approach: first, we searched for interactions among single nucleotide polymorphisms on the pathway genes; second, we searched for interactions among the genes, both steps using Logic regression; third, we examined the GEIs significant at the 5% level using logistic regression for cancer risk and Cox proportional hazards models for survival. Permutation-based test was used for multiple testing adjustment. We identified 8 significant GEIs associated with risk among 6 genes adjusting for multiple testing: TNF (OR = 1.85, 95% CI: 1.10, 3.11, TLR4 (OR = 2.34, 95% CI: 1.38, 3.98, and EGR2 (OR = 2.23, 95% CI: 1.04, 4.78 with smoking; IGF1R (OR = 1.69, 95% CI: 1.04, 2.72, TLR4 (OR = 2.10, 95% CI: 1.22, 3.60 and EGR2 (OR = 2.12, 95% CI: 1.01, 4.46 with alcohol; and PDGFB (OR = 1.75, 95% CI: 1.04, 2.92 and MMP1 (OR = 2.44, 95% CI: 1.24, 4.81 with protein. Five GEIs were associated with survival at the 5% significance level but not after multiple testing adjustment: CXCR1 (HR = 2.06, 95% CI: 1.13, 3.75 with smoking; and KDR (HR = 4.36, 95% CI: 1.62, 11.73, TLR2 (HR = 9.06, 95% CI: 1.14, 72.11, EGR2 (HR = 2.45, 95% CI: 1.42, 4.22, and EGFR (HR = 6.33, 95% CI: 1.95, 20.54 with protein. GEIs between angiogenesis genes and smoking, alcohol, and animal protein impact rectal cancer risk. Our results support the importance of considering the biologic hypothesis to characterize GEIs associated with cancer outcomes.
A Systems’ Biology Approach to Study MicroRNA-Mediated Gene Regulatory Networks

Directory of Open Access Journals (Sweden)

Xin Lai

2013-01-01

Full Text Available MicroRNAs (miRNAs are potent effectors in gene regulatory networks where aberrant miRNA expression can contribute to human diseases such as cancer. For a better understanding of the regulatory role of miRNAs in coordinating gene expression, we here present a systems biology approach combining data-driven modeling and model-driven experiments. Such an approach is characterized by an iterative process, including biological data acquisition and integration, network construction, mathematical modeling and experimental validation. To demonstrate the application of this approach, we adopt it to investigate mechanisms of collective repression on p21 by multiple miRNAs. We first construct a p21 regulatory network based on data from the literature and further expand it using algorithms that predict molecular interactions. Based on the network structure, a detailed mechanistic model is established and its parameter values are determined using data. Finally, the calibrated model is used to study the effect of different miRNA expression profiles and cooperative target regulation on p21 expression levels in different biological contexts.
Stochastic processes in cell biology

CERN Document Server

Bressloff, Paul C

2014-01-01

This book develops the theory of continuous and discrete stochastic processes within the context of cell biology. A wide range of biological topics are covered including normal and anomalous diffusion in complex cellular environments, stochastic ion channels and excitable systems, stochastic calcium signaling, molecular motors, intracellular transport, signal transduction, bacterial chemotaxis, robustness in gene networks, genetic switches and oscillators, cell polarization, polymerization, cellular length control, and branching processes. The book also provides a pedagogical introduction to the theory of stochastic process – Fokker Planck equations, stochastic differential equations, master equations and jump Markov processes, diffusion approximations and the system size expansion, first passage time problems, stochastic hybrid systems, reaction-diffusion equations, exclusion processes, WKB methods, martingales and branching processes, stochastic calculus, and numerical methods. This text is primarily...

Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data

Directory of Open Access Journals (Sweden)

de los Reyes Benildo G

2008-04-01

Full Text Available Abstract Background Integrating data from multiple global assays and curated databases is essential to understand the spatio-temporal interactions within cells. Different experiments measure cellular processes at various widths and depths, while databases contain biological information based on established facts or published data. Integrating these complementary datasets helps infer a mutually consistent transcriptional regulatory network (TRN with strong similarity to the structure of the underlying genetic regulatory modules. Decomposing the TRN into a small set of recurring regulatory patterns, called network motifs (NM, facilitates the inference. Identifying NMs defined by specific transcription factors (TF establishes the framework structure of a TRN and allows the inference of TF-target gene relationship. This paper introduces a computational framework for utilizing data from multiple sources to infer TF-target gene relationships on the basis of NMs. The data include time course gene expression profiles, genome-wide location analysis data, binding sequence data, and gene ontology (GO information. Results The proposed computational framework was tested using gene expression data associated with cell cycle progression in yeast. Among 800 cell cycle related genes, 85 were identified as candidate TFs and classified into four previously defined NMs. The NMs for a subset of TFs are obtained from literature. Support vector machine (SVM classifiers were used to estimate NMs for the remaining TFs. The potential downstream target genes for the TFs were clustered into 34 biologically significant groups. The relationships between TFs and potential target gene clusters were examined by training recurrent neural networks whose topologies mimic the NMs to which the TFs are classified. The identified relationships between TFs and gene clusters were evaluated using the following biological validation and statistical analyses: (1 Gene set enrichment
Genome-Wide Detection and Analysis of Multifunctional Genes

Science.gov (United States)

Pritykin, Yuri; Ghersi, Dario; Singh, Mona

2015-01-01

Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655
Variation across mitochondrial gene trees provides evidence for systematic error: How much gene tree variation is biological?

Science.gov (United States)

Richards, Emilie J; Brown, Jeremy M; Barley, Anthony J; Chong, Rebecca A; Thomson, Robert C

2018-02-19

The use of large genomic datasets in phylogenetics has highlighted extensive topological variation across genes. Much of this discordance is assumed to result from biological processes. However, variation among gene trees can also be a consequence of systematic error driven by poor model fit, and the relative importance of biological versus methodological factors in explaining gene tree variation is a major unresolved question. Using mitochondrial genomes to control for biological causes of gene tree variation, we estimate the extent of gene tree discordance driven by systematic error and employ posterior prediction to highlight the role of model fit in producing this discordance. We find that the amount of discordance among mitochondrial gene trees is similar to the amount of discordance found in other studies that assume only biological causes of variation. This similarity suggests that the role of systematic error in generating gene tree variation is underappreciated and critical evaluation of fit between assumed models and the data used for inference is important for the resolution of unresolved phylogenetic questions.
Normalization and gene p-value estimation: issues in microarray data processing.

Science.gov (United States)

Fundel, Katrin; Küffner, Robert; Aigner, Thomas; Zimmer, Ralf

2008-05-28

Numerous methods exist for basic processing, e.g. normalization, of microarray gene expression data. These methods have an important effect on the final analysis outcome. Therefore, it is crucial to select methods appropriate for a given dataset in order to assure the validity and reliability of expression data analysis. Furthermore, biological interpretation requires expression values for genes, which are often represented by several spots or probe sets on a microarray. How to best integrate spot/probe set values into gene values has so far been a somewhat neglected problem. We present a case study comparing different between-array normalization methods with respect to the identification of differentially expressed genes. Our results show that it is feasible and necessary to use prior knowledge on gene expression measurements to select an adequate normalization method for the given data. Furthermore, we provide evidence that combining spot/probe set p-values into gene p-values for detecting differentially expressed genes has advantages compared to combining expression values for spots/probe sets into gene expression values. The comparison of different methods suggests to use Stouffer's method for this purpose. The study has been conducted on gene expression experiments investigating human joint cartilage samples of osteoarthritis related groups: a cDNA microarray (83 samples, four groups) and an Affymetrix (26 samples, two groups) data set. The apparently straight forward steps of gene expression data analysis, e.g. between-array normalization and detection of differentially regulated genes, can be accomplished by numerous different methods. We analyzed multiple methods and the possible effects and thereby demonstrate the importance of the single decisions taken during data processing. We give guidelines for evaluating normalization outcomes. An overview of these effects via appropriate measures and plots compared to prior knowledge is essential for the biological
Digital signal processing reveals circadian baseline oscillation in majority of mammalian genes.

Directory of Open Access Journals (Sweden)

Andrey A Ptitsyn

2007-06-01

Full Text Available In mammals, circadian periodicity has been described for gene expression in the hypothalamus and multiple peripheral tissues. It is accepted that 10%-15% of all genes oscillate in a daily rhythm, regulated by an intrinsic molecular clock. Statistical analyses of periodicity are limited by the small size of datasets and high levels of stochastic noise. Here, we propose a new approach applying digital signal processing algorithms separately to each group of genes oscillating in the same phase. Combined with the statistical tests for periodicity, this method identifies circadian baseline oscillation in almost 100% of all expressed genes. Consequently, circadian oscillation in gene expression should be evaluated in any study related to biological pathways. Changes in gene expression caused by mutations or regulation of environmental factors (such as photic stimuli or feeding should be considered in the context of changes in the amplitude and phase of genetic oscillations.
A BAC-bacterial recombination method to generate physically linked multiple gene reporter DNA constructs

Directory of Open Access Journals (Sweden)

Gong Shiaochin

2009-03-01

Full Text Available Abstract Background Reporter gene mice are valuable animal models for biological research providing a gene expression readout that can contribute to cellular characterization within the context of a developmental process. With the advancement of bacterial recombination techniques to engineer reporter gene constructs from BAC genomic clones and the generation of optically distinguishable fluorescent protein reporter genes, there is an unprecedented capability to engineer more informative transgenic reporter mouse models relative to what has been traditionally available. Results We demonstrate here our first effort on the development of a three stage bacterial recombination strategy to physically link multiple genes together with their respective fluorescent protein (FP reporters in one DNA fragment. This strategy uses bacterial recombination techniques to: (1 subclone genes of interest into BAC linking vectors, (2 insert desired reporter genes into respective genes and (3 link different gene-reporters together. As proof of concept, we have generated a single DNA fragment containing the genes Trap, Dmp1, and Ibsp driving the expression of ECFP, mCherry, and Topaz FP reporter genes, respectively. Using this DNA construct, we have successfully generated transgenic reporter mice that retain two to three gene readouts. Conclusion The three stage methodology to link multiple genes with their respective fluorescent protein reporter works with reasonable efficiency. Moreover, gene linkage allows for their common chromosomal integration into a single locus. However, the testing of this multi-reporter DNA construct by transgenesis does suggest that the linkage of two different genes together, despite their large size, can still create a positional effect. We believe that gene choice, genomic DNA fragment size and the presence of endogenous insulator elements are critical variables.
A BAC-bacterial recombination method to generate physically linked multiple gene reporter DNA constructs.

Science.gov (United States)

Maye, Peter; Stover, Mary Louise; Liu, Yaling; Rowe, David W; Gong, Shiaochin; Lichtler, Alexander C

2009-03-13

Reporter gene mice are valuable animal models for biological research providing a gene expression readout that can contribute to cellular characterization within the context of a developmental process. With the advancement of bacterial recombination techniques to engineer reporter gene constructs from BAC genomic clones and the generation of optically distinguishable fluorescent protein reporter genes, there is an unprecedented capability to engineer more informative transgenic reporter mouse models relative to what has been traditionally available. We demonstrate here our first effort on the development of a three stage bacterial recombination strategy to physically link multiple genes together with their respective fluorescent protein (FP) reporters in one DNA fragment. This strategy uses bacterial recombination techniques to: (1) subclone genes of interest into BAC linking vectors, (2) insert desired reporter genes into respective genes and (3) link different gene-reporters together. As proof of concept, we have generated a single DNA fragment containing the genes Trap, Dmp1, and Ibsp driving the expression of ECFP, mCherry, and Topaz FP reporter genes, respectively. Using this DNA construct, we have successfully generated transgenic reporter mice that retain two to three gene readouts. The three stage methodology to link multiple genes with their respective fluorescent protein reporter works with reasonable efficiency. Moreover, gene linkage allows for their common chromosomal integration into a single locus. However, the testing of this multi-reporter DNA construct by transgenesis does suggest that the linkage of two different genes together, despite their large size, can still create a positional effect. We believe that gene choice, genomic DNA fragment size and the presence of endogenous insulator elements are critical variables.
Multiple Realizability and Biological Laws

NARCIS (Netherlands)

Raerinne, Jani P.; Eronen, Markus I.

2012-01-01

We critically analyze Alexander Rosenberg's argument based on the multiple realizability of biological properties that there are no biological laws. The argument is intuitive and suggestive. Nevertheless, a closer analysis reveals that the argument rests on dubious assumptions about the nature of
Molecular biology III - Oncogenes and tumor suppressor genes

International Nuclear Information System (INIS)

Giaccia, Amato J.

1996-01-01

Purpose: The purpose of this course is to introduce to radiation oncologists the basic concepts of tumorigenesis, building on the information that will be presented in the first and second part of this series of lectures. Objective: Our objective is to increase the current understanding of radiation oncologists with the process of tumorigenesis, especially focusing on genes that are altered in many tumor types that are potential candidates for novel molecular strategies. As strategies to treat cancer of cancer are becoming more sophisticated, it will be important for both the practitioner and academician to develop a basic understanding of the function of cancer 'genes'. This will be the third in a series of refresher courses that are meant to address recent advances in Cancer Biology in a way that both clinicians without previous knowledge of molecular biology or experienced researchers will find interesting. The lecture will begin with a basic overview of tumorigenesis; methods of detecting chromosome/DNA alterations, approaches used to isolate oncogenes and tumor suppressor genes, and their role in cell killing by apoptosis. Special attention will be given to oncogenes and tumor suppressor genes that are modulated by ionizing radiation and the tumor microenvironment. We will relate the biology of oncogenes and tumor suppressor genes to basic aspects of radiation biology that would be important in clinical practice. Finally, we will review recent studies on the prognostic significance of p53 mutations and apoptosis in tumor specimens. The main point of this lecture is to relate both researcher and clinician what are the therapeutic ramifications of oncogene and tumor suppressor gene mutations found in human neoptasia
An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

Science.gov (United States)

Booma, P M; Prabhakaran, S; Dhanalakshmi, R

2014-01-01

Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.
Systems-based biological concordance and predictive reproducibility of gene set discovery methods in cardiovascular disease.

Science.gov (United States)

Azuaje, Francisco; Zheng, Huiru; Camargo, Anyela; Wang, Haiying

2011-08-01

The discovery of novel disease biomarkers is a crucial challenge for translational bioinformatics. Demonstration of both their classification power and reproducibility across independent datasets are essential requirements to assess their potential clinical relevance. Small datasets and multiplicity of putative biomarker sets may explain lack of predictive reproducibility. Studies based on pathway-driven discovery approaches have suggested that, despite such discrepancies, the resulting putative biomarkers tend to be implicated in common biological processes. Investigations of this problem have been mainly focused on datasets derived from cancer research. We investigated the predictive and functional concordance of five methods for discovering putative biomarkers in four independently-generated datasets from the cardiovascular disease domain. A diversity of biosignatures was identified by the different methods. However, we found strong biological process concordance between them, especially in the case of methods based on gene set analysis. With a few exceptions, we observed lack of classification reproducibility using independent datasets. Partial overlaps between our putative sets of biomarkers and the primary studies exist. Despite the observed limitations, pathway-driven or gene set analysis can predict potentially novel biomarkers and can jointly point to biomedically-relevant underlying molecular mechanisms. Copyright © 2011 Elsevier Inc. All rights reserved.
Revealing complex function, process and pathway interactions with high-throughput expression and biological annotation data.

Science.gov (United States)

Singh, Nitesh Kumar; Ernst, Mathias; Liebscher, Volkmar; Fuellen, Georg; Taher, Leila

2016-10-20

The biological relationships both between and within the functions, processes and pathways that operate within complex biological systems are only poorly characterized, making the interpretation of large scale gene expression datasets extremely challenging. Here, we present an approach that integrates gene expression and biological annotation data to identify and describe the interactions between biological functions, processes and pathways that govern a phenotype of interest. The product is a global, interconnected network, not of genes but of functions, processes and pathways, that represents the biological relationships within the system. We validated our approach on two high-throughput expression datasets describing organismal and organ development. Our findings are well supported by the available literature, confirming that developmental processes and apoptosis play key roles in cell differentiation. Furthermore, our results suggest that processes related to pluripotency and lineage commitment, which are known to be critical for development, interact mainly indirectly, through genes implicated in more general biological processes. Moreover, we provide evidence that supports the relevance of cell spatial organization in the developing liver for proper liver function. Our strategy can be viewed as an abstraction that is useful to interpret high-throughput data and devise further experiments.
Knowledge Enrichment Analysis for Human Tissue- Specific Genes Uncover New Biological Insights

Directory of Open Access Journals (Sweden)

Gong Xiu-Jun

2012-06-01

Full Text Available The expression and regulation of genes in different tissues are fundamental questions to be answered in biology. Knowledge enrichment analysis for tissue specific (TS and housekeeping (HK genes may help identify their roles in biological process or diseases and gain new biological insights.In this paper, we performed the knowledge enrichment analysis for 17,343 genes in 84 human tissues using Gene Set Enrichment Analysis (GSEA and Hypergeometric Analysis (HA against three biological ontologies: Gene Ontology (GO, KEGG pathways and Disease Ontology (DO respectively.The analyses results demonstrated that the functions of most gene groups are consistent with their tissue origins. Meanwhile three interesting new associations for HK genes and the skeletal muscle tissuegenes are found. Firstly, Hypergeometric analysis against KEGG database for HK genes disclosed that three disease terms (Parkinson’s disease, Huntington’s disease, Alzheimer’s disease are intensively enriched.Secondly, Hypergeometric analysis against the KEGG database for Skeletal Muscle tissue genes shows that two cardiac diseases of “Hypertrophic cardiomyopathy (HCM” and “Arrhythmogenic right ventricular cardiomyopathy (ARVC” are heavily enriched, which are also considered as no relationship with skeletal functions.Thirdly, “Prostate cancer” is intensively enriched in Hypergeometric analysis against the disease ontology (DO for the Skeletal Muscle tissue genes, which is a much unexpected phenomenon.
Multiple coupled landscapes and non-adiabatic dynamics with applications to self-activating genes.

Science.gov (United States)

Chen, Cong; Zhang, Kun; Feng, Haidong; Sasai, Masaki; Wang, Jin

2015-11-21

Many physical, chemical and biochemical systems (e.g. electronic dynamics and gene regulatory networks) are governed by continuous stochastic processes (e.g. electron dynamics on a particular electronic energy surface and protein (gene product) synthesis) coupled with discrete processes (e.g. hopping among different electronic energy surfaces and on and off switching of genes). One can also think of the underlying dynamics as the continuous motion on a particular landscape and discrete hoppings among different landscapes. The main difference of such systems from the intra-landscape dynamics alone is the emergence of the timescale involved in transitions among different landscapes in addition to the timescale involved in a particular landscape. The adiabatic limit when inter-landscape hoppings are fast compared to continuous intra-landscape dynamics has been studied both analytically and numerically, but the analytical treatment of the non-adiabatic regime where the inter-landscape hoppings are slow or comparable to continuous intra-landscape dynamics remains challenging. In this study, we show that there exists mathematical mapping of the dynamics on 2(N) discretely coupled N continuous dimensional landscapes onto one single landscape in 2N dimensional extended continuous space. On this 2N dimensional landscape, eddy current emerges as a sign of non-equilibrium non-adiabatic dynamics and plays an important role in system evolution. Many interesting physical effects such as the enhancement of fluctuations, irreversibility, dissipation and optimal kinetics emerge due to non-adiabaticity manifested by the eddy current illustrated for an N = 1 self-activator. We further generalize our theory to the N-gene network with multiple binding sites and multiple synthesis rates for discretely coupled non-equilibrium stochastic physical and biological systems.
A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori.

Science.gov (United States)

Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

2016-03-25

Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Building for Biology: A Gene Therapy Trial Infrastructure

Directory of Open Access Journals (Sweden)

Samuel Taylor-Alexander

2017-06-01

Full Text Available In this article, we examine the construction of the infrastructure for a Phase II gene therapy trial for Cystic Fibrosis (CF. Tracing the development of the material technologies and physical spaces used in the trial, we show how the trial infrastructure took form at the uncertain intersection of scientific norms, built environments, regulatory negotiations, patienthood, and the biologies of both disease and therapy. We define infrastructures as material and immaterial (including symbols and affect composites that serve a selective distributive purpose and facilitate projects of making and doing. There is a politics to this distributive action, which is itself twofold, because whilst infrastructures enable and delimit the movement of matter, they also mediate the very activity for which they provide the grounds. An infrastructural focus allows us to show how purposeful connections are made in a context of epistemic and regulatory uncertainty. The gene therapy researchers were working in a context of multiple uncertainties, regarding not only how to do gene therapy, but also how to anticipate and enact ambiguous regulatory requirements in a context of limited resources (technical, spatial, and financial. At the same time, the trial infrastructure had to accommodate Cystic Fibrosis biology by bridging the gap between pathology and therapy. The consortium’s approach to treating CF required that they address concerns about contamination and safety while finding a way of getting a modified gene product into the lungs of the trial participants.
Biological data warehousing system for identifying transcriptional regulatory sites from gene expressions of microarray data.

Science.gov (United States)

Tsou, Ann-Ping; Sun, Yi-Ming; Liu, Chia-Lin; Huang, Hsien-Da; Horng, Jorng-Tzong; Tsai, Meng-Feng; Liu, Baw-Juine

2006-07-01

Identification of transcriptional regulatory sites plays an important role in the investigation of gene regulation. For this propose, we designed and implemented a data warehouse to integrate multiple heterogeneous biological data sources with data types such as text-file, XML, image, MySQL database model, and Oracle database model. The utility of the biological data warehouse in predicting transcriptional regulatory sites of coregulated genes was explored using a synexpression group derived from a microarray study. Both of the binding sites of known transcription factors and predicted over-represented (OR) oligonucleotides were demonstrated for the gene group. The potential biological roles of both known nucleotides and one OR nucleotide were demonstrated using bioassays. Therefore, the results from the wet-lab experiments reinforce the power and utility of the data warehouse as an approach to the genome-wide search for important transcription regulatory elements that are the key to many complex biological systems.
Profile of science process skills of Preservice Biology Teacher in General Biology Course

Science.gov (United States)

Susanti, R.; Anwar, Y.; Ermayanti

2018-04-01

This study aims to obtain portrayal images of science process skills among preservice biology teacher. This research took place in Sriwijaya University and involved 41 participants. To collect the data, this study used multiple choice test comprising 40 items to measure the mastery of science process skills. The data were then analyzed in descriptive manner. The results showed that communication aspect outperfomed the other skills with that 81%; while the lowest one was identifying variables and predicting (59%). In addition, basic science process skills was 72%; whereas for integrated skills was a bit lower, 67%. In general, the capability of doing science process skills varies among preservice biology teachers.
Immediate Early Genes Anchor a Biological Pathway of Proteins Required for Memory Formation, Long-Term Depression and Risk for Schizophrenia

Directory of Open Access Journals (Sweden)

Ketan K. Marballi

2018-02-01

result in insufficient electrophysiologic, immunologic, and neuroprotective, processes that these genes normally mediate. Continued adverse environmental experiences, over time, may thereby result in neuropathology that gives rise to the symptoms of schizophrenia. By combining multiple genes associated with schizophrenia susceptibility, in a functional cascade triggered by neuronal activity, the proposed biological pathway provides an explanation for both the polygenic and environmental influences that determine the complex etiology of this mental illness.
Immediate Early Genes Anchor a Biological Pathway of Proteins Required for Memory Formation, Long-Term Depression and Risk for Schizophrenia

Science.gov (United States)

Marballi, Ketan K.; Gallitano, Amelia L.

2018-01-01

insufficient electrophysiologic, immunologic, and neuroprotective, processes that these genes normally mediate. Continued adverse environmental experiences, over time, may thereby result in neuropathology that gives rise to the symptoms of schizophrenia. By combining multiple genes associated with schizophrenia susceptibility, in a functional cascade triggered by neuronal activity, the proposed biological pathway provides an explanation for both the polygenic and environmental influences that determine the complex etiology of this mental illness. PMID:29520222

Simultaneous inference of phenotype-associated genes and relevant tissues from GWAS data via Bayesian integration of multiple tissue-specific gene networks.

Science.gov (United States)

Wu, Mengmeng; Lin, Zhixiang; Ma, Shining; Chen, Ting; Jiang, Rui; Wong, Wing Hung

2017-12-01

Although genome-wide association studies (GWAS) have successfully identified thousands of genomic loci associated with hundreds of complex traits in the past decade, the debate about such problems as missing heritability and weak interpretability has been appealing for effective computational methods to facilitate the advanced analysis of the vast volume of existing and anticipated genetic data. Towards this goal, gene-level integrative GWAS analysis with the assumption that genes associated with a phenotype tend to be enriched in biological gene sets or gene networks has recently attracted much attention, due to such advantages as straightforward interpretation, less multiple testing burdens, and robustness across studies. However, existing methods in this category usually exploit non-tissue-specific gene networks and thus lack the ability to utilize informative tissue-specific characteristics. To overcome this limitation, we proposed a Bayesian approach called SIGNET (Simultaneously Inference of GeNEs and Tissues) to integrate GWAS data and multiple tissue-specific gene networks for the simultaneous inference of phenotype-associated genes and relevant tissues. Through extensive simulation studies, we showed the effectiveness of our method in finding both associated genes and relevant tissues for a phenotype. In applications to real GWAS data of 14 complex phenotypes, we demonstrated the power of our method in both deciphering genetic basis and discovering biological insights of a phenotype. With this understanding, we expect to see SIGNET as a valuable tool for integrative GWAS analysis, thereby boosting the prevention, diagnosis, and treatment of human inherited diseases and eventually facilitating precision medicine.
An integrative approach to inferring biologically meaningful gene modules

Directory of Open Access Journals (Sweden)

Wang Kai

2011-07-01

Full Text Available Abstract Background The ability to construct biologically meaningful gene networks and modules is critical for contemporary systems biology. Though recent studies have demonstrated the power of using gene modules to shed light on the functioning of complex biological systems, most modules in these networks have shown little association with meaningful biological function. We have devised a method which directly incorporates gene ontology (GO annotation in construction of gene modules in order to gain better functional association. Results We have devised a method, Semantic Similarity-Integrated approach for Modularization (SSIM that integrates various gene-gene pairwise similarity values, including information obtained from gene expression, protein-protein interactions and GO annotations, in the construction of modules using affinity propagation clustering. We demonstrated the performance of the proposed method using data from two complex biological responses: 1. the osmotic shock response in Saccharomyces cerevisiae, and 2. the prion-induced pathogenic mouse model. In comparison with two previously reported algorithms, modules identified by SSIM showed significantly stronger association with biological functions. Conclusions The incorporation of semantic similarity based on GO annotation with gene expression and protein-protein interaction data can greatly enhance the functional relevance of inferred gene modules. In addition, the SSIM approach can also reveal the hierarchical structure of gene modules to gain a broader functional view of the biological system. Hence, the proposed method can facilitate comprehensive and in-depth analysis of high throughput experimental data at the gene network level.
Genes2WordCloud: a quick way to identify biological themes from gene lists and free text.

Science.gov (United States)

Baroukh, Caroline; Jenkins, Sherry L; Dannenfelser, Ruth; Ma'ayan, Avi

2011-10-13

Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.
Networks in biological systems: An investigation of the Gene Ontology as an evolving network

International Nuclear Information System (INIS)

Coronnello, C; Tumminello, M; Micciche, S; Mantegna, R.N.

2009-01-01

Many biological systems can be described as networks where different elements interact, in order to perform biological processes. We introduce a network associated with the Gene Ontology. Specifically, we construct a correlation-based network where the vertices are the terms of the Gene Ontology and the link between each two terms is weighted on the basis of the number of genes that they have in common. We analyze a filtered network obtained from the correlation-based network and we characterize its evolution over different releases of the Gene Ontology.
Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

Science.gov (United States)

Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

2013-03-21

Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group
Cell cycle gene expression networks discovered using systems biology: Significance in carcinogenesis

Science.gov (United States)

Scott, RE; Ghule, PN; Stein, JL; Stein, GS

2015-01-01

The early stages of carcinogenesis are linked to defects in the cell cycle. A series of cell cycle checkpoints are involved in this process. The G1/S checkpoint that serves to integrate the control of cell proliferation and differentiation is linked to carcinogenesis and the mitotic spindle checkpoint with the development of chromosomal instability. This paper presents the outcome of systems biology studies designed to evaluate if networks of covariate cell cycle gene transcripts exist in proliferative mammalian tissues including mice, rats and humans. The GeneNetwork website that contains numerous gene expression datasets from different species, sexes and tissues represents the foundational resource for these studies (www.genenetwork.org). In addition, WebGestalt, a gene ontology tool, facilitated the identification of expression networks of genes that co-vary with key cell cycle targets, especially Cdc20 and Plk1 (www.bioinfo.vanderbilt.edu/webgestalt). Cell cycle expression networks of such covariate mRNAs exist in multiple proliferative tissues including liver, lung, pituitary, adipose and lymphoid tissues among others but not in brain or retina that have low proliferative potential. Sixty-three covariate cell cycle gene transcripts (mRNAs) compose the average cell cycle network with p = e−13 to e−36. Cell cycle expression networks show species, sex and tissue variability and they are enriched in mRNA transcripts associated with mitosis many of which are associated with chromosomal instability. PMID:25808367
Multiple genetic interaction experiments provide complementary information useful for gene function prediction.

Directory of Open Access Journals (Sweden)

Magali Michaut

Full Text Available Genetic interactions help map biological processes and their functional relationships. A genetic interaction is defined as a deviation from the expected phenotype when combining multiple genetic mutations. In Saccharomyces cerevisiae, most genetic interactions are measured under a single phenotype - growth rate in standard laboratory conditions. Recently genetic interactions have been collected under different phenotypic readouts and experimental conditions. How different are these networks and what can we learn from their differences? We conducted a systematic analysis of quantitative genetic interaction networks in yeast performed under different experimental conditions. We find that networks obtained using different phenotypic readouts, in different conditions and from different laboratories overlap less than expected and provide significant unique information. To exploit this information, we develop a novel method to combine individual genetic interaction data sets and show that the resulting network improves gene function prediction performance, demonstrating that individual networks provide complementary information. Our results support the notion that using diverse phenotypic readouts and experimental conditions will substantially increase the amount of gene function information produced by genetic interaction screens.
Genes2WordCloud: a quick way to identify biological themes from gene lists and free text

Directory of Open Access Journals (Sweden)

Ma'ayan Avi

2011-10-01

Full Text Available Abstract Background Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Results Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Methods Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Conclusions Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.
Gene-environment interaction in Major Depression: focus on experience-dependent biological systems

Directory of Open Access Journals (Sweden)

Nicola eLopizzo

2015-05-01

Full Text Available Major Depressive Disorder (MDD is a multifactorial and polygenic disorder, where multiple and partially overlapping sets of susceptibility genes interact each other and with the environment, predisposing individuals to the development of the illness. Thus, MDD results from a complex interplay of vulnerability genes and environmental factors that act cumulatively throughout individual's lifetime. Among these environmental factors, stressful life experiences, especially those occurring early in life, have been suggested to exert a crucial impact on brain development, leading to permanent functional changes that may contribute to life long risk for mental health outcomes. In this review we will discuss how genetic variants (polymorphisms, SNPs within genes operating in neurobiological systems that mediate stress response and synaptic plasticity, can impact, by themselves, the vulnerability risk for MDD; we will also consider how this MDD risk can be further modulated when gene X environment interaction is taken into account. Finally, we will discuss the role of epigenetic mechanisms, and in particular of DNA methylation and miRNAs expression changes, in mediating the effect of the stress on the vulnerability risk to develop MDD. Taken together, in this review we aim to underlie the role of genetic and epigenetic processes involved in stress and neuroplasticity related biological systems on development of MDD after exposure to early life stress, thereby building the basis for future research and clinical interventions.
Profiling of Candida albicans Gene Expression During Intra-abdominal Candidiasis Identifies Biologic Processes Involved in Pathogenesis

Science.gov (United States)

Cheng, Shaoji; Clancy, Cornelius J.; Xu, Wenjie; Schneider, Frank; Hao, Binghua; Mitchell, Aaron P.; Nguyen, M. Hong

2013-01-01

Background. The pathogenesis of intra-abdominal candidiasis is poorly understood. Methods. Mice were intraperitoneally infected with Candida albicans (1 × 106 colony-forming units) and sterile stool. nanoString assays were used to quantitate messenger RNA for 145 C. albicans genes within the peritoneal cavity at 48 hours. Results. Within 6 hours after infection, mice developed peritonitis, characterized by high yeast burdens, neutrophil influx, and a pH of 7.9 within peritoneal fluid. Organ invasion by hyphae and early abscess formation were evident 6 and 24 hours after infection, respectively; abscesses resolved by day 14. nanoString assays revealed adhesion and responses to alkaline pH, osmolarity, and stress as biologic processes activated in the peritoneal cavity. Disruption of the highly-expressed gene RIM101, which encodes an alkaline-regulated transcription factor, did not impact cellular morphology but reduced both C. albicans burden during early peritonitis and C. albicans persistence within abscesses. RIM101 influenced expression of 49 genes during intra-abdominal candidiasis, including previously unidentified Rim101 targets. Overexpression of the RIM101-dependent gene SAP5, which encodes a secreted protease, restored the ability of a rim101 mutant to persist within abscesses. Conclusions. A mouse model of intra-abdominal candidiasis is valuable for studying pathogenesis and C. albicans gene expression. RIM101 contributes to persistence within intra-abdominal abscesses, at least in part through activation of SAP5. PMID:24006479
Pediatric Multiple Sclerosis: Genes, Environment, and a Comprehensive Therapeutic Approach.

Science.gov (United States)

Cappa, Ryan; Theroux, Liana; Brenton, J Nicholas

2017-10-01

Pediatric multiple sclerosis is an increasingly recognized and studied disorder that accounts for 3% to 10% of all patients with multiple sclerosis. The risk for pediatric multiple sclerosis is thought to reflect a complex interplay between environmental and genetic risk factors. Environmental exposures, including sunlight (ultraviolet radiation, vitamin D levels), infections (Epstein-Barr virus), passive smoking, and obesity, have been identified as potential risk factors in youth. Genetic predisposition contributes to the risk of multiple sclerosis, and the major histocompatibility complex on chromosome 6 makes the single largest contribution to susceptibility to multiple sclerosis. With the use of large-scale genome-wide association studies, other non-major histocompatibility complex alleles have been identified as independent risk factors for the disease. The bridge between environment and genes likely lies in the study of epigenetic processes, which are environmentally-influenced mechanisms through which gene expression may be modified. This article will review these topics to provide a framework for discussion of a comprehensive approach to counseling and ultimately treating the pediatric patient with multiple sclerosis. Copyright © 2017 Elsevier Inc. All rights reserved.
Gene expression profiles of prostate cancer reveal involvement of multiple molecular pathways in the metastatic process

International Nuclear Information System (INIS)

Chandran, Uma R; Ma, Changqing; Dhir, Rajiv; Bisceglia, Michelle; Lyons-Weiler, Maureen; Liang, Wenjing; Michalopoulos, George; Becich, Michael; Monzon, Federico A

2007-01-01

Prostate cancer is characterized by heterogeneity in the clinical course that often does not correlate with morphologic features of the tumor. Metastasis reflects the most adverse outcome of prostate cancer, and to date there are no reliable morphologic features or serum biomarkers that can reliably predict which patients are at higher risk of developing metastatic disease. Understanding the differences in the biology of metastatic and organ confined primary tumors is essential for developing new prognostic markers and therapeutic targets. Using Affymetrix oligonucleotide arrays, we analyzed gene expression profiles of 24 androgen-ablation resistant metastatic samples obtained from 4 patients and a previously published dataset of 64 primary prostate tumor samples. Differential gene expression was analyzed after removing potentially uninformative stromal genes, addressing the differences in cellular content between primary and metastatic tumors. The metastatic samples are highly heterogenous in expression; however, differential expression analysis shows that 415 genes are upregulated and 364 genes are downregulated at least 2 fold in every patient with metastasis. The expression profile of metastatic samples reveals changes in expression of a unique set of genes representing both the androgen ablation related pathways and other metastasis related gene networks such as cell adhesion, bone remodelling and cell cycle. The differentially expressed genes include metabolic enzymes, transcription factors such as Forkhead Box M1 (FoxM1) and cell adhesion molecules such as Osteopontin (SPP1). We hypothesize that these genes have a role in the biology of metastatic disease and that they represent potential therapeutic targets for prostate cancer
Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

Directory of Open Access Journals (Sweden)

Bissell MJ

2006-03-01

Full Text Available Abstract Background Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and utilizes ideas from statistical information retrieval to discover and characterize properties shared by subsets of the list. The practical utility of this method is demonstrated by employing it in a retrospective study of two non-overlapping sets of genes defined by a published investigation as markers for normal human breast luminal epithelial cells and myoepithelial cells. Results Each genetic locus was characterized using a finite set of biological properties and represented as a vector of features indicating attributes associated with the locus (a gene attribute profile. In this study, the vector space models for a pre-defined list of genes were constructed from the Gene Ontology (GO terms and the Conserved Domain Database (CDD protein domain terms assigned to the loci by the gene-centered corpus LocusLink. This data set of GO- and CDD-based gene attribute profiles, vectors of binary random variables, was used to estimate multiple finite mixture models and each ensuing model utilized to partition the profiles into clusters. The resultant partitionings were combined using a unanimous voting scheme to produce consensus clusters, sets of profiles that co-occured consistently in the same cluster. Attributes that were important in defining the genes assigned to a consensus cluster were identified. The clusters and their attributes were inspected to ascertain the GO and CDD terms most associated with subsets of genes and in conjunction with external knowledge such as chromosomal location, used to gain functional insights into human breast biology. The 52 luminal epithelial cell markers and 89 myoepithelial cell markers are disjoint sets of genes. Ensemble attribute profile clustering-based analysis indicated that both lists
Cross-species multiple environmental stress responses: An integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L. Moench and related model species.

Directory of Open Access Journals (Sweden)

Adugna Abdi Woldesemayat

Full Text Available Crop response to the changing climate and unpredictable effects of global warming with adverse conditions such as drought stress has brought concerns about food security to the fore; crop yield loss is a major cause of concern in this regard. Identification of genes with multiple responses across environmental stresses is the genetic foundation that leads to crop adaptation to environmental perturbations.In this paper, we introduce an integrated approach to assess candidate genes for multiple stress responses across-species. The approach combines ontology based semantic data integration with expression profiling, comparative genomics, phylogenomics, functional gene enrichment and gene enrichment network analysis to identify genes associated with plant stress phenotypes. Five different ontologies, viz., Gene Ontology (GO, Trait Ontology (TO, Plant Ontology (PO, Growth Ontology (GRO and Environment Ontology (EO were used to semantically integrate drought related information.Target genes linked to Quantitative Trait Loci (QTLs controlling yield and stress tolerance in sorghum (Sorghum bicolor (L. Moench and closely related species were identified. Based on the enriched GO terms of the biological processes, 1116 sorghum genes with potential responses to 5 different stresses, such as drought (18%, salt (32%, cold (20%, heat (8% and oxidative stress (25% were identified to be over-expressed. Out of 169 sorghum drought responsive QTLs associated genes that were identified based on expression datasets, 56% were shown to have multiple stress responses. On the other hand, out of 168 additional genes that have been evaluated for orthologous pairs, 90% were conserved across species for drought tolerance. Over 50% of identified maize and rice genes were responsive to drought and salt stresses and were co-located within multifunctional QTLs. Among the total identified multi-stress responsive genes, 272 targets were shown to be co-localized within QTLs
Butyrate induces profound changes in gene expression related to multiple signal pathways in bovine kidney epithelial cells

Directory of Open Access Journals (Sweden)

Li CongJun

2006-09-01

Full Text Available Abstract Background Global gene expression profiles of bovine kidney epithelial cells regulated by sodium butyrate were investigated with high-density oligonucleotide microarrays. The bovine microarray with 86,191 distinct 60mer oligonucleotides, each with 4 replicates, was designed and produced with Maskless Array Synthesizer technology. These oligonucleotides represent approximately 45,383 unique cattle sequences. Results 450 genes significantly regulated by butyrate with a median False Discovery Rate (FDR = 0 % were identified. The majority of these genes were repressed by butyrate and associated with cell cycle control. The expression levels of 30 selected genes identified by the microarray were confirmed using real-time PCR. The results from real-time PCR positively correlated (R = 0.867 with the results from the microarray. Conclusion This study presented the genes related to multiple signal pathways such as cell cycle control and apoptosis. The profound changes in gene expression elucidate the molecular basis for the pleiotropic effects of butyrate on biological processes. These findings enable better recognition of the full range of beneficial roles butyrate may play during cattle energy metabolism, cell growth and proliferation, and possibly in fighting gastrointestinal pathogens.
Conserving forest biological diversity: How the Montreal Process helps achieve sustainability

Science.gov (United States)

Mark Nelson; Guy Robertson; Kurt. Riitters

2015-01-01

Forests support a variety of ecosystems, species and genes collectively referred to as biological diversity along with important processes that tie these all together. With the growing recognition that biological diversity contributes to human welfare in a number of important ways such as providing food, medicine and fiber (provisioning services...
Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

Science.gov (United States)

Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

2016-01-15

As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.
Argudas: lessons for argumentation in biology based on a gene expression use case.

Science.gov (United States)

McLeod, Kenneth; Ferguson, Gus; Burger, Albert

2012-01-25

In situ hybridisation gene expression information helps biologists identify where a gene is expressed. However, the databases that republish the experimental information online are often both incomplete and inconsistent. Non-monotonic reasoning can help resolve such difficulties - one such form of reasoning is computational argumentation. Essentially this involves asking a computer to debate (i.e. reason about) the validity of a particular statement. Arguments are produced for both sides - the statement is true and, the statement is false - then the most powerful argument is used. In this work the computer is asked to debate whether or not a gene is expressed in a particular mouse anatomical structure. The information generated during the debate can be passed to the biological end-user, enabling their own decision-making process. This paper examines the evolution of a system, Argudas, which tests using computational argumentation in an in situ gene hybridisation gene expression use case. Argudas reasons using information extracted from several different online resources that publish gene expression information for the mouse. The development and evaluation of two prototypes is discussed. Throughout a number of issues shall be raised including the appropriateness of computational argumentation in biology and the challenges faced when integrating apparently similar online biological databases. From the work described in this paper it is clear that for argumentation to be effective in the biological domain the argumentation community need to develop further the tools and resources they provide. Additionally, the biological community must tackle the incongruity between overlapping and adjacent resources, thus facilitating the integration and modelling of biological information. Finally, this work highlights both the importance of, and difficulty in creating, a good model of the domain.
Examination of Signatures of Recent Positive Selection on Genes Involved in Human Sialic Acid Biology.

Science.gov (United States)

Moon, Jiyun M; Aronoff, David M; Capra, John A; Abbot, Patrick; Rokas, Antonis

2018-03-28

Sialic acids are nine carbon sugars ubiquitously found on the surfaces of vertebrate cells and are involved in various immune response-related processes. In humans, at least 58 genes spanning diverse functions, from biosynthesis and activation to recycling and degradation, are involved in sialic acid biology. Because of their role in immunity, sialic acid biology genes have been hypothesized to exhibit elevated rates of evolutionary change. Consistent with this hypothesis, several genes involved in sialic acid biology have experienced higher rates of non-synonymous substitutions in the human lineage than their counterparts in other great apes, perhaps in response to ancient pathogens that infected hominins millions of years ago (paleopathogens). To test whether sialic acid biology genes have also experienced more recent positive selection during the evolution of the modern human lineage, reflecting adaptation to contemporary cosmopolitan or geographically-restricted pathogens, we examined whether their protein-coding regions showed evidence of recent hard and soft selective sweeps. This examination involved the calculation of four measures that quantify changes in allele frequency spectra, extent of population differentiation, and haplotype homozygosity caused by recent hard and soft selective sweeps for 55 sialic acid biology genes using publicly available whole genome sequencing data from 1,668 humans from three ethnic groups. To disentangle evidence for selection from confounding demographic effects, we compared the observed patterns in sialic acid biology genes to simulated sequences of the same length under a model of neutral evolution that takes into account human demographic history. We found that the patterns of genetic variation of most sialic acid biology genes did not significantly deviate from neutral expectations and were not significantly different among genes belonging to different functional categories. Those few sialic acid biology genes that
Simulation and Analysis of Complex Biological Processes: an Organisation Modelling Perspective

NARCIS (Netherlands)

Bosse, T.; Jonker, C.M.; Treur, J.

2005-01-01

This paper explores how the dynamics of complex biological processes can be modelled and simulated as an organisation of multiple agents. This modelling perspective identifies organisational structure occurring in complex decentralised processes and handles complexity of the analysis of the dynamics

Bayesian models and meta analysis for multiple tissue gene expression data following corticosteroid administration

Directory of Open Access Journals (Sweden)

Kelemen Arpad

2008-08-01

Full Text Available Abstract Background This paper addresses key biological problems and statistical issues in the analysis of large gene expression data sets that describe systemic temporal response cascades to therapeutic doses in multiple tissues such as liver, skeletal muscle, and kidney from the same animals. Affymetrix time course gene expression data U34A are obtained from three different tissues including kidney, liver and muscle. Our goal is not only to find the concordance of gene in different tissues, identify the common differentially expressed genes over time and also examine the reproducibility of the findings by integrating the results through meta analysis from multiple tissues in order to gain a significant increase in the power of detecting differentially expressed genes over time and to find the differential differences of three tissues responding to the drug. Results and conclusion Bayesian categorical model for estimating the proportion of the 'call' are used for pre-screening genes. Hierarchical Bayesian Mixture Model is further developed for the identifications of differentially expressed genes across time and dynamic clusters. Deviance information criterion is applied to determine the number of components for model comparisons and selections. Bayesian mixture model produces the gene-specific posterior probability of differential/non-differential expression and the 95% credible interval, which is the basis for our further Bayesian meta-inference. Meta-analysis is performed in order to identify commonly expressed genes from multiple tissues that may serve as ideal targets for novel treatment strategies and to integrate the results across separate studies. We have found the common expressed genes in the three tissues. However, the up/down/no regulations of these common genes are different at different time points. Moreover, the most differentially expressed genes were found in the liver, then in kidney, and then in muscle.
Branching processes in biology

CERN Document Server

Kimmel, Marek

2015-01-01

This book provides a theoretical background of branching processes and discusses their biological applications. Branching processes are a well-developed and powerful set of tools in the field of applied probability. The range of applications considered includes molecular biology, cellular biology, human evolution and medicine. The branching processes discussed include Galton-Watson, Markov, Bellman-Harris, Multitype, and General Processes. As an aid to understanding specific examples, two introductory chapters, and two glossaries are included that provide background material in mathematics and in biology. The book will be of interest to scientists who work in quantitative modeling of biological systems, particularly probabilists, mathematical biologists, biostatisticians, cell biologists, molecular biologists, and bioinformaticians. The authors are a mathematician and cell biologist who have collaborated for more than a decade in the field of branching processes in biology for this new edition. This second ex...
Fractional populations in multiple gene inheritance.

Science.gov (United States)

Chung, Myung-Hoon; Kim, Chul Koo; Nahm, Kyun

2003-01-22

With complete knowledge of the human genome sequence, one of the most interesting tasks remaining is to understand the functions of individual genes and how they communicate. Using the information about genes (locus, allele, mutation rate, fitness, etc.), we attempt to explain population demographic data. This population evolution study could complement and enhance biologists' understanding about genes. We present a general approach to study population genetics in complex situations. In the present approach, multiple allele inheritance, multiple loci inheritance, natural selection and mutations are allowed simultaneously in order to consider a more realistic situation. A simulation program is presented so that readers can readily carry out studies with their own parameters. It is shown that the multiplicity of the loci greatly affects the demographic results of fractional population ratios. Furthermore, the study indicates that some high infant mortality rates due to congenital anomalies can be attributed to multiple loci inheritance. The simulation program can be downloaded from http://won.hongik.ac.kr/~mhchung/index_files/yapop.htm. In order to run this program, one needs Visual Studio.NET platform, which can be downloaded from http://msdn.microsoft.com/netframework/downloads/default.asp.
NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

Directory of Open Access Journals (Sweden)

Joeri Ruyssinck

Full Text Available One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made
Lysophosphatidic acid as a lipid mediator with multiple biological actions.

Science.gov (United States)

Aikawa, Shizu; Hashimoto, Takafumi; Kano, Kuniyuki; Aoki, Junken

2015-02-01

Lysophosphatidic acid (LPA) is one of the simplest glycerophospholipids with one fatty acid chain and a phosphate group as a polar head. Although LPA had been viewed just as a metabolic intermediate in de novo lipid synthetic pathways, it has recently been paid much attention as a lipid mediator. LPA exerts many kinds of cellular processes, such as cell proliferation and smooth muscle contraction, through cognate G protein-coupled receptors. Because lipids are not coded by the genome directly, it is difficult to know their patho- and physiological roles. However, recent studies have identified several key factors mediating the biological roles of LPA, such as receptors and producing enzymes. In addition, studies of transgenic and gene knockout animals for these LPA-related genes, have revealed the biological significance of LPA. In this review we will summarize recent advances in the studies of LPA production and its roles in both physiological and pathological conditions. © The Authors 2014. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

Science.gov (United States)

Chasman, Daniel I; Fuchsberger, Christian; Pattaro, Cristian; Teumer, Alexander; Böger, Carsten A; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; O'Seaghdha, Conall M; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D; Gierman, Hinco J; Feitosa, Mary F; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Lambert, Jean-Charles; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Coassin, Stefan; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; de Andrade, Mariza; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Meisinger, Christa; Gieger, Christian; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S; van Duijn, Cornelia M; Borecki, Ingrid B; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Parsa, Afshin; Bochud, Murielle; Heid, Iris M; Kao, W H Linda; Fox, Caroline S; Köttgen, Anna

2012-12-15

In conducting genome-wide association studies (GWAS), analytical approaches leveraging biological information may further understanding of the pathophysiology of clinical traits. To discover novel associations with estimated glomerular filtration rate (eGFR), a measure of kidney function, we developed a strategy for integrating prior biological knowledge into the existing GWAS data for eGFR from the CKDGen Consortium. Our strategy focuses on single nucleotide polymorphism (SNPs) in genes that are connected by functional evidence, determined by literature mining and gene ontology (GO) hierarchies, to genes near previously validated eGFR associations. It then requires association thresholds consistent with multiple testing, and finally evaluates novel candidates by independent replication. Among the samples of European ancestry, we identified a genome-wide significant SNP in FBXL20 (P = 5.6 × 10(-9)) in meta-analysis of all available data, and additional SNPs at the INHBC, LRP2, PLEKHA1, SLC3A2 and SLC7A6 genes meeting multiple-testing corrected significance for replication and overall P-values of 4.5 × 10(-4)-2.2 × 10(-7). Neither the novel PLEKHA1 nor FBXL20 associations, both further supported by association with eGFR among African Americans and with transcript abundance, would have been implicated by eGFR candidate gene approaches. LRP2, encoding the megalin receptor, was identified through connection with the previously known eGFR gene DAB2 and extends understanding of the megalin system in kidney function. These findings highlight integration of existing genome-wide association data with independent biological knowledge to uncover novel candidate eGFR associations, including candidates lacking known connections to kidney-specific pathways. The strategy may also be applicable to other clinical phenotypes, although more testing will be needed to assess its potential for discovery in general.
MADIBA: A web server toolkit for biological interpretation of Plasmodium and plant gene clusters

Directory of Open Access Journals (Sweden)

Louw Abraham I

2008-02-01

Full Text Available Abstract Background Microarray technology makes it possible to identify changes in gene expression of an organism, under various conditions. Data mining is thus essential for deducing significant biological information such as the identification of new biological mechanisms or putative drug targets. While many algorithms and software have been developed for analysing gene expression, the extraction of relevant information from experimental data is still a substantial challenge, requiring significant time and skill. Description MADIBA (MicroArray Data Interface for Biological Annotation facilitates the assignment of biological meaning to gene expression clusters by automating the post-processing stage. A relational database has been designed to store the data from gene to pathway for Plasmodium, rice and Arabidopsis. Tools within the web interface allow rapid analyses for the identification of the Gene Ontology terms relevant to each cluster; visualising the metabolic pathways where the genes are implicated, their genomic localisations, putative common transcriptional regulatory elements in the upstream sequences, and an analysis specific to the organism being studied. Conclusion MADIBA is an integrated, online tool that will assist researchers in interpreting their results and understand the meaning of the co-expression of a cluster of genes. Functionality of MADIBA was validated by analysing a number of gene clusters from several published experiments – expression profiling of the Plasmodium life cycle, and salt stress treatments of Arabidopsis and rice. In most of the cases, the same conclusions found by the authors were quickly and easily obtained after analysing the gene clusters with MADIBA.
Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data

Directory of Open Access Journals (Sweden)

Gao Shouguo

2011-08-01

Full Text Available Abstract Background Bayesian Network (BN is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable. Results We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naïve Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a ~2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information. Conclusion our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.
Systems Biology Graphical Notation: Process Description language Level 1 Version 1.3.

Science.gov (United States)

Moodie, Stuart; Le Novère, Nicolas; Demir, Emek; Mi, Huaiyu; Villéger, Alice

2015-09-04

The Systems Biological Graphical Notation (SBGN) is an international community effort for standardized graphical representations of biological pathways and networks. The goal of SBGN is to provide unambiguous pathway and network maps for readers with different scientific backgrounds as well as to support efficient and accurate exchange of biological knowledge between different research communities, industry, and other players in systems biology. Three SBGN languages, Process Description (PD), Entity Relationship (ER) and Activity Flow (AF), allow for the representation of different aspects of biological and biochemical systems at different levels of detail. The SBGN Process Description language represents biological entities and processes between these entities within a network. SBGN PD focuses on the mechanistic description and temporal dependencies of biological interactions and transformations. The nodes (elements) are split into entity nodes describing, e.g., metabolites, proteins, genes and complexes, and process nodes describing, e.g., reactions and associations. The edges (connections) provide descriptions of relationships (or influences) between the nodes, such as consumption, production, stimulation and inhibition. Among all three languages of SBGN, PD is the closest to metabolic and regulatory pathways in biological literature and textbooks, but its well-defined semantics offer a superior precision in expressing biological knowledge.
Chemistry and biology by new multiple choice

International Nuclear Information System (INIS)

Seo, Hyeong Seok; Kim, Seong Hwan

2003-02-01

This book is divided into two parts, the first part is about chemistry, which deals with science of material, atom structure and periodic law, chemical combination and power between molecule, state of material and solution, chemical reaction and an organic compound. The second part give description of biology with molecule and cell, energy in cells and chemical synthesis, molecular biology and heredity, function on animal, function on plant and evolution and ecology. This book has explanation of chemistry and biology with new multiple choice.
100 years after Smoluchowski: stochastic processes in cell biology

International Nuclear Information System (INIS)

Holcman, D; Schuss, Z

2017-01-01

100 years after Smoluchowski introduced his approach to stochastic processes, they are now at the basis of mathematical and physical modeling in cellular biology: they are used for example to analyse and to extract features from a large number (tens of thousands) of single molecular trajectories or to study the diffusive motion of molecules, proteins or receptors. Stochastic modeling is a new step in large data analysis that serves extracting cell biology concepts. We review here Smoluchowski’s approach to stochastic processes and provide several applications for coarse-graining diffusion, studying polymer models for understanding nuclear organization and finally, we discuss the stochastic jump dynamics of telomeres across cell division and stochastic gene regulation. (topical review)
Modelling the Dynamics of Intracellular Processes as an Organisation of Multiple Agents

NARCIS (Netherlands)

Bosse, T.; Jonker, C.M.; Treur, J.; Armano, G.; Merelli, E.; Denzinger, J.; Martin, A.; Miles, S.; Tianfield, H.; Unland, R.

2005-01-01

This paper explores how the dynamics of complex biological processes can be modeled as an organisation of multiple agents. This modelling perspective identifies organisational structure occurring in complex decentralised processes and handles complexity of the analysis of the dynamics by structuring
Multifractal detrended fluctuation analysis of analog random multiplicative processes

Energy Technology Data Exchange (ETDEWEB)

Silva, L.B.M.; Vermelho, M.V.D. [Instituto de Fisica, Universidade Federal de Alagoas, Maceio - AL, 57072-970 (Brazil); Lyra, M.L. [Instituto de Fisica, Universidade Federal de Alagoas, Maceio - AL, 57072-970 (Brazil)], E-mail: marcelo@if.ufal.br; Viswanathan, G.M. [Instituto de Fisica, Universidade Federal de Alagoas, Maceio - AL, 57072-970 (Brazil)

2009-09-15

We investigate non-Gaussian statistical properties of stationary stochastic signals generated by an analog circuit that simulates a random multiplicative process with weak additive noise. The random noises are originated by thermal shot noise and avalanche processes, while the multiplicative process is generated by a fully analog circuit. The resulting signal describes stochastic time series of current interest in several areas such as turbulence, finance, biology and environment, which exhibit power-law distributions. Specifically, we study the correlation properties of the signal by employing a detrended fluctuation analysis and explore its multifractal nature. The singularity spectrum is obtained and analyzed as a function of the control circuit parameter that tunes the asymptotic power-law form of the probability distribution function.
Excessive biologic response to IFNβ is associated with poor treatment response in patients with multiple sclerosis.

Directory of Open Access Journals (Sweden)

Richard A Rudick

Full Text Available BACKGROUND: Interferon-beta (IFNβ is used to inhibit disease activity in multiple sclerosis (MS, but its mechanisms of action are incompletely understood, individual treatment response varies, and biological markers predicting response to treatment have yet to be identified. METHODS: The relationship between the molecular response to IFNβ and treatment response was determined in 85 patients using a longitudinal design in which treatment effect was categorized by brain magnetic resonance imaging as good (n = 70 or poor response (n = 15. Molecular response was quantified using a customized cDNA macroarray assay for 166 IFN-regulated genes (IRGs. RESULTS: The molecular response to IFNβ differed significantly between patients in the pattern and number of regulated genes. The molecular response was strikingly stable for individuals for as long as 24 months, however, suggesting an individual 'IFN response fingerprint'. Unexpectedly, patients with poor response showed an exaggerated molecular response. IRG induction ratios demonstrated an exaggerated molecular response at both the first and 6-month IFNβ injections. CONCLUSION: MS patients exhibit individually unique but temporally stable biological responses to IFNβ. Poor treatment response is not explained by the duration of biological effects or the specific genes induced. Rather, individuals with poor treatment response have a generally exaggerated biological response to type 1 IFN injections. We hypothesize that the molecular response to type I IFN identifies a pathogenetically distinct subset of MS patients whose disease is driven in part by innate immunity. The findings suggest a strategy for biologically based, rational use of IFNβ for individual MS patients.
Mining biological databases for candidate disease genes

Science.gov (United States)

Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

2001-07-01

The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

Directory of Open Access Journals (Sweden)

Mario Fruzangohar

Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.
The sociobiology of genes: the gene's eye view as a unifying behavioural-ecological framework for biological evolution.

Science.gov (United States)

De Tiège, Alexis; Van de Peer, Yves; Braeckman, Johan; Tanghe, Koen B

2017-11-22

Although classical evolutionary theory, i.e., population genetics and the Modern Synthesis, was already implicitly 'gene-centred', the organism was, in practice, still generally regarded as the individual unit of which a population is composed. The gene-centred approach to evolution only reached a logical conclusion with the advent of the gene-selectionist or gene's eye view in the 1960s and 1970s. Whereas classical evolutionary theory can only work with (genotypically represented) fitness differences between individual organisms, gene-selectionism is capable of working with fitness differences among genes within the same organism and genome. Here, we explore the explanatory potential of 'intra-organismic' and 'intra-genomic' gene-selectionism, i.e., of a behavioural-ecological 'gene's eye view' on genetic, genomic and organismal evolution. First, we give a general outline of the framework and how it complements the-to some extent-still 'organism-centred' approach of classical evolutionary theory. Secondly, we give a more in-depth assessment of its explanatory potential for biological evolution, i.e., for Darwin's 'common descent with modification' or, more specifically, for 'historical continuity or homology with modular evolutionary change' as it has been studied by evolutionary developmental biology (evo-devo) during the last few decades. In contrast with classical evolutionary theory, evo-devo focuses on 'within-organism' developmental processes. Given the capacity of gene-selectionism to adopt an intra-organismal gene's eye view, we outline the relevance of the latter model for evo-devo. Overall, we aim for the conceptual integration between the gene's eye view on the one hand, and more organism-centred evolutionary models (both classical evolutionary theory and evo-devo) on the other.
Review paper Personality and genes: remarks from a biological perspective

Directory of Open Access Journals (Sweden)

Grzegorz Węgrzyn

2014-10-01

Full Text Available Although there is no doubt that genes’ functions influence human personality, years of studies provided no clear picture on regulation of particular traits by specific genes. In this article, an overview of the complexity of the system of genetic control of personality is presented, and the level of complications of biological processes operating in this system is underlined. The methodology of studies devoted to determine effects of genes on personality traits is discussed, and limitations of various methods in such studies are indicated. Finally, suggestions for further research are listed and commented on. It is likely that to increase the level of our understanding of genetic mechanisms that modulate human personality, researchers conducting further studies will have to focus on using large sample sizes, performing independent replications, considering experiments on animal models, integrating cross-cultural data and epigenetic measures, and performing interdisciplinary experiments which combine methods of various disciplines, such as biology and psychology.
A gene pathway analysis highlights the role of cellular adhesion molecules in multiple sclerosis susceptibility

DEFF Research Database (Denmark)

Damotte, V; Guillot-Noel, L; Patsopoulos, N A

2014-01-01

adhesion molecule (CAMs) biological pathway using Cytoscape software. This network is a strong candidate, as it is involved in the crossing of the blood-brain barrier by the T cells, an early event in MS pathophysiology, and is used as an efficient therapeutic target. We drew up a list of 76 genes...... in interaction with other genes as a group. Pathway analysis is an alternative way to highlight such group of genes. Using SNP association P-values from eight multiple sclerosis (MS) GWAS data sets, we performed a candidate pathway analysis for MS susceptibility by considering genes interacting in the cell...... belonging to the CAM network. We highlighted 64 networks enriched with CAM genes with low P-values. Filtering by a percentage of CAM genes up to 50% and rejecting enriched signals mainly driven by transcription factors, we highlighted five networks associated with MS susceptibility. One of them, constituted...
Development of unidentified dna-specific hif 1α gene of lizard (hemidactylus platyurus) which plays a role in tissue regeneration process

Science.gov (United States)

Novianti, T.; Sadikin, M.; Widia, S.; Juniantito, V.; Arida, E. A.

2018-03-01

Development of unidentified specific gene is essential to analyze the availability these genes in biological process. Identification unidentified specific DNA of HIF 1α genes is important to analyze their contribution in tissue regeneration process in lizard tail (Hemidactylus platyurus). Bioinformatics and PCR techniques are relatively an easier method to identify an unidentified gene. The most widely used method is BLAST (Basic Local Alignment Sequence Tools) method for alignment the sequences from the other organism. BLAST technique is online software from website https://blast.ncbi.nlm.nih.gov/Blast.cgi that capable to generate the similar sequences from closest kinship to distant kindship. Gecko japonicus is a species that it has closest kinship with H. platyurus. Comparing HIF 1 α gene sequence of G. japonicus with the other species used multiple alignment methods from Mega7 software. Conserved base areas were identified using Clustal IX method. Primary DNA of HIF 1 α gene was design by Primer3 software. HIF 1α gene of lizard (H. platyurus) was successfully amplified using a real-time PCR machine by primary DNA that we had designed from Gecko japonicus. Identification unidentified gene of HIF 1a lizard has been done successfully with multiple alignment method. The study was conducted by analyzing during the growth of tail on day 1, 3, 5, 7, 10, 13 and 17 of lizard tail after autotomy. Process amplification of HIF 1α gene was described by CT value in real time PCR machine. HIF 1α expression of gene is quantified by Livak formula. Chi-square statistic test is 0.000 which means that there is a different expression of HIF 1 α gene in every growth day treatment.

Dovetailing biology and chemistry: integrating the Gene Ontology with the ChEBI chemical ontology

Science.gov (United States)

2013-01-01

Background The Gene Ontology (GO) facilitates the description of the action of gene products in a biological context. Many GO terms refer to chemical entities that participate in biological processes. To facilitate accurate and consistent systems-wide biological representation, it is necessary to integrate the chemical view of these entities with the biological view of GO functions and processes. We describe a collaborative effort between the GO and the Chemical Entities of Biological Interest (ChEBI) ontology developers to ensure that the representation of chemicals in the GO is both internally consistent and in alignment with the chemical expertise captured in ChEBI. Results We have examined and integrated the ChEBI structural hierarchy into the GO resource through computationally-assisted manual curation of both GO and ChEBI. Our work has resulted in the creation of computable definitions of GO terms that contain fully defined semantic relationships to corresponding chemical terms in ChEBI. Conclusions The set of logical definitions using both the GO and ChEBI has already been used to automate aspects of GO development and has the potential to allow the integration of data across the domains of biology and chemistry. These logical definitions are available as an extended version of the ontology from http://purl.obolibrary.org/obo/go/extensions/go-plus.owl. PMID:23895341
The leukemia-specific fusion gene ETV6/RUNX1 perturbs distinct key biological functions primarily by gene repression.

Directory of Open Access Journals (Sweden)

Gerhard Fuka

Full Text Available BACKGROUND: ETV6/RUNX1 (E/R (also known as TEL/AML1 is the most frequent gene fusion in childhood acute lymphoblastic leukemia (ALL and also most likely the crucial factor for disease initiation; its role in leukemia propagation and maintenance, however, remains largely elusive. To address this issue we performed a shRNA-mediated knock-down (KD of the E/R fusion gene and investigated the ensuing consequences on genome-wide gene expression patterns and deducible regulatory functions in two E/R-positive leukemic cell lines. FINDINGS: Microarray analyses identified 777 genes whose expression was substantially altered. Although approximately equal proportions were either up- (KD-UP or down-regulated (KD-DOWN, the effects on biological processes and pathways differed considerably. The E/R KD-UP set was significantly enriched for genes included in the "cell activation", "immune response", "apoptosis", "signal transduction" and "development and differentiation" categories, whereas in the E/R KD-DOWN set only the "PI3K/AKT/mTOR signaling" and "hematopoietic stem cells" categories became evident. Comparable expression signatures obtained from primary E/R-positive ALL samples underline the relevance of these pathways and molecular functions. We also validated six differentially expressed genes representing the categories "stem cell properties", "B-cell differentiation", "immune response", "cell adhesion" and "DNA damage" with RT-qPCR. CONCLUSION: Our analyses provide the first preliminary evidence that the continuous expression of the E/R fusion gene interferes with key regulatory functions that shape the biology of this leukemia subtype. E/R may thus indeed constitute the essential driving force for the propagation and maintenance of the leukemic process irrespective of potential consequences of associated secondary changes. Finally, these findings may also provide a valuable source of potentially attractive therapeutic targets.
Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

Science.gov (United States)

Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

2014-05-01

Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
System and process for pulsed multiple reaction monitoring

Science.gov (United States)

Belov, Mikhail E

2013-05-17

A new pulsed multiple reaction monitoring process and system are disclosed that uses a pulsed ion injection mode for use in conjunction with triple-quadrupole instruments. The pulsed injection mode approach reduces background ion noise at the detector, increases amplitude of the ion signal, and includes a unity duty cycle that provides a significant sensitivity increase for reliable quantitation of proteins/peptides present at attomole levels in highly complex biological mixtures.
Transcriptional regulation of receptor-like protein genes by environmental stresses and hormones and their overexpression activities in Arabidopsis thaliana.

Science.gov (United States)

Wu, Jinbin; Liu, Zhijun; Zhang, Zhao; Lv, Yanting; Yang, Nan; Zhang, Guohua; Wu, Menyao; Lv, Shuo; Pan, Lixia; Joosten, Matthieu H A J; Wang, Guodong

2016-05-01

Receptor-like proteins (RLPs) have been implicated in multiple biological processes, including plant development and immunity to microbial infection. Fifty-seven AtRLP genes have been identified in Arabidopsis, whereas only a few have been functionally characterized. This is due to the lack of suitable physiological screening conditions and the high degree of functional redundancy among AtRLP genes. To overcome the functional redundancy and further understand the role of AtRLP genes, we studied the evolution of AtRLP genes and compiled a comprehensive profile of the transcriptional regulation of AtRLP genes upon exposure to a range of environmental stresses and different hormones. These results indicate that the majority of AtRLP genes are differentially expressed under various conditions that were tested, an observation that will help to select certain AtRLP genes involved in a specific biological process for further experimental studies to eventually dissect their function. A large number of AtRLP genes were found to respond to more than one treatment, suggesting that one single AtRLP gene may be involved in multiple physiological processes. In addition, we performed a genome-wide cloning of the AtRLP genes, and generated and characterized transgenic Arabidopsis plants overexpressing the individual AtRLP genes, presenting new insight into the roles of AtRLP genes, as exemplified by AtRLP3, AtRLP11 and AtRLP28 Our study provides an overview of biological processes in which AtRLP genes may be involved, and presents valuable resources for future investigations into the function of these genes. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Polyuridylylation and processing of transcripts from multiple gene minicircles in chloroplasts of the dinoflagellate Amphidinium carterae

KAUST Repository

Barbrook, Adrian C.

2012-05-05

Although transcription and transcript processing in the chloroplasts of plants have been extensively characterised, the RNA metabolism of other chloroplast lineages across the eukaryotes remains poorly understood. In this paper, we use RT-PCR to study transcription and transcript processing in the chloroplasts of Amphidinium carterae, a model peridinin-containing dinoflagellate. These organisms have a highly unusual chloroplast genome, with genes located on multiple small \\'minicircle\\' elements, and a number of idiosyncratic features of RNA metabolism including transcription via a rolling circle mechanism, and 3′ terminal polyuridylylation of transcripts. We demonstrate that transcription occurs in A. carterae via a rolling circle mechanism, as previously shown in the dinoflagellate Heterocapsa, and present evidence for the production of both polycistronic and monocistronic transcripts from A. carterae minicircles, including several regions containing ORFs previously not known to be expressed. We demonstrate the presence of both polyuridylylated and non-polyuridylylated transcripts in A. carterae, and show that polycistronic transcripts can be terminally polyuridylylated. We present a model for RNA metabolism in dinoflagellate chloroplasts where long polycistronic precursors are processed to form mature transcripts. Terminal polyuridylylation may mark transcripts with the correct 3′ end. © 2012 Springer Science+Business Media B.V.
Differentially expressed genes distributed over chromosomes and implicated in certain biological processes for site insertion genetically modified rice Kemingdao.

Science.gov (United States)

Liu, Zhi; Li, Yunhe; Zhao, Jie; Chen, Xiuping; Jian, Guiliang; Peng, Yufa; Qi, Fangjun

2012-01-01

Release of genetically modified (GM) plants has sparked off intensive debates worldwide partly because of concerns about potential adverse unintended effects of GM plants to the agro system and the safety of foods. In this study, with the aim of revealing the molecular basis for unintended effects of a single site insertion GM Kemingdao (KMD) rice transformed with a synthetic cry1Ab gene, and bridging unintended effects of KMD rice through clues of differentially expressed genes, comparative transcriptome analyses were performed for GM KMD rice and its parent rice of Xiushui11 (XS11). The results showed that 680 differentially expressed transcripts were identified from 30-day old seedlings of GM KMD rice. The absolute majority of these changed expression transcripts dispersed and located over all rice chromosomes, and existed physical distance on chromosome from the insertion site, while only two transcripts were found to be differentially expressed within the 21 genes located within 100 kb up and down-stream of the insertion site. Pathway and biology function analyses further revealed that differentially expressed transcripts of KMD rice were involved in certain biological processes, and mainly implicated in two types of pathways. One type was pathways implicated in plant stress/defense responses, which were considerably in coordination with the reported unintended effects of KMD rice, which were more susceptible to rice diseases compared to its parent rice XS11; the other type was pathways associated with amino acids metabolism. With this clue, new unintended effects for changes in amino acids synthesis of KMD rice leaves were successfully revealed. Such that an actual case was firstly provided for identification of unintended effects in GM plants by comparative transciptome analysis.
Integrated assessment by multiple gene expression analysis of quercetin bioactivity on anticancer-related mechanisms in colon cancer cells in vitro

NARCIS (Netherlands)

Erk, van M.J.; Roepman, P.; Lende, van der T.R.; Stierum, R.H.; Aarts, J.M.M.J.G.; Bladeren, van P.J.; Ommen, van B.

2005-01-01

Background Many different mechanisms are involved in nutrient¿related prevention of colon cancer. In this study, a comprehensive assessment of the spectrum of possible biological actions of the bioactive compound quercetin is made using multiple gene expression analysis. Quercetin is a flavonoid
Prosecutor: parameter-free inference of gene function for prokaryotes using DNA microarray data, genomic context and multiple gene annotation sources

Directory of Open Access Journals (Sweden)

van Hijum Sacha AFT

2008-10-01

Full Text Available Abstract Background Despite a plethora of functional genomic efforts, the function of many genes in sequenced genomes remains unknown. The increasing amount of microarray data for many species allows employing the guilt-by-association principle to predict function on a large scale: genes exhibiting similar expression patterns are more likely to participate in shared biological processes. Results We developed Prosecutor, an application that enables researchers to rapidly infer gene function based on available gene expression data and functional annotations. Our parameter-free functional prediction method uses a sensitive algorithm to achieve a high association rate of linking genes with unknown function to annotated genes. Furthermore, Prosecutor utilizes additional biological information such as genomic context and known regulatory mechanisms that are specific for prokaryotes. We analyzed publicly available transcriptome data sets and used literature sources to validate putative functions suggested by Prosecutor. We supply the complete results of our analysis for 11 prokaryotic organisms on a dedicated website. Conclusion The Prosecutor software and supplementary datasets available at http://www.prosecutor.nl allow researchers working on any of the analyzed organisms to quickly identify the putative functions of their genes of interest. A de novo analysis allows new organisms to be studied.
The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Science.gov (United States)

Liberzon, Arthur; Birger, Chet; Thorvaldsdóttir, Helga; Ghandi, Mahmoud; Mesirov, Jill P; Tamayo, Pablo

2015-12-23

The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.
Experimental strategies to assess the biological ramifications of multiple drivers of global ocean change-A review.

Science.gov (United States)

Boyd, Philip W; Collins, Sinead; Dupont, Sam; Fabricius, Katharina; Gattuso, Jean-Pierre; Havenhand, Jonathan; Hutchins, David A; Riebesell, Ulf; Rintoul, Max S; Vichi, Marcello; Biswas, Haimanti; Ciotti, Aurea; Gao, Kunshan; Gehlen, Marion; Hurd, Catriona L; Kurihara, Haruko; McGraw, Christina M; Navarro, Jorge M; Nilsson, Göran E; Passow, Uta; Pörtner, Hans-Otto

2018-06-01

Marine life is controlled by multiple physical and chemical drivers and by diverse ecological processes. Many of these oceanic properties are being altered by climate change and other anthropogenic pressures. Hence, identifying the influences of multifaceted ocean change, from local to global scales, is a complex task. To guide policy-making and make projections of the future of the marine biosphere, it is essential to understand biological responses at physiological, evolutionary and ecological levels. Here, we contrast and compare different approaches to multiple driver experiments that aim to elucidate biological responses to a complex matrix of ocean global change. We present the benefits and the challenges of each approach with a focus on marine research, and guidelines to navigate through these different categories to help identify strategies that might best address research questions in fundamental physiology, experimental evolutionary biology and community ecology. Our review reveals that the field of multiple driver research is being pulled in complementary directions: the need for reductionist approaches to obtain process-oriented, mechanistic understanding and a requirement to quantify responses to projected future scenarios of ocean change. We conclude the review with recommendations on how best to align different experimental approaches to contribute fundamental information needed for science-based policy formulation. © 2018 John Wiley & Sons Ltd.
A Bayesian Hierarchical Model for Relating Multiple SNPs within Multiple Genes to Disease Risk

Directory of Open Access Journals (Sweden)

Lewei Duan

2013-01-01

Full Text Available A variety of methods have been proposed for studying the association of multiple genes thought to be involved in a common pathway for a particular disease. Here, we present an extension of a Bayesian hierarchical modeling strategy that allows for multiple SNPs within each gene, with external prior information at either the SNP or gene level. The model involves variable selection at the SNP level through latent indicator variables and Bayesian shrinkage at the gene level towards a prior mean vector and covariance matrix that depend on external information. The entire model is fitted using Markov chain Monte Carlo methods. Simulation studies show that the approach is capable of recovering many of the truly causal SNPs and genes, depending upon their frequency and size of their effects. The method is applied to data on 504 SNPs in 38 candidate genes involved in DNA damage response in the WECARE study of second breast cancers in relation to radiotherapy exposure.
Nothing in Evolution Makes Sense Except in the Light of Genomics: Read–Write Genome Evolution as an Active Biological Process

Directory of Open Access Journals (Sweden)

James A. Shapiro

2016-06-01

Full Text Available The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
A canonical correlation analysis-based dynamic bayesian network prior to infer gene regulatory networks from multiple types of biological data.

Science.gov (United States)

Baur, Brittany; Bozdag, Serdar

2015-04-01

One of the challenging and important computational problems in systems biology is to infer gene regulatory networks (GRNs) of biological systems. Several methods that exploit gene expression data have been developed to tackle this problem. In this study, we propose the use of copy number and DNA methylation data to infer GRNs. We developed an algorithm that scores regulatory interactions between genes based on canonical correlation analysis. In this algorithm, copy number or DNA methylation variables are treated as potential regulator variables, and expression variables are treated as potential target variables. We first validated that the canonical correlation analysis method is able to infer true interactions in high accuracy. We showed that the use of DNA methylation or copy number datasets leads to improved inference over steady-state expression. Our results also showed that epigenetic and structural information could be used to infer directionality of regulatory interactions. Additional improvements in GRN inference can be gleaned from incorporating the result in an informative prior in a dynamic Bayesian algorithm. This is the first study that incorporates copy number and DNA methylation into an informative prior in dynamic Bayesian framework. By closely examining top-scoring interactions with different sources of epigenetic or structural information, we also identified potential novel regulatory interactions.
Significant Down-Regulation of “Biological Adhesion” Genes in Porcine Oocytes after IVM

Directory of Open Access Journals (Sweden)

Joanna Budna

2017-12-01

Full Text Available Proper maturation of the mammalian oocyte is a compound processes determining successful monospermic fertilization, however the number of fully mature porcine oocytes is still unsatisfactory. Since oocytes’ maturation and fertilization involve cellular adhesion and membranous contact, the aim was to investigate cell adhesion ontology group in porcine oocytes. The oocytes were collected from ovaries of 45 pubertal crossbred Landrace gilts and subjected to two BCB tests. After the first test, only granulosa cell-free BCB+ oocytes were directly exposed to microarray assays and RT-qPCR (“before IVM” group, or first in vitro matured and then if classified as BCB+ passed to molecular analyses (“after IVM” group. As a result, we have discovered substantial down-regulation of genes involved in adhesion processes, such as: organization of actin cytoskeleton, migration, proliferation, differentiation, apoptosis, survival or angiogenesis in porcine oocytes after IVM, compared to oocytes analyzed before IVM. In conclusion, we found that biological adhesion may be recognized as the process involved in porcine oocytes’ successful IVM. Down-regulation of genes included in this ontology group in immature oocytes after IVM points to their unique function in oocyte’s achievement of fully mature stages. Thus, results indicated new molecular markers involved in porcine oocyte IVM, displaying essential roles in biological adhesion processes.
TF-finder: A software package for identifying transcription factors involved in biological processes using microarray data and existing knowledge base

Directory of Open Access Journals (Sweden)

Cui Xiaoqi

2010-08-01

Full Text Available Abstract Background Identification of transcription factors (TFs involved in a biological process is the first step towards a better understanding of the underlying regulatory mechanisms. However, due to the involvement of a large number of genes and complicated interactions in a gene regulatory network (GRN, identification of the TFs involved in a biology process remains to be very challenging. In reality, the recognition of TFs for a given a biological process can be further complicated by the fact that most eukaryotic genomes encode thousands of TFs, which are organized in gene families of various sizes and in many cases with poor sequence conservation except for small conserved domains. This poses a significant challenge for identification of the exact TFs involved or ranking the importance of a set of TFs to a process of interest. Therefore, new methods for recognizing novel TFs are desperately needed. Although a plethora of methods have been developed to infer regulatory genes using microarray data, it is still rare to find the methods that use existing knowledge base in particular the validated genes known to be involved in a process to bait/guide discovery of novel TFs. Such methods can replace the sometimes-arbitrary process of selection of candidate genes for experimental validation and significantly advance our knowledge and understanding of the regulation of a process. Results We developed an automated software package called TF-finder for recognizing TFs involved in a biological process using microarray data and existing knowledge base. TF-finder contains two components, adaptive sparse canonical correlation analysis (ASCCA and enrichment test, for TF recognition. ASCCA uses positive target genes to bait TFS from gene expression data while enrichment test examines the presence of positive TFs in the outcomes from ASCCA. Using microarray data from salt and water stress experiments, we showed TF-finder is very efficient in recognizing
Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression

Directory of Open Access Journals (Sweden)

Calvo-Dmgz D.

2012-12-01

Full Text Available DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS. The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significat differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.
VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

Science.gov (United States)

Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

2013-12-16

Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis
A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

Science.gov (United States)

Seok, Junhee; Davis, Ronald W; Xiao, Wenzhong

2015-01-01

Accumulated biological knowledge is often encoded as gene sets, collections of genes associated with similar biological functions or pathways. The use of gene sets in the analyses of high-throughput gene expression data has been intensively studied and applied in clinical research. However, the main interest remains in finding modules of biological knowledge, or corresponding gene sets, significantly associated with disease conditions. Risk prediction from censored survival times using gene sets hasn't been well studied. In this work, we propose a hybrid method that uses both single gene and gene set information together to predict patient survival risks from gene expression profiles. In the proposed method, gene sets provide context-level information that is poorly reflected by single genes. Complementarily, single genes help to supplement incomplete information of gene sets due to our imperfect biomedical knowledge. Through the tests over multiple data sets of cancer and trauma injury, the proposed method showed robust and improved performance compared with the conventional approaches with only single genes or gene sets solely. Additionally, we examined the prediction result in the trauma injury data, and showed that the modules of biological knowledge used in the prediction by the proposed method were highly interpretable in biology. A wide range of survival prediction problems in clinical genomics is expected to benefit from the use of biological knowledge.
Improved elucidation of biological processes linked to diabetic nephropathy by single probe-based microarray data analysis.

Directory of Open Access Journals (Sweden)

Clemens D Cohen

Full Text Available BACKGROUND: Diabetic nephropathy (DN is a complex and chronic metabolic disease that evolves into a progressive fibrosing renal disorder. Effective transcriptomic profiling of slowly evolving disease processes such as DN can be problematic. The changes that occur are often subtle and can escape detection by conventional oligonucleotide DNA array analyses. METHODOLOGY/PRINCIPAL FINDINGS: We examined microdissected human renal tissue with or without DN using Affymetrix oligonucleotide microarrays (HG-U133A by standard Robust Multi-array Analysis (RMA. Subsequent gene ontology analysis by Database for Annotation, Visualization and Integrated Discovery (DAVID showed limited detection of biological processes previously identified as central mechanisms in the development of DN (e.g. inflammation and angiogenesis. This apparent lack of sensitivity may be associated with the gene-oriented averaging of oligonucleotide probe signals, as this includes signals from cross-hybridizing probes and gene annotation that is based on out of date genomic data. We then examined the same CEL file data using a different methodology to determine how well it could correlate transcriptomic data with observed biology. ChipInspector (CI is based on single probe analysis and de novo gene annotation that bypasses probe set definitions. Both methods, RMA and CI, used at default settings yielded comparable numbers of differentially regulated genes. However, when verified by RT-PCR, the single probe based analysis demonstrated reduced background noise with enhanced sensitivity and fewer false positives. CONCLUSIONS/SIGNIFICANCE: Using a single probe based analysis approach with de novo gene annotation allowed an improved representation of the biological processes linked to the development and progression of DN. The improved analysis was exemplified by the detection of Wnt signaling pathway activation in DN, a process not previously reported to be involved in this disease.

A Genome-Wide Identification of the WRKY Family Genes and a Survey of Potential WRKY Target Genes in Dendrobium officinale.

Science.gov (United States)

He, Chunmei; Teixeira da Silva, Jaime A; Tan, Jianwen; Zhang, Jianxia; Pan, Xiaoping; Li, Mingzhi; Luo, Jianping; Duan, Jun

2017-08-23

The WRKY family, one of the largest families of transcription factors, plays important roles in the regulation of various biological processes, including growth, development and stress responses in plants. In the present study, 63 DoWRKY genes were identified from the Dendrobium officinale genome. These were classified into groups I, II, III and a non-group, each with 14, 28, 10 and 11 members, respectively. ABA-responsive, sulfur-responsive and low temperature-responsive elements were identified in the 1-k upstream regulatory region of DoWRKY genes. Subsequently, the expression of the 63 DoWRKY genes under cold stress was assessed, and the expression profiles of a large number of these genes were regulated by low temperature in roots and stems. To further understand the regulatory mechanism of DoWRKY genes in biological processes, potential WRKY target genes were investigated. Among them, most stress-related genes contained multiple W-box elements in their promoters. In addition, the genes involved in polysaccharide synthesis and hydrolysis contained W-box elements in their 1-k upstream regulatory regions, suggesting that DoWRKY genes may play a role in polysaccharide metabolism. These results provide a basis for investigating the function of WRKY genes and help to understand the downstream regulation network in plants within the Orchidaceae.
Mathematical modeling of biological processes

CERN Document Server

Friedman, Avner

2014-01-01

This book on mathematical modeling of biological processes includes a wide selection of biological topics that demonstrate the power of mathematics and computational codes in setting up biological processes with a rigorous and predictive framework. Topics include: enzyme dynamics, spread of disease, harvesting bacteria, competition among live species, neuronal oscillations, transport of neurofilaments in axon, cancer and cancer therapy, and granulomas. Complete with a description of the biological background and biological question that requires the use of mathematics, this book is developed for graduate students and advanced undergraduate students with only basic knowledge of ordinary differential equations and partial differential equations; background in biology is not required. Students will gain knowledge on how to program with MATLAB without previous programming experience and how to use codes in order to test biological hypothesis.
Combining multiple hypothesis testing and affinity propagation clustering leads to accurate, robust and sample size independent classification on gene expression data

Directory of Open Access Journals (Sweden)

Sakellariou Argiris

2012-10-01

Full Text Available Abstract Background A feature selection method in microarray gene expression data should be independent of platform, disease and dataset size. Our hypothesis is that among the statistically significant ranked genes in a gene list, there should be clusters of genes that share similar biological functions related to the investigated disease. Thus, instead of keeping N top ranked genes, it would be more appropriate to define and keep a number of gene cluster exemplars. Results We propose a hybrid FS method (mAP-KL, which combines multiple hypothesis testing and affinity propagation (AP-clustering algorithm along with the Krzanowski & Lai cluster quality index, to select a small yet informative subset of genes. We applied mAP-KL on real microarray data, as well as on simulated data, and compared its performance against 13 other feature selection approaches. Across a variety of diseases and number of samples, mAP-KL presents competitive classification results, particularly in neuromuscular diseases, where its overall AUC score was 0.91. Furthermore, mAP-KL generates concise yet biologically relevant and informative N-gene expression signatures, which can serve as a valuable tool for diagnostic and prognostic purposes, as well as a source of potential disease biomarkers in a broad range of diseases. Conclusions mAP-KL is a data-driven and classifier-independent hybrid feature selection method, which applies to any disease classification problem based on microarray data, regardless of the available samples. Combining multiple hypothesis testing and AP leads to subsets of genes, which classify unknown samples from both, small and large patient cohorts with high accuracy.
Capitalization of multiple intelligence types during the biology disciplines

Directory of Open Access Journals (Sweden)

Mariana DUMITRU

2010-05-01

Full Text Available The study was conducted on a sample of children at the Lăpuş School with classes I-VIII, using the teaching/learning process of the biology disciplines. A key element in applying the theory of Multiple Intelligence in a classroom is knowing the intelligence profile of children. Differentiated teaching approach was designed based on the predominant types of intelligences. For this purpose we used various methods: questionnaire, observation of children as they are given various tasks, interview, development of projects, role play, the biographical method-personal history of child, analysis of activities' results (compositions, drawings, collages, portfolios, debates in pair-groups, and case studies. In child’s profile, (types of intelligences become qualities that we capitalize in training, designing different teaching approach depending on predominant types of intelligences. The results appeared without delay. After a school's year that we worked differently with the children, they have improved school performance and became more interested in the study of biological disciplines thus arousing their curiosity and respect towards life.
Gene regulation and noise reduction by coupling of stochastic processes

Science.gov (United States)

Ramos, Alexandre F.; Hornos, José Eduardo M.; Reinitz, John

2015-02-01

Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.
Gene regulation and noise reduction by coupling of stochastic processes.

Science.gov (United States)

Ramos, Alexandre F; Hornos, José Eduardo M; Reinitz, John

2015-02-01

Here we characterize the low-noise regime of a stochastic model for a negative self-regulating binary gene. The model has two stochastic variables, the protein number and the state of the gene. Each state of the gene behaves as a protein source governed by a Poisson process. The coupling between the two gene states depends on protein number. This fact has a very important implication: There exist protein production regimes characterized by sub-Poissonian noise because of negative covariance between the two stochastic variables of the model. Hence the protein numbers obey a probability distribution that has a peak that is sharper than those of the two coupled Poisson processes that are combined to produce it. Biochemically, the noise reduction in protein number occurs when the switching of the genetic state is more rapid than protein synthesis or degradation. We consider the chemical reaction rates necessary for Poisson and sub-Poisson processes in prokaryotes and eucaryotes. Our results suggest that the coupling of multiple stochastic processes in a negative covariance regime might be a widespread mechanism for noise reduction.
Network-Guided Key Gene Discovery for a Given Cellular Process

DEFF Research Database (Denmark)

He, Feng Q; Ollert, Markus

2018-01-01

Identification of key genes for a given physiological or pathological process is an essential but still very challenging task for the entire biomedical research community. Statistics-based approaches, such as genome-wide association study (GWAS)- or quantitative trait locus (QTL)-related analysis...... have already made enormous contributions to identifying key genes associated with a given disease or phenotype, the success of which is however very much dependent on a huge number of samples. Recent advances in network biology, especially network inference directly from genome-scale data...
Unlocking the treasure trove: from genes to schizophrenia biology.

Science.gov (United States)

McCarthy, Shane E; McCombie, W Richard; Corvin, Aiden

2014-05-01

Significant progress is being made in defining the genetic etiology of schizophrenia. As the list of implicated genes grows, parallel developments in gene editing technology provide new methods to investigate gene function in model systems. The confluence of these two research fields--gene discovery and functional biology--may offer novel insights into schizophrenia etiology. We review recent advances in these fields, consider the likely obstacles to progress, and consider strategies as to how these can be overcome.
Improving functional modules discovery by enriching interaction networks with gene profiles

KAUST Repository

Salem, Saeed

2013-05-01

Recent advances in proteomic and transcriptomic technologies resulted in the accumulation of vast amount of high-throughput data that span multiple biological processes and characteristics in different organisms. Much of the data come in the form of interaction networks and mRNA expression arrays. An important task in systems biology is functional modules discovery where the goal is to uncover well-connected sub-networks (modules). These discovered modules help to unravel the underlying mechanisms of the observed biological processes. While most of the existing module discovery methods use only the interaction data, in this work we propose, CLARM, which discovers biological modules by incorporating gene profiles data with protein-protein interaction networks. We demonstrate the effectiveness of CLARM on Yeast and Human interaction datasets, and gene expression and molecular function profiles. Experiments on these real datasets show that the CLARM approach is competitive to well established functional module discovery methods.
Gene Module Identification from Microarray Data Using Nonnegative Independent Component Analysis

Directory of Open Access Journals (Sweden)

Ting Gong

2007-01-01

Full Text Available Genes mostly interact with each other to form transcriptional modules for performing single or multiple functions. It is important to unravel such transcriptional modules and to determine how disturbances in them may lead to disease. Here, we propose a non-negative independent component analysis (nICA approach for transcriptional module discovery. nICA method utilizes the non-negativity constraint to enforce the independence of biological processes within the participated genes. In such, nICA decomposes the observed gene expression into positive independent components, which fi ts better to the reality of corresponding putative biological processes. In conjunction with nICA modeling, visual statistical data analyzer (VISDA is applied to group genes into modules in latent variable space. We demonstrate the usefulness of the approach through the identification of composite modules from yeast data and the discovery of pathway modules in muscle regeneration.
Aberrant gene promoter methylation associated with sporadic multiple colorectal cancer.

Directory of Open Access Journals (Sweden)

Victoria Gonzalo

Full Text Available BACKGROUND: Colorectal cancer (CRC multiplicity has been mainly related to polyposis and non-polyposis hereditary syndromes. In sporadic CRC, aberrant gene promoter methylation has been shown to play a key role in carcinogenesis, although little is known about its involvement in multiplicity. To assess the effect of methylation in tumor multiplicity in sporadic CRC, hypermethylation of key tumor suppressor genes was evaluated in patients with both multiple and solitary tumors, as a proof-of-concept of an underlying epigenetic defect. METHODOLOGY/PRINCIPAL FINDINGS: We examined a total of 47 synchronous/metachronous primary CRC from 41 patients, and 41 gender, age (5-year intervals and tumor location-paired patients with solitary tumors. Exclusion criteria were polyposis syndromes, Lynch syndrome and inflammatory bowel disease. DNA methylation at the promoter region of the MGMT, CDKN2A, SFRP1, TMEFF2, HS3ST2 (3OST2, RASSF1A and GATA4 genes was evaluated by quantitative methylation specific PCR in both tumor and corresponding normal appearing colorectal mucosa samples. Overall, patients with multiple lesions exhibited a higher degree of methylation in tumor samples than those with solitary tumors regarding all evaluated genes. After adjusting for age and gender, binomial logistic regression analysis identified methylation of MGMT2 (OR, 1.48; 95% CI, 1.10 to 1.97; p = 0.008 and RASSF1A (OR, 2.04; 95% CI, 1.01 to 4.13; p = 0.047 as variables independently associated with tumor multiplicity, being the risk related to methylation of any of these two genes 4.57 (95% CI, 1.53 to 13.61; p = 0.006. Moreover, in six patients in whom both tumors were available, we found a correlation in the methylation levels of MGMT2 (r = 0.64, p = 0.17, SFRP1 (r = 0.83, 0.06, HPP1 (r = 0.64, p = 0.17, 3OST2 (r = 0.83, p = 0.06 and GATA4 (r = 0.6, p = 0.24. Methylation in normal appearing colorectal mucosa from patients with multiple and solitary CRC showed no relevant
Spatiotemporal network motif reveals the biological traits of developmental gene regulatory networks in Drosophila melanogaster

Directory of Open Access Journals (Sweden)

Kim Man-Sun

2012-05-01

Full Text Available Abstract Background Network motifs provided a “conceptual tool” for understanding the functional principles of biological networks, but such motifs have primarily been used to consider static network structures. Static networks, however, cannot be used to reveal time- and region-specific traits of biological systems. To overcome this limitation, we proposed the concept of a “spatiotemporal network motif,” a spatiotemporal sequence of network motifs of sub-networks which are active only at specific time points and body parts. Results On the basis of this concept, we analyzed the developmental gene regulatory network of the Drosophila melanogaster embryo. We identified spatiotemporal network motifs and investigated their distribution pattern in time and space. As a result, we found how key developmental processes are temporally and spatially regulated by the gene network. In particular, we found that nested feedback loops appeared frequently throughout the entire developmental process. From mathematical simulations, we found that mutual inhibition in the nested feedback loops contributes to the formation of spatial expression patterns. Conclusions Taken together, the proposed concept and the simulations can be used to unravel the design principle of developmental gene regulatory networks.
Diversity from genes to ecosystems: A unifying framework to study variation across biological metrics and scales

Science.gov (United States)

Biological diversity is a key concept in the life sciences and plays a fundamental role in many ecological and evolutionary processes. Although biodiversity is inherently a hierarchical concept covering different levels of organization (genes, population, species, ecological communities and ecosyst...
miRNA-Processing Gene Methylation and Cancer Risk.

Science.gov (United States)

Joyce, Brian T; Zheng, Yinan; Zhang, Zhou; Liu, Lei; Kocherginsky, Masha; Murphy, Robert; Achenbach, Chad J; Musa, Jonah; Wehbe, Firas; Just, Allan; Shen, Jincheng; Vokonas, Pantel; Schwartz, Joel; Baccarelli, Andrea A; Hou, Lifang

2018-05-01

Background: Dysregulation of miRNA and methylation levels are epigenetic hallmarks of cancer, potentially linked via miRNA-processing genes. Studies have found genetic alterations to miRNA-processing genes in cancer cells and human population studies. Our objective was to prospectively examine changes in DNA methylation of miRNA-processing genes and their associations with cancer risk. Methods: We examined cohort data from the Department of Veterans' Affairs Normative Aging Study. Participants were assessed every 3 to 5 years starting in 1999 through 2013 including questionnaires, medical record review, and blood collection. Blood from 686 consenting participants was analyzed using the Illumina 450K BeadChip array to measure methylation at CpG sites throughout the genome. We selected 19 genes based on a literature review, with 519 corresponding CpG sites. We then used Cox proportional hazards models to examine associations with cancer incidence, and generalized estimating equations to examine associations with cancer prevalence. Associations at false discovery rate time to cancer development (positively for cg06751583, inversely for cg23230564 and cg21034183), whereas methylation of one CpG site ( DROSHA : cg16131300) was positively associated with cancer prevalence. Conclusions: DNA methylation of DROSHA , a key miRNA-processing gene, and TNRC6B may play a role in early carcinogenesis. Impact: Changes in miRNA processing may exert multiple effects on cancer development, including protecting against it via altered global miRNAs, and may be a useful early detection biomarker of cancer. Cancer Epidemiol Biomarkers Prev; 27(5); 550-7. ©2018 AACR . ©2018 American Association for Cancer Research.
PhysioSpace: relating gene expression experiments from heterogeneous sources using shared physiological processes.

Directory of Open Access Journals (Sweden)

Michael Lenz

Full Text Available Relating expression signatures from different sources such as cell lines, in vitro cultures from primary cells and biopsy material is an important task in drug development and translational medicine as well as for tracking of cell fate and disease progression. Especially the comparison of large scale gene expression changes to tissue or cell type specific signatures is of high interest for the tracking of cell fate in (trans- differentiation experiments and for cancer research, which increasingly focuses on shared processes and the involvement of the microenvironment. These signature relation approaches require robust statistical methods to account for the high biological heterogeneity in clinical data and must cope with small sample sizes in lab experiments and common patterns of co-expression in ubiquitous cellular processes. We describe a novel method, called PhysioSpace, to position dynamics of time series data derived from cellular differentiation and disease progression in a genome-wide expression space. The PhysioSpace is defined by a compendium of publicly available gene expression signatures representing a large set of biological phenotypes. The mapping of gene expression changes onto the PhysioSpace leads to a robust ranking of physiologically relevant signatures, as rigorously evaluated via sample-label permutations. A spherical transformation of the data improves the performance, leading to stable results even in case of small sample sizes. Using PhysioSpace with clinical cancer datasets reveals that such data exhibits large heterogeneity in the number of significant signature associations. This behavior was closely associated with the classification endpoint and cancer type under consideration, indicating shared biological functionalities in disease associated processes. Even though the time series data of cell line differentiation exhibited responses in larger clusters covering several biologically related patterns, top scoring
GeneLab: A Systems Biology Platform for Spaceflight Omics Data

Science.gov (United States)

Reinsch, Sigrid S.; Lai, San-Huei; Chen, Rick; Thompson, Terri; Berrios, Daniel; Fogle, Homer; Marcu, Oana; Timucin, Linda; Chakravarty, Kaushik; Coughlan, Joseph

2015-01-01

NASA's mission includes expanding our understanding of biological systems to improve life on Earth and to enable long-duration human exploration of space. Resources to support large numbers of spaceflight investigations are limited. NASA's GeneLab project is maximizing the science output from these experiments by: (1) developing a unique public bioinformatics database that includes space bioscience relevant "omics" data (genomics, transcriptomics, proteomics, and metabolomics) and experimental metadata; (2) partnering with NASA-funded flight experiments through bio-sample sharing or sample augmentation to expedite omics data input to the GeneLab database; and (3) developing community-driven reference flight experiments. The first database, GeneLab Data System Version 1.0, went online in April 2015. V1.0 contains numerous flight datasets and has search and download capabilities. Version 2.0 will be released in 2016 and will link to analytic tools. In 2015 Genelab partnered with two Biological Research in Canisters experiments (BBRIC-19 and BRIC-20) which examine responses of Arabidopsis thaliana to spaceflight. GeneLab also partnered with Rodent Research-1 (RR1), the maiden flight to test the newly developed rodent habitat. GeneLab developed protocols for maxiumum yield of RNA, DNA and protein from precious RR-1 tissues harvested and preserved during the SpaceX-4 mission, as well as from tissues from mice that were frozen intact during spaceflight and later dissected. GeneLab is establishing partnerships with at least three planned flights for 2016. Organism-specific nationwide Science Definition Teams (SDTs) will define future GeneLab dedicated missions and ensure the broader scientific impact of the GeneLab missions. GeneLab ensures prompt release and open access to all high-throughput omics data from spaceflight and ground-based simulations of microgravity and radiation. Overall, GeneLab will facilitate the generation and query of parallel multi-omics data, and
Using Osteoclast Differentiation as a Model for Gene Discovery in an Undergraduate Cell Biology Laboratory

Science.gov (United States)

Birnbaum, Mark J.; Picco, Jenna; Clements, Meghan; Witwicka, Hanna; Yang, Meiheng; Hoey, Margaret T.; Odgren, Paul R.

2010-01-01

A key goal of molecular/cell biology/biotechnology is to identify essential genes in virtually every physiological process to uncover basic mechanisms of cell function and to establish potential targets of drug therapy combating human disease. This article describes a semester-long, project-oriented molecular/cellular/biotechnology laboratory…
Gene dosage, expression, and ontology analysis identifies driver genes in the carcinogenesis and chemoradioresistance of cervical cancer.

Directory of Open Access Journals (Sweden)

Malin Lando

2009-11-01

Full Text Available Integrative analysis of gene dosage, expression, and ontology (GO data was performed to discover driver genes in the carcinogenesis and chemoradioresistance of cervical cancers. Gene dosage and expression profiles of 102 locally advanced cervical cancers were generated by microarray techniques. Fifty-two of these patients were also analyzed with the Illumina expression method to confirm the gene expression results. An independent cohort of 41 patients was used for validation of gene expressions associated with clinical outcome. Statistical analysis identified 29 recurrent gains and losses and 3 losses (on 3p, 13q, 21q associated with poor outcome after chemoradiotherapy. The intratumor heterogeneity, assessed from the gene dosage profiles, was low for these alterations, showing that they had emerged prior to many other alterations and probably were early events in carcinogenesis. Integration of the alterations with gene expression and GO data identified genes that were regulated by the alterations and revealed five biological processes that were significantly overrepresented among the affected genes: apoptosis, metabolism, macromolecule localization, translation, and transcription. Four genes on 3p (RYBP, GBE1 and 13q (FAM48A, MED4 correlated with outcome at both the gene dosage and expression level and were satisfactorily validated in the independent cohort. These integrated analyses yielded 57 candidate drivers of 24 genetic events, including novel loci responsible for chemoradioresistance. Further mapping of the connections among genetic events, drivers, and biological processes suggested that each individual event stimulates specific processes in carcinogenesis through the coordinated control of multiple genes. The present results may provide novel therapeutic opportunities of both early and advanced stage cervical cancers.
Functional genes to assess nitrogen cycling and aromatic hydrocarbon degradation: primers and processing matter

Directory of Open Access Journals (Sweden)

Christopher Ryan Penton

2013-09-01

Full Text Available Targeting sequencing to genes involved in key environmental processes, i.e. ecofunctional genes, provides an opportunity to sample nature’s gene guilds to greater depth and help link community structure to process-level outcomes. Vastly different approaches have been implemented for sequence processing and, ultimately, for taxonomic placement of these gene reads. The overall quality of next generation sequence analysis of functional genes is dependent on multiple steps and assumptions of unknown diversity. To illustrate current issues surrounding amplicon read processing we provide examples for three ecofunctional gene groups. A combination of in-silico, environmental and cultured strain sequences was used to test new primers targeting the dioxin and dibenzofuran degrading genes dxnA1, dbfA1, and carAa. The majority of obtained environmental sequences were classified into novel sequence clusters, illustrating the discovery value of the approach. For the nitrite reductase step in denitrification, the well-known nirK primers exhibited deficiencies in reference database coverage, illustrating the need to refine primer-binding sites and/or to design multiple primers, while nirS primers exhibited bias against five phyla. Amino acid-based OTU clustering of these two N-cycle genes from soil samples yielded only 114 unique nirK and 45 unique nirS genus-level groupings, likely a reflection of constricted primer coverage. Finally, supervised and non-supervised OTU analysis methods were compared using the nifH gene of nitrogen fixation, with generally similar outcomes, but the clustering (non-supervised method yielded higher diversity estimates and stronger site-based differences. High throughput amplicon sequencing can provide inexpensive and rapid access to nature’s related sequences by circumventing the culturing barrier, but each unique gene requires individual considerations in terms of primer design and sequence processing and classification.
Adaptive Horizontal Gene Transfers between Multiple Cheese-Associated Fungi.

Science.gov (United States)

Ropars, Jeanne; Rodríguez de la Vega, Ricardo C; López-Villavicencio, Manuela; Gouzy, Jérôme; Sallet, Erika; Dumas, Émilie; Lacoste, Sandrine; Debuchy, Robert; Dupont, Joëlle; Branca, Antoine; Giraud, Tatiana

2015-10-05

Domestication is an excellent model for studies of adaptation because it involves recent and strong selection on a few, identified traits [1-5]. Few studies have focused on the domestication of fungi, with notable exceptions [6-11], despite their importance to bioindustry [12] and to a general understanding of adaptation in eukaryotes [5]. Penicillium fungi are ubiquitous molds among which two distantly related species have been independently selected for cheese making-P. roqueforti for blue cheeses like Roquefort and P. camemberti for soft cheeses like Camembert. The selected traits include morphology, aromatic profile, lipolytic and proteolytic activities, and ability to grow at low temperatures, in a matrix containing bacterial and fungal competitors [13-15]. By comparing the genomes of ten Penicillium species, we show that adaptation to cheese was associated with multiple recent horizontal transfers of large genomic regions carrying crucial metabolic genes. We identified seven horizontally transferred regions (HTRs) spanning more than 10 kb each, flanked by specific transposable elements, and displaying nearly 100% identity between distant Penicillium species. Two HTRs carried genes with functions involved in the utilization of cheese nutrients or competition and were found nearly identical in multiple strains and species of cheese-associated Penicillium fungi, indicating recent selective sweeps; they were experimentally associated with faster growth and greater competitiveness on cheese and contained genes highly expressed in the early stage of cheese maturation. These findings have industrial and food safety implications and improve our understanding of the processes of adaptation to rapid environmental changes. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

A strategy for full interrogation of prognostic gene expression patterns: exploring the biology of diffuse large B cell lymphoma.

Directory of Open Access Journals (Sweden)

Lisa M Rimsza

Full Text Available Gene expression profiling yields quantitative data on gene expression used to create prognostic models that accurately predict patient outcome in diffuse large B cell lymphoma (DLBCL. Often, data are analyzed with genes classified by whether they fall above or below the median expression level. We sought to determine whether examining multiple cut-points might be a more powerful technique to investigate the association of gene expression with outcome.We explored gene expression profiling data using variable cut-point analysis for 36 genes with reported prognostic value in DLBCL. We plotted two-group survival logrank test statistics against corresponding cut-points of the gene expression levels and smooth estimates of the hazard ratio of death versus gene expression levels. To facilitate comparisons we also standardized the expression of each of the genes by the fraction of patients that would be identified by any cut-point. A multiple comparison adjusted permutation p-value identified 3 different patterns of significance: 1 genes with significant cut-point points below the median, whose loss is associated with poor outcome (e.g. HLA-DR; 2 genes with significant cut-points above the median, whose over-expression is associated with poor outcome (e.g. CCND2; and 3 genes with significant cut-points on either side of the median, (e.g. extracellular molecules such as FN1.Variable cut-point analysis with permutation p-value calculation can be used to identify significant genes that would not otherwise be identified with median cut-points and may suggest biological patterns of gene effects.
Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes

Science.gov (United States)

2010-01-01

Background Horizontal gene transfer (HGT) is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR) survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR) were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT)-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native mitochondrial copies suggests
Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes

Directory of Open Access Journals (Sweden)

Hao Weilong

2010-12-01

Full Text Available Abstract Background Horizontal gene transfer (HGT is relatively common in plant mitochondrial genomes but the mechanisms, extent and consequences of transfer remain largely unknown. Previous results indicate that parasitic plants are often involved as either transfer donors or recipients, suggesting that direct contact between parasite and host facilitates genetic transfer among plants. Results In order to uncover the mechanistic details of plant-to-plant HGT, the extent and evolutionary fate of transfer was investigated between two groups: the parasitic genus Cuscuta and a small clade of Plantago species. A broad polymerase chain reaction (PCR survey of mitochondrial genes revealed that at least three genes (atp1, atp6 and matR were recently transferred from Cuscuta to Plantago. Quantitative PCR assays show that these three genes have a mitochondrial location in the one species line of Plantago examined. Patterns of sequence evolution suggest that these foreign genes degraded into pseudogenes shortly after transfer and reverse transcription (RT-PCR analyses demonstrate that none are detectably transcribed. Three cases of gene conversion were detected between native and foreign copies of the atp1 gene. The identical phylogenetic distribution of the three foreign genes within Plantago and the retention of cytidines at ancestral positions of RNA editing indicate that these genes were probably acquired via a single, DNA-mediated transfer event. However, samplings of multiple individuals from two of the three species in the recipient Plantago clade revealed complex and perplexing phylogenetic discrepancies and patterns of sequence divergence for all three of the foreign genes. Conclusions This study reports the best evidence to date that multiple mitochondrial genes can be transferred via a single HGT event and that transfer occurred via a strictly DNA-level intermediate. The discovery of gene conversion between co-resident foreign and native
A swarm intelligence framework for reconstructing gene networks: searching for biologically plausible architectures.

Science.gov (United States)

Kentzoglanakis, Kyriakos; Poole, Matthew

2012-01-01

In this paper, we investigate the problem of reverse engineering the topology of gene regulatory networks from temporal gene expression data. We adopt a computational intelligence approach comprising swarm intelligence techniques, namely particle swarm optimization (PSO) and ant colony optimization (ACO). In addition, the recurrent neural network (RNN) formalism is employed for modeling the dynamical behavior of gene regulatory systems. More specifically, ACO is used for searching the discrete space of network architectures and PSO for searching the corresponding continuous space of RNN model parameters. We propose a novel solution construction process in the context of ACO for generating biologically plausible candidate architectures. The objective is to concentrate the search effort into areas of the structure space that contain architectures which are feasible in terms of their topological resemblance to real-world networks. The proposed framework is initially applied to the reconstruction of a small artificial network that has previously been studied in the context of gene network reverse engineering. Subsequently, we consider an artificial data set with added noise for reconstructing a subnetwork of the genetic interaction network of S. cerevisiae (yeast). Finally, the framework is applied to a real-world data set for reverse engineering the SOS response system of the bacterium Escherichia coli. Results demonstrate the relative advantage of utilizing problem-specific knowledge regarding biologically plausible structural properties of gene networks over conducting a problem-agnostic search in the vast space of network architectures.
Gene-environment interaction and biological monitoring of occupational exposures

International Nuclear Information System (INIS)

Hirvonen, Ari

2005-01-01

Biological monitoring methods and biological limit values applied in occupational and environmental medicine have been traditionally developed on the assumption that individuals do not differ significantly in their biotransformation capacities. It has become clear, however, that this is not the case, but wide inter-individual differences exist in the metabolism of chemicals. Integration of the data on individual metabolic capacity in biological monitoring studies is therefore anticipated to represent a significant refinement of the currently used methods. We have recently conducted several biological monitoring studies on occupationally exposed subjects, which have included the determination of the workers' genotypes for the metabolic genes of potential importance for a given chemical exposure. The exposure levels have been measured by urine metabolites, adducts in blood macromolecules, and cytogenetic alterations in lymphocytes. Our studies indicate that genetic polymorphisms in metabolic genes may indeed be important modifiers of individual biological monitoring results of, e.g., carbon disulphide and styrene. The information is anticipated to be useful in insuring that the workplace is safe for everyone, including the most sensitive individuals. This knowledge could also be useful to occupational physicians, industrial hygienists, and regulatory bodies in charge of defining acceptable exposure limits for environmental and/or occupational pollutants
An Approach for Predicting Essential Genes Using Multiple Homology Mapping and Machine Learning Algorithms.

Science.gov (United States)

Hua, Hong-Li; Zhang, Fa-Zhan; Labena, Abraham Alemayehu; Dong, Chuan; Jin, Yan-Ting; Guo, Feng-Biao

Investigation of essential genes is significant to comprehend the minimal gene sets of cell and discover potential drug targets. In this study, a novel approach based on multiple homology mapping and machine learning method was introduced to predict essential genes. We focused on 25 bacteria which have characterized essential genes. The predictions yielded the highest area under receiver operating characteristic (ROC) curve (AUC) of 0.9716 through tenfold cross-validation test. Proper features were utilized to construct models to make predictions in distantly related bacteria. The accuracy of predictions was evaluated via the consistency of predictions and known essential genes of target species. The highest AUC of 0.9552 and average AUC of 0.8314 were achieved when making predictions across organisms. An independent dataset from Synechococcus elongatus , which was released recently, was obtained for further assessment of the performance of our model. The AUC score of predictions is 0.7855, which is higher than other methods. This research presents that features obtained by homology mapping uniquely can achieve quite great or even better results than those integrated features. Meanwhile, the work indicates that machine learning-based method can assign more efficient weight coefficients than using empirical formula based on biological knowledge.
Discovering biological progression underlying microarray samples.

Directory of Open Access Journals (Sweden)

Peng Qiu

2011-04-01

Full Text Available In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD, to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression, and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the
Quantum selfish gene (biological evolution in terms of quantum mechanics)

OpenAIRE

Ozhigov, Yuri I.

2013-01-01

I propose to treat the biological evolution of genoms by means of quantum mechanical tools. We start with the concept of meta- gene, which specifies the "selfish gene" of R.Dawkins. Meta- gene encodes the abstract living unity, which can live relatively independently of the others, and can contain a few real creatures. Each population of living creatures we treat as the wave function on meta- genes, which module squared is the total number of creatures with the given meta-gene, and the phase ...
Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

Science.gov (United States)

Hassani-Pak, Keywan; Rawlings, Christopher

2017-06-13

Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.
Modelling biological processes in WWTP; Modelado de procesos biologicos en las EDAR

Energy Technology Data Exchange (ETDEWEB)

Carpes, G.

2009-07-01

Biological technologies by active sludges are the most used in wastewater treatments. Multiple variants are affected in the characterization of this process, like wastewater treatment plant (WWTP) design, features and concentration of sludge, dissolved oxygen concentration and characteristics of the wastewater, including temperature and nutrients. Mathematical formula applied to WWTP modelling are presented to design its operation and to test the most important parameters, too. It is necessary to optimize the process in WWTP. (Author) 19 refs.
The multiple roles of hypothetical gene BPSS1356 in Burkholderia pseudomallei.

Directory of Open Access Journals (Sweden)

Hokchai Yam

Full Text Available Burkholderia pseudomallei is an opportunistic pathogen and the causative agent of melioidosis. It is able to adapt to harsh environments and can live intracellularly in its infected hosts. In this study, identification of transcriptional factors that associate with the β' subunit (RpoC of RNA polymerase was performed. The N-terminal region of this subunit is known to trigger promoter melting when associated with a sigma factor. A pull-down assay using histidine-tagged B. pseudomallei RpoC N-terminal region as bait showed that a hypothetical protein BPSS1356 was one of the proteins bound. This hypothetical protein is conserved in all B. pseudomallei strains and present only in the Burkholderia genus. A BPSS1356 deletion mutant was generated to investigate its biological function. The mutant strain exhibited reduced biofilm formation and a lower cell density during the stationary phase of growth in LB medium. Electron microscopic analysis revealed that the ΔBPSS1356 mutant cells had a shrunken cytoplasm indicative of cell plasmolysis and a rougher surface when compared to the wild type. An RNA microarray result showed that a total of 63 genes were transcriptionally affected by the BPSS1356 deletion with fold change values of higher than 4. The expression of a group of genes encoding membrane located transporters was concurrently down-regulated in ΔBPSS1356 mutant. Amongst the affected genes, the putative ion transportation genes were the most severely suppressed. Deprivation of BPSS1356 also down-regulated the transcriptions of genes for the arginine deiminase system, glycerol metabolism, type III secretion system cluster 2, cytochrome bd oxidase and arsenic resistance. It is therefore obvious that BPSS1356 plays a multiple regulatory roles on many genes.
Functional knowledge transfer for high-accuracy prediction of under-studied biological processes.

Directory of Open Access Journals (Sweden)

Christopher Y Park

Full Text Available A key challenge in genetics is identifying the functional roles of genes in pathways. Numerous functional genomics techniques (e.g. machine learning that predict protein function have been developed to address this question. These methods generally build from existing annotations of genes to pathways and thus are often unable to identify additional genes participating in processes that are not already well studied. Many of these processes are well studied in some organism, but not necessarily in an investigator's organism of interest. Sequence-based search methods (e.g. BLAST have been used to transfer such annotation information between organisms. We demonstrate that functional genomics can complement traditional sequence similarity to improve the transfer of gene annotations between organisms. Our method transfers annotations only when functionally appropriate as determined by genomic data and can be used with any prediction algorithm to combine transferred gene function knowledge with organism-specific high-throughput data to enable accurate function prediction. We show that diverse state-of-art machine learning algorithms leveraging functional knowledge transfer (FKT dramatically improve their accuracy in predicting gene-pathway membership, particularly for processes with little experimental knowledge in an organism. We also show that our method compares favorably to annotation transfer by sequence similarity. Next, we deploy FKT with state-of-the-art SVM classifier to predict novel genes to 11,000 biological processes across six diverse organisms and expand the coverage of accurate function predictions to processes that are often ignored because of a dearth of annotated genes in an organism. Finally, we perform in vivo experimental investigation in Danio rerio and confirm the regulatory role of our top predicted novel gene, wnt5b, in leftward cell migration during heart development. FKT is immediately applicable to many bioinformatics
Towards Integration of Biological and Physiological Functions at Multiple Levels

Directory of Open Access Journals (Sweden)

Taishin eNomura

2010-12-01

Full Text Available An aim of systems physiology today can be stated as to establish logical and quantitative bridges between phenomenological attributes of physiological entities such as cells and organs and physical attributes of biological entities, i.e., biological molecules, allowing us to describe and better understand physiological functions in terms of underlying biological functions. This article illustrates possible schema that can be used for promoting systems physiology by integrating quantitative knowledge of biological and physiological functions at multiple levels of time and space with the use of information technology infrastructure. Emphasis will be made for systematic, modular, hierarchical, and standardized descriptions of mathematical models of the functions and advantages for the use of them.
Tunable promoters in synthetic and systems biology

DEFF Research Database (Denmark)

Dehli, Tore; Solem, Christian; Jensen, Peter Ruhdal

2012-01-01

in synthetic biology. A number of tools exist to manipulate the steps in between gene sequence and functional protein in living cells, but out of these the most straight-forward approach is to alter the gene expression level by manipulating the promoter sequence. Some of the promoter tuning tools available......Synthetic and systems biologists need standardized, modular and orthogonal tools yielding predictable functions in vivo. In systems biology such tools are needed to quantitatively analyze the behavior of biological systems while the efficient engineering of artificial gene networks is central...... for accomplishing such altered gene expression levels are discussed here along with examples of their use, and ideas for new tools are described. The road ahead looks very promising for synthetic and systems biologists as tools to achieve just about anything in terms of tuning and timing multiple gene expression...
Gene prediction using the Self-Organizing Map: automatic generation of multiple gene models.

Science.gov (United States)

Mahony, Shaun; McInerney, James O; Smith, Terry J; Golden, Aaron

2004-03-05

Many current gene prediction methods use only one model to represent protein-coding regions in a genome, and so are less likely to predict the location of genes that have an atypical sequence composition. It is likely that future improvements in gene finding will involve the development of methods that can adequately deal with intra-genomic compositional variation. This work explores a new approach to gene-prediction, based on the Self-Organizing Map, which has the ability to automatically identify multiple gene models within a genome. The current implementation, named RescueNet, uses relative synonymous codon usage as the indicator of protein-coding potential. While its raw accuracy rate can be less than other methods, RescueNet consistently identifies some genes that other methods do not, and should therefore be of interest to gene-prediction software developers and genome annotation teams alike. RescueNet is recommended for use in conjunction with, or as a complement to, other gene prediction methods.
Simulation and estimation of gene number in a biological pathway using almost complete saturation mutagenesis screening of haploid mouse cells.

Science.gov (United States)

Tokunaga, Masahiro; Kokubu, Chikara; Maeda, Yusuke; Sese, Jun; Horie, Kyoji; Sugimoto, Nakaba; Kinoshita, Taroh; Yusa, Kosuke; Takeda, Junji

2014-11-24

Genome-wide saturation mutagenesis and subsequent phenotype-driven screening has been central to a comprehensive understanding of complex biological processes in classical model organisms such as flies, nematodes, and plants. The degree of "saturation" (i.e., the fraction of possible target genes identified) has been shown to be a critical parameter in determining all relevant genes involved in a biological function, without prior knowledge of their products. In mammalian model systems, however, the relatively large scale and labor intensity of experiments have hampered the achievement of actual saturation mutagenesis, especially for recessive traits that require biallelic mutations to manifest detectable phenotypes. By exploiting the recently established haploid mouse embryonic stem cells (ESCs), we present an implementation of almost complete saturation mutagenesis in a mammalian system. The haploid ESCs were mutagenized with the chemical mutagen N-ethyl-N-nitrosourea (ENU) and processed for the screening of mutants defective in various steps of the glycosylphosphatidylinositol-anchor biosynthetic pathway. The resulting 114 independent mutant clones were characterized by a functional complementation assay, and were shown to be defective in any of 20 genes among all 22 known genes essential for this well-characterized pathway. Ten mutants were further validated by whole-exome sequencing. The predominant generation of single-nucleotide substitutions by ENU resulted in a gene mutation rate proportional to the length of the coding sequence, which facilitated the experimental design of saturation mutagenesis screening with the aid of computational simulation. Our study enables mammalian saturation mutagenesis to become a realistic proposition. Computational simulation, combined with a pilot mutagenesis experiment, could serve as a tool for the estimation of the number of genes essential for biological processes such as drug target pathways when a positive selection of
Partitioning of genomic variance using biological pathways

DEFF Research Database (Denmark)

Edwards, Stefan McKinnon; Janss, Luc; Madsen, Per

and that these variants are enriched for genes that are connected in biological pathways or for likely functional effects on genes. These biological findings provide valuable insight for developing better genomic models. These are statistical models for predicting complex trait phenotypes on the basis of SNP......-data and trait phenotypes and can account for a much larger fraction of the heritable component. A disadvantage is that this “black-box” modelling approach conceals the biological mechanisms underlying the trait. We propose to open the “black-box” by building SNP-set genomic models that evaluate the collective...... action of multiple SNPs in genes, biological pathways or other external findings on the trait phenotype. As proof of concept we have tested the modelling framework on several traits in dairy cattle....
Genes involved in complex adaptive processes tend to have highly conserved upstream regions in mammalian genomes

Directory of Open Access Journals (Sweden)

Kohane Isaac

2005-11-01

Full Text Available Abstract Background Recent advances in genome sequencing suggest a remarkable conservation in gene content of mammalian organisms. The similarity in gene repertoire present in different organisms has increased interest in studying regulatory mechanisms of gene expression aimed at elucidating the differences in phenotypes. In particular, a proximal promoter region contains a large number of regulatory elements that control the expression of its downstream gene. Although many studies have focused on identification of these elements, a broader picture on the complexity of transcriptional regulation of different biological processes has not been addressed in mammals. The regulatory complexity may strongly correlate with gene function, as different evolutionary forces must act on the regulatory systems under different biological conditions. We investigate this hypothesis by comparing the conservation of promoters upstream of genes classified in different functional categories. Results By conducting a rank correlation analysis between functional annotation and upstream sequence alignment scores obtained by human-mouse and human-dog comparison, we found a significantly greater conservation of the upstream sequence of genes involved in development, cell communication, neural functions and signaling processes than those involved in more basic processes shared with unicellular organisms such as metabolism and ribosomal function. This observation persists after controlling for G+C content. Considering conservation as a functional signature, we hypothesize a higher density of cis-regulatory elements upstream of genes participating in complex and adaptive processes. Conclusion We identified a class of functions that are associated with either high or low promoter conservation in mammals. We detected a significant tendency that points to complex and adaptive processes were associated with higher promoter conservation, despite the fact that they have emerged
Comprehensive Transcriptome Analysis of Sex-Biased Expressed Genes Reveals Discrete Biological and Physiological Features of Male and Female Schistosoma japonicum.

Directory of Open Access Journals (Sweden)

Pengfei Cai

2016-04-01

Full Text Available Schistosomiasis is a chronic and debilitating disease caused by blood flukes (digenetic trematodes of the genus Schistosoma. Schistosomes are sexually dimorphic and exhibit dramatic morphological changes during a complex lifecycle which requires subtle gene regulatory mechanisms to fulfil these complex biological processes. In the current study, a 41,982 features custom DNA microarray, which represents the most comprehensive probe coverage for any schistosome transcriptome study, was designed based on public domain and local databases to explore differential gene expression in S. japonicum. We found that approximately 1/10 of the total annotated genes in the S. japonicum genome are differentially expressed between adult males and females. In general, genes associated with the cytoskeleton, and motor and neuronal activities were readily expressed in male adult worms, whereas genes involved in amino acid metabolism, nucleotide biosynthesis, gluconeogenesis, glycosylation, cell cycle processes, DNA synthesis and genome fidelity and stability were enriched in females. Further, miRNAs target sites within these gene sets were predicted, which provides a scenario whereby the miRNAs potentially regulate these sex-biased expressed genes. The study significantly expands the expressional and regulatory characteristics of gender-biased expressed genes in schistosomes with high accuracy. The data provide a better appreciation of the biological and physiological features of male and female schistosome parasites, which may lead to novel vaccine targets and the development of new therapeutic interventions.
Comprehensive Transcriptome Analysis of Sex-Biased Expressed Genes Reveals Discrete Biological and Physiological Features of Male and Female Schistosoma japonicum.

Science.gov (United States)

Cai, Pengfei; Liu, Shuai; Piao, Xianyu; Hou, Nan; Gobert, Geoffrey N; McManus, Donald P; Chen, Qijun

2016-04-01

Schistosomiasis is a chronic and debilitating disease caused by blood flukes (digenetic trematodes) of the genus Schistosoma. Schistosomes are sexually dimorphic and exhibit dramatic morphological changes during a complex lifecycle which requires subtle gene regulatory mechanisms to fulfil these complex biological processes. In the current study, a 41,982 features custom DNA microarray, which represents the most comprehensive probe coverage for any schistosome transcriptome study, was designed based on public domain and local databases to explore differential gene expression in S. japonicum. We found that approximately 1/10 of the total annotated genes in the S. japonicum genome are differentially expressed between adult males and females. In general, genes associated with the cytoskeleton, and motor and neuronal activities were readily expressed in male adult worms, whereas genes involved in amino acid metabolism, nucleotide biosynthesis, gluconeogenesis, glycosylation, cell cycle processes, DNA synthesis and genome fidelity and stability were enriched in females. Further, miRNAs target sites within these gene sets were predicted, which provides a scenario whereby the miRNAs potentially regulate these sex-biased expressed genes. The study significantly expands the expressional and regulatory characteristics of gender-biased expressed genes in schistosomes with high accuracy. The data provide a better appreciation of the biological and physiological features of male and female schistosome parasites, which may lead to novel vaccine targets and the development of new therapeutic interventions.

Global transcriptomic analysis suggests carbon dioxide as an environmental stressor in spaceflight: A systems biology GeneLab case study.

Science.gov (United States)

Beheshti, Afshin; Cekanaviciute, Egle; Smith, David J; Costes, Sylvain V

2018-03-08

Spaceflight introduces a combination of environmental stressors, including microgravity, ionizing radiation, changes in diet and altered atmospheric gas composition. In order to understand the impact of each environmental component on astronauts it is important to investigate potential influences in isolation. Rodent spaceflight experiments involve both standard vivarium cages and animal enclosure modules (AEMs), which are cages used to house rodents in spaceflight. Ground control AEMs are engineered to match the spaceflight environment. There are limited studies examining the biological response invariably due to the configuration of AEM and vivarium housing. To investigate the innate global transcriptomic patterns of rodents housed in spaceflight-matched AEM compared to standard vivarium cages we utilized publicly available data from the NASA GeneLab repository. Using a systems biology approach, we observed that AEM housing was associated with significant transcriptomic differences, including reduced metabolism, altered immune responses, and activation of possible tumorigenic pathways. Although we did not perform any functional studies, our findings revealed a mild hypoxic phenotype in AEM, possibly due to atmospheric carbon dioxide that was increased to match conditions in spaceflight. Our investigation illustrates the process of generating new hypotheses and informing future experimental research by repurposing multiple space-flown datasets.
Gene expression analysis of zebrafish melanocytes, iridophores, and retinal pigmented epithelium reveals indicators of biological function and developmental origin.

Directory of Open Access Journals (Sweden)

Charles W Higdon

Full Text Available In order to facilitate understanding of pigment cell biology, we developed a method to concomitantly purify melanocytes, iridophores, and retinal pigmented epithelium from zebrafish, and analyzed their transcriptomes. Comparing expression data from these cell types and whole embryos allowed us to reveal gene expression co-enrichment in melanocytes and retinal pigmented epithelium, as well as in melanocytes and iridophores. We found 214 genes co-enriched in melanocytes and retinal pigmented epithelium, indicating the shared functions of melanin-producing cells. We found 62 genes significantly co-enriched in melanocytes and iridophores, illustrative of their shared developmental origins from the neural crest. This is also the first analysis of the iridophore transcriptome. Gene expression analysis for iridophores revealed extensive enrichment of specific enzymes to coordinate production of their guanine-based reflective pigment. We speculate the coordinated upregulation of specific enzymes from several metabolic pathways recycles the rate-limiting substrate for purine synthesis, phosphoribosyl pyrophosphate, thus constituting a guanine cycle. The purification procedure and expression analysis described here, along with the accompanying transcriptome-wide expression data, provide the first mRNA sequencing data for multiple purified zebrafish pigment cell types, and will be a useful resource for further studies of pigment cell biology.
Network Graph Analysis of Gene-Gene Interactions in Genome-Wide Association Study Data

Directory of Open Access Journals (Sweden)

Sungyoung Lee

2012-12-01

Full Text Available Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs. For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR is one of the powerful and efficient methods for detecting high-order gene-gene (GxG interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI. Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
Network graph analysis of gene-gene interactions in genome-wide association study data.

Science.gov (United States)

Lee, Sungyoung; Kwon, Min-Seok; Park, Taesung

2012-12-01

Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs). For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR) is one of the powerful and efficient methods for detecting high-order gene-gene (GxG) interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE) data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI). Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.
Mechanisms of multiple production processes

International Nuclear Information System (INIS)

Dremin, I.M.

1977-01-01

Theoretical approaches to multiple production processes are discussed. A large number of models proceeds from the notion about common excited system produced by colliding hadrons. This class of models includes the hydrodynamical, statistical, thermodynamical and statistical bootstrap models. Sometimes the production process is due to excitation and decay of two colliding particles. The fragmentation bremsstrahlung and inelastic diffraction models belong to this group. The largest group of models describes the multiple production process as a result of formation of many excited centers. The typical example is the multiperipheral model. An interesting direction is given by the attempts to interrelate the mechanism of multiple production with internal structure of particles that is with their constituents (C-group)'-quarks, gluons, etc. Besides the models there are phenomenological (p group) attempts to connect different features of multiple production. Experimental data indicate the existence of leading and pionization particles thus giving an evidence for applications of different models. The data about increase of total and inclusive cross sections, the behaviour of the mean multiplicity and correlations at high energies provide a clue for further development of multiple production theory
MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers.

Science.gov (United States)

Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier; Lecompte, Odile

2017-06-16

The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user's specific interests and provides an efficient way to share information with collaborators. Furthermore, the user's behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends. ©Alexis Allot, Kirsley Chennen, Yannis
Hybrid Thermochemical/Biological Processing

Science.gov (United States)

Brown, Robert C.

The conventional view of biorefineries is that lignocellulosic plant material will be fractionated into cellulose, hemicellulose, lignin, and terpenes before these components are biochemically converted into market products. Occasionally, these plants include a thermochemical step at the end of the process to convert recalcitrant plant components or mixed waste streams into heat to meet thermal energy demands elsewhere in the facility. However, another possibility for converting high-fiber plant materials is to start by thermochemically processing it into a uniform intermediate product that can be biologically converted into a bio-based product. This alternative route to bio-based products is known as hybrid thermochemical/biological processing. There are two distinct approaches to hybrid processing: (a) gasification followed by fermentation of the resulting gaseous mixture of carbon monoxide (CO), hydrogen (H2), and carbon dioxide (CO2) and (b) fast pyrolysis followed by hydrolysis and/or fermentation of the anhydrosugars found in the resulting bio-oil. This article explores this "cart before the horse" approach to biorefineries.
GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores

Directory of Open Access Journals (Sweden)

Wang Kai

2011-05-01

Full Text Available Abstract Background Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs have multiple cores, whereas Graphics Processing Units (GPUs also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Findings Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1 the interaction of SNPs within it in parallel, and 2 the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. Conclusions GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/.
Gene expression profiling reveals multiple toxicity endpoints induced by hepatotoxicants

Energy Technology Data Exchange (ETDEWEB)

Huang Qihong; Jin Xidong; Gaillard, Elias T.; Knight, Brian L.; Pack, Franklin D.; Stoltz, James H.; Jayadev, Supriya; Blanchard, Kerry T

2004-05-18

Microarray technology continues to gain increased acceptance in the drug development process, particularly at the stage of toxicology and safety assessment. In the current study, microarrays were used to investigate gene expression changes associated with hepatotoxicity, the most commonly reported clinical liability with pharmaceutical agents. Acetaminophen, methotrexate, methapyrilene, furan and phenytoin were used as benchmark compounds capable of inducing specific but different types of hepatotoxicity. The goal of the work was to define gene expression profiles capable of distinguishing the different subtypes of hepatotoxicity. Sprague-Dawley rats were orally dosed with acetaminophen (single dose, 4500 mg/kg for 6, 24 and 72 h), methotrexate (1 mg/kg per day for 1, 7 and 14 days), methapyrilene (100 mg/kg per day for 3 and 7 days), furan (40 mg/kg per day for 1, 3, 7 and 14 days) or phenytoin (300 mg/kg per day for 14 days). Hepatic gene expression was assessed using toxicology-specific gene arrays containing 684 target genes or expressed sequence tags (ESTs). Principal component analysis (PCA) of gene expression data was able to provide a clear distinction of each compound, suggesting that gene expression data can be used to discern different hepatotoxic agents and toxicity endpoints. Gene expression data were applied to the multiplicity-adjusted permutation test and significantly changed genes were categorized and correlated to hepatotoxic endpoints. Repression of enzymes involved in lipid oxidation (acyl-CoA dehydrogenase, medium chain, enoyl CoA hydratase, very long-chain acyl-CoA synthetase) were associated with microvesicular lipidosis. Likewise, subsets of genes associated with hepatotocellular necrosis, inflammation, hepatitis, bile duct hyperplasia and fibrosis have been identified. The current study illustrates that expression profiling can be used to: (1) distinguish different hepatotoxic endpoints; (2) predict the development of toxic endpoints; and
Online Analytical Processing (OLAP: A Fast and Effective Data Mining Tool for Gene Expression Databases

Directory of Open Access Journals (Sweden)

Alkharouf Nadim W.

2005-01-01

Full Text Available Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD. A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB.
Online analytical processing (OLAP): a fast and effective data mining tool for gene expression databases.

Science.gov (United States)

Alkharouf, Nadim W; Jamison, D Curtis; Matthews, Benjamin F

2005-06-30

Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP) can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD). A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB.
Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease.

Science.gov (United States)

Modena, Brian D; Bleecker, Eugene R; Busse, William W; Erzurum, Serpil C; Gaston, Benjamin M; Jarjour, Nizar N; Meyers, Deborah A; Milosevic, Jadranka; Tedrow, John R; Wu, Wei; Kaminski, Naftali; Wenzel, Sally E

2017-06-01

Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Identify networks of genes reflective of underlying biological processes that define SA. Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12-21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes.
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

Directory of Open Access Journals (Sweden)

Ning Ye

2015-01-01

Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Discovery of cancer common and specific driver gene sets

Science.gov (United States)

2017-01-01

Abstract Cancer is known as a disease mainly caused by gene alterations. Discovery of mutated driver pathways or gene sets is becoming an important step to understand molecular mechanisms of carcinogenesis. However, systematically investigating commonalities and specificities of driver gene sets among multiple cancer types is still a great challenge, but this investigation will undoubtedly benefit deciphering cancers and will be helpful for personalized therapy and precision medicine in cancer treatment. In this study, we propose two optimization models to de novo discover common driver gene sets among multiple cancer types (ComMDP) and specific driver gene sets of one certain or multiple cancer types to other cancers (SpeMDP), respectively. We first apply ComMDP and SpeMDP to simulated data to validate their efficiency. Then, we further apply these methods to 12 cancer types from The Cancer Genome Atlas (TCGA) and obtain several biologically meaningful driver pathways. As examples, we construct a common cancer pathway model for BRCA and OV, infer a complex driver pathway model for BRCA carcinogenesis based on common driver gene sets of BRCA with eight cancer types, and investigate specific driver pathways of the liquid cancer lymphoblastic acute myeloid leukemia (LAML) versus other solid cancer types. In these processes more candidate cancer genes are also found. PMID:28168295
Understanding the biological underpinnings of ecohydrological processes

Science.gov (United States)

Huxman, T. E.; Scott, R. L.; Barron-Gafford, G. A.; Hamerlynck, E. P.; Jenerette, D.; Tissue, D. T.; Breshears, D. D.; Saleska, S. R.

2012-12-01

Climate change presents a challenge for predicting ecosystem response, as multiple factors drive both the physical and life processes happening on the land surface and their interactions result in a complex, evolving coupled system. For example, changes in surface temperature and precipitation influence near-surface hydrology through impacts on system energy balance, affecting a range of physical processes. These changes in the salient features of the environment affect biological processes and elicit responses along the hierarchy of life (biochemistry to community composition). Many of these structural or process changes can alter patterns of soil water-use and influence land surface characteristics that affect local climate. Of the many features that affect our ability to predict the future dynamics of ecosystems, it is this hierarchical response of life that creates substantial complexity. Advances in the ability to predict or understand aspects of demography help describe thresholds in coupled ecohydrological system. Disentangling the physical and biological features that underlie land surface dynamics following disturbance are allowing a better understanding of the partitioning of water in the time-course of recovery. Better predicting the timing of phenology and key seasonal events allow for a more accurate description of the full functional response of the land surface to climate. In addition, explicitly considering the hierarchical structural features of life are helping to describe complex time-dependent behavior in ecosystems. However, despite this progress, we have yet to build an ability to fully account for the generalization of the main features of living systems into models that can describe ecohydrological processes, especially acclimation, assembly and adaptation. This is unfortunate, given that many key ecosystem services are functions of these coupled co-evolutionary processes. To date, both the lack of controlled measurements and experimentation
Informative gene selection using Adaptive Analytic Hierarchy Process (A2HP

Directory of Open Access Journals (Sweden)

Abhishek Bhola

2017-12-01

Full Text Available Gene expression dataset derived from microarray experiments are marked by large number of genes, which contains the gene expression values at different sample conditions/time-points. Selection of informative genes from these large datasets is an issue of major concern for various researchers and biologists. In this study, we propose a gene selection and dimensionality reduction method called Adaptive Analytic Hierarchy Process (A2HP. Traditional analytic hierarchy process is a multiple-criteria based decision analysis method whose result depends upon the expert knowledge or decision makers. It is mainly used to solve the decision problems in different fields. On the other hand, A2HP is a fused method that combines the outcomes of five individual gene selection ranking methods t-test, chi-square variance test, z-test, wilcoxon test and signal-to-noise ratio (SNR. At first, the preprocessing of gene expression dataset is done and then the reduced number of genes obtained, will be fed as input for A2HP. A2HP utilizes both quantitative and qualitative factors to select the informative genes. Results demonstrate that A2HP selects efficient number of genes as compared to the individual gene selection methods. The percentage of deduction in number of genes and time complexity are taken as the performance measure for the proposed method. And it is shown that A2HP outperforms individual gene selection methods.
Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks.

Science.gov (United States)

Yu, Bowen; Doraiswamy, Harish; Chen, Xi; Miraldi, Emily; Arrieta-Ortiz, Mario Luis; Hafemeister, Christoph; Madar, Aviv; Bonneau, Richard; Silva, Cláudio T

2014-12-01

Elucidation of transcriptional regulatory networks (TRNs) is a fundamental goal in biology, and one of the most important components of TRNs are transcription factors (TFs), proteins that specifically bind to gene promoter and enhancer regions to alter target gene expression patterns. Advances in genomic technologies as well as advances in computational biology have led to multiple large regulatory network models (directed networks) each with a large corpus of supporting data and gene-annotation. There are multiple possible biological motivations for exploring large regulatory network models, including: validating TF-target gene relationships, figuring out co-regulation patterns, and exploring the coordination of cell processes in response to changes in cell state or environment. Here we focus on queries aimed at validating regulatory network models, and on coordinating visualization of primary data and directed weighted gene regulatory networks. The large size of both the network models and the primary data can make such coordinated queries cumbersome with existing tools and, in particular, inhibits the sharing of results between collaborators. In this work, we develop and demonstrate a web-based framework for coordinating visualization and exploration of expression data (RNA-seq, microarray), network models and gene-binding data (ChIP-seq). Using specialized data structures and multiple coordinated views, we design an efficient querying model to support interactive analysis of the data. Finally, we show the effectiveness of our framework through case studies for the mouse immune system (a dataset focused on a subset of key cellular functions) and a model bacteria (a small genome with high data-completeness).
Gene Ontology Consortium: going forward.

Science.gov (United States)

2015-01-01

The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Fluctuating Thermodynamics for Biological Processes

Science.gov (United States)

Ham, Sihyun

Because biomolecular processes are largely under thermodynamic control, dynamic extension of thermodynamics is necessary to uncover the mechanisms and driving factors of fluctuating processes. The fluctuating thermodynamics technology presented in this talk offers a practical means for the thermodynamic characterization of conformational dynamics in biomolecules. The use of fluctuating thermodynamics has the potential to provide a comprehensive picture of fluctuating phenomena in diverse biological processes. Through the application of fluctuating thermodynamics, we provide a thermodynamic perspective on the misfolding and aggregation of the various proteins associated with human diseases. In this talk, I will present the detailed concepts and applications of the fluctuating thermodynamics technology for elucidating biological processes. This work was supported by Samsung Science and Technology Foundation under Project Number SSTF-BA1401-13.
Extracting biologically significant patterns from short time series gene expression data

Directory of Open Access Journals (Sweden)

McGinnis Thomas

2009-08-01

Full Text Available Abstract Background Time series gene expression data analysis is used widely to study the dynamics of various cell processes. Most of the time series data available today consist of few time points only, thus making the application of standard clustering techniques difficult. Results We developed two new algorithms that are capable of extracting biological patterns from short time point series gene expression data. The two algorithms, ASTRO and MiMeSR, are inspired by the rank order preserving framework and the minimum mean squared residue approach, respectively. However, ASTRO and MiMeSR differ from previous approaches in that they take advantage of the relatively few number of time points in order to reduce the problem from NP-hard to linear. Tested on well-defined short time expression data, we found that our approaches are robust to noise, as well as to random patterns, and that they can correctly detect the temporal expression profile of relevant functional categories. Evaluation of our methods was performed using Gene Ontology (GO annotations and chromatin immunoprecipitation (ChIP-chip data. Conclusion Our approaches generally outperform both standard clustering algorithms and algorithms designed specifically for clustering of short time series gene expression data. Both algorithms are available at http://www.benoslab.pitt.edu/astro/.

The potential of TaqMan Array Cards for detection of multiple biological agents by real-time PCR.

Directory of Open Access Journals (Sweden)

Phillip A Rachwal

Full Text Available The TaqMan Array Card architecture, normally used for gene expression studies, was evaluated for its potential to detect multiple bacterial agents by real-time PCR. Ten PCR assays targeting five biological agents (Bacillus anthracis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, and Yersinia pestis were incorporated onto Array Cards. A comparison of PCR performance of each PCR in Array Card and singleplex format was conducted using DNA extracted from pure bacterial cultures. When 100 fg of agent DNA was added to Array Card channels the following levels of agent detection (where at least one agent PCR replicate returned a positive result were observed: Y. pestis 100%, B. mallei & F. tularensis 93%; B. anthracis 71%; B. pseudomallei 43%. For B. mallei & pseudomallei detection the BPM2 PCR, which detects both species, outperformed PCR assays specific to each organism indicating identification of the respective species would not be reproducible at the 100 fg level. Near 100% levels of detection were observed when 100 fg of DNA was added to each PCR in singleplex format with singleplex PCRs also returning sporadic positives at the 10 fg per PCR level. Before evaluating the use of Array Cards for the testing of environmental and clinical sample types, with potential levels of background DNA and PCR inhibitors, users would therefore have to accept a 10-fold reduction in sensitivity of PCR assays on the Array Card format, in order to benefit for the capacity to test multiple samples for multiple agents. A two PCR per agent strategy would allow the testing of 7 samples for the presence of 11 biological agents or 3 samples for 23 biological agents per card (with negative control channels.
GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

Directory of Open Access Journals (Sweden)

Chris Cheadle

2007-01-01

Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.
Biclustering methods: biological relevance and application in gene expression analysis.

Directory of Open Access Journals (Sweden)

Ali Oghabian

Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.
Effect of Process-Oriented Guided-Inquiry Learning on Non-majors Biology Students' Understanding of Biological Classification

Science.gov (United States)

Wozniak, Breann M.

The purpose of this study was to examine the effect of process-oriented guided-inquiry learning (POGIL) on non-majors college biology students' understanding of biological classification. This study addressed an area of science instruction, POGIL in the non-majors college biology laboratory, which has yet to be qualitatively and quantitatively researched. A concurrent triangulation mixed methods approach was used. Students' understanding of biological classification was measured in two areas: scores on pre and posttests (consisting of 11 multiple choice questions), and conceptions of classification as elicited in pre and post interviews and instructor reflections. Participants were Minnesota State University, Mankato students enrolled in BIOL 100 Summer Session. One section was taught with the traditional curriculum (n = 6) and the other section in the POGIL curriculum (n = 10) developed by the researcher. Three students from each section were selected to take part in pre and post interviews. There were no significant differences within each teaching method (p familiar animal categories and aquatic habitats, unfamiliar organisms, combining and subdividing initial groupings, and the hierarchical nature of classification. The POGIL students were the only group to surpass these challenges after the teaching intervention. This study shows that POGIL is an effective technique at eliciting students' misconceptions, and addressing these misconceptions, leading to an increase in student understanding of biological classification.
A data science approach to candidate gene selection of pain regarded as a process of learning and neural plasticity.

Science.gov (United States)

Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn

2016-12-01

The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.
Revealing Pathway Dynamics in Heart Diseases by Analyzing Multiple Differential Networks.

Directory of Open Access Journals (Sweden)

Xiaoke Ma

2015-06-01

Full Text Available Development of heart diseases is driven by dynamic changes in both the activity and connectivity of gene pathways. Understanding these dynamic events is critical for understanding pathogenic mechanisms and development of effective treatment. Currently, there is a lack of computational methods that enable analysis of multiple gene networks, each of which exhibits differential activity compared to the network of the baseline/healthy condition. We describe the iMDM algorithm to identify both unique and shared gene modules across multiple differential co-expression networks, termed M-DMs (multiple differential modules. We applied iMDM to a time-course RNA-Seq dataset generated using a murine heart failure model generated on two genotypes. We showed that iMDM achieves higher accuracy in inferring gene modules compared to using single or multiple co-expression networks. We found that condition-specific M-DMs exhibit differential activities, mediate different biological processes, and are enriched for genes with known cardiovascular phenotypes. By analyzing M-DMs that are present in multiple conditions, we revealed dynamic changes in pathway activity and connectivity across heart failure conditions. We further showed that module dynamics were correlated with the dynamics of disease phenotypes during the development of heart failure. Thus, pathway dynamics is a powerful measure for understanding pathogenesis. iMDM provides a principled way to dissect the dynamics of gene pathways and its relationship to the dynamics of disease phenotype. With the exponential growth of omics data, our method can aid in generating systems-level insights into disease progression.
Selection of internal control genes for quantitative real-time RT-PCR studies during tomato development process

Directory of Open Access Journals (Sweden)

Borges-Pérez Andrés

2008-12-01

Full Text Available Abstract Background The elucidation of gene expression patterns leads to a better understanding of biological processes. Real-time quantitative RT-PCR has become the standard method for in-depth studies of gene expression. A biologically meaningful reporting of target mRNA quantities requires accurate and reliable normalization in order to identify real gene-specific variation. The purpose of normalization is to control several variables such as different amounts and quality of starting material, variable enzymatic efficiencies of retrotranscription from RNA to cDNA, or differences between tissues or cells in overall transcriptional activity. The validity of a housekeeping gene as endogenous control relies on the stability of its expression level across the sample panel being analysed. In the present report we describe the first systematic evaluation of potential internal controls during tomato development process to identify which are the most reliable for transcript quantification by real-time RT-PCR. Results In this study, we assess the expression stability of 7 traditional and 4 novel housekeeping genes in a set of 27 samples representing different tissues and organs of tomato plants at different developmental stages. First, we designed, tested and optimized amplification primers for real-time RT-PCR. Then, expression data from each candidate gene were evaluated with three complementary approaches based on different statistical procedures. Our analysis suggests that SGN-U314153 (CAC, SGN-U321250 (TIP41, SGN-U346908 ("Expressed" and SGN-U316474 (SAND genes provide superior transcript normalization in tomato development studies. We recommend different combinations of these exceptionally stable housekeeping genes for suited normalization of different developmental series, including the complete tomato development process. Conclusion This work constitutes the first effort for the selection of optimal endogenous controls for quantitative real
Moving beyond a descriptive aquatic toxicology: the value of biological process and trait information.

Science.gov (United States)

Segner, Helmut

2011-10-01

In order to improve the ability to link chemical exposure to toxicological and ecological effects, aquatic toxicology will have to move from observing what chemical concentrations induce adverse effects to more explanatory approaches, that are concepts which build on knowledge of biological processes and pathways leading from exposure to adverse effects, as well as on knowledge on stressor vulnerability as given by the genetic, physiological and ecological (e.g., life history) traits of biota. Developing aquatic toxicology in this direction faces a number of challenges, including (i) taking into account species differences in toxicant responses on the basis of the evolutionarily developed diversity of phenotypic vulnerability to environmental stressors, (ii) utilizing diversified biological response profiles to serve as biological read across for prioritizing chemicals, categorizing them according to modes of action, and for guiding targeted toxicity evaluation; (iii) prediction of ecological consequences of toxic exposure from knowledge of how biological processes and phenotypic traits lead to effect propagation across the levels of biological hierarchy; and (iv) the search for concepts to assess the cumulative impact of multiple stressors. An underlying theme in these challenges is that, in addition to the question of what the chemical does to the biological receptor, we should give increasing emphasis to the question how the biological receptor handles the chemicals, i.e., through which pathways the initial chemical-biological interaction extends to the adverse effects, how this extension is modulated by adaptive or compensatory processes as well as by phenotypic traits of the biological receptor. 2011 Elsevier B.V. All rights reserved.
Multiple and variable NHEJ-like genes are involved in resistance to DNA damage in Streptomyces ambofaciens

Directory of Open Access Journals (Sweden)

Grégory Hoff

2016-11-01

Full Text Available Non homologous end-joining (NHEJ is a double strand break (DSB repair pathway which does not require any homologous template and can ligate two DNA ends together. The basic bacterial NHEJ machinery involves two partners: the Ku protein, a DNA end binding protein for DSB recognition and the multifunctional LigD protein composed a ligase, a nuclease and a polymerase domain, for end processing and ligation of the broken ends. In silico analyses performed in the 38 sequenced genomes of Streptomyces species revealed the existence of a large panel of NHEJ-like genes. Indeed, ku genes or ligD domain homologues are scattered throughout the genome in multiple copies and can be distinguished in two categories: the core NHEJ gene set constituted of conserved loci and the variable NHEJ gene set constituted of NHEJ-like genes present in only a part of the species. In Streptomyces ambofaciens ATCC 23877, not only the deletion of core genes but also that of variable genes led to an increased sensitivity to DNA damage induced by electron beam irradiation. Multiple mutants of ku, ligase or polymerase encoding genes showed an aggravated phenotype compared to single mutants. Biochemical assays revealed the ability of Ku-like proteins to protect and to stimulate ligation of DNA ends. RT-qPCR and GFP fusion experiments suggested that ku-like genes show a growth phase dependent expression profile consistent with their involvement in DNA repair during spores formation and/or germination.
Circumventing furin enhances factor VIII biological activity and ameliorates bleeding phenotypes in hemophilia models

OpenAIRE

Siner, Joshua I.; Samelson-Jones, Benjamin J.; Crudele, Julie M.; French, Robert A.; Lee, Benjamin J.; Zhou, Shanzhen; Merricks, Elizabeth; Raymer, Robin; Nichols, Timothy C.; Camire, Rodney M.; Arruda, Valder R.

2016-01-01

Processing by the proprotein convertase furin is believed to be critical for the biological activity of multiple proteins involved in hemostasis, including coagulation factor VIII (FVIII). This belief prompted the retention of the furin recognition motif (amino acids 1645–1648) in the design of B-domain–deleted FVIII (FVIII-BDD) products in current clinical use and in the drug development pipeline, as well as in experimental FVIII gene therapy strategies. Here, we report that processing by fu...
Cre/lox-based multiple markerless gene disruption in the genome of the extreme thermophile Thermus thermophilus.

Science.gov (United States)

Togawa, Yoichiro; Nunoshiba, Tatsuo; Hiratsu, Keiichiro

2018-02-01

Markerless gene-disruption technology is particularly useful for effective genetic analyses of Thermus thermophilus (T. thermophilus), which have a limited number of selectable markers. In an attempt to develop a novel system for the markerless disruption of genes in T. thermophilus, we applied a Cre/lox system to construct a triple gene disruptant. To achieve this, we constructed two genetic tools, a loxP-htk-loxP cassette and cre-expressing plasmid, pSH-Cre, for gene disruption and removal of the selectable marker by Cre-mediated recombination. We found that the Cre/lox system was compatible with the proliferation of the T. thermophilus HB27 strain at the lowest growth temperature (50 °C), and thus succeeded in establishing a triple gene disruptant, the (∆TTC1454::loxP, ∆TTC1535KpnI::loxP, ∆TTC1576::loxP) strain, without leaving behind a selectable marker. During the process of the sequential disruption of multiple genes, we observed the undesired deletion and inversion of the chromosomal region between multiple loxP sites that were induced by Cre-mediated recombination. Therefore, we examined the effects of a lox66-htk-lox71 cassette by exploiting the mutant lox sites, lox66 and lox71, instead of native loxP sites. We successfully constructed a (∆TTC1535::lox72, ∆TTC1537::lox72) double gene disruptant without inducing the undesired deletion of the 0.7-kbp region between the two directly oriented lox72 sites created by the Cre-mediated recombination of the lox66-htk-lox71 cassette. This is the first demonstration of a Cre/lox system being applicable to extreme thermophiles in a genetic manipulation. Our results indicate that this system is a powerful tool for multiple markerless gene disruption in T. thermophilus.
Stochastic Simulation of Process Calculi for Biology

Directory of Open Access Journals (Sweden)

Andrew Phillips

2010-10-01

Full Text Available Biological systems typically involve large numbers of components with complex, highly parallel interactions and intrinsic stochasticity. To model this complexity, numerous programming languages based on process calculi have been developed, many of which are expressive enough to generate unbounded numbers of molecular species and reactions. As a result of this expressiveness, such calculi cannot rely on standard reaction-based simulation methods, which require fixed numbers of species and reactions. Rather than implementing custom stochastic simulation algorithms for each process calculus, we propose to use a generic abstract machine that can be instantiated to a range of process calculi and a range of reaction-based simulation algorithms. The abstract machine functions as a just-in-time compiler, which dynamically updates the set of possible reactions and chooses the next reaction in an iterative cycle. In this short paper we give a brief summary of the generic abstract machine, and show how it can be instantiated with the stochastic simulation algorithm known as Gillespie's Direct Method. We also discuss the wider implications of such an abstract machine, and outline how it can be used to simulate multiple calculi simultaneously within a common framework.
Gene stacking of multiple traits for high yield of fermentable sugars in plant biomass

DEFF Research Database (Denmark)

Aznar, Aude; Chalvin, Camille; Shih, Patrick M.

2018-01-01

the ratio of C6 to C5 sugars in the cell wall and decreasing the lignin content are two important targets in engineering of plants that are more suitable for downstream processing for second-generation biofuel production.Results: We have studied the basic mechanisms of cell wall biosynthesis and identified...... genes involved in biosynthesis of pectic galactan, including the GALS1 galactan synthase and the UDP-galactose/UDP-rhamnose transporter URGT1. We have engineered plants with a more suitable biomass composition by applying these findings, in conjunction with synthetic biology and gene stacking tools...... to vessels where this polysaccharide is essential. Finally, the high galactan and low xylan traits were stacked with the low lignin trait obtained by expressing the QsuB gene encoding dehydroshikimate dehydratase in lignifying cells.Conclusion: The results show that approaches to increasing C6 sugar content...
Simple and Efficient Targeting of Multiple Genes Through CRISPR-Cas9 in Physcomitrella patens

Directory of Open Access Journals (Sweden)

Mauricio Lopez-Obando

2016-11-01

Full Text Available Powerful genome editing technologies are needed for efficient gene function analysis. The CRISPR-Cas9 system has been adapted as an efficient gene-knock-out technology in a variety of species. However, in a number of situations, knocking out or modifying a single gene is not sufficient; this is particularly true for genes belonging to a common family, or for genes showing redundant functions. Like many plants, the model organism Physcomitrella patens has experienced multiple events of polyploidization during evolution that has resulted in a number of families of duplicated genes. Here, we report a robust CRISPR-Cas9 system, based on the codelivery of a CAS9 expressing cassette, multiple sgRNA vectors, and a cassette for transient transformation selection, for gene knock-out in multiple gene families. We demonstrate that CRISPR-Cas9-mediated targeting of five different genes allows the selection of a quintuple mutant, and all possible subcombinations of mutants, in one experiment, with no mutations detected in potential off-target sequences. Furthermore, we confirmed the observation that the presence of repeats in the vicinity of the cutting region favors deletion due to the alternative end joining pathway, for which induced frameshift mutations can be potentially predicted. Because the number of multiple gene families in Physcomitrella is substantial, this tool opens new perspectives to study the role of expanded gene families in the colonization of land by plants.
Sieve-based relation extraction of gene regulatory networks from biological literature.

Science.gov (United States)

Žitnik, Slavko; Žitnik, Marinka; Zupan, Blaž; Bajec, Marko

2015-01-01

Relation extraction is an essential procedure in literature mining. It focuses on extracting semantic relations between parts of text, called mentions. Biomedical literature includes an enormous amount of textual descriptions of biological entities, their interactions and results of related experiments. To extract them in an explicit, computer readable format, these relations were at first extracted manually from databases. Manual curation was later replaced with automatic or semi-automatic tools with natural language processing capabilities. The current challenge is the development of information extraction procedures that can directly infer more complex relational structures, such as gene regulatory networks. We develop a computational approach for extraction of gene regulatory networks from textual data. Our method is designed as a sieve-based system and uses linear-chain conditional random fields and rules for relation extraction. With this method we successfully extracted the sporulation gene regulation network in the bacterium Bacillus subtilis for the information extraction challenge at the BioNLP 2013 conference. To enable extraction of distant relations using first-order models, we transform the data into skip-mention sequences. We infer multiple models, each of which is able to extract different relationship types. Following the shared task, we conducted additional analysis using different system settings that resulted in reducing the reconstruction error of bacterial sporulation network from 0.73 to 0.68, measured as the slot error rate between the predicted and the reference network. We observe that all relation extraction sieves contribute to the predictive performance of the proposed approach. Also, features constructed by considering mention words and their prefixes and suffixes are the most important features for higher accuracy of extraction. Analysis of distances between different mention types in the text shows that our choice of transforming
BiologicalNetworks 2.0 - an integrative view of genome biology data

Directory of Open Access Journals (Sweden)

Ponomarenko Julia

2010-12-01

Full Text Available Abstract Background A significant problem in the study of mechanisms of an organism's development is the elucidation of interrelated factors which are making an impact on the different levels of the organism, such as genes, biological molecules, cells, and cell systems. Numerous sources of heterogeneous data which exist for these subsystems are still not integrated sufficiently enough to give researchers a straightforward opportunity to analyze them together in the same frame of study. Systematic application of data integration methods is also hampered by a multitude of such factors as the orthogonal nature of the integrated data and naming problems. Results Here we report on a new version of BiologicalNetworks, a research environment for the integral visualization and analysis of heterogeneous biological data. BiologicalNetworks can be queried for properties of thousands of different types of biological entities (genes/proteins, promoters, COGs, pathways, binding sites, and other and their relations (interactions, co-expression, co-citations, and other. The system includes the build-pathways infrastructure for molecular interactions/relations and module discovery in high-throughput experiments. Also implemented in BiologicalNetworks are the Integrated Genome Viewer and Comparative Genomics Browser applications, which allow for the search and analysis of gene regulatory regions and their conservation in multiple species in conjunction with molecular pathways/networks, experimental data and functional annotations. Conclusions The new release of BiologicalNetworks together with its back-end database introduces extensive functionality for a more efficient integrated multi-level analysis of microarray, sequence, regulatory, and other data. BiologicalNetworks is freely available at http://www.biologicalnetworks.org.
Bottom-up GGM algorithm for constructing multiple layered hierarchical gene regulatory networks

Science.gov (United States)

Multilayered hierarchical gene regulatory networks (ML-hGRNs) are very important for understanding genetics regulation of biological pathways. However, there are currently no computational algorithms available for directly building ML-hGRNs that regulate biological pathways. A bottom-up graphic Gaus...
Single amino acid substitution in important hemoglobinopathies does not disturb molecular function and biological process

Directory of Open Access Journals (Sweden)

Viroj Wiwanitkit

2008-06-01

Full Text Available Viroj WiwanitkitDepartment of Laboratory Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, ThailandAbstract: Hemoglobin is an important protein found in the red cells of many animals. In humans, the hemoglobin is mainly distributed in the red blood cell. Single amino acid substitution is the main pathogenesis of most hemoglobin disorders. Here, the author used a new gene ontology technology to predict the molecular function and biological process of four important hemoglobin disorders with single substitution. The four studied important abnormal hemoglobins (Hb with single substitution included Hb S, Hb E, Hb C, and Hb J-Baltimore. Using the GoFigure server, the molecular function and biological process in normal and abnormal hemoglobins was predicted. Compared with normal hemoglobin, all studied abnormal hemoglobins had the same function and biological process. This indicated that the overall function of oxygen transportation is not disturbed in the studied hemoglobin disorders. Clinical findings of oxygen depletion in abnormal hemoglobin should therefore be due to the other processes rather than genomics, proteomics, and expression levels.Keywords: hemoglobin, amino acid, substitution, function
Transmission as a basic process in microbial biology. Lwoff Award Prize Lecture.

Science.gov (United States)

Baquero, Fernando

2017-11-01

Transmission is a basic process in biology and evolution, as it communicates different biological entities within and across hierarchical levels (from genes to holobionts) both in time and space. Vertical descent, replication, is transmission of information across generations (in the time dimension), and horizontal descent is transmission of information across compartments (in the space dimension). Transmission is essentially a communication process that can be studied by analogy of the classic information theory, based on 'emitters', 'messages' and 'receivers'. The analogy can be easily extended to the triad 'emigration', 'migration' and 'immigration'. A number of causes (forces) determine the emission, and another set of causes (energies) assures the reception. The message in fact is essentially constituted by 'meaningful' biological entities. A DNA sequence, a cell and a population have a semiotic dimension, are 'signs' that are eventually recognized (decoded) and integrated by receiver biological entities. In cis-acting or unenclosed transmission, the emitters and receivers correspond to separated entities of the same hierarchical level; in trans-acting or embedded transmission, the information flows between different, but frequently nested, hierarchical levels. The result (as in introgressive events) is constantly producing innovation and feeding natural selection, influencing also the evolution of transmission processes. This review is based on the concepts presented at the André Lwoff Award Lecture in the FEMS Microbiology Congress in Maastricht in 2015. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Information-processing genes

International Nuclear Information System (INIS)

Tahir Shah, K.

1995-01-01

There are an estimated 100,000 genes in the human genome of which 97% is non-coding. On the other hand, bacteria have little or no non-coding DNA. Non-coding region includes introns, ALU sequences, satellite DNA, and other segments not expressed as proteins. Why it exists? Why nature has kept non-coding during the long evolutionary period if it has no role in the development of complex life forms? Does complexity of a species somehow correlated to the existence of apparently useless sequences? What kind of capability is encoded within such nucleotide sequences that is a necessary, but not a sufficient condition for the evolution of complex life forms, keeping in mind the C-value paradox and the omnipresence of non-coding segments in higher eurkaryotes and also in many archea and prokaryotes. The physico-chemical description of biological processes is hardware oriented and does not highlight algorithmic or information processing aspect. However, an algorithm without its hardware implementation is useless as much as hardware without its capability to run an algorithm. The nature and type of computation an information-processing hardware can perform depends only on its algorithm and the architecture that reflects the algorithm. Given that enormously difficult tasks such as high fidelity replication, transcription, editing and regulation are all achieved within a long linear sequence, it is natural to think that some parts of a genome are involved is these tasks. If some complex algorithms are encoded with these parts, then it is natural to think that non-coding regions contain processing-information algorithms. A comparison between well-known automatic sequences and sequences constructed out of motifs is found in all species proves the point: noncoding regions are a sort of ''hardwired'' programs, i.e., they are linear representations of information-processing machines. Thus in our model, a noncoding region, e.g., an intron contains a program (or equivalently, it is

Mean associative multiplicities in deep inelastic processes

International Nuclear Information System (INIS)

Dzhaparidze, G.Sh.; Kiselev, A.V.; Petrov, V.A.

1981-01-01

The associative hadron multiplicities in deep inelastic and Drell--Yan processes are studied. In particular the mean multiplicities in different hard processes in QCD are found to be determined by the mean multiplicity in parton jet [ru
Statistical assessment of crosstalk enrichment between gene groups in biological networks.

Science.gov (United States)

McCormack, Theodore; Frings, Oliver; Alexeyenko, Andrey; Sonnhammer, Erik L L

2013-01-01

Analyzing groups of functionally coupled genes or proteins in the context of global interaction networks has become an important aspect of bioinformatic investigations. Assessing the statistical significance of crosstalk enrichment between or within groups of genes can be a valuable tool for functional annotation of experimental gene sets. Here we present CrossTalkZ, a statistical method and software to assess the significance of crosstalk enrichment between pairs of gene or protein groups in large biological networks. We demonstrate that the standard z-score is generally an appropriate and unbiased statistic. We further evaluate the ability of four different methods to reliably recover crosstalk within known biological pathways. We conclude that the methods preserving the second-order topological network properties perform best. Finally, we show how CrossTalkZ can be used to annotate experimental gene sets using known pathway annotations and that its performance at this task is superior to gene enrichment analysis (GEA). CrossTalkZ (available at http://sonnhammer.sbc.su.se/download/software/CrossTalkZ/) is implemented in C++, easy to use, fast, accepts various input file formats, and produces a number of statistics. These include z-score, p-value, false discovery rate, and a test of normality for the null distributions.
Gene expression profiling in cells with enhanced gamma-secretase activity.

Directory of Open Access Journals (Sweden)

Alexandra I Magold

2009-09-01

Full Text Available Processing by gamma-secretase of many type-I membrane protein substrates triggers signaling cascades by releasing intracellular domains (ICDs that, following nuclear translocation, modulate the transcription of different genes regulating a diverse array of cellular and biological processes. Because the list of gamma-secretase substrates is growing quickly and this enzyme is a cancer and Alzheimer's disease therapeutic target, the mapping of gamma-secretase activity susceptible gene transcription is important for sharpening our view of specific affected genes, molecular functions and biological pathways.To identify genes and molecular functions transcriptionally affected by gamma-secretase activity, the cellular transcriptomes of Chinese hamster ovary (CHO cells with enhanced and inhibited gamma-secretase activity were analyzed and compared by cDNA microarray. The functional clustering by FatiGO of the 1,981 identified genes revealed over- and under-represented groups with multiple activities and functions. Single genes with the most pronounced transcriptional susceptibility to gamma-secretase activity were evaluated by real-time PCR. Among the 21 validated genes, the strikingly decreased transcription of PTPRG and AMN1 and increased transcription of UPP1 potentially support data on cell cycle disturbances relevant to cancer, stem cell and neurodegenerative diseases' research. The mapping of interactions of proteins encoded by the validated genes exclusively relied on evidence-based data and revealed broad effects on Wnt pathway members, including WNT3A and DVL3. Intriguingly, the transcription of TERA, a gene of unknown function, is affected by gamma-secretase activity and was significantly altered in the analyzed human Alzheimer's disease brain cortices.Investigating the effects of gamma-secretase activity on gene transcription has revealed several affected clusters of molecular functions and, more specifically, 21 genes that hold significant
High-multiplicity processes

International Nuclear Information System (INIS)

Shelkov, G.; Sisakyan, A.; Mandzhavidze, I.

1999-01-01

We wish to demonstrate that investigation of asymptotically high multiplicity (AHM) hadron reactions may solve, or at least clear up, a number of problems unsolvable by other ways. We would lean upon the idea: (i) the reactions final state entropy is proportional to multiplicity and, by this reason, just in the AHM domain one may expect the equilibrium final state and (ii) the AHM final state is cold because of the energy-momentum conservation laws. This means that the collective phenomena may become important in the AHM domain. The possibility of hard processes dominance is considered also
A Friendly-Biological Reactor SIMulator (BioReSIM for studying biological processes in wastewater treatment processes

Directory of Open Access Journals (Sweden)

Raul Molina

2014-12-01

Full Text Available Biological processes for wastewater treatments are inherently dynamic systems because of the large variations in the influent wastewater flow rate, concentration composition and the adaptive behavior of the involved microorganisms. Moreover, the sludge retention time (SRT is a critical factor to understand the bioreactor performances when changes in the influent or in the operation conditions take place. Since SRT are usually in the range of 10-30 days, the performance of biological reactors needs a long time to be monitored in a regular laboratory demonstration, limiting the knowledge that can be obtained in the experimental lab practice. In order to overcome this lack, mathematical models and computer simulations are useful tools to describe biochemical processes and predict the overall performance of bioreactors under different working operation conditions and variations of the inlet wastewater composition. The mathematical solution of the model could be difficult as numerous biochemical processes can be considered. Additionally, biological reactors description (mass balance, etc. needs models represented by partial or/and ordinary differential equations associated to algebraic expressions, that require complex computational codes to obtain the numerical solutions. Different kind of software for mathematical modeling can be used, from large degree of freedom simulators capable of free models definition (as AQUASIM, to closed predefined model structure programs (as BIOWIN. The first ones usually require long learning curves, whereas the second ones could be excessively rigid for specific wastewater treatment systems. As alternative, we present Biological Reactor SIMulator (BioReSIM, a MATLAB code for the simulation of sequencing batch reactors (SBR and rotating biological contactors (RBC as biological systems of suspended and attached biomass for wastewater treatment, respectively. This BioReSIM allows the evaluation of simple and complex
Transcriptional regulation of receptor-like protein genes by environmental stresses and hormones and their overexpression activities in Arabidopsis thaliana

NARCIS (Netherlands)

Wu, Jinbin; Liu, Zhijun; Zhang, Zhao; Lv, Yanting; Yang, Nan; Zhang, Guohua; Wu, Menyao; Lv, Shuo; Pan, Lixia; Joosten, Matthieu H.A.J.; Wang, Guodong

2016-01-01

Receptor-like proteins (RLPs) have been implicated in multiple biological processes, including plant development and immunity to microbial infection. Fifty-seven AtRLP genes have been identified in Arabidopsis, whereas only a few have been functionally characterized. This is due to the lack of
Upregulation of Immunoglobulin-related Genes in Cortical Sections from Multiple Sclerosis Patients

NARCIS (Netherlands)

Torkildsen, O.; Stansberg, C.; Angelskar, S.M.; Kooi, E.J.; Geurts, J.J.G.; van der Valk, P.; Myhr, K.M.; Steen, V.M.; Bo, L.

2010-01-01

Multiple sclerosis (MS) is a demyelinating disease of the central nervous system (CNS). Microarray-based global gene expression profiling is a promising method, used to study potential genes involved in the pathogenesis of the disease. In the present study, we have examined global gene expression in
Bayesian inference based modelling for gene transcriptional dynamics by integrating multiple source of knowledge

Directory of Open Access Journals (Sweden)

Wang Shu-Qiang

2012-07-01

Full Text Available Abstract Background A key challenge in the post genome era is to identify genome-wide transcriptional regulatory networks, which specify the interactions between transcription factors and their target genes. Numerous methods have been developed for reconstructing gene regulatory networks from expression data. However, most of them are based on coarse grained qualitative models, and cannot provide a quantitative view of regulatory systems. Results A binding affinity based regulatory model is proposed to quantify the transcriptional regulatory network. Multiple quantities, including binding affinity and the activity level of transcription factor (TF are incorporated into a general learning model. The sequence features of the promoter and the possible occupancy of nucleosomes are exploited to estimate the binding probability of regulators. Comparing with the previous models that only employ microarray data, the proposed model can bridge the gap between the relative background frequency of the observed nucleotide and the gene's transcription rate. Conclusions We testify the proposed approach on two real-world microarray datasets. Experimental results show that the proposed model can effectively identify the parameters and the activity level of TF. Moreover, the kinetic parameters introduced in the proposed model can reveal more biological sense than previous models can do.
APPRIS 2017: principal isoforms for multiple gene sets

Science.gov (United States)

Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

2018-01-01

Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475
Disruption of PC1/3 expression in mice causes dwarfism and multiple neuroendocrine peptide processing defects

DEFF Research Database (Denmark)

Zhu, Xiaorong; Zhou, An; Dey, Arunangsu

2002-01-01

vertebrates and invertebrates. Disruption of the gene-encoding mouse PC1/3 has now been accomplished and results in a syndrome of severe postnatal growth impairment and multiple defects in processing many hormone precursors, including hypothalamic growth hormone-releasing hormone (GHRH), pituitary...
Risk score modeling of multiple gene to gene interactions using aggregated-multifactor dimensionality reduction

Directory of Open Access Journals (Sweden)

Dai Hongying

2013-01-01

Full Text Available Abstract Background Multifactor Dimensionality Reduction (MDR has been widely applied to detect gene-gene (GxG interactions associated with complex diseases. Existing MDR methods summarize disease risk by a dichotomous predisposing model (high-risk/low-risk from one optimal GxG interaction, which does not take the accumulated effects from multiple GxG interactions into account. Results We propose an Aggregated-Multifactor Dimensionality Reduction (A-MDR method that exhaustively searches for and detects significant GxG interactions to generate an epistasis enriched gene network. An aggregated epistasis enriched risk score, which takes into account multiple GxG interactions simultaneously, replaces the dichotomous predisposing risk variable and provides higher resolution in the quantification of disease susceptibility. We evaluate this new A-MDR approach in a broad range of simulations. Also, we present the results of an application of the A-MDR method to a data set derived from Juvenile Idiopathic Arthritis patients treated with methotrexate (MTX that revealed several GxG interactions in the folate pathway that were associated with treatment response. The epistasis enriched risk score that pooled information from 82 significant GxG interactions distinguished MTX responders from non-responders with 82% accuracy. Conclusions The proposed A-MDR is innovative in the MDR framework to investigate aggregated effects among GxG interactions. New measures (pOR, pRR and pChi are proposed to detect multiple GxG interactions.
Comparison of multiple gene assembly methods for metabolic engineering

Science.gov (United States)

Chenfeng Lu; Karen Mansoorabadi; Thomas Jeffries

2007-01-01

A universal, rapid DNA assembly method for efficient multigene plasmid construction is important for biological research and for optimizing gene expression in industrial microbes. Three different approaches to achieve this goal were evaluated. These included creating long complementary extensions using a uracil-DNA glycosylase technique, overlap extension polymerase...
Gene expression analysis of interferon-beta treatment in multiple sclerosis

DEFF Research Database (Denmark)

Sellebjerg, F.; Datta, P.; Larsen, J.

2008-01-01

by treatment with IFN-beta. We use DNA microarrays to study gene expression in 10 multiple sclerosis (MS) patients who began de novo treatment with IFN-beta. After the first injection of IFN-beta, the expression of 74 out of 3428 genes changed at least two-fold and statistically significantly (after Bonferroni......Treatment with interferon-beta (IFN-beta) induces the expression of hundreds of genes in blood mononuclear cells, and the expression of several genes has been proposed as a marker of the effect of treatment with IFN-beta. However, to date no molecules have been identified that are stably induced...
From gene to structure: Lactobacillus bulgaricus D-lactate dehydrogenase from yogurt as an integrated curriculum model for undergraduate molecular biology and biochemistry laboratory courses.

Science.gov (United States)

Lawton, Jeffrey A; Prescott, Noelle A; Lawton, Ping X

2018-05-01

We have developed an integrated, project-oriented curriculum for undergraduate molecular biology and biochemistry laboratory courses spanning two semesters that is organized around the ldhA gene from the yogurt-fermenting bacterium Lactobacillus bulgaricus, which encodes the enzyme d-lactate dehydrogenase. The molecular biology module, which consists of nine experiments carried out over eleven sessions, begins with the isolation of genomic DNA from L. bulgaricus in yogurt and guides students through the process of cloning the ldhA gene into a prokaryotic expression vector, followed by mRNA isolation and characterization of recombinant gene expression levels using RT-PCR. The biochemistry module, which consists of nine experiments carried out over eight sessions, begins with overexpression of the cloned ldhA gene and guides students through the process of affinity purification, biochemical characterization of the purified LdhA protein, and analysis of enzyme kinetics using various substrates and an inhibitor, concluding with a guided inquiry investigation of structure-function relationships in the three-dimensional structure of LdhA using molecular visualization software. Students conclude by writing a paper describing their work on the project, formatted as a manuscript to be submitted for publication in a scientific journal. Overall, this curriculum, with its emphasis on experiential learning, provides hands-on training with a variety of common laboratory techniques in molecular biology and biochemistry and builds experience with the process of scientific reasoning, along with reinforcement of essential transferrable skills such as critical thinking, information literacy, and written communication, all within the framework of an extended project having the look and feel of a research experience. © 2018 by The International Union of Biochemistry and Molecular Biology, 46(3):270-278, 2018. © 2018 The International Union of Biochemistry and Molecular Biology.
MO-DE-207B-03: Improved Cancer Classification Using Patient-Specific Biological Pathway Information Via Gene Expression Data

Energy Technology Data Exchange (ETDEWEB)

Young, M; Craft, D [Massachusetts General Hospital and Harvard Medical School, Boston, MA (United States)

2016-06-15

Purpose: To develop an efficient, pathway-based classification system using network biology statistics to assist in patient-specific response predictions to radiation and drug therapies across multiple cancer types. Methods: We developed PICS (Pathway Informed Classification System), a novel two-step cancer classification algorithm. In PICS, a matrix m of mRNA expression values for a patient cohort is collapsed into a matrix p of biological pathways. The entries of p, which we term pathway scores, are obtained from either principal component analysis (PCA), normal tissue centroid (NTC), or gene expression deviation (GED). The pathway score matrix is clustered using both k-means and hierarchical clustering, and a clustering is judged by how well it groups patients into distinct survival classes. The most effective pathway scoring/clustering combination, per clustering p-value, thus generates various ‘signatures’ for conventional and functional cancer classification. Results: PICS successfully regularized large dimension gene data, separated normal and cancerous tissues, and clustered a large patient cohort spanning six cancer types. Furthermore, PICS clustered patient cohorts into distinct, statistically-significant survival groups. For a suboptimally-debulked ovarian cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00127) showed significant improvement over that of a prior gene expression-classified study (p = .0179). For a pancreatic cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00141) showed significant improvement over that of a prior gene expression-classified study (p = .04). Pathway-based classification confirmed biomarkers for the pyrimidine, WNT-signaling, glycerophosphoglycerol, beta-alanine, and panthothenic acid pathways for ovarian cancer. Despite its robust nature, PICS requires significantly less run time than current pathway scoring methods. Conclusion: This work validates the PICS method to improve
Systematic reconstruction of autism biology from massive genetic mutation profiles.

Science.gov (United States)

Luo, Weijun; Zhang, Chaolin; Jiang, Yong-Hui; Brouwer, Cory R

2018-04-01

Autism spectrum disorder (ASD) affects 1% of world population and has become a pressing medical and social problem worldwide. As a paradigmatic complex genetic disease, ASD has been intensively studied and thousands of gene mutations have been reported. Because these mutations rarely recur, it is difficult to (i) pinpoint the fewer disease-causing versus majority random events and (ii) replicate or verify independent studies. A coherent and systematic understanding of autism biology has not been achieved. We analyzed 3392 and 4792 autism-related mutations from two large-scale whole-exome studies across multiple resolution levels, that is, variants (single-nucleotide), genes (protein-coding unit), and pathways (molecular module). These mutations do not recur or replicate at the variant level, but significantly and increasingly do so at gene and pathway levels. Genetic association reveals a novel gene + pathway dual-hit model, where the mutation burden becomes less relevant. In multiple independent analyses, hundreds of variants or genes repeatedly converge to several canonical pathways, either novel or literature-supported. These pathways define recurrent and systematic ASD biology, distinct from previously reported gene groups or networks. They also present a catalog of novel ASD risk factors including 118 variants and 72 genes. At a subpathway level, most variants disrupt the pathway-related gene functions, and in the same gene, they tend to hit residues extremely close to each other and in the same domain. Multiple interacting variants spotlight key modules, including the cAMP (adenosine 3',5'-monophosphate) second-messenger system and mGluR (metabotropic glutamate receptor) signaling regulation by GRKs (G protein-coupled receptor kinases). At a superpathway level, distinct pathways further interconnect and converge to three biology themes: synaptic function, morphology, and plasticity.
Impaired global, and compensatory local, biological motion processing in people with high levels of autistic traits.

Science.gov (United States)

van Boxtel, Jeroen J A; Lu, Hongjing

2013-01-01

People with Autism Spectrum Disorder (ASD) are hypothesized to have poor high-level processing but superior low-level processing, causing impaired social recognition, and a focus on non-social stimulus contingencies. Biological motion perception provides an ideal domain to investigate exactly how ASD modulates the interaction between low and high-level processing, because it involves multiple processing stages, and carries many important social cues. We investigated individual differences among typically developing observers in biological motion processing, and whether such individual differences associate with the number of autistic traits. In Experiment 1, we found that individuals with fewer autistic traits were automatically and involuntarily attracted to global biological motion information, whereas individuals with more autistic traits did not show this pre-attentional distraction. We employed an action adaptation paradigm in the second study to show that individuals with more autistic traits were able to compensate for deficits in global processing with an increased involvement in local processing. Our findings can be interpreted within a predictive coding framework, which characterizes the functional relationship between local and global processing stages, and explains how these stages contribute to the perceptual difficulties associated with ASD.
Impaired global, and compensatory local, biological motion processing in people with high levels of autistic traits

Directory of Open Access Journals (Sweden)

Jeroen J A Van Boxtel

2013-04-01

Full Text Available People with Autism Spectrum Disorder (ASD are hypothesized to have poor high-level processing but superior low-level processing, causing impaired social recognition, and a focus on non-social stimulus contingencies. Biological motion perception provides an ideal domain to investigate exactly how ASD modulates the interaction between low and high-level processing, because it involves multiple processing stages, and carries many important social cues. We investigated individual differences among typically developing observers in biological motion processing, and whether such individual differences associate with the number of autistic traits. In Experiment 1, we found that individuals with fewer autistic traits were automatically and involuntarily attracted to global biological motion information, whereas individuals with more autistic traits did not show this pre-attentional distraction. We employed an action adaptation paradigm in the second study to show that individuals with more autistic traits were able to compensate for deficits in global processing with an increased involvement in local processing. Our findings can be interpreted within a predictive coding framework, which characterizes the functional relationship between local and global processing stages, and explains how these stages contribute to the perceptual difficulties associated with ASD.
Graphics processing units in bioinformatics, computational biology and systems biology.

Science.gov (United States)

Nobile, Marco S; Cazzaniga, Paolo; Tangherloni, Andrea; Besozzi, Daniela

2017-09-01

Several studies in Bioinformatics, Computational Biology and Systems Biology rely on the definition of physico-chemical or mathematical models of biological systems at different scales and levels of complexity, ranging from the interaction of atoms in single molecules up to genome-wide interaction networks. Traditional computational methods and software tools developed in these research fields share a common trait: they can be computationally demanding on Central Processing Units (CPUs), therefore limiting their applicability in many circumstances. To overcome this issue, general-purpose Graphics Processing Units (GPUs) are gaining an increasing attention by the scientific community, as they can considerably reduce the running time required by standard CPU-based software, and allow more intensive investigations of biological systems. In this review, we present a collection of GPU tools recently developed to perform computational analyses in life science disciplines, emphasizing the advantages and the drawbacks in the use of these parallel architectures. The complete list of GPU-powered tools here reviewed is available at http://bit.ly/gputools. © The Author 2016. Published by Oxford University Press.
Content-rich biological network constructed by mining PubMed abstracts

Directory of Open Access Journals (Sweden)

Sharp Burt M

2004-10-01

Full Text Available Abstract Background The integration of the rapidly expanding corpus of information about the genome, transcriptome, and proteome, engendered by powerful technological advances, such as microarrays, and the availability of genomic sequence from multiple species, challenges the grasp and comprehension of the scientific community. Despite the existence of text-mining methods that identify biological relationships based on the textual co-occurrence of gene/protein terms or similarities in abstract texts, knowledge of the underlying molecular connections on a large scale, which is prerequisite to understanding novel biological processes, lags far behind the accumulation of data. While computationally efficient, the co-occurrence-based approaches fail to characterize (e.g., inhibition or stimulation, directionality biological interactions. Programs with natural language processing (NLP capability have been created to address these limitations, however, they are in general not readily accessible to the public. Results We present a NLP-based text-mining approach, Chilibot, which constructs content-rich relationship networks among biological concepts, genes, proteins, or drugs. Amongst its features, suggestions for new hypotheses can be generated. Lastly, we provide evidence that the connectivity of molecular networks extracted from the biological literature follows the power-law distribution, indicating scale-free topologies consistent with the results of previous experimental analyses. Conclusions Chilibot distills scientific relationships from knowledge available throughout a wide range of biological domains and presents these in a content-rich graphical format, thus integrating general biomedical knowledge with the specialized knowledge and interests of the user. Chilibot http://www.chilibot.net can be accessed free of charge to academic users.

Examining the process of de novo gene birth: an educational primer on "integration of new genes into cellular networks, and their structural maturation".

Science.gov (United States)

Frietze, Seth; Leatherman, Judith

2014-03-01

New genes that arise from modification of the noncoding portion of a genome rather than being duplicated from parent genes are called de novo genes. These genes, identified by their brief evolution and lack of parent genes, provide an opportunity to study the timeframe in which emerging genes integrate into cellular networks, and how the characteristics of these genes change as they mature into bona fide genes. An article by G. Abrusán provides an opportunity to introduce students to fundamental concepts in evolutionary and comparative genetics and to provide a technical background by which to discuss systems biology approaches when studying the evolutionary process of gene birth. Basic background needed to understand the Abrusán study and details on comparative genomic concepts tailored for a classroom discussion are provided, including discussion questions and a supplemental exercise on navigating a genome database.
MiR-210 disturbs mitotic progression through regulating a group of mitosis-related genes

OpenAIRE

He, Jie; Wu, Jiangbin; Xu, Naihan; Xie, Weidong; Li, Mengnan; Li, Jianna; Jiang, Yuyang; Yang, Burton B.; Zhang, Yaou

2012-01-01

MiR-210 is up-regulated in multiple cancer types but its function is disputable and further investigation is necessary. Using a bioinformatics approach, we identified the putative target genes of miR-210 in hypoxia-induced CNE cells from genome-wide scale. Two functional gene groups related to cell cycle and RNA processing were recognized as the major targets of miR-210. Here, we investigated the molecular mechanism and biological consequence of miR-210 in cell cycle regulation, particularly ...
Synthetic Biology: Putting Synthesis into Biology

Science.gov (United States)

Liang, Jing; Luo, Yunzi; Zhao, Huimin

2010-01-01

The ability to manipulate living organisms is at the heart of a range of emerging technologies that serve to address important and current problems in environment, energy, and health. However, with all its complexity and interconnectivity, biology has for many years been recalcitrant to engineering manipulations. The recent advances in synthesis, analysis, and modeling methods have finally provided the tools necessary to manipulate living systems in meaningful ways, and have led to the coining of a field named synthetic biology. The scope of synthetic biology is as complicated as life itself – encompassing many branches of science, and across many scales of application. New DNA synthesis and assembly techniques have made routine the customization of very large DNA molecules. This in turn has allowed the incorporation of multiple genes and pathways. By coupling these with techniques that allow for the modeling and design of protein functions, scientists have now gained the tools to create completely novel biological machineries. Even the ultimate biological machinery – a self-replicating organism – is being pursued at this moment. It is the purpose of this review to dissect and organize these various components of synthetic biology into a coherent picture. PMID:21064036
Ventilator-associated pneumonia caused by carbapenem-resistant Enterobacteriaceae carrying multiple metallo-beta-lactamase genes

Directory of Open Access Journals (Sweden)

Dwivedi Mayank

2009-07-01

Full Text Available Context: Ventilator-associated pneumonia (VAP is a leading nosocomial infection in the intensive care unit (ICU. Members of Enterobacteriaceae are the most common causative agents and carbapenems are the most commonly used antibiotics. Metallo-beta-lactamase (MBL production leading to treatment failure may go unnoticed by routine disc diffusion susceptibility testing. Moreover, there is not much information on association of MBL-producing Enterobacteriaceae with ICU-acquired VAP. Therefore, a study was undertaken to find out the association of MBL-producing Enterobacteriaceae with VAP. Settings: This study was conducted in a large tertiary care hospital of North India with an eight-bed critical care unit. Materials and Methods: The respiratory samples (bronchoalveolar lavage, protected brush catheter specimens and endotracheal or transtracheal aspirates obtained from VAP patients (during January 2005-December 2006 were processed, isolated bacteria identified and their antibiotic susceptibilities tested as per standard protocols. The isolates of Enterobacteriaceae resistant to carbapenem were subjected to phenotypic and genotypic tests for the detection of MBLs. Results: Twelve of 64 isolates of Enterobacteriaceae were detected as MBL producers, bla IMP being the most prevalent gene. Additionally, in three strains, simultaneous coexistence of multiple MBL genes was detected. Conclusion: The coexistence of multiple MBL genes in Enterobacteriaceae is an alarming situation. As MBL genes are associated with integrons that can be embedded in transposons, which in turn can be accommodated on plasmids thereby resulting in a highly mobile genetic apparatus, the further spread of these genes in different pathogens is likely to occur.
Gene expression studies of reference genes for quantitative real-time PCR: an overview in insects.

Science.gov (United States)

Shakeel, Muhammad; Rodriguez, Alicia; Tahir, Urfa Bin; Jin, Fengliang

2018-02-01

Whenever gene expression is being examined, it is essential that a normalization process is carried out to eliminate non-biological variations. The use of reference genes, such as glyceraldehyde-3-phosphate dehydrogenase, actin, and ribosomal protein genes, is the usual method of choice for normalizing gene expression. Although reference genes are used to normalize target gene expression, a major problem is that the stability of these genes differs among tissues, developmental stages, species, and responses to abiotic factors. Therefore, the use and validation of multiple reference genes are required. This review discusses the reasons that why RT-qPCR has become the preferred method for validating results of gene expression profiles, the use of specific and non-specific dyes and the importance of use of primers and probes for qPCR as well as to discuss several statistical algorithms developed to help the validation of potential reference genes. The conflicts arising in the use of classical reference genes in gene normalization and their replacement with novel references are also discussed by citing the high stability and low stability of classical and novel reference genes under various biotic and abiotic experimental conditions by employing various methods applied for the reference genes amplification.
Circadian Enhancers Coordinate Multiple Phases of Rhythmic Gene Transcription In Vivo

Science.gov (United States)

Fang, Bin; Everett, Logan J.; Jager, Jennifer; Briggs, Erika; Armour, Sean M.; Feng, Dan; Roy, Ankur; Gerhart-Hines, Zachary; Sun, Zheng; Lazar, Mitchell A.

2014-01-01

SUMMARY Mammalian transcriptomes display complex circadian rhythms with multiple phases of gene expression that cannot be accounted for by current models of the molecular clock. We have determined the underlying mechanisms by measuring nascent RNA transcription around the clock in mouse liver. Unbiased examination of eRNAs that cluster in specific circadian phases identified functional enhancers driven by distinct transcription factors (TFs). We further identify on a global scale the components of the TF cistromes that function to orchestrate circadian gene expression. Integrated genomic analyses also revealed novel mechanisms by which a single circadian factor controls opposing transcriptional phases. These findings shed new light on the diversity and specificity of TF function in the generation of multiple phases of circadian gene transcription in a mammalian organ. PMID:25416951
A non-inheritable maternal Cas9-based multiple-gene editing system in mice

OpenAIRE

Takayuki Sakurai; Akiko Kamiyoshi; Hisaka Kawate; Chie Mori; Satoshi Watanabe; Megumu Tanaka; Ryuichi Uetake; Masahiro Sato; Takayuki Shindo

2016-01-01

The CRISPR/Cas9 system is capable of editing multiple genes through one-step zygote injection. The preexisting method is largely based on the co-injection of Cas9 DNA (or mRNA) and guide RNAs (gRNAs); however, it is unclear how many genes can be simultaneously edited by this method, and a reliable means to generate transgenic (Tg) animals with multiple gene editing has yet to be developed. Here, we employed non-inheritable maternal Cas9 (maCas9) protein derived from Tg mice with systemic Cas9...
StrateGene: object-oriented programming in molecular biology.

Science.gov (United States)

Carhart, R E; Cash, H D; Moore, J F

1988-03-01

This paper describes some of the ways that object-oriented programming methodologies have been used to represent and manipulate biological information in a working application. When running on a Xerox 1100 series computer, StrateGene functions as a genetic engineering workstation for the management of information about cloning experiments. It represents biological molecules, enzymes, fragments, and methods as classes, subclasses, and members in a hierarchy of objects. These objects may have various attributes, which themselves can be defined and classified. The attributes and their values can be passed from the classes of objects down to the subclasses and members. The user can modify the objects and their attributes while using them. New knowledge and changes to the system can be incorporated relatively easily. The operations on the biological objects are associated with the objects themselves. This makes it easier to invoke them correctly and allows generic operations to be customized for the particular object.
Hybrid-Lambda: simulation of multiple merger and Kingman gene genealogies in species networks and species trees.

Science.gov (United States)

Zhu, Sha; Degnan, James H; Goldstien, Sharyn J; Eldon, Bjarki

2015-09-15

There has been increasing interest in coalescent models which admit multiple mergers of ancestral lineages; and to model hybridization and coalescence simultaneously. Hybrid-Lambda is a software package that simulates gene genealogies under multiple merger and Kingman's coalescent processes within species networks or species trees. Hybrid-Lambda allows different coalescent processes to be specified for different populations, and allows for time to be converted between generations and coalescent units, by specifying a population size for each population. In addition, Hybrid-Lambda can generate simulated datasets, assuming the infinitely many sites mutation model, and compute the F ST statistic. As an illustration, we apply Hybrid-Lambda to infer the time of subdivision of certain marine invertebrates under different coalescent processes. Hybrid-Lambda makes it possible to investigate biogeographic concordance among high fecundity species exhibiting skewed offspring distribution.
Synthetic biology and occupational risk.

Science.gov (United States)

Howard, John; Murashov, Vladimir; Schulte, Paul

2017-03-01

Synthetic biology is an emerging interdisciplinary field of biotechnology that involves applying the principles of engineering and chemical design to biological systems. Biosafety professionals have done an excellent job in addressing research laboratory safety as synthetic biology and gene editing have emerged from the larger field of biotechnology. Despite these efforts, risks posed by synthetic biology are of increasing concern as research procedures scale up to industrial processes in the larger bioeconomy. A greater number and variety of workers will be exposed to commercial synthetic biology risks in the future, including risks to a variety of workers from the use of lentiviral vectors as gene transfer devices. There is a need to review and enhance current protection measures in the field of synthetic biology, whether in experimental laboratories where new advances are being researched, in health care settings where treatments using viral vectors as gene delivery systems are increasingly being used, or in the industrial bioeconomy. Enhanced worker protection measures should include increased injury and illness surveillance of the synthetic biology workforce; proactive risk assessment and management of synthetic biology products; research on the relative effectiveness of extrinsic and intrinsic biocontainment methods; specific safety guidance for synthetic biology industrial processes; determination of appropriate medical mitigation measures for lentiviral vector exposure incidents; and greater awareness and involvement in synthetic biology safety by the general occupational safety and health community as well as by government occupational safety and health research and regulatory agencies.
Regulation of gene expression by manipulating transcriptional repressor activity using a novel CoSRI technology.

Science.gov (United States)

Xu, Yue; Li, Song Feng; Parish, Roger W

2017-07-01

Targeted gene manipulation is a central strategy for studying gene function and identifying related biological processes. However, a methodology for manipulating the regulatory motifs of transcription factors is lacking as these factors commonly possess multiple motifs (e.g. repression and activation motifs) which collaborate with each other to regulate multiple biological processes. We describe a novel approach designated conserved sequence-guided repressor inhibition (CoSRI) that can specifically reduce or abolish the repressive activities of transcription factors in vivo. The technology was evaluated using the chimeric MYB80-EAR transcription factor and subsequently the endogenous WUS transcription factor. The technology was employed to develop a reversible male sterility system applicable to hybrid seed production. In order to determine the capacity of the technology to regulate the activity of endogenous transcription factors, the WUS repressor was chosen. The WUS repression motif could be inhibited in vivo and the transformed plants exhibited the wus-1 phenotype. Consequently, the technology can be used to manipulate the activities of transcriptional repressor motifs regulating beneficial traits in crop plants and other eukaryotic organisms. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Positron emission tomography and gene therapy: basic concepts and experimental approaches for in vivo gene expression imaging.

Science.gov (United States)

Peñuelas, Iván; Boán, JoséF; Martí-Climent, Josep M; Sangro, Bruno; Mazzolini, Guillermo; Prieto, Jesús; Richter, José A

2004-01-01

More than two decades of intense research have allowed gene therapy to move from the laboratory to the clinical setting, where its use for the treatment of human pathologies has been considerably increased in the last years. However, many crucial questions remain to be solved in this challenging field. In vivo imaging with positron emission tomography (PET) by combination of the appropriate PET reporter gene and PET reporter probe could provide invaluable qualitative and quantitative information to answer multiple unsolved questions about gene therapy. PET imaging could be used to define parameters not available by other techniques that are of substantial interest not only for the proper understanding of the gene therapy process, but also for its future development and clinical application in humans. This review focuses on the molecular biology basis of gene therapy and molecular imaging, describing the fundamentals of in vivo gene expression imaging by PET, and the application of PET to gene therapy, as a technology that can be used in many different ways. It could be applied to avoid invasive procedures for gene therapy monitoring; accurately diagnose the pathology for better planning of the most adequate therapeutic approach; as treatment evaluation to image the functional effects of gene therapy at the biochemical level; as a quantitative noninvasive way to monitor the location, magnitude and persistence of gene expression over time; and would also help to a better understanding of vector biology and pharmacology devoted to the development of safer and more efficient vectors.
Insect-gene-activity detection system for chemical and biological warfare agents and toxic industrial chemicals

Science.gov (United States)

Mackie, Ryan S.; Schilling, Amanda S.; Lopez, Arturo M.; Rayms-Keller, Alfredo

2002-02-01

Detection of multiple chemical and biological weapons (CBW) agents and/or complex mixtures of toxic industrial chemicals (TIC) is imperative for both the commercial and military sectors. In a military scenario, a multi-CBW attack would create confusion, thereby delaying decontamination and therapeutic efforts. In the commercial sector, polluted sites invariably contain a mixture of TIC. Novel detection systems capable of detecting CBW and TIC are sorely needed. While it may be impossible to build a detector capable of discriminating all the possible combinations of CBW, a detection system capable of statistically predicting the most likely composition of a given mixture is within the reach of current emerging technologies. Aquatic insect-gene activity may prove to be a sensitive, discriminating, and elegant paradigm for the detection of CBW and TIC. We propose to systematically establish the expression patterns of selected protein markers in insects exposed to specific mixtures of chemical and biological warfare agents to generate a library of biosignatures of exposure. The predicting capabilities of an operational library of biosignatures of exposures will allow the detection of emerging novel or genetically engineered agents, as well as complex mixtures of chemical and biological weapons agents. CBW and TIC are discussed in the context of war, terrorism, and pollution.
Mean associated multiplicities in deep inelastic processes

International Nuclear Information System (INIS)

Dzhaparidze, G.Sh.; Kiselev, A.V.; Petrov, V.A.

1982-01-01

A formula is derived for the mean hadron multiplicity in the target fragmentation range of deep inelastic scattering processes. It is shown that in the high-x region the ratio of the mean multiplicities in the current fragmentation region and in the target fragmentation region tends to unity at high energies. The mean multiplicity for the Drell-Yan process is considered
Mean associated multiplicities in deep inelastic processes

International Nuclear Information System (INIS)

Dzhaparidze, G.S.; Kiselev, A.V.; Petrov, V.A.

1982-01-01

A formula is derived for the mean multiplicity of hadrons in the target-fragmentation region in the process of deep inelastic scattering. It is shown that in the region of large x the ratio of the mean multiplicities in the current- and target-fragmentation regions tends to unity at high energies. The mean multiplicity in the Drell-Yan process is also discussed
Stochastic biological response to radiation. Comprehensive analysis of gene expression

International Nuclear Information System (INIS)

Inoue, Tohru; Hirabayashi, Yoko

2012-01-01

Authors explain that the radiation effect on biological system is stochastic along the law of physics, differing from chemical effect, using instances of Cs-137 gamma-ray (GR) and benzene (BZ) exposures to mice and of resultant comprehensive analyses of gene expression. Single GR irradiation is done with Gamma Cell 40 (CSR) to C57BL/6 or C3H/He mouse at 0, 0.6 and 3 Gy. BE is given orally at 150 mg/kg/day for 5 days x 2 weeks. Bone marrow cells are sampled 1 month after the exposure. Comprehensive gene expression is analyzed by Gene Chip Mouse Genome 430 2.0 Array (Affymetrix) and data are processed by programs like case normalization, statistics, network generation, functional analysis etc. GR irradiation brings about changes of gene expression, which are classifiable in common genes variable commonly on the dose change and stochastic genes variable stochastically within each dose: e.g., with Welch-t-test, significant differences are between 0/3 Gy (dose-specific difference, 455 pbs (probe set), in stochastic 2113 pbs), 0/0.6 Gy (267 in 1284 pbs) and 0.6/3 Gy (532 pbs); and with one-way analysis of variation (ANOVA) and hierarchial/dendrographic analyses, 520 pbs are shown to involve the dose-dependent 226 and dose-specific 294 pbs. It is also shown that at 3 Gy, expression of common genes are rather suppressed, including those related to the proliferation/apoptosis of B/T cells, and of stochastic genes, related to cell division/signaling. Ven diagram of the common genes of above 520 pbs, stochastic 2113 pbs at 3 Gy and 1284 pbs at 0.6 Gy shows the overlapping genes 29, 2 and 4, respectively, indicating only 35 pbs are overlapping in total. Network analysis of changes by GR shows the rather high expression of genes around hub of cAMP response element binding protein (CREB) at 0.6 Gy, and rather variable expression around CREB hub/suppressed expression of kinesin hub at 3 Gy; in the network by BZ exposure, unchanged or low expression around p53 hub and suppression
Expression regulation of design process gene in product design

DEFF Research Database (Denmark)

Li, Bo; Fang, Lusheng; Li, Bo

2011-01-01

To improve the design process efficiency, this paper proposes the principle and methodology that design process gene controls the characteristics of design process under the framework of design process reuse and optimization based on design process gene. First, the concept of design process gene...... is proposed and analyzed, as well as its three categories i.e., the operator gene, the structural gene and the regulator gene. Second, the trigger mechanism that design objectives and constraints trigger the operator gene is constructed. Third, the expression principle of structural gene is analyzed...... with the example of design management gene. Last, the regulation mode that the regulator gene regulates the expression of the structural gene is established and it is illustrated by taking the design process management gene as an example. © (2011) Trans Tech Publications....
Glyphosate accumulation, translocation, and biological effects in Coffea arabica after single and multiple exposures

DEFF Research Database (Denmark)

Schrübbers, Lars Christoph; Valverde, Bernal E.; Strobel, Bjarne W.

2016-01-01

In perennial crops like coffee, glyphosate drift exposure can occur multiple times during its commercial life span. Due to limited glyphosate degradation in higher plants, a potential accumulation of glyphosate could lead to increased biological effects with increased exposure frequency....... In this study, we investigated glyphosate translocation over time, and its concentration and biological effects after single and multiple simulated spray-drift exposures. Additionally, shikimic acid/glyphosate ratios were used as biomarkers for glyphosate binding to its target enzyme.Four weeks after...... the exposure, glyphosate was continuously translocated. Shikimic acid levels were lin-ear correlated with glyphosate levels. After two months, however, glyphosate appeared to have reduced activity. In the greenhouse, multiple applications resulted in higher internal glyphosate concentrations.The time...
A Convenient Cas9-based Conditional Knockout Strategy for Simultaneously Targeting Multiple Genes in Mouse.

Science.gov (United States)

Chen, Jiang; Du, Yinan; He, Xueyan; Huang, Xingxu; Shi, Yun S

2017-03-31

The most powerful way to probe protein function is to characterize the consequence of its deletion. Compared to conventional gene knockout (KO), conditional knockout (cKO) provides an advanced gene targeting strategy with which gene deletion can be performed in a spatially and temporally restricted manner. However, for most species that are amphiploid, the widely used Cre-flox conditional KO (cKO) system would need targeting loci in both alleles to be loxP flanked, which in practice, requires time and labor consuming breeding. This is considerably significant when one is dealing with multiple genes. CRISPR/Cas9 genome modulation system is advantaged in its capability in targeting multiple sites simultaneously. Here we propose a strategy that could achieve conditional KO of multiple genes in mouse with Cre recombinase dependent Cas9 expression. By transgenic construction of loxP-stop-loxP (LSL) controlled Cas9 (LSL-Cas9) together with sgRNAs targeting EGFP, we showed that the fluorescence molecule could be eliminated in a Cre-dependent manner. We further verified the efficacy of this novel strategy to target multiple sites by deleting c-Maf and MafB simultaneously in macrophages specifically. Compared to the traditional Cre-flox cKO strategy, this sgRNAs-LSL-Cas9 cKO system is simpler and faster, and would make conditional manipulation of multiple genes feasible.
Synchronous versus asynchronous modeling of gene regulatory networks.

Science.gov (United States)

Garg, Abhishek; Di Cara, Alessandro; Xenarios, Ioannis; Mendoza, Luis; De Micheli, Giovanni

2008-09-01

In silico modeling of gene regulatory networks has gained some momentum recently due to increased interest in analyzing the dynamics of biological systems. This has been further facilitated by the increasing availability of experimental data on gene-gene, protein-protein and gene-protein interactions. The two dynamical properties that are often experimentally testable are perturbations and stable steady states. Although a lot of work has been done on the identification of steady states, not much work has been reported on in silico modeling of cellular differentiation processes. In this manuscript, we provide algorithms based on reduced ordered binary decision diagrams (ROBDDs) for Boolean modeling of gene regulatory networks. Algorithms for synchronous and asynchronous transition models have been proposed and their corresponding computational properties have been analyzed. These algorithms allow users to compute cyclic attractors of large networks that are currently not feasible using existing software. Hereby we provide a framework to analyze the effect of multiple gene perturbation protocols, and their effect on cell differentiation processes. These algorithms were validated on the T-helper model showing the correct steady state identification and Th1-Th2 cellular differentiation process. The software binaries for Windows and Linux platforms can be downloaded from http://si2.epfl.ch/~garg/genysis.html.

Equivalent Gene Expression Profiles between Glatopa™ and Copaxone®.

Directory of Open Access Journals (Sweden)

Josephine S D'Alessandro

Full Text Available Glatopa™ is a generic glatiramer acetate recently approved for the treatment of patients with relapsing forms of multiple sclerosis. Gene expression profiling was performed as a means to evaluate equivalence of Glatopa and Copaxone®. Microarray analysis containing 39,429 unique probes across the entire genome was performed in murine glatiramer acetate--responsive Th2-polarized T cells, a test system highly relevant to the biology of glatiramer acetate. A closely related but nonequivalent glatiramoid molecule was used as a control to establish assay sensitivity. Multiple probe-level (Student's t-test and sample-level (principal component analysis, multidimensional scaling, and hierarchical clustering statistical analyses were utilized to look for differences in gene expression induced by the test articles. The analyses were conducted across all genes measured, as well as across a subset of genes that were shown to be modulated by Copaxone. The following observations were made across multiple statistical analyses: the expression of numerous genes was significantly changed by treatment with Copaxone when compared against media-only control; gene expression profiles induced by Copaxone and Glatopa were not significantly different; and gene expression profiles induced by Copaxone and the nonequivalent glatiramoid were significantly different, underscoring the sensitivity of the test system and the multiple analysis methods. Comparative analysis was also performed on sets of transcripts relevant to T-cell biology and antigen presentation, among others that are known to be modulated by glatiramer acetate. No statistically significant differences were observed between Copaxone and Glatopa in the expression levels (magnitude and direction of these glatiramer acetate-regulated genes. In conclusion, multiple methods consistently supported equivalent gene expression profiles between Copaxone and Glatopa.
Association of circadian rhythm genes ARNTL/BMAL1 and CLOCK with multiple sclerosis.

Directory of Open Access Journals (Sweden)

Polona Lavtar

Full Text Available Prevalence of multiple sclerosis varies with geographic latitude. We hypothesized that this fact might be partially associated with the influence of latitude on circadian rhythm and consequently that genetic variability of key circadian rhythm regulators, ARNTL and CLOCK genes, might contribute to the risk for multiple sclerosis. Our aim was to analyse selected polymorphisms of ARNTL and CLOCK, and their association with multiple sclerosis. A total of 900 Caucasian patients and 1024 healthy controls were compared for genetic signature at 8 SNPs, 4 for each of both genes. We found a statistically significant difference in genotype (ARNTL rs3789327, P = 7.5·10-5; CLOCK rs6811520 P = 0.02 distributions in patients and controls. The ARNTL rs3789327 CC genotype was associated with higher risk for multiple sclerosis at an OR of 1.67 (95% CI 1.35-2.07, P = 0.0001 and the CLOCK rs6811520 genotype CC at an OR of 1.40 (95% CI 1.13-1.73, P = 0.002. The results of this study suggest that genetic variability in the ARNTL and CLOCK genes might be associated with risk for multiple sclerosis.
Whole exome sequencing reveals concomitant mutations of multiple FA genes in individual Fanconi anemia patients.

Science.gov (United States)

Chang, Lixian; Yuan, Weiping; Zeng, Huimin; Zhou, Quanquan; Wei, Wei; Zhou, Jianfeng; Li, Miaomiao; Wang, Xiaomin; Xu, Mingjiang; Yang, Fengchun; Yang, Yungui; Cheng, Tao; Zhu, Xiaofan

2014-05-15

Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients' clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology.
A postprocessing method in the HMC framework for predicting gene function based on biological instrumental data

Science.gov (United States)

Feng, Shou; Fu, Ping; Zheng, Wenbin

2018-03-01

Predicting gene function based on biological instrumental data is a complicated and challenging hierarchical multi-label classification (HMC) problem. When using local approach methods to solve this problem, a preliminary results processing method is usually needed. This paper proposed a novel preliminary results processing method called the nodes interaction method. The nodes interaction method revises the preliminary results and guarantees that the predictions are consistent with the hierarchy constraint. This method exploits the label dependency and considers the hierarchical interaction between nodes when making decisions based on the Bayesian network in its first phase. In the second phase, this method further adjusts the results according to the hierarchy constraint. Implementing the nodes interaction method in the HMC framework also enhances the HMC performance for solving the gene function prediction problem based on the Gene Ontology (GO), the hierarchy of which is a directed acyclic graph that is more difficult to tackle. The experimental results validate the promising performance of the proposed method compared to state-of-the-art methods on eight benchmark yeast data sets annotated by the GO.
Genetic evaluation with major genes and polygenic inheritance when some animals are not genotyped using gene content multiple-trait BLUP.

Science.gov (United States)

Legarra, Andrés; Vitezica, Zulma G

2015-11-17

In pedigreed populations with a major gene segregating for a quantitative trait, it is not clear how to use pedigree, genotype and phenotype information when some individuals are not genotyped. We propose to consider gene content at the major gene as a second trait correlated to the quantitative trait, in a gene content multiple-trait best linear unbiased prediction (GCMTBLUP) method. The genetic covariance between the trait and gene content at the major gene is a function of the substitution effect of the gene. This genetic covariance can be written in a multiple-trait form that accommodates any pattern of missing values for either genotype or phenotype data. Effects of major gene alleles and the genetic covariance between genotype at the major gene and the phenotype can be estimated using standard EM-REML or Gibbs sampling. Prediction of breeding values with genotypes at the major gene can use multiple-trait BLUP software. Major genes with more than two alleles can be considered by including negative covariances between gene contents at each different allele. We simulated two scenarios: a selected and an unselected trait with heritabilities of 0.05 and 0.5, respectively. In both cases, the major gene explained half the genetic variation. Competing methods used imputed gene contents derived by the method of Gengler et al. or by iterative peeling. Imputed gene contents, in contrast to GCMTBLUP, do not consider information on the quantitative trait for genotype prediction. GCMTBLUP gave unbiased estimates of the gene effect, in contrast to the other methods, with less bias and better or equal accuracy of prediction. GCMTBLUP improved estimation of genotypes in non-genotyped individuals, in particular if these individuals had own phenotype records and the trait had a high heritability. Ignoring the major gene in genetic evaluation led to serious biases and decreased prediction accuracy. CGMTBLUP is the best linear predictor of additive genetic merit including
Novel method to load multiple genes onto a mammalian artificial chromosome.

Directory of Open Access Journals (Sweden)

Anna Tóth

Full Text Available Mammalian artificial chromosomes are natural chromosome-based vectors that may carry a vast amount of genetic material in terms of both size and number. They are reasonably stable and segregate well in both mitosis and meiosis. A platform artificial chromosome expression system (ACEs was earlier described with multiple loading sites for a modified lambda-integrase enzyme. It has been shown that this ACEs is suitable for high-level industrial protein production and the treatment of a mouse model for a devastating human disorder, Krabbe's disease. ACEs-treated mutant mice carrying a therapeutic gene lived more than four times longer than untreated counterparts. This novel gene therapy method is called combined mammalian artificial chromosome-stem cell therapy. At present, this method suffers from the limitation that a new selection marker gene should be present for each therapeutic gene loaded onto the ACEs. Complex diseases require the cooperative action of several genes for treatment, but only a limited number of selection marker genes are available and there is also a risk of serious side-effects caused by the unwanted expression of these marker genes in mammalian cells, organs and organisms. We describe here a novel method to load multiple genes onto the ACEs by using only two selectable marker genes. These markers may be removed from the ACEs before therapeutic application. This novel technology could revolutionize gene therapeutic applications targeting the treatment of complex disorders and cancers. It could also speed up cell therapy by allowing researchers to engineer a chromosome with a predetermined set of genetic factors to differentiate adult stem cells, embryonic stem cells and induced pluripotent stem (iPS cells into cell types of therapeutic value. It is also a suitable tool for the investigation of complex biochemical pathways in basic science by producing an ACEs with several genes from a signal transduction pathway of interest.
Novel method to load multiple genes onto a mammalian artificial chromosome.

Science.gov (United States)

Tóth, Anna; Fodor, Katalin; Praznovszky, Tünde; Tubak, Vilmos; Udvardy, Andor; Hadlaczky, Gyula; Katona, Robert L

2014-01-01

Mammalian artificial chromosomes are natural chromosome-based vectors that may carry a vast amount of genetic material in terms of both size and number. They are reasonably stable and segregate well in both mitosis and meiosis. A platform artificial chromosome expression system (ACEs) was earlier described with multiple loading sites for a modified lambda-integrase enzyme. It has been shown that this ACEs is suitable for high-level industrial protein production and the treatment of a mouse model for a devastating human disorder, Krabbe's disease. ACEs-treated mutant mice carrying a therapeutic gene lived more than four times longer than untreated counterparts. This novel gene therapy method is called combined mammalian artificial chromosome-stem cell therapy. At present, this method suffers from the limitation that a new selection marker gene should be present for each therapeutic gene loaded onto the ACEs. Complex diseases require the cooperative action of several genes for treatment, but only a limited number of selection marker genes are available and there is also a risk of serious side-effects caused by the unwanted expression of these marker genes in mammalian cells, organs and organisms. We describe here a novel method to load multiple genes onto the ACEs by using only two selectable marker genes. These markers may be removed from the ACEs before therapeutic application. This novel technology could revolutionize gene therapeutic applications targeting the treatment of complex disorders and cancers. It could also speed up cell therapy by allowing researchers to engineer a chromosome with a predetermined set of genetic factors to differentiate adult stem cells, embryonic stem cells and induced pluripotent stem (iPS) cells into cell types of therapeutic value. It is also a suitable tool for the investigation of complex biochemical pathways in basic science by producing an ACEs with several genes from a signal transduction pathway of interest.
A permutation-based multiple testing method for time-course microarray experiments

Directory of Open Access Journals (Sweden)

George Stephen L

2009-10-01

Full Text Available Abstract Background Time-course microarray experiments are widely used to study the temporal profiles of gene expression. Storey et al. (2005 developed a method for analyzing time-course microarray studies that can be applied to discovering genes whose expression trajectories change over time within a single biological group, or those that follow different time trajectories among multiple groups. They estimated the expression trajectories of each gene using natural cubic splines under the null (no time-course and alternative (time-course hypotheses, and used a goodness of fit test statistic to quantify the discrepancy. The null distribution of the statistic was approximated through a bootstrap method. Gene expression levels in microarray data are often complicatedly correlated. An accurate type I error control adjusting for multiple testing requires the joint null distribution of test statistics for a large number of genes. For this purpose, permutation methods have been widely used because of computational ease and their intuitive interpretation. Results In this paper, we propose a permutation-based multiple testing procedure based on the test statistic used by Storey et al. (2005. We also propose an efficient computation algorithm. Extensive simulations are conducted to investigate the performance of the permutation-based multiple testing procedure. The application of the proposed method is illustrated using the Caenorhabditis elegans dauer developmental data. Conclusion Our method is computationally efficient and applicable for identifying genes whose expression levels are time-dependent in a single biological group and for identifying the genes for which the time-profile depends on the group in a multi-group setting.
Methodological issues in detecting gene-gene interactions in breast cancer susceptibility: a population-based study in Ontario

Directory of Open Access Journals (Sweden)

Onay Venus

2007-08-01

Full Text Available Abstract Background There is growing evidence that gene-gene interactions are ubiquitous in determining the susceptibility to common human diseases. The investigation of such gene-gene interactions presents new statistical challenges for studies with relatively small sample sizes as the number of potential interactions in the genome can be large. Breast cancer provides a useful paradigm to study genetically complex diseases because commonly occurring single nucleotide polymorphisms (SNPs may additively or synergistically disturb the system-wide communication of the cellular processes leading to cancer development. Methods In this study, we systematically studied SNP-SNP interactions among 19 SNPs from 18 key genes involved in major cancer pathways in a sample of 398 breast cancer cases and 372 controls from Ontario. We discuss the methodological issues associated with the detection of SNP-SNP interactions in this dataset by applying and comparing three commonly used methods: the logistic regression model, classification and regression trees (CART, and the multifactor dimensionality reduction (MDR method. Results Our analyses show evidence for several simple (two-way and complex (multi-way SNP-SNP interactions associated with breast cancer. For example, all three methods identified XPD-[Lys751Gln]*IL10-[G(-1082A] as the most significant two-way interaction. CART and MDR identified the same critical SNPs participating in complex interactions. Our results suggest that the use of multiple statistical approaches (or an integrated approach rather than a single methodology could be the best strategy to elucidate complex gene interactions that have generally very different patterns. Conclusion The strategy used here has the potential to identify complex biological relationships among breast cancer genes and processes. This will lead to the discovery of novel biological information, which will improve breast cancer risk management.
On the Interplay between the Evolvability and Network Robustness in an Evolutionary Biological Network: A Systems Biology Approach

Science.gov (United States)

Chen, Bor-Sen; Lin, Ying-Po

2011-01-01

In the evolutionary process, the random transmission and mutation of genes provide biological diversities for natural selection. In order to preserve functional phenotypes between generations, gene networks need to evolve robustly under the influence of random perturbations. Therefore, the robustness of the phenotype, in the evolutionary process, exerts a selection force on gene networks to keep network functions. However, gene networks need to adjust, by variations in genetic content, to generate phenotypes for new challenges in the network’s evolution, ie, the evolvability. Hence, there should be some interplay between the evolvability and network robustness in evolutionary gene networks. In this study, the interplay between the evolvability and network robustness of a gene network and a biochemical network is discussed from a nonlinear stochastic system point of view. It was found that if the genetic robustness plus environmental robustness is less than the network robustness, the phenotype of the biological network is robust in evolution. The tradeoff between the genetic robustness and environmental robustness in evolution is discussed from the stochastic stability robustness and sensitivity of the nonlinear stochastic biological network, which may be relevant to the statistical tradeoff between bias and variance, the so-called bias/variance dilemma. Further, the tradeoff could be considered as an antagonistic pleiotropic action of a gene network and discussed from the systems biology perspective. PMID:22084563
Operon Gene Order Is Optimized for Ordered Protein Complex Assembly

Science.gov (United States)

Wells, Jonathan N.; Bergendahl, L. Therese; Marsh, Joseph A.

2016-01-01

Summary The assembly of heteromeric protein complexes is an inherently stochastic process in which multiple genes are expressed separately into proteins, which must then somehow find each other within the cell. Here, we considered one of the ways by which prokaryotic organisms have attempted to maximize the efficiency of protein complex assembly: the organization of subunit-encoding genes into operons. Using structure-based assembly predictions, we show that operon gene order has been optimized to match the order in which protein subunits assemble. Exceptions to this are almost entirely highly expressed proteins for which assembly is less stochastic and for which precisely ordered translation offers less benefit. Overall, these results show that ordered protein complex assembly pathways are of significant biological importance and represent a major evolutionary constraint on operon gene organization. PMID:26804901
The integration of weighted gene association networks based on information entropy.

Science.gov (United States)

Yang, Fan; Wu, Duzhi; Lin, Limei; Yang, Jian; Yang, Tinghong; Zhao, Jing

2017-01-01

Constructing genome scale weighted gene association networks (WGAN) from multiple data sources is one of research hot spots in systems biology. In this paper, we employ information entropy to describe the uncertain degree of gene-gene links and propose a strategy for data integration of weighted networks. We use this method to integrate four existing human weighted gene association networks and construct a much larger WGAN, which includes richer biology information while still keeps high functional relevance between linked gene pairs. The new WGAN shows satisfactory performance in disease gene prediction, which suggests the reliability of our integration strategy. Compared with existing integration methods, our method takes the advantage of the inherent characteristics of the component networks and pays less attention to the biology background of the data. It can make full use of existing biological networks with low computational effort.
IGEMS: The Consortium on Interplay of Genes and Environment Across Multiple Studies

DEFF Research Database (Denmark)

Pedersen, Nancy L; Christensen, Kaare; Dahl, Anna K

2013-01-01

The Interplay of Genes and Environment across Multiple Studies (IGEMS) group is a consortium of eight longitudinal twin studies established to explore the nature of social context effects and gene-environment interplay in late-life functioning. The resulting analysis of the combined data from ove...
Action of multiple intra-QTL genes concerted around a co-localized transcription factor underpins a large effect QTL

Science.gov (United States)

Dixit, Shalabh; Kumar Biswal, Akshaya; Min, Aye; Henry, Amelia; Oane, Rowena H.; Raorane, Manish L.; Longkumer, Toshisangba; Pabuayon, Isaiah M.; Mutte, Sumanth K.; Vardarajan, Adithi R.; Miro, Berta; Govindan, Ganesan; Albano-Enriquez, Blesilda; Pueffeld, Mandy; Sreenivasulu, Nese; Slamet-Loedin, Inez; Sundarvelpandian, Kalaipandian; Tsai, Yuan-Ching; Raghuvanshi, Saurabh; Hsing, Yue-Ie C.; Kumar, Arvind; Kohli, Ajay

2015-01-01

Sub-QTLs and multiple intra-QTL genes are hypothesized to underpin large-effect QTLs. Known QTLs over gene families, biosynthetic pathways or certain traits represent functional gene-clusters of genes of the same gene ontology (GO). Gene-clusters containing genes of different GO have not been elaborated, except in silico as coexpressed genes within QTLs. Here we demonstrate the requirement of multiple intra-QTL genes for the full impact of QTL qDTY12.1 on rice yield under drought. Multiple evidences are presented for the need of the transcription factor ‘no apical meristem’ (OsNAM12.1) and its co-localized target genes of separate GO categories for qDTY12.1 function, raising a regulon-like model of genetic architecture. The molecular underpinnings of qDTY12.1 support its effectiveness in further improving a drought tolerant genotype and for its validity in multiple genotypes/ecosystems/environments. Resolving the combinatorial value of OsNAM12.1 with individual intra-QTL genes notwithstanding, identification and analyses of qDTY12.1has fast-tracked rice improvement towards food security. PMID:26507552
Experimental and Modeling Approaches for Understanding the Effect of Gene Expression Noise in Biological Development

Directory of Open Access Journals (Sweden)

David M. Holloway

2018-04-01

Full Text Available Biological development involves numerous chemical and physical processes which must act in concert to reliably produce a cell, a tissue, or a body. To be successful, the developing organism must be robust to variability at many levels, such as the environment (e.g., temperature, moisture, upstream information (such as long-range positional information gradients, or intrinsic noise due to the stochastic nature of low concentration chemical kinetics. The latter is especially relevant to the regulation of gene expression in cell differentiation. The temporal stochasticity of gene expression has been studied in single celled organisms for nearly two decades, but only recently have techniques become available to gather temporally-resolved data across spatially-distributed gene expression patterns in developing multicellular organisms. These demonstrate temporal noisy “bursting” in the number of gene transcripts per cell, raising the question of how the transcript number defining a particular cell type is produced, such that one cell type can reliably be distinguished from a neighboring cell of different type along a tissue boundary. Stochastic spatio-temporal modeling of tissue-wide expression patterns can identify signatures for specific types of gene regulation, which can be used to extract regulatory mechanism information from experimental time series. This Perspective focuses on using this type of approach to study gene expression noise during the anterior-posterior segmentation of the fruit fly embryo. Advances in experimental and theoretical techniques will lead to an increasing quantification of expression noise that can be used to understand how regulatory mechanisms contribute to embryonic robustness across a range of developmental processes.
Network analysis of differential expression for the identification of disease-causing genes.

Directory of Open Access Journals (Sweden)

Daniela Nitsch

Full Text Available Genetic studies (in particular linkage and association studies identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved. We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes.
Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

Science.gov (United States)

Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

2011-11-01

Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.
A searchable cross-platform gene expression database reveals connections between drug treatments and disease

Directory of Open Access Journals (Sweden)

Williams Gareth

2012-01-01

Full Text Available Abstract Background Transcriptional data covering multiple platforms and species is collected and processed into a searchable platform independent expression database (SPIED. SPIED consists of over 100,000 expression fold profiles defined independently of control/treatment assignment and mapped to non-redundant gene lists. The database is thus searchable with query profiles defined over genes alone. The motivation behind SPIED is that transcriptional profiles can be quantitatively compared and ranked and thus serve as effective surrogates for comparing the underlying biological states across multiple experiments. Results Drug perturbation, cancer and neurodegenerative disease derived transcriptional profiles are shown to be effective descriptors of the underlying biology as they return related drugs and pathologies from SPIED. In the case of Alzheimer's disease there is high transcriptional overlap with other neurodegenerative conditions and rodent models of neurodegeneration and nerve injury. Combining the query signature with correlating profiles allows for the definition of a tight neurodegeneration signature that successfully highlights many neuroprotective drugs in the Broad connectivity map. Conclusions Quantitative querying of expression data from across the totality of deposited experiments is an effective way of discovering connections between different biological systems and in particular that between drug action and biological disease state. Examples in cancer and neurodegenerative conditions validate the utility of SPIED.
Exploring lipids with nonlinear optical microscopy in multiple biological systems

Science.gov (United States)

Alfonso-Garcia, Alba

Lipids are crucial biomolecules for the well being of humans. Altered lipid metabolism may give rise to a variety of diseases that affect organs from the cardiovascular to the central nervous system. A deeper understanding of lipid metabolic processes would spur medical research towards developing precise diagnostic tools, treatment methods, and preventive strategies for reducing the impact of lipid diseases. Lipid visualization remains a complex task because of the perturbative effect exerted by traditional biochemical assays and most fluorescence markers. Coherent Raman scattering (CRS) microscopy enables interrogation of biological samples with minimum disturbance, and is particularly well suited for label-free visualization of lipids, providing chemical specificity without compromising on spatial resolution. Hyperspectral imaging yields large datasets that benefit from tailored multivariate analysis. In this thesis, CRS microscopy was combined with Raman spectroscopy and other label-free nonlinear optical techniques to analyze lipid metabolism in multiple biological systems. We used nonlinear Raman techniques to characterize Meibum secretions in the progression of dry eye disease, where the lipid and protein contributions change in ratio and phase segregation. We employed similar tools to examine lipid droplets in mice livers aboard a spaceflight mission, which lose their retinol content contributing to the onset of nonalcoholic fatty-liver disease. We also focused on atherosclerosis, a disease that revolves around lipid-rich plaques in arterial walls. We examined the lipid content of macrophages, whose variable phenotype gives rise to contrasting healing and inflammatory activities. We also proposed new label-free markers, based on lifetime imaging, for macrophage phenotype, and to detect products of lipid oxidation. Cholesterol was also detected in hepatitis C virus infected cells, and in specific strains of age-related macular degeneration diseased cells by
Ten good reasons to consider biological processes in prevention and intervention research.

Science.gov (United States)

Beauchaine, Theodore P; Neuhaus, Emily; Brenner, Sharon L; Gatzke-Kopp, Lisa

2008-01-01

Most contemporary accounts of psychopathology acknowledge the importance of both biological and environmental influences on behavior. In developmental psychopathology, multiple etiological mechanisms for psychiatric disturbance are well recognized, including those operating at genetic, neurobiological, and environmental levels of analysis. However, neuroscientific principles are rarely considered in current approaches to prevention or intervention. In this article, we explain why a deeper understanding of the genetic and neural substrates of behavior is essential for the next generation of preventive interventions, and we outline 10 specific reasons why considering biological processes can improve treatment efficacy. Among these, we discuss (a) the role of biomarkers and endophenotypes in identifying those most in need of prevention; (b) implications for treatment of genetic and neural mechanisms of homotypic comorbidity, heterotypic comorbidity, and heterotypic continuity; (c) ways in which biological vulnerabilities moderate the effects of environmental experience; (d) situations in which Biology x Environment interactions account for more variance in key outcomes than main effects; and (e) sensitivity of neural systems, via epigenesis, programming, and neural plasticity, to environmental moderation across the life span. For each of the 10 reasons outlined we present an example from current literature and discuss critical implications for prevention.

Ten good reasons to consider biological processes in prevention and intervention research

Science.gov (United States)

BEAUCHAINE, THEODORE P.; NEUHAUS, EMILY; BRENNER, SHARON L.; GATZKE-KOPP, LISA

2009-01-01

Most contemporary accounts of psychopathology acknowledge the importance of both biological and environmental influences on behavior. In developmental psychopathology, multiple etiological mechanisms for psychiatric disturbance are well recognized, including those operating at genetic, neurobiological, and environmental levels of analysis. However, neuroscientific principles are rarely considered in current approaches to prevention or intervention. In this article, we explain why a deeper understanding of the genetic and neural substrates of behavior is essential for the next generation of preventive interventions, and we outline 10 specific reasons why considering biological processes can improve treatment efficacy. Among these, we discuss (a) the role of biomarkers and endophenotypes in identifying those most in need of prevention; (b) implications for treatment of genetic and neural mechanisms of homotypic comorbidity, heterotypic comorbidity, and heterotypic continuity; (c) ways in which biological vulnerabilities moderate the effects of environmental experience; (d) situations in which Biology×Environment interactions account for more variance in key outcomes than main effects; and (e) sensitivity of neural systems, via epigenesis, programming, and neural plasticity, to environmental moderation across the life span. For each of the 10 reasons outlined we present an example from current literature and discuss critical implications for prevention. PMID:18606030
Improvements in algal lipid production: a systems biology and gene editing approach.

Science.gov (United States)

Banerjee, Avik; Banerjee, Chiranjib; Negi, Sangeeta; Chang, Jo-Shu; Shukla, Pratyoosh

2018-05-01

In the wake of rising energy demands, microalgae have emerged as potential sources of sustainable and renewable carbon-neutral fuels, such as bio-hydrogen and bio-oil. For rational metabolic engineering, the elucidation of metabolic pathways in fine detail and their manipulation according to requirements is the key to exploiting the use of microalgae. Emergence of site-specific nucleases have revolutionized applied research leading to biotechnological gains. Genome engineering as well as modulation of the endogenous genome with high precision using CRISPR systems is being gradually employed in microalgal research. Further, to optimize and produce better algal platforms, use of systems biology network analysis and integration of omics data is required. This review discusses two important approaches: systems biology and gene editing strategies used on microalgal systems with a focus on biofuel production and sustainable solutions. It also emphasizes that the integration of such systems would contribute and compliment applied research on microalgae. Recent advances in microalgae are discussed, including systems biology, gene editing approaches in lipid bio-synthesis, and antenna engineering. Lastly, it has been attempted here to showcase how CRISPR/Cas systems are a better editing tool than existing techniques that can be utilized for gene modulation and engineering during biofuel production.
Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

Directory of Open Access Journals (Sweden)

Merchant Sabeeha S

2011-07-01

Full Text Available Abstract Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

Science.gov (United States)

Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

2004-03-01

By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
Spies and Bloggers: New Synthetic Biology Tools to Understand Microbial Processes in Soils and Sediments

Science.gov (United States)

Masiello, C. A.; Silberg, J. J.; Cheng, H. Y.; Del Valle, I.; Fulk, E. M.; Gao, X.; Bennett, G. N.

2017-12-01

Microbes can be programmed through synthetic biology to report on their behavior, informing researchers when their environment has triggered changes in their gene expression (e.g. in response to shifts in O2 or H2O), or when they have participated in a specific step of an elemental cycle (e.g. denitrification). This use of synthetic biology has the potential to significantly improve our understanding of microbes' roles in elemental and water cycling, because it allows reporting on the environment from the perspective of a microbe, matching the measurement scale exactly to the scale that a microbe experiences. However, synthetic microbes have not yet seen wide use in soil and sediment laboratory experiments because synthetic organisms typically report by fluorescing, making their signals difficult to detect outside the petri dish. We are developing a new suite of microbial programs that report instead by releasing easily-detected gases, allowing the real-time, noninvasive monitoring of behaviors in sediments and soils. Microbial biosensors can, in theory, be programmed to detect dynamic processes that contribute to a wide range of geobiological processes, including C cycling (biofilm production, methanogenesis, and synthesis of extracellular enzymes that degrade organic matter), N cycling (expression of enzymes that underlie different steps of the N cycle) and potentially S cycling. We will provide an overview of the potential uses of gas-reporting biosensors in soil and sediment lab experiments, and will report the development of the systematics of these sensors. Successful development of gas biosensors for laboratory use will require addressing issues including: engineering the intensity and selectivity of microbial gas production to maximize the signal to noise ratio; normalizing the gas reporter signal to cell population size, managing gas diffusion effects on signal shape; and developing multiple gases that can be used in parallel.
A pipeline to determine RT-QPCR control genes for evolutionary studies: application to primate gene expression across multiple tissues.

Directory of Open Access Journals (Sweden)

Olivier Fedrigo

Full Text Available Because many species-specific phenotypic differences are assumed to be caused by differential regulation of gene expression, many recent investigations have focused on measuring transcript abundance. Despite the availability of high-throughput platforms, quantitative real-time polymerase chain reaction (RT-QPCR is often the method of choice because of its low cost and wider dynamic range. However, the accuracy of this technique heavily relies on the use of multiple valid control genes for normalization. We created a pipeline for choosing genes potentially useful as RT-QPCR control genes for measuring expression between human and chimpanzee samples across multiple tissues, using published microarrays and a measure of tissue-specificity. We identified 13 genes from the pipeline and from commonly used control genes: ACTB, USP49, ARGHGEF2, GSK3A, TBP, SDHA, EIF2B2, GPDH, YWHAZ, HPTR1, RPL13A, HMBS, and EEF2. We then tested these candidate genes and validated their expression stability across species. We established the rank order of the most preferable set of genes for single and combined tissues. Our results suggest that for at least three tissues (cerebral cortex, liver, and skeletal muscle, EIF2B2, EEF2, HMBS, and SDHA are useful genes for normalizing human and chimpanzee expression using RT-QPCR. Interestingly, other commonly used control genes, including TBP, GAPDH, and, especially ACTB do not perform as well. This pipeline could be easily adapted to other species for which expression data exist, providing taxonomically appropriate control genes for comparisons of gene expression among species.
Molecular profiles to biology and pathways: a systems biology approach.

Science.gov (United States)

Van Laere, Steven; Dirix, Luc; Vermeulen, Peter

2016-06-16

Interpreting molecular profiles in a biological context requires specialized analysis strategies. Initially, lists of relevant genes were screened to identify enriched concepts associated with pathways or specific molecular processes. However, the shortcoming of interpreting gene lists by using predefined sets of genes has resulted in the development of novel methods that heavily rely on network-based concepts. These algorithms have the advantage that they allow a more holistic view of the signaling properties of the condition under study as well as that they are suitable for integrating different data types like gene expression, gene mutation, and even histological parameters.
The fate of retrotransposed processed genes in Arabidopsis thaliana.

Science.gov (United States)

Abdelkarim, Basma T M; Maranda, Vincent; Drouin, Guy

2017-04-20

Processed genes are functional genes that have arisen as a result of the retrotransposition of mRNA molecules. We found 6 genes that generated processed genes in the common ancestor of five Brassicaceae species (Arabidopsis thaliana, Arabidopsis lyrata, Capsella rubella, Brassica rapa and Thellungiella parvula). These processed genes have therefore been kept for at least 30millionyears. Analyses of the Ka/Ks ratio of these genes, and of those having given rise to them, show that they evolve relatively slowly and suggest that the processed genes maintained the same function as that of their parental gene. There is a significant negative correlation between the number of ESTs and transcripts produced and the Ka/Ks ratios of the parental genes but not of the processed genes. This suggests that selection has not yet adapted the selective pressure the processed genes experience to their expression level. However, the A. thaliana processed genes tend to be expressed in the same tissues as that of their parental genes. Furthermore, most have a CAATT-box, a TATA-box and are located about 1kb from another protein-coding gene. Altogether, our results suggest that the processed genes found in the A. thaliana genome have been kept to produce more of the same product, and in the same tissues, as that encoded by their parental gene. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.
Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology.

Science.gov (United States)

Vivar, Juan C; Pemu, Priscilla; McPherson, Ruth; Ghosh, Sujoy

2013-08-01

Abstract Unparalleled technological advances have fueled an explosive growth in the scope and scale of biological data and have propelled life sciences into the realm of "Big Data" that cannot be managed or analyzed by conventional approaches. Big Data in the life sciences are driven primarily via a diverse collection of 'omics'-based technologies, including genomics, proteomics, metabolomics, transcriptomics, metagenomics, and lipidomics. Gene-set enrichment analysis is a powerful approach for interrogating large 'omics' datasets, leading to the identification of biological mechanisms associated with observed outcomes. While several factors influence the results from such analysis, the impact from the contents of pathway databases is often under-appreciated. Pathway databases often contain variously named pathways that overlap with one another to varying degrees. Ignoring such redundancies during pathway analysis can lead to the designation of several pathways as being significant due to high content-similarity, rather than truly independent biological mechanisms. Statistically, such dependencies also result in correlated p values and overdispersion, leading to biased results. We investigated the level of redundancies in multiple pathway databases and observed large discrepancies in the nature and extent of pathway overlap. This prompted us to develop the application, ReCiPa (Redundancy Control in Pathway Databases), to control redundancies in pathway databases based on user-defined thresholds. Analysis of genomic and genetic datasets, using ReCiPa-generated overlap-controlled versions of KEGG and Reactome pathways, led to a reduction in redundancy among the top-scoring gene-sets and allowed for the inclusion of additional gene-sets representing possibly novel biological mechanisms. Using obesity as an example, bioinformatic analysis further demonstrated that gene-sets identified from overlap-controlled pathway databases show stronger evidence of prior association
A search engine to identify pathway genes from expression data on multiple organisms

Directory of Open Access Journals (Sweden)

Zambon Alexander C

2007-05-01

Full Text Available Abstract Background The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results We developed a search engine, the Multiple-Species Gene Recommender (MSGR, which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.
Novel Myopia Genes and Pathways Identified From Syndromic Forms of Myopia

Science.gov (United States)

Loughman, James; Wildsoet, Christine F.; Williams, Cathy; Guggenheim, Jeremy A.

2018-01-01

Purpose To test the hypothesis that genes known to cause clinical syndromes featuring myopia also harbor polymorphisms contributing to nonsyndromic refractive errors. Methods Clinical phenotypes and syndromes that have refractive errors as a recognized feature were identified using the Online Mendelian Inheritance in Man (OMIM) database. One hundred fifty-four unique causative genes were identified, of which 119 were specifically linked with myopia and 114 represented syndromic myopia (i.e., myopia and at least one other clinical feature). Myopia was the only refractive error listed for 98 genes and hyperopia and the only refractive error noted for 28 genes, with the remaining 28 genes linked to phenotypes with multiple forms of refractive error. Pathway analysis was carried out to find biological processes overrepresented within these sets of genes. Genetic variants located within 50 kb of the 119 myopia-related genes were evaluated for involvement in refractive error by analysis of summary statistics from genome-wide association studies (GWAS) conducted by the CREAM Consortium and 23andMe, using both single-marker and gene-based tests. Results Pathway analysis identified several biological processes already implicated in refractive error development through prior GWAS analyses and animal studies, including extracellular matrix remodeling, focal adhesion, and axon guidance, supporting the research hypothesis. Novel pathways also implicated in myopia development included mannosylation, glycosylation, lens development, gliogenesis, and Schwann cell differentiation. Hyperopia was found to be linked to a different pattern of biological processes, mostly related to organogenesis. Comparison with GWAS findings further confirmed that syndromic myopia genes were enriched for genetic variants that influence refractive errors in the general population. Gene-based analyses implicated 21 novel candidate myopia genes (ADAMTS18, ADAMTS2, ADAMTSL4, AGK, ALDH18A1, ASXL1, COL4A1
Introductory Biology Textbooks Under-Represent Scientific Process

Directory of Open Access Journals (Sweden)

Dara B. Duncan

2011-08-01

Full Text Available Attrition of undergraduates from Biology majors is a long-standing problem. Introductory courses that fail to engage students or spark their curiosity by emphasizing the open-ended and creative nature of biological investigation and discovery could contribute to student detachment from the field. Our hypothesis was that introductory biology books devote relatively few figures to illustration of the design and interpretation of experiments or field studies, thereby de-emphasizing the scientific process.To investigate this possibility, we examined figures in six Introductory Biology textbooks published in 2008. On average, multistep scientific investigations were presented in fewer than 5% of the hundreds of figures in each book. Devoting such a small percentage of figures to the processes by which discoveries are made discourages an emphasis on scientific thinking. We suggest that by increasing significantly the illustration of scientific investigations, textbooks could support undergraduates’ early interest in biology, stimulate the development of design and analytical skills, and inspire some students to participate in investigations of their own.
Development of the Multiple Gene Knockout System with One-Step PCR in Thermoacidophilic Crenarchaeon Sulfolobus acidocaldarius

Directory of Open Access Journals (Sweden)

Shoji Suzuki

2017-01-01

Full Text Available Multiple gene knockout systems developed in the thermoacidophilic crenarchaeon Sulfolobus acidocaldarius are powerful genetic tools. However, plasmid construction typically requires several steps. Alternatively, PCR tailing for high-throughput gene disruption was also developed in S. acidocaldarius, but repeated gene knockout based on PCR tailing has been limited due to lack of a genetic marker system. In this study, we demonstrated efficient homologous recombination frequency (2.8 × 104 ± 6.9 × 103 colonies/μg DNA by optimizing the transformation conditions. This optimized protocol allowed to develop reliable gene knockout via double crossover using short homologous arms and to establish the multiple gene knockout system with one-step PCR (MONSTER. In the MONSTER, a multiple gene knockout cassette was simply and rapidly constructed by one-step PCR without plasmid construction, and the PCR product can be immediately used for target gene deletion. As an example of the applications of this strategy, we successfully made a DNA photolyase- (phr- and arginine decarboxylase- (argD- deficient strain of S. acidocaldarius. In addition, an agmatine selection system consisting of an agmatine-auxotrophic strain and argD marker was also established. The MONSTER provides an alternative strategy that enables the very simple construction of multiple gene knockout cassettes for genetic studies in S. acidocaldarius.
The ALMT Gene Family Performs Multiple Functions in Plants

Directory of Open Access Journals (Sweden)

Jie Liu

2018-02-01

Full Text Available The aluminium activated malate transporter (ALMT gene family is named after the first member of the family identified in wheat (Triticum aestivum L.. The product of this gene controls resistance to aluminium (Al toxicity. ALMT genes encode transmembrane proteins that function as anion channels and perform multiple functions involving the transport of organic anions (e.g., carboxylates and inorganic anions in cells. They share a PF11744 domain and are classified in the Fusaric acid resistance protein-like superfamily, CL0307. The proteins typically have five to seven transmembrane regions in the N-terminal half and a long hydrophillic C-terminal tail but predictions of secondary structure vary. Although widely spread in plants, relatively little information is available on the roles performed by other members of this family. In this review, we summarized functions of ALMT gene families, including Al resistance, stomatal function, mineral nutrition, microbe interactions, fruit acidity, light response and seed development.
Biological species in the viral world.

Science.gov (United States)

Bobay, Louis-Marie; Ochman, Howard

2018-06-05

Due to their dependence on cellular organisms for metabolism and replication, viruses are typically named and assigned to species according to their genome structure and the original host that they infect. But because viruses often infect multiple hosts and the numbers of distinct lineages within a host can be vast, their delineation into species is often dictated by arbitrary sequence thresholds, which are highly inconsistent across lineages. Here we apply an approach to determine the boundaries of viral species based on the detection of gene flow within populations, thereby defining viral species according to the biological species concept (BSC). Despite the potential for gene transfer between highly divergent genomes, viruses, like the cellular organisms they infect, assort into reproductively isolated groups and can be organized into biological species. This approach revealed that BSC-defined viral species are often congruent with the taxonomic partitioning based on shared gene contents and host tropism, and that bacteriophages can similarly be classified in biological species. These results open the possibility to use a single, universal definition of species that is applicable across cellular and acellular lifeforms.
Analysis of gene and protein name synonyms in Entrez Gene and UniProtKB resources

KAUST Repository

Arkasosy, Basil

2013-05-11

Ambiguity in texts is a well-known problem: words can carry several meanings, and hence, can be read and interpreted differently. This is also true in the biological literature; names of biological concepts, such as genes and proteins, might be ambiguous, referring in some cases to more than one gene or one protein, or in others, to both genes and proteins at the same time. Public biological databases give a very useful insight about genes and proteins information, including their names. In this study, we made a thorough analysis of the nomenclatures of genes and proteins in two data sources and for six different species. We developed an automated process that parses, extracts, processes and stores information available in two major biological databases: Entrez Gene and UniProtKB. We analysed gene and protein synonyms, their types, frequencies, and the ambiguities within a species, in between data sources and cross-species. We found that at least 40% of the cross-species ambiguities are caused by names that are already ambiguous within the species. Our study shows that from the six species we analysed (Homo Sapiens, Mus Musculus, Arabidopsis Thaliana, Oryza Sativa, Bacillus Subtilis and Pseudomonas Fluorescens), rice (Oriza Sativa) has the best naming model in Entrez Gene database, with low ambiguities between data sources and cross-species.
fcGENE: a versatile tool for processing and transforming SNP datasets.

Directory of Open Access Journals (Sweden)

Nab Raj Roshyara

Full Text Available Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.
Fast gene ontology based clustering for microarray experiments.

Science.gov (United States)

Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

2008-11-21

Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
MIDAS: A Modular DNA Assembly System for Synthetic Biology.

Science.gov (United States)

van Dolleweerd, Craig J; Kessans, Sarah A; Van de Bittner, Kyle C; Bustamante, Leyla Y; Bundela, Rudranuj; Scott, Barry; Nicholson, Matthew J; Parker, Emily J

2018-04-20

A modular and hierarchical DNA assembly platform for synthetic biology based on Golden Gate (Type IIS restriction enzyme) cloning is described. This enabling technology, termed MIDAS (for Modular Idempotent DNA Assembly System), can be used to precisely assemble multiple DNA fragments in a single reaction using a standardized assembly design. It can be used to build genes from libraries of sequence-verified, reusable parts and to assemble multiple genes in a single vector, with full user control over gene order and orientation, as well as control of the direction of growth (polarity) of the multigene assembly, a feature that allows genes to be nested between other genes or genetic elements. We describe the detailed design and use of MIDAS, exemplified by the reconstruction, in the filamentous fungus Penicillium paxilli, of the metabolic pathway for production of paspaline and paxilline, key intermediates in the biosynthesis of a range of indole diterpenes-a class of secondary metabolites produced by several species of filamentous fungi. MIDAS was used to efficiently assemble a 25.2 kb plasmid from 21 different modules (seven genes, each composed of three basic parts). By using a parts library-based system for construction of complex assemblies, and a unique set of vectors, MIDAS can provide a flexible route to assembling tailored combinations of genes and other genetic elements, thereby supporting synthetic biology applications in a wide range of expression hosts.
Building gene co-expression networks using transcriptomics data for systems biology investigations

DEFF Research Database (Denmark)

Kadarmideen, Haja; Watson-Haigh, Nathan S.

2012-01-01

Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four......) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT...... (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended....

Synthetic Biology Platform for Sensing and Integrating Endogenous Transcriptional Inputs in Mammalian Cells.

Science.gov (United States)

Angelici, Bartolomeo; Mailand, Erik; Haefliger, Benjamin; Benenson, Yaakov

2016-08-30

One of the goals of synthetic biology is to develop programmable artificial gene networks that can transduce multiple endogenous molecular cues to precisely control cell behavior. Realizing this vision requires interfacing natural molecular inputs with synthetic components that generate functional molecular outputs. Interfacing synthetic circuits with endogenous mammalian transcription factors has been particularly difficult. Here, we describe a systematic approach that enables integration and transduction of multiple mammalian transcription factor inputs by a synthetic network. The approach is facilitated by a proportional amplifier sensor based on synergistic positive autoregulation. The circuits efficiently transduce endogenous transcription factor levels into RNAi, transcriptional transactivation, and site-specific recombination. They also enable AND logic between pairs of arbitrary transcription factors. The results establish a framework for developing synthetic gene networks that interface with cellular processes through transcriptional regulators. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Aberrant microRNA expression in multiple myeloma

DEFF Research Database (Denmark)

Dimopoulos, Konstantinos; Gimsing, Peter; Grønbæk, Kirsten

2013-01-01

Multiple myeloma (MM) is a devastating disease with a complex biology, and in spite of improved survivability by novel treatment strategies over the last decade, MM is still incurable by current therapy. MicroRNAs (miRNAs) are small, non-coding RNA molecules that regulate gene expression at a post...
Modular design of synthetic gene circuits with biological parts and pools.

Science.gov (United States)

Marchisio, Mario Andrea

2015-01-01

Synthetic gene circuits can be designed in an electronic fashion by displaying their basic components-Standard Biological Parts and Pools of molecules-on the computer screen and connecting them with hypothetical wires. This procedure, achieved by our add-on for the software ProMoT, was successfully applied to bacterial circuits. Recently, we have extended this design-methodology to eukaryotic cells. Here, highly complex components such as promoters and Pools of mRNA contain hundreds of species and reactions whose calculation demands a rule-based modeling approach. We showed how to build such complex modules via the joint employment of the software BioNetGen (rule-based modeling) and ProMoT (modularization). In this chapter, we illustrate how to utilize our computational tool for synthetic biology with the in silico implementation of a simple eukaryotic gene circuit that performs the logic AND operation.
Understanding Biological Regulation Through Synthetic Biology.

Science.gov (United States)

Bashor, Caleb J; Collins, James J

2018-03-16

Engineering synthetic gene regulatory circuits proceeds through iterative cycles of design, building, and testing. Initial circuit designs must rely on often-incomplete models of regulation established by fields of reductive inquiry-biochemistry and molecular and systems biology. As differences in designed and experimentally observed circuit behavior are inevitably encountered, investigated, and resolved, each turn of the engineering cycle can force a resynthesis in understanding of natural network function. Here, we outline research that uses the process of gene circuit engineering to advance biological discovery. Synthetic gene circuit engineering research has not only refined our understanding of cellular regulation but furnished biologists with a toolkit that can be directed at natural systems to exact precision manipulation of network structure. As we discuss, using circuit engineering to predictively reorganize, rewire, and reconstruct cellular regulation serves as the ultimate means of testing and understanding how cellular phenotype emerges from systems-level network function. Expected final online publication date for the Annual Review of Biophysics Volume 47 is May 20, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Phylogenetic reconstruction and DNA barcoding for closely related pine moth species (Dendrolimus) in China with multiple gene markers.

Science.gov (United States)

Dai, Qing-Yan; Gao, Qiang; Wu, Chun-Sheng; Chesters, Douglas; Zhu, Chao-Dong; Zhang, Ai-Bing

2012-01-01

Unlike distinct species, closely related species offer a great challenge for phylogeny reconstruction and species identification with DNA barcoding due to their often overlapping genetic variation. We tested a sibling species group of pine moth pests in China with a standard cytochrome c oxidase subunit I (COI) gene and two alternative internal transcribed spacer (ITS) genes (ITS1 and ITS2). Five different phylogenetic/DNA barcoding analysis methods (Maximum likelihood (ML)/Neighbor-joining (NJ), "best close match" (BCM), Minimum distance (MD), and BP-based method (BP)), representing commonly used methodology (tree-based and non-tree based) in the field, were applied to both single-gene and multiple-gene analyses. Our results demonstrated clear reciprocal species monophyly for three relatively distant related species, Dendrolimus superans, D. houi, D. kikuchii, as recovered by both single and multiple genes while the phylogenetic relationship of three closely related species, D. punctatus, D. tabulaeformis, D. spectabilis, could not be resolved with the traditional tree-building methods. Additionally, we find the standard COI barcode outperforms two nuclear ITS genes, whatever the methods used. On average, the COI barcode achieved a success rate of 94.10-97.40%, while ITS1 and ITS2 obtained a success rate of 64.70-81.60%, indicating ITS genes are less suitable for species identification in this case. We propose the use of an overall success rate of species identification that takes both sequencing success and assignation success into account, since species identification success rates with multiple-gene barcoding system were generally overestimated, especially by tree-based methods, where only successfully sequenced DNA sequences were used to construct a phylogenetic tree. Non-tree based methods, such as MD, BCM, and BP approaches, presented advantages over tree-based methods by reporting the overall success rates with statistical significance. In addition, our
Phylogenetic reconstruction and DNA barcoding for closely related pine moth species (Dendrolimus in China with multiple gene markers.

Directory of Open Access Journals (Sweden)

Qing-Yan Dai

Full Text Available Unlike distinct species, closely related species offer a great challenge for phylogeny reconstruction and species identification with DNA barcoding due to their often overlapping genetic variation. We tested a sibling species group of pine moth pests in China with a standard cytochrome c oxidase subunit I (COI gene and two alternative internal transcribed spacer (ITS genes (ITS1 and ITS2. Five different phylogenetic/DNA barcoding analysis methods (Maximum likelihood (ML/Neighbor-joining (NJ, "best close match" (BCM, Minimum distance (MD, and BP-based method (BP, representing commonly used methodology (tree-based and non-tree based in the field, were applied to both single-gene and multiple-gene analyses. Our results demonstrated clear reciprocal species monophyly for three relatively distant related species, Dendrolimus superans, D. houi, D. kikuchii, as recovered by both single and multiple genes while the phylogenetic relationship of three closely related species, D. punctatus, D. tabulaeformis, D. spectabilis, could not be resolved with the traditional tree-building methods. Additionally, we find the standard COI barcode outperforms two nuclear ITS genes, whatever the methods used. On average, the COI barcode achieved a success rate of 94.10-97.40%, while ITS1 and ITS2 obtained a success rate of 64.70-81.60%, indicating ITS genes are less suitable for species identification in this case. We propose the use of an overall success rate of species identification that takes both sequencing success and assignation success into account, since species identification success rates with multiple-gene barcoding system were generally overestimated, especially by tree-based methods, where only successfully sequenced DNA sequences were used to construct a phylogenetic tree. Non-tree based methods, such as MD, BCM, and BP approaches, presented advantages over tree-based methods by reporting the overall success rates with statistical significance. In
Characteristics and Validation Techniques for PCA-Based Gene-Expression Signatures

Directory of Open Access Journals (Sweden)

Anders E. Berglund

2017-01-01

Full Text Available Background. Many gene-expression signatures exist for describing the biological state of profiled tumors. Principal Component Analysis (PCA can be used to summarize a gene signature into a single score. Our hypothesis is that gene signatures can be validated when applied to new datasets, using inherent properties of PCA. Results. This validation is based on four key concepts. Coherence: elements of a gene signature should be correlated beyond chance. Uniqueness: the general direction of the data being examined can drive most of the observed signal. Robustness: if a gene signature is designed to measure a single biological effect, then this signal should be sufficiently strong and distinct compared to other signals within the signature. Transferability: the derived PCA gene signature score should describe the same biology in the target dataset as it does in the training dataset. Conclusions. The proposed validation procedure ensures that PCA-based gene signatures perform as expected when applied to datasets other than those that the signatures were trained upon. Complex signatures, describing multiple independent biological components, are also easily identified.
GeneLab Phase 2: Integrated Search Data Federation of Space Biology Experimental Data

Science.gov (United States)

Tran, P. B.; Berrios, D. C.; Gurram, M. M.; Hashim, J. C. M.; Raghunandan, S.; Lin, S. Y.; Le, T. Q.; Heher, D. M.; Thai, H. T.; Welch, J. D.;

2016-01-01

The GeneLab project is a science initiative to maximize the scientific return of omics data collected from spaceflight and from ground simulations of microgravity and radiation experiments, supported by a data system for a public bioinformatics repository and collaborative analysis tools for these data. The mission of GeneLab is to maximize the utilization of the valuable biological research resources aboard the ISS by collecting genomic, transcriptomic, proteomic and metabolomic (so-called omics) data to enable the exploration of the molecular network responses of terrestrial biology to space environments using a systems biology approach. All GeneLab data are made available to a worldwide network of researchers through its open-access data system. GeneLab is currently being developed by NASA to support Open Science biomedical research in order to enable the human exploration of space and improve life on earth. Open access to Phase 1 of the GeneLab Data Systems (GLDS) was implemented in April 2015. Download volumes have grown steadily, mirroring the growth in curated space biology research data sets (61 as of June 2016), now exceeding 10 TB/month, with over 10,000 file downloads since the start of Phase 1. For the period April 2015 to May 2016, most frequently downloaded were data from studies of Mus musculus (39) followed closely by Arabidopsis thaliana (30), with the remaining downloads roughly equally split across 12 other organisms (each 10 of total downloads). GLDS Phase 2 is focusing on interoperability, supporting data federation, including integrated search capabilities, of GLDS-housed data sets with external data sources, such as gene expression data from NIHNCBIs Gene Expression Omnibus (GEO), proteomic data from EBIs PRIDE system, and metagenomic data from Argonne National Laboratory's MG-RAST. GEO and MG-RAST employ specifications for investigation metadata that are different from those used by the GLDS and PRIDE (e.g., ISA-Tab). The GLDS Phase 2 system

Simultaneous gene finding in multiple genomes.

Science.gov (United States)

König, Stefanie; Romoth, Lars W; Gerischer, Lizzy; Stanke, Mario

2016-11-15

As the tree of life is populated with sequenced genomes ever more densely, the new challenge is the accurate and consistent annotation of entire clades of genomes. We address this problem with a new approach to comparative gene finding that takes a multiple genome alignment of closely related species and simultaneously predicts the location and structure of protein-coding genes in all input genomes, thereby exploiting negative selection and sequence conservation. The model prefers potential gene structures in the different genomes that are in agreement with each other, or-if not-where the exon gains and losses are plausible given the species tree. We formulate the multi-species gene finding problem as a binary labeling problem on a graph. The resulting optimization problem is NP hard, but can be efficiently approximated using a subgradient-based dual decomposition approach. The proposed method was tested on whole-genome alignments of 12 vertebrate and 12 Drosophila species. The accuracy was evaluated for human, mouse and Drosophila melanogaster and compared to competing methods. Results suggest that our method is well-suited for annotation of (a large number of) genomes of closely related species within a clade, in particular, when RNA-Seq data are available for many of the genomes. The transfer of existing annotations from one genome to another via the genome alignment is more accurate than previous approaches that are based on protein-spliced alignments, when the genomes are at close to medium distances. The method is implemented in C ++ as part of Augustus and available open source at http://bioinf.uni-greifswald.de/augustus/ CONTACT: stefaniekoenig@ymail.com or mario.stanke@uni-greifswald.deSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Comparative analysis of human conjunctival and corneal epithelial gene expression with oligonucleotide microarrays.

Science.gov (United States)

Turner, Helen C; Budak, Murat T; Akinci, M A Murat; Wolosin, J Mario

2007-05-01

To determine global mRNA expression levels in corneal and conjunctival epithelia and identify transcripts that exhibit preferential tissue expression. cDNA samples derived from human conjunctival and corneal epithelia were hybridized in three independent experiments to a commercial oligonucleotide array representing more than 22,000 transcripts. The resultant signal intensities and microarray software transcript present/absent calls were used in conjunction with the local pooled error (LPE) statistical method to identify transcripts that are preferentially or exclusively expressed in one of the two tissues at significant levels (expression >1% of the beta-actin level). EASE (Expression Analysis Systematic Explorer software) was used to identify biological systems comparatively overrepresented in either epithelium. Immuno-, and cytohistochemistry was performed to validate or expand on selected results of interest. The analysis identified 332 preferential and 93 exclusive significant corneal epithelial transcripts. The corresponding numbers of conjunctival epithelium transcripts were 592 and 211, respectively. The overrepresented biological processes in the cornea were related to cell adhesion and oxiredox equilibria and cytoprotection activities. In the conjunctiva, the biological processes that were most prominent were related to innate immunity and melanogenesis. Immunohistochemistry for antigen-presenting cells and melanocytes was consistent with these gene signatures. The transcript comparison identified a substantial number of genes that have either not been identified previously or are not known to be highly expressed in these two epithelia, including testican-1, ECM1, formin, CRTAC1, and NQO1 in the cornea and, in the conjunctiva, sPLA(2)-IIA, lipocalin 2, IGFBP3, multiple MCH class II proteins, and the Na-Pi cotransporter type IIb. Comparative gene expression profiling leads to the identification of many biological processes and previously unknown genes that
Eco-systems biology-From the gene to the stream

International Nuclear Information System (INIS)

Mothersill, Carmel; Seymour, Colin

2010-01-01

This review considers the implications for environmental health and ecosystem sustainability, of new developments in radiobiology and ecotoxicology. Specifically it considers how the non-targeted effects of low doses of radiation, which are currently being scrutinized experimentally, not only mirror similar effects from low doses of chemical stressors but may actually lead to unpredictable emergent effects at higher hierarchical levels. The position is argued that non-targeted effects are mechanistically important in coordinating phased hierarchical transitions (i.e. transitions which occur in a regulated sequence). The field of multiple stressors (both radiation and chemical) is highly complex and agents can interact in an additive, antagonist or synergistic manner. The outcome following low dose multiple stressor exposure also is impacted by the context in which the stressors are received, perceived or communicated by the organism or tissue. Modern biology has given us very sensitive methods to examine changes following stressor interaction with biological systems at several levels of organization but the translation of these observations to ultimate risk remains difficult to resolve. Since multiple stressor exposure is the norm in the environment, it is essential to move away from single stressor-based protection and to develop tools, including legal instruments, which will enable us to use response-based risk assessment. Radiation protection in the context of multiple stressors includes consideration of humans and non-humans as separate groups requiring separate assessment frameworks. This is because for humans, individual survival and prevention of cancer are paramount but for animals, it is considered sufficient to protect populations and cancer is not of concern. The need to revisit this position is discussed not only from the environmental perspective but also from the human health perspective because the importance of 'pollution' (a generic term for
Global map of physical interactions among differentially expressed genes in multiple sclerosis relapses and remissions.

Science.gov (United States)

Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat

2011-09-15

Multiple sclerosis (MS) is a central nervous system autoimmune inflammatory T-cell-mediated disease with a relapsing-remitting course in the majority of patients. In this study, we performed a high-resolution systems biology analysis of gene expression and physical interactions in MS relapse and remission. To this end, we integrated 164 large-scale measurements of gene expression in peripheral blood mononuclear cells of MS patients in relapse or remission and healthy subjects, with large-scale information about the physical interactions between these genes obtained from public databases. These data were analyzed with a variety of computational methods. We find that there is a clear and significant global network-level signal that is related to the changes in gene expression of MS patients in comparison to healthy subjects. However, despite the clear differences in the clinical symptoms of MS patients in relapse versus remission, the network level signal is weaker when comparing patients in these two stages of the disease. This result suggests that most of the genes have relatively similar expression levels in the two stages of the disease. In accordance with previous studies, we found that the pathways related to regulation of cell death, chemotaxis and inflammatory response are differentially expressed in the disease in comparison to healthy subjects, while pathways related to cell adhesion, cell migration and cell-cell signaling are activated in relapse in comparison to remission. However, the current study includes a detailed report of the exact set of genes involved in these pathways and the interactions between them. For example, we found that the genes TP53 and IL1 are 'network-hub' that interacts with many of the differentially expressed genes in MS patients versus healthy subjects, and the epidermal growth factor receptor is a 'network-hub' in the case of MS patients with relapse versus remission. The statistical approaches employed in this study enabled us
FunGene: the functional gene pipeline and repository.

Science.gov (United States)

Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

2013-01-01

Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
FunGene: the Functional Gene Pipeline and Repository

Directory of Open Access Journals (Sweden)

Jordan A. Fish

2013-10-01

Full Text Available Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer.While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/ offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
A pathway-based network analysis of hypertension-related genes

Science.gov (United States)

Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

2016-02-01

Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.
Potential of genes and gene products from Trichoderma sp. and Gliocladium sp. for the development of biological pesticides.

Science.gov (United States)

Lorito, M; Hayes, C K; Zoina, A; Scala, F; Del Sorbo, G; Woo, S L; Harman, G E

1994-12-01

Fungal cell wall degrading enzymes produced by the biocontrol fungi Trichoderma harzianum and Gliocladium virens are strong inhibitors of spore germination and hyphal elongation of a number of phytopathogenic fungi. The purified enzymes include chitinolytic enzymes with different modes of action or different substrate specificity and glucanolytic enzymes with exo-activity. A variety of synergistic interactions were found when different enzymes were combined or associated with biotic or abiotic antifungal agents. The levels of inhibition obtained by using enzyme combinations were, in some cases, comparable with commercial fungicides. Moreover, the antifungal interaction between enzymes and common fungicides allowed the reduction of the chemical doses up to 200-fold. Chitinolytic and glucanolytic enzymes from T. harzianum were able to improve substantially the antifungal ability of a biocontrol strain of Enterobacter cloacae. DNA fragments containing genes encoding for different chitinolytic enzymes were isolated from a cDNA library of T. harzianum and cloned for mechanistic studies and biocontrol purposes. Our results provide additional information on the role of lytic enzymes in processes of biocontrol and strongly suggest the use of lytic enzymes and their genes for biological control of plant diseases.
A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database

Directory of Open Access Journals (Sweden)

Tripputi Mark

2006-10-01

Full Text Available Abstract Background Many of the most popular pre-processing methods for Affymetrix expression arrays, such as RMA, gcRMA, and PLIER, simultaneously analyze data across a set of predetermined arrays to improve precision of the final measures of expression. One problem associated with these algorithms is that expression measurements for a particular sample are highly dependent on the set of samples used for normalization and results obtained by normalization with a different set may not be comparable. A related problem is that an organization producing and/or storing large amounts of data in a sequential fashion will need to either re-run the pre-processing algorithm every time an array is added or store them in batches that are pre-processed together. Furthermore, pre-processing of large numbers of arrays requires loading all the feature-level data into memory which is a difficult task even with modern computers. We utilize a scheme that produces all the information necessary for pre-processing using a very large training set that can be used for summarization of samples outside of the training set. All subsequent pre-processing tasks can be done on an individual array basis. We demonstrate the utility of this approach by defining a new version of the Robust Multi-chip Averaging (RMA algorithm which we refer to as refRMA. Results We assess performance based on multiple sets of samples processed over HG U133A Affymetrix GeneChip® arrays. We show that the refRMA workflow, when used in conjunction with a large, biologically diverse training set, results in the same general characteristics as that of RMA in its classic form when comparing overall data structure, sample-to-sample correlation, and variation. Further, we demonstrate that the refRMA workflow and reference set can be robustly applied to naïve organ types and to benchmark data where its performance indicates respectable results. Conclusion Our results indicate that a biologically diverse
Stable carbon isotope fractionation of chlorinated ethenes by a microbial consortium containing multiple dechlorinating genes.

Science.gov (United States)

Liu, Na; Ding, Longzhen; Li, Haijun; Zhang, Pengpeng; Zheng, Jixing; Weng, Chih-Huang

2018-08-01

The study aimed to determine the possible contribution of specific growth conditions and community structures to variable carbon enrichment factors (Ɛ- carbon ) values for the degradation of chlorinated ethenes (CEs) by a bacterial consortium with multiple dechlorinating genes. Ɛ- carbon values for trichloroethylene, cis-1,2-dichloroethylene, and vinyl chloride were -7.24% ± 0.59%, -14.6% ± 1.71%, and -21.1% ± 1.14%, respectively, during their degradation by a microbial consortium containing multiple dechlorinating genes including tceA and vcrA. The Ɛ- carbon values of all CEs were not greatly affected by changes in growth conditions and community structures, which directly or indirectly affected reductive dechlorination of CEs by this consortium. Stability analysis provided evidence that the presence of multiple dechlorinating genes within a microbial consortium had little effect on carbon isotope fractionation, as long as the genes have definite, non-overlapping functions. Copyright © 2018 Elsevier Ltd. All rights reserved.
A multicolor panel of TALE-KRAB based transcriptional repressor vectors enabling knockdown of multiple gene targets.

Science.gov (United States)

Zhang, Zhonghui; Wu, Elise; Qian, Zhijian; Wu, Wen-Shu

2014-12-05

Stable and efficient knockdown of multiple gene targets is highly desirable for dissection of molecular pathways. Because it allows sequence-specific DNA binding, transcription activator-like effector (TALE) offers a new genetic perturbation technique that allows for gene-specific repression. Here, we constructed a multicolor lentiviral TALE-Kruppel-associated box (KRAB) expression vector platform that enables knockdown of multiple gene targets. This platform is fully compatible with the Golden Gate TALEN and TAL Effector Kit 2.0, a widely used and efficient method for TALE assembly. We showed that this multicolor TALE-KRAB vector system when combined together with bone marrow transplantation could quickly knock down c-kit and PU.1 genes in hematopoietic stem and progenitor cells of recipient mice. Furthermore, our data demonstrated that this platform simultaneously knocked down both c-Kit and PU.1 genes in the same primary cell populations. Together, our results suggest that this multicolor TALE-KRAB vector platform is a promising and versatile tool for knockdown of multiple gene targets and could greatly facilitate dissection of molecular pathways.
Proteomic analysis of Bombyx mori molting fluid: Insights into the molting process.

Science.gov (United States)

Liu, Hua-Wei; Wang, Luo-Ling; Tang, Xin; Dong, Zhao-Ming; Guo, Peng-Chao; Zhao, Dong-Chao; Xia, Qing-You; Zhao, Ping

2018-02-20

Molting is an essential biological process occurring multiple times throughout the life cycle of most Ecdysozoa. Molting fluids accumulate and function in the exuvial space during the molting process. In this study, we used liquid chromatography-tandem mass spectrometry to investigate the molting fluids to analyze the molecular mechanisms of molting in the silkworm, Bombyx mori. In total, 375 proteins were identified in molting fluids from the silkworm at 14-16h before pupation and eclosion, including 12 chitin metabolism-related enzymes, 35 serine proteases, 15 peptidases, and 38 protease inhibitors. Gene ontology analysis indicated that "catalytic" constitutes the most enriched function in the molting fluid. Gene expression patterns and bioinformatic analyses suggested that numerous enzymes are involved in the degradation of cuticle proteins and chitin. Protein-protein interaction network and activity analyses showed that protease inhibitors are involved in the regulation of multiple pathways in molting fluid. Additionally, many immune-related proteins may be involved in the immune defense during molting. These results provide a comprehensive proteomic insight into proteolytic enzymes and protease inhibitors in molting fluid, and will likely improve the current understanding of physiological processes in insect molting. Insect molting constitutes a dynamic physiological process. To better understand this process, we used LC-MS/MS to investigate the proteome of silkworm molting fluids and identified key proteins involved in silkworm molting. The biological processes of the old cuticle degradation pathway and immune defense response were analyzed in the proteome of silkworm molting fluid. We report that protease inhibitors serve as key factors in the regulation of the molting process. The proteomic results provide new insight into biological molting processes in insects. Copyright © 2017 Elsevier B.V. All rights reserved.

Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

Directory of Open Access Journals (Sweden)

Sahra Uygun

2016-12-01

Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.
Determination of Biological Treatability Processes of Textile Wastewater and Implementation of a Fuzzy Logic Model

Directory of Open Access Journals (Sweden)

Harun Akif Kabuk

2015-01-01

Full Text Available This study investigated the biological treatability of textile wastewater. For this purpose, a membrane bioreactor (MBR was utilized for biological treatment after the ozonation process. Due to the refractory organic contents of textile wastewater that has a low biodegradability capacity, ozonation was implemented as an advanced oxidation process prior to the MBR system to increase the biodegradability of the wastewater. Textile wastewater, oxidized by ozonation, was fed to the MBR at different hydraulic retention times (HRT. During the process, color, chemical oxygen demand (COD, and biochemical oxygen demand (BOD removal efficiencies were monitored for 24-hour, 12-hour, 6-hour, and 3-hour retention times. Under these conditions, 94% color, 65% COD, and 55% BOD removal efficiencies were obtained in the MBR system. The experimental outputs were modeled with multiple linear regressions (MLR and fuzzy logic. MLR results suggested that color removal is more related to COD removal relative to BOD removal. A surface map of this issue was prepared with a fuzzy logic model. Furthermore, fuzzy logic was employed to the whole modeling of the biological system treatment. Determination coefficients for COD, BOD, and color removal efficiencies were 0.96, 0.97, and 0.92, respectively.
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm.

Science.gov (United States)

Tchagang, Alain B; Phan, Sieu; Famili, Fazel; Shearer, Heather; Fobert, Pierre; Huang, Yi; Zou, Jitao; Huang, Daiqing; Cutler, Adrian; Liu, Ziying; Pan, Youlian

2012-04-04

Nowadays, it is possible to collect expression levels of a set of genes from a set of biological samples during a series of time points. Such data have three dimensions: gene-sample-time (GST). Thus they are called 3D microarray gene expression data. To take advantage of the 3D data collected, and to fully understand the biological knowledge hidden in the GST data, novel subspace clustering algorithms have to be developed to effectively address the biological problem in the corresponding space. We developed a subspace clustering algorithm called Order Preserving Triclustering (OPTricluster), for 3D short time-series data mining. OPTricluster is able to identify 3D clusters with coherent evolution from a given 3D dataset using a combinatorial approach on the sample dimension, and the order preserving (OP) concept on the time dimension. The fusion of the two methodologies allows one to study similarities and differences between samples in terms of their temporal expression profile. OPTricluster has been successfully applied to four case studies: immune response in mice infected by malaria (Plasmodium chabaudi), systemic acquired resistance in Arabidopsis thaliana, similarities and differences between inner and outer cotyledon in Brassica napus during seed development, and to Brassica napus whole seed development. These studies showed that OPTricluster is robust to noise and is able to detect the similarities and differences between biological samples. Our analysis showed that OPTricluster generally outperforms other well known clustering algorithms such as the TRICLUSTER, gTRICLUSTER and K-means; it is robust to noise and can effectively mine the biological knowledge hidden in the 3D short time-series gene expression data.
Organization and post-transcriptional processing of focal adhesion kinase gene

Directory of Open Access Journals (Sweden)

Enslen Hervé

2006-08-01

Full Text Available Abstract Background Focal adhesion kinase (FAK is a non-receptor tyrosine kinase critical for processes ranging from embryo development to cancer progression. Although isoforms with specific molecular and functional properties have been characterized in rodents and chicken, the organization of FAK gene throughout phylogeny and its potential to generate multiple isoforms are not well understood. Here, we study the phylogeny of FAK, the organization of its gene, and its post-transcriptional processing in rodents and human. Results A single orthologue of FAK and the related PYK2 was found in non-vertebrate species. Gene duplication probably occurred in deuterostomes after the echinoderma embranchment, leading to the evolution of PYK2 with distinct properties. The amino acid sequence of FAK and PYK2 is conserved in their functional domains but not in their linker regions, with the absence of autophosphorylation site in C. elegans. Comparison of mouse and human FAK genes revealed the existence of multiple combinations of conserved and non-conserved 5'-untranslated exons in FAK transcripts suggesting a complex regulation of their expression. Four alternatively spliced coding exons (13, 14, 16, and 31, previously described in rodents, are highly conserved in vertebrates. Cis-regulatory elements known to regulate alternative splicing were found in conserved alternative exons of FAK or in the flanking introns. In contrast, other reported human variant exons were restricted to Homo sapiens, and, in some cases, other primates. Several of these non-conserved exons may correspond to transposable elements. The inclusion of conserved alternative exons was examined by RT-PCR in mouse and human brain during development. Inclusion of exons 14 and 16 peaked at the end of embryonic life, whereas inclusion of exon 13 increased steadily until adulthood. Study of various tissues showed that inclusion of these exons also occurred, independently from each other, in a
Roll of hemagglutinin gene in the biology of avian inflenza virus

Directory of Open Access Journals (Sweden)

Masoud Soltanialvar

2016-06-01

Full Text Available The hemagglutinin (HA, the major envelope glycoprotein of influenza, plays an important role during the early stage of infection, and changes in the HA gene prior to the emergence of pathogenic avian influenza viruses. The HA protein controls viral entry through membrane fusion of the viral envelope with the host cell membrane and allows the genetic information released to initiate new virus synthesis. Sharp antigenic variation of HA remains the critical challenge to the development of effective vaccines. Therefore, we highlight the role of HA in need of review: structure of HA, the fusion process and the HA receptor binding specificity in interspecies transmission and the impact of multiple mutations at antigenic sites and host antibodies to the parental virus, and the host susceptibility to productive infection by the drift strains.
Fast Gene Ontology based clustering for microarray experiments

Directory of Open Access Journals (Sweden)

Ovaska Kristian

2008-11-01

Full Text Available Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

Science.gov (United States)

Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

2017-04-12

Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.
Towards the understanding of network information processing in biology

Science.gov (United States)

Singh, Vijay

Living organisms perform incredibly well in detecting a signal present in the environment. This information processing is achieved near optimally and quite reliably, even though the sources of signals are highly variable and complex. The work in the last few decades has given us a fair understanding of how individual signal processing units like neurons and cell receptors process signals, but the principles of collective information processing on biological networks are far from clear. Information processing in biological networks, like the brain, metabolic circuits, cellular-signaling circuits, etc., involves complex interactions among a large number of units (neurons, receptors). The combinatorially large number of states such a system can exist in makes it impossible to study these systems from the first principles, starting from the interactions between the basic units. The principles of collective information processing on such complex networks can be identified using coarse graining approaches. This could provide insights into the organization and function of complex biological networks. Here I study models of biological networks using continuum dynamics, renormalization, maximum likelihood estimation and information theory. Such coarse graining approaches identify features that are essential for certain processes performed by underlying biological networks. We find that long-range connections in the brain allow for global scale feature detection in a signal. These also suppress the noise and remove any gaps present in the signal. Hierarchical organization with long-range connections leads to large-scale connectivity at low synapse numbers. Time delays can be utilized to separate a mixture of signals with temporal scales. Our observations indicate that the rules in multivariate signal processing are quite different from traditional single unit signal processing.
Learning style and concept acquisition of community college students in introductory biology

Science.gov (United States)

Bobick, Sandra Burin

This study investigated the influence of learning style on concept acquisition within a sample of community college students in a general biology course. There are two subproblems within the larger problem: (1) the influence of demographic variables (age, gender, number of college credits, prior exposure to scientific information) on learning style, and (2) the correlations between prior scientific knowledge, learning style and student understanding of the concept of the gene. The sample included all students enrolled in an introductory general biology course during two consecutive semesters at an urban community college. Initial data was gathered during the first week of the semester, at which time students filled in a short questionnaire (age, gender, number of college credits, prior exposure to science information either through reading/visual sources or a prior biology course). Subjects were then given the Inventory of Learning Processes-Revised (ILP-R) which measures general preferences in five learning styles; Deep Learning; Elaborative Learning, Agentic Learning, Methodical Learning and Literal Memorization. Subjects were then given the Gene Conceptual Knowledge pretest: a 15 question objective section and an essay section. Subjects were exposed to specific concepts during lecture and laboratory exercises. At the last lab, students were given the Genetics Conceptual Knowledge Posttest. Pretest/posttest gains were correlated with demographic variables and learning styles were analyzed for significant correlations. Learning styles, as the independent variable in a simultaneous multiple regression, were significant predictors of results on the gene assessment tests, including pretest, posttest and gain. Of the learning styles, Deep Learning accounted for the greatest positive predictive value of pretest essay and pretest objective results. Literal Memorization was a significant negative predictor for posttest essay, essay gain and objective gain. Simultaneous
Dominating biological networks.

Directory of Open Access Journals (Sweden)

Tijana Milenković

Full Text Available Proteins are essential macromolecules of life that carry out most cellular processes. Since proteins aggregate to perform function, and since protein-protein interaction (PPI networks model these aggregations, one would expect to uncover new biology from PPI network topology. Hence, using PPI networks to predict protein function and role of protein pathways in disease has received attention. A debate remains open about whether network properties of "biologically central (BC" genes (i.e., their protein products, such as those involved in aging, cancer, infectious diseases, or signaling and drug-targeted pathways, exhibit some topological centrality compared to the rest of the proteins in the human PPI network.To help resolve this debate, we design new network-based approaches and apply them to get new insight into biological function and disease. We hypothesize that BC genes have a topologically central (TC role in the human PPI network. We propose two different concepts of topological centrality. We design a new centrality measure to capture complex wirings of proteins in the network that identifies as TC those proteins that reside in dense extended network neighborhoods. Also, we use the notion of domination and find dominating sets (DSs in the PPI network, i.e., sets of proteins such that every protein is either in the DS or is a neighbor of the DS. Clearly, a DS has a TC role, as it enables efficient communication between different network parts. We find statistically significant enrichment in BC genes of TC nodes and outperform the existing methods indicating that genes involved in key biological processes occupy topologically complex and dense regions of the network and correspond to its "spine" that connects all other network parts and can thus pass cellular signals efficiently throughout the network. To our knowledge, this is the first study that explores domination in the context of PPI networks.
Synthetic Biology: Tools to Design, Build, and Optimize Cellular Processes

Science.gov (United States)

Young, Eric; Alper, Hal

2010-01-01

The general central dogma frames the emergent properties of life, which make biology both necessary and difficult to engineer. In a process engineering paradigm, each biological process stream and process unit is heavily influenced by regulatory interactions and interactions with the surrounding environment. Synthetic biology is developing the tools and methods that will increase control over these interactions, eventually resulting in an integrative synthetic biology that will allow ground-up cellular optimization. In this review, we attempt to contextualize the areas of synthetic biology into three tiers: (1) the process units and associated streams of the central dogma, (2) the intrinsic regulatory mechanisms, and (3) the extrinsic physical and chemical environment. Efforts at each of these three tiers attempt to control cellular systems and take advantage of emerging tools and approaches. Ultimately, it will be possible to integrate these approaches and realize the vision of integrative synthetic biology when cells are completely rewired for biotechnological goals. This review will highlight progress towards this goal as well as areas requiring further research. PMID:20150964
Synthetic Biology: Tools to Design, Build, and Optimize Cellular Processes

Directory of Open Access Journals (Sweden)

Eric Young

2010-01-01

Full Text Available The general central dogma frames the emergent properties of life, which make biology both necessary and difficult to engineer. In a process engineering paradigm, each biological process stream and process unit is heavily influenced by regulatory interactions and interactions with the surrounding environment. Synthetic biology is developing the tools and methods that will increase control over these interactions, eventually resulting in an integrative synthetic biology that will allow ground-up cellular optimization. In this review, we attempt to contextualize the areas of synthetic biology into three tiers: (1 the process units and associated streams of the central dogma, (2 the intrinsic regulatory mechanisms, and (3 the extrinsic physical and chemical environment. Efforts at each of these three tiers attempt to control cellular systems and take advantage of emerging tools and approaches. Ultimately, it will be possible to integrate these approaches and realize the vision of integrative synthetic biology when cells are completely rewired for biotechnological goals. This review will highlight progress towards this goal as well as areas requiring further research.
Synthetic biology: tools to design, build, and optimize cellular processes.

Science.gov (United States)

Young, Eric; Alper, Hal

2010-01-01

The general central dogma frames the emergent properties of life, which make biology both necessary and difficult to engineer. In a process engineering paradigm, each biological process stream and process unit is heavily influenced by regulatory interactions and interactions with the surrounding environment. Synthetic biology is developing the tools and methods that will increase control over these interactions, eventually resulting in an integrative synthetic biology that will allow ground-up cellular optimization. In this review, we attempt to contextualize the areas of synthetic biology into three tiers: (1) the process units and associated streams of the central dogma, (2) the intrinsic regulatory mechanisms, and (3) the extrinsic physical and chemical environment. Efforts at each of these three tiers attempt to control cellular systems and take advantage of emerging tools and approaches. Ultimately, it will be possible to integrate these approaches and realize the vision of integrative synthetic biology when cells are completely rewired for biotechnological goals. This review will highlight progress towards this goal as well as areas requiring further research.
The role of epigenetics in the biology of multiple myeloma

DEFF Research Database (Denmark)

Dimopoulos, K; Gimsing, P; Grønbæk, K

2014-01-01

Several recent studies have highlighted the biological complexity of multiple myeloma (MM) that arises as a result of several disrupted cancer pathways. Apart from the central role of genetic abnormalities, epigenetic aberrations have also been shown to be important players in the development of MM......, and a lot of research during the past decades has focused on the ways DNA methylation, histone modifications and noncoding RNAs contribute to the pathobiology of MM. This has led to, apart from better understanding of the disease biology, the development of epigenetic drugs, such as histone deacetylase...... inhibitors that are already used in clinical trials in MM with promising results. This review will present the role of epigenetic abnormalities in MM and how these can affect specific pathways, and focus on the potential of novel 'epidrugs' as future treatment modalities for MM....
Biological processes influencing contaminant release from sediments

International Nuclear Information System (INIS)

Reible, D.D.

1996-01-01

The influence of biological processes, including bioturbation, on the mobility of contaminants in freshwater sediments is described. Effective mass coefficients are estimated for tubificid oligochaetes as a function of worm behavior and biomass density. The mass transfer coefficients were observed to be inversely proportional to water oxygen content and proportional to the square root of biomass density. The sediment reworking and contaminant release are contrasted with those of freshwater amphipods. The implications of these and other biological processes for contaminant release and i n-situ remediation of soils and sediments are summarized. 4 figs., 1 tab
Treatment of slaughter wastewater by coagulation sedimentation-anaerobic biological filter and biological contact oxidation process

Science.gov (United States)

Sun, M.; Yu, P. F.; Fu, J. X.; Ji, X. Q.; Jiang, T.

2017-08-01

The optimal process parameters and conditions for the treatment of slaughterhouse wastewater by coagulation sedimentation-AF - biological contact oxidation process were studied to solve the problem of high concentration organic wastewater treatment in the production of small and medium sized slaughter plants. The suitable water temperature and the optimum reaction time are determined by the experiment of precipitation to study the effect of filtration rate and reflux ratio on COD and SS in anaerobic biological filter and the effect of biofilm thickness and gas water ratio on NH3-N and COD in biological contact oxidation tank, and results show that the optimum temperature is 16-24°C, reaction time is 20 min in coagulating sedimentation, the optimum filtration rate is 0.6 m/h, and the optimum reflux ratio is 300% in anaerobic biological filter reactor. The most suitable biological film thickness range of 1.8-2.2 mm and the most suitable gas water ratio is 12:1-14:1 in biological contact oxidation pool. In the coupling process of continuous operation for 80 days, the average effluent’s mass concentrations of COD, TP and TN were 15.57 mg/L, 40 mg/L and 0.63 mg/L, the average removal rates were 98.93%, 86.10%, 88.95%, respectively. The coupling process has stable operation effect and good effluent quality, and is suitable for the industrial application.
Integrative analysis of RUNX1 downstream pathways and target genes

Directory of Open Access Journals (Sweden)

Liu Marjorie

2008-07-01

Full Text Available Abstract Background The RUNX1 transcription factor gene is frequently mutated in sporadic myeloid and lymphoid leukemia through translocation, point mutation or amplification. It is also responsible for a familial platelet disorder with predisposition to acute myeloid leukemia (FPD-AML. The disruption of the largely unknown biological pathways controlled by RUNX1 is likely to be responsible for the development of leukemia. We have used multiple microarray platforms and bioinformatic techniques to help identify these biological pathways to aid in the understanding of why RUNX1 mutations lead to leukemia. Results Here we report genes regulated either directly or indirectly by RUNX1 based on the study of gene expression profiles generated from 3 different human and mouse platforms. The platforms used were global gene expression profiling of: 1 cell lines with RUNX1 mutations from FPD-AML patients, 2 over-expression of RUNX1 and CBFβ, and 3 Runx1 knockout mouse embryos using either cDNA or Affymetrix microarrays. We observe that our datasets (lists of differentially expressed genes significantly correlate with published microarray data from sporadic AML patients with mutations in either RUNX1 or its cofactor, CBFβ. A number of biological processes were identified among the differentially expressed genes and functional assays suggest that heterozygous RUNX1 point mutations in patients with FPD-AML impair cell proliferation, microtubule dynamics and possibly genetic stability. In addition, analysis of the regulatory regions of the differentially expressed genes has for the first time systematically identified numerous potential novel RUNX1 target genes. Conclusion This work is the first large-scale study attempting to identify the genetic networks regulated by RUNX1, a master regulator in the development of the hematopoietic system and leukemia. The biological pathways and target genes controlled by RUNX1 will have considerable importance in disease
Genes from scratch--the evolutionary fate of de novo genes.

Science.gov (United States)

Schlötterer, Christian

2015-04-01

Although considered an extremely unlikely event, many genes emerge from previously noncoding genomic regions. This review covers the entire life cycle of such de novo genes. Two competing hypotheses about the process of de novo gene birth are discussed as well as the high death rate of de novo genes. Despite the high death rate, some de novo genes are retained and remain functional, even in distantly related species, through their integration into gene networks. Further studies combining gene expression with ribosome profiling in multiple populations across different species will be instrumental for an improved understanding of the evolutionary processes operating on de novo genes. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

DEFF Research Database (Denmark)

Rossin, Elizabeth J.; Hansen, Kasper Lage; Raychaudhuri, Soumya

2011-01-01

Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these r......Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed...... in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more...... that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non...
Feature selection and classification of MAQC-II breast cancer and multiple myeloma microarray gene expression data.

Directory of Open Access Journals (Sweden)

Qingzhong Liu

Full Text Available Microarray data has a high dimension of variables but available datasets usually have only a small number of samples, thereby making the study of such datasets interesting and challenging. In the task of analyzing microarray data for the purpose of, e.g., predicting gene-disease association, feature selection is very important because it provides a way to handle the high dimensionality by exploiting information redundancy induced by associations among genetic markers. Judicious feature selection in microarray data analysis can result in significant reduction of cost while maintaining or improving the classification or prediction accuracy of learning machines that are employed to sort out the datasets. In this paper, we propose a gene selection method called Recursive Feature Addition (RFA, which combines supervised learning and statistical similarity measures. We compare our method with the following gene selection methods: Support Vector Machine Recursive Feature Elimination (SVMRFE, Leave-One-Out Calculation Sequential Forward Selection (LOOCSFS, Gradient based Leave-one-out Gene Selection (GLGS. To evaluate the performance of these gene selection methods, we employ several popular learning classifiers on the MicroArray Quality Control phase II on predictive modeling (MAQC-II breast cancer dataset and the MAQC-II multiple myeloma dataset. Experimental results show that gene selection is strictly paired with learning classifier. Overall, our approach outperforms other compared methods. The biological functional analysis based on the MAQC-II breast cancer dataset convinced us to apply our method for phenotype prediction. Additionally, learning classifiers also play important roles in the classification of microarray data and our experimental results indicate that the Nearest Mean Scale Classifier (NMSC is a good choice due to its prediction reliability and its stability across the three performance measurements: Testing accuracy, MCC values, and

Pre-mRNA mis-splicing of sarcomeric genes in heart failure.

Science.gov (United States)

Zhu, Chaoqun; Chen, Zhilong; Guo, Wei

2017-08-01

Pre-mRNA splicing is an important biological process that allows production of multiple proteins from a single gene in the genome, and mainly contributes to protein diversity in eukaryotic organisms. Alternative splicing is commonly governed by RNA binding proteins to meet the ever-changing demands of the cell. However, the mis-splicing may lead to human diseases. In the heart of human, mis-regulation of alternative splicing has been associated with heart failure. In this short review, we focus on alternative splicing of sarcomeric genes and review mis-splicing related heart failure with relatively well studied Sarcomeric genes and splicing mechanisms with identified regulatory factors. The perspective of alternative splicing based therapeutic strategies in heart failure has also been discussed. Copyright © 2016 Elsevier B.V. All rights reserved.
Nitrogen Cycle Evaluation (NiCE) Chip for the Simultaneous Analysis of Multiple N-Cycle Associated Genes.

Science.gov (United States)

Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi

2018-02-02

Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene
Synaptic genes are extensively downregulated across multiple brain regions in normal human aging and Alzheimer’s disease

Science.gov (United States)

Berchtold, Nicole C.; Coleman, Paul D.; Cribbs, David H.; Rogers, Joseph; Gillen, Daniel L.; Cotman, Carl W.

2014-01-01

Synapses are essential for transmitting, processing, and storing information, all of which decline in aging and Alzheimer’s disease (AD). Because synapse loss only partially accounts for the cognitive declines seen in aging and AD, we hypothesized that existing synapses might undergo molecular changes that reduce their functional capacity. Microarrays were used to evaluate expression profiles of 340 synaptic genes in aging (20–99 years) and AD across 4 brain regions from 81 cases. The analysis revealed an unexpectedly large number of significant expression changes in synapse-related genes in aging, with many undergoing progressive downregulation across aging and AD. Functional classification of the genes showing altered expression revealed that multiple aspects of synaptic function are affected, notably synaptic vesicle trafficking and release, neurotransmitter receptors and receptor trafficking, postsynaptic density scaffolding, cell adhesion regulating synaptic stability, and neuromodulatory systems. The widespread declines in synaptic gene expression in normal aging suggests that function of existing synapses might be impaired, and that a common set of synaptic genes are vulnerable to change in aging and AD. PMID:23273601
A viral microRNA down-regulates multiple cell cycle genes through mRNA 5'UTRs.

Directory of Open Access Journals (Sweden)

Finn Grey

2010-06-01

Full Text Available Global gene expression data combined with bioinformatic analysis provides strong evidence that mammalian miRNAs mediate repression of gene expression primarily through binding sites within the 3' untranslated region (UTR. Using RNA induced silencing complex immunoprecipitation (RISC-IP techniques we have identified multiple cellular targets for a human cytomegalovirus (HCMV miRNA, miR-US25-1. Strikingly, this miRNA binds target sites primarily within 5'UTRs, mediating significant reduction in gene expression. Intriguingly, many of the genes targeted by miR-US25-1 are associated with cell cycle control, including cyclin E2, BRCC3, EID1, MAPRE2, and CD147, suggesting that miR-US25-1 is targeting genes within a related pathway. Deletion of miR-US25-1 from HCMV results in over expression of cyclin E2 in the context of viral infection. Our studies demonstrate that a viral miRNA mediates translational repression of multiple cellular genes by targeting mRNA 5'UTRs.
Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents.

Science.gov (United States)

Usie, Anabel; Karathia, Hiren; Teixidó, Ivan; Alves, Rui; Solsona, Francesc

2014-01-01

One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this 'up-to-dateness' came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP). In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains 'up-to-dateness' of the results. http://metres.udl.cat/index.php/downloads, metres.cmb@gmail.com.
Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents

Directory of Open Access Journals (Sweden)

Anabel Usie

2014-02-01

Full Text Available One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this ‘up-to-dateness’ came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP. In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains ‘up-to-dateness’ of the results. Availability: http://metres.udl.cat/index.php/downloads, Contact: metres.cmb@gmail.com.
Entropy and Multifractality for the Myeloma Multiple TET 2 Gene

Directory of Open Access Journals (Sweden)

Carlo Cattani

2012-01-01

Full Text Available The nucleotide and amino-acid distributions are studied for two variants of mRNA of gene that codes for a protein which is involved in multiple myeloid. Some patches and symmetries are singled out, thus, showing some distinctions between the two variants. Fractal dimensions and entropy are discussed as well.
The role of biological clock in glucose homeostasis

Directory of Open Access Journals (Sweden)

Piotr Chrościcki

2013-06-01

Full Text Available The mechanism of the biological clock is based on a rhythmic expression of clock genes and clock-controlled genes. As a result of their transcripto-translational associations, endogenous rhythms in the synthesis of key proteins of various physiological and metabolic processes are created. The major timekeeping mechanism for these rhythms exists in the central nervous system. The master circadian clock, localized in suprachiasmatic nucleus (SCN, regulates multiple metabolic pathways, while feeding behavior and metabolite availability can in turn regulate the circadian clock. It is also suggested that in the brain there is a food entrainable oscillator (FEO or oscillators, resulting in activation of both food anticipatory activity and hormone secretion that control digestion processes. Moreover, most cells and tissues express autonomous clocks. Maintenance of the glucose homeostasis is particularly important for the proper function of the body, as this sugar is the main source of energy for the brain, retina, erythrocytes and skeletal muscles. Thus, glucose production and utilization are synchronized in time. The hypothalamic excited orexin neurons control energy balance of organism and modulate the glucose production and utilization. Deficiency of orexin action results in narcolepsy and weight gain, whereas glucose and amino acids can affect activity of the orexin cells. Large-scale genetic studies in rodents and humans provide evidence for the involvement of disrupted clock gene expression rhythms in the pathogenesis of obesity and type 2 diabetes. In general, the current lifestyle of the developed modern societies disturbs the action of biological clock.
Terminator Operon Reporter: combining a transcription termination switch with reporter technology for improved gene synthesis and synthetic biology applications.

Science.gov (United States)

Zampini, Massimiliano; Mur, Luis A J; Rees Stevens, Pauline; Pachebat, Justin A; Newbold, C James; Hayes, Finbarr; Kingston-Smith, Alison

2016-05-25

Synthetic biology is characterized by the development of novel and powerful DNA fabrication methods and by the application of engineering principles to biology. The current study describes Terminator Operon Reporter (TOR), a new gene assembly technology based on the conditional activation of a reporter gene in response to sequence errors occurring at the assembly stage of the synthetic element. These errors are monitored by a transcription terminator that is placed between the synthetic gene and reporter gene. Switching of this terminator between active and inactive states dictates the transcription status of the downstream reporter gene to provide a rapid and facile readout of the accuracy of synthetic assembly. Designed specifically and uniquely for the synthesis of protein coding genes in bacteria, TOR allows the rapid and cost-effective fabrication of synthetic constructs by employing oligonucleotides at the most basic purification level (desalted) and without the need for costly and time-consuming post-synthesis correction methods. Thus, TOR streamlines gene assembly approaches, which are central to the future development of synthetic biology.
Multiple genes encode the major surface glycoprotein of Pneumocystis carinii

DEFF Research Database (Denmark)

Kovacs, J A; Powell, F; Edman, J C

1993-01-01

hydrophobic region at the carboxyl terminus. The presence of multiple related msg genes encoding the major surface glycoprotein of P. carinii suggests that antigenic variation is a possible mechanism for evading host defenses. Further characterization of this family of genes should allow the development......The major surface antigen of Pneumocystis carinii, a life-threatening opportunistic pathogen in human immunodeficiency virus-infected patients, is an abundant glycoprotein that functions in host-organism interactions. A monoclonal antibody to this antigen is protective in animals, and thus...... blot studies using chromosomal or restricted DNA, the major surface glycoproteins are the products of a multicopy family of genes. The predicted protein has an M(r) of approximately 123,000, is relatively rich in cysteine residues (5.5%) that are very strongly conserved, and contains a well conserved...
Association of a novel point mutation in MSH2 gene with familial multiple primary cancers

Directory of Open Access Journals (Sweden)

Hai Hu

2017-10-01

Full Text Available Abstract Background Multiple primary cancers (MPC have been identified as two or more cancers without any subordinate relationship that occur either simultaneously or metachronously in the same or different organs of an individual. Lynch syndrome is an autosomal dominant genetic disorder that increases the risk of many types of cancers. Lynch syndrome patients who suffer more than two cancers can also be considered as MPC; patients of this kind provide unique resources to learn how genetic mutation causes MPC in different tissues. Methods We performed a whole genome sequencing on blood cells and two tumor samples of a Lynch syndrome patient who was diagnosed with five primary cancers. The mutational landscape of the tumors, including somatic point mutations and copy number alternations, was characterized. We also compared Lynch syndrome with sporadic cancers and proposed a model to illustrate the mutational process by which Lynch syndrome progresses to MPC. Results We revealed a novel pathologic mutation on the MSH2 gene (G504 splicing that associates with Lynch syndrome. Systematical comparison of the mutation landscape revealed that multiple cancers in the proband were evolutionarily independent. Integrative analysis showed that truncating mutations of DNA mismatch repair (MMR genes were significantly enriched in the patient. A mutation progress model that included germline mutations of MMR genes, double hits of MMR system, mutations in tissue-specific driver genes, and rapid accumulation of additional passenger mutations was proposed to illustrate how MPC occurs in Lynch syndrome patients. Conclusion Our findings demonstrate that both germline and somatic alterations are driving forces of carcinogenesis, which may resolve the carcinogenic theory of Lynch syndrome.
Discovering time-lagged rules from microarray data using gene profile classifiers

Directory of Open Access Journals (Sweden)

Ponzoni Ignacio

2011-04-01

Full Text Available Abstract Background Gene regulatory networks have an essential role in every process of life. In this regard, the amount of genome-wide time series data is becoming increasingly available, providing the opportunity to discover the time-delayed gene regulatory networks that govern the majority of these molecular processes. Results This paper aims at reconstructing gene regulatory networks from multiple genome-wide microarray time series datasets. In this sense, a new model-free algorithm called GRNCOP2 (Gene Regulatory Network inference by Combinatorial OPtimization 2, which is a significant evolution of the GRNCOP algorithm, was developed using combinatorial optimization of gene profile classifiers. The method is capable of inferring potential time-delay relationships with any span of time between genes from various time series datasets given as input. The proposed algorithm was applied to time series data composed of twenty yeast genes that are highly relevant for the cell-cycle study, and the results were compared against several related approaches. The outcomes have shown that GRNCOP2 outperforms the contrasted methods in terms of the proposed metrics, and that the results are consistent with previous biological knowledge. Additionally, a genome-wide study on multiple publicly available time series data was performed. In this case, the experimentation has exhibited the soundness and scalability of the new method which inferred highly-related statistically-significant gene associations. Conclusions A novel method for inferring time-delayed gene regulatory networks from genome-wide time series datasets is proposed in this paper. The method was carefully validated with several publicly available data sets. The results have demonstrated that the algorithm constitutes a usable model-free approach capable of predicting meaningful relationships between genes, revealing the time-trends of gene regulation.
Multiple Genes Cause Postmating Prezygotic Reproductive Isolation in the Drosophila virilis Group.

Science.gov (United States)

Ahmed-Braimah, Yasir H

2016-12-07

Understanding the genetic basis of speciation is a central problem in evolutionary biology. Studies of reproductive isolation have provided several insights into the genetic causes of speciation, especially in taxa that lend themselves to detailed genetic scrutiny. Reproductive barriers have usually been divided into those that occur before zygote formation (prezygotic) and after (postzygotic), with the latter receiving a great deal of attention over several decades. Reproductive barriers that occur after mating but before zygote formation [postmating prezygotic (PMPZ)] are especially understudied at the genetic level. Here, I present a phenotypic and genetic analysis of a PMPZ reproductive barrier between two species of the Drosophila virilis group: D. americana and D. virilis This species pair shows strong PMPZ isolation, especially when D. americana males mate with D. virilis females: ∼99% of eggs laid after these heterospecific copulations are not fertilized. Previous work has shown that the paternal loci contributing to this incompatibility reside on two chromosomes, one of which (chromosome 5) likely carries multiple factors. The other (chromosome 2) is fixed for a paracentric inversion that encompasses nearly half the chromosome. Here, I present two results. First, I show that PMPZ in this species cross is largely due to defective sperm storage in heterospecific copulations. Second, using advanced intercross and backcross mapping approaches, I identify genomic regions that carry genes capable of rescuing heterospecific fertilization. I conclude that paternal incompatibility between D. americana males and D. virilis females is underlain by four or more genes on chromosomes 2 and 5. Copyright © 2016 Ahmed-Braimah.
Interspecies Systems Biology Uncovers Metabolites Affecting C. elegans Gene Expression and Life History Traits

Science.gov (United States)

Watson, Emma; MacNeil, Lesley T.; Ritter, Ashlyn D.; Yilmaz, L. Safak; Rosebrock, Adam P.; Caudy, Amy A.; Walhout, Albertha J. M.

2014-01-01

SUMMARY Diet greatly influences gene expression and physiology. In mammals, elucidating the effects and mechanisms of individual nutrients is challenging due to the complexity of both the animal and its diet. Here we used an interspecies systems biology approach with Caenorhabditis elegans and two if its bacterial diets, Escherichia coli and Comamonas aquatica, to identify metabolites that affect the animal’s gene expression and physiology. We identify vitamin B12 as the major dilutable metabolite provided by Comamonas aq. that regulates gene expression, accelerates development and reduces fertility, but does not affect lifespan. We find that vitamin B12 has a dual role in the animal: it affects development and fertility via the methionine/S-Adenosylmethionine (SAM) cycle and breaks down the short-chain fatty acid propionic acid preventing its toxic buildup. Our interspecies systems biology approach provides a paradigm for understanding complex interactions between diet and physiology. PMID:24529378
Step by Step: Biology Undergraduates' Problem-Solving Procedures during Multiple-Choice Assessment.

Science.gov (United States)

Prevost, Luanna B; Lemons, Paula P

2016-01-01

This study uses the theoretical framework of domain-specific problem solving to explore the procedures students use to solve multiple-choice problems about biology concepts. We designed several multiple-choice problems and administered them on four exams. We trained students to produce written descriptions of how they solved the problem, and this allowed us to systematically investigate their problem-solving procedures. We identified a range of procedures and organized them as domain general, domain specific, or hybrid. We also identified domain-general and domain-specific errors made by students during problem solving. We found that students use domain-general and hybrid procedures more frequently when solving lower-order problems than higher-order problems, while they use domain-specific procedures more frequently when solving higher-order problems. Additionally, the more domain-specific procedures students used, the higher the likelihood that they would answer the problem correctly, up to five procedures. However, if students used just one domain-general procedure, they were as likely to answer the problem correctly as if they had used two to five domain-general procedures. Our findings provide a categorization scheme and framework for additional research on biology problem solving and suggest several important implications for researchers and instructors. © 2016 L. B. Prevost and P. P. Lemons. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Mammalian synthetic biology: emerging medical applications.

Science.gov (United States)

Kis, Zoltán; Pereira, Hugo Sant'Ana; Homma, Takayuki; Pedrigi, Ryan M; Krams, Rob

2015-05-06

In this review, we discuss new emerging medical applications of the rapidly evolving field of mammalian synthetic biology. We start with simple mammalian synthetic biological components and move towards more complex and therapy-oriented gene circuits. A comprehensive list of ON-OFF switches, categorized into transcriptional, post-transcriptional, translational and post-translational, is presented in the first sections. Subsequently, Boolean logic gates, synthetic mammalian oscillators and toggle switches will be described. Several synthetic gene networks are further reviewed in the medical applications section, including cancer therapy gene circuits, immuno-regulatory networks, among others. The final sections focus on the applicability of synthetic gene networks to drug discovery, drug delivery, receptor-activating gene circuits and mammalian biomanufacturing processes. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Efficient Processing of Multiple DTW Queries in Time Series Databases

DEFF Research Database (Denmark)

Kremer, Hardy; Günnemann, Stephan; Ivanescu, Anca-Maria

2011-01-01

. In many of today’s applications, however, large numbers of queries arise at any given time. Existing DTW techniques do not process multiple DTW queries simultaneously, a serious limitation which slows down overall processing. In this paper, we propose an efficient processing approach for multiple DTW...... for multiple DTW queries....
PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes.

Science.gov (United States)

Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping

2013-12-27

With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.
C/EBPβ Mediates Growth Hormone-Regulated Expression of Multiple Target Genes

Science.gov (United States)

Cui, Tracy X.; Lin, Grace; LaPensee, Christopher R.; Calinescu, Anda-Alexandra; Rathore, Maanjot; Streeter, Cale; Piwien-Pilipuk, Graciela; Lanning, Nathan; Jin, Hui; Carter-Su, Christin; Qin, Zhaohui S.

2011-01-01

Regulation of c-Fos transcription by GH is mediated by CCAAT/enhancer binding protein β (C/EBPβ). This study examines the role of C/EBPβ in mediating GH activation of other early response genes, including Cyr61, Btg2, Socs3, Zfp36, and Socs1. C/EBPβ depletion using short hairpin RNA impaired responsiveness of these genes to GH, as seen for c-Fos. Rescue with wild-type C/EBPβ led to GH-dependent recruitment of the coactivator p300 to the c-Fos promoter. In contrast, rescue with C/EBPβ mutated at the ERK phosphorylation site at T188 failed to induce GH-dependent recruitment of p300, indicating that ERK-mediated phosphorylation of C/EBPβ at T188 is required for GH-induced recruitment of p300 to c-Fos. GH also induced the occupancy of phosphorylated C/EBPβ and p300 on Cyr61, Btg2, and Socs3 at predicted C/EBP-cAMP response element-binding protein motifs in their promoters. Consistent with a role for ERKs in GH-induced expression of these genes, treatment with U0126 to block ERK phosphorylation inhibited their GH-induced expression. In contrast, GH-dependent expression of Zfp36 and Socs1 was not inhibited by U0126. Thus, induction of multiple early response genes by GH in 3T3-F442A cells is mediated by C/EBPβ. A subset of these genes is regulated similarly to c-Fos, through a mechanism involving GH-stimulated ERK 1/2 activation, phosphorylation of C/EBPβ, and recruitment of p300. Overall, these studies suggest that C/EBPβ, like the signal transducer and activator of transcription proteins, regulates multiple genes in response to GH. PMID:21292824
Gene expression-based biological test for major depressive disorder: an advanced study

Directory of Open Access Journals (Sweden)

Watanabe S

2017-02-01

Full Text Available Shin-ya Watanabe,1 Shusuke Numata,1 Jun-ichi Iga,2 Makoto Kinoshita,1 Hidehiro Umehara,1 Kazuo Ishii,3 Tetsuro Ohmori1 1Department of Psychiatry, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, 2Department of Neuropsychiatry, Molecules and Function, Ehime University Graduate School of Medicine, Ehime, 3Department of Applied Biological Science, Faculty of Agriculture, Tokyo University of Agriculture and Technology, Tokyo, Japan Purpose: Recently, we could distinguished patients with major depressive disorder (MDD from nonpsychiatric controls with high accuracy using a panel of five gene expression markers (ARHGAP24, HDAC5, PDGFC, PRNP, and SLC6A4 in leukocyte. In the present study, we examined whether this biological test is able to discriminate patients with MDD from those without MDD, including those with schizophrenia and bipolar disorder.Patients and methods: We measured messenger ribonucleic acid expression levels of the aforementioned five genes in peripheral leukocytes in 17 patients with schizophrenia and 36 patients with bipolar disorder using quantitative real-time polymerase chain reaction (PCR, and we combined these expression data with our previous expression data of 25 patients with MDD and 25 controls. Subsequently, a linear discriminant function was developed for use in discriminating between patients with MDD and without MDD.Results: This expression panel was able to segregate patients with MDD from those without MDD with a sensitivity and specificity of 64% and 67.9%, respectively.Conclusion: Further research to identify MDD-specific markers is needed to improve the performance of this biological test. Keywords: depressive disorder, biomarker, gene expression, schizophrenia, bipolar disorder

Evolution of bow-tie architectures in biology.

Directory of Open Access Journals (Sweden)

Tamar Friedlander

2015-03-01

Full Text Available Bow-tie or hourglass structure is a common architectural feature found in many biological systems. A bow-tie in a multi-layered structure occurs when intermediate layers have much fewer components than the input and output layers. Examples include metabolism where a handful of building blocks mediate between multiple input nutrients and multiple output biomass components, and signaling networks where information from numerous receptor types passes through a small set of signaling pathways to regulate multiple output genes. Little is known, however, about how bow-tie architectures evolve. Here, we address the evolution of bow-tie architectures using simulations of multi-layered systems evolving to fulfill a given input-output goal. We find that bow-ties spontaneously evolve when the information in the evolutionary goal can be compressed. Mathematically speaking, bow-ties evolve when the rank of the input-output matrix describing the evolutionary goal is deficient. The maximal compression possible (the rank of the goal determines the size of the narrowest part of the network-that is the bow-tie. A further requirement is that a process is active to reduce the number of links in the network, such as product-rule mutations, otherwise a non-bow-tie solution is found in the evolutionary simulations. This offers a mechanism to understand a common architectural principle of biological systems, and a way to quantitate the effective rank of the goals under which they evolved.
Gene Expression Measurement Module (GEMM) - a fully automated, miniaturized instrument for measuring gene expression in space

Science.gov (United States)

Karouia, Fathi; Ricco, Antonio; Pohorille, Andrew; Peyvan, Kianoosh

2012-07-01

The capability to measure gene expression on board spacecrafts opens the doors to a large number of experiments on the influence of space environment on biological systems that will profoundly impact our ability to conduct safe and effective space travel, and might also shed light on terrestrial physiology or biological function and human disease and aging processes. Measurements of gene expression will help us to understand adaptation of terrestrial life to conditions beyond the planet of origin, identify deleterious effects of the space environment on a wide range of organisms from microbes to humans, develop effective countermeasures against these effects, determine metabolic basis of microbial pathogenicity and drug resistance, test our ability to sustain and grow in space organisms that can be used for life support and in situ resource utilization during long-duration space exploration, and monitor both the spacecraft environment and crew health. These and other applications hold significant potential for discoveries in space biology, biotechnology and medicine. Accordingly, supported by funding from the NASA Astrobiology Science and Technology Instrument Development Program, we are developing a fully automated, miniaturized, integrated fluidic system for small spacecraft capable of in-situ measuring microbial expression of thousands of genes from multiple samples. The instrument will be capable of (1) lysing bacterial cell walls, (2) extracting and purifying RNA released from cells, (3) hybridizing it on a microarray and (4) providing electrochemical readout, all in a microfluidics cartridge. The prototype under development is suitable for deployment on nanosatellite platforms developed by the NASA Small Spacecraft Office. The first target application is to cultivate and measure gene expression of the photosynthetic bacterium Synechococcus elongatus, i.e. a cyanobacterium known to exhibit remarkable metabolic diversity and resilience to adverse conditions
Visual Comparison of Multiple Gene Expression Datasets in a Genomic Context

Directory of Open Access Journals (Sweden)

Borowski Krzysztof

2008-06-01

Full Text Available The need for novel methods of visualizing microarray data is growing. New perspectives are beneficial to finding patterns in expression data. The Bluejay genome browser provides an integrative way of visualizing gene expression datasets in a genomic context. We have now developed the functionality to display multiple microarray datasets simultaneously in Bluejay, in order to provide researchers with a comprehensive view of their datasets linked to a graphical representation of gene function. This will enable biologists to obtain valuable insights on expression patterns, by allowing them to analyze the expression values in relation to the gene locations as well as to compare expression profiles of related genomes or of di erent experiments for the same genome.
Processing scarce biological samples for light and transmission electron microscopy

Directory of Open Access Journals (Sweden)

P Taupin

2008-06-01

Full Text Available Light microscopy (LM and transmission electron microscopy (TEM aim at understanding the relationship structure-function. With advances in biology, isolation and purification of scarce populations of cells or subcellular structures may not lead to enough biological material, for processing for LM and TEM. A protocol for preparation of scarce biological samples is presented. It is based on pre-embedding the biological samples, suspensions or pellets, in bovine serum albumin (BSA and bis-acrylamide (BA, cross-linked and polymerized. This preparation provides a simple and reproducible technique to process biological materials, present in limited quantities that can not be amplified, for light and transmission electron microscopy.
Fast Construction of Near Parsimonious Hybridization Networks for Multiple Phylogenetic Trees.

Science.gov (United States)

Mirzaei, Sajad; Wu, Yufeng

2016-01-01

Hybridization networks represent plausible evolutionary histories of species that are affected by reticulate evolutionary processes. An established computational problem on hybridization networks is constructing the most parsimonious hybridization network such that each of the given phylogenetic trees (called gene trees) is "displayed" in the network. There have been several previous approaches, including an exact method and several heuristics, for this NP-hard problem. However, the exact method is only applicable to a limited range of data, and heuristic methods can be less accurate and also slow sometimes. In this paper, we develop a new algorithm for constructing near parsimonious networks for multiple binary gene trees. This method is more efficient for large numbers of gene trees than previous heuristics. This new method also produces more parsimonious results on many simulated datasets as well as a real biological dataset than a previous method. We also show that our method produces topologically more accurate networks for many datasets.
A comprehensive experiment for molecular biology: Determination of single nucleotide polymorphism in human REV3 gene using PCR-RFLP.

Science.gov (United States)

Zhang, Xu; Shao, Meng; Gao, Lu; Zhao, Yuanyuan; Sun, Zixuan; Zhou, Liping; Yan, Yongmin; Shao, Qixiang; Xu, Wenrong; Qian, Hui

2017-07-08

Laboratory exercise is helpful for medical students to understand the basic principles of molecular biology and to learn about the practical applications of molecular biology. We have designed a lab course on molecular biology about the determination of single nucleotide polymorphism (SNP) in human REV3 gene, the product of which is a subunit of DNA polymerase ζ and SNPs in this gene are associated with altered susceptibility to cancer. This newly designed experiment is composed of three parts, including genomic DNA extraction, gene amplification by PCR, and genotyping by RFLP. By combining these activities, the students are not only able to learn a series of biotechniques in molecular biology, but also acquire the ability to link the learned knowledge with practical applications. This comprehensive experiment will help the medical students improve the conceptual understanding of SNP and the technical understanding of SNP detection. © 2017 by The International Union of Biochemistry and Molecular Biology, 45(4):299-304, 2017. © 2017 The International Union of Biochemistry and Molecular Biology.
Meta-Analysis of Multiple Sclerosis Microarray Data Reveals Dysregulation in RNA Splicing Regulatory Genes

Directory of Open Access Journals (Sweden)

Elvezia Maria Paraboschi

2015-09-01

Full Text Available Abnormalities in RNA metabolism and alternative splicing (AS are emerging as important players in complex disease phenotypes. In particular, accumulating evidence suggests the existence of pathogenic links between multiple sclerosis (MS and altered AS, including functional studies showing that an imbalance in alternatively-spliced isoforms may contribute to disease etiology. Here, we tested whether the altered expression of AS-related genes represents a MS-specific signature. A comprehensive comparative analysis of gene expression profiles of publicly-available microarray datasets (190 MS cases, 182 controls, followed by gene-ontology enrichment analysis, highlighted a significant enrichment for differentially-expressed genes involved in RNA metabolism/AS. In detail, a total of 17 genes were found to be differentially expressed in MS in multiple datasets, with CELF1 being dysregulated in five out of seven studies. We confirmed CELF1 downregulation in MS (p = 0.0015 by real-time RT-PCRs on RNA extracted from blood cells of 30 cases and 30 controls. As a proof of concept, we experimentally verified the unbalance in alternatively-spliced isoforms in MS of the NFAT5 gene, a putative CELF1 target. In conclusion, for the first time we provide evidence of a consistent dysregulation of splicing-related genes in MS and we discuss its possible implications in modulating specific AS events in MS susceptibility genes.
Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

Science.gov (United States)

Beauparlant, Marc A; Drouin, Guy

2014-02-01

Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.
Precision control of recombinant gene transcription for CHO cell synthetic biology.

Science.gov (United States)

Brown, Adam J; James, David C

2016-01-01

The next generation of mammalian cell factories for biopharmaceutical production will be genetically engineered to possess both generic and product-specific manufacturing capabilities that may not exist naturally. Introduction of entirely new combinations of synthetic functions (e.g. novel metabolic or stress-response pathways), and retro-engineering of existing functional cell modules will drive disruptive change in cellular manufacturing performance. However, before we can apply the core concepts underpinning synthetic biology (design, build, test) to CHO cell engineering we must first develop practical and robust enabling technologies. Fundamentally, we will require the ability to precisely control the relative stoichiometry of numerous functional components we simultaneously introduce into the host cell factory. In this review we discuss how this can be achieved by design of engineered promoters that enable concerted control of recombinant gene transcription. We describe the specific mechanisms of transcriptional regulation that affect promoter function during bioproduction processes, and detail the highly-specific promoter design criteria that are required in the context of CHO cell engineering. The relative applicability of diverse promoter development strategies are discussed, including re-engineering of natural sequences, design of synthetic transcription factor-based systems, and construction of synthetic promoters. This review highlights the potential of promoter engineering to achieve precision transcriptional control for CHO cell synthetic biology. Copyright © 2015. Published by Elsevier Inc.
Bayesian uncertainty analysis for complex systems biology models: emulation, global parameter searches and evaluation of gene functions.

Science.gov (United States)

Vernon, Ian; Liu, Junli; Goldstein, Michael; Rowe, James; Topping, Jen; Lindsey, Keith

2018-01-02

Many mathematical models have now been employed across every area of systems biology. These models increasingly involve large numbers of unknown parameters, have complex structure which can result in substantial evaluation time relative to the needs of the analysis, and need to be compared to observed data of various forms. The correct analysis of such models usually requires a global parameter search, over a high dimensional parameter space, that incorporates and respects the most important sources of uncertainty. This can be an extremely difficult task, but it is essential for any meaningful inference or prediction to be made about any biological system. It hence represents a fundamental challenge for the whole of systems biology. Bayesian statistical methodology for the uncertainty analysis of complex models is introduced, which is designed to address the high dimensional global parameter search problem. Bayesian emulators that mimic the systems biology model but which are extremely fast to evaluate are embeded within an iterative history match: an efficient method to search high dimensional spaces within a more formal statistical setting, while incorporating major sources of uncertainty. The approach is demonstrated via application to a model of hormonal crosstalk in Arabidopsis root development, which has 32 rate parameters, for which we identify the sets of rate parameter values that lead to acceptable matches between model output and observed trend data. The multiple insights into the model's structure that this analysis provides are discussed. The methodology is applied to a second related model, and the biological consequences of the resulting comparison, including the evaluation of gene functions, are described. Bayesian uncertainty analysis for complex models using both emulators and history matching is shown to be a powerful technique that can greatly aid the study of a large class of systems biology models. It both provides insight into model behaviour
The SH2D2A gene and susceptibility to multiple sclerosis

DEFF Research Database (Denmark)

Lorentzen, A.R.; Smestad, C.; Lie, B.A.

2008-01-01

We previously reported an association between the SH2D2A gene encoding TSAd and multiple sclerosis (MS). Here a total of 2128 Nordic MS patients and 2004 controls were genotyped for the SH2D2A promoter GA repeat polymorphism and rs926103 encoding a serine to asparagine substitution at amino acid...... that the SH2D2A gene may contribute to susceptibility to MS Udgivelsesdato: 2008/7/15...
Interspecies systems biology uncovers metabolites affecting C. elegans gene expression and life history traits.

Science.gov (United States)

Watson, Emma; MacNeil, Lesley T; Ritter, Ashlyn D; Yilmaz, L Safak; Rosebrock, Adam P; Caudy, Amy A; Walhout, Albertha J M

2014-02-13

Diet greatly influences gene expression and physiology. In mammals, elucidating the effects and mechanisms of individual nutrients is challenging due to the complexity of both the animal and its diet. Here, we used an interspecies systems biology approach with Caenorhabditis elegans and two of its bacterial diets, Escherichia coli and Comamonas aquatica, to identify metabolites that affect the animal's gene expression and physiology. We identify vitamin B12 as the major dilutable metabolite provided by Comamonas aq. that regulates gene expression, accelerates development, and reduces fertility but does not affect lifespan. We find that vitamin B12 has a dual role in the animal: it affects development and fertility via the methionine/S-Adenosylmethionine (SAM) cycle and breaks down the short-chain fatty acid propionic acid, preventing its toxic buildup. Our interspecies systems biology approach provides a paradigm for understanding complex interactions between diet and physiology. Copyright © 2014 Elsevier Inc. All rights reserved.
CRISPR-Cas9: a promising tool for gene editing on induced pluripotent stem cells.

Science.gov (United States)

Kim, Eun Ji; Kang, Ki Ho; Ju, Ji Hyeon

2017-01-01

Recent advances in genome editing with programmable nucleases have opened up new avenues for multiple applications, from basic research to clinical therapy. The ease of use of the technology-and particularly clustered regularly interspaced short palindromic repeats (CRISPR)-will allow us to improve our understanding of genomic variation in disease processes via cellular and animal models. Here, we highlight the progress made in correcting gene mutations in monogenic hereditary disorders and discuss various CRISPR-associated applications, such as cancer research, synthetic biology, and gene therapy using induced pluripotent stem cells. The challenges, ethical issues, and future prospects of CRISPR-based systems for human research are also discussed.
CRISPR-Cas9: a promising tool for gene editing on induced pluripotent stem cells

Science.gov (United States)

Kim, Eun Ji; Kang, Ki Ho; Ju, Ji Hyeon

2017-01-01

Recent advances in genome editing with programmable nucleases have opened up new avenues for multiple applications, from basic research to clinical therapy. The ease of use of the technology—and particularly clustered regularly interspaced short palindromic repeats (CRISPR)—will allow us to improve our understanding of genomic variation in disease processes via cellular and animal models. Here, we highlight the progress made in correcting gene mutations in monogenic hereditary disorders and discuss various CRISPR-associated applications, such as cancer research, synthetic biology, and gene therapy using induced pluripotent stem cells. The challenges, ethical issues, and future prospects of CRISPR-based systems for human research are also discussed. PMID:28049282
Functional characterization of the Drosophila MRP (mitochondrial RNA processing) RNA gene.

Science.gov (United States)

Schneider, Mary D; Bains, Anupinder K; Rajendra, T K; Dominski, Zbigniew; Matera, A Gregory; Simmonds, Andrew J

2010-11-01

MRP RNA is a noncoding RNA component of RNase mitochondrial RNA processing (MRP), a multi-protein eukaryotic endoribonuclease reported to function in multiple cellular processes, including ribosomal RNA processing, mitochondrial DNA replication, and cell cycle regulation. A recent study predicted a potential Drosophila ortholog of MRP RNA (CR33682) by computer-based genome analysis. We have confirmed the expression of this gene and characterized the phenotype associated with this locus. Flies with mutations that specifically affect MRP RNA show defects in growth and development that begin in the early larval period and end in larval death during the second instar stage. We present several lines of evidence demonstrating a role for Drosophila MRP RNA in rRNA processing. The nuclear fraction of Drosophila MRP RNA localizes to the nucleolus. Further, a mutant strain shows defects in rRNA processing that include a defect in 5.8S rRNA processing, typical of MRP RNA mutants in other species, as well as defects in early stages of rRNA processing.
Expression Profiling of Genes Related to Endothelial Cells Biology in Patients with Type 2 Diabetes and Patients with Prediabetes

Directory of Open Access Journals (Sweden)

Sara Moradipoor

2016-01-01

Full Text Available Endothelial dysfunction appears to be an early sign indicating vascular damage and predicts the progression of atherosclerosis and cardiovascular disorders. Extensive clinical and experimental evidence suggests that endothelial dysfunction occurs in Type 2 Diabetes Mellitus (T2DM and prediabetes patients. This study was carried out with an aim to appraise the expression levels in the peripheral blood of 84 genes related to endothelial cells biology in patients with diagnosed T2DM or prediabetes, trying to identify new genes whose expression might be changed under these pathological conditions. The study covered a total of 45 participants. The participants were divided into three groups: group 1, patients with T2DM; group 2, patients with prediabetes; group 3, control group. The gene expression analysis was performed using the Endothelial Cell Biology RT2 Profiler PCR Array. In the case of T2DM, 59 genes were found to be upregulated, and four genes were observed to be downregulated. In prediabetes patients, increased expression was observed for 49 genes, with two downregulated genes observed. Our results indicate that diabetic and prediabetic conditions change the expression levels of genes related to endothelial cells biology and, consequently, may increase the risk for occurrence of endothelial dysfunction.
Bears in a forest of gene trees: phylogenetic inference is complicated by incomplete lineage sorting and gene flow.

Science.gov (United States)

Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel

2014-08-01

Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide analysis of gene expression in primate taste buds reveals links to diverse processes.

Directory of Open Access Journals (Sweden)

Peter Hevezi

Full Text Available Efforts to unravel the mechanisms underlying taste sensation (gustation have largely focused on rodents. Here we present the first comprehensive characterization of gene expression in primate taste buds. Our findings reveal unique new insights into the biology of taste buds. We generated a taste bud gene expression database using laser capture microdissection (LCM procured fungiform (FG and circumvallate (CV taste buds from primates. We also used LCM to collect the top and bottom portions of CV taste buds. Affymetrix genome wide arrays were used to analyze gene expression in all samples. Known taste receptors are preferentially expressed in the top portion of taste buds. Genes associated with the cell cycle and stem cells are preferentially expressed in the bottom portion of taste buds, suggesting that precursor cells are located there. Several chemokines including CXCL14 and CXCL8 are among the highest expressed genes in taste buds, indicating that immune system related processes are active in taste buds. Several genes expressed specifically in endocrine glands including growth hormone releasing hormone and its receptor are also strongly expressed in taste buds, suggesting a link between metabolism and taste. Cell type-specific expression of transcription factors and signaling molecules involved in cell fate, including KIT, reveals the taste bud as an active site of cell regeneration, differentiation, and development. IKBKAP, a gene mutated in familial dysautonomia, a disease that results in loss of taste buds, is expressed in taste cells that communicate with afferent nerve fibers via synaptic transmission. This database highlights the power of LCM coupled with transcriptional profiling to dissect the molecular composition of normal tissues, represents the most comprehensive molecular analysis of primate taste buds to date, and provides a foundation for further studies in diverse aspects of taste biology.
An introduction to stochastic processes with applications to biology

CERN Document Server

Allen, Linda J S

2010-01-01

An Introduction to Stochastic Processes with Applications to Biology, Second Edition presents the basic theory of stochastic processes necessary in understanding and applying stochastic methods to biological problems in areas such as population growth and extinction, drug kinetics, two-species competition and predation, the spread of epidemics, and the genetics of inbreeding. Because of their rich structure, the text focuses on discrete and continuous time Markov chains and continuous time and state Markov processes.New to the Second EditionA new chapter on stochastic differential equations th
Polymorphisms in genes encoding leptin, ghrelin and their receptors in German multiple sclerosis patients.

Science.gov (United States)

Rey, Linda K; Wieczorek, Stefan; Akkad, Denis A; Linker, Ralf A; Chan, Andrew; Hoffjan, Sabine

2011-01-01

Multiple sclerosis (MS) is a neuro-inflammatory, autoimmune disease influenced by environmental and polygenic components. There is growing evidence that the peptide hormone leptin, known to regulate energy homeostasis, as well as its antagonist ghrelin play an important role in inflammatory processes in autoimmune diseases, including MS. Recently, single nucleotide polymorphisms (SNPs) in the genes encoding leptin, ghrelin and their receptors were evaluated, amongst others, in Wegener's granulomatosis and Churg-Strauss syndrome. The Lys656Asn SNP in the LEPR gene showed a significant but contrasting association with these vasculitides. We therefore aimed at investigating these polymorphisms in a German MS case-control cohort. Twelve SNPs in the LEP, LEPR, GHRL and GHSR genes were genotyped in 776 MS patients and 878 control subjects. We found an association of a haplotype in the GHSR gene with MS that could not be replicated in a second cohort. Otherwise, no significant differences in allele or genotype frequencies were observed between patients and controls in this particular cohort. Thus, the present results do not support the hypothesis that genetic variation in the leptin/ghrelin system contributes substantially to the pathogenesis of MS. However, a modest effect of GHSR variation cannot be ruled out and needs to be further evaluated in future studies. Copyright Â© 2011 Elsevier Ltd. All rights reserved.

Mining Functional Modules in Heterogeneous Biological Networks Using Multiplex PageRank Approach.

Science.gov (United States)

Li, Jun; Zhao, Patrick X

2016-01-01

Identification of functional modules/sub-networks in large-scale biological networks is one of the important research challenges in current bioinformatics and systems biology. Approaches have been developed to identify functional modules in single-class biological networks; however, methods for systematically and interactively mining multiple classes of heterogeneous biological networks are lacking. In this paper, we present a novel algorithm (called mPageRank) that utilizes the Multiplex PageRank approach to mine functional modules from two classes of biological networks. We demonstrate the capabilities of our approach by successfully mining functional biological modules through integrating expression-based gene-gene association networks and protein-protein interaction networks. We first compared the performance of our method with that of other methods using simulated data. We then applied our method to identify the cell division cycle related functional module and plant signaling defense-related functional module in the model plant Arabidopsis thaliana. Our results demonstrated that the mPageRank method is effective for mining sub-networks in both expression-based gene-gene association networks and protein-protein interaction networks, and has the potential to be adapted for the discovery of functional modules/sub-networks in other heterogeneous biological networks. The mPageRank executable program, source code, the datasets and results of the presented two case studies are publicly and freely available at http://plantgrn.noble.org/MPageRank/.
Gene expression analysis of overwintering mountain pine beetle larvae suggests multiple systems involved in overwintering stress, cold hardiness, and preparation for spring development

Directory of Open Access Journals (Sweden)

Jeanne A. Robert

2016-07-01

Full Text Available Cold-induced mortality has historically been a key aspect of mountain pine beetle, Dendroctonus ponderosae Hopkins (Coleoptera: Curculionidae, population control, but little is known about the molecular basis for cold tolerance in this insect. We used RNA-seq analysis to monitor gene expression patterns of mountain pine beetle larvae at four time points during their overwintering period—early-autumn, late-autumn, early-spring, and late-spring. Changing transcript profiles over the winter indicates a multipronged physiological response from larvae that is broadly characterized by gene transcripts involved in insect immune responses and detoxification during the autumn. In the spring, although transcripts associated with developmental process are present, there was no particular biological process dominating the transcriptome.
Association between expression of random gene sets and survival is evident in multiple cancer types and may be explained by sub-classification

Science.gov (United States)

2018-01-01

One of the goals of cancer research is to identify a set of genes that cause or control disease progression. However, although multiple such gene sets were published, these are usually in very poor agreement with each other, and very few of the genes proved to be functional therapeutic targets. Furthermore, recent findings from a breast cancer gene-expression cohort showed that sets of genes selected randomly can be used to predict survival with a much higher probability than expected. These results imply that many of the genes identified in breast cancer gene expression analysis may not be causal of cancer progression, even though they can still be highly predictive of prognosis. We performed a similar analysis on all the cancer types available in the cancer genome atlas (TCGA), namely, estimating the predictive power of random gene sets for survival. Our work shows that most cancer types exhibit the property that random selections of genes are more predictive of survival than expected. In contrast to previous work, this property is not removed by using a proliferation signature, which implies that proliferation may not always be the confounder that drives this property. We suggest one possible solution in the form of data-driven sub-classification to reduce this property significantly. Our results suggest that the predictive power of random gene sets may be used to identify the existence of sub-classes in the data, and thus may allow better understanding of patient stratification. Furthermore, by reducing the observed bias this may allow more direct identification of biologically relevant, and potentially causal, genes. PMID:29470520
A fast and efficient gene-network reconstruction method from multiple over-expression experiments

Directory of Open Access Journals (Sweden)

Thurner Stefan

2009-08-01

Full Text Available Abstract Background Reverse engineering of gene regulatory networks presents one of the big challenges in systems biology. Gene regulatory networks are usually inferred from a set of single-gene over-expressions and/or knockout experiments. Functional relationships between genes are retrieved either from the steady state gene expressions or from respective time series. Results We present a novel algorithm for gene network reconstruction on the basis of steady-state gene-chip data from over-expression experiments. The algorithm is based on a straight forward solution of a linear gene-dynamics equation, where experimental data is fed in as a first predictor for the solution. We compare the algorithm's performance with the NIR algorithm, both on the well known E. coli experimental data and on in-silico experiments. Conclusion We show superiority of the proposed algorithm in the number of correctly reconstructed links and discuss computational time and robustness. The proposed algorithm is not limited by combinatorial explosion problems and can be used in principle for large networks.
On the Origin of De Novo Genes in Arabidopsis thaliana Populations.

Science.gov (United States)

Li, Zi-Wen; Chen, Xi; Wu, Qiong; Hagmann, Jörg; Han, Ting-Shen; Zou, Yu-Pan; Ge, Song; Guo, Ya-Long

2016-08-03

De novo genes, which originate from ancestral nongenic sequences, are one of the most important sources of protein-coding genes. This origination process is crucial for the adaptation of organisms. However, how de novo genes arise and become fixed in a population or species remains largely unknown. Here, we identified 782 de novo genes from the model plant Arabidopsis thaliana and divided them into three types based on the availability of translational evidence, transcriptional evidence, and neither transcriptional nor translational evidence for their origin. Importantly, by integrating multiple types of omics data, including data from genomes, epigenomes, transcriptomes, and translatomes, we found that epigenetic modifications (DNA methylation and histone modification) play an important role in the origination process of de novo genes. Intriguingly, using the transcriptomes and methylomes from the same population of 84 accessions, we found that de novo genes that are transcribed in approximately half of the total accessions within the population are highly methylated, with lower levels of transcription than those transcribed at other frequencies within the population. We hypothesized that, during the origin of de novo gene alleles, those neutralized to low expression states via DNA methylation have relatively high probabilities of spreading and becoming fixed in a population. Our results highlight the process underlying the origin of de novo genes at the population level, as well as the importance of DNA methylation in this process. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Investigation of a miRNA-Induced Gene Silencing Technique in Petunia Reveals Alterations in miR173 Precursor Processing and the Accumulation of Secondary siRNAs from Endogenous Genes.

Directory of Open Access Journals (Sweden)

Yao Han

Full Text Available MIGS (miRNA-induced gene silencing is a straightforward and efficient gene silencing technique in Arabidopsis. It works by exploiting miR173 to trigger the production of phasiRNAs (phased small interfering RNAs. MIGS can be used in plant species other than Arabidopsis by co-expression of miR173 and target gene fragments fused to an upstream miR173 target site. However, the efficiency and technical mechanisms have not been thoroughly investigated in other plants. In this work, two vectors, pMIGS-chs and pMIGS-pds, were constructed and transformed into petunia plants. The transgenic plants showed CHS (chalcone synthase and PDS (phytoene desaturase gene-silencing phenotypes respectively, indicating that MIGS functions in petunia. MIGS-chs plants were used to investigate the mechanisms of this technique in petunia. Results of 5'- RACE showed that the miR173 target site was cleaved at the expected position and that endogenous CHS genes were cut at multiple positions. Small RNA deep sequencing analysis showed that the processing of Arabidopsis miR173 precursors in MIGS-chs transgenic petunia plants did not occur in exactly the same way as in Arabidopsis, suggesting differences in the machinery of miRNA processing between plant species. Small RNAs in-phase with the miR173 cleavage register were produced immediately downstream from the cleavage site and out-of-phase small RNAs were accumulated at relatively high levels from processing cycle 5 onwards. Secondary siRNAs were generated from multiple sites of endogenous CHS-A and CHS-J genes, indicating that miR173 cleavage induced siRNAs have the same ability to initiate siRNA transitivity as the siRNAs functioning in co-suppression and hpRNA silencing. On account of the simplicity of vector construction and the transitive amplification of signals from endogenous transcripts, MIGS is a good alternative gene silencing method for plants, especially for silencing a cluster of homologous genes with redundant
Investigation of a miRNA-Induced Gene Silencing Technique in Petunia Reveals Alterations in miR173 Precursor Processing and the Accumulation of Secondary siRNAs from Endogenous Genes.

Science.gov (United States)

Han, Yao; Zhang, Bin; Qin, Xiaoting; Li, Mingyang; Guo, Yulong

2015-01-01

MIGS (miRNA-induced gene silencing) is a straightforward and efficient gene silencing technique in Arabidopsis. It works by exploiting miR173 to trigger the production of phasiRNAs (phased small interfering RNAs). MIGS can be used in plant species other than Arabidopsis by co-expression of miR173 and target gene fragments fused to an upstream miR173 target site. However, the efficiency and technical mechanisms have not been thoroughly investigated in other plants. In this work, two vectors, pMIGS-chs and pMIGS-pds, were constructed and transformed into petunia plants. The transgenic plants showed CHS (chalcone synthase) and PDS (phytoene desaturase) gene-silencing phenotypes respectively, indicating that MIGS functions in petunia. MIGS-chs plants were used to investigate the mechanisms of this technique in petunia. Results of 5'- RACE showed that the miR173 target site was cleaved at the expected position and that endogenous CHS genes were cut at multiple positions. Small RNA deep sequencing analysis showed that the processing of Arabidopsis miR173 precursors in MIGS-chs transgenic petunia plants did not occur in exactly the same way as in Arabidopsis, suggesting differences in the machinery of miRNA processing between plant species. Small RNAs in-phase with the miR173 cleavage register were produced immediately downstream from the cleavage site and out-of-phase small RNAs were accumulated at relatively high levels from processing cycle 5 onwards. Secondary siRNAs were generated from multiple sites of endogenous CHS-A and CHS-J genes, indicating that miR173 cleavage induced siRNAs have the same ability to initiate siRNA transitivity as the siRNAs functioning in co-suppression and hpRNA silencing. On account of the simplicity of vector construction and the transitive amplification of signals from endogenous transcripts, MIGS is a good alternative gene silencing method for plants, especially for silencing a cluster of homologous genes with redundant functions.
From noise to synthetic nucleoli: can synthetic biology achieve new insights?

Science.gov (United States)

Ciechonska, Marta; Grob, Alice; Isalan, Mark

2016-04-18

Synthetic biology aims to re-organise and control biological components to make functional devices. Along the way, the iterative process of designing and testing gene circuits has the potential to yield many insights into the functioning of the underlying chassis of cells. Thus, synthetic biology is converging with disciplines such as systems biology and even classical cell biology, to give a new level of predictability to gene expression, cell metabolism and cellular signalling networks. This review gives an overview of the contributions that synthetic biology has made in understanding gene expression, in terms of cell heterogeneity (noise), the coupling of growth and energy usage to expression, and spatiotemporal considerations. We mainly compare progress in bacterial and mammalian systems, which have some of the most-developed engineering frameworks. Overall, one view of synthetic biology can be neatly summarised as "creating in order to understand."
Multiple production of hadrons in deep inelastic processes

International Nuclear Information System (INIS)

Kiselev, A.V.; Petrov, V.A.

1985-01-01

Formulas are proposed for the description of the mean multiplicity of hadrons in deep inelastic processes. On the basis of the existing data, predictions are made for the behavior of the mean multiplicity at higher energies
Efficient multitasking: parallel versus serial processing of multiple tasks.

Science.gov (United States)

Fischer, Rico; Plessow, Franziska

2015-01-01

In the context of performance optimizations in multitasking, a central debate has unfolded in multitasking research around whether cognitive processes related to different tasks proceed only sequentially (one at a time), or can operate in parallel (simultaneously). This review features a discussion of theoretical considerations and empirical evidence regarding parallel versus serial task processing in multitasking. In addition, we highlight how methodological differences and theoretical conceptions determine the extent to which parallel processing in multitasking can be detected, to guide their employment in future research. Parallel and serial processing of multiple tasks are not mutually exclusive. Therefore, questions focusing exclusively on either task-processing mode are too simplified. We review empirical evidence and demonstrate that shifting between more parallel and more serial task processing critically depends on the conditions under which multiple tasks are performed. We conclude that efficient multitasking is reflected by the ability of individuals to adjust multitasking performance to environmental demands by flexibly shifting between different processing strategies of multiple task-component scheduling.
Industrial systems biology and its impact on synthetic biology of yeast cell factories

DEFF Research Database (Denmark)

Fletcher, Eugene; Krivoruchko, Anastasia; Nielsen, Jens

2016-01-01

Engineering industrial cell factories to effectively yield a desired product while dealing with industrially relevant stresses is usually the most challenging step in the development of industrial production of chemicals using microbial fermentation processes. Using synthetic biology tools......, microbial cell factories such as Saccharomyces cerevisiae can be engineered to express synthetic pathways for the production of fuels, biopharmaceuticals, fragrances, and food flavors. However, directing fluxes through these synthetic pathways towards the desired product can be demanding due to complex...... regulation or poor gene expression. Systems biology, which applies computational tools and mathematical modeling to understand complex biological networks, can be used to guide synthetic biology design. Here, we present our perspective on how systems biology can impact synthetic biology towards the goal...
Expression of novel rice gibberellin 2-oxidase gene is under homeostatic regulation by biologically active gibberellins.

Science.gov (United States)

Sakai, Miho; Sakamoto, Tomoaki; Saito, Tamio; Matsuoka, Makoto; Tanaka, Hiroshi; Kobayashi, Masatomo

2003-04-01

We have cloned two genes for gibberellin (GA) 2-oxidase from rice ( Oryza sativa L.). Expression of OsGA2ox2 was not observed. The other gene, OsGA2ox3, was expressed in every tissue examined and was enhanced by the application of biologically active GA. Recombinant OsGA2ox3 protein catalyzed the metabolism of GA(1) to GA(8) and GA(20) to GA(29)-catabolite. These results indicate that OsGA2ox3 is involved in the homeostatic regulation of the endogenous level of biologically active GA in rice.
[Advance in molecular biology of Dendrobium (Orchidaceae)].

Science.gov (United States)

Li, Qing; Li, Biao; Guo, Shun-Xing

2016-08-01

With the development of molecular biology, the process in molecular biology research of Dendrobium is going fast. Not only did it provide new ways to identify Dendrobium quickly, reveal the genetic diversity and relationship of Dendrobium, but also lay the vital foundation for explaining the mechanism of Dendrobium growth and metabolism. The present paper reviews the recent process in molecular biology research of Dendrobium from three aspects, including molecular identification, genetic diversity and functional genes. And this review will facilitate the development of this research area and Dendrobium. Copyright© by the Chinese Pharmaceutical Association.
Stochastic transport processes in discrete biological systems

CERN Document Server

Frehland, Eckart

1982-01-01

These notes are in part based on a course for advanced students in the applications of stochastic processes held in 1978 at the University of Konstanz. These notes contain the results of re cent studies on the stochastic description of ion transport through biological membranes. In particular, they serve as an introduction to an unified theory of fluctuations in complex biological transport systems. We emphasize that the subject of this volume is not to introduce the mathematics of stochastic processes but to present a field of theoretical biophysics in which stochastic methods are important. In the last years the study of membrane noise has become an important method in biophysics. Valuable information on the ion transport mechanisms in membranes can be obtained from noise analysis. A number of different processes such as the opening and closing of ion channels have been shown to be sources of the measured current or voltage fluctuations. Bio logical 'transport systems can be complex. For example, the tr...
Modeling the Activity of Single Genes

Science.gov (United States)

Mjolsness, Eric; Gibson, Michael

1999-01-01

The central dogma of molecular biology states that information is stored in DNA, transcribed to messenger RNA (mRNA) and then translated into proteins. This picture is significantly augmentated when we consider the action of certain proteins in regulating transcription. These transcription factors provide a feedback pathway by which genes can regulate one another's expression as mRNA and then as protein. To review: DNA, RNA and proteins have different functions. DNA is the molecular storehouse of genetic information. When cells divide, the DNA is replicated, so that each daughter cell maintains the same genetic information as the mother cell. RNA acts as a go-between from DNA to proteins. Only a single copy of DNA is present, but multiple copies of the same piece of RNA may be present, allowing cells to make huge amounts of protein. In eukaryotes (organisms with a nucleus), DNA is found in the nucleus only. RNA is copied in the nucleus then translocates(moves) outside the nucleus, where it is transcribed into proteins. Along the way, the RNA may be spliced, i.e., may have pieces cut out. RNA then attaches to ribosomes and is translated to proteins. Proteins are the machinery of the cell other than DNA and RNA, all the complex molecules of the cell are proteins. Proteins are specialized machines, each of which fulfills its own task, which may be transporting oxygen, catalyzing reactions, or responding to extracellular signals, just to name a few. One of the more interesting functions a protein may have is binding directly or indirectly to DNA to perform transcriptional regulation, thus forming a closed feedback loop of gene regulation. The structure of DNA and the central dogma were understood in the 50s; in the early 80s it became possible to make arbitrary modifications to DNA and use cellular machinery to transcribe and translate the resulting genes; more recently, genomes (i.e., the complete DNA sequence) of many organisms have been sequenced. This large
The development and application of a multiple gene co-silencing system using endogenous URA3 as a reporter gene in Ganoderma lucidum.

Directory of Open Access Journals (Sweden)

Dashuai Mu

Full Text Available Ganoderma lucidum is one of the most important medicinal mushrooms; however, molecular genetics research on this species has been limited due to a lack of reliable reverse genetic tools. In this study, the endogenous orotidine 5'-monophosphate decarboxylase gene (URA3 was cloned as a silencing reporter, and four gene-silencing methods using hairpin, sense, antisense, and dual promoter constructs, were introduced into G. lucidum through a simple electroporation procedure. A comparison and evaluation of silencing efficiency demonstrated that all of the four methods differentially suppressed the expression of URA3. Our data unequivocally indicate that the dual promoter silencing vector yields the highest rate of URA3 silencing compared with other vectors (up to 81.9%. To highlight the advantages of the dual promoter system, we constructed a co-silencing system based on the dual promoter method and succeeded in co-silencing URA3 and laccase in G. lucidum. The reduction of the mRNA levels of the two genes were correlated. Thus, the screening efficiency for RNAi knockdown of multiple genes may be improved by the co-silencing of an endogenous reporter gene. The molecular tools developed in this study should facilitate the isolation of genes and the characterization of the functions of multiple genes in this pharmaceutically important species, and these tools should be highly useful for the study of other basidiomycetes.
Development of a multiple-gene-loading method by combining multi-integration system-equipped mouse artificial chromosome vector and CRISPR-Cas9.

Directory of Open Access Journals (Sweden)

Kazuhisa Honma

Full Text Available Mouse artificial chromosome (MAC vectors have several advantages as gene delivery vectors, such as stable and independent maintenance in host cells without integration, transferability from donor cells to recipient cells via microcell-mediated chromosome transfer (MMCT, and the potential for loading a megabase-sized DNA fragment. Previously, a MAC containing a multi-integrase platform (MI-MAC was developed to facilitate the transfer of multiple genes into desired cells. Although the MI system can theoretically hold five gene-loading vectors (GLVs, there are a limited number of drugs available for the selection of multiple-GLV integration. To overcome this issue, we attempted to knock out and reuse drug resistance genes (DRGs using the CRISPR-Cas9 system. In this study, we developed new methods for multiple-GLV integration. As a proof of concept, we introduced five GLVs in the MI-MAC by these methods, in which each GLV contained a gene encoding a fluorescent or luminescent protein (EGFP, mCherry, BFP, Eluc, and Cluc. Genes of interest (GOI on the MI-MAC were expressed stably and functionally without silencing in the host cells. Furthermore, the MI-MAC carrying five GLVs was transferred to other cells by MMCT, and the resultant recipient cells exhibited all five fluorescence/luminescence signals. Thus, the MI-MAC was successfully used as a multiple-GLV integration vector using the CRISPR-Cas9 system. The MI-MAC employing these methods may resolve bottlenecks in developing multiple-gene humanized models, multiple-gene monitoring models, disease models, reprogramming, and inducible gene expression systems.
Integrative multicellular biological modeling: a case study of 3D epidermal development using GPU algorithms

Directory of Open Access Journals (Sweden)

Christley Scott

2010-08-01

Full Text Available Abstract Background Simulation of sophisticated biological models requires considerable computational power. These models typically integrate together numerous biological phenomena such as spatially-explicit heterogeneous cells, cell-cell interactions, cell-environment interactions and intracellular gene networks. The recent advent of programming for graphical processing units (GPU opens up the possibility of developing more integrative, detailed and predictive biological models while at the same time decreasing the computational cost to simulate those models. Results We construct a 3D model of epidermal development and provide a set of GPU algorithms that executes significantly faster than sequential central processing unit (CPU code. We provide a parallel implementation of the subcellular element method for individual cells residing in a lattice-free spatial environment. Each cell in our epidermal model includes an internal gene network, which integrates cellular interaction of Notch signaling together with environmental interaction of basement membrane adhesion, to specify cellular state and behaviors such as growth and division. We take a pedagogical approach to describing how modeling methods are efficiently implemented on the GPU including memory layout of data structures and functional decomposition. We discuss various programmatic issues and provide a set of design guidelines for GPU programming that are instructive to avoid common pitfalls as well as to extract performance from the GPU architecture. Conclusions We demonstrate that GPU algorithms represent a significant technological advance for the simulation of complex biological models. We further demonstrate with our epidermal model that the integration of multiple complex modeling methods for heterogeneous multicellular biological processes is both feasible and computationally tractable using this new technology. We hope that the provided algorithms and source code will be a
Real-time PCR detection of Fe-type nitrile hydratase genes from environmental isolates suggests horizontal gene transfer between multiple genera.

Science.gov (United States)

Coffey, Lee; Owens, Erica; Tambling, Karen; O'Neill, David; O'Connor, Laura; O'Reilly, Catherine

2010-11-01

Nitriles are widespread in the environment as a result of biological and industrial activity. Nitrile hydratases catalyse the hydration of nitriles to the corresponding amide and are often associated with amidases, which catalyze the conversion of amides to the corresponding acids. Nitrile hydratases have potential as biocatalysts in bioremediation and biotransformation applications, and several successful examples demonstrate the advantages. In this work a real-time PCR assay was designed for the detection of Fe-type nitrile hydratase genes from environmental isolates purified from nitrile-enriched soils and seaweeds. Specific PCR primers were also designed for amplification and sequencing of the genes. Identical or highly homologous nitrile hydratase genes were detected from isolates of numerous genera from geographically diverse sites, as were numerous novel genes. The genes were also detected from isolates of genera not previously reported to harbour nitrile hydratases. The results provide further evidence that many bacteria have acquired the genes via horizontal gene transfer. The real-time PCR assay should prove useful in searching for nitrile hydratases that could have novel substrate specificities and therefore potential in industrial applications.
Towards physical principles of biological evolution

Science.gov (United States)

Katsnelson, Mikhail I.; Wolf, Yuri I.; Koonin, Eugene V.

2018-03-01

Biological systems reach organizational complexity that far exceeds the complexity of any known inanimate objects. Biological entities undoubtedly obey the laws of quantum physics and statistical mechanics. However, is modern physics sufficient to adequately describe, model and explain the evolution of biological complexity? Detailed parallels have been drawn between statistical thermodynamics and the population-genetic theory of biological evolution. Based on these parallels, we outline new perspectives on biological innovation and major transitions in evolution, and introduce a biological equivalent of thermodynamic potential that reflects the innovation propensity of an evolving population. Deep analogies have been suggested to also exist between the properties of biological entities and processes, and those of frustrated states in physics, such as glasses. Such systems are characterized by frustration whereby local state with minimal free energy conflict with the global minimum, resulting in ‘emergent phenomena’. We extend such analogies by examining frustration-type phenomena, such as conflicts between different levels of selection, in biological evolution. These frustration effects appear to drive the evolution of biological complexity. We further address evolution in multidimensional fitness landscapes from the point of view of percolation theory and suggest that percolation at level above the critical threshold dictates the tree-like evolution of complex organisms. Taken together, these multiple connections between fundamental processes in physics and biology imply that construction of a meaningful physical theory of biological evolution might not be a futile effort. However, it is unrealistic to expect that such a theory can be created in one scoop; if it ever comes to being, this can only happen through integration of multiple physical models of evolutionary processes. Furthermore, the existing framework of theoretical physics is unlikely to suffice

Hidden Markov processes theory and applications to biology

CERN Document Server

Vidyasagar, M

2014-01-01

This book explores important aspects of Markov and hidden Markov processes and the applications of these ideas to various problems in computational biology. The book starts from first principles, so that no previous knowledge of probability is necessary. However, the work is rigorous and mathematical, making it useful to engineers and mathematicians, even those not interested in biological applications. A range of exercises is provided, including drills to familiarize the reader with concepts and more advanced problems that require deep thinking about the theory. Biological applications are t
Genome Engineering and Modification Toward Synthetic Biology for the Production of Antibiotics.

Science.gov (United States)

Zou, Xuan; Wang, Lianrong; Li, Zhiqiang; Luo, Jie; Wang, Yunfu; Deng, Zixin; Du, Shiming; Chen, Shi

2018-01-01

Antibiotic production is often governed by large gene clusters composed of genes related to antibiotic scaffold synthesis, tailoring, regulation, and resistance. With the expansion of genome sequencing, a considerable number of antibiotic gene clusters has been isolated and characterized. The emerging genome engineering techniques make it possible towards more efficient engineering of antibiotics. In addition to genomic editing, multiple synthetic biology approaches have been developed for the exploration and improvement of antibiotic natural products. Here, we review the progress in the development of these genome editing techniques used to engineer new antibiotics, focusing on three aspects of genome engineering: direct cloning of large genomic fragments, genome engineering of gene clusters, and regulation of gene cluster expression. This review will not only summarize the current uses of genomic engineering techniques for cloning and assembly of antibiotic gene clusters or for altering antibiotic synthetic pathways but will also provide perspectives on the future directions of rebuilding biological systems for the design of novel antibiotics. © 2017 Wiley Periodicals, Inc.
Molecular evolution of the Paramyxoviridae and Rhabdoviridae multiple-protein-encoding P gene.

Science.gov (United States)

Jordan, I K; Sutter, B A; McClure, M A

2000-01-01

Presented here is an analysis of the molecular evolutionary dynamics of the P gene among 76 representative sequences of the Paramyxoviridae and Rhabdoviridae RNA virus families. In a number of Paramyxoviridae taxa, as well as in vesicular stomatitis viruses of the Rhabdoviridae, the P gene encodes multiple proteins from a single genomic RNA sequence. These products include the phosphoprotein (P), as well as the C and V proteins. The complexity of the P gene makes it an intriguing locus to study from an evolutionary perspective. Amino acid sequence alignments of the proteins encoded at the P and N loci were used in independent phylogenetic reconstructions of the Paramyxoviridae and Rhabdoviridae families. P-gene-coding capacities were mapped onto the Paramyxoviridae phylogeny, and the most parsimonious path of multiple-coding-capacity evolution was determined. Levels of amino acid variation for Paramyxoviridae and Rhabdoviridae P-gene-encoded products were also analyzed. Proteins encoded in overlapping reading frames from the same nucleotides have different levels of amino acid variation. The nucleotide architecture that underlies the amino acid variation was determined in order to evaluate the role of selection in the evolution of the P gene overlapping reading frames. In every case, the evolution of one of the proteins encoded in the overlapping reading frames has been constrained by negative selection while the other has evolved more rapidly. The integrity of the overlapping reading frame that represents a derived state is generally maintained at the expense of the ancestral reading frame encoded by the same nucleotides. The evolution of such multicoding sequences is likely a response by RNA viruses to selective pressure to maximize genomic information content while maintaining small genome size. The ability to evolve such a complex genomic strategy is intimately related to the dynamics of the viral quasispecies, which allow enhanced exploration of the adaptive
Network analysis reveals stage-specific changes in zebrafish embryo development using time course whole transcriptome profiling and prior biological knowledge.

Science.gov (United States)

Zhang, Yuji

2015-01-01

Molecular networks act as the backbone of molecular activities within cells, offering a unique opportunity to better understand the mechanism of diseases. While network data usually constitute only static network maps, integrating them with time course gene expression information can provide clues to the dynamic features of these networks and unravel the mechanistic driver genes characterizing cellular responses. Time course gene expression data allow us to broadly "watch" the dynamics of the system. However, one challenge in the analysis of such data is to establish and characterize the interplay among genes that are altered at different time points in the context of a biological process or functional category. Integrative analysis of these data sources will lead us a more complete understanding of how biological entities (e.g., genes and proteins) coordinately perform their biological functions in biological systems. In this paper, we introduced a novel network-based approach to extract functional knowledge from time-dependent biological processes at a system level using time course mRNA sequencing data in zebrafish embryo development. The proposed method was applied to investigate 1α, 25(OH)2D3-altered mechanisms in zebrafish embryo development. We applied the proposed method to a public zebrafish time course mRNA-Seq dataset, containing two different treatments along four time points. We constructed networks between gene ontology biological process categories, which were enriched in differential expressed genes between consecutive time points and different conditions. The temporal propagation of 1α, 25-Dihydroxyvitamin D3-altered transcriptional changes started from a few genes that were altered initially at earlier stage, to large groups of biological coherent genes at later stages. The most notable biological processes included neuronal and retinal development and generalized stress response. In addition, we also investigated the relationship among
Process and representation in multiple-cue judgment

OpenAIRE

Olsson, Anna-Carin

2002-01-01

This thesis investigates the cognitive processes and representations underlying human judgment in a multiple-cue judgment task. Several recent models assume that people have several qualitatively distinct and competing levels of knowledge representations (Ashby, Alfonso-Reese, Turken, & Waldron, 1998; Erickson & Kruschke, 1998; Nosofsky, Palmeri, & McKinley, 1994; Sloman, 1996). The most successful cognitive models in categorization and multiple-cue judgment are, respectively, exe...
Positive selection of Plasmodium falciparum parasites with multiple var2csa-type PfEMP1 genes during the course of infection in pregnant women

DEFF Research Database (Denmark)

Sander, Adam F; Salanti, Ali; Lavstsen, Thomas

2011-01-01

multiple genes coding for different VAR2CSA proteins, and parasites with >1 var2csa gene appear to be more common in pregnant women with placental malaria than in nonpregnant individuals. We present evidence that, in pregnant women, parasites containing multiple var2csa-type genes possess a selective...... advantage over parasites with a single var2csa gene. Accumulation of parasites with multiple copies of the var2csa gene during the course of pregnancy was also correlated with the development of antibodies involved in blocking VAR2CSA adhesion. The data suggest that multiplicity of var2csa-type genes...
Recurrent Gene Duplication Leads to Diverse Repertoires of Centromeric Histones in Drosophila Species.

Science.gov (United States)

Kursel, Lisa E; Malik, Harmit S

2017-06-01

Despite their essential role in the process of chromosome segregation in most eukaryotes, centromeric histones show remarkable evolutionary lability. Not only have they been lost in multiple insect lineages, but they have also undergone gene duplication in multiple plant lineages. Based on detailed study of a handful of model organisms including Drosophila melanogaster, centromeric histone duplication is considered to be rare in animals. Using a detailed phylogenomic study, we find that Cid, the centromeric histone gene, has undergone at least four independent gene duplications during Drosophila evolution. We find duplicate Cid genes in D. eugracilis (Cid2), in the montium species subgroup (Cid3, Cid4) and in the entire Drosophila subgenus (Cid5). We show that Cid3, Cid4, and Cid5 all localize to centromeres in their respective species. Some Cid duplicates are primarily expressed in the male germline. With rare exceptions, Cid duplicates have been strictly retained after birth, suggesting that they perform nonredundant centromeric functions, independent from the ancestral Cid. Indeed, each duplicate encodes a distinct N-terminal tail, which may provide the basis for distinct protein-protein interactions. Finally, we show some Cid duplicates evolve under positive selection whereas others do not. Taken together, our results support the hypothesis that Drosophila Cid duplicates have subfunctionalized. Thus, these gene duplications provide an unprecedented opportunity to dissect the multiple roles of centromeric histones. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Fused Regression for Multi-source Gene Regulatory Network Inference.

Directory of Open Access Journals (Sweden)

Kari Y Lam

2016-12-01

Full Text Available Understanding gene regulatory networks is critical to understanding cellular differentiation and response to external stimuli. Methods for global network inference have been developed and applied to a variety of species. Most approaches consider the problem of network inference independently in each species, despite evidence that gene regulation can be conserved even in distantly related species. Further, network inference is often confined to single data-types (single platforms and single cell types. We introduce a method for multi-source network inference that allows simultaneous estimation of gene regulatory networks in multiple species or biological processes through the introduction of priors based on known gene relationships such as orthology incorporated using fused regression. This approach improves network inference performance even when orthology mapping and conservation are incomplete. We refine this method by presenting an algorithm that extracts the true conserved subnetwork from a larger set of potentially conserved interactions and demonstrate the utility of our method in cross species network inference. Last, we demonstrate our method's utility in learning from data collected on different experimental platforms.
GeneLab: NASA's Open Access, Collaborative Platform for Systems Biology and Space Medicine

Science.gov (United States)

Berrios, Daniel C.; Thompson, Terri G.; Fogle, Homer W.; Rask, Jon C.; Coughlan, Joseph C.

2015-01-01

NASA is investing in GeneLab1 (http:genelab.nasa.gov), a multi-year effort to maximize utilization of the limited resources to conduct biological and medical research in space, principally aboard the International Space Station (ISS). High-throughput genomic, transcriptomic, proteomic or other omics analyses from experiments conducted on the ISS will be stored in the GeneLab Data Systems (GLDS), an open-science information system that will also include a biocomputation platform with collaborative science capabilities, to enable the discovery and validation of molecular networks.
Molecular Biology at the Cutting Edge: A Review on CRISPR/CAS9 Gene Editing for Undergraduates

Science.gov (United States)

Thurtle-Schmidt, Deborah M.; Lo, Te-Wen

2018-01-01

Disrupting a gene to determine its effect on an organism's phenotype is an indispensable tool in molecular biology. Such techniques are critical for understanding how a gene product contributes to the development and cellular identity of organisms. The explosion of genomic sequencing technologies combined with recent advances in genome-editing…
Students’ learning activities while studying biological process diagrams

NARCIS (Netherlands)

Kragten, M.; Admiraal, W.; Rijlaarsdam, G.

2015-01-01

Process diagrams describe how a system functions (e.g. photosynthesis) and are an important type of representation in Biology education. In the present study, we examined students’ learning activities while studying process diagrams, related to their resulting comprehension of these diagrams. Each
Multiple episodes of convergence in genes of the dim light vision pathway in bats.

Directory of Open Access Journals (Sweden)

Yong-Yi Shen

Full Text Available The molecular basis of the evolution of phenotypic characters is very complex and is poorly understood with few examples documenting the roles of multiple genes. Considering that a single gene cannot fully explain the convergence of phenotypic characters, we choose to study the convergent evolution of rod vision in two divergent bats from a network perspective. The Old World fruit bats (Pteropodidae are non-echolocating and have binocular vision, whereas the sheath-tailed bats (Emballonuridae are echolocating and have monocular vision; however, they both have relatively large eyes and rely more on rod vision to find food and navigate in the night. We found that the genes CRX, which plays an essential role in the differentiation of photoreceptor cells, SAG, which is involved in the desensitization of the photoactivated transduction cascade, and the photoreceptor gene RH, which is directly responsible for the perception of dim light, have undergone parallel sequence evolution in two divergent lineages of bats with larger eyes (Pteropodidae and Emballonuroidea. The multiple convergent events in the network of genes essential for rod vision is a rare phenomenon that illustrates the importance of investigating pathways and networks in the evolution of the molecular basis of phenotypic convergence.
Gene Expression Measurement Module (GEMM) - A Fully Automated, Miniaturized Instrument for Measuring Gene Expression in Space

Science.gov (United States)

Pohorille, Andrew; Peyvan, Kia; Karouia, Fathi; Ricco, Antonio

2012-01-01

The capability to measure gene expression on board spacecraft opens the door to a large number of high-value experiments on the influence of the space environment on biological systems. For example, measurements of gene expression will help us to understand adaptation of terrestrial life to conditions beyond the planet of origin, identify deleterious effects of the space environment on a wide range of organisms from microbes to humans, develop effective countermeasures against these effects, and determine the metabolic bases of microbial pathogenicity and drug resistance. These and other applications hold significant potential for discoveries in space biology, biotechnology, and medicine. Supported by funding from the NASA Astrobiology Science and Technology Instrument Development Program, we are developing a fully automated, miniaturized, integrated fluidic system for small spacecraft capable of in-situ measurement of expression of several hundreds of microbial genes from multiple samples. The instrument will be capable of (1) lysing cell walls of bacteria sampled from cultures grown in space, (2) extracting and purifying RNA released from cells, (3) hybridizing the RNA on a microarray and (4) providing readout of the microarray signal, all in a single microfluidics cartridge. The device is suitable for deployment on nanosatellite platforms developed by NASA Ames' Small Spacecraft Division. To meet space and other technical constraints imposed by these platforms, a number of technical innovations are being implemented. The integration and end-to-end technological and biological validation of the instrument are carried out using as a model the photosynthetic bacterium Synechococcus elongatus, known for its remarkable metabolic diversity and resilience to adverse conditions. Each step in the measurement process-lysis, nucleic acid extraction, purification, and hybridization to an array-is assessed through comparison of the results obtained using the instrument with
Students' Ability to Solve Process-Diagram Problems in Secondary Biology Education

Science.gov (United States)

Kragten, Marco; Admiraal, Wilfried; Rijlaarsdam, Gert

2015-01-01

Process diagrams are important tools in biology for explaining processes such as protein synthesis, compound cycles and the like. The aim of the present study was to measure the ability to solve process-diagram problems in biology and its relationship with prior knowledge, spatial ability and working memory. For this purpose, we developed a test…
The JCSG high-throughput structural biology pipeline

International Nuclear Information System (INIS)

Elsliger, Marc-André; Deacon, Ashley M.; Godzik, Adam; Lesley, Scott A.; Wooley, John; Wüthrich, Kurt; Wilson, Ian A.

2010-01-01

The Joint Center for Structural Genomics high-throughput structural biology pipeline has delivered more than 1000 structures to the community over the past ten years and has made a significant contribution to the overall goal of the NIH Protein Structure Initiative (PSI) of expanding structural coverage of the protein universe. The Joint Center for Structural Genomics high-throughput structural biology pipeline has delivered more than 1000 structures to the community over the past ten years. The JCSG has made a significant contribution to the overall goal of the NIH Protein Structure Initiative (PSI) of expanding structural coverage of the protein universe, as well as making substantial inroads into structural coverage of an entire organism. Targets are processed through an extensive combination of bioinformatics and biophysical analyses to efficiently characterize and optimize each target prior to selection for structure determination. The pipeline uses parallel processing methods at almost every step in the process and can adapt to a wide range of protein targets from bacterial to human. The construction, expansion and optimization of the JCSG gene-to-structure pipeline over the years have resulted in many technological and methodological advances and developments. The vast number of targets and the enormous amounts of associated data processed through the multiple stages of the experimental pipeline required the development of variety of valuable resources that, wherever feasible, have been converted to free-access web-based tools and applications
FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

Science.gov (United States)

Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2018-02-09

Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of
Programming Morphogenesis through Systems and Synthetic Biology.

Science.gov (United States)

Velazquez, Jeremy J; Su, Emily; Cahan, Patrick; Ebrahimkhani, Mo R

2018-04-01

Mammalian tissue development is an intricate, spatiotemporal process of self-organization that emerges from gene regulatory networks of differentiating stem cells. A major goal in stem cell biology is to gain a sufficient understanding of gene regulatory networks and cell-cell interactions to enable the reliable and robust engineering of morphogenesis. Here, we review advances in synthetic biology, single cell genomics, and multiscale modeling, which, when synthesized, provide a framework to achieve the ambitious goal of programming morphogenesis in complex tissues and organoids. Copyright © 2017 Elsevier Ltd. All rights reserved.
Step by Step: Biology Undergraduates’ Problem-Solving Procedures during Multiple-Choice Assessment

Science.gov (United States)

Prevost, Luanna B.; Lemons, Paula P.

2016-01-01

This study uses the theoretical framework of domain-specific problem solving to explore the procedures students use to solve multiple-choice problems about biology concepts. We designed several multiple-choice problems and administered them on four exams. We trained students to produce written descriptions of how they solved the problem, and this allowed us to systematically investigate their problem-solving procedures. We identified a range of procedures and organized them as domain general, domain specific, or hybrid. We also identified domain-general and domain-specific errors made by students during problem solving. We found that students use domain-general and hybrid procedures more frequently when solving lower-order problems than higher-order problems, while they use domain-specific procedures more frequently when solving higher-order problems. Additionally, the more domain-specific procedures students used, the higher the likelihood that they would answer the problem correctly, up to five procedures. However, if students used just one domain-general procedure, they were as likely to answer the problem correctly as if they had used two to five domain-general procedures. Our findings provide a categorization scheme and framework for additional research on biology problem solving and suggest several important implications for researchers and instructors. PMID:27909021
Rapid genome reshaping by multiple-gene loss after whole-genome duplication in teleost fish suggested by mathematical modeling

Science.gov (United States)

Sato, Yukuto; Tsukamoto, Katsumi; Nishida, Mutsumi

2015-01-01

Whole-genome duplication (WGD) is believed to be a significant source of major evolutionary innovation. Redundant genes resulting from WGD are thought to be lost or acquire new functions. However, the rates of gene loss and thus temporal process of genome reshaping after WGD remain unclear. The WGD shared by all teleost fish, one-half of all jawed vertebrates, was more recent than the two ancient WGDs that occurred before the origin of jawed vertebrates, and thus lends itself to analysis of gene loss and genome reshaping. Using a newly developed orthology identification pipeline, we inferred the post–teleost-specific WGD evolutionary histories of 6,892 protein-coding genes from nine phylogenetically representative teleost genomes on a time-calibrated tree. We found that rapid gene loss did occur in the first 60 My, with a loss of more than 70–80% of duplicated genes, and produced similar genomic gene arrangements within teleosts in that relatively short time. Mathematical modeling suggests that rapid gene loss occurred mainly by events involving simultaneous loss of multiple genes. We found that the subsequent 250 My were characterized by slow and steady loss of individual genes. Our pipeline also identified about 1,100 shared single-copy genes that are inferred to have become singletons before the divergence of clupeocephalan teleosts. Therefore, our comparative genome analysis suggests that rapid gene loss just after the WGD reshaped teleost genomes before the major divergence, and provides a useful set of marker genes for future phylogenetic analysis. PMID:26578810
Co-existence of Blau syndrome and NAID? Diagnostic challenges associated with presence of multiple pathogenic variants in NOD2 gene: a case report.

Science.gov (United States)

Dziedzic, Magdalena; Marjańska, Agata; Bąbol-Pokora, Katarzyna; Urbańczyk, Anna; Grześk, Elżbieta; Młynarski, Wojciech; Kołtan, Sylwia

2017-07-27

Pediatric autoinflammatory diseases are rare and still poorly understood conditions resulting from defective genetic control of innate immune system, inter alia from anomalies of NOD2 gene. The product of this gene is Nod2 protein, taking part in maintenance of immune homeostasis. Clinical form of resultant autoinflammatory condition depends on NOD2 genotype; usually patients with NOD2 defects present with Blau syndrome, NOD2-associated autoinflammatory disease (NAID) or Crohn's disease. We present the case of a 7-year-old girl with co-existing symptoms of two rare diseases, Blau syndrome and NAID. Overlapping manifestations of two syndromes raised a significant diagnostic challenge, until next-generation molecular test (NGS) identified presence of three pathogenic variants of NOD2 gene: P268S, IVS8 +158 , 1007 fs, and established the ultimate diagnosis. Presence of multiple genetical abnormalities resulted in an ambiguous clinical presentation with overlapping symptoms of Blau syndrome and NAID. Final diagnosis of autoinflammatory disease opened new therapeutic possibilities, including the use of biological treatments.

A differential genome-wide transcriptome analysis: impact of cellular copper on complex biological processes like aging and development.

Directory of Open Access Journals (Sweden)

Jörg Servos

Full Text Available The regulation of cellular copper homeostasis is crucial in biology. Impairments lead to severe dysfunctions and are known to affect aging and development. Previously, a loss-of-function mutation in the gene encoding the copper-sensing and copper-regulated transcription factor GRISEA of the filamentous fungus Podospora anserina was reported to lead to cellular copper depletion and a pleiotropic phenotype with hypopigmentation of the mycelium and the ascospores, affected fertility and increased lifespan by approximately 60% when compared to the wild type. This phenotype is linked to a switch from a copper-dependent standard to an alternative respiration leading to both a reduced generation of reactive oxygen species (ROS and of adenosine triphosphate (ATP. We performed a genome-wide comparative transcriptome analysis of a wild-type strain and the copper-depleted grisea mutant. We unambiguously assigned 9,700 sequences of the transcriptome in both strains to the more than 10,600 predicted and annotated open reading frames of the P. anserina genome indicating 90% coverage of the transcriptome. 4,752 of the transcripts differed significantly in abundance with 1,156 transcripts differing at least 3-fold. Selected genes were investigated by qRT-PCR analyses. Apart from this general characterization we analyzed the data with special emphasis on molecular pathways related to the grisea mutation taking advantage of the available complete genomic sequence of P. anserina. This analysis verified but also corrected conclusions from earlier data obtained by single gene analysis, identified new candidates of factors as part of the cellular copper homeostasis system including target genes of transcription factor GRISEA, and provides a rich reference source of quantitative data for further in detail investigations. Overall, the present study demonstrates the importance of systems biology approaches also in cases were mutations in single genes are analyzed to
Boolean Models of Biological Processes Explain Cascade-Like Behavior.

Science.gov (United States)

Chen, Hao; Wang, Guanyu; Simha, Rahul; Du, Chenghang; Zeng, Chen

2016-01-29

Biological networks play a key role in determining biological function and therefore, an understanding of their structure and dynamics is of central interest in systems biology. In Boolean models of such networks, the status of each molecule is either "on" or "off" and along with the molecules interact with each other, their individual status changes from "on" to "off" or vice-versa and the system of molecules in the network collectively go through a sequence of changes in state. This sequence of changes is termed a biological process. In this paper, we examine the common perception that events in biomolecular networks occur sequentially, in a cascade-like manner, and ask whether this is likely to be an inherent property. In further investigations of the budding and fission yeast cell-cycle, we identify two generic dynamical rules. A Boolean system that complies with these rules will automatically have a certain robustness. By considering the biological requirements in robustness and designability, we show that those Boolean dynamical systems, compared to an arbitrary dynamical system, statistically present the characteristics of cascadeness and sequentiality, as observed in the budding and fission yeast cell- cycle. These results suggest that cascade-like behavior might be an intrinsic property of biological processes.
Oral cancer cells with different potential of lymphatic metastasis displayed distinct biologic behaviors and gene expression profiles.

Science.gov (United States)

Zhuang, Zhang; Jian, Pan; Longjiang, Li; Bo, Han; Wenlin, Xiao

2010-02-01

Oral squamous cell carcinoma (OSCC) often spreads from the primary tumor to regional lymph nodes in the early stage. Better understanding of the biology of lymphatic spread of oral cancer cells is important for improving the survival rate of cancer patients. We established the cell line LNMTca8113 by repeated injections in foot pads of nude mice, which had a much higher lymphatic metastasis rate than its parental cell line Tca8113. Then, we compared the biologic behaviors of cancer cells between them. Moreover, microarray-based expression profiles between them were also compared, and a panel of differential genes was validated using real-time-PCR. In contrast to Tca8113 cells, LNMTca8113 cells were more proliferative and resistant to apoptosis in the absence of serum, and had enhanced ability of inducing capillary-like structures. Moreover, microarray-based expression profiles between them identified 1341 genes involved in cell cycle, cell adhesion, lymphangiogenesis, regulation of apoptosis, and so on. Some genes dedicating to the metastatic potential, including JAM2, TNC, CTSC, LAMB1, VEGFC, HAPLN1, ACPP, GDF9 and FGF11, were upregulated in LNMTca8113 cells. These results suggested that LNMTca8113 and Tca8113 cells were proper models for lymphatic metastasis study because there were differences in biologic behaviors and metastasis-related genes between them. Additionally, the differentially expressed gene profiles in cancer progression may be helpful in exploring therapeutic targets and provide the foundation for further functional validation of these specific candidate genes for OSCC.
Correction of technical bias in clinical microarray data improves concordance with known biological information

DEFF Research Database (Denmark)

Eklund, Aron Charles; Szallasi, Zoltan Imre

2008-01-01

The performance of gene expression microarrays has been well characterized using controlled reference samples, but the performance on clinical samples remains less clear. We identified sources of technical bias affecting many genes in concert, thus causing spurious correlations in clinical data...... sets and false associations between genes and clinical variables. We developed a method to correct for technical bias in clinical microarray data, which increased concordance with known biological relationships in multiple data sets....
Multiple scattering processes: inverse and direct

International Nuclear Information System (INIS)

Kagiwada, H.H.; Kalaba, R.; Ueno, S.

1975-01-01

The purpose of the work is to formulate inverse problems in radiative transfer, to introduce the functions b and h as parameters of internal intensity in homogeneous slabs, and to derive initial value problems to replace the more traditional boundary value problems and integral equations of multiple scattering with high computational efficiency. The discussion covers multiple scattering processes in a one-dimensional medium; isotropic scattering in homogeneous slabs illuminated by parallel rays of radiation; the theory of functions b and h in homogeneous slabs illuminated by isotropic sources of radiation either at the top or at the bottom; inverse and direct problems of multiple scattering in slabs including internal sources; multiple scattering in inhomogeneous media, with particular reference to inverse problems for estimation of layers and total thickness of inhomogeneous slabs and to multiple scattering problems with Lambert's law and specular reflectors underlying slabs; and anisotropic scattering with reduction of the number of relevant arguments through axially symmetric fields and expansion in Legendre functions. Gaussian quadrature data for a seven point formula, a FORTRAN program for computing the functions b and h, and tables of these functions supplement the text
KeyPathwayMiner - De-novo network enrichment by combining multiple OMICS data and biological networks

DEFF Research Database (Denmark)

Baumbach, Jan; Alcaraz, Nicolas; Pauling, Josch K.

We tackle the problem of de-novo pathway extraction. Given a biological network and a set of case-control studies, KeyPathwayMiner efficiently extracts and visualizes all maximal connected sub-networks that contain mainly genes that are dysregulated, e.g., differentially expressed, in most cases ...
Systems biology-guided identification of synthetic lethal gene pairs and its potential use to discover antibiotic combinations

DEFF Research Database (Denmark)

Aziz, Ramy K.; Monk, Jonathan M.; Lewis, R. M.

2015-01-01

Mathematical models of metabolism from bacterial systems biology have proven their utility across multiple fields, for example metabolic engineering, growth phenotype simulation, and biological discovery. The usefulness of the models stems from their ability to compute a link between genotype...... of metabolic models, and highlight one potential application of systems biology to drug discovery and translational medicine....
Detailed assessment of gene activation levels by multiple hypoxia-responsive elements under various hypoxic conditions.

Science.gov (United States)

Takeuchi, Yasuto; Inubushi, Masayuki; Jin, Yong-Nan; Murai, Chika; Tsuji, Atsushi B; Hata, Hironobu; Kitagawa, Yoshimasa; Saga, Tsuneo

2014-12-01

HIF-1/HRE pathway is a promising target for the imaging and the treatment of intractable malignancy (HIF-1; hypoxia-inducible factor 1, HRE; hypoxia-responsive element). The purposes of our study are: (1) to assess the gene activation levels resulting from various numbers of HREs under various hypoxic conditions, (2) to evaluate the bidirectional activity of multiple HREs, and (3) to confirm whether multiple HREs can induce gene expression in vivo. Human colon carcinoma HCT116 cells were transiently transfected by the constructs containing a firefly luciferase reporter gene and various numbers (2, 4, 6, 8, 10, and 12) of HREs (nHRE+, nHRE-). The relative luciferase activities were measured under various durations of hypoxia (6, 12, 18, and 24 h), O2 concentrations (1, 2, 4, 8, and 16 %), and various concentrations of deferoxamine mesylate (20, 40, 80, 160, and 320 µg/mL growth medium). The bidirectional gene activation levels by HREs were examined in the constructs (dual-luc-nHREs) containing firefly and Renilla luciferase reporter genes at each side of nHREs. Finally, to test whether the construct containing 12HRE and the NIS reporter gene (12HRE-NIS) can induce gene expression in vivo, SPECT imaging was performed in a mouse xenograft model. (1) gene activation levels by HREs tended to increase with increasing HRE copy number, but a saturation effect was observed in constructs with more than 6 or 8 copies of an HRE, (2) gene activation levels by HREs increased remarkably during 6-12 h of hypoxia, but not beyond 12 h, (3) gene activation levels by HREs decreased with increasing O2 concentrations, but could be detected even under mild hypoxia at 16 % O2, (4) the bidirectionally proportional activity of the HRE was confirmed regardless of the hypoxic severity, and (5) NIS expression driven by 12 tandem copies of an HRE in response to hypoxia could be visualized on in vivo SPECT imaging. The results of this study will help in the understanding and assessment of
Evaluation of the effectiveness and safety of the thermo-treatment process to dispose of recombinant DNA waste from biological research laboratories.

Science.gov (United States)

Li, Meng-Nan; Zheng, Guang-Hong; Wang, Lei; Xiao, Wei; Fu, Xiao-Hua; Le, Yi-Quan; Ren, Da-Ming

2009-01-01

The discharge of recombinant DNA waste from biological laboratories into the eco-system may be one of the pathways resulting in horizontal gene transfer or "gene pollution". Heating at 100 degrees C for 5-10 min is a common method for treating recombinant DNA waste in biological research laboratories in China. In this study, we evaluated the effectiveness and the safety of the thermo-treatment method in the disposal of recombinant DNA waste. Quantitative PCR, plasmid transformation and electrophoresis technology were used to evaluate the decay/denaturation efficiency during the thermo-treatment process of recombinant plasmid, pET-28b. Results showed that prolonging thermo-treatment time could improve decay efficiency of the plasmid, and its decay half-life was 2.7-4.0 min during the thermo-treatment at 100 degrees C. However, after 30 min of thermo-treatment some transforming activity remained. Higher ionic strength could protect recombinant plasmid from decay during the treatment process. These results indicate that thermo-treatment at 100 degrees C cannot decay and inactivate pET-28b completely. In addition, preliminary results showed that thermo-treated recombinant plasmids were not degraded completely in a short period when they were discharged into an aquatic environment. This implies that when thermo-treated recombinant DNAs are discharged into the eco-system, they may have enough time to re-nature and transform, thus resulting in gene diffusion.
Evaluation of the effectiveness and safety of the thermo-treatment process to dispose of recombinant DNA waste from biological research laboratories

International Nuclear Information System (INIS)

Li Mengnan; Zheng Guanghong; Wang Lei; Xiao Wei; Fu Xiaohua; Le Yiquan; Ren Daming

2009-01-01

The discharge of recombinant DNA waste from biological laboratories into the eco-system may be one of the pathways resulting in horizontal gene transfer or 'gene pollution'. Heating at 100 deg. C for 5-10 min is a common method for treating recombinant DNA waste in biological research laboratories in China. In this study, we evaluated the effectiveness and the safety of the thermo-treatment method in the disposal of recombinant DNA waste. Quantitative PCR, plasmid transformation and electrophoresis technology were used to evaluate the decay/denaturation efficiency during the thermo-treatment process of recombinant plasmid, pET-28b. Results showed that prolonging thermo-treatment time could improve decay efficiency of the plasmid, and its decay half-life was 2.7-4.0 min during the thermo-treatment at 100 deg. C. However, after 30 min of thermo-treatment some transforming activity remained. Higher ionic strength could protect recombinant plasmid from decay during the treatment process. These results indicate that thermo-treatment at 100 deg. C cannot decay and inactivate pET-28b completely. In addition, preliminary results showed that thermo-treated recombinant plasmids were not degraded completely in a short period when they were discharged into an aquatic environment. This implies that when thermo-treated recombinant DNAs are discharged into the eco-system, they may have enough time to re-nature and transform, thus resulting in gene diffusion
PGASO: A synthetic biology tool for engineering a cellulolytic yeast

Directory of Open Access Journals (Sweden)

Chang Jui-Jen

2012-07-01

Full Text Available Abstract Background To achieve an economical cellulosic ethanol production, a host that can do both cellulosic saccharification and ethanol fermentation is desirable. However, to engineer a non-cellulolytic yeast to be such a host requires synthetic biology techniques to transform multiple enzyme genes into its genome. Results A technique, named Promoter-based Gene Assembly and Simultaneous Overexpression (PGASO, that employs overlapping oligonucleotides for recombinatorial assembly of gene cassettes with individual promoters, was developed. PGASO was applied to engineer Kluyveromycesmarxianus KY3, which is a thermo- and toxin-tolerant yeast. We obtained a recombinant strain, called KR5, that is capable of simultaneously expressing exoglucanase and endoglucanase (both of Trichodermareesei, a beta-glucosidase (from a cow rumen fungus, a neomycin phosphotransferase, and a green fluorescent protein. High transformation efficiency and accuracy were achieved as ~63% of the transformants was confirmed to be correct. KR5 can utilize beta-glycan, cellobiose or CMC as the sole carbon source for growth and can directly convert cellobiose and beta-glycan to ethanol. Conclusions This study provides the first example of multi-gene assembly in a single step in a yeast species other than Saccharomyces cerevisiae. We successfully engineered a yeast host with a five-gene cassette assembly and the new host is capable of co-expressing three types of cellulase genes. Our study shows that PGASO is an efficient tool for simultaneous expression of multiple enzymes in the kefir yeast KY3 and that KY3 can serve as a host for developing synthetic biology tools.
The Genome Biology of Effector Gene Evolution in Filamentous Plant Pathogens.

Science.gov (United States)

Sánchez-Vallet, Andrea; Fouché, Simone; Fudal, Isabelle; Hartmann, Fanny E; Soyer, Jessica L; Tellier, Aurélien; Croll, Daniel

2018-05-16

Filamentous pathogens, including fungi and oomycetes, pose major threats to global food security. Crop pathogens cause damage by secreting effectors that manipulate the host to the pathogen's advantage. Genes encoding such effectors are among the most rapidly evolving genes in pathogen genomes. Here, we review how the major characteristics of the emergence, function, and regulation of effector genes are tightly linked to the genomic compartments where these genes are located in pathogen genomes. The presence of repetitive elements in these compartments is associated with elevated rates of point mutations and sequence rearrangements with a major impact on effector diversification. The expression of many effectors converges on an epigenetic control mediated by the presence of repetitive elements. Population genomics analyses showed that rapidly evolving pathogens show high rates of turnover at effector loci and display a mosaic in effector presence-absence polymorphism among strains. We conclude that effective pathogen containment strategies require a thorough understanding of the effector genome biology and the pathogen's potential for rapid adaptation. Expected final online publication date for the Annual Review of Phytopathology Volume 56 is August 25, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
The nuclear receptor gene nhr-25 plays multiple roles in the Caenorhabditis elegans heterochronic gene network to control the larva-to-adult transition

Czech Academy of Sciences Publication Activity Database

Hada, K.; Asahina, Masako; Hasegawa, H.; Kanaho, Y.; Slack, F. J.; Niwa, R.

2010-01-01

Roč. 344, č. 2 (2010), s. 1100-1109 ISSN 0012-1606 R&D Projects: GA ČR(CZ) GA204/07/0948; GA ČR(CZ) GD204/09/H058 Institutional research plan: CEZ:AV0Z60220518 Keywords : apl-1 * Caenorhabditis elegans * heterochronic gene * heterochronic gene * let-7 * nuclear receptor * nhr-25 Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 4.094, year: 2010
Redox processes in radiation biology and cancer

International Nuclear Information System (INIS)

Greenstock, C.L.

1981-01-01

Free-radical intermediates, particularly the activated oxygen species OH, O - 2 , and 1 O 2 , are implicated in many types of radiation damage to biological systems. In addition, these same species may be formed, either directly or indirectly through biochemical redox reactions, in both essential and aberrant metabolic processes. Cell survival and adaptation to an environment containing ionizing radiation and other physical and chemical carcinogens ultimately depend upon the cell's ability to maintain optimal function in response to free-radical damage at the chemical level. Many of these feedback control mechanisms are redox controlled. Radiation chemical techniques using selective radical scavengers, such as product analysis and pulse radiolysis, enable us to generate, observe, and characterize individually the nature and reactivity of potentially damaging free radicals. From an analysis of the chemical kinetics of free-radical involvement in biological damage, redox mechanisms are proposed to describe the early processes of radiation damage, redox mechanisms are proposed to describe the early processes of radiation damage, its protection and sensitization, and the role of free radicals in radiation and chemical carcinogenesis
Construction of coffee transcriptome networks based on gene annotation semantics

Directory of Open Access Journals (Sweden)

Castillo Luis F.

2012-12-01

Full Text Available Gene annotation is a process that encompasses multiple approaches on the analysis of nucleic acids or protein sequences in order to assign structural and functional characteristics to gene models. When thousands of gene models are being described in an organism genome, construction and visualization of gene networks impose novel challenges in the understanding of complex expression patterns and the generation of new knowledge in genomics research. In order to take advantage of accumulated text data after conventional gene sequence analysis, this work applied semantics in combination with visualization tools to build transcriptome networks from a set of coffee gene annotations. A set of selected coffee transcriptome sequences, chosen by the quality of the sequence comparison reported by Basic Local Alignment Search Tool (BLAST and Interproscan, were filtered out by coverage, identity, length of the query, and e-values. Meanwhile, term descriptors for molecular biology and biochemistry were obtained along the Wordnet dictionary in order to construct a Resource Description Framework (RDF using Ruby scripts and Methontology to find associations between concepts. Relationships between sequence annotations and semantic concepts were graphically represented through a total of 6845 oriented vectors, which were reduced to 745 non-redundant associations. A large gene network connecting transcripts by way of relational concepts was created where detailed connections remain to be validated for biological significance based on current biochemical and genetics frameworks. Besides reusing text information in the generation of gene connections and for data mining purposes, this tool development opens the possibility to visualize complex and abundant transcriptome data, and triggers the formulation of new hypotheses in metabolic pathways analysis.
Exploring autophagy with Gene Ontology

Science.gov (United States)

2018-01-01

ABSTRACT Autophagy is a fundamental cellular process that is well conserved among eukaryotes. It is one of the strategies that cells use to catabolize substances in a controlled way. Autophagy is used for recycling cellular components, responding to cellular stresses and ridding cells of foreign material. Perturbations in autophagy have been implicated in a number of pathological conditions such as neurodegeneration, cardiac disease and cancer. The growing knowledge about autophagic mechanisms needs to be collected in a computable and shareable format to allow its use in data representation and interpretation. The Gene Ontology (GO) is a freely available resource that describes how and where gene products function in biological systems. It consists of 3 interrelated structured vocabularies that outline what gene products do at the biochemical level, where they act in a cell and the overall biological objectives to which their actions contribute. It also consists of ‘annotations’ that associate gene products with the terms. Here we describe how we represent autophagy in GO, how we create and define terms relevant to autophagy researchers and how we interrelate those terms to generate a coherent view of the process, therefore allowing an interoperable description of its biological aspects. We also describe how annotation of gene products with GO terms improves data analysis and interpretation, hence bringing a significant benefit to this field of study. PMID:29455577
Binaural Processing of Multiple Sound Sources

Science.gov (United States)

2016-08-18

AFRL-AFOSR-VA-TR-2016-0298 Binaural Processing of Multiple Sound Sources William Yost ARIZONA STATE UNIVERSITY 660 S MILL AVE STE 312 TEMPE, AZ 85281...18-08-2016 2. REPORT TYPE Final Performance 3. DATES COVERED (From - To) 15 Jul 2012 to 14 Jul 2016 4. TITLE AND SUBTITLE Binaural Processing of...three topics cited above are entirely within the scope of the AFOSR grant. 15. SUBJECT TERMS Binaural hearing, Sound Localization, Interaural signal
A comparative analysis of multiple-choice and student performance-task assessment in the high school biology classroom

Science.gov (United States)

Cushing, Patrick Ryan

This study compared the performance of high school students on laboratory assessments. Thirty-four high school students who were enrolled in the second semester of a regular biology class or had completed the biology course the previous semester participated in this study. They were randomly assigned to examinations of two formats, performance-task and traditional multiple-choice, from two content areas, using a compound light microscope and diffusion. Students were directed to think-aloud as they performed the assessments. Additional verbal data were obtained during interviews following the assessment. The tape-recorded narrative data were analyzed for type and diversity of knowledge and skill categories, and percentage of in-depth processing demonstrated. While overall mean scores on the assessments were low, elicited statements provided additional insight into student cognition. Results indicated that a greater diversity of knowledge and skill categories was elicited by the two microscope assessments and by the two performance-task assessments. In addition, statements demonstrating in-depth processing were coded most frequently in narratives elicited during clinical interviews following the diffusion performance-task assessment. This study calls for individual teachers to design authentic assessment practices and apply them to daily classroom routines. Authentic assessment should be an integral part of the learning process and not merely an end result. In addition, teachers are encouraged to explicitly identify and model, through think-aloud methods, desired cognitive behaviors in the classroom.
Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation.

Science.gov (United States)

Godec, Jernej; Tan, Yan; Liberzon, Arthur; Tamayo, Pablo; Bhattacharya, Sanchita; Butte, Atul J; Mesirov, Jill P; Haining, W Nicholas

2016-01-19

Gene-expression profiling has become a mainstay in immunology, but subtle changes in gene networks related to biological processes are hard to discern when comparing various datasets. For instance, conservation of the transcriptional response to sepsis in mouse models and human disease remains controversial. To improve transcriptional analysis in immunology, we created ImmuneSigDB: a manually annotated compendium of ∼5,000 gene-sets from diverse cell states, experimental manipulations, and genetic perturbations in immunology. Analysis using ImmuneSigDB identified signatures induced in activated myeloid cells and differentiating lymphocytes that were highly conserved between humans and mice. Sepsis triggered conserved patterns of gene expression in humans and mouse models. However, we also identified species-specific biological processes in the sepsis transcriptional response: although both species upregulated phagocytosis-related genes, a mitosis signature was specific to humans. ImmuneSigDB enables granular analysis of transcriptomic data to improve biological understanding of immune processes of the human and mouse immune systems. Copyright © 2016 Elsevier Inc. All rights reserved.
New insights into the apoptotic process in mollusks: characterization of caspase genes in Mytilus galloprovincialis.

Directory of Open Access Journals (Sweden)

Alejandro Romero

2011-02-01

Full Text Available Apoptosis is an essential biological process in the development and maintenance of immune system homeostasis. Caspase proteins constitute the core of the apoptotic machinery and can be categorized as either initiators or effectors of apoptosis. Although the genes encoding caspase proteins have been described in vertebrates and in almost all invertebrate phyla, there are few reports describing the initiator and executioner caspases or the modulation of their expression by different stimuli in different apoptotic pathways in bivalves. In the present work, we characterized two initiator and four executioner caspases in the mussel Mytilus galloprovincialis. Both initiators and executioners showed structural features that make them different from other caspase proteins already described. Evaluation of the genes' tissue expression patterns revealed extremely high expression levels within the gland and gills, where the apoptotic process is highly active due to the clearance of damaged cells. Hemocytes also showed high expression values, probably due to of the role of apoptosis in the defense against pathogens. To understand the mechanisms of caspase gene regulation, hemocytes were treated with UV-light, environmental pollutants and pathogen-associated molecular patterns (PAMPs and apoptosis was evaluated by microscopy, flow cytometry and qPCR techniques. Our results suggest that the apoptotic process could be tightly regulated in bivalve mollusks by overexpression/suppression of caspase genes; additionally, there is evidence of caspase-specific responses to pathogens and pollutants. The apoptotic process in mollusks has a similar complexity to that of vertebrates, but presents unique features that may be related to recurrent exposure to environmental changes, pollutants and pathogens imposed by their sedentary nature.

Predicting cellular growth from gene expression signatures.

Directory of Open Access Journals (Sweden)

Edoardo M Airoldi

2009-01-01

Full Text Available Maintaining balanced growth in a changing environment is a fundamental systems-level challenge for cellular physiology, particularly in microorganisms. While the complete set of regulatory and functional pathways supporting growth and cellular proliferation are not yet known, portions of them are well understood. In particular, cellular proliferation is governed by mechanisms that are highly conserved from unicellular to multicellular organisms, and the disruption of these processes in metazoans is a major factor in the development of cancer. In this paper, we develop statistical methodology to identify quantitative aspects of the regulatory mechanisms underlying cellular proliferation in Saccharomyces cerevisiae. We find that the expression levels of a small set of genes can be exploited to predict the instantaneous growth rate of any cellular culture with high accuracy. The predictions obtained in this fashion are robust to changing biological conditions, experimental methods, and technological platforms. The proposed model is also effective in predicting growth rates for the related yeast Saccharomyces bayanus and the highly diverged yeast Schizosaccharomyces pombe, suggesting that the underlying regulatory signature is conserved across a wide range of unicellular evolution. We investigate the biological significance of the gene expression signature that the predictions are based upon from multiple perspectives: by perturbing the regulatory network through the Ras/PKA pathway, observing strong upregulation of growth rate even in the absence of appropriate nutrients, and discovering putative transcription factor binding sites, observing enrichment in growth-correlated genes. More broadly, the proposed methodology enables biological insights about growth at an instantaneous time scale, inaccessible by direct experimental methods. Data and tools enabling others to apply our methods are available at http://function.princeton.edu/growthrate.
Biologic phosphorus elimination - influencing parameters, boundary conditions, process optimation

International Nuclear Information System (INIS)

Dai Xiaohu.

1992-01-01

This paper first presents a systematic study of the basic process of biologic phosphorus elimination as employed by the original 'Phoredox (Main Stream) Process'. The conditions governing the process and the factors influencing its performance were determined by trial operation. A stationary model was developed for the purpose of modelling biologic phosphorus elimination in such a main stream process and optimising the dimensioning. The validity of the model was confirmed by operational data given in the literature and by operational data from the authors' own semitechnical-scale experimental plant. The model permits simulation of the values to be expected for effluent phosphorus and phosphate concentrations for given influent data and boundary conditions. It is thus possible to dimension a plant for accomodation of the original Phoredox (Main Stream) Process or any similar phosphorus eliminating plant that is to work according to the principle of the main stream process. (orig./EF) [de
Limited agreement of independent RNAi screens for virus-required host genes owes more to false-negative than false-positive factors.

Directory of Open Access Journals (Sweden)

Linhui Hao

Full Text Available Systematic, genome-wide RNA interference (RNAi analysis is a powerful approach to identify gene functions that support or modulate selected biological processes. An emerging challenge shared with some other genome-wide approaches is that independent RNAi studies often show limited agreement in their lists of implicated genes. To better understand this, we analyzed four genome-wide RNAi studies that identified host genes involved in influenza virus replication. These studies collectively identified and validated the roles of 614 cell genes, but pair-wise overlap among the four gene lists was only 3% to 15% (average 6.7%. However, a number of functional categories were overrepresented in multiple studies. The pair-wise overlap of these enriched-category lists was high, ∼19%, implying more agreement among studies than apparent at the gene level. Probing this further, we found that the gene lists implicated by independent studies were highly connected in interacting networks by independent functional measures such as protein-protein interactions, at rates significantly higher than predicted by chance. We also developed a general, model-based approach to gauge the effects of false-positive and false-negative factors and to estimate, from a limited number of studies, the total number of genes involved in a process. For influenza virus replication, this novel statistical approach estimates the total number of cell genes involved to be ∼2,800. This and multiple other aspects of our experimental and computational results imply that, when following good quality control practices, the low overlap between studies is primarily due to false negatives rather than false-positive gene identifications. These results and methods have implications for and applications to multiple forms of genome-wide analysis.
A systems biology approach reveals that tissue tropism to West Nile virus is regulated by antiviral genes and innate immune cellular processes.

Directory of Open Access Journals (Sweden)

Mehul S Suthar

2013-02-01

Full Text Available The actions of the RIG-I like receptor (RLR and type I interferon (IFN signaling pathways are essential for a protective innate immune response against the emerging flavivirus West Nile virus (WNV. In mice lacking RLR or IFN signaling pathways, WNV exhibits enhanced tissue tropism, indicating that specific host factors of innate immune defense restrict WNV infection and dissemination in peripheral tissues. However, the immune mechanisms by which the RLR and IFN pathways coordinate and function to impart restriction of WNV infection are not well defined. Using a systems biology approach, we defined the host innate immune response signature and actions that restrict WNV tissue tropism. Transcriptional profiling and pathway modeling to compare WNV-infected permissive (spleen and nonpermissive (liver tissues showed high enrichment for inflammatory responses, including pattern recognition receptors and IFN signaling pathways, that define restriction of WNV replication in the liver. Assessment of infected livers from Mavs(-/- × Ifnar(-/- mice revealed the loss of expression of several key components within the natural killer (NK cell signaling pathway, including genes associated with NK cell activation, inflammatory cytokine production, and NK cell receptor signaling. In vivo analysis of hepatic immune cell infiltrates from WT mice demonstrated that WNV infection leads to an increase in NK cell numbers with enhanced proliferation, maturation, and effector action. In contrast, livers from Mavs(-/- × Ifnar(-/- infected mice displayed reduced immune cell infiltration, including a significant reduction in NK cell numbers. Analysis of cocultures of dendritic and NK cells revealed both cell-intrinsic and -extrinsic roles for the RLR and IFN signaling pathways to regulate NK cell effector activity. Taken together, these observations reveal a complex innate immune signaling network, regulated by the RLR and IFN signaling pathways, that drives tissue
Mathematical methods in biology and neurobiology

CERN Document Server

Jost, Jürgen

2014-01-01

Mathematical models can be used to meet many of the challenges and opportunities offered by modern biology. The description of biological phenomena requires a range of mathematical theories. This is the case particularly for the emerging field of systems biology. Mathematical Methods in Biology and Neurobiology introduces and develops these mathematical structures and methods in a systematic manner. It studies: • discrete structures and graph theory • stochastic processes • dynamical systems and partial differential equations • optimization and the calculus of variations. The biological applications range from molecular to evolutionary and ecological levels, for example: • cellular reaction kinetics and gene regulation • biological pattern formation and chemotaxis • the biophysics and dynamics of neurons • the coding of information in neuronal systems • phylogenetic tree reconstruction • branching processes and population genetics • optimal resource allocation • sexual recombi...
Age by Disease Biological Interactions: Implications for Late-Life Depression

Directory of Open Access Journals (Sweden)

Brandon eMcKinney

2012-11-01

Full Text Available Onset of depressive symptoms after the age of 65, or late-life depression (LLD, is common and poses a significant burden on affected individuals, caretakers and society. Evidence suggests a unique biological basis for LLD, but current hypotheses do not account for its pathophysiological complexity. Here we propose a novel etiological framework for LLD, the age-by-disease biological interaction hypothesis, based on the observations that the subset of genes that undergoes lifelong progressive changes in expression is restricted to a specific set of biological processes, and that a disproportionate number of these age-dependent genes have been previously and similarly implicated in neurodegenerative and neuropsychiatric disorders, including depression. The age-by-disease biological interaction hypothesis posits that age-dependent biological processes (i are pushed in LLD-promoting directions by changes in gene expression naturally occurring during brain aging, which (ii directly contribute to pathophysiological mechanisms of LLD, and (iii that individual variability in rates of age-dependent changes determines risk or resiliency to develop age-related disorders, including LLD. We review observations supporting this hypothesis, including consistent and specific age-dependent changes in brain gene expression, and their overlap with neuropsychiatric and neurodegenerative disease pathways. We then review preliminary reports supporting the genetic component of this hypothesis. Other potential biological mediators of age-dependent gene changes are proposed. We speculate that studies examining the relative contribution of these mechanisms to age-dependent changes and related disease mechanisms will not only provide critical information on the biology of normal aging of the human brain, but will inform our understanding our age-dependent diseases, in time fostering the development of new interventions for prevention and treatment of age-dependent diseases
Immuno-Oncology-The Translational Runway for Gene Therapy: Gene Therapeutics to Address Multiple Immune Targets.

Science.gov (United States)

Weß, Ludger; Schnieders, Frank

2017-12-01

Cancer therapy is once again experiencing a paradigm shift. This shift is based on extensive clinical experience demonstrating that cancer cannot be successfully fought by addressing only single targets or pathways. Even the combination of several neo-antigens in cancer vaccines is not sufficient for successful, lasting tumor eradication. The focus has therefore shifted to the immune system's role in cancer and the striking abilities of cancer cells to manipulate and/or deactivate the immune system. Researchers and pharma companies have started to target the processes and cells known to support immune surveillance and the elimination of tumor cells. Immune processes, however, require novel concepts beyond the traditional "single-target-single drug" paradigm and need parallel targeting of diverse cells and mechanisms. This review gives a perspective on the role of gene therapy technologies in the evolving immuno-oncology space and identifies gene therapy as a major driver in the development and regulation of effective cancer immunotherapy. Present challenges and breakthroughs ranging from chimeric antigen receptor T-cell therapy, gene-modified oncolytic viruses, combination cancer vaccines, to RNA therapeutics are spotlighted. Gene therapy is recognized as the most prominent technology enabling effective immuno-oncology strategies.
When things don't add up: quantifying impacts of multiple stressors from individual metabolism to ecosystem processing.

Science.gov (United States)

Galic, Nika; Sullivan, Lauren L; Grimm, Volker; Forbes, Valery E

2018-04-01

Ecosystems are exposed to multiple stressors which can compromise functioning and service delivery. These stressors often co-occur and interact in different ways which are not yet fully understood. Here, we applied a population model representing a freshwater amphipod feeding on leaf litter in forested streams. We simulated impacts of hypothetical stressors, individually and in pairwise combinations that target the individuals' feeding, maintenance, growth and reproduction. Impacts were quantified by examining responses at three levels of biological organisation: individual-level body sizes and cumulative reproduction, population-level abundance and biomass and ecosystem-level leaf litter decomposition. Interactive effects of multiple stressors at the individual level were mostly antagonistic, that is, less negative than expected. Most population- and ecosystem-level responses to multiple stressors were stronger than expected from an additive model, that is, synergistic. Our results suggest that across levels of biological organisation responses to multiple stressors are rarely only additive. We suggest methods for efficiently quantifying impacts of multiple stressors at different levels of biological organisation. © 2018 John Wiley & Sons Ltd/CNRS.
Multiple photon infrared processes in polyatomic molecules

International Nuclear Information System (INIS)

Harrison, R.G.; Butcher, S.R.

1980-01-01

This paper reviews current understanding of the process of multiple photon excitation and dissociation of polyatomic molecules, whereby in the presence of an intense infrared laser field a molecule may absorb upwards of 30 photons. The application of this process to new photochemistry and in particular laser isotope separation is also discussed. (author)
Silicon Regulates Potential Genes Involved in Major Physiological Processes in Plants to Combat Stress

Directory of Open Access Journals (Sweden)

Abinaya Manivannan

2017-08-01

Full Text Available Silicon (Si, the quasi-essential element occurs as the second most abundant element in the earth's crust. Biological importance of Si in plant kingdom has become inevitable particularly under stressed environment. In general, plants are classified as high, medium, and low silicon accumulators based on the ability of roots to absorb Si. The uptake of Si directly influence the positive effects attributed to the plant but Si supplementation proves to mitigate stress and recover plant growth even in low accumulating plants like tomato. The application of Si in soil as well as soil-less cultivation systems have resulted in the enhancement of quantitative and qualitative traits of plants even under stressed environment. Silicon possesses several mechanisms to regulate the physiological, biochemical, and antioxidant metabolism in plants to combat abiotic and biotic stresses. Nevertheless, very few reports are available on the aspect of Si-mediated molecular regulation of genes with potential role in stress tolerance. The recent advancements in the era of genomics and transcriptomics have opened an avenue for the determination of molecular rationale associated with the Si amendment to the stress alleviation in plants. Therefore, the present endeavor has attempted to describe the recent discoveries related to the regulation of vital genes involved in photosynthesis, transcription regulation, defense, water transport, polyamine synthesis, and housekeeping genes during abiotic and biotic stress alleviation by Si. Furthermore, an overview of Si-mediated modulation of multiple genes involved in stress response pathways such as phenylpropanoid pathway, jasmonic acid pathway, ABA-dependent or independent regulatory pathway have been discussed in this review.
Dioxin exposure of human CD34+ hemopoietic cells induces gene expression modulation that recapitulates its in vivo clinical and biological effects

International Nuclear Information System (INIS)

Fracchiolla, Nicola Stefano; Todoerti, Katia; Bertazzi, Pier Alberto; Servida, Federica; Corradini, Paolo; Carniti, Cristiana; Colombi, Antonio; Cecilia Pesatori, Angela; Neri, Antonino; Deliliers, Giorgio Lambertenghi

2011-01-01

2,3,7,8-Tetrachlorodibenzo-p-dioxin (TCDD) has a large number of biological effects, including skin, cardiovascular, neurologic diseases, diabetes, infertility, cancers and immunotoxicity. We analysed the in vitro TCDD effects on human CD34 + cells and tested the gene expression modulation by means of microarray analyses before and after TCDD exposure. We identified 257 differentially modulated probe sets, identifying 221 well characterized genes. A large part of these resulted associated to cell adhesion and/or angiogenesis and to transcription regulation. Synaptic transmission and visual perception functions, with the particular involvement of the GABAergic pathway were also significantly modulated. Numerous transcripts involved in cell cycle or cell proliferation, immune response, signal transduction, ion channel activity or calcium ion binding, tissue development and differentiation, female or male fertility or in several metabolic pathways were also affected after dioxin exposure. The transcriptional profile induced by TCDD treatment on human CD34 + cells strikingly reproduces the clinical and biological effects observed in individuals exposed to dioxin and in biological experimental systems. Our data support a role of dioxin in the neoplastic transformation of hemopoietic stem cells and in immune modulation processes after in vivo exposure, as indicated by the epidemiologic data in dioxin accidentally exposed populations, providing a molecular basis for it. In addition, TCDD alters genes associated to glucidic and lipidic metabolisms, to GABAergic transmission or involved in male and female fertility, thus providing a possible explanation of the diabetogenic, dyslipidemic, neurologic and fertility effects induced by TCDD in vivo exposure.
SBR-Blood: systems biology repository for hematopoietic cells.

Science.gov (United States)

Lichtenberg, Jens; Heuston, Elisabeth F; Mishra, Tejaswini; Keller, Cheryl A; Hardison, Ross C; Bodine, David M

2016-01-04

Extensive research into hematopoiesis (the development of blood cells) over several decades has generated large sets of expression and epigenetic profiles in multiple human and mouse blood cell types. However, there is no single location to analyze how gene regulatory processes lead to different mature blood cells. We have developed a new database framework called hematopoietic Systems Biology Repository (SBR-Blood), available online at http://sbrblood.nhgri.nih.gov, which allows user-initiated analyses for cell type correlations or gene-specific behavior during differentiation using publicly available datasets for array- and sequencing-based platforms from mouse hematopoietic cells. SBR-Blood organizes information by both cell identity and by hematopoietic lineage. The validity and usability of SBR-Blood has been established through the reproduction of workflows relevant to expression data, DNA methylation, histone modifications and transcription factor occupancy profiles. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
The implications of non-linear biological oscillations on human electrophysiology for electrohypersensitivity (EHS) and multiple chemical sensitivity (MCS).

Science.gov (United States)

Sage, Cindy

2015-01-01

The 'informational content' of Earth's electromagnetic signaling is like a set of operating instructions for human life. These environmental cues are dynamic and involve exquisitely low inputs (intensities) of critical frequencies with which all life on Earth evolved. Circadian and other temporal biological rhythms depend on these fluctuating electromagnetic inputs to direct gene expression, cell communication and metabolism, neural development, brainwave activity, neural synchrony, a diversity of immune functions, sleep and wake cycles, behavior and cognition. Oscillation is also a universal phenomenon, and biological systems of the heart, brain and gut are dependent on the cooperative actions of cells that function according to principles of non-linear, coupled biological oscillations for their synchrony. They are dependent on exquisitely timed cues from the environment at vanishingly small levels. Altered 'informational content' of environmental cues can swamp natural electromagnetic cues and result in dysregulation of normal biological rhythms that direct growth, development, metabolism and repair mechanisms. Pulsed electromagnetic fields (PEMF) and radiofrequency radiation (RFR) can have the devastating biological effects of disrupting homeostasis and desynchronizing normal biological rhythms that maintain health. Non-linear, weak field biological oscillations govern body electrophysiology, organize cell and tissue functions and maintain organ systems. Artificial bioelectrical interference can give false information (disruptive signaling) sufficient to affect critical pacemaker cells (of the heart, gut and brain) and desynchronize functions of these important cells that orchestrate function and maintain health. Chronic physiological stress undermines homeostasis whether it is chemically induced or electromagnetically induced (or both exposures are simultaneous contributors). This can eventually break down adaptive biological responses critical to health
Gene coexpression network analysis as a source of functional annotation for rice genes.

Directory of Open Access Journals (Sweden)

Kevin L Childs

Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional
Average multiplications in deep inelastic processes and their interpretation

International Nuclear Information System (INIS)

Kiselev, A.V.; Petrov, V.A.

1983-01-01

Inclusive production of hadrons in deep inelastic proceseseus is considered. It is shown that at high energies the jet evolution in deep inelastic processes is mainly of nonperturbative character. With the increase of a final hadron state energy the leading contribution to an average multiplicity comes from a parton subprocess due to production of massive quark and gluon jets and their further fragmentation as diquark contribution becomes less and less essential. The ratio of the total average multiplicity in deep inelastic processes to the average multiplicity in e + e - -annihilation at high energies tends to unity
Measuring Strategic Processing when Students Read Multiple Texts

Science.gov (United States)

Braten, Ivar; Stromso, Helge I.

2011-01-01

This study explored the dimensionality of multiple-text comprehension strategies in a sample of 216 Norwegian education undergraduates who read seven separate texts on a science topic and immediately afterwards responded to a self-report inventory focusing on strategic multiple-text processing in that specific task context. Two dimensions were…
Epigenetics: beyond genes | Fossey | Southern Forests: a Journal of ...

African Journals Online (AJOL)

Gene regulatory processes lead to differential gene expression and are referred to as epigenetic phenomena; these are ubiquitous processes in the biological world. These reversible heritable changes concern DNA and RNA, their interactions, and chromatin-mediated and RNA-mediated mechanisms. DNA compaction is ...
Automatically identifying gene/protein terms in MEDLINE abstracts.

Science.gov (United States)

Yu, Hong; Hatzivassiloglou, Vasileios; Rzhetsky, Andrey; Wilbur, W John

2002-01-01

Natural language processing (NLP) techniques are used to extract information automatically from computer-readable literature. In biology, the identification of terms corresponding to biological substances (e.g., genes and proteins) is a necessary step that precedes the application of other NLP systems that extract biological information (e.g., protein-protein interactions, gene regulation events, and biochemical pathways). We have developed GPmarkup (for "gene/protein-full name mark up"), a software system that automatically identifies gene/protein terms (i.e., symbols or full names) in MEDLINE abstracts. As a part of marking up process, we also generated automatically a knowledge source of paired gene/protein symbols and full names (e.g., LARD for lymphocyte associated receptor of death) from MEDLINE. We found that many of the pairs in our knowledge source do not appear in the current GenBank database. Therefore our methods may also be used for automatic lexicon generation. GPmarkup has 73% recall and 93% precision in identifying and marking up gene/protein terms in MEDLINE abstracts. A random sample of gene/protein symbols and full names and a sample set of marked up abstracts can be viewed at http://www.cpmc.columbia.edu/homepages/yuh9001/GPmarkup/. Contact. hy52@columbia.edu. Voice: 212-939-7028; fax: 212-666-0140.
The collective biology of the gene: Towards genetic dynamics engineering

International Nuclear Information System (INIS)

Chela-Flores, J.

1985-11-01

Chromatin dynamics is studied in terms of coupled vibrations (phonon pairing); this is shown to lead to a collective variable Δ, interpreted as a gene inhibition factor, which behaves as a biological switch turned off, not only by enzymatic action or metabolic energy, but also by means of an external probe:irradiation. We discuss the inactivation of the X chromosome and puffing. The relevance of being able to modulate Δ is emphasized, since it is equivalent to controlling chromatin dynamics without interfering with chromatin structure, unlike in the usual recombinant DNA techniques. (author)
Searching the literature for proteins facilitates the identification of biological processes, if advanced methods of analysis are linked: a case study on microgravity-caused changes in cells.

Science.gov (United States)

Bauer, Johann; Bussen, Markus; Wise, Petra; Wehland, Markus; Schneider, Sabine; Grimm, Daniela

2016-07-01

More than one hundred reports were published about the characterization of cells from malignant and healthy tissues, as well as of endothelial cells and stem cells exposed to microgravity conditions. We retrieved publications about microgravity related studies on each type of cells, extracted the proteins mentioned therein and analyzed them aiming to identify biological processes affected by microgravity culture conditions. The analysis revealed 66 different biological processes, 19 of them were always detected when papers about the four types of cells were analyzed. Since a response to the removal of gravity is common to the different cell types, some of the 19 biological processes could play a role in cellular adaption to microgravity. Applying computer programs, to extract and analyze proteins and genes mentioned in publications becomes essential for scientists interested to get an overview of the rapidly growing fields of gravitational biology and space medicine.

REGγ is associated with multiple oncogenic pathways in human cancers

International Nuclear Information System (INIS)

He, Jing; Wang, Zhuo; Shi, Tieliu; Zhang, Pei; Chen, Rui; Li, Xiaotao; Cui, Long; Zeng, Yu; Wang, Guangqiang; Zhou, Ping; Yang, Yuanyuan; Ji, Lei; Zhao, Yanyan; Chen, Jiwu

2012-01-01

Recent studies suggest a role of the proteasome activator, REGγ, in cancer progression. Since there are limited numbers of known REGγ targets, it is not known which cancers and pathways are associated with REGγ. REGγ protein expressions in four different cancers were investigated by immunohistochemistry (IHC) analysis. Following NCBI Gene Expression Omnibus (GEO) database search, microarray platform validation, differential expressions of REGγ in corresponding cancers were statistically analyzed. Genes highly correlated with REGγ were defined based on Pearson's correlation coefficient. Functional links were estimated by Ingenuity Core analysis. Finally, validation was performed by RT-PCR analysis in established cancer cell lines and IHC in human colon cancer tissues Here, we demonstrate overexpression of REGγ in four different cancer types by micro-tissue array analysis. Using meta-analysis of publicly available microarray databases and biological studies, we verified elevated REGγ gene expression in the four types of cancers and identified genes significantly correlated with REGγ expression, including genes in p53, Myc pathways, and multiple other cancer-related pathways. The predicted correlations were largely consistent with quantitative RT-PCR analysis. This study provides us novel insights in REGγ gene expression profiles and its link to multiple cancer-related pathways in cancers. Our results indicate potentially important pathogenic roles of REGγ in multiple cancer types and implicate REGγ as a putative cancer marker
Process for sewage biological treatment from uranium

International Nuclear Information System (INIS)

Popa, K.; Cecal, A.; Craciun, I.

2004-01-01

The invention relates to the sewage treatment, in particular to the sewage biological treatmen from radioactive waste, namely from uranium. The process dor sewage biological treatment from uranium includes cultivation in the sewage of the aquatic plants Lemna minor and Spirulina platensis. The plants cultivation is carried out in two stages. In the first stage for cultivation is used Lemna minor in the second stage - Spirulina platensis . After finishing the plant cultivation it is carried out separation of their biomass. The result of the invention consists in increasing the uranyl ions by the biomass of plants cultivated in the sewage
Process for sewage biological treatment from uranium

International Nuclear Information System (INIS)

Popa, Karin; Cecal, Alexandru; Craciun, Iftimie Ionel; Rudic, Valeriu; Gulea, Aurelian; Cepoi, Liliana

2004-01-01

The invention relates to the sewage treatment, in particular to the sewage biological treatment from radioactive waste, namely from uranium. The process for sewage biological treatment from uranium includes cultivation in the sewage of the aquatic plants Lemna minor and Spirulina platensis. The plant cultivation is carried out in two stages. In the first stage for cultivation is used Lemna minor and in the second stage - Spirulina platensis. After finishing the plant cultivation it is carried out separation of their biomass. The result of the invention consists in increasing the uranyl ions accumulation by the biomass of plants cultivated in the sewage.
Effect of multiplicative noise on stationary stochastic process

Science.gov (United States)

Kargovsky, A. V.; Chikishev, A. Yu.; Chichigina, O. A.

2018-03-01

An open system that can be analyzed using the Langevin equation with multiplicative noise is considered. The stationary state of the system results from a balance of deterministic damping and random pumping simulated as noise with controlled periodicity. The dependence of statistical moments of the variable that characterizes the system on parameters of the problem is studied. A nontrivial decrease in the mean value of the main variable with an increase in noise stochasticity is revealed. Applications of the results in several physical, chemical, biological, and technical problems of natural and humanitarian sciences are discussed.
EasyClone: method for iterative chromosomal integration of multiple genes in Saccharomyces cerevisiae

DEFF Research Database (Denmark)

Jensen, Niels Bjerg; Strucko, Tomas; Kildegaard, Kanchana Rueksomtawin

2014-01-01

of multiple genes with an option of recycling selection markers. The vectors combine the advantage of efficient uracil excision reaction-based cloning and Cre-LoxP-mediated marker recycling system. The episomal and integrative vector sets were tested by inserting genes encoding cyan, yellow, and red...... fluorescent proteins into separate vectors and analyzing for co-expression of proteins by flow cytometry. Cells expressing genes encoding for the three fluorescent proteins from three integrations exhibited a much higher level of simultaneous expression than cells producing fluorescent proteins encoded...... on episomal plasmids, where correspondingly 95% and 6% of the cells were within a fluorescence interval of Log10 mean ± 15% for all three colors. We demonstrate that selective markers can be simultaneously removed using Cre-mediated recombination and all the integrated heterologous genes remain...
Finding Order in Randomness: Single-Molecule Studies Reveal Stochastic RNA Processing | Center for Cancer Research

Science.gov (United States)

Producing a functional eukaryotic messenger RNA (mRNA) requires the coordinated activity of several large protein complexes to initiate transcription, elongate nascent transcripts, splice together exons, and cleave and polyadenylate the 3’ end. Kinetic competition between these various processes has been proposed to regulate mRNA maturation, but this model could lead to multiple, randomly determined, or stochastic, pathways or outcomes. Regulatory checkpoints have been suggested as a means of ensuring quality control. However, current methods have been unable to tease apart the contributions of these processes at a single gene or on a time scale that could provide mechanistic insight. To begin to investigate the kinetic relationship between transcription and splicing, Daniel Larson, Ph.D., of CCR’s Laboratory of Receptor Biology and Gene Expression, and his colleagues employed a single-molecule RNA imaging approach to monitor production and processing of a human β-globin reporter gene in living cells.
Gene expression profiling for molecular classification of multiple myeloma in newly diagnosed patients

NARCIS (Netherlands)

Broyl, Annemiek; Hose, Dirk; Lokhorst, Henk; de Knegt, Yvonne; Peeters, Justine; Jauch, Anna; Bertsch, Uta; Buijs, Arjan; Stevens-Kroef, Marian; Beverloo, H. Berna; Vellenga, Edo; Zweegman, Sonja; Kersten, Marie-Josée; van der Holt, Bronno; el Jarari, Laila; Mulligan, George; Goldschmidt, Hartmut; van Duin, Mark; Sonneveld, Pieter

2010-01-01

To identify molecularly defined subgroups in multiple myeloma, gene expression profiling was performed on purified CD138(+) plasma cells of 320 newly diagnosed myeloma patients included in the Dutch-Belgian/German HOVON-65/GMMG-HD4 trial. Hierarchical clustering identified 10 subgroups; 6
Permethrin induction of multiple cytochrome P450 genes in insecticide resistant mosquitoes, Culex quinquefasciatus.

Science.gov (United States)

Gong, Youhui; Li, Ting; Zhang, Lee; Gao, Xiwu; Liu, Nannan

2013-01-01

The expression of some insect P450 genes can be induced by both exogenous and endogenous compounds and there is evidence to suggest that multiple constitutively overexpressed P450 genes are co-responsible for the development of resistance to permethrin in resistant mosquitoes. This study characterized the permethrin induction profiles of P450 genes known to be constitutively overexpressed in resistant mosquitoes, Culex quinquefasciatus. The gene expression in 7 of the 19 P450 genes CYP325K3v1, CYP4D42v2, CYP9J45, (CYP) CPIJ000926, CYP325G4, CYP4C38, CYP4H40 in the HAmCqG8 strain, increased more than 2-fold after exposure to permethrin at an LC50 concentration (10 ppm) compared to their acetone treated counterpart; no significant differences in the expression of these P450 genes in susceptible S-Lab mosquitoes were observed after permethrin treatment. Eleven of the fourteen P450 genes overexpressed in the MAmCqG6 strain, CYP9M10, CYP6Z12, CYP9J33, CYP9J43, CYP9J34, CYP306A1, CYP6Z15, CYP9J45, CYPPAL1, CYP4C52v1, CYP9J39, were also induced more than doubled after exposure to an LC50 (0.7 ppm) dose of permethrin. No significant induction in P450 gene expression was observed in the susceptible S-Lab mosquitoes after permethrin treatment except for CYP6Z15 and CYP9J39, suggesting that permethrin induction of these two P450 genes are common to both susceptible and resistant mosquitoes while the induction of the others are specific to insecticide resistant mosquitoes. These results demonstrate that multiple P450 genes are co-up-regulated in insecticide resistant mosquitoes through both constitutive overexpression and induction mechanisms, providing additional support for their involvement in the detoxification of insecticides and the development of insecticide resistance.
Correlating Information Contents of Gene Ontology Terms to Infer Semantic Similarity of Gene Products

Directory of Open Access Journals (Sweden)

Mingxin Gan

2014-01-01

Full Text Available Successful applications of the gene ontology to the inference of functional relationships between gene products in recent years have raised the need for computational methods to automatically calculate semantic similarity between gene products based on semantic similarity of gene ontology terms. Nevertheless, existing methods, though having been widely used in a variety of applications, may significantly overestimate semantic similarity between genes that are actually not functionally related, thereby yielding misleading results in applications. To overcome this limitation, we propose to represent a gene product as a vector that is composed of information contents of gene ontology terms annotated for the gene product, and we suggest calculating similarity between two gene products as the relatedness of their corresponding vectors using three measures: Pearson’s correlation coefficient, cosine similarity, and the Jaccard index. We focus on the biological process domain of the gene ontology and annotations of yeast proteins to study the effectiveness of the proposed measures. Results show that semantic similarity scores calculated using the proposed measures are more consistent with known biological knowledge than those derived using a list of existing methods, suggesting the effectiveness of our method in characterizing functional relationships between gene products.
Large-scale modeling of condition-specific gene regulatory networks by information integration and inference.

Science.gov (United States)

Ellwanger, Daniel Christian; Leonhardt, Jörn Florian; Mewes, Hans-Werner

2014-12-01

Understanding how regulatory networks globally coordinate the response of a cell to changing conditions, such as perturbations by shifting environments, is an elementary challenge in systems biology which has yet to be met. Genome-wide gene expression measurements are high dimensional as these are reflecting the condition-specific interplay of thousands of cellular components. The integration of prior biological knowledge into the modeling process of systems-wide gene regulation enables the large-scale interpretation of gene expression signals in the context of known regulatory relations. We developed COGERE (http://mips.helmholtz-muenchen.de/cogere), a method for the inference of condition-specific gene regulatory networks in human and mouse. We integrated existing knowledge of regulatory interactions from multiple sources to a comprehensive model of prior information. COGERE infers condition-specific regulation by evaluating the mutual dependency between regulator (transcription factor or miRNA) and target gene expression using prior information. This dependency is scored by the non-parametric, nonlinear correlation coefficient η(2) (eta squared) that is derived by a two-way analysis of variance. We show that COGERE significantly outperforms alternative methods in predicting condition-specific gene regulatory networks on simulated data sets. Furthermore, by inferring the cancer-specific gene regulatory network from the NCI-60 expression study, we demonstrate the utility of COGERE to promote hypothesis-driven clinical research. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bioinformatics resource manager v2.3: an integrated software environment for systems biology with microRNA and cross-species analysis tools

Science.gov (United States)

2012-01-01

Background MicroRNAs (miRNAs) are noncoding RNAs that direct post-transcriptional regulation of protein coding genes. Recent studies have shown miRNAs are important for controlling many biological processes, including nervous system development, and are highly conserved across species. Given their importance, computational tools are necessary for analysis, interpretation and integration of high-throughput (HTP) miRNA data in an increasing number of model species. The Bioinformatics Resource Manager (BRM) v2.3 is a software environment for data management, mining, integration and functional annotation of HTP biological data. In this study, we report recent updates to BRM for miRNA data analysis and cross-species comparisons across datasets. Results BRM v2.3 has the capability to query predicted miRNA targets from multiple databases, retrieve potential regulatory miRNAs for known genes, integrate experimentally derived miRNA and mRNA datasets, perform ortholog mapping across species, and retrieve annotation and cross-reference identifiers for an expanded number of species. Here we use BRM to show that developmental exposure of zebrafish to 30 uM nicotine from 6–48 hours post fertilization (hpf) results in behavioral hyperactivity in larval zebrafish and alteration of putative miRNA gene targets in whole embryos at developmental stages that encompass early neurogenesis. We show typical workflows for using BRM to integrate experimental zebrafish miRNA and mRNA microarray datasets with example retrievals for zebrafish, including pathway annotation and mapping to human ortholog. Functional analysis of differentially regulated (p<0.05) gene targets in BRM indicates that nicotine exposure disrupts genes involved in neurogenesis, possibly through misregulation of nicotine-sensitive miRNAs. Conclusions BRM provides the ability to mine complex data for identification of candidate miRNAs or pathways that drive phenotypic outcome and, therefore, is a useful hypothesis
A Comprehensive Experiment for Molecular Biology: Determination of Single Nucleotide Polymorphism in Human REV3 Gene Using PCR-RFLP

Science.gov (United States)

Zhang, Xu; Shao, Meng; Gao, Lu; Zhao, Yuanyuan; Sun, Zixuan; Zhou, Liping; Yan, Yongmin; Shao, Qixiang; Xu, Wenrong; Qian, Hui

2017-01-01

Laboratory exercise is helpful for medical students to understand the basic principles of molecular biology and to learn about the practical applications of molecular biology. We have designed a lab course on molecular biology about the determination of single nucleotide polymorphism (SNP) in human REV3 gene, the product of which is a subunit of…
Stochastic processes, multiscale modeling, and numerical methods for computational cellular biology

CERN Document Server

2017-01-01

This book focuses on the modeling and mathematical analysis of stochastic dynamical systems along with their simulations. The collected chapters will review fundamental and current topics and approaches to dynamical systems in cellular biology. This text aims to develop improved mathematical and computational methods with which to study biological processes. At the scale of a single cell, stochasticity becomes important due to low copy numbers of biological molecules, such as mRNA and proteins that take part in biochemical reactions driving cellular processes. When trying to describe such biological processes, the traditional deterministic models are often inadequate, precisely because of these low copy numbers. This book presents stochastic models, which are necessary to account for small particle numbers and extrinsic noise sources. The complexity of these models depend upon whether the biochemical reactions are diffusion-limited or reaction-limited. In the former case, one needs to adopt the framework of s...
Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

Science.gov (United States)

Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

2017-10-01

Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.
The association of telomere length and genetic variation in telomere biology genes.

Science.gov (United States)

Mirabello, Lisa; Yu, Kai; Kraft, Peter; De Vivo, Immaculata; Hunter, David J; Prescott, Jennifer; Wong, Jason Y Y; Chatterjee, Nilanjan; Hayes, Richard B; Savage, Sharon A

2010-09-01

Telomeres cap chromosome ends and are critical for genomic stability. Many telomere-associated proteins are important for telomere length maintenance. Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) in genes encoding telomere-associated proteins (RTEL1 and TERT-CLPTM1) as markers of cancer risk. We conducted an association study of telomere length and 743 SNPs in 43 telomere biology genes. Telomere length in peripheral blood DNA was determined by Q-PCR in 3,646 participants from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial and Nurses' Health Study. We investigated associations by SNP, gene, and pathway (functional group). We found no associations between telomere length and SNPs in TERT-CLPTM1L or RTEL1. Telomere length was not significantly associated with specific functional groups. Thirteen SNPs from four genes (MEN1, MRE11A, RECQL5, and TNKS) were significantly associated with telomere length. The strongest findings were in MEN1 (gene-based P=0.006), menin, which associates with the telomerase promoter and may negatively regulate telomerase. This large association study did not find strong associations with telomere length. The combination of limited diversity and evolutionary conservation suggest that these genes may be under selective pressure. More work is needed to explore the role of genetic variants in telomere length regulation. Published 2010 Wiley-Liss, Inc.
Test of Science Process Skills of Biology Students towards Developing of Learning Exercises

Directory of Open Access Journals (Sweden)

Judith S. Rabacal

2016-11-01

Full Text Available This is a descriptive study aimed to determine the academic achievement on science process skills of the BS Biology Students of Northern Negros State College of Science and Technology, Philippines with the end view of developing learning exercises which will enhance their academic achievement on basic and integrated science process skills. The data in this study were obtained using a validated questionnaire. Mean was the statistical tool used to determine the academic achievement on the above mentioned science process skills; t-test for independent means was used to determine significant difference on the academic achievement of science process skills of BS Biology students while Pearson Product Moment of Correlation Coefficient was used to determine the significant relationship between basic and integrated science process skills of the BS Biology students. A 0.05 level of significance was used to determine whether the hypothesis set in the study will be rejected or accepted. Findings revealed that the academic achievement on basic and integrated science process skills of the BS Biology students was average. Findings revealed that there are no significant differences on the academic performance of the BS Biology students when grouped according to year level and gender. Findings also revealed that there is a significant difference on the academic achievement between basic and integrated science process skills of the BS Biology students. Findings revealed that there is a significant relationship between academic achievement on the basic and integrated science process skills of the BS Biology students.
Genomics-Based Discovery of Plant Genes for Synthetic Biology of Terpenoid Fragrances: A Case Study in Sandalwood oil Biosynthesis.

Science.gov (United States)

Celedon, J M; Bohlmann, J

2016-01-01

Terpenoid fragrances are powerful mediators of ecological interactions in nature and have a long history of traditional and modern industrial applications. Plants produce a great diversity of fragrant terpenoid metabolites, which make them a superb source of biosynthetic genes and enzymes. Advances in fragrance gene discovery have enabled new approaches in synthetic biology of high-value speciality molecules toward applications in the fragrance and flavor, food and beverage, cosmetics, and other industries. Rapid developments in transcriptome and genome sequencing of nonmodel plant species have accelerated the discovery of fragrance biosynthetic pathways. In parallel, advances in metabolic engineering of microbial and plant systems have established platforms for synthetic biology applications of some of the thousands of plant genes that underlie fragrance diversity. While many fragrance molecules (eg, simple monoterpenes) are abundant in readily renewable plant materials, some highly valuable fragrant terpenoids (eg, santalols, ambroxides) are rare in nature and interesting targets for synthetic biology. As a representative example for genomics/transcriptomics enabled gene and enzyme discovery, we describe a strategy used successfully for elucidation of a complete fragrance biosynthetic pathway in sandalwood (Santalum album) and its reconstruction in yeast (Saccharomyces cerevisiae). We address questions related to the discovery of specific genes within large gene families and recovery of rare gene transcripts that are selectively expressed in recalcitrant tissues. To substantiate the validity of the approaches, we describe the combination of methods used in the gene and enzyme discovery of a cytochrome P450 in the fragrant heartwood of tropical sandalwood, responsible for the fragrance defining, final step in the biosynthesis of (Z)-santalols. © 2016 Elsevier Inc. All rights reserved.
Active Interaction Mapping as a tool to elucidate hierarchical functions of biological processes.

Science.gov (United States)

Farré, Jean-Claude; Kramer, Michael; Ideker, Trey; Subramani, Suresh

2017-07-03

Increasingly, various 'omics data are contributing significantly to our understanding of novel biological processes, but it has not been possible to iteratively elucidate hierarchical functions in complex phenomena. We describe a general systems biology approach called Active Interaction Mapping (AI-MAP), which elucidates the hierarchy of functions for any biological process. Existing and new 'omics data sets can be iteratively added to create and improve hierarchical models which enhance our understanding of particular biological processes. The best datatypes to further improve an AI-MAP model are predicted computationally. We applied this approach to our understanding of general and selective autophagy, which are conserved in most eukaryotes, setting the stage for the broader application to other cellular processes of interest. In the particular application to autophagy-related processes, we uncovered and validated new autophagy and autophagy-related processes, expanded known autophagy processes with new components, integrated known non-autophagic processes with autophagy and predict other unexplored connections.
LGscore: A method to identify disease-related genes using biological literature and Google data.

Science.gov (United States)

Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

2015-04-01

Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.
An Application of Matrix Multiplication

Indian Academy of Sciences (India)

IAS Admin

intelligence, image processing, mathematical modeling, optimization techniques, environmental science, mathematical linguistics, graph theory applications to biological networks, social networks, electrical engineering. .... diagonal of A are 0; and if G has no multiple edges, then all the entries of A are either 1 or 0.

Modalities of gene action predicted by the classical evolutionary biological theory of aging.

Science.gov (United States)

Martin, George M

2007-04-01

What might now be referred to as the "classical" evolutionary biological theory of why we age has had a number of serious challenges in recent years. While the theory might therefore have to be modified under certain circumstances, in the author's opinion, it still provides the soundest theoretical basis for thinking about how we age. Nine modalities of gene action that have the potential to modulate processes of aging are reviewed, including the two most widely reviewed and accepted concepts ("antagonistic pleiotropy" and "mutation accumulation"). While several of these nine mechanisms can be regarded as derivatives of the antagonistic pleiotropic concept, they frame more specific questions for future research. Such research should pursue what appears to be the dominant factor in the determination of intraspecific variations in longevity-stochastic mechanisms, most likely based upon epigenetics. This contrasts with the dominant factor in the determination of interspecific variations in longevity-the constitutional genome, most likely based upon variations in regulatory loci.
Use of Graph Database for the Integration of Heterogeneous Biological Data.

Science.gov (United States)

Yoon, Byoung-Ha; Kim, Seon-Kyu; Kim, Seon-Young

2017-03-01

Understanding complex relationships among heterogeneous biological data is one of the fundamental goals in biology. In most cases, diverse biological data are stored in relational databases, such as MySQL and Oracle, which store data in multiple tables and then infer relationships by multiple-join statements. Recently, a new type of database, called the graph-based database, was developed to natively represent various kinds of complex relationships, and it is widely used among computer science communities and IT industries. Here, we demonstrate the feasibility of using a graph-based database for complex biological relationships by comparing the performance between MySQL and Neo4j, one of the most widely used graph databases. We collected various biological data (protein-protein interaction, drug-target, gene-disease, etc.) from several existing sources, removed duplicate and redundant data, and finally constructed a graph database containing 114,550 nodes and 82,674,321 relationships. When we tested the query execution performance of MySQL versus Neo4j, we found that Neo4j outperformed MySQL in all cases. While Neo4j exhibited a very fast response for various queries, MySQL exhibited latent or unfinished responses for complex queries with multiple-join statements. These results show that using graph-based databases, such as Neo4j, is an efficient way to store complex biological relationships. Moreover, querying a graph database in diverse ways has the potential to reveal novel relationships among heterogeneous biological data.
Deciphering cancer heterogeneity: the biological space

Directory of Open Access Journals (Sweden)

Stephanie eRoessler

2014-04-01

Full Text Available Most lethal solid tumors including hepatocellular carcinoma (HCC are considered incurable due to extensive heterogeneity in clinical presentation and tumor biology. Tumor heterogeneity may result from different cells of origin, patient ethnicity, etiology, underlying disease and diversity of genomic and epigenomic changes which drive tumor development. Cancer genomic heterogeneity thereby impedes treatment options and poses a significant challenge to cancer management. Studies of the HCC genome have revealed that although various genomic signatures identified in different HCC subgroups share a common prognosis, each carries unique molecular changes which are linked to different sets of cancer hallmarks whose misregulation has been proposed by Hanahan and Weinberg to be essential for tumorigenesis. We hypothesize that these specific sets of cancer hallmarks collectively occupy different tumor biological space representing the misregulation of different biological processes. In principle, a combination of different cancer hallmarks can result in new convergent molecular networks that are unique to each tumor subgroup and represent ideal druggable targets. Due to the ability of the tumor to adapt to external factors such as treatment or changes in the tumor microenvironment, the tumor biological space is elastic. Our ability to identify distinct groups of cancer patients with similar tumor biology who are most likely to respond to a specific therapy would have a significant impact on improving patient outcome. It is currently a challenge to identify a particular hallmark or a newly emerged convergent molecular network for a particular tumor. Thus, it is anticipated that the integration of multiple levels of data such as genomic mutations, somatic copy number aberration, gene expression, proteomics, and metabolomics, may help us grasp the tumor biological space occupied by each individual, leading to improved therapeutic intervention and outcome.
Pathway reconstruction of airway remodeling in chronic lung diseases: a systems biology approach.

Directory of Open Access Journals (Sweden)

Ali Najafi

Full Text Available Airway remodeling is a pathophysiologic process at the clinical, cellular, and molecular level relating to chronic obstructive airway diseases such as chronic obstructive pulmonary disease (COPD, asthma and mustard lung. These diseases are associated with the dysregulation of multiple molecular pathways in the airway cells. Little progress has so far been made in discovering the molecular causes of complex disease in a holistic systems manner. Therefore, pathway and network reconstruction is an essential part of a systems biology approach to solve this challenging problem. In this paper, multiple data sources were used to construct the molecular process of airway remodeling pathway in mustard lung as a model of airway disease. We first compiled a master list of genes that change with airway remodeling in the mustard lung disease and then reconstructed the pathway by generating and merging the protein-protein interaction and the gene regulatory networks. Experimental observations and literature mining were used to identify and validate the master list. The outcome of this paper can provide valuable information about closely related chronic obstructive airway diseases which are of great importance for biologists and their future research. Reconstructing the airway remodeling interactome provides a starting point and reference for the future experimental study of mustard lung, and further analysis and development of these maps will be critical to understanding airway diseases in patients.
Industrial systems biology and its impact on synthetic biology of yeast cell factories.

Science.gov (United States)

Fletcher, Eugene; Krivoruchko, Anastasia; Nielsen, Jens

2016-06-01

Engineering industrial cell factories to effectively yield a desired product while dealing with industrially relevant stresses is usually the most challenging step in the development of industrial production of chemicals using microbial fermentation processes. Using synthetic biology tools, microbial cell factories such as Saccharomyces cerevisiae can be engineered to express synthetic pathways for the production of fuels, biopharmaceuticals, fragrances, and food flavors. However, directing fluxes through these synthetic pathways towards the desired product can be demanding due to complex regulation or poor gene expression. Systems biology, which applies computational tools and mathematical modeling to understand complex biological networks, can be used to guide synthetic biology design. Here, we present our perspective on how systems biology can impact synthetic biology towards the goal of developing improved yeast cell factories. Biotechnol. Bioeng. 2016;113: 1164-1170. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Multiple-predators-based capture process on complex networks

International Nuclear Information System (INIS)

Sharafat, Rajput Ramiz; Pu Cunlai; Li Jie; Chen Rongbin; Xu Zhongqi

2017-01-01

The predator/prey (capture) problem is a prototype of many network-related applications. We study the capture process on complex networks by considering multiple predators from multiple sources. In our model, some lions start from multiple sources simultaneously to capture the lamb by biased random walks, which are controlled with a free parameter α . We derive the distribution of the lamb’s lifetime and the expected lifetime 〈 T 〉. Through simulation, we find that the expected lifetime drops substantially with the increasing number of lions. Moreover, we study how the underlying topological structure affects the capture process, and obtain that locating on small-degree nodes is better than on large-degree nodes to prolong the lifetime of the lamb. The dense or homogeneous network structures are against the survival of the lamb. We also discuss how to improve the capture efficiency in our model. (paper)
Signature pathways identified from gene expression profiles in the human uterine cervix before and after spontaneous term parturition

Science.gov (United States)

HASSAN, Sonia S.; ROMERO, Roberto; TARCA, Adi L.; DRAGHICI, Sorin; PINELES, Beth; BUGRIM, Andrej; KHALEK, Nahla; CAMACHO, Natalia; MITTAL, Pooja; YOON, Bo Hyun; ESPINOZA, Jimmy; KIM, Chong Jai; SOROKIN, Yoram; MALONE, John

2008-01-01

Objective This study aimed to discover ‘signature pathways’ characterizing biological processes based on genes differentially expressed in the uterine cervix before and after spontaneous labor. Study Design The cervical transcriptome was previously characterized from biopsies taken before and after term labor. Pathway analysis was used to study the differentially expressed genes based on two gene-to-pathway annotation databases (KEGG and Metacore™). Over-represented and highly impacted pathways and connectivity nodes were identified. Results Fifty-two pathways in the Metacore™ database were significantly enriched in differentially expressed genes. Three of the top 5 pathways were known to be involved in cervical remodeling.Two novel pathways were: plasmin signaling and plasminogen activator urokinase (PLAU) signaling. The same analysis in the KEGG database identified 4 significant pathways, of which impact analysis confirmed. Multiple nodes providing connectivity within the plasmin and PLAU signaling pathways were identified.. Conclusions Three strategies for pathway analysis were consistent in their identification of novel, unexpected as well as expected networks, suggesting that this approach is both valid and effective for the elucidation of biological mechanisms involved in cervical dilation and remodeling. PMID:17826407
When Is Hub Gene Selection Better than Standard Meta-Analysis?

Science.gov (United States)

Langfelder, Peter; Mischel, Paul S.; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to
When is hub gene selection better than standard meta-analysis?

Directory of Open Access Journals (Sweden)

Peter Langfelder

Full Text Available Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data. Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA in three comprehensive and unbiased empirical studies: (1 Finding genes predictive of lung cancer survival, (2 finding methylation markers related to age, and (3 finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1. However, standard meta-analysis methods perform as good as (if not better than a consensus network approach in terms of validation success (criterion 2. The article also reports a comparison of meta-analysis techniques
When is hub gene selection better than standard meta-analysis?

Science.gov (United States)

Langfelder, Peter; Mischel, Paul S; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to
Efficient Adoption and Assessment of Multiple Process Improvement Reference Models

Directory of Open Access Journals (Sweden)

Simona Jeners

2013-06-01

Full Text Available A variety of reference models such as CMMI, COBIT or ITIL support IT organizations to improve their processes. These process improvement reference models (IRMs cover different domains such as IT development, IT Services or IT Governance but also share some similarities. As there are organizations that address multiple domains and need to coordinate their processes in their improvement we present MoSaIC, an approach to support organizations to efficiently adopt and conform to multiple IRMs. Our solution realizes a semantic integration of IRMs based on common meta-models. The resulting IRM integration model enables organizations to efficiently implement and asses multiple IRMs and to benefit from synergy effects.
The Reconstruction and Analysis of Gene Regulatory Networks.

Science.gov (United States)

Zheng, Guangyong; Huang, Tao

2018-01-01

In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.
Seven gene deletions in seven days

DEFF Research Database (Denmark)

Ingemann Jensen, Sheila; Lennen, Rebecca; Herrgard, Markus

2015-01-01

Generation of multiple genomic alterations is currently a time consuming process. Here, a method was established that enables highly efficient and simultaneous deletion of multiple genes in Escherichia coli. A temperature sensitive plasmid containing arabinose inducible lambda Red recombineering ...
Modeling stochasticity and robustness in gene regulatory networks.

Science.gov (United States)

Garg, Abhishek; Mohanram, Kartik; Di Cara, Alessandro; De Micheli, Giovanni; Xenarios, Ioannis

2009-06-15

Understanding gene regulation in biological processes and modeling the robustness of underlying regulatory networks is an important problem that is currently being addressed by computational systems biologists. Lately, there has been a renewed interest in Boolean modeling techniques for gene regulatory networks (GRNs). However, due to their deterministic nature, it is often difficult to identify whether these modeling approaches are robust to the addition of stochastic noise that is widespread in gene regulatory processes. Stochasticity in Boolean models of GRNs has been addressed relatively sparingly in the past, mainly by flipping the expression of genes between different expression levels with a predefined probability. This stochasticity in nodes (SIN) model leads to over representation of noise in GRNs and hence non-correspondence with biological observations. In this article, we introduce the stochasticity in functions (SIF) model for simulating stochasticity in Boolean models of GRNs. By providing biological motivation behind the use of the SIF model and applying it to the T-helper and T-cell activation networks, we show that the SIF model provides more biologically robust results than the existing SIN model of stochasticity in GRNs. Algorithms are made available under our Boolean modeling toolbox, GenYsis. The software binaries can be downloaded from http://si2.epfl.ch/ approximately garg/genysis.html.
iBiology: communicating the process of science.

Science.gov (United States)

Goodwin, Sarah S

2014-08-01

The Internet hosts an abundance of science video resources aimed at communicating scientific knowledge, including webinars, massive open online courses, and TED talks. Although these videos are efficient at disseminating information for diverse types of users, they often do not demonstrate the process of doing science, the excitement of scientific discovery, or how new scientific knowledge is developed. iBiology (www.ibiology.org), a project that creates open-access science videos about biology research and science-related topics, seeks to fill this need by producing videos by science leaders that make their ideas, stories, and experiences available to anyone with an Internet connection. © 2014 Goodwin. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Queueing-Based Synchronization and Entrainment for Synthetic Gene Oscillators

Science.gov (United States)

Mather, William; Butzin, Nicholas; Hochendoner, Philip; Ogle, Curtis

Synthetic gene oscillators have been a major focus of synthetic biology research since the beginning of the field 15 years ago. They have proven to be useful both for biotechnological applications as well as a testing ground to significantly develop our understanding of the design principles behind synthetic and native gene oscillators. In particular, the principles governing synchronization and entrainment of biological oscillators have been explored using a synthetic biology approach. Our work combines experimental and theoretical approaches to specifically investigate how a bottleneck for protein degradation, which is present in most if not all existing synthetic oscillators, can be leveraged to robustly synchronize and entrain biological oscillators. We use both the terminology and mathematical tools of queueing theory to intuitively explain the role of this bottleneck in both synchronization and entrainment, which extends prior work demonstrating the usefulness of queueing theory in synthetic and native gene circuits. We conclude with an investigation of how synchronization and entrainment may be sensitive to the presence of multiple proteolytic pathways in a cell that couple weakly through crosstalk. This work was supported by NSF Grant #1330180.
New genes as drivers of phenotypic evolution

Science.gov (United States)

Chen, Sidi; Krinsky, Benjamin H.; Long, Manyuan

2014-01-01

During the course of evolution, genomes acquire novel genetic elements as sources of functional and phenotypic diversity, including new genes that originated in recent evolution. In the past few years, substantial progress has been made in understanding the evolution and phenotypic effects of new genes. In particular, an emerging picture is that new genes, despite being present in the genomes of only a subset of species, can rapidly evolve indispensable roles in fundamental biological processes, including development, reproduction, brain function and behaviour. The molecular underpinnings of how new genes can develop these roles are starting to be characterized. These recent discoveries yield fresh insights into our broad understanding of biological diversity at refined resolution. PMID:23949544
Heat transfer and fluid flow in biological processes advances and applications

CERN Document Server

Becker, Sid

2015-01-01

Heat Transfer and Fluid Flow in Biological Processes covers emerging areas in fluid flow and heat transfer relevant to biosystems and medical technology. This book uses an interdisciplinary approach to provide a comprehensive prospective on biofluid mechanics and heat transfer advances and includes reviews of the most recent methods in modeling of flows in biological media, such as CFD. Written by internationally recognized researchers in the field, each chapter provides a strong introductory section that is useful to both readers currently in the field and readers interested in learning more about these areas. Heat Transfer and Fluid Flow in Biological Processes is an indispensable reference for professors, graduate students, professionals, and clinical researchers in the fields of biology, biomedical engineering, chemistry and medicine working on applications of fluid flow, heat transfer, and transport phenomena in biomedical technology. Provides a wide range of biological and clinical applications of fluid...
Prevalent Role of Gene Features in Determining Evolutionary Fates of Whole-Genome Duplication Duplicated Genes in Flowering Plants1[W][OA

Science.gov (United States)

Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi

2013-01-01

The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs. PMID:23396833
Analyzing Multiple-Probe Microarray: Estimation and Application of Gene Expression Indexes

KAUST Repository

Maadooliat, Mehdi

2012-07-26

Gene expression index estimation is an essential step in analyzing multiple probe microarray data. Various modeling methods have been proposed in this area. Amidst all, a popular method proposed in Li and Wong (2001) is based on a multiplicative model, which is similar to the additive model discussed in Irizarry et al. (2003a) at the logarithm scale. Along this line, Hu et al. (2006) proposed data transformation to improve expression index estimation based on an ad hoc entropy criteria and naive grid search approach. In this work, we re-examined this problem using a new profile likelihood-based transformation estimation approach that is more statistically elegant and computationally efficient. We demonstrate the applicability of the proposed method using a benchmark Affymetrix U95A spiked-in experiment. Moreover, We introduced a new multivariate expression index and used the empirical study to shows its promise in terms of improving model fitting and power of detecting differential expression over the commonly used univariate expression index. As the other important content of the work, we discussed two generally encountered practical issues in application of gene expression index: normalization and summary statistic used for detecting differential expression. Our empirical study shows somewhat different findings from the MAQC project (MAQC, 2006).

Fine tuning of RFX/DAF-19-regulated target gene expression through binding to multiple sites in Caenorhabditis elegans

OpenAIRE

Chu, Jeffery S. C.; Tarailo-Graovac, Maja; Zhang, Di; Wang, Jun; Uyar, Bora; Tu, Domena; Trinh, Joanne; Baillie, David L.; Chen, Nansheng

2011-01-01

In humans, mutations of a growing list of regulatory factor X (RFX) target genes have been associated with devastating genetics disease conditions including ciliopathies. However, mechanisms underlying RFX transcription factors (TFs)-mediated gene expression regulation, especially differential gene expression regulation, are largely unknown. In this study, we explore the functional significance of the co-existence of multiple X-box motifs in regulating differential gene expression in Caenorha...
Phase I metabolic genes and risk of lung cancer: multiple polymorphisms and mRNA expression.

Directory of Open Access Journals (Sweden)

Melissa Rotunno

2009-05-01

Full Text Available Polymorphisms in genes coding for enzymes that activate tobacco lung carcinogens may generate inter-individual differences in lung cancer risk. Previous studies had limited sample sizes, poor exposure characterization, and a few single nucleotide polymorphisms (SNPs tested in candidate genes. We analyzed 25 SNPs (some previously untested in 2101 primary lung cancer cases and 2120 population controls from the Environment And Genetics in Lung cancer Etiology (EAGLE study from six phase I metabolic genes, including cytochrome P450s, microsomal epoxide hydrolase, and myeloperoxidase. We evaluated the main genotype effects and genotype-smoking interactions in lung cancer risk overall and in the major histology subtypes. We tested the combined effect of multiple SNPs on lung cancer risk and on gene expression. Findings were prioritized based on significance thresholds and consistency across different analyses, and accounted for multiple testing and prior knowledge. Two haplotypes in EPHX1 were significantly associated with lung cancer risk in the overall population. In addition, CYP1B1 and CYP2A6 polymorphisms were inversely associated with adenocarcinoma and squamous cell carcinoma risk, respectively. Moreover, the association between CYP1A1 rs2606345 genotype and lung cancer was significantly modified by intensity of cigarette smoking, suggesting an underlying dose-response mechanism. Finally, increasing number of variants at CYP1A1/A2 genes revealed significant protection in never smokers and risk in ever smokers. Results were supported by differential gene expression in non-tumor lung tissue samples with down-regulation of CYP1A1 in never smokers and up-regulation in smokers from CYP1A1/A2 SNPs. The significant haplotype associations emphasize that the effect of multiple SNPs may be important despite null single SNP-associations, and warrants consideration in genome-wide association studies (GWAS. Our findings emphasize the necessity of post
PCA-based bootstrap confidence interval tests for gene-disease association involving multiple SNPs

Directory of Open Access Journals (Sweden)

Xue Fuzhong

2010-01-01

Full Text Available Abstract Background Genetic association study is currently the primary vehicle for identification and characterization of disease-predisposing variant(s which usually involves multiple single-nucleotide polymorphisms (SNPs available. However, SNP-wise association tests raise concerns over multiple testing. Haplotype-based methods have the advantage of being able to account for correlations between neighbouring SNPs, yet assuming Hardy-Weinberg equilibrium (HWE and potentially large number degrees of freedom can harm its statistical power and robustness. Approaches based on principal component analysis (PCA are preferable in this regard but their performance varies with methods of extracting principal components (PCs. Results PCA-based bootstrap confidence interval test (PCA-BCIT, which directly uses the PC scores to assess gene-disease association, was developed and evaluated for three ways of extracting PCs, i.e., cases only(CAES, controls only(COES and cases and controls combined(CES. Extraction of PCs with COES is preferred to that with CAES and CES. Performance of the test was examined via simulations as well as analyses on data of rheumatoid arthritis and heroin addiction, which maintains nominal level under null hypothesis and showed comparable performance with permutation test. Conclusions PCA-BCIT is a valid and powerful method for assessing gene-disease association involving multiple SNPs.
Phenol wastewater remediation: advanced oxidation processes coupled to a biological treatment.

Science.gov (United States)

Rubalcaba, A; Suárez-Ojeda, M E; Stüber, F; Fortuny, A; Bengoa, C; Metcalfe, I; Font, J; Carrera, J; Fabregat, A

2007-01-01

Nowadays, there are increasingly stringent regulations requiring more and more treatment of industrial effluents to generate product waters which could be easily reused or disposed of to the environment without any harmful effects. Therefore, different advanced oxidation processes were investigated as suitable precursors for the biological treatment of industrial effluents containing phenol. Wet air oxidation and Fenton process were tested batch wise, while catalytic wet air oxidation and H2O2-promoted catalytic wet air oxidation processes were studied in a trickle bed reactor, the last two using over activated carbon as catalyst. Effluent characterisation was made by means of substrate conversion (using high liquid performance chromatography), chemical oxygen demand and total organic carbon. Biodegradation parameters (i.e. maximum oxygen uptake rate and oxygen consumption) were obtained from respirometric tests using activated sludge from an urban biological wastewater treatment plant (WWTP). The main goal was to find the proper conditions in terms of biodegradability enhancement, so that these phenolic effluents could be successfully treated in an urban biological WWTP. Results show promising research ways for the development of efficient coupled processes for the treatment of wastewater containing toxic or biologically non-degradable compounds.
GeneNotes – A novel information management software for biologists

Directory of Open Access Journals (Sweden)

Wong Wing H

2005-02-01

Full Text Available Abstract Background Collecting and managing information is a challenging task in a genome-wide profiling research project. Most databases and online computational tools require a direct human involvement. Information and computational results are presented in various multimedia formats (e.g., text, image, PDF, word files, etc., many of which cannot be automatically processed by computers in biologically meaningful ways. In addition, the quality of computational results is far from perfect and requires nontrivial manual examination. The timely selection, integration and interpretation of heterogeneous biological information still heavily rely on the sensibility of biologists. Biologists often feel overwhelmed by the huge amount of and the great diversity of distributed heterogeneous biological information. Description We developed an information management application called GeneNotes. GeneNotes is the first application that allows users to collect and manage multimedia biological information about genes/ESTs. GeneNotes provides an integrated environment for users to surf the Internet, collect notes for genes/ESTs, and retrieve notes. GeneNotes is supported by a server that integrates gene annotations from many major databases (e.g., HGNC, MGI, etc.. GeneNotes uses the integrated gene annotations to (a identify genes given various types of gene IDs (e.g., RefSeq ID, GenBank ID, etc., and (b provide quick views of genes. GeneNotes is free for academic usage. The program and the tutorials are available at: http://bayes.fas.harvard.edu/genenotes/. Conclusions GeneNotes provides a novel human-computer interface to assist researchers to collect and manage biological information. It also provides a platform for studying how users behave when they manipulate biological information. The results of such study can lead to innovation of more intelligent human-computer interfaces that greatly shorten the cycle of biology research.
Discovery of possible gene relationships through the application of self-organizing maps to DNA microarray databases.

Science.gov (United States)

Chavez-Alvarez, Rocio; Chavoya, Arturo; Mendez-Vazquez, Andres

2014-01-01

DNA microarrays and cell cycle synchronization experiments have made possible the study of the mechanisms of cell cycle regulation of Saccharomyces cerevisiae by simultaneously monitoring the expression levels of thousands of genes at specific time points. On the other hand, pattern recognition techniques can contribute to the analysis of such massive measurements, providing a model of gene expression level evolution through the cell cycle process. In this paper, we propose the use of one of such techniques--an unsupervised artificial neural network called a Self-Organizing Map (SOM)-which has been successfully applied to processes involving very noisy signals, classifying and organizing them, and assisting in the discovery of behavior patterns without requiring prior knowledge about the process under analysis. As a test bed for the use of SOMs in finding possible relationships among genes and their possible contribution in some biological processes, we selected 282 S. cerevisiae genes that have been shown through biological experiments to have an activity during the cell cycle. The expression level of these genes was analyzed in five of the most cited time series DNA microarray databases used in the study of the cell cycle of this organism. With the use of SOM, it was possible to find clusters of genes with similar behavior in the five databases along two cell cycles. This result suggested that some of these genes might be biologically related or might have a regulatory relationship, as was corroborated by comparing some of the clusters obtained with SOMs against a previously reported regulatory network that was generated using biological knowledge, such as protein-protein interactions, gene expression levels, metabolism dynamics, promoter binding, and modification, regulation and transport of proteins. The methodology described in this paper could be applied to the study of gene relationships of other biological processes in different organisms.
Adaption of Ulva pertusa to multiple-contamination of heavy metals and nutrients: Biological mechanism of outbreak of Ulva sp. green tide.

Science.gov (United States)

Ge, Changzi; Yu, Xiru; Kan, Manman; Qu, Chunfeng

2017-12-15

The multiple-contamination of heavy metals and nutrients worsens increasingly and Ulva sp. green tide occurs almost simultaneously. To reveal the biological mechanism for outbreak of the green tide, Ulva pertusa was exposed to seven-day-multiple-contamination. The relation between pH variation (V pH ), Chl a content, ratio of (Chl a content)/(Chl b content) (R chla/chlb ), SOD activity of U. pertusa (A SOD ) and contamination concentration is [Formula: see text] (pcontamination concentrations of seawaters where Ulva sp. green tide occurred and the contamination concentrations set in the present work, U. pertusa can adapt to multiple-contaminations in these waters. Thus, the adaption to multiple-contamination may be one biological mechanism for the outbreak of Ulva sp. green tide. Copyright © 2017 Elsevier Ltd. All rights reserved.
Multiple BiP genes of Arabidopsis thaliana are required for male gametogenesis and pollen competitiveness.

Science.gov (United States)

Maruyama, Daisuke; Sugiyama, Tomoyuki; Endo, Toshiya; Nishikawa, Shuh-Ichi

2014-04-01

Immunoglobulin-binding protein (BiP) is a molecular chaperone of the heat shock protein 70 (Hsp70) family. BiP is localized in the endoplasmic reticulum (ER) and plays key roles in protein translocation, protein folding and quality control in the ER. The genomes of flowering plants contain multiple BiP genes. Arabidopsis thaliana has three BiP genes. BIP1 and BIP2 are ubiquitously expressed. BIP3 encodes a less well conserved BiP paralog, and it is expressed only under ER stress conditions in the majority of organs. Here, we report that all BiP genes are expressed and functional in pollen and pollen tubes. Although the bip1 bip2 double mutation does not affect pollen viability, the bip1 bip2 bip3 triple mutation is lethal in pollen. This result indicates that lethality of the bip1 bip2 double mutation is rescued by BiP3 expression. A decrease in the copy number of the ubiquitously expressed BiP genes correlates well with a decrease in pollen tube growth, which leads to reduced fitness of mutant pollen during fertilization. Because an increased protein secretion activity is expected to increase the protein folding demand in the ER, the multiple BiP genes probably cooperate with each other to ensure ER homeostasis in cells with active secretion such as rapidly growing pollen tubes.
Industrial scale gene synthesis.

Science.gov (United States)

Notka, Frank; Liss, Michael; Wagner, Ralf

2011-01-01

The most recent developments in the area of deep DNA sequencing and downstream quantitative and functional analysis are rapidly adding a new dimension to understanding biochemical pathways and metabolic interdependencies. These increasing insights pave the way to designing new strategies that address public needs, including environmental applications and therapeutic inventions, or novel cell factories for sustainable and reconcilable energy or chemicals sources. Adding yet another level is building upon nonnaturally occurring networks and pathways. Recent developments in synthetic biology have created economic and reliable options for designing and synthesizing genes, operons, and eventually complete genomes. Meanwhile, high-throughput design and synthesis of extremely comprehensive DNA sequences have evolved into an enabling technology already indispensable in various life science sectors today. Here, we describe the industrial perspective of modern gene synthesis and its relationship with synthetic biology. Gene synthesis contributed significantly to the emergence of synthetic biology by not only providing the genetic material in high quality and quantity but also enabling its assembly, according to engineering design principles, in a standardized format. Synthetic biology on the other hand, added the need for assembling complex circuits and large complexes, thus fostering the development of appropriate methods and expanding the scope of applications. Synthetic biology has also stimulated interdisciplinary collaboration as well as integration of the broader public by addressing socioeconomic, philosophical, ethical, political, and legal opportunities and concerns. The demand-driven technological achievements of gene synthesis and the implemented processes are exemplified by an industrial setting of large-scale gene synthesis, describing production from order to delivery. Copyright © 2011 Elsevier Inc. All rights reserved.
Cross-species comparison of biological themes and underlying genes on a global gene expression scale in a mouse model of colorectal liver metastasis and in clinical specimens

Directory of Open Access Journals (Sweden)

Schirmacher Peter

2008-09-01

Full Text Available Abstract Background Invasion-related genes over-expressed by tumor cells as well as by reacting host cells represent promising drug targets for anti-cancer therapy. Such candidate genes need to be validated in appropriate animal models. Results This study examined the suitability of a murine model (CT26/Balb/C of colorectal liver metastasis to represent clinical liver metastasis specimens using a global gene expression approach. Cross-species similarity was examined between pure liver, liver invasion, tumor invasion and pure tumor compartments through overlap of up-regulated genes and gene ontology (GO-based biological themes on the level of single GO-terms and of condensed GO-term families. Three out of four GO-term families were conserved in a compartment-specific way between the species: secondary metabolism (liver, invasion (invasion front, and immune response (invasion front and liver. Among the individual GO-terms over-represented in the invasion compartments in both species were "extracellular matrix", "cell motility", "cell adhesion" and "antigen presentation" indicating that typical invasion related processes are operating in both species. This was reflected on the single gene level as well, as cross-species overlap of potential target genes over-expressed in the combined invasion front compartments reached up to 36.5%. Generally, histopathology and gene expression correlated well as the highest single gene overlap was found to be 44% in syn-compartmental comparisons (liver versus liver whereas cross-compartmental overlaps were much lower (e.g. liver versus tumor: 9.7%. However, single gene overlap was surprisingly high in some cross-compartmental comparisons (e.g. human liver invasion compartment and murine tumor invasion compartment: 9.0% despite little histolopathologic similarity indicating that invasion relevant genes are not necessarily confined to histologically defined compartments. Conclusion In summary, cross
Organic micropollutants in aerobic and anaerobic membrane bioreactors: Changes in microbial communities and gene expression

KAUST Repository

Harb, Moustapha

2016-07-09

Organic micro-pollutants (OMPs) are contaminants of emerging concern in wastewater treatment due to the risk of their proliferation into the environment, but their impact on the biological treatment process is not well understood. The purpose of this study is to examine the effects of the presence of OMPs on the core microbial populations of wastewater treatment. Two nanofiltration-coupled membrane bioreactors (aerobic and anaerobic) were subjected to the same operating conditions while treating synthetic municipal wastewater spiked with OMPs. Microbial community dynamics, gene expression levels, and antibiotic resistance genes were analyzed using molecular-based approaches. Results showed that presence of OMPs in the wastewater feed had a clear effect on keystone bacterial populations in both the aerobic and anaerobic sludge while also significantly impacting biodegradation-associated gene expression levels. Finally, multiple antibiotic-type OMPs were found to have higher removal rates in the anaerobic MBR, while associated antibiotic resistance genes were lower.
Organic micropollutants in aerobic and anaerobic membrane bioreactors: Changes in microbial communities and gene expression

KAUST Repository

Harb, Moustapha; Wei, Chunhai; Wang, Nan; Amy, Gary L.; Hong, Pei-Ying

2016-01-01

Organic micro-pollutants (OMPs) are contaminants of emerging concern in wastewater treatment due to the risk of their proliferation into the environment, but their impact on the biological treatment process is not well understood. The purpose of this study is to examine the effects of the presence of OMPs on the core microbial populations of wastewater treatment. Two nanofiltration-coupled membrane bioreactors (aerobic and anaerobic) were subjected to the same operating conditions while treating synthetic municipal wastewater spiked with OMPs. Microbial community dynamics, gene expression levels, and antibiotic resistance genes were analyzed using molecular-based approaches. Results showed that presence of OMPs in the wastewater feed had a clear effect on keystone bacterial populations in both the aerobic and anaerobic sludge while also significantly impacting biodegradation-associated gene expression levels. Finally, multiple antibiotic-type OMPs were found to have higher removal rates in the anaerobic MBR, while associated antibiotic resistance genes were lower.
Peroxisome Proliferator-Activated Receptor Alpha Target Genes

Directory of Open Access Journals (Sweden)

Maryam Rakhshandehroo

2010-01-01

Full Text Available The peroxisome proliferator-activated receptor alpha (PPARα is a ligand-activated transcription factor involved in the regulation of a variety of processes, ranging from inflammation and immunity to nutrient metabolism and energy homeostasis. PPARα serves as a molecular target for hypolipidemic fibrates drugs which bind the receptor with high affinity. Furthermore, PPARα binds and is activated by numerous fatty acids and fatty acid-derived compounds. PPARα governs biological processes by altering the expression of a large number of target genes. Accordingly, the specific role of PPARα is directly related to the biological function of its target genes. Here, we present an overview of the involvement of PPARα in lipid metabolism and other pathways through a detailed analysis of the different known or putative PPARα target genes. The emphasis is on gene regulation by PPARα in liver although many of the results likely apply to other organs and tissues as well.
Pathway analysis of gene signatures predicting metastasis of node-negative primary breast cancer

International Nuclear Information System (INIS)

Yu, Jack X; Sieuwerts, Anieta M; Zhang, Yi; Martens, John WM; Smid, Marcel; Klijn, Jan GM; Wang, Yixin; Foekens, John A

2007-01-01

Published prognostic gene signatures in breast cancer have few genes in common. Here we provide a rationale for this observation by studying the prognostic power and the underlying biological pathways of different gene signatures. Gene signatures to predict the development of metastases in estrogen receptor-positive and estrogen receptor-negative tumors were identified using 500 re-sampled training sets and mapping to Gene Ontology Biological Process to identify over-represented pathways. The Global Test program confirmed that gene expression profilings in the common pathways were associated with the metastasis of the patients. The apoptotic pathway and cell division, or cell growth regulation and G-protein coupled receptor signal transduction, were most significantly associated with the metastatic capability of estrogen receptor-positive or estrogen-negative tumors, respectively. A gene signature derived of the common pathways predicted metastasis in an independent cohort. Mapping of the pathways represented by different published prognostic signatures showed that they share 53% of the identified pathways. We show that divergent gene sets classifying patients for the same clinical endpoint represent similar biological processes and that pathway-derived signatures can be used to predict prognosis. Furthermore, our study reveals that the underlying biology related to aggressiveness of estrogen receptor subgroups of breast cancer is quite different
Clinical and biological features of multiple myeloma involving the gastrointestinal system.

Science.gov (United States)

Talamo, Giampaolo; Cavallo, Federica; Zangari, Maurizio; Barlogie, Bart; Lee, Choon-Kee; Pineda-Roman, Mauricio; Kiwan, Elias; Krishna, Somashekar; Tricot, Guido

2006-07-01

We report 24 cases of multiple myeloma (MM) with involvement of the gastrointestinal (GI) system. We found a strong association with high A lactate dehydrogenase levels, plasmablastic morphology, and A unfavorable karyotype. GI involvement at the time of initial diagnosis was much rarer than later in the course of the disease. The A median survival after diagnosis of GI involvement was 7 months. Among 13 patients treated with stem cell transplantation, the response rate was 92%, and median progression-free survival was 4 months. We conclude that MM involving the GI system is associated with adverse biological features and with short-lasting remissions, even after A high-dose chemotherapy.
Multiple electron processes of He and Ne by proton impact

Science.gov (United States)

Terekhin, Pavel Nikolaevich; Montenegro, Pablo; Quinto, Michele; Monti, Juan; Fojon, Omar; Rivarola, Roberto

2016-05-01

A detailed investigation of multiple electron processes (single and multiple ionization, single capture, transfer-ionization) of He and Ne is presented for proton impact at intermediate and high collision energies. Exclusive absolute cross sections for these processes have been obtained by calculation of transition probabilities in the independent electron and independent event models as a function of impact parameter in the framework of the continuum distorted wave-eikonal initial state theory. A binomial analysis is employed to calculate exclusive probabilities. The comparison with available theoretical and experimental results shows that exclusive probabilities are needed for a reliable description of the experimental data. The developed approach can be used for obtaining the input database for modeling multiple electron processes of charged particles passing through the matter.
Degradation alternatives for a commercial fungicide in water: biological, photo-Fenton, and coupled biological photo-Fenton processes.

Science.gov (United States)

López-Loveira, Elsa; Ariganello, Federico; Medina, María Sara; Centrón, Daniela; Candal, Roberto; Curutchet, Gustavo

2017-11-01

Imazalil (IMZ) is a widely used fungicide for the post-harvest treatment of citrus, classified as "likely to be carcinogenic in humans" for EPA, that can be only partially removed by conventional biological treatment. Consequently, specific or combined processes should be applied to prevent its release to the environment. Biological treatment with adapted microorganism consortium, photo-Fenton, and coupled biological photo-Fenton processes were tested as alternatives for the purification of water containing high concentration of the fungicide and the coadjutants present in the commercial formulation. IMZ-resistant consortium with the capacity to degrade IMZ in the presence of a C-rich co-substrate was isolated from sludge coming from a fruit packaging company wastewater treatment plant. This consortium was adapted to resist and degrade the organics present in photo-Fenton-oxidized IMZ water solution. Bacteria colonies from the consortia were isolated and identified. The effect of H 2 O 2 initial concentration and dosage on IMZ degradation rate, average oxidation state (AOS), organic acid concentration, oxidation, and mineralization percentage after photo-Fenton process was determined. The application of biological treatment to the oxidized solutions notably decreased the total organic carbon (TOC) in solution. The effect of the oxidation degree, limited by H 2 O 2 concentration and dosage, on the percentage of mineralization obtained after the biological treatment was determined and explained in terms of changes in AOS. The concentration of H 2 O 2 necessary to eliminate IMZ by photo-Fenton and to reduce TOC and chemical oxygen demand (COD) by biological treatment, in order to allow the release of the effluents to rivers with different flows, was estimated.
A new measure for functional similarity of gene products based on Gene Ontology

Directory of Open Access Journals (Sweden)

Lengauer Thomas

2006-06-01

Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.
Metabolic adaptation of a human pathogen during chronic infections - a systems biology approach

DEFF Research Database (Denmark)

Thøgersen, Juliane Charlotte

modeling to uncover how human pathogens adapt to the human host. Pseudomonas aeruginosa infections in cystic fibrosis patients are used as a model system for under-‐ standing these adaptation processes. The exploratory systems biology approach facilitates identification of important phenotypes...... by classical molecular biology approaches where genes and reactions typically are investigated in a one to one relationship. This thesis is an example of how mathematical approaches and modeling can facilitate new biologi-‐ cal understanding and provide new surprising ideas to important biological processes....... and metabolic pathways that are necessary or related to establishment of chronic infections. Archetypal analysis showed to be successful in extracting relevant phenotypes from global gene expression da-‐ ta. Furthermore, genome-‐scale metabolic modeling showed to be useful in connecting the genotype...
Stochastic switching in biology: from genotype to phenotype

International Nuclear Information System (INIS)

Bressloff, Paul C

2017-01-01

There has been a resurgence of interest in non-equilibrium stochastic processes in recent years, driven in part by the observation that the number of molecules (genes, mRNA, proteins) involved in gene expression are often of order 1–1000. This means that deterministic mass-action kinetics tends to break down, and one needs to take into account the discrete, stochastic nature of biochemical reactions. One of the major consequences of molecular noise is the occurrence of stochastic biological switching at both the genotypic and phenotypic levels. For example, individual gene regulatory networks can switch between graded and binary responses, exhibit translational/transcriptional bursting, and support metastability (noise-induced switching between states that are stable in the deterministic limit). If random switching persists at the phenotypic level then this can confer certain advantages to cell populations growing in a changing environment, as exemplified by bacterial persistence in response to antibiotics. Gene expression at the single-cell level can also be regulated by changes in cell density at the population level, a process known as quorum sensing. In contrast to noise-driven phenotypic switching, the switching mechanism in quorum sensing is stimulus-driven and thus noise tends to have a detrimental effect. A common approach to modeling stochastic gene expression is to assume a large but finite system and to approximate the discrete processes by continuous processes using a system-size expansion. However, there is a growing need to have some familiarity with the theory of stochastic processes that goes beyond the standard topics of chemical master equations, the system-size expansion, Langevin equations and the Fokker–Planck equation. Examples include stochastic hybrid systems (piecewise deterministic Markov processes), large deviations and the Wentzel–Kramers–Brillouin (WKB) method, adiabatic reductions, and queuing/renewal theory. The major aim of

Patterns of population differentiation of candidate genes for cardiovascular disease

Directory of Open Access Journals (Sweden)

Ding Keyue

2007-07-01

Full Text Available Abstract Background The basis for ethnic differences in cardiovascular disease (CVD susceptibility is not fully understood. We investigated patterns of population differentiation (FST of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI, Utah residents with European ancestry (CEU, and Han Chinese (CHB + Japanese (JPT. We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism. Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions or non-functional (intronic and synonymous sites. Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.
Biological and geochemical processes involved during denitrification in Callovo-Oxfordian clay

International Nuclear Information System (INIS)

Ollivier, P.; Parmentier, M.; Joulian, C.; Pauwels, H.; Albrecht, A.

2012-01-01

geochemical and biological variations observed in the experiments, biogeochemical modeling is carried out using the geochemical software PHREEQC. The present work builds upon two previous studies done at BRGM: the formulation of a COx pore water model and the creation of a kinetic biological denitrification model the latter. Because of the large uncertainties on the estimation of biomass based on the classic optical microscopy method, the quantification of NarG gene is used for biogeochemical modeling. To account for the observed presence of two nitrate reduction products, two sets of kinetic parameters are used to correctly represent experimental data: one in the early stage of experiments and another for the rest of experiments. Bacterial growth is modeled using acetate and nitrate as carbon and nitrogen sources. Calculated bacterial concentrations are in good agreement with NarG gene data. The calculated mass-balance indicates that about 40% of the carbon from acetate is used for anabolism and 60% for catabolism. Although, some discrepancies are still present between modeled and experimental pH evolution, the model is able to reproduce important changes such as the decrease of dissolved calcium in experiment with COx. This drop in Ca is explained by calcite precipitation and to a lesser extent by cation exchange. Experiments are still ongoing. It appears that nitrate is still decreasing. Further work should be done. In this study, we use acetate but other electron donors such as H 2 need to be investigated. Also, the synthetic solution representative of COx pore water is amended with Pseudomonas mandelii. Other bacteria should be considered. Finally, it would be interesting to work on the quantification of bacterial messenger RNA. Our preliminary tests show that this approach may provide more precise information on the biomass fraction actively involved in denitrification process. The next step could be to work with a consortium of bacteria
Finding gene regulatory network candidates using the gene expression knowledge base.

Science.gov (United States)

Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

2014-12-10

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.
A big data pipeline: Identifying dynamic gene regulatory networks from time-course Gene Expression Omnibus data with applications to influenza infection.

Science.gov (United States)

Carey, Michelle; Ramírez, Juan Camilo; Wu, Shuang; Wu, Hulin

2018-07-01

A biological host response to an external stimulus or intervention such as a disease or infection is a dynamic process, which is regulated by an intricate network of many genes and their products. Understanding the dynamics of this gene regulatory network allows us to infer the mechanisms involved in a host response to an external stimulus, and hence aids the discovery of biomarkers of phenotype and biological function. In this article, we propose a modeling/analysis pipeline for dynamic gene expression data, called Pipeline4DGEData, which consists of a series of statistical modeling techniques to construct dynamic gene regulatory networks from the large volumes of high-dimensional time-course gene expression data that are freely available in the Gene Expression Omnibus repository. This pipeline has a consistent and scalable structure that allows it to simultaneously analyze a large number of time-course gene expression data sets, and then integrate the results across different studies. We apply the proposed pipeline to influenza infection data from nine studies and demonstrate that interesting biological findings can be discovered with its implementation.
An evolvable oestrogen receptor activity sensor: development of a modular system for integrating multiple genes into the yeast genome

NARCIS (Netherlands)

Fox, J.E.; Bridgham, J.T.; Bovee, T.F.H.; Thornton, J.W.

2007-01-01

To study a gene interaction network, we developed a gene-targeting strategy that allows efficient and stable genomic integration of multiple genetic constructs at distinct target loci in the yeast genome. This gene-targeting strategy uses a modular plasmid with a recyclable selectable marker and a
Screening of point mutations by multiple SSCP analysis in the dystrophin gene

Energy Technology Data Exchange (ETDEWEB)

Lasa, A.; Baiget, M.; Gallano, P. [Hospital Sant Pau, Barcelona (Spain)

1994-09-01

Duchenne muscular dystrophy (DMD) is a lethal, X-linked neuromuscular disorder. The population frequency of DMD is one in approximately 3500 boys, of which one third is thought to be a new mutant. The DMD gene is the largest known to date, spanning over 2,3 Mb in band Xp21.2; 79 exons are transcribed into a 14 Kb mRNA coding for a protein of 427 kD which has been named dystrophin. It has been shown that about 65% of affected boys have a gene deletion with a wide variation in localization and size. The remaining affected individuals who have no detectable deletions or duplications would probably carry more subtle mutations that are difficult to detect. These mutations occur in several different exons and seem to be unique to single patients. Their identification represents a formidable goal because of the large size and complexity of the dystrophin gene. SSCP is a very efficient method for the detection of point mutations if the parameters that affect the separation of the strands are optimized for a particular DNA fragment. The multiple SSCP allows the simultaneous study of several exons, and implies the use of different conditions because no single set of conditions will be optimal for all fragments. Seventy-eight DMD patients with no deletion or duplication in the dystrophin gene were selected for the multiple SSCP analysis. Genomic DNA from these patients was amplified using the primers described for the diagnosis procedure (muscle promoter and exons 3, 8, 12, 16, 17, 19, 32, 45, 48 and 51). We have observed different mobility shifts in bands corresponding to exons 8, 12, 43 and 51. In exons 17 and 45, altered electrophoretic patterns were found in different samples identifying polymorphisms already described.
Colloquium paper: uniquely human evolution of sialic acid genetics and biology.

Science.gov (United States)

Varki, Ajit

2010-05-11

Darwinian evolution of humans from our common ancestors with nonhuman primates involved many gene-environment interactions at the population level, and the resulting human-specific genetic changes must contribute to the "Human Condition." Recent data indicate that the biology of sialic acids (which directly involves less than 60 genes) shows more than 10 uniquely human genetic changes in comparison with our closest evolutionary relatives. Known outcomes are tissue-specific changes in abundant cell-surface glycans, changes in specificity and/or expression of multiple proteins that recognize these glycans, and novel pathogen regimes. Specific events include Alu-mediated inactivation of the CMAH gene, resulting in loss of synthesis of the Sia N-glycolylneuraminic acid (Neu5Gc) and increase in expression of the precursor N-acetylneuraminic acid (Neu5Ac); increased expression of alpha2-6-linked Sias (likely because of changed expression of ST6GALI); and multiple changes in SIGLEC genes encoding Sia-recognizing Ig-like lectins (Siglecs). The last includes binding specificity changes (in Siglecs -5, -7, -9, -11, and -12); expression pattern changes (in Siglecs -1, -5, -6, and -11); gene conversion (SIGLEC11); and deletion or pseudogenization (SIGLEC13, SIGLEC14, and SIGLEC16). A nongenetic outcome of the CMAH mutation is human metabolic incorporation of foreign dietary Neu5Gc, in the face of circulating anti-Neu5Gc antibodies, generating a novel "xeno-auto-antigen" situation. Taken together, these data suggest that both the genes associated with Sia biology and the related impacts of the environment comprise a relative "hot spot" of genetic and physiological changes in human evolution, with implications for uniquely human features both in health and disease.
A Partial Least Square Approach for Modeling Gene-gene and Gene-environment Interactions When Multiple Markers Are Genotyped

Science.gov (United States)

Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C.

2008-01-01

Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense SNPs in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches: the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey’s 1-df model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women’s Health Initiative (WHI), this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with BMI. PMID:18615621
A partial least-square approach for modeling gene-gene and gene-environment interactions when multiple markers are genotyped.

Science.gov (United States)

Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C

2009-01-01

Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one-degree-of-freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index.
Third international congress of plant molecular biology: Molecular biology of plant growth and development

Energy Technology Data Exchange (ETDEWEB)

Hallick, R.B. [ed.

1995-02-01

The Congress was held October 6-11, 1991 in Tucson with approximately 3000 scientists attending and over 300 oral presentations and 1800 posters. Plant molecular biology is one of the most rapidly developing areas of the biological sciences. Recent advances in the ability to isolate genes, to study their expression, and to create transgenic plants have had a major impact on our understanding of the many fundamental plant processes. In addition, new approaches have been created to improve plants for agricultural purposes. This is a book of presentation and posters from the conference.
Biomine: predicting links between biological entities using network models of heterogeneous databases

Directory of Open Access Journals (Sweden)

Eronen Lauri

2012-06-01

Full Text Available Abstract Background Biological databases contain large amounts of data concerning the functions and associations of genes and proteins. Integration of data from several such databases into a single repository can aid the discovery of previously unknown connections spanning multiple types of relationships and databases. Results Biomine is a system that integrates cross-references from several biological databases into a graph model with multiple types of edges, such as protein interactions, gene-disease associations and gene ontology annotations. Edges are weighted based on their type, reliability, and informativeness. We present Biomine and evaluate its performance in link prediction, where the goal is to predict pairs of nodes that will be connected in the future, based on current data. In particular, we formulate protein interaction prediction and disease gene prioritization tasks as instances of link prediction. The predictions are based on a proximity measure computed on the integrated graph. We consider and experiment with several such measures, and perform a parameter optimization procedure where different edge types are weighted to optimize link prediction accuracy. We also propose a novel method for disease-gene prioritization, defined as finding a subset of candidate genes that cluster together in the graph. We experimentally evaluate Biomine by predicting future annotations in the source databases and prioritizing lists of putative disease genes. Conclusions The experimental results show that Biomine has strong potential for predicting links when a set of selected candidate links is available. The predictions obtained using the entire Biomine dataset are shown to clearly outperform ones obtained using any single source of data alone, when different types of links are suitably weighted. In the gene prioritization task, an established reference set of disease-associated genes is useful, but the results show that under favorable
Influence of different natural physical fields on biological processes

Science.gov (United States)

Mashinsky, A. L.

2001-01-01

In space flight conditions gravity, magnetic, and electrical fields as well as ionizing radiation change both in size, and in direction. This causes disruptions in the conduct of some physical processes, chemical reactions, and metabolism in living organisms. In these conditions organisms of different phylogenetic level change their metabolic reactions undergo changes such as disturbances in ionic exchange both in lower and in higher plants, changes in cell morphology for example, gyrosity in Proteus ( Proteus vulgaris), spatial disorientation in coleoptiles of Wheat ( Triticum aestivum) and Pea ( Pisum sativum) seedlings, mutational changes in Crepis ( Crepis capillaris) and Arabidopsis ( Arabidopsis thaliana) seedling. It has been found that even in the absence of gravity, gravireceptors determining spatial orientation in higher plants under terrestrial conditions are formed in the course of ontogenesis. Under weightlessness this system does not function and spatial orientation is determined by the light flux gradient or by the action of some other factors. Peculiarities of the formation of the gravireceptor apparatus in higher plants, amphibians, fish, and birds under space flight conditions have been observed. It has been found that the system in which responses were accompanied by phase transition have proven to be gravity-sensitive under microgravity conditions. Such reactions include also the process of photosynthesis which is the main energy production process in plants. In view of the established effects of microgravity and different natural physical fields on biological processes, it has been shown that these processes change due to the absence of initially rigid determination. The established biological effect of physical fields influence on biological processes in organisms is the starting point for elucidating the role of gravity and evolutionary development of various organisms on Earth.
Integration of gene expression and methylation to unravel biological networks in glioblastoma patients.

Science.gov (United States)

Gadaleta, Francesco; Bessonov, Kyrylo; Van Steen, Kristel

2017-02-01

The vast amount of heterogeneous omics data, encompassing a broad range of biomolecular information, requires novel methods of analysis, including those that integrate the available levels of information. In this work, we describe Regression2Net, a computational approach that is able to integrate gene expression and genomic or methylation data in two steps. First, penalized regressions are used to build Expression-Expression (EEnet) and Expression-Genomic or Expression-Methylation (EMnet) networks. Second, network theory is used to highlight important communities of genes. When applying our approach, Regression2Net to gene expression and methylation profiles for individuals with glioblastoma multiforme, we identified, respectively, 284 and 447 potentially interesting genes in relation to glioblastoma pathology. These genes showed at least one connection in the integrated networks ANDnet and XORnet derived from aforementioned EEnet and EMnet networks. Although the edges in ANDnet occur in both EEnet and EMnet, the edges in XORnet occur in EMnet but not in EEnet. In-depth biological analysis of connected genes in ANDnet and XORnet revealed genes that are related to energy metabolism, cell cycle control (AATF), immune system response, and several cancer types. Importantly, we observed significant overrepresentation of cancer-related pathways including glioma, especially in the XORnet network, suggesting a nonignorable role of methylation in glioblastoma multiforma. In the ANDnet, we furthermore identified potential glioma suppressor genes ACCN3 and ACCN4 linked to the NBPF1 neuroblastoma breakpoint family, as well as numerous ABC transporter genes (ABCA1, ABCB1) suggesting drug resistance of glioblastoma tumors. © 2016 WILEY PERIODICALS, INC.
Circumventing furin enhances factor VIII biological activity and ameliorates bleeding phenotypes in hemophilia models

Science.gov (United States)

Siner, Joshua I.; Samelson-Jones, Benjamin J.; Crudele, Julie M.; French, Robert A.; Lee, Benjamin J.; Zhou, Shanzhen; Merricks, Elizabeth; Raymer, Robin; Camire, Rodney M.; Arruda, Valder R.

2016-01-01

Processing by the proprotein convertase furin is believed to be critical for the biological activity of multiple proteins involved in hemostasis, including coagulation factor VIII (FVIII). This belief prompted the retention of the furin recognition motif (amino acids 1645–1648) in the design of B-domain–deleted FVIII (FVIII-BDD) products in current clinical use and in the drug development pipeline, as well as in experimental FVIII gene therapy strategies. Here, we report that processing by furin is in fact deleterious to FVIII-BDD secretion and procoagulant activity. Inhibition of furin increases the secretion and decreases the intracellular retention of FVIII-BDD protein in mammalian cells. Our new variant (FVIII-ΔF), in which this recognition motif is removed, efficiently circumvents furin. FVIII-ΔF demonstrates increased recombinant protein yields, enhanced clotting activity, and higher circulating FVIII levels after adeno-associated viral vector–based liver gene therapy in a murine model of severe hemophilia A (HA) compared with FVIII-BDD. Moreover, we observed an amelioration of the bleeding phenotype in severe HA dogs with sustained therapeutic FVIII levels after FVIII-ΔF gene therapy at a lower vector dose than previously employed in this model. The immunogenicity of FVIII-ΔF did not differ from that of FVIII-BDD as a protein or a gene therapeutic. Thus, contrary to previous suppositions, FVIII variants that can avoid furin processing are likely to have enhanced translational potential for HA therapy. PMID:27734034
Effects of simulated microgravity on gene expression and biological phenotypes of a single generation Caenorhabditis elegans cultured on 2 different media.

Science.gov (United States)

Tee, Ling Fei; Neoh, Hui-Min; Then, Sue Mian; Murad, Nor Azian; Asillam, Mohd Fairos; Hashim, Mohd Helmy; Nathan, Sheila; Jamal, Rahman

2017-11-01

Studies of multigenerational Caenorhabditis elegans exposed to long-term spaceflight have revealed expression changes of genes involved in longevity, DNA repair, and locomotion. However, results from spaceflight experiments are difficult to reproduce as space missions are costly and opportunities are rather limited for researchers. In addition, multigenerational cultures of C. elegans used in previous studies contribute to mixture of gene expression profiles from both larvae and adult worms, which were recently reported to be different. Usage of different culture media during microgravity simulation experiments might also give rise to differences in the gene expression and biological phenotypes of the worms. In this study, we investigated the effects of simulated microgravity on the gene expression and biological phenotype profiles of a single generation of C. elegans worms cultured on 2 different culture media. A desktop Random Positioning Machine (RPM) was used to simulate microgravity on the worms for approximately 52 to 54 h. Gene expression profile was analysed using the Affymetrix GeneChip® C. elegans 1.0 ST Array. Only one gene (R01H2.2) was found to be downregulated in nematode growth medium (NGM)-cultured worms exposed to simulated microgravity. On the other hand, eight genes were differentially expressed for C. elegans Maintenance Medium (CeMM)-cultured worms in microgravity; six were upregulated, while two were downregulated. Five of the upregulated genes (C07E3.15, C34H3.21, C32D5.16, F35H8.9 and C34F11.17) encode non-coding RNAs. In terms of biological phenotype, we observed that microgravity-simulated worms experienced minimal changes in terms of lifespan, locomotion and reproductive capabilities in comparison with the ground controls. Taking it all together, simulated microgravity on a single generation of C. elegans did not confer major changes to their gene expression and biological phenotype. Nevertheless, exposure of the worms to microgravity
Comprehensive annotation of secondary metabolite biosynthetic genes and gene clusters of Aspergillus nidulans, A. fumigatus, A. niger and A. oryzae

Science.gov (United States)

2013-01-01

Background Secondary metabolite production, a hallmark of filamentous fungi, is an expanding area of research for the Aspergilli. These compounds are potent chemicals, ranging from deadly toxins to therapeutic antibiotics to potential anti-cancer drugs. The genome sequences for multiple Aspergilli have been determined, and provide a wealth of predictive information about secondary metabolite production. Sequence analysis and gene overexpression strategies have enabled the discovery of novel secondary metabolites and the genes involved in their biosynthesis. The Aspergillus Genome Database (AspGD) provides a central repository for gene annotation and protein information for Aspergillus species. These annotations include Gene Ontology (GO) terms, phenotype data, gene names and descriptions and they are crucial for interpreting both small- and large-scale data and for aiding in the design of new experiments that further Aspergillus research. Results We have manually curated Biological Process GO annotations for all genes in AspGD with recorded functions in secondary metabolite production, adding new GO terms that specifically describe each secondary metabolite. We then leveraged these new annotations to predict roles in secondary metabolism for genes lacking experimental characterization. As a starting point for manually annotating Aspergillus secondary metabolite gene clusters, we used antiSMASH (antibiotics and Secondary Metabolite Analysis SHell) and SMURF (Secondary Metabolite Unknown Regions Finder) algorithms to identify potential clusters in A. nidulans, A. fumigatus, A. niger and A. oryzae, which we subsequently refined through manual curation. Conclusions This set of 266 manually curated secondary metabolite gene clusters will facilitate the investigation of novel Aspergillus secondary metabolites. PMID:23617571
Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

Science.gov (United States)

Rossin, Elizabeth J.; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J.; Tatar, Diana; Benita, Yair

2011-01-01

Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in
Circadian cycles of gene expression in the coral, Acropora millepora.

Directory of Open Access Journals (Sweden)

Aisling K Brady

Full Text Available Circadian rhythms regulate many physiological, behavioral and reproductive processes. These rhythms are often controlled by light, and daily cycles of solar illumination entrain many clock regulated processes. In scleractinian corals a number of different processes and behaviors are associated with specific periods of solar illumination or non-illumination--for example, skeletal deposition, feeding and both brooding and broadcast spawning.We have undertaken an analysis of diurnal expression of the whole transcriptome and more focused studies on a number of candidate circadian genes in the coral Acropora millepora using deep RNA sequencing and quantitative PCR. Many examples of diurnal cycles of RNA abundance were identified, some of which are light responsive and damped quickly under constant darkness, for example, cryptochrome 1 and timeless, but others that continue to cycle in a robust manner when kept in constant darkness, for example, clock, cryptochrome 2, cycle and eyes absent, indicating that their transcription is regulated by an endogenous clock entrained to the light-dark cycle. Many other biological processes that varied between day and night were also identified by a clustering analysis of gene ontology annotations.Corals exhibit diurnal patterns of gene expression that may participate in the regulation of circadian biological processes. Rhythmic cycles of gene expression occur under constant darkness in both populations of coral larvae that lack zooxanthellae and in individual adult tissue containing zooxanthellae, indicating that transcription is under the control of a biological clock. In addition to genes potentially involved in regulating circadian processes, many other pathways were found to display diel cycles of transcription.
Quantitative Structure-Activity Relationships and Docking Studies of Calcitonin Gene-Related Peptide Antagonists

DEFF Research Database (Denmark)

Jenssen, Håvard; Mehrabian, Mohadeseh; Kyani, Anahita

2012-01-01

Defining the role of calcitonin gene-related peptide in migraine pathogenesis could lead to the application of calcitonin gene-related peptide antagonists as novel migraine therapeutics. In this work, quantitative structure-activity relationship modeling of biological activities of a large range...... of calcitonin gene-related peptide antagonists was performed using a panel of physicochemical descriptors. The computational studies evaluated different variable selection techniques and demonstrated shuffling stepwise multiple linear regression to be superior over genetic algorithm-multiple linear regression....... The linear quantitative structure-activity relationship model revealed better statistical parameters of cross-validation in comparison with the non-linear support vector regression technique. Implementing only five peptide descriptors into this linear quantitative structure-activity relationship model...
Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates

Science.gov (United States)

Wang, Yongcui; Zhao, Weiling; Zhou, Xiaobo

2016-10-01

Accurate identification of coherent transcriptional modules (subnetworks) in adipose and muscle tissues is important for revealing the related mechanisms and co-regulated pathways involved in the development of aging-related diseases. Here, we proposed a systematically computational approach, called ICEGM, to Identify the Co-Expression Gene Modules through a novel mathematical framework of Higher-Order Generalized Singular Value Decomposition (HO-GSVD). ICEGM was applied on the adipose, and heart and skeletal muscle tissues in old and young female African green vervet monkeys. The genes associated with the development of inflammation, cardiovascular and skeletal disorder diseases, and cancer were revealed by the ICEGM. Meanwhile, genes in the ICEGM modules were also enriched in the adipocytes, smooth muscle cells, cardiac myocytes, and immune cells. Comprehensive disease annotation and canonical pathway analysis indicated that immune cells, adipocytes, cardiomyocytes, and smooth muscle cells played a synergistic role in cardiac and physical functions in the aged monkeys by regulation of the biological processes associated with metabolism, inflammation, and atherosclerosis. In conclusion, the ICEGM provides an efficiently systematic framework for decoding the co-expression gene modules in multiple tissues. Analysis of genes in the ICEGM module yielded important insights on the cooperative role of multiple tissues in the development of diseases.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.