WorldWideScience

Sample records for gene selection method

  1. Ultrahigh-dimensional variable selection method for whole-genome gene-gene interaction analysis

    Directory of Open Access Journals (Sweden)

    Ueki Masao

    2012-05-01

    Full Text Available Abstract Background Genome-wide gene-gene interaction analysis using single nucleotide polymorphisms (SNPs is an attractive way for identification of genetic components that confers susceptibility of human complex diseases. Individual hypothesis testing for SNP-SNP pairs as in common genome-wide association study (GWAS however involves difficulty in setting overall p-value due to complicated correlation structure, namely, the multiple testing problem that causes unacceptable false negative results. A large number of SNP-SNP pairs than sample size, so-called the large p small n problem, precludes simultaneous analysis using multiple regression. The method that overcomes above issues is thus needed. Results We adopt an up-to-date method for ultrahigh-dimensional variable selection termed the sure independence screening (SIS for appropriate handling of numerous number of SNP-SNP interactions by including them as predictor variables in logistic regression. We propose ranking strategy using promising dummy coding methods and following variable selection procedure in the SIS method suitably modified for gene-gene interaction analysis. We also implemented the procedures in a software program, EPISIS, using the cost-effective GPGPU (General-purpose computing on graphics processing units technology. EPISIS can complete exhaustive search for SNP-SNP interactions in standard GWAS dataset within several hours. The proposed method works successfully in simulation experiments and in application to real WTCCC (Wellcome Trust Case–control Consortium data. Conclusions Based on the machine-learning principle, the proposed method gives powerful and flexible genome-wide search for various patterns of gene-gene interaction.

  2. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba

    2016-07-20

    The native nature of high dimension low sample size of gene expression data make the classification task more challenging. Therefore, feature (gene) selection become an apparent need. Selecting a meaningful and relevant genes for classifier not only decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we present a new feature selection technique that takes advantage of clustering both samples and genes. Materials and methods We used leukemia gene expression dataset [1]. The effectiveness of the selected features were evaluated by four different classification methods; support vector machines, k-nearest neighbor, random forest, and linear discriminate analysis. The method evaluate the importance and relevance of each gene cluster by summing the expression level for each gene belongs to this cluster. The gene cluster consider important, if it satisfies conditions depend on thresholds and percentage otherwise eliminated. Results Initial analysis identified 7120 differentially expressed genes of leukemia (Fig. 15a), after applying our feature selection methodology we end up with specific 1117 genes discriminating two classes of leukemia (Fig. 15b). Further applying the same method with more stringent higher positive and lower negative threshold condition, number reduced to 58 genes have be tested to evaluate the effectiveness of the method (Fig. 15c). The results of the four classification methods are summarized in Table 11. Conclusions The feature selection method gave good results with minimum classification error. Our heat-map result shows distinct pattern of refines genes discriminating between two classes of leukemia.

  3. Detection of biomarkers for Hepatocellular Carcinoma using a hybrid univariate gene selection methods

    Directory of Open Access Journals (Sweden)

    Abdel Samee Nagwan M

    2012-08-01

    Full Text Available Abstract Background Discovering new biomarkers has a great role in improving early diagnosis of Hepatocellular carcinoma (HCC. The experimental determination of biomarkers needs a lot of time and money. This motivates this work to use in-silico prediction of biomarkers to reduce the number of experiments required for detecting new ones. This is achieved by extracting the most representative genes in microarrays of HCC. Results In this work, we provide a method for extracting the differential expressed genes, up regulated ones, that can be considered candidate biomarkers in high throughput microarrays of HCC. We examine the power of several gene selection methods (such as Pearson’s correlation coefficient, Cosine coefficient, Euclidean distance, Mutual information and Entropy with different estimators in selecting informative genes. A biological interpretation of the highly ranked genes is done using KEGG (Kyoto Encyclopedia of Genes and Genomes pathways, ENTREZ and DAVID (Database for Annotation, Visualization, and Integrated Discovery databases. The top ten genes selected using Pearson’s correlation coefficient and Cosine coefficient contained six genes that have been implicated in cancer (often multiple cancers genesis in previous studies. A fewer number of genes were obtained by the other methods (4 genes using Mutual information, 3genes using Euclidean distance and only one gene using Entropy. A better result was obtained by the utilization of a hybrid approach based on intersecting the highly ranked genes in the output of all investigated methods. This hybrid combination yielded seven genes (2 genes for HCC and 5 genes in different types of cancer in the top ten genes of the list of intersected genes. Conclusions To strengthen the effectiveness of the univariate selection methods, we propose a hybrid approach by intersecting several of these methods in a cascaded manner. This approach surpasses all of univariate selection methods when

  4. A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data

    KAUST Repository

    Abusamra, Heba

    2013-05-01

    Microarray technology has enriched the study of gene expression in such a way that scientists are now able to measure the expression levels of thousands of genes in a single experiment. Microarray gene expression data gained great importance in recent years due to its role in disease diagnoses and prognoses which help to choose the appropriate treatment plan for patients. This technology has shifted a new era in molecular classification, interpreting gene expression data remains a difficult problem and an active research area due to their native nature of “high dimensional low sample size”. Such problems pose great challenges to existing classification methods. Thus, effective feature selection techniques are often needed in this case to aid to correctly classify different tumor types and consequently lead to a better understanding of genetic signatures as well as improve treatment strategies. This thesis aims on a comparative study of state-of-the-art feature selection methods, classification methods, and the combination of them, based on gene expression data. We compared the efficiency of three different classification methods including: support vector machines, k- nearest neighbor and random forest, and eight different feature selection methods, including: information gain, twoing rule, sum minority, max minority, gini index, sum of variances, t- statistics, and one-dimension support vector machine. Five-fold cross validation was used to evaluate the classification performance. Two publicly available gene expression data sets of glioma were used for this study. Different experiments have been applied to compare the performance of the classification methods with and without performing feature selection. Results revealed the important role of feature selection in classifying gene expression data. By performing feature selection, the classification accuracy can be significantly boosted by using a small number of genes. The relationship of features selected in

  5. A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data

    KAUST Repository

    Abusamra, Heba

    2013-01-01

    Different experiments have been applied to compare the performance of the classification methods with and without performing feature selection. Results revealed the important role of feature selection in classifying gene expression data. By performing feature selection, the classification accuracy can be significantly boosted by using a small number of genes. The relationship of features selected in different feature selection methods is investigated and the most frequent features selected in each fold among all methods for both datasets are evaluated.

  6. An Entropy-based gene selection method for cancer classification using microarray data

    Directory of Open Access Journals (Sweden)

    Krishnan Arun

    2005-03-01

    Full Text Available Abstract Background Accurate diagnosis of cancer subtypes remains a challenging problem. Building classifiers based on gene expression data is a promising approach; yet the selection of non-redundant but relevant genes is difficult. The selected gene set should be small enough to allow diagnosis even in regular clinical laboratories and ideally identify genes involved in cancer-specific regulatory pathways. Here an entropy-based method is proposed that selects genes related to the different cancer classes while at the same time reducing the redundancy among the genes. Results The present study identifies a subset of features by maximizing the relevance and minimizing the redundancy of the selected genes. A merit called normalized mutual information is employed to measure the relevance and the redundancy of the genes. In order to find a more representative subset of features, an iterative procedure is adopted that incorporates an initial clustering followed by data partitioning and the application of the algorithm to each of the partitions. A leave-one-out approach then selects the most commonly selected genes across all the different runs and the gene selection algorithm is applied again to pare down the list of selected genes until a minimal subset is obtained that gives a satisfactory accuracy of classification. The algorithm was applied to three different data sets and the results obtained were compared to work done by others using the same data sets Conclusion This study presents an entropy-based iterative algorithm for selecting genes from microarray data that are able to classify various cancer sub-types with high accuracy. In addition, the feature set obtained is very compact, that is, the redundancy between genes is reduced to a large extent. This implies that classifiers can be built with a smaller subset of genes.

  7. Robust gene selection methods using weighting schemes for microarray data analysis.

    Science.gov (United States)

    Kang, Suyeon; Song, Jongwoo

    2017-09-02

    A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.

  8. A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data of Glioma

    KAUST Repository

    Abusamra, Heba

    2013-11-01

    Microarray gene expression data gained great importance in recent years due to its role in disease diagnoses and prognoses which help to choose the appropriate treatment plan for patients. This technology has shifted a new era in molecular classification. Interpreting gene expression data remains a difficult problem and an active research area due to their native nature of “high dimensional low sample size”. Such problems pose great challenges to existing classification methods. Thus, effective feature selection techniques are often needed in this case to aid to correctly classify different tumor types and consequently lead to a better understanding of genetic signatures as well as improve treatment strategies. This paper aims on a comparative study of state-of-the- art feature selection methods, classification methods, and the combination of them, based on gene expression data. We compared the efficiency of three different classification methods including: support vector machines, k-nearest neighbor and random forest, and eight different feature selection methods, including: information gain, twoing rule, sum minority, max minority, gini index, sum of variances, t-statistics, and one-dimension support vector machine. Five-fold cross validation was used to evaluate the classification performance. Two publicly available gene expression data sets of glioma were used in the experiments. Results revealed the important role of feature selection in classifying gene expression data. By performing feature selection, the classification accuracy can be significantly boosted by using a small number of genes. The relationship of features selected in different feature selection methods is investigated and the most frequent features selected in each fold among all methods for both datasets are evaluated.

  9. A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data of Glioma

    KAUST Repository

    Abusamra, Heba

    2013-01-01

    Microarray gene expression data gained great importance in recent years due to its role in disease diagnoses and prognoses which help to choose the appropriate treatment plan for patients. This technology has shifted a new era in molecular classification. Interpreting gene expression data remains a difficult problem and an active research area due to their native nature of “high dimensional low sample size”. Such problems pose great challenges to existing classification methods. Thus, effective feature selection techniques are often needed in this case to aid to correctly classify different tumor types and consequently lead to a better understanding of genetic signatures as well as improve treatment strategies. This paper aims on a comparative study of state-of-the- art feature selection methods, classification methods, and the combination of them, based on gene expression data. We compared the efficiency of three different classification methods including: support vector machines, k-nearest neighbor and random forest, and eight different feature selection methods, including: information gain, twoing rule, sum minority, max minority, gini index, sum of variances, t-statistics, and one-dimension support vector machine. Five-fold cross validation was used to evaluate the classification performance. Two publicly available gene expression data sets of glioma were used in the experiments. Results revealed the important role of feature selection in classifying gene expression data. By performing feature selection, the classification accuracy can be significantly boosted by using a small number of genes. The relationship of features selected in different feature selection methods is investigated and the most frequent features selected in each fold among all methods for both datasets are evaluated.

  10. Locating disease genes using Bayesian variable selection with the Haseman-Elston method

    Directory of Open Access Journals (Sweden)

    He Qimei

    2003-12-01

    Full Text Available Abstract Background We applied stochastic search variable selection (SSVS, a Bayesian model selection method, to the simulated data of Genetic Analysis Workshop 13. We used SSVS with the revisited Haseman-Elston method to find the markers linked to the loci determining change in cholesterol over time. To study gene-gene interaction (epistasis and gene-environment interaction, we adopted prior structures, which incorporate the relationship among the predictors. This allows SSVS to search in the model space more efficiently and avoid the less likely models. Results In applying SSVS, instead of looking at the posterior distribution of each of the candidate models, which is sensitive to the setting of the prior, we ranked the candidate variables (markers according to their marginal posterior probability, which was shown to be more robust to the prior. Compared with traditional methods that consider one marker at a time, our method considers all markers simultaneously and obtains more favorable results. Conclusions We showed that SSVS is a powerful method for identifying linked markers using the Haseman-Elston method, even for weak effects. SSVS is very effective because it does a smart search over the entire model space.

  11. Characteristic gene selection via weighting principal components by singular values.

    Directory of Open Access Journals (Sweden)

    Jin-Xing Liu

    Full Text Available Conventional gene selection methods based on principal component analysis (PCA use only the first principal component (PC of PCA or sparse PCA to select characteristic genes. These methods indeed assume that the first PC plays a dominant role in gene selection. However, in a number of cases this assumption is not satisfied, so the conventional PCA-based methods usually provide poor selection results. In order to improve the performance of the PCA-based gene selection method, we put forward the gene selection method via weighting PCs by singular values (WPCS. Because different PCs have different importance, the singular values are exploited as the weights to represent the influence on gene selection of different PCs. The ROC curves and AUC statistics on artificial data show that our method outperforms the state-of-the-art methods. Moreover, experimental results on real gene expression data sets show that our method can extract more characteristic genes in response to abiotic stresses than conventional gene selection methods.

  12. Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification.

    Science.gov (United States)

    Alshamlan, Hala M; Badr, Ghada H; Alohali, Yousef A

    2015-06-01

    Naturally inspired evolutionary algorithms prove effectiveness when used for solving feature selection and classification problems. Artificial Bee Colony (ABC) is a relatively new swarm intelligence method. In this paper, we propose a new hybrid gene selection method, namely Genetic Bee Colony (GBC) algorithm. The proposed algorithm combines the used of a Genetic Algorithm (GA) along with Artificial Bee Colony (ABC) algorithm. The goal is to integrate the advantages of both algorithms. The proposed algorithm is applied to a microarray gene expression profile in order to select the most predictive and informative genes for cancer classification. In order to test the accuracy performance of the proposed algorithm, extensive experiments were conducted. Three binary microarray datasets are use, which include: colon, leukemia, and lung. In addition, another three multi-class microarray datasets are used, which are: SRBCT, lymphoma, and leukemia. Results of the GBC algorithm are compared with our recently proposed technique: mRMR when combined with the Artificial Bee Colony algorithm (mRMR-ABC). We also compared the combination of mRMR with GA (mRMR-GA) and Particle Swarm Optimization (mRMR-PSO) algorithms. In addition, we compared the GBC algorithm with other related algorithms that have been recently published in the literature, using all benchmark datasets. The GBC algorithm shows superior performance as it achieved the highest classification accuracy along with the lowest average number of selected genes. This proves that the GBC algorithm is a promising approach for solving the gene selection problem in both binary and multi-class cancer classification. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. A BAYESIAN NONPARAMETRIC MIXTURE MODEL FOR SELECTING GENES AND GENE SUBNETWORKS.

    Science.gov (United States)

    Zhao, Yize; Kang, Jian; Yu, Tianwei

    2014-06-01

    It is very challenging to select informative features from tens of thousands of measured features in high-throughput data analysis. Recently, several parametric/regression models have been developed utilizing the gene network information to select genes or pathways strongly associated with a clinical/biological outcome. Alternatively, in this paper, we propose a nonparametric Bayesian model for gene selection incorporating network information. In addition to identifying genes that have a strong association with a clinical outcome, our model can select genes with particular expressional behavior, in which case the regression models are not directly applicable. We show that our proposed model is equivalent to an infinity mixture model for which we develop a posterior computation algorithm based on Markov chain Monte Carlo (MCMC) methods. We also propose two fast computing algorithms that approximate the posterior simulation with good accuracy but relatively low computational cost. We illustrate our methods on simulation studies and the analysis of Spellman yeast cell cycle microarray data.

  14. Multiclass gene selection using Pareto-fronts.

    Science.gov (United States)

    Rajapakse, Jagath C; Mundra, Piyushkumar A

    2013-01-01

    Filter methods are often used for selection of genes in multiclass sample classification by using microarray data. Such techniques usually tend to bias toward a few classes that are easily distinguishable from other classes due to imbalances of strong features and sample sizes of different classes. It could therefore lead to selection of redundant genes while missing the relevant genes, leading to poor classification of tissue samples. In this manuscript, we propose to decompose multiclass ranking statistics into class-specific statistics and then use Pareto-front analysis for selection of genes. This alleviates the bias induced by class intrinsic characteristics of dominating classes. The use of Pareto-front analysis is demonstrated on two filter criteria commonly used for gene selection: F-score and KW-score. A significant improvement in classification performance and reduction in redundancy among top-ranked genes were achieved in experiments with both synthetic and real-benchmark data sets.

  15. Parameters selection in gene selection using Gaussian kernel support vector machines by genetic algorithm

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear statistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two representative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method performs well in selecting genes and achieves high classification accuracies with these genes.

  16. Informative gene selection using Adaptive Analytic Hierarchy Process (A2HP

    Directory of Open Access Journals (Sweden)

    Abhishek Bhola

    2017-12-01

    Full Text Available Gene expression dataset derived from microarray experiments are marked by large number of genes, which contains the gene expression values at different sample conditions/time-points. Selection of informative genes from these large datasets is an issue of major concern for various researchers and biologists. In this study, we propose a gene selection and dimensionality reduction method called Adaptive Analytic Hierarchy Process (A2HP. Traditional analytic hierarchy process is a multiple-criteria based decision analysis method whose result depends upon the expert knowledge or decision makers. It is mainly used to solve the decision problems in different fields. On the other hand, A2HP is a fused method that combines the outcomes of five individual gene selection ranking methods t-test, chi-square variance test, z-test, wilcoxon test and signal-to-noise ratio (SNR. At first, the preprocessing of gene expression dataset is done and then the reduced number of genes obtained, will be fed as input for A2HP. A2HP utilizes both quantitative and qualitative factors to select the informative genes. Results demonstrate that A2HP selects efficient number of genes as compared to the individual gene selection methods. The percentage of deduction in number of genes and time complexity are taken as the performance measure for the proposed method. And it is shown that A2HP outperforms individual gene selection methods.

  17. Statistical Redundancy Testing for Improved Gene Selection in Cancer Classification Using Microarray Data

    Directory of Open Access Journals (Sweden)

    J. Sunil Rao

    2007-01-01

    Full Text Available In gene selection for cancer classifi cation using microarray data, we define an eigenvalue-ratio statistic to measure a gene’s contribution to the joint discriminability when this gene is included into a set of genes. Based on this eigenvalueratio statistic, we define a novel hypothesis testing for gene statistical redundancy and propose two gene selection methods. Simulation studies illustrate the agreement between statistical redundancy testing and gene selection methods. Real data examples show the proposed gene selection methods can select a compact gene subset which can not only be used to build high quality cancer classifiers but also show biological relevance.

  18. FiGS: a filter-based gene selection workbench for microarray data

    Directory of Open Access Journals (Sweden)

    Yun Taegyun

    2010-01-01

    Full Text Available Abstract Background The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. Results FiGS is a web-based workbench that automatically compares various gene selection procedures and provides the optimal gene selection result for an input microarray dataset. FiGS builds up diverse gene selection procedures by aligning different feature selection techniques and classifiers. In addition to the highly reputed techniques, FiGS diversifies the gene selection procedures by incorporating gene clustering options in the feature selection step and different data pre-processing options in classifier training step. All candidate gene selection procedures are evaluated by the .632+ bootstrap errors and listed with their classification accuracies and selected gene sets. FiGS runs on parallelized computing nodes that capacitate heavy computations. FiGS is freely accessible at http://gexp.kaist.ac.kr/figs. Conclusion FiGS is an web-based application that automates an extensive search for the optimized gene selection analysis for a microarray dataset in a parallel computing environment. FiGS will provide both an efficient and comprehensive means of acquiring optimal gene sets that discriminate disease states from microarray datasets.

  19. Gene selection for microarray data classification via subspace learning and manifold regularization.

    Science.gov (United States)

    Tang, Chang; Cao, Lijuan; Zheng, Xiao; Wang, Minhui

    2017-12-19

    With the rapid development of DNA microarray technology, large amount of genomic data has been generated. Classification of these microarray data is a challenge task since gene expression data are often with thousands of genes but a small number of samples. In this paper, an effective gene selection method is proposed to select the best subset of genes for microarray data with the irrelevant and redundant genes removed. Compared with original data, the selected gene subset can benefit the classification task. We formulate the gene selection task as a manifold regularized subspace learning problem. In detail, a projection matrix is used to project the original high dimensional microarray data into a lower dimensional subspace, with the constraint that the original genes can be well represented by the selected genes. Meanwhile, the local manifold structure of original data is preserved by a Laplacian graph regularization term on the low-dimensional data space. The projection matrix can serve as an importance indicator of different genes. An iterative update algorithm is developed for solving the problem. Experimental results on six publicly available microarray datasets and one clinical dataset demonstrate that the proposed method performs better when compared with other state-of-the-art methods in terms of microarray data classification. Graphical Abstract The graphical abstract of this work.

  20. Gene selection heuristic algorithm for nutrigenomics studies.

    Science.gov (United States)

    Valour, D; Hue, I; Grimard, B; Valour, B

    2013-07-15

    Large datasets from -omics studies need to be deeply investigated. The aim of this paper is to provide a new method (LEM method) for the search of transcriptome and metabolome connections. The heuristic algorithm here described extends the classical canonical correlation analysis (CCA) to a high number of variables (without regularization) and combines well-conditioning and fast-computing in "R." Reduced CCA models are summarized in PageRank matrices, the product of which gives a stochastic matrix that resumes the self-avoiding walk covered by the algorithm. Then, a homogeneous Markov process applied to this stochastic matrix converges the probabilities of interconnection between genes, providing a selection of disjointed subsets of genes. This is an alternative to regularized generalized CCA for the determination of blocks within the structure matrix. Each gene subset is thus linked to the whole metabolic or clinical dataset that represents the biological phenotype of interest. Moreover, this selection process reaches the aim of biologists who often need small sets of genes for further validation or extended phenotyping. The algorithm is shown to work efficiently on three published datasets, resulting in meaningfully broadened gene networks.

  1. Methods for selecting fixed-effect models for heterogeneous codon evolution, with comments on their application to gene and genome data.

    Science.gov (United States)

    Bao, Le; Gu, Hong; Dunn, Katherine A; Bielawski, Joseph P

    2007-02-08

    Models of codon evolution have proven useful for investigating the strength and direction of natural selection. In some cases, a priori biological knowledge has been used successfully to model heterogeneous evolutionary dynamics among codon sites. These are called fixed-effect models, and they require that all codon sites are assigned to one of several partitions which are permitted to have independent parameters for selection pressure, evolutionary rate, transition to transversion ratio or codon frequencies. For single gene analysis, partitions might be defined according to protein tertiary structure, and for multiple gene analysis partitions might be defined according to a gene's functional category. Given a set of related fixed-effect models, the task of selecting the model that best fits the data is not trivial. In this study, we implement a set of fixed-effect codon models which allow for different levels of heterogeneity among partitions in the substitution process. We describe strategies for selecting among these models by a backward elimination procedure, Akaike information criterion (AIC) or a corrected Akaike information criterion (AICc). We evaluate the performance of these model selection methods via a simulation study, and make several recommendations for real data analysis. Our simulation study indicates that the backward elimination procedure can provide a reliable method for model selection in this setting. We also demonstrate the utility of these models by application to a single-gene dataset partitioned according to tertiary structure (abalone sperm lysin), and a multi-gene dataset partitioned according to the functional category of the gene (flagellar-related proteins of Listeria). Fixed-effect models have advantages and disadvantages. Fixed-effect models are desirable when data partitions are known to exhibit significant heterogeneity or when a statistical test of such heterogeneity is desired. They have the disadvantage of requiring a priori

  2. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Directory of Open Access Journals (Sweden)

    Landfors Mattias

    2010-10-01

    Full Text Available Abstract Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered, missing value imputation (2, standardization of data (2, gene selection (19 or clustering method (11. The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that

  3. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    Science.gov (United States)

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is

  4. Selection of Phototransduction Genes in Homo sapiens.

    Science.gov (United States)

    Christopher, Mark; Scheetz, Todd E; Mullins, Robert F; Abràmoff, Michael D

    2013-08-13

    We investigated the evidence of recent positive selection in the human phototransduction system at single nucleotide polymorphism (SNP) and gene level. SNP genotyping data from the International HapMap Project for European, Eastern Asian, and African populations was used to discover differences in haplotype length and allele frequency between these populations. Numeric selection metrics were computed for each SNP and aggregated into gene-level metrics to measure evidence of recent positive selection. The level of recent positive selection in phototransduction genes was evaluated and compared to a set of genes shown previously to be under recent selection, and a set of highly conserved genes as positive and negative controls, respectively. Six of 20 phototransduction genes evaluated had gene-level selection metrics above the 90th percentile: RGS9, GNB1, RHO, PDE6G, GNAT1, and SLC24A1. The selection signal across these genes was found to be of similar magnitude to the positive control genes and much greater than the negative control genes. There is evidence for selective pressure in the genes involved in retinal phototransduction, and traces of this selective pressure can be demonstrated using SNP-level and gene-level metrics of allelic variation. We hypothesize that the selective pressure on these genes was related to their role in low light vision and retinal adaptation to ambient light changes. Uncovering the underlying genetics of evolutionary adaptations in phototransduction not only allows greater understanding of vision and visual diseases, but also the development of patient-specific diagnostic and intervention strategies.

  5. Genetic signature of strong recent positive selection at interleukin-32 gene in goat

    Directory of Open Access Journals (Sweden)

    Akhtar Rasool Asif

    2017-07-01

    Full Text Available Objective Identification of the candidate genes that play key roles in phenotypic variations can provide new information about evolution and positive selection. Interleukin (IL-32 is involved in many biological processes, however, its role for the immune response against various diseases in mammals is poorly understood. Therefore, the current investigation was performed for the better understanding of the molecular evolution and the positive selection of single nucleotide polymorphisms in IL-32 gene. Methods By using fixation index (FST based method, IL-32 (9375 gene was found to be outlier and under significant positive selection with the provisional combined allocation of mean heterozygosity and FST. Using nucleotide sequences of 11 mammalian species from National Center for Biotechnology Information database, the evolutionary selection of IL-32 gene was determined using Maximum likelihood model method, through four models (M1a, M2a, M7, and M8 in Codeml program of phylogenetic analysis by maximum liklihood. Results IL-32 is detected under positive selection using the FST simulations method. The phylogenetic tree revealed that goat IL-32 was in close resemblance with sheep IL-32. The coding nucleotide sequences were compared among 11 species and it was found that the goat IL-32 gene shared identity with sheep (96.54%, bison (91.97%, camel (58.39%, cat (56.59%, buffalo (56.50%, human (56.13%, dog (50.97%, horse (54.04%, and rabbit (53.41% respectively. Conclusion This study provides evidence for IL-32 gene as under significant positive selection in goat.

  6. An Efficient Ensemble Learning Method for Gene Microarray Classification

    Directory of Open Access Journals (Sweden)

    Alireza Osareh

    2013-01-01

    Full Text Available The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.

  7. Prospective identification of malaria parasite genes under balancing selection.

    Directory of Open Access Journals (Sweden)

    Kevin K A Tetteh

    Full Text Available Endemic human pathogens are subject to strong immune selection, and interrogation of pathogen genome variation for signatures of balancing selection can identify important target antigens. Several major antigen genes in the malaria parasite Plasmodium falciparum have shown such signatures in polymorphism-versus-divergence indices (comparing with the chimpanzee parasite P. reichenowi, and in allele frequency based indices.To compare methods for prospective identification of genes under balancing selection, 26 additional genes known or predicted to encode surface-exposed proteins of the invasive blood stage merozoite were first sequenced from a panel of 14 independent P. falciparum cultured lines and P. reichenowi. Six genes at the positive extremes of one or both of the Hudson-Kreitman-Aguade (HKA and McDonald-Kreitman (MK indices were identified. Allele frequency based analysis was then performed on a Gambian P. falciparum population sample for these six genes and three others as controls. Tajima's D (TjD index was most highly positive for the msp3/6-like PF10_0348 (TjD = 1.96 as well as the positive control ama1 antigen gene (TjD = 1.22. Across the genes there was a strong correlation between population TjD values and the relative HKA indices (whether derived from the population or the panel of cultured laboratory isolates, but no correlation with the MK indices.Although few individual parasite genes show significant evidence of balancing selection, analysis of population genomic and comparative sequence data with the HKA and TjD indices should discriminate those that do, and thereby identify likely targets of immunity.

  8. A feature selection approach for identification of signature genes from SAGE data

    Directory of Open Access Journals (Sweden)

    Silva Paulo JS

    2007-05-01

    Full Text Available Abstract Background One goal of gene expression profiling is to identify signature genes that robustly distinguish different types or grades of tumors. Several tumor classifiers based on expression profiling have been proposed using microarray technique. Due to important differences in the probabilistic models of microarray and SAGE technologies, it is important to develop suitable techniques to select specific genes from SAGE measurements. Results A new framework to select specific genes that distinguish different biological states based on the analysis of SAGE data is proposed. The new framework applies the bolstered error for the identification of strong genes that separate the biological states in a feature space defined by the gene expression of a training set. Credibility intervals defined from a probabilistic model of SAGE measurements are used to identify the genes that distinguish the different states with more reliability among all gene groups selected by the strong genes method. A score taking into account the credibility and the bolstered error values in order to rank the groups of considered genes is proposed. Results obtained using SAGE data from gliomas are presented, thus corroborating the introduced methodology. Conclusion The model representing counting data, such as SAGE, provides additional statistical information that allows a more robust analysis. The additional statistical information provided by the probabilistic model is incorporated in the methodology described in the paper. The introduced method is suitable to identify signature genes that lead to a good separation of the biological states using SAGE and may be adapted for other counting methods such as Massive Parallel Signature Sequencing (MPSS or the recent Sequencing-By-Synthesis (SBS technique. Some of such genes identified by the proposed method may be useful to generate classifiers.

  9. Statistical approach for selection of biologically informative genes.

    Science.gov (United States)

    Das, Samarendra; Rai, Anil; Mishra, D C; Rai, Shesh N

    2018-05-20

    Selection of informative genes from high dimensional gene expression data has emerged as an important research area in genomics. Many gene selection techniques have been proposed so far are either based on relevancy or redundancy measure. Further, the performance of these techniques has been adjudged through post selection classification accuracy computed through a classifier using the selected genes. This performance metric may be statistically sound but may not be biologically relevant. A statistical approach, i.e. Boot-MRMR, was proposed based on a composite measure of maximum relevance and minimum redundancy, which is both statistically sound and biologically relevant for informative gene selection. For comparative evaluation of the proposed approach, we developed two biological sufficient criteria, i.e. Gene Set Enrichment with QTL (GSEQ) and biological similarity score based on Gene Ontology (GO). Further, a systematic and rigorous evaluation of the proposed technique with 12 existing gene selection techniques was carried out using five gene expression datasets. This evaluation was based on a broad spectrum of statistically sound (e.g. subject classification) and biological relevant (based on QTL and GO) criteria under a multiple criteria decision-making framework. The performance analysis showed that the proposed technique selects informative genes which are more biologically relevant. The proposed technique is also found to be quite competitive with the existing techniques with respect to subject classification and computational time. Our results also showed that under the multiple criteria decision-making setup, the proposed technique is best for informative gene selection over the available alternatives. Based on the proposed approach, an R Package, i.e. BootMRMR has been developed and available at https://cran.r-project.org/web/packages/BootMRMR. This study will provide a practical guide to select statistical techniques for selecting informative genes

  10. Recent adaptive events in human brain revealed by meta-analysis of positively selected genes.

    Directory of Open Access Journals (Sweden)

    Yue Huang

    Full Text Available BACKGROUND AND OBJECTIVES: Analysis of positively-selected genes can help us understand how human evolved, especially the evolution of highly developed cognitive functions. However, previous works have reached conflicting conclusions regarding whether human neuronal genes are over-represented among genes under positive selection. METHODS AND RESULTS: We divided positively-selected genes into four groups according to the identification approaches, compiling a comprehensive list from 27 previous studies. We showed that genes that are highly expressed in the central nervous system are enriched in recent positive selection events in human history identified by intra-species genomic scan, especially in brain regions related to cognitive functions. This pattern holds when different datasets, parameters and analysis pipelines were used. Functional category enrichment analysis supported these findings, showing that synapse-related functions are enriched in genes under recent positive selection. In contrast, immune-related functions, for instance, are enriched in genes under ancient positive selection revealed by inter-species coding region comparison. We further demonstrated that most of these patterns still hold even after controlling for genomic characteristics that might bias genome-wide identification of positively-selected genes including gene length, gene density, GC composition, and intensity of negative selection. CONCLUSION: Our rigorous analysis resolved previous conflicting conclusions and revealed recent adaptation of human brain functions.

  11. Entropy-based gene ranking without selection bias for the predictive classification of microarray data

    Directory of Open Access Journals (Sweden)

    Serafini Maria

    2003-11-01

    Full Text Available Abstract Background We describe the E-RFE method for gene ranking, which is useful for the identification of markers in the predictive classification of array data. The method supports a practical modeling scheme designed to avoid the construction of classification rules based on the selection of too small gene subsets (an effect known as the selection bias, in which the estimated predictive errors are too optimistic due to testing on samples already considered in the feature selection process. Results With E-RFE, we speed up the recursive feature elimination (RFE with SVM classifiers by eliminating chunks of uninteresting genes using an entropy measure of the SVM weights distribution. An optimal subset of genes is selected according to a two-strata model evaluation procedure: modeling is replicated by an external stratified-partition resampling scheme, and, within each run, an internal K-fold cross-validation is used for E-RFE ranking. Also, the optimal number of genes can be estimated according to the saturation of Zipf's law profiles. Conclusions Without a decrease of classification accuracy, E-RFE allows a speed-up factor of 100 with respect to standard RFE, while improving on alternative parametric RFE reduction strategies. Thus, a process for gene selection and error estimation is made practical, ensuring control of the selection bias, and providing additional diagnostic indicators of gene importance.

  12. Recursive Cluster Elimination (RCE for classification and feature selection from gene expression data

    Directory of Open Access Journals (Sweden)

    Showe Louise C

    2007-05-01

    Full Text Available Abstract Background Classification studies using gene expression datasets are usually based on small numbers of samples and tens of thousands of genes. The selection of those genes that are important for distinguishing the different sample classes being compared, poses a challenging problem in high dimensional data analysis. We describe a new procedure for selecting significant genes as recursive cluster elimination (RCE rather than recursive feature elimination (RFE. We have tested this algorithm on six datasets and compared its performance with that of two related classification procedures with RFE. Results We have developed a novel method for selecting significant genes in comparative gene expression studies. This method, which we refer to as SVM-RCE, combines K-means, a clustering method, to identify correlated gene clusters, and Support Vector Machines (SVMs, a supervised machine learning classification method, to identify and score (rank those gene clusters for the purpose of classification. K-means is used initially to group genes into clusters. Recursive cluster elimination (RCE is then applied to iteratively remove those clusters of genes that contribute the least to the classification performance. SVM-RCE identifies the clusters of correlated genes that are most significantly differentially expressed between the sample classes. Utilization of gene clusters, rather than individual genes, enhances the supervised classification accuracy of the same data as compared to the accuracy when either SVM or Penalized Discriminant Analysis (PDA with recursive feature elimination (SVM-RFE and PDA-RFE are used to remove genes based on their individual discriminant weights. Conclusion SVM-RCE provides improved classification accuracy with complex microarray data sets when it is compared to the classification accuracy of the same datasets using either SVM-RFE or PDA-RFE. SVM-RCE identifies clusters of correlated genes that when considered together

  13. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

    Directory of Open Access Journals (Sweden)

    Hala Alshamlan

    2015-01-01

    Full Text Available An artificial bee colony (ABC is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR, and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO. The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  14. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

    Science.gov (United States)

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  15. Natural Selection in Virulence Genes of Francisella tularensis.

    Science.gov (United States)

    Gunnell, Mark K; Robison, Richard A; Adams, Byron J

    2016-06-01

    A fundamental tenet of evolution is that alleles that are under negative selection are often deleterious and confer no evolutionary advantage. Negatively selected alleles are removed from the gene pool and are eventually extinguished from the population. Conversely, alleles under positive selection do confer an evolutionary advantage and lead to an increase in the overall fitness of the organism. These alleles increase in frequency until they eventually become fixed in the population. Francisella tularensis is a zoonotic pathogen and a potential biothreat agent. The most virulent type of F. tularensis, Type A, is distributed across North America with Type A.I occurring mainly in the east and Type A.II appearing mainly in the west. F. tularensis is thought to be a genome in decay (losing genes) because of the relatively large number of pseudogenes present in its genome. We hypothesized that the observed frequency of gene loss/pseudogenes may be an artifact of evolution in response to a changing environment, and that genes involved in virulence should be under strong positive selection. To test this hypothesis, we sequenced and compared whole genomes of Type A.I and A.II isolates. We analyzed a subset of virulence and housekeeping genes from several F. tularensis subspecies genomes to ascertain the presence and extent of positive selection. Eleven previously identified virulence genes were screened for positive selection along with 10 housekeeping genes. Analyses of selection yielded one housekeeping gene and 7 virulence genes which showed significant evidence of positive selection at loci implicated in cell surface structures and membrane proteins, metabolism and biosynthesis, transcription, translation and cell separation, and substrate binding and transport. Our results suggest that while the loss of functional genes through disuse could be accelerated by negative selection, the genome decay in Francisella could also be the byproduct of adaptive evolution

  16. Gene selection and classification for cancer microarray data based on machine learning and similarity measures

    Directory of Open Access Journals (Sweden)

    Liu Qingzhong

    2011-12-01

    Full Text Available Abstract Background Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money. Results To deal with redundant information and improve classification, we propose a gene selection method, Recursive Feature Addition, which combines supervised learning and statistical similarity measures. To determine the final optimal gene set for prediction and classification, we propose an algorithm, Lagging Prediction Peephole Optimization. By using six benchmark microarray gene expression data sets, we compared Recursive Feature Addition with recently developed gene selection methods: Support Vector Machine Recursive Feature Elimination, Leave-One-Out Calculation Sequential Forward Selection and several others. Conclusions On average, with the use of popular learning machines including Nearest Mean Scaled Classifier, Support Vector Machine, Naive Bayes Classifier and Random Forest, Recursive Feature Addition outperformed other methods. Our studies also showed that Lagging Prediction Peephole Optimization is superior to random strategy; Recursive Feature Addition with Lagging Prediction Peephole Optimization obtained better testing accuracies than the gene selection method varSelRF.

  17. Gene selection for cancer classification with the help of bees.

    Science.gov (United States)

    Moosa, Johra Muhammad; Shakur, Rameen; Kaykobad, Mohammad; Rahman, Mohammad Sohel

    2016-08-10

    Development of biologically relevant models from gene expression data notably, microarray data has become a topic of great interest in the field of bioinformatics and clinical genetics and oncology. Only a small number of gene expression data compared to the total number of genes explored possess a significant correlation with a certain phenotype. Gene selection enables researchers to obtain substantial insight into the genetic nature of the disease and the mechanisms responsible for it. Besides improvement of the performance of cancer classification, it can also cut down the time and cost of medical diagnoses. This study presents a modified Artificial Bee Colony Algorithm (ABC) to select minimum number of genes that are deemed to be significant for cancer along with improvement of predictive accuracy. The search equation of ABC is believed to be good at exploration but poor at exploitation. To overcome this limitation we have modified the ABC algorithm by incorporating the concept of pheromones which is one of the major components of Ant Colony Optimization (ACO) algorithm and a new operation in which successive bees communicate to share their findings. The proposed algorithm is evaluated using a suite of ten publicly available datasets after the parameters are tuned scientifically with one of the datasets. Obtained results are compared to other works that used the same datasets. The performance of the proposed method is proved to be superior. The method presented in this paper can provide subset of genes leading to more accurate classification results while the number of selected genes is smaller. Additionally, the proposed modified Artificial Bee Colony Algorithm could conceivably be applied to problems in other areas as well.

  18. Feature selection and classification of MAQC-II breast cancer and multiple myeloma microarray gene expression data.

    Directory of Open Access Journals (Sweden)

    Qingzhong Liu

    Full Text Available Microarray data has a high dimension of variables but available datasets usually have only a small number of samples, thereby making the study of such datasets interesting and challenging. In the task of analyzing microarray data for the purpose of, e.g., predicting gene-disease association, feature selection is very important because it provides a way to handle the high dimensionality by exploiting information redundancy induced by associations among genetic markers. Judicious feature selection in microarray data analysis can result in significant reduction of cost while maintaining or improving the classification or prediction accuracy of learning machines that are employed to sort out the datasets. In this paper, we propose a gene selection method called Recursive Feature Addition (RFA, which combines supervised learning and statistical similarity measures. We compare our method with the following gene selection methods: Support Vector Machine Recursive Feature Elimination (SVMRFE, Leave-One-Out Calculation Sequential Forward Selection (LOOCSFS, Gradient based Leave-one-out Gene Selection (GLGS. To evaluate the performance of these gene selection methods, we employ several popular learning classifiers on the MicroArray Quality Control phase II on predictive modeling (MAQC-II breast cancer dataset and the MAQC-II multiple myeloma dataset. Experimental results show that gene selection is strictly paired with learning classifier. Overall, our approach outperforms other compared methods. The biological functional analysis based on the MAQC-II breast cancer dataset convinced us to apply our method for phenotype prediction. Additionally, learning classifiers also play important roles in the classification of microarray data and our experimental results indicate that the Nearest Mean Scale Classifier (NMSC is a good choice due to its prediction reliability and its stability across the three performance measurements: Testing accuracy, MCC values, and

  19. Confirming candidate genes for longevity in Drosophila melanogaster using two different genetic backgrounds and selection methods

    DEFF Research Database (Denmark)

    Wit, Janneke; Frydenberg, Jane; Sarup, Pernille Merete

    2013-01-01

    usually focussed on one sex and on flies originating from one genetic background, and results from different studies often do not overlap. Using D. melanogaster selected for increased longevity we aimed to find robust longevity related genes by examining gene expression in both sexes of flies originating......Elucidating genes that affect life span or that can be used as biomarkers for ageing has received attention in diverse studies in recent years. Using model organisms and various approaches several genes have been linked to the longevity phenotype. For Drosophila melanogaster those studies have...... from different genetic backgrounds. Further, we compared expression changes across three ages, when flies were young, middle aged or old, to examine how candidate gene expression changes with the onset of ageing. We selected 10 genes based on their expression differences in prior microarray studies...

  20. ReliefSeq: a gene-wise adaptive-K nearest-neighbor feature selection tool for finding gene-gene interactions and main effects in mRNA-Seq gene expression data.

    Directory of Open Access Journals (Sweden)

    Brett A McKinney

    Full Text Available Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k for each gene to optimize the Relief-F test statistics (importance scores for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to

  1. A metaheuristic optimization framework for informative gene selection

    Directory of Open Access Journals (Sweden)

    Kaberi Das

    Full Text Available This paper presents a metaheuristic framework using Harmony Search (HS with Genetic Algorithm (GA for gene selection. The internal architecture of the proposed model broadly works in two phases, in the first phase, the model allows the hybridization of HS with GA to compute and evaluate the fitness of the randomly selected solutions of binary strings and then HS ranks the solutions in descending order of their fitness. In the second phase, the offsprings are generated using crossover and mutation operations of GA and finally, those offsprings were selected for the next generation whose fitness value is more than their parents evaluated by SVM classifier. The accuracy of the final gene subsets obtained from this model has been evaluated using SVM classifiers. The merit of this approach is analyzed by experimental results on five benchmark datasets and the results showed an impressive accuracy over existing feature selection approaches. The occurrence of gene subsets selected from this model have also been computed and the most often selected gene subsets with the probability of [0.1–0.9] have been chosen as optimal sets of informative genes. Finally, the performance of those selected informative gene subsets have been measured and established through probabilistic measures. Keywords: Gene Selection, Metaheuristic, Harmony Search Algorithm, Genetic Algorithm, SVM

  2. Multi-level gene/MiRNA feature selection using deep belief nets and active learning.

    Science.gov (United States)

    Ibrahim, Rania; Yousri, Noha A; Ismail, Mohamed A; El-Makky, Nagwa M

    2014-01-01

    Selecting the most discriminative genes/miRNAs has been raised as an important task in bioinformatics to enhance disease classifiers and to mitigate the dimensionality curse problem. Original feature selection methods choose genes/miRNAs based on their individual features regardless of how they perform together. Considering group features instead of individual ones provides a better view for selecting the most informative genes/miRNAs. Recently, deep learning has proven its ability in representing the data in multiple levels of abstraction, allowing for better discrimination between different classes. However, the idea of using deep learning for feature selection is not widely used in the bioinformatics field yet. In this paper, a novel multi-level feature selection approach named MLFS is proposed for selecting genes/miRNAs based on expression profiles. The approach is based on both deep and active learning. Moreover, an extension to use the technique for miRNAs is presented by considering the biological relation between miRNAs and genes. Experimental results show that the approach was able to outperform classical feature selection methods in hepatocellular carcinoma (HCC) by 9%, lung cancer by 6% and breast cancer by around 10% in F1-measure. Results also show the enhancement in F1-measure of our approach over recently related work in [1] and [2].

  3. Evaluation of gene importance in microarray data based upon probability of selection

    Directory of Open Access Journals (Sweden)

    Fu Li M

    2005-03-01

    Full Text Available Abstract Background Microarray devices permit a genome-scale evaluation of gene function. This technology has catalyzed biomedical research and development in recent years. As many important diseases can be traced down to the gene level, a long-standing research problem is to identify specific gene expression patterns linking to metabolic characteristics that contribute to disease development and progression. The microarray approach offers an expedited solution to this problem. However, it has posed a challenging issue to recognize disease-related genes expression patterns embedded in the microarray data. In selecting a small set of biologically significant genes for classifier design, the nature of high data dimensionality inherent in this problem creates substantial amount of uncertainty. Results Here we present a model for probability analysis of selected genes in order to determine their importance. Our contribution is that we show how to derive the P value of each selected gene in multiple gene selection trials based on different combinations of data samples and how to conduct a reliability analysis accordingly. The importance of a gene is indicated by its associated P value in that a smaller value implies higher information content from information theory. On the microarray data concerning the subtype classification of small round blue cell tumors, we demonstrate that the method is capable of finding the smallest set of genes (19 genes with optimal classification performance, compared with results reported in the literature. Conclusion In classifier design based on microarray data, the probability value derived from gene selection based on multiple combinations of data samples enables an effective mechanism for reducing the tendency of fitting local data particularities.

  4. Feature Genes Selection Using Supervised Locally Linear Embedding and Correlation Coefficient for Microarray Classification.

    Science.gov (United States)

    Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou

    2018-01-01

    The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.

  5. A dual selection based, targeted gene replacement tool for Magnaporthe grisea and Fusarium oxysporum.

    Science.gov (United States)

    Khang, Chang Hyun; Park, Sook-Young; Lee, Yong-Hwan; Kang, Seogchan

    2005-06-01

    Rapid progress in fungal genome sequencing presents many new opportunities for functional genomic analysis of fungal biology through the systematic mutagenesis of the genes identified through sequencing. However, the lack of efficient tools for targeted gene replacement is a limiting factor for fungal functional genomics, as it often necessitates the screening of a large number of transformants to identify the desired mutant. We developed an efficient method of gene replacement and evaluated factors affecting the efficiency of this method using two plant pathogenic fungi, Magnaporthe grisea and Fusarium oxysporum. This method is based on Agrobacterium tumefaciens-mediated transformation with a mutant allele of the target gene flanked by the herpes simplex virus thymidine kinase (HSVtk) gene as a conditional negative selection marker against ectopic transformants. The HSVtk gene product converts 5-fluoro-2'-deoxyuridine to a compound toxic to diverse fungi. Because ectopic transformants express HSVtk, while gene replacement mutants lack HSVtk, growing transformants on a medium amended with 5-fluoro-2'-deoxyuridine facilitates the identification of targeted mutants by counter-selecting against ectopic transformants. In addition to M. grisea and F. oxysporum, the method and associated vectors are likely to be applicable to manipulating genes in a broad spectrum of fungi, thus potentially serving as an efficient, universal functional genomic tool for harnessing the growing body of fungal genome sequence data to study fungal biology.

  6. Novel method to load multiple genes onto a mammalian artificial chromosome.

    Directory of Open Access Journals (Sweden)

    Anna Tóth

    Full Text Available Mammalian artificial chromosomes are natural chromosome-based vectors that may carry a vast amount of genetic material in terms of both size and number. They are reasonably stable and segregate well in both mitosis and meiosis. A platform artificial chromosome expression system (ACEs was earlier described with multiple loading sites for a modified lambda-integrase enzyme. It has been shown that this ACEs is suitable for high-level industrial protein production and the treatment of a mouse model for a devastating human disorder, Krabbe's disease. ACEs-treated mutant mice carrying a therapeutic gene lived more than four times longer than untreated counterparts. This novel gene therapy method is called combined mammalian artificial chromosome-stem cell therapy. At present, this method suffers from the limitation that a new selection marker gene should be present for each therapeutic gene loaded onto the ACEs. Complex diseases require the cooperative action of several genes for treatment, but only a limited number of selection marker genes are available and there is also a risk of serious side-effects caused by the unwanted expression of these marker genes in mammalian cells, organs and organisms. We describe here a novel method to load multiple genes onto the ACEs by using only two selectable marker genes. These markers may be removed from the ACEs before therapeutic application. This novel technology could revolutionize gene therapeutic applications targeting the treatment of complex disorders and cancers. It could also speed up cell therapy by allowing researchers to engineer a chromosome with a predetermined set of genetic factors to differentiate adult stem cells, embryonic stem cells and induced pluripotent stem (iPS cells into cell types of therapeutic value. It is also a suitable tool for the investigation of complex biochemical pathways in basic science by producing an ACEs with several genes from a signal transduction pathway of interest.

  7. Novel method to load multiple genes onto a mammalian artificial chromosome.

    Science.gov (United States)

    Tóth, Anna; Fodor, Katalin; Praznovszky, Tünde; Tubak, Vilmos; Udvardy, Andor; Hadlaczky, Gyula; Katona, Robert L

    2014-01-01

    Mammalian artificial chromosomes are natural chromosome-based vectors that may carry a vast amount of genetic material in terms of both size and number. They are reasonably stable and segregate well in both mitosis and meiosis. A platform artificial chromosome expression system (ACEs) was earlier described with multiple loading sites for a modified lambda-integrase enzyme. It has been shown that this ACEs is suitable for high-level industrial protein production and the treatment of a mouse model for a devastating human disorder, Krabbe's disease. ACEs-treated mutant mice carrying a therapeutic gene lived more than four times longer than untreated counterparts. This novel gene therapy method is called combined mammalian artificial chromosome-stem cell therapy. At present, this method suffers from the limitation that a new selection marker gene should be present for each therapeutic gene loaded onto the ACEs. Complex diseases require the cooperative action of several genes for treatment, but only a limited number of selection marker genes are available and there is also a risk of serious side-effects caused by the unwanted expression of these marker genes in mammalian cells, organs and organisms. We describe here a novel method to load multiple genes onto the ACEs by using only two selectable marker genes. These markers may be removed from the ACEs before therapeutic application. This novel technology could revolutionize gene therapeutic applications targeting the treatment of complex disorders and cancers. It could also speed up cell therapy by allowing researchers to engineer a chromosome with a predetermined set of genetic factors to differentiate adult stem cells, embryonic stem cells and induced pluripotent stem (iPS) cells into cell types of therapeutic value. It is also a suitable tool for the investigation of complex biochemical pathways in basic science by producing an ACEs with several genes from a signal transduction pathway of interest.

  8. Minimal gene selection for classification and diagnosis prediction based on gene expression profile

    Directory of Open Access Journals (Sweden)

    Alireza Mehridehnavi

    2013-01-01

    Conclusion: We have shown that the use of two most significant genes based on their S/N ratios and selection of suitable training samples can lead to classify DLBCL patients with a rather good result. Actually with the aid of mentioned methods we could compensate lack of enough number of patients, improve accuracy of classifying and reduce complication of computations and so running time.

  9. Genes under positive selection in Escherichia coli

    DEFF Research Database (Denmark)

    Petersen, Lise; Bollback, Jonathan P; Dimmic, Matt

    2007-01-01

    We used a comparative genomics approach to identify genes that are under positive selection in six strains of Escherichia coli and Shigella flexneri, including five strains that are human pathogens. We find that positive selection targets a wide range of different functions in the E. coli genome......, including cell surface proteins such as beta barrel porins, presumably because of the involvement of these genes in evolutionary arms races with other bacteria, phages, and/or the host immune system. Structural mapping of positively selected sites on trans-membrane beta barrel porins reveals...... that the residues under positive selection occur almost exclusively in the extracellular region of the proteins that are enriched with sites known to be targets of phages, colicins, or the host immune system. More surprisingly, we also find a number of other categories of genes that show very strong evidence...

  10. Translational selection in human: More pronounced in housekeeping genes

    KAUST Repository

    Ma, Lina

    2014-07-10

    Background: Translational selection is a ubiquitous and significant mechanism to regulate protein expression in prokaryotes and unicellular eukaryotes. Recent evidence has shown that translational selection is weakly operative in highly expressed genes in human and other vertebrates. However, it remains unclear whether translational selection acts differentially on human genes depending on their expression patterns.Results: Here we report that human housekeeping (HK) genes that are strictly defined as genes that are expressed ubiquitously and consistently in most or all tissues, are under stronger translational selection.Conclusions: These observations clearly show that translational selection is also closely associated with expression pattern. Our results suggest that human HK genes are more efficiently and/or accurately translated into proteins, which will inevitably open up a new understanding of HK genes and the regulation of gene expression.Reviewers: This article was reviewed by Yuan Yuan, Baylor College of Medicine; Han Liang, University of Texas MD Anderson Cancer Center (nominated by Dr Laura Landweber) Eugene Koonin, NCBI, NLM, NIH, United States of America Sandor Pongor, International Centre for Genetic Engineering and biotechnology (ICGEB), Italy. © 2014 Ma et al.; licensee BioMed Central Ltd.

  11. Gene expression network reconstruction by convex feature selection when incorporating genetic perturbations.

    Directory of Open Access Journals (Sweden)

    Benjamin A Logsdon

    Full Text Available Cellular gene expression measurements contain regulatory information that can be used to discover novel network relationships. Here, we present a new algorithm for network reconstruction powered by the adaptive lasso, a theoretically and empirically well-behaved method for selecting the regulatory features of a network. Any algorithms designed for network discovery that make use of directed probabilistic graphs require perturbations, produced by either experiments or naturally occurring genetic variation, to successfully infer unique regulatory relationships from gene expression data. Our approach makes use of appropriately selected cis-expression Quantitative Trait Loci (cis-eQTL, which provide a sufficient set of independent perturbations for maximum network resolution. We compare the performance of our network reconstruction algorithm to four other approaches: the PC-algorithm, QTLnet, the QDG algorithm, and the NEO algorithm, all of which have been used to reconstruct directed networks among phenotypes leveraging QTL. We show that the adaptive lasso can outperform these algorithms for networks of ten genes and ten cis-eQTL, and is competitive with the QDG algorithm for networks with thirty genes and thirty cis-eQTL, with rich topologies and hundreds of samples. Using this novel approach, we identify unique sets of directed relationships in Saccharomyces cerevisiae when analyzing genome-wide gene expression data for an intercross between a wild strain and a lab strain. We recover novel putative network relationships between a tyrosine biosynthesis gene (TYR1, and genes involved in endocytosis (RCY1, the spindle checkpoint (BUB2, sulfonate catabolism (JLP1, and cell-cell communication (PRM7. Our algorithm provides a synthesis of feature selection methods and graphical model theory that has the potential to reveal new directed regulatory relationships from the analysis of population level genetic and gene expression data.

  12. Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

    Science.gov (United States)

    Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

    2016-01-15

    As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Construction of human antibody gene libraries and selection of antibodies by phage display.

    Science.gov (United States)

    Frenzel, André; Kügler, Jonas; Wilke, Sonja; Schirrmann, Thomas; Hust, Michael

    2014-01-01

    Antibody phage display is the most commonly used in vitro selection technology and has yielded thousands of useful antibodies for research, diagnostics, and therapy.The prerequisite for successful generation and development of human recombinant antibodies using phage display is the construction of a high-quality antibody gene library. Here, we describe the methods for the construction of human immune and naive scFv gene libraries.The success also depends on the panning strategy for the selection of binders from these libraries. In this article, we describe a panning strategy that is high-throughput compatible and allows parallel selection in microtiter plates.

  14. Gene-Based Multiclass Cancer Diagnosis with Class-Selective Rejections

    Science.gov (United States)

    Jrad, Nisrine; Grall-Maës, Edith; Beauseroy, Pierre

    2009-01-01

    Supervised learning of microarray data is receiving much attention in recent years. Multiclass cancer diagnosis, based on selected gene profiles, are used as adjunct of clinical diagnosis. However, supervised diagnosis may hinder patient care, add expense or confound a result. To avoid this misleading, a multiclass cancer diagnosis with class-selective rejection is proposed. It rejects some patients from one, some, or all classes in order to ensure a higher reliability while reducing time and expense costs. Moreover, this classifier takes into account asymmetric penalties dependant on each class and on each wrong or partially correct decision. It is based on ν-1-SVM coupled with its regularization path and minimizes a general loss function defined in the class-selective rejection scheme. The state of art multiclass algorithms can be considered as a particular case of the proposed algorithm where the number of decisions is given by the classes and the loss function is defined by the Bayesian risk. Two experiments are carried out in the Bayesian and the class selective rejection frameworks. Five genes selected datasets are used to assess the performance of the proposed method. Results are discussed and accuracies are compared with those computed by the Naive Bayes, Nearest Neighbor, Linear Perceptron, Multilayer Perceptron, and Support Vector Machines classifiers. PMID:19584932

  15. Positive selection on gene expression in the human brain

    DEFF Research Database (Denmark)

    Khaitovich, Philipp; Tang, Kun; Franz, Henriette

    2006-01-01

    Recent work has shown that the expression levels of genes transcribed in the brains of humans and chimpanzees have changed less than those of genes transcribed in other tissues [1] . However, when gene expression changes are mapped onto the evolutionary lineage in which they occurred, the brain...... shows more changes than other tissues in the human lineage compared to the chimpanzee lineage [1] , [2] and [3] . There are two possible explanations for this: either positive selection drove more gene expression changes to fixation in the human brain than in the chimpanzee brain, or genes expressed...... in the brain experienced less purifying selection in humans than in chimpanzees, i.e. gene expression in the human brain is functionally less constrained. The first scenario would be supported if genes that changed their expression in the brain in the human lineage showed more selective sweeps than other genes...

  16. A GMM-IG framework for selecting genes as expression panel biomarkers.

    Science.gov (United States)

    Wang, Mingyi; Chen, Jake Y

    2010-01-01

    The limitation of small sample size of functional genomics experiments has made it necessary to integrate DNA microarray experimental data from different sources. However, experimentation noises and biases of different microarray platforms have made integrated data analysis challenging. In this work, we propose an integrative computational framework to identify candidate biomarker genes from publicly available functional genomics studies. We developed a new framework, Gaussian Mixture Modeling-Coupled Information Gain (GMM-IG). In this framework, we first apply a two-component Gaussian mixture model (GMM) to estimate the conditional probability distributions of gene expression data between two different types of samples, for example, normal versus cancer. An expectation-maximization algorithm is then used to estimate the maximum likelihood parameters of a mixture of two Gaussian models in the feature space and determine the underlying expression levels of genes. Gene expression results from different studies are discretized, based on GMM estimations and then unified. Significantly differentially-expressed genes are filtered and assessed with information gain (IG) measures. DNA microarray experimental data for lung cancers from three different prior studies was processed using the new GMM-IG method. Target gene markers from a gene expression panel were selected and compared with several conventional computational biomarker data analysis methods. GMM-IG showed consistently high accuracy for several classification assessments. A high reproducibility of gene selection results was also determined from statistical validations. Our study shows that the GMM-IG framework can overcome poor reliability issues from single-study DNA microarray experiment while maintaining high accuracies by combining true signals from multiple studies. We present a conceptually simple framework that enables reliable integration of true differential gene expression signals from multiple

  17. Gene-Based Multiclass Cancer Diagnosis with Class-Selective Rejections

    Directory of Open Access Journals (Sweden)

    Nisrine Jrad

    2009-01-01

    rejection scheme. The state of art multiclass algorithms can be considered as a particular case of the proposed algorithm where the number of decisions is given by the classes and the loss function is defined by the Bayesian risk. Two experiments are carried out in the Bayesian and the class selective rejection frameworks. Five genes selected datasets are used to assess the performance of the proposed method. Results are discussed and accuracies are compared with those computed by the Naive Bayes, Nearest Neighbor, Linear Perceptron, Multilayer Perceptron, and Support Vector Machines classifiers.

  18. When natural selection gives gene function the cold shoulder.

    Science.gov (United States)

    Cutter, Asher D; Jovelin, Richard

    2015-11-01

    It is tempting to invoke organismal selection as perpetually optimizing the function of any given gene. However, natural selection can drive genic functional change without improvement of biochemical activity, even to the extinction of gene activity. Detrimental mutations can creep in owing to linkage with other selectively favored loci. Selection can promote functional degradation, irrespective of genetic drift, when adaptation occurs by loss of gene function. Even stabilizing selection on a trait can lead to divergence of the underlying molecular constituents. Selfish genetic elements can also proliferate independent of any functional benefits to the host genome. Here we review the logic and evidence for these diverse processes acting in genome evolution. This collection of distinct evolutionary phenomena - while operating through easily understandable mechanisms - all contribute to the seemingly counterintuitive notion that maintenance or improvement of a gene's biochemical function sometimes do not determine its evolutionary fate. © 2015 WILEY Periodicals, Inc.

  19. A general method for identifying major hybrid male sterility genes in Drosophila.

    Science.gov (United States)

    Zeng, L W; Singh, R S

    1995-10-01

    The genes responsible for hybrid male sterility in species crosses are usually identified by introgressing chromosome segments, monitored by visible markers, between closely related species by continuous backcrosses. This commonly used method, however, suffers from two problems. First, it relies on the availability of markers to monitor the introgressed regions and so the portion of the genome examined is limited to the marked regions. Secondly, the introgressed regions are usually large and it is impossible to tell if the effects of the introgressed regions are the result of single (or few) major genes or many minor genes (polygenes). Here we introduce a simple and general method for identifying putative major hybrid male sterility genes which is free of these problems. In this method, the actual hybrid male sterility genes (rather than markers), or tightly linked gene complexes with large effects, are selectively introgressed from one species into the background of another species by repeated backcrosses. This is performed by selectively backcrossing heterozygous (for hybrid male sterility gene or genes) females producing fertile and sterile sons in roughly equal proportions to males of either parental species. As no marker gene is required for this procedure, this method can be used with any species pairs that produce unisexual sterility. With the application of this method, a small X chromosome region of Drosophila mauritiana which produces complete hybrid male sterility (aspermic testes) in the background of D. simulans was identified. Recombination analysis reveals that this region contains a second major hybrid male sterility gene linked to the forked locus located at either 62.7 +/- 0.66 map units or at the centromere region of the X chromosome of D. mauritiana.

  20. Ageing Drosophila selected for longevity retain a young gene expression profile

    DEFF Research Database (Denmark)

    Sarup, Pernille Merete

    and longevity selected lines. Among the latter genes we found a clear overrepresentation of genes involved in immune functions supporting the hypothesis of the life shortening effect of an overactive immune system (inflammaging). Eighty-four genes were differentially expressed at the same physiological age...... between control and longevity selected lines, and the overlap between the same chronological and physiological age gene lists counted 40 candidate genes for increased longevity. Among these were genes with functions in starvation resistance, a regulator of immune responses and several genes which have......  We have investigated how the gene-expression profile of longevity selected lines of Drosophila melanogaster differed from control lines in young, middle-aged and old male flies. 530 genes were differentially expressed between selected and control flies at the same chronological age. We used...

  1. Cytogenetic analysis and mapping of leaf rust resistance in Aegilops speltoides Tausch derived bread wheat line Selection2427 carrying putative gametocidal gene(s).

    Science.gov (United States)

    Niranjana, M; Vinod; Sharma, J B; Mallick, Niharika; Tomar, S M S; Jha, S K

    2017-12-01

    Leaf rust (Puccinia triticina) is a major biotic stress affecting wheat yields worldwide. Host-plant resistance is the best method for controlling leaf rust. Aegilops speltoides is a good source of resistance against wheat rusts. To date, five Lr genes, Lr28, Lr35, Lr36, Lr47, and Lr51, have been transferred from Ae. speltoides to bread wheat. In Selection2427, a bread wheat introgresed line with Ae. speltoides as the donor parent, a dominant gene for leaf rust resistance was mapped to the long arm of chromosome 3B (LrS2427). None of the Lr genes introgressed from Ae. speltoides have been mapped to chromosome 3B. Since none of the designated seedling leaf rust resistance genes have been located on chromosome 3B, LrS2427 seems to be a novel gene. Selection2427 showed a unique property typical of gametocidal genes, that when crossed to other bread wheat cultivars, the F 1 showed partial pollen sterility and poor seed setting, whilst Selection2427 showed reasonable male and female fertility. Accidental co-transfer of gametocidal genes with LrS2427 may have occurred in Selection2427. Though LrS2427 did not show any segregation distortion and assorted independently of putative gametocidal gene(s), its utilization will be difficult due to the selfish behavior of gametocidal genes.

  2. GAP1, a novel selection and counter-selection marker for multiple gene disruptions in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Regenberg, Birgitte; Hansen, J.

    2000-01-01

    the GAP1 gene. This is caused by recombination between two Salmonella typuimurium hisG direct repeats embracing GAP1, and will result in a sub-population of gap1 cells. Such cells are selected on a medium containing D-histidine, and may subsequently be used for a second gene disruption. Hence, multiple...... flanked by short (60 bp) stretches of the gene in question. Through homologous recombination, the cassette will integrate into the target gene, which is thus replaced by GAP1, and mutants are selected for on minimal L-citrulline medium. When propagated under non-selective conditions, some cells will lose...... gene disruptions can be made fast, cheaply and easily in a gap1 strain, with two positive selection steps for each disruption. Copyright (C) 2000 John Wiley & Sons, Ltd....

  3. Empirical Bayes ranking and selection methods via semiparametric hierarchical mixture models in microarray studies.

    Science.gov (United States)

    Noma, Hisashi; Matsui, Shigeyuki

    2013-05-20

    The main purpose of microarray studies is screening of differentially expressed genes as candidates for further investigation. Because of limited resources in this stage, prioritizing genes are relevant statistical tasks in microarray studies. For effective gene selections, parametric empirical Bayes methods for ranking and selection of genes with largest effect sizes have been proposed (Noma et al., 2010; Biostatistics 11: 281-289). The hierarchical mixture model incorporates the differential and non-differential components and allows information borrowing across differential genes with separation from nuisance, non-differential genes. In this article, we develop empirical Bayes ranking methods via a semiparametric hierarchical mixture model. A nonparametric prior distribution, rather than parametric prior distributions, for effect sizes is specified and estimated using the "smoothing by roughening" approach of Laird and Louis (1991; Computational statistics and data analysis 12: 27-37). We present applications to childhood and infant leukemia clinical studies with microarrays for exploring genes related to prognosis or disease progression. Copyright © 2012 John Wiley & Sons, Ltd.

  4. A comparative study of three different gene expression analysis methods.

    Science.gov (United States)

    Choe, Jae Young; Han, Hyung Soo; Lee, Seon Duk; Lee, Hanna; Lee, Dong Eun; Ahn, Jae Yun; Ryoo, Hyun Wook; Seo, Kang Suk; Kim, Jong Kun

    2017-12-04

    TNF-α regulates immune cells and acts as an endogenous pyrogen. Reverse transcription polymerase chain reaction (RT-PCR) is one of the most commonly used methods for gene expression analysis. Among the alternatives to PCR, loop-mediated isothermal amplification (LAMP) shows good potential in terms of specificity and sensitivity. However, few studies have compared RT-PCR and LAMP for human gene expression analysis. Therefore, in the present study, we compared one-step RT-PCR, two-step RT-LAMP and one-step RT-LAMP for human gene expression analysis. We compared three gene expression analysis methods using the human TNF-α gene as a biomarker from peripheral blood cells. Total RNA from the three selected febrile patients were subjected to the three different methods of gene expression analysis. In the comparison of three gene expression analysis methods, the detection limit of both one-step RT-PCR and one-step RT-LAMP were the same, while that of two-step RT-LAMP was inferior. One-step RT-LAMP takes less time, and the experimental result is easy to determine. One-step RT-LAMP is a potentially useful and complementary tool that is fast and reasonably sensitive. In addition, one-step RT-LAMP could be useful in environments lacking specialized equipment or expertise.

  5. Positive-negative-selection-mediated gene targeting in rice

    Directory of Open Access Journals (Sweden)

    Zenpei eShimatani

    2015-01-01

    Full Text Available Gene targeting (GT refers to the designed modification of genomic sequence(s through homologous recombination (HR. GT is a powerful tool both for the study of gene function and for molecular breeding. However, in transformation of higher plants, non-homologous end joining (NHEJ occurs overwhelmingly in somatic cells, masking HR-mediated GT. Positive-negative selection (PNS is an approach for finding HR-mediated GT events because it can eliminate NHEJ effectively by expression of a negative-selection marker gene. In rice—a major crop worldwide—reproducible PNS-mediated GT of endogenous genes has now been successfully achieved. The procedure is based on strong PNS using diphtheria toxin A-fragment as a negative marker, and has succeeded in the directed modification of several endogenous rice genes in various ways. In addition to gene knock-outs and knock-ins, a nucleotide substitution in a target gene was also achieved recently. This review presents a summary of the development of the rice PNS system, highlighting its advantages. Different types of gene modification and gene editing aimed at developing new plant breeding technology (NPBT based on PNS are discussed.

  6. Altering the selection capabilities of common cloning vectors via restriction enzyme mediated gene disruption

    Science.gov (United States)

    2013-01-01

    Background The cloning of gene sequences forms the basis for many molecular biological studies. One important step in the cloning process is the isolation of bacterial transformants carrying vector DNA. This involves a vector-encoded selectable marker gene, which in most cases, confers resistance to an antibiotic. However, there are a number of circumstances in which a different selectable marker is required or may be preferable. Such situations can include restrictions to host strain choice, two phase cloning experiments and mutagenesis experiments, issues that result in additional unnecessary cloning steps, in which the DNA needs to be subcloned into a vector with a suitable selectable marker. Results We have used restriction enzyme mediated gene disruption to modify the selectable marker gene of a given vector by cloning a different selectable marker gene into the original marker present in that vector. Cloning a new selectable marker into a pre-existing marker was found to change the selection phenotype conferred by that vector, which we were able to demonstrate using multiple commonly used vectors and multiple resistance markers. This methodology was also successfully applied not only to cloning vectors, but also to expression vectors while keeping the expression characteristics of the vector unaltered. Conclusions Changing the selectable marker of a given vector has a number of advantages and applications. This rapid and efficient method could be used for co-expression of recombinant proteins, optimisation of two phase cloning procedures, as well as multiple genetic manipulations within the same host strain without the need to remove a pre-existing selectable marker in a previously genetically modified strain. PMID:23497512

  7. A novel mutual information-based Boolean network inference method from time-series gene expression data.

    Directory of Open Access Journals (Sweden)

    Shohag Barman

    Full Text Available Inferring a gene regulatory network from time-series gene expression data in systems biology is a challenging problem. Many methods have been suggested, most of which have a scalability limitation due to the combinatorial cost of searching a regulatory set of genes. In addition, they have focused on the accurate inference of a network structure only. Therefore, there is a pressing need to develop a network inference method to search regulatory genes efficiently and to predict the network dynamics accurately.In this study, we employed a Boolean network model with a restricted update rule scheme to capture coarse-grained dynamics, and propose a novel mutual information-based Boolean network inference (MIBNI method. Given time-series gene expression data as an input, the method first identifies a set of initial regulatory genes using mutual information-based feature selection, and then improves the dynamics prediction accuracy by iteratively swapping a pair of genes between sets of the selected regulatory genes and the other genes. Through extensive simulations with artificial datasets, MIBNI showed consistently better performance than six well-known existing methods, REVEAL, Best-Fit, RelNet, CST, CLR, and BIBN in terms of both structural and dynamics prediction accuracy. We further tested the proposed method with two real gene expression datasets for an Escherichia coli gene regulatory network and a fission yeast cell cycle network, and also observed better results using MIBNI compared to the six other methods.Taken together, MIBNI is a promising tool for predicting both the structure and the dynamics of a gene regulatory network.

  8. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    Science.gov (United States)

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. A hybrid computational method for the discovery of novel reproduction-related genes.

    Science.gov (United States)

    Chen, Lei; Chu, Chen; Kong, Xiangyin; Huang, Guohua; Huang, Tao; Cai, Yu-Dong

    2015-01-01

    Uncovering the molecular mechanisms underlying reproduction is of great importance to infertility treatment and to the generation of healthy offspring. In this study, we discovered novel reproduction-related genes with a hybrid computational method, integrating three different types of method, which offered new clues for further reproduction research. This method was first executed on a weighted graph, constructed based on known protein-protein interactions, to search the shortest paths connecting any two known reproduction-related genes. Genes occurring in these paths were deemed to have a special relationship with reproduction. These newly discovered genes were filtered with a randomization test. Then, the remaining genes were further selected according to their associations with known reproduction-related genes measured by protein-protein interaction score and alignment score obtained by BLAST. The in-depth analysis of the high confidence novel reproduction genes revealed hidden mechanisms of reproduction and provided guidelines for further experimental validations.

  10. "Contrasting patterns of selection at Pinus pinaster Ait. Drought stress candidate genes as revealed by genetic differentiation analyses".

    Science.gov (United States)

    Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H

    2008-02-01

    The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed

  11. Two-Way Gene Interaction From Microarray Data Based on Correlation Methods.

    Science.gov (United States)

    Alavi Majd, Hamid; Talebi, Atefeh; Gilany, Kambiz; Khayyer, Nasibeh

    2016-06-01

    Gene networks have generated a massive explosion in the development of high-throughput techniques for monitoring various aspects of gene activity. Networks offer a natural way to model interactions between genes, and extracting gene network information from high-throughput genomic data is an important and difficult task. The purpose of this study is to construct a two-way gene network based on parametric and nonparametric correlation coefficients. The first step in constructing a Gene Co-expression Network is to score all pairs of gene vectors. The second step is to select a score threshold and connect all gene pairs whose scores exceed this value. In the foundation-application study, we constructed two-way gene networks using nonparametric methods, such as Spearman's rank correlation coefficient and Blomqvist's measure, and compared them with Pearson's correlation coefficient. We surveyed six genes of venous thrombosis disease, made a matrix entry representing the score for the corresponding gene pair, and obtained two-way interactions using Pearson's correlation, Spearman's rank correlation, and Blomqvist's coefficient. Finally, these methods were compared with Cytoscape, based on BIND, and Gene Ontology, based on molecular function visual methods; R software version 3.2 and Bioconductor were used to perform these methods. Based on the Pearson and Spearman correlations, the results were the same and were confirmed by Cytoscape and GO visual methods; however, Blomqvist's coefficient was not confirmed by visual methods. Some results of the correlation coefficients are not the same with visualization. The reason may be due to the small number of data.

  12. Gene features selection for three-class disease classification via multiple orthogonal partial least square discriminant analysis and S-plot using microarray data.

    Science.gov (United States)

    Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu

    2013-01-01

    DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.

  13. Genes under positive selection in a model plant pathogenic fungus, Botrytis.

    Science.gov (United States)

    Aguileta, Gabriela; Lengelle, Juliette; Chiapello, Hélène; Giraud, Tatiana; Viaud, Muriel; Fournier, Elisabeth; Rodolphe, François; Marthey, Sylvain; Ducasse, Aurélie; Gendrault, Annie; Poulain, Julie; Wincker, Patrick; Gout, Lilian

    2012-07-01

    The rapid evolution of particular genes is essential for the adaptation of pathogens to new hosts and new environments. Powerful methods have been developed for detecting targets of selection in the genome. Here we used divergence data to compare genes among four closely related fungal pathogens adapted to different hosts to elucidate the functions putatively involved in adaptive processes. For this goal, ESTs were sequenced in the specialist fungal pathogens Botrytis tulipae and Botrytis ficariarum, and compared with genome sequences of Botrytis cinerea and Sclerotinia sclerotiorum, responsible for diseases on over 200 plant species. A maximum likelihood-based analysis of 642 predicted orthologs detected 21 genes showing footprints of positive selection. These results were validated by resequencing nine of these genes in additional Botrytis species, showing they have also been rapidly evolving in other related species. Twenty of the 21 genes had not previously been identified as pathogenicity factors in B. cinerea, but some had functions related to plant-fungus interactions. The putative functions were involved in respiratory and energy metabolism, protein and RNA metabolism, signal transduction or virulence, similarly to what was detected in previous studies using the same approach in other pathogens. Mutants of B. cinerea were generated for four of these genes as a first attempt to elucidate their functions. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. Construction and applications of exon-trapping gene-targeting vectors with a novel strategy for negative selection.

    Science.gov (United States)

    Saito, Shinta; Ura, Kiyoe; Kodama, Miho; Adachi, Noritaka

    2015-06-30

    Targeted gene modification by homologous recombination provides a powerful tool for studying gene function in cells and animals. In higher eukaryotes, non-homologous integration of targeting vectors occurs several orders of magnitude more frequently than does targeted integration, making the gene-targeting technology highly inefficient. For this reason, negative-selection strategies have been employed to reduce the number of drug-resistant clones associated with non-homologous vector integration, particularly when artificial nucleases to introduce a DNA break at the target site are unavailable or undesirable. As such, an exon-trap strategy using a promoterless drug-resistance marker gene provides an effective way to counterselect non-homologous integrants. However, constructing exon-trapping targeting vectors has been a time-consuming and complicated process. By virtue of highly efficient att-mediated recombination, we successfully developed a simple and rapid method to construct plasmid-based vectors that allow for exon-trapping gene targeting. These exon-trap vectors were useful in obtaining correctly targeted clones in mouse embryonic stem cells and human HT1080 cells. Most importantly, with the use of a conditionally cytotoxic gene, we further developed a novel strategy for negative selection, thereby enhancing the efficiency of counterselection for non-homologous integration of exon-trap vectors. Our methods will greatly facilitate exon-trapping gene-targeting technologies in mammalian cells, particularly when combined with the novel negative selection strategy.

  15. Reference gene selection for molecular studies of dormancy in wild oat (Avena fatua L. caryopses by RT-qPCR method.

    Directory of Open Access Journals (Sweden)

    Izabela Ruduś

    Full Text Available Molecular studies of primary and secondary dormancy in Avena fatua L., a serious weed of cereal and other crops, are intended to reveal the species-specific details of underlying molecular mechanisms which in turn may be useable in weed management. Among others, quantitative real-time PCR (RT-qPCR data of comparative gene expression analysis may give some insight into the involvement of particular wild oat genes in dormancy release, maintenance or induction by unfavorable conditions. To assure obtaining biologically significant results using this method, the expression stability of selected candidate reference genes in different data subsets was evaluated using four statistical algorithms i.e. geNorm, NormFinder, Best Keeper and ΔCt method. Although some discrepancies in their ranking outputs were noticed, evidently two ubiquitin-conjugating enzyme homologs, AfUBC1 and AfUBC2, as well as one homolog of glyceraldehyde 3-phosphate dehydrogenase AfGAPDH1 and TATA-binding protein AfTBP2 appeared as more stably expressed than AfEF1a (translation elongation factor 1α, AfGAPDH2 or the least stable α-tubulin homolog AfTUA1 in caryopses and seedlings of A. fatua. Gene expression analysis of a dormancy-related wild oat transcription factor VIVIPAROUS1 (AfVP1 allowed for a validation of candidate reference genes performance. Based on the obtained results it can be recommended that the normalization factor calculated as a geometric mean of Cq values of AfUBC1, AfUBC2 and AfGAPDH1 would be optimal for RT-qPCR results normalization in the experiments comprising A. fatua caryopses of different dormancy status.

  16. Reference gene selection for molecular studies of dormancy in wild oat (Avena fatua L.) caryopses by RT-qPCR method.

    Science.gov (United States)

    Ruduś, Izabela; Kępczyński, Jan

    2018-01-01

    Molecular studies of primary and secondary dormancy in Avena fatua L., a serious weed of cereal and other crops, are intended to reveal the species-specific details of underlying molecular mechanisms which in turn may be useable in weed management. Among others, quantitative real-time PCR (RT-qPCR) data of comparative gene expression analysis may give some insight into the involvement of particular wild oat genes in dormancy release, maintenance or induction by unfavorable conditions. To assure obtaining biologically significant results using this method, the expression stability of selected candidate reference genes in different data subsets was evaluated using four statistical algorithms i.e. geNorm, NormFinder, Best Keeper and ΔCt method. Although some discrepancies in their ranking outputs were noticed, evidently two ubiquitin-conjugating enzyme homologs, AfUBC1 and AfUBC2, as well as one homolog of glyceraldehyde 3-phosphate dehydrogenase AfGAPDH1 and TATA-binding protein AfTBP2 appeared as more stably expressed than AfEF1a (translation elongation factor 1α), AfGAPDH2 or the least stable α-tubulin homolog AfTUA1 in caryopses and seedlings of A. fatua. Gene expression analysis of a dormancy-related wild oat transcription factor VIVIPAROUS1 (AfVP1) allowed for a validation of candidate reference genes performance. Based on the obtained results it can be recommended that the normalization factor calculated as a geometric mean of Cq values of AfUBC1, AfUBC2 and AfGAPDH1 would be optimal for RT-qPCR results normalization in the experiments comprising A. fatua caryopses of different dormancy status.

  17. A Cancer Gene Selection Algorithm Based on the K-S Test and CFS

    Directory of Open Access Journals (Sweden)

    Qiang Su

    2017-01-01

    Full Text Available Background. To address the challenging problem of selecting distinguished genes from cancer gene expression datasets, this paper presents a gene subset selection algorithm based on the Kolmogorov-Smirnov (K-S test and correlation-based feature selection (CFS principles. The algorithm selects distinguished genes first using the K-S test, and then, it uses CFS to select genes from those selected by the K-S test. Results. We adopted support vector machines (SVM as the classification tool and used the criteria of accuracy to evaluate the performance of the classifiers on the selected gene subsets. This approach compared the proposed gene subset selection algorithm with the K-S test, CFS, minimum-redundancy maximum-relevancy (mRMR, and ReliefF algorithms. The average experimental results of the aforementioned gene selection algorithms for 5 gene expression datasets demonstrate that, based on accuracy, the performance of the new K-S and CFS-based algorithm is better than those of the K-S test, CFS, mRMR, and ReliefF algorithms. Conclusions. The experimental results show that the K-S test-CFS gene selection algorithm is a very effective and promising approach compared to the K-S test, CFS, mRMR, and ReliefF algorithms.

  18. A scan for positively selected genes in the genomes of humans and chimpanzees.

    Directory of Open Access Journals (Sweden)

    Rasmus Nielsen

    2005-06-01

    Full Text Available Since the divergence of humans and chimpanzees about 5 million years ago, these species have undergone a remarkable evolution with drastic divergence in anatomy and cognitive abilities. At the molecular level, despite the small overall magnitude of DNA sequence divergence, we might expect such evolutionary changes to leave a noticeable signature throughout the genome. We here compare 13,731 annotated genes from humans to their chimpanzee orthologs to identify genes that show evidence of positive selection. Many of the genes that present a signature of positive selection tend to be involved in sensory perception or immune defenses. However, the group of genes that show the strongest evidence for positive selection also includes a surprising number of genes involved in tumor suppression and apoptosis, and of genes involved in spermatogenesis. We hypothesize that positive selection in some of these genes may be driven by genomic conflict due to apoptosis during spermatogenesis. Genes with maximal expression in the brain show little or no evidence for positive selection, while genes with maximal expression in the testis tend to be enriched with positively selected genes. Genes on the X chromosome also tend to show an elevated tendency for positive selection. We also present polymorphism data from 20 Caucasian Americans and 19 African Americans for the 50 annotated genes showing the strongest evidence for positive selection. The polymorphism analysis further supports the presence of positive selection in these genes by showing an excess of high-frequency derived nonsynonymous mutations.

  19. Use of Maximum Likelihood-Mixed Models to select stable reference genes: a case of heat stress response in sheep

    Directory of Open Access Journals (Sweden)

    Salces Judit

    2011-08-01

    Full Text Available Abstract Background Reference genes with stable expression are required to normalize expression differences of target genes in qPCR experiments. Several procedures and companion software have been proposed to find the most stable genes. Model based procedures are attractive because they provide a solid statistical framework. NormFinder, a widely used software, uses a model based method. The pairwise comparison procedure implemented in GeNorm is a simpler procedure but one of the most extensively used. In the present work a statistical approach based in Maximum Likelihood estimation under mixed models was tested and compared with NormFinder and geNorm softwares. Sixteen candidate genes were tested in whole blood samples from control and heat stressed sheep. Results A model including gene and treatment as fixed effects, sample (animal, gene by treatment, gene by sample and treatment by sample interactions as random effects with heteroskedastic residual variance in gene by treatment levels was selected using goodness of fit and predictive ability criteria among a variety of models. Mean Square Error obtained under the selected model was used as indicator of gene expression stability. Genes top and bottom ranked by the three approaches were similar; however, notable differences for the best pair of genes selected for each method and the remaining genes of the rankings were shown. Differences among the expression values of normalized targets for each statistical approach were also found. Conclusions Optimal statistical properties of Maximum Likelihood estimation joined to mixed model flexibility allow for more accurate estimation of expression stability of genes under many different situations. Accurate selection of reference genes has a direct impact over the normalized expression values of a given target gene. This may be critical when the aim of the study is to compare expression rate differences among samples under different environmental

  20. Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.

    Science.gov (United States)

    Crossa, José; Pérez-Rodríguez, Paulino; Cuevas, Jaime; Montesinos-López, Osval; Jarquín, Diego; de Los Campos, Gustavo; Burgueño, Juan; González-Camacho, Juan M; Pérez-Elizalde, Sergio; Beyene, Yoseph; Dreisigacker, Susanne; Singh, Ravi; Zhang, Xuecai; Gowda, Manje; Roorkiwal, Manish; Rutkoski, Jessica; Varshney, Rajeev K

    2017-11-01

    Genomic selection (GS) facilitates the rapid selection of superior genotypes and accelerates the breeding cycle. In this review, we discuss the history, principles, and basis of GS and genomic-enabled prediction (GP) as well as the genetics and statistical complexities of GP models, including genomic genotype×environment (G×E) interactions. We also examine the accuracy of GP models and methods for two cereal crops and two legume crops based on random cross-validation. GS applied to maize breeding has shown tangible genetic gains. Based on GP results, we speculate how GS in germplasm enhancement (i.e., prebreeding) programs could accelerate the flow of genes from gene bank accessions to elite lines. Recent advances in hyperspectral image technology could be combined with GS and pedigree-assisted breeding. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  2. Gene selection using hybrid binary black hole algorithm and modified binary particle swarm optimization.

    Science.gov (United States)

    Pashaei, Elnaz; Pashaei, Elham; Aydin, Nizamettin

    2018-04-14

    In cancer classification, gene selection is an important data preprocessing technique, but it is a difficult task due to the large search space. Accordingly, the objective of this study is to develop a hybrid meta-heuristic Binary Black Hole Algorithm (BBHA) and Binary Particle Swarm Optimization (BPSO) (4-2) model that emphasizes gene selection. In this model, the BBHA is embedded in the BPSO (4-2) algorithm to make the BPSO (4-2) more effective and to facilitate the exploration and exploitation of the BPSO (4-2) algorithm to further improve the performance. This model has been associated with Random Forest Recursive Feature Elimination (RF-RFE) pre-filtering technique. The classifiers which are evaluated in the proposed framework are Sparse Partial Least Squares Discriminant Analysis (SPLSDA); k-nearest neighbor and Naive Bayes. The performance of the proposed method was evaluated on two benchmark and three clinical microarrays. The experimental results and statistical analysis confirm the better performance of the BPSO (4-2)-BBHA compared with the BBHA, the BPSO (4-2) and several state-of-the-art methods in terms of avoiding local minima, convergence rate, accuracy and number of selected genes. The results also show that the BPSO (4-2)-BBHA model can successfully identify known biologically and statistically significant genes from the clinical datasets. Copyright © 2018 Elsevier Inc. All rights reserved.

  3. Rapid and targeted introgression of genes into popular wheat cultivars using marker-assisted background selection.

    Directory of Open Access Journals (Sweden)

    Harpinder S Randhawa

    Full Text Available A marker-assisted background selection (MABS-based gene introgression approach in wheat (Triticum aestivum L. was optimized, where 97% or more of a recurrent parent genome (RPG can be recovered in just two backcross (BC generations. A four-step MABS method was developed based on 'Plabsim' computer simulations and wheat genome structure information. During empirical optimization of this method, double recombinants around the target gene were selected in a step-wise fashion during the two BC cycles followed by selection for recurrent parent genotype on non-carrier chromosomes. The average spacing between carrier chromosome markers was <4 cM. For non-carrier chromosome markers that flanked each of the 48 wheat gene-rich regions, this distance was approximately 12 cM. Employed to introgress seedling stripe rust (Puccinia striiformis f. sp. tritici resistance gene Yr15 into the spring wheat cultivar 'Zak', marker analysis of 2,187 backcross-derived progeny resulted in the recovery of a BC(2F(2ratio3 plant with 97% of the recurrent parent genome. In contrast, only 82% of the recurrent parent genome was recovered in phenotypically selected BC(4F(7 plants developed without MABS. Field evaluation results from 17 locations indicated that the MABS-derived line was either equal or superior to the recurrent parent for the tested agronomic characteristics. Based on these results, MABS is recommended as a strategy for rapidly introgressing a targeted gene into a wheat genotype in just two backcross generations while recovering 97% or more of the recurrent parent genotype.

  4. A Comparison of Selective Pressures in Plant X-Linked and Autosomal Genes.

    Science.gov (United States)

    Krasovec, Marc; Nevado, Bruno; Filatov, Dmitry A

    2018-05-03

    Selection is expected to work differently in autosomal and X-linked genes because of their ploidy difference and the exposure of recessive X-linked mutations to haploid selection in males. However, it is not clear whether these expectations apply to recently evolved sex chromosomes, where many genes retain functional X- and Y-linked gametologs. We took advantage of the recently evolved sex chromosomes in the plant Silene latifolia and its closely related species to compare the selective pressures between hemizygous and non-hemizygous X-linked genes as well as between X-linked genes and autosomal genes. Our analysis, based on over 1000 genes, demonstrated that, similar to animals, X-linked genes in Silene evolve significantly faster than autosomal genes—the so-called faster-X effect. Contrary to expectations, faster-X divergence was detectable only for non-hemizygous X-linked genes. Our phylogeny-based analyses of selection revealed no evidence for faster adaptation in X-linked genes compared to autosomal genes. On the other hand, partial relaxation of purifying selection was apparent on the X-chromosome compared to the autosomes, consistent with a smaller genetic diversity in S. latifolia X-linked genes (π x = 0.016; π aut = 0.023). Thus, the faster-X divergence in S. latifolia appears to be a consequence of the smaller effective population size rather than of a faster adaptive evolution on the X-chromosome. We argue that this may be a general feature of “young” sex chromosomes, where the majority of X-linked genes are not hemizygous, preventing haploid selection in heterogametic sex.

  5. Evidence for natural selection at the melanocortin-3 receptor gene in European and African populations.

    Science.gov (United States)

    Yoshiuchi, Issei

    2016-08-01

    Obesity is increasing steadily in worldwide prevalence and is known to cause serious health problems in association with type 2 diabetes mellitus (T2DM), including hypertension, stroke, and cardiovascular diseases. According to the thrifty gene hypothesis, the natural selection of obesity-related genes is important during feast and famine because they control body weight and fat levels. Past human adaptations to environmental changes in food supply, lifestyle, and geography may have influenced the selection of genes associated with the metabolism of glucose, lipids, and energy. The melanocortin-3 receptor gene (MC3R) is associated with obesity, with MC3R-deficient mice showing increased fat mass. MC3R variations are also linked with childhood obesity and insulin resistance. Here, we aimed to uncover evidence of selection at MC3R. We performed a three-step method to detect selection at MC3R using HapMap population data. We used Wright's F statistics as a measure of population differentiation, the long-range haplotype test to identify extended haplotypes, and the integrated haplotype score (iHS) to detect selection at MC3R. We observed high population differentiation between European and African populations at two MC3R childhood obesity- and insulin resistance-associated single-nucleotide polymorphisms (rs3746619 and rs3827103) using Wright's F statistics. The iHS revealed evidence of natural selection at MC3R. These findings provide evidence for natural selection at MC3R. Further investigation is warranted into adaptive evolution at T2DM- and obesity-associated genes.

  6. Selection on the Major Color Gene Melanocortin-1-Receptor Shaped the Evolution of the Melanocortin System Genes

    Directory of Open Access Journals (Sweden)

    Linda Dib

    2017-12-01

    Full Text Available Modular genetic systems and networks have complex evolutionary histories shaped by selection acting on single genes as well as on their integrated function within the network. However, uncovering molecular coevolution requires the detection of coevolving sites in sequences. Detailed knowledge of the functions of each gene in the system is also necessary to identify the selective agents driving coevolution. Using recently developed computational tools, we investigated the effect of positive selection on the coevolution of ten major genes in the melanocortin system, responsible for multiple physiological functions and human diseases. Substitutions driven by positive selection at the melanocortin-1-receptor (MC1R induced more coevolutionary changes on the system than positive selection on other genes in the system. Contrarily, selection on the highly pleiotropic POMC gene, which orchestrates the activation of the different melanocortin receptors, had the lowest coevolutionary influence. MC1R and possibly its main function, melanin pigmentation, seems to have influenced the evolution of the melanocortin system more than functions regulated by MC2-5Rs such as energy homeostasis, glucocorticoid-dependent stress and anti-inflammatory responses. Although replication in other regulatory systems is needed, this suggests that single functional aspects of a genetic network or system can be of higher importance than others in shaping coevolution among the genes that integrate it.

  7. When Is Hub Gene Selection Better than Standard Meta-Analysis?

    Science.gov (United States)

    Langfelder, Peter; Mischel, Paul S.; Horvath, Steve

    2013-01-01

    Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to

  8. When is hub gene selection better than standard meta-analysis?

    Directory of Open Access Journals (Sweden)

    Peter Langfelder

    Full Text Available Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data. Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA in three comprehensive and unbiased empirical studies: (1 Finding genes predictive of lung cancer survival, (2 finding methylation markers related to age, and (3 finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1. However, standard meta-analysis methods perform as good as (if not better than a consensus network approach in terms of validation success (criterion 2. The article also reports a comparison of meta-analysis techniques

  9. When is hub gene selection better than standard meta-analysis?

    Science.gov (United States)

    Langfelder, Peter; Mischel, Paul S; Horvath, Steve

    2013-01-01

    Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to

  10. Positive selection on MHC class II DRB and DQB genes in the bank vole (Myodes glareolus).

    Science.gov (United States)

    Scherman, Kristin; Råberg, Lars; Westerdahl, Helena

    2014-05-01

    The major histocompatibility complex (MHC) class IIB genes show considerable sequence similarity between loci. The MHC class II DQB and DRB genes are known to exhibit a high level of polymorphism, most likely maintained by parasite-mediated selection. Studies of the MHC in wild rodents have focused on DRB, whilst DQB has been given much less attention. Here, we characterised DQB genes in Swedish bank voles Myodes glareolus, using full-length transcripts. We then designed primers that specifically amplify exon 2 from DRB (202 bp) and DQB (205 bp) and investigated molecular signatures of natural selection on DRB and DQB alleles. The presence of two separate gene clusters was confirmed using BLASTN and phylogenetic analysis, where our seven transcripts clustered according to either DQB or DRB homologues. These gene clusters were again confirmed on exon 2 data from 454-amplicon sequencing. Our DRB primers amplify a similar number of alleles per individual as previously published DRB primers, though our reads are longer. Traditional d N/d S analyses of DRB sequences in the bank vole have not found a conclusive signal of positive selection. Using a more advanced substitution model (the Kumar method) we found positive selection in the peptide binding region (PBR) of both DRB and DQB genes. Maximum likelihood models of codon substitutions detected positively selected sites located in the PBR of both DQB and DRB. Interestingly, these analyses detected at least twice as many positively selected sites in DQB than DRB, suggesting that DQB has been under stronger positive selection than DRB over evolutionary time.

  11. A comparative study of covariance selection models for the inference of gene regulatory networks.

    Science.gov (United States)

    Stifanelli, Patrizia F; Creanza, Teresa M; Anglani, Roberto; Liuzzi, Vania C; Mukherjee, Sayan; Schena, Francesco P; Ancona, Nicola

    2013-10-01

    The inference, or 'reverse-engineering', of gene regulatory networks from expression data and the description of the complex dependency structures among genes are open issues in modern molecular biology. In this paper we compared three regularized methods of covariance selection for the inference of gene regulatory networks, developed to circumvent the problems raising when the number of observations n is smaller than the number of genes p. The examined approaches provided three alternative estimates of the inverse covariance matrix: (a) the 'PINV' method is based on the Moore-Penrose pseudoinverse, (b) the 'RCM' method performs correlation between regression residuals and (c) 'ℓ(2C)' method maximizes a properly regularized log-likelihood function. Our extensive simulation studies showed that ℓ(2C) outperformed the other two methods having the most predictive partial correlation estimates and the highest values of sensitivity to infer conditional dependencies between genes even when a few number of observations was available. The application of this method for inferring gene networks of the isoprenoid biosynthesis pathways in Arabidopsis thaliana allowed to enlighten a negative partial correlation coefficient between the two hubs in the two isoprenoid pathways and, more importantly, provided an evidence of cross-talk between genes in the plastidial and the cytosolic pathways. When applied to gene expression data relative to a signature of HRAS oncogene in human cell cultures, the method revealed 9 genes (p-value<0.0005) directly interacting with HRAS, sharing the same Ras-responsive binding site for the transcription factor RREB1. This result suggests that the transcriptional activation of these genes is mediated by a common transcription factor downstream of Ras signaling. Software implementing the methods in the form of Matlab scripts are available at: http://users.ba.cnr.it/issia/iesina18/CovSelModelsCodes.zip. Copyright © 2013 The Authors. Published by

  12. A large-scale benchmark of gene prioritization methods.

    Science.gov (United States)

    Guala, Dimitri; Sonnhammer, Erik L L

    2017-04-21

    In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.

  13. Whole-gene positive selection, elevated synonymous substitution rates, duplication, and indel evolution of the chloroplast clpP1 gene.

    Directory of Open Access Journals (Sweden)

    Per Erixon

    Full Text Available BACKGROUND: Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. METHODOLOGY/PRINCIPLE FINDINGS: We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family. Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. CONCLUSIONS/SIGNIFICANCE: We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the

  14. Selection on meiosis genes in diploid and tetraploid Arabidopsis arenosa.

    Science.gov (United States)

    Wright, Kevin M; Arnold, Brian; Xue, Katherine; Šurinová, Maria; O'Connell, Jeremy; Bomblies, Kirsten

    2015-04-01

    Meiotic chromosome segregation is critical for fertility across eukaryotes, and core meiotic processes are well conserved even between kingdoms. Nevertheless, recent work in animals has shown that at least some meiosis genes are highly diverse or strongly differentiated among populations. What drives this remains largely unknown. We previously showed that autotetraploid Arabidopsis arenosa evolved stable meiosis, likely through reduced crossover rates, and that associated with this there is strong evidence for selection in a subset of meiosis genes known to affect axis formation, synapsis, and crossover frequency. Here, we use genome-wide data to study the molecular evolution of 70 meiosis genes in a much wider sample of A. arenosa. We sample the polyploid lineage, a diploid lineage from the Carpathian Mountains, and a more distantly related diploid lineage from the adjacent, but biogeographically distinct Pannonian Basin. We find that not only did selection act on meiosis genes in the polyploid lineage but also independently on a smaller subset of meiosis genes in Pannonian diploids. Functionally related genes are targeted by selection in these distinct contexts, and in two cases, independent sweeps occurred in the same loci. The tetraploid lineage has sustained selection on more genes, has more amino acid changes in each, and these more often affect conserved or potentially functional sites. We hypothesize that Pannonian diploid and tetraploid A. arenosa experienced selection on structural proteins that mediate sister chromatid cohesion, the formation of meiotic chromosome axes, and synapsis, likely for different underlying reasons. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. A Novel ‘Gene Insertion/Marker Out’ (GIMO) Method for Transgene Expression and Gene Complementation in Rodent Malaria Parasites

    Science.gov (United States)

    Sajid, Mohammed; Chevalley-Maurel, Séverine; Ramesar, Jai; Klop, Onny; Franke-Fayard, Blandine M. D.; Janse, Chris J.; Khan, Shahid M.

    2011-01-01

    Research on the biology of malaria parasites has greatly benefited from the application of reverse genetic technologies, in particular through the analysis of gene deletion mutants and studies on transgenic parasites that express heterologous or mutated proteins. However, transfection in Plasmodium is limited by the paucity of drug-selectable markers that hampers subsequent genetic modification of the same mutant. We report the development of a novel ‘gene insertion/marker out’ (GIMO) method for two rodent malaria parasites, which uses negative selection to rapidly generate transgenic mutants ready for subsequent modifications. We have created reference mother lines for both P. berghei ANKA and P. yoelii 17XNL that serve as recipient parasites for GIMO-transfection. Compared to existing protocols GIMO-transfection greatly simplifies and speeds up the generation of mutants expressing heterologous proteins, free of drug-resistance genes, and requires far fewer laboratory animals. In addition we demonstrate that GIMO-transfection is also a simple and fast method for genetic complementation of mutants with a gene deletion or mutation. The implementation of GIMO-transfection procedures should greatly enhance Plasmodium reverse-genetic research. PMID:22216235

  16. Sexual selection, genetic conflict, selfish genes, and the atypical patterns of gene expression in spermatogenic cells.

    Science.gov (United States)

    Kleene, Kenneth C

    2005-01-01

    This review proposes that the peculiar patterns of gene expression in spermatogenic cells are the consequence of powerful evolutionary forces known as sexual selection. Sexual selection is generally characterized by intense competition of males for females, an enormous variety of the strategies to maximize male reproductive success, exaggerated male traits at all levels of biological organization, co-evolution of sexual traits in males and females, and conflict between the sexual advantage of the male trait and the reproductive fitness of females and the individual fitness of both sexes. In addition, spermatogenesis is afflicted by selfish genes that promote their transmission to progeny while causing deleterious effects. Sexual selection, selfish genes, and genetic conflict provide compelling explanations for many atypical features of gene expression in spermatogenic cells including the gross overexpression of certain mRNAs, transcripts encoding truncated proteins that cannot carry out basic functions of the proteins encoded by the same genes in somatic cells, the large number of gene families containing paralogous genes encoding spermatogenic cell-specific isoforms, the large number of testis-cancer-associated genes that are expressed only in spermatogenic cells and malignant cells, and the overbearing role of Sertoli cells in regulating the number and quality of spermatozoa.

  17. Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don.

    Science.gov (United States)

    Xiao, Zheng; Sun, Xiaobo; Liu, Xiaoqing; Li, Chang; He, Lisi; Chen, Shangping; Su, Jiale

    2016-01-01

    The quantitative real-time polymerase chain reaction (qRT-PCR) approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder, and BestKeeper. The results showed that EF1- α (elongation factor 1-alpha), 18S (18s ribosomal RNA), and RPL3 (ribosomal protein L3) were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin ( TUB ) was the least stable. ACT5 (actin), RPL3 , 18S , and EF1- α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle . Furthermore, the expression profiles of RmPSY (phytoene synthase) and RmPDS (phytoene dehydrogenase) were assessed using EF1- α, 18S , ACT5 , RPL3 , and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle .

  18. Selection of Reliable Reference Genes for Gene Expression Studies on Rhododendron molle G. Don

    Directory of Open Access Journals (Sweden)

    Zheng Xiao

    2016-10-01

    Full Text Available The quantitative real-time polymerase chain reaction (qRT-PCR approach has become a widely used method to analyze expression patterns of target genes. The selection of an optimal reference gene is a prerequisite for the accurate normalization of gene expression in qRT-PCR. The present study constitutes the first systematic evaluation of potential reference genes in Rhododendron molle G. Don. Eleven candidate reference genes in different tissues and flowers at different developmental stages of R. molle were assessed using the following three software packages: GeNorm, NormFinder and BestKeeper. The results showed that EF1-α (elongation factor 1-alpha, 18S (18s ribosomal RNA and RPL3 (ribosomal protein L3 were the most stable reference genes in developing rhododendron flowers and, thus, in all of the tested samples, while tublin (TUB was the least stable. ACT5 (actin, RPL3, 18S and EF1-α were found to be the top four choices for different tissues, whereas TUB was not found to favor qRT-PCR normalization in these tissues. Three stable reference genes are recommended for the normalization of qRT-PCR data in R. molle. Furthermore, the expression profiles of RmPSY (phytoene synthase and RmPDS (phytoene dehydrogenase were assessed using EF1-α, 18S, ACT5, and RPL3 and their combination as internals. Similar trends were found, but these trends varied when the least stable reference gene TUB was used. The results further prove that it is necessary to validate the stability of reference genes prior to their use for normalization under different experimental conditions. This study provides useful information for reliable qRT-PCR data normalization in gene studies of R. molle.

  19. Comparison of genome-wide selection strategies to identify furfural tolerance genes in Escherichia coli.

    Science.gov (United States)

    Glebes, Tirzah Y; Sandoval, Nicholas R; Gillis, Jacob H; Gill, Ryan T

    2015-01-01

    Engineering both feedstock and product tolerance is important for transitioning towards next-generation biofuels derived from renewable sources. Tolerance to chemical inhibitors typically results in complex phenotypes, for which multiple genetic changes must often be made to confer tolerance. Here, we performed a genome-wide search for furfural-tolerant alleles using the TRackable Multiplex Recombineering (TRMR) method (Warner et al. (2010), Nature Biotechnology), which uses chromosomally integrated mutations directed towards increased or decreased expression of virtually every gene in Escherichia coli. We employed various growth selection strategies to assess the role of selection design towards growth enrichments. We also compared genes with increased fitness from our TRMR selection to those from a previously reported genome-wide identification study of furfural tolerance genes using a plasmid-based genomic library approach (Glebes et al. (2014) PLOS ONE). In several cases, growth improvements were observed for the chromosomally integrated promoter/RBS mutations but not for the plasmid-based overexpression constructs. Through this assessment, four novel tolerance genes, ahpC, yhjH, rna, and dicA, were identified and confirmed for their effect on improving growth in the presence of furfural. © 2014 Wiley Periodicals, Inc.

  20. Pearl millet transformation system using the positive selectable marker gene phosphomannose isomerase

    CSIR Research Space (South Africa)

    O'Kennedy, MM

    2004-04-01

    Full Text Available of the transgene. Similar to results obtained from previous studies with maize and wheat, the manA gene was shown to be a superior selectable marker gene for improving transformation efficiencies when compared to antibiotic or herbicide selectable marker genes....

  1. A Combinatory Approach for Selecting Prognostic Genes in Microarray Studies of Tumour Survivals

    Directory of Open Access Journals (Sweden)

    Qihua Tan

    2009-01-01

    Full Text Available Different from significant gene expression analysis which looks for genes that are differentially regulated, feature selection in the microarray-based prognostic gene expression analysis aims at finding a subset of marker genes that are not only differentially expressed but also informative for prediction. Unfortunately feature selection in literature of microarray study is predominated by the simple heuristic univariate gene filter paradigm that selects differentially expressed genes according to their statistical significances. We introduce a combinatory feature selection strategy that integrates differential gene expression analysis with the Gram-Schmidt process to identify prognostic genes that are both statistically significant and highly informative for predicting tumour survival outcomes. Empirical application to leukemia and ovarian cancer survival data through-within- and cross-study validations shows that the feature space can be largely reduced while achieving improved testing performances.

  2. Evaluation of gene association methods for coexpression network construction and biological knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Sapna Kumari

    Full Text Available BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods - Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson - and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.

  3. Selection for the compactness of highly expressed genes in Gallus gallus

    Directory of Open Access Journals (Sweden)

    Zhou Ming

    2010-05-01

    Full Text Available Abstract Background Coding sequence (CDS length, gene size, and intron length vary within a genome and among genomes. Previous studies in diverse organisms, including human, D. Melanogaster, C. elegans, S. cerevisiae, and Arabidopsis thaliana, indicated that there are negative relationships between expression level and gene size, CDS length as well as intron length. Different models such as selection for economy model, genomic design model, and mutational bias hypotheses have been proposed to explain such observation. The debate of which model is a superior one to explain the observation has not been settled down. The chicken (Gallus gallus is an important model organism that bridges the evolutionary gap between mammals and other vertebrates. As D. Melanogaster, chicken has a larger effective population size, selection for chicken genome is expected to be more effective in increasing protein synthesis efficiency. Therefore, in this study the chicken was used as a model organism to elucidate the interaction between gene features and expression pattern upon selection pressure. Results Based on different technologies, we gathered expression data for nuclear protein coding, single-splicing genes from Gallus gallus genome and compared them with gene parameters. We found that gene size, CDS length, first intron length, average intron length, and total intron length are negatively correlated with expression level and expression breadth significantly. The tissue specificity is positively correlated with the first intron length but negatively correlated with the average intron length, and not correlated with the CDS length and protein domain numbers. Comparison analyses showed that ubiquitously expressed genes and narrowly expressed genes with the similar expression levels do not differ in compactness. Our data provided evidence that the genomic design model can not, at least in part, explain our observations. We grouped all somatic-tissue-specific genes

  4. Selection of appropriate reference genes for the detection of rhythmic gene expression via quantitative real-time PCR in Tibetan hulless barley.

    Directory of Open Access Journals (Sweden)

    Jing Cai

    Full Text Available Hulless barley (Hordeum vulgare L. var. nudum. hook. f. has been cultivated as a major crop in the Qinghai-Tibet plateau of China for thousands of years. Compared to other cereal crops, the Tibetan hulless barley has developed stronger endogenous resistances to survive in the severe environment of its habitat. To understand the unique resistant mechanisms of this plant, detailed genetic studies need to be performed. The quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR is the most commonly used method in detecting gene expression. However, the selection of stable reference genes under limited experimental conditions was considered to be an essential step for obtaining accurate results in qRT-PCR. In this study, 10 candidate reference genes-ACT (Actin, E2 (Ubiquitin conjugating enzyme 2, TUBα (Alpha-tubulin, TUBβ6 (Beta-tubulin 6, GAPDH (Glyceraldehyde 3-phosphate dehydrogenase, EF-1α (Elongation factor 1-alpha, SAMDC (S-adenosylmethionine decarboxylase, PKABA1 (Gene for protein kinase HvPKABA1, PGK (Phosphoglycerate kinase, and HSP90 (Heat shock protein 90-were selected from the NCBI gene database of barley. Following qRT-PCR amplifications of all candidate reference genes in Tibetan hulless barley seedlings under various stressed conditions, the stabilities of these candidates were analyzed by three individual software packages including geNorm, NormFinder, and BestKeeper. The results demonstrated that TUBβ6, E2, TUBα, and HSP90 were generally the most suitable sets under all tested conditions; similarly, TUBα and HSP90 showed peak stability under salt stress, TUBα and EF-1α were the most suitable reference genes under cold stress, and ACT and E2 were the most stable under drought stress. Finally, a known circadian gene CCA1 was used to verify the service ability of chosen reference genes. The results confirmed that all recommended reference genes by the three software were suitable for gene expression

  5. Selection of appropriate reference genes for the detection of rhythmic gene expression via quantitative real-time PCR in Tibetan hulless barley.

    Science.gov (United States)

    Cai, Jing; Li, Pengfei; Luo, Xiao; Chang, Tianliang; Li, Jiaxing; Zhao, Yuwei; Xu, Yao

    2018-01-01

    Hulless barley (Hordeum vulgare L. var. nudum. hook. f.) has been cultivated as a major crop in the Qinghai-Tibet plateau of China for thousands of years. Compared to other cereal crops, the Tibetan hulless barley has developed stronger endogenous resistances to survive in the severe environment of its habitat. To understand the unique resistant mechanisms of this plant, detailed genetic studies need to be performed. The quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR) is the most commonly used method in detecting gene expression. However, the selection of stable reference genes under limited experimental conditions was considered to be an essential step for obtaining accurate results in qRT-PCR. In this study, 10 candidate reference genes-ACT (Actin), E2 (Ubiquitin conjugating enzyme 2), TUBα (Alpha-tubulin), TUBβ6 (Beta-tubulin 6), GAPDH (Glyceraldehyde 3-phosphate dehydrogenase), EF-1α (Elongation factor 1-alpha), SAMDC (S-adenosylmethionine decarboxylase), PKABA1 (Gene for protein kinase HvPKABA1), PGK (Phosphoglycerate kinase), and HSP90 (Heat shock protein 90)-were selected from the NCBI gene database of barley. Following qRT-PCR amplifications of all candidate reference genes in Tibetan hulless barley seedlings under various stressed conditions, the stabilities of these candidates were analyzed by three individual software packages including geNorm, NormFinder, and BestKeeper. The results demonstrated that TUBβ6, E2, TUBα, and HSP90 were generally the most suitable sets under all tested conditions; similarly, TUBα and HSP90 showed peak stability under salt stress, TUBα and EF-1α were the most suitable reference genes under cold stress, and ACT and E2 were the most stable under drought stress. Finally, a known circadian gene CCA1 was used to verify the service ability of chosen reference genes. The results confirmed that all recommended reference genes by the three software were suitable for gene expression analysis

  6. Detecting negative selection on recurrent mutations using gene genealogy

    Science.gov (United States)

    2013-01-01

    Background Whether or not a mutant allele in a population is under selection is an important issue in population genetics, and various neutrality tests have been invented so far to detect selection. However, detection of negative selection has been notoriously difficult, partly because negatively selected alleles are usually rare in the population and have little impact on either population dynamics or the shape of the gene genealogy. Recently, through studies of genetic disorders and genome-wide analyses, many structural variations were shown to occur recurrently in the population. Such “recurrent mutations” might be revealed as deleterious by exploiting the signal of negative selection in the gene genealogy enhanced by their recurrence. Results Motivated by the above idea, we devised two new test statistics. One is the total number of mutants at a recurrently mutating locus among sampled sequences, which is tested conditionally on the number of forward mutations mapped on the sequence genealogy. The other is the size of the most common class of identical-by-descent mutants in the sample, again tested conditionally on the number of forward mutations mapped on the sequence genealogy. To examine the performance of these two tests, we simulated recurrently mutated loci each flanked by sites with neutral single nucleotide polymorphisms (SNPs), with no recombination. Using neutral recurrent mutations as null models, we attempted to detect deleterious recurrent mutations. Our analyses demonstrated high powers of our new tests under constant population size, as well as their moderate power to detect selection in expanding populations. We also devised a new maximum parsimony algorithm that, given the states of the sampled sequences at a recurrently mutating locus and an incompletely resolved genealogy, enumerates mutation histories with a minimum number of mutations while partially resolving genealogical relationships when necessary. Conclusions With their

  7. GSEH: A Novel Approach to Select Prostate Cancer-Associated Genes Using Gene Expression Heterogeneity.

    Science.gov (United States)

    Kim, Hyunjin; Choi, Sang-Min; Park, Sanghyun

    2018-01-01

    When a gene shows varying levels of expression among normal people but similar levels in disease patients or shows similar levels of expression among normal people but different levels in disease patients, we can assume that the gene is associated with the disease. By utilizing this gene expression heterogeneity, we can obtain additional information that abets discovery of disease-associated genes. In this study, we used collaborative filtering to calculate the degree of gene expression heterogeneity between classes and then scored the genes on the basis of the degree of gene expression heterogeneity to find "differentially predicted" genes. Through the proposed method, we discovered more prostate cancer-associated genes than 10 comparable methods. The genes prioritized by the proposed method are potentially significant to biological processes of a disease and can provide insight into them.

  8. Antibody-directed lentiviral gene transduction for live-cell monitoring and selection of human iPS and hES cells.

    Directory of Open Access Journals (Sweden)

    Dai-tze Wu

    Full Text Available The identification of stem cells within a mixed population of cells is a major hurdle for stem cell biology--in particular, in the identification of induced pluripotent stem (iPS cells during the reprogramming process. Based on the selective expression of stem cell surface markers, a method to specifically infect stem cells through antibody-conjugated lentiviral particles has been developed that can deliver both visual markers for live-cell imaging as well as selectable markers to enrich for iPS cells. Antibodies recognizing SSEA4 and CD24 mediated the selective infection of the iPS cells over the parental human fibroblasts, allowing for rapid expansion of these cells by puromycin selection. Adaptation of the vector allows for the selective marking of human embryonic stem (hES cells for their removal from a population of differentiated cells. This method has the benefit that it not only identifies stem cells, but that specific genes, including positive and negative selection markers, regulatory genes or miRNA can be delivered to the targeted stem cells. The ability to specifically target gene delivery to human pluripotent stem cells has broad applications in tissue engineering and stem cell therapies.

  9. The roles of gene duplication, gene conversion and positive selection in rodent Esp and Mup pheromone gene families with comparison to the Abp family.

    Science.gov (United States)

    Karn, Robert C; Laukaitis, Christina M

    2012-01-01

    Three proteinaceous pheromone families, the androgen-binding proteins (ABPs), the exocrine-gland secreting peptides (ESPs) and the major urinary proteins (MUPs) are encoded by large gene families in the genomes of Mus musculus and Rattus norvegicus. We studied the evolutionary histories of the Mup and Esp genes and compared them with what is known about the Abp genes. Apparently gene conversion has played little if any role in the expansion of the mouse Class A and Class B Mup genes and pseudogenes, and the rat Mups. By contrast, we found evidence of extensive gene conversion in many Esp genes although not in all of them. Our studies of selection identified at least two amino acid sites in β-sheets as having evolved under positive selection in the mouse Class A and Class B MUPs and in rat MUPs. We show that selection may have acted on the ESPs by determining K(a)/K(s) for Exon 3 sequences with and without the converted sequence segment. While it appears that purifying selection acted on the ESP signal peptides, the secreted portions of the ESPs probably have undergone much more rapid evolution. When the inner gene converted fragment sequences were removed, eleven Esp paralogs were present in two or more pairs with K(a)/K(s) >1.0 and thus we propose that positive selection is detectable by this means in at least some mouse Esp paralogs. We compare and contrast the evolutionary histories of all three mouse pheromone gene families in light of their proposed functions in mouse communication.

  10. Positive selection in the SLC11A1 gene in the family Equidae

    DEFF Research Database (Denmark)

    Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan

    2016-01-01

    Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes...... a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans...... and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence...

  11. Research progress in machine learning methods for gene-gene interaction detection.

    Science.gov (United States)

    Peng, Zhe-Ye; Tang, Zi-Jun; Xie, Min-Zhu

    2018-03-20

    Complex diseases are results of gene-gene and gene-environment interactions. However, the detection of high-dimensional gene-gene interactions is computationally challenging. In the last two decades, machine-learning approaches have been developed to detect gene-gene interactions with some successes. In this review, we summarize the progress in research on machine learning methods, as applied to gene-gene interaction detection. It systematically examines the principles and limitations of the current machine learning methods used in genome wide association studies (GWAS) to detect gene-gene interactions, such as neural networks (NN), random forest (RF), support vector machines (SVM) and multifactor dimensionality reduction (MDR), and provides some insights on the future research directions in the field.

  12. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  13. The Integrative Method Based on the Module-Network for Identifying Driver Genes in Cancer Subtypes

    Directory of Open Access Journals (Sweden)

    Xinguo Lu

    2018-01-01

    Full Text Available With advances in next-generation sequencing(NGS technologies, a large number of multiple types of high-throughput genomics data are available. A great challenge in exploring cancer progression is to identify the driver genes from the variant genes by analyzing and integrating multi-types genomics data. Breast cancer is known as a heterogeneous disease. The identification of subtype-specific driver genes is critical to guide the diagnosis, assessment of prognosis and treatment of breast cancer. We developed an integrated frame based on gene expression profiles and copy number variation (CNV data to identify breast cancer subtype-specific driver genes. In this frame, we employed statistical machine-learning method to select gene subsets and utilized an module-network analysis method to identify potential candidate driver genes. The final subtype-specific driver genes were acquired by paired-wise comparison in subtypes. To validate specificity of the driver genes, the gene expression data of these genes were applied to classify the patient samples with 10-fold cross validation and the enrichment analysis were also conducted on the identified driver genes. The experimental results show that the proposed integrative method can identify the potential driver genes and the classifier with these genes acquired better performance than with genes identified by other methods.

  14. Self-assembled pentablock copolymers for selective and sustained gene delivery

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Bingqi [Iowa State Univ., Ames, IA (United States)

    2011-05-15

    The poly(diethylaminoethyl methacrylate) (PDEAEM) - Pluronic F127 - PDEAEM pentablock copolymer (PB) gene delivery vector system has been found to possess an inherent selectivity in transfecting cancer cells over non-cancer cells in vitro, without attaching any targeting ligands. In order to understand the mechanism of this selective transfection, three possible intracellular barriers to transfection were investigated in both cancer and non-cancer cells. We concluded that escape from the endocytic pathway served as the primary intracellular barrier for PB-mediated transfection. Most likely, PB vectors were entrapped and rendered non-functional in acidic lysosomes of non-cancer cells, but survived in less acidic lysosomes of cancer cells. The work highlights the importance of identifying intracellular barriers for different gene delivery systems and provides a new paradigm for designing targeting vectors based on intracellular differences between cell types, rather than through the use of targeting ligands. The PB vector was further developed to simultaneously deliver anticancer drugs and genes, which showed a synergistic effect demonstrated by significantly enhanced gene expression in vitro. Due to the thermosensitive gelation behavior, the PB vector packaging both drug and gene was also investigated for its in vitro sustained release properties by using polyethylene glycol diacrylate as a barrier gel to mimic the tumor matrix in vivo. Overall, this work resulted in the development of a gene delivery vector for sustained and selective gene delivery to tumor cells for cancer therapy.

  15. Population Level Purifying Selection and Gene Expression Shape Subgenome Evolution in Maize.

    Science.gov (United States)

    Pophaly, Saurabh D; Tellier, Aurélien

    2015-12-01

    The maize ancestor experienced a recent whole-genome duplication (WGD) followed by gene erosion which generated two subgenomes, the dominant subgenome (maize1) experiencing fewer deletions than maize2. We take advantage of available extensive polymorphism and gene expression data in maize to study purifying selection and gene expression divergence between WGD retained paralog pairs. We first report a strong correlation in nucleotide diversity between duplicate pairs, except for upstream regions. We then show that maize1 genes are under stronger purifying selection than maize2. WGD retained genes have higher gene dosage and biased Gene Ontologies consistent with previous studies. The relative gene expression of paralogs across tissues demonstrates that 98% of duplicate pairs have either subfunctionalized in a tissuewise manner or have diverged consistently in their expression thereby preventing functional complementation. Tissuewise subfunctionalization seems to be a hallmark of transcription factors, whereas consistent repression occurs for macromolecular complexes. We show that dominant gene expression is a strong determinant of the strength of purifying selection, explaining the inferred stronger negative selection on maize1 genes. We propose a novel expression-based classification of duplicates which is more robust to explain observed polymorphism patterns than the subgenome location. Finally, upstream regions of repressed genes exhibit an enrichment in transposable elements which indicates a possible mechanism for expression divergence. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  16. Repeated adaptive introgression at a gene under multiallelic balancing selection.

    Directory of Open Access Journals (Sweden)

    Vincent Castric

    2008-08-01

    Full Text Available Recently diverged species typically have incomplete reproductive barriers, allowing introgression of genetic material from one species into the genomic background of the other. The role of natural selection in preventing or promoting introgression remains contentious. Because of genomic co-adaptation, some chromosomal fragments are expected to be selected against in the new background and resist introgression. In contrast, natural selection should favor introgression for alleles at genes evolving under multi-allelic balancing selection, such as the MHC in vertebrates, disease resistance, or self-incompatibility genes in plants. Here, we test the prediction that negative, frequency-dependent selection on alleles at the multi-allelic gene controlling pistil self-incompatibility specificity in two closely related species, Arabidopsis halleri and A. lyrata, caused introgression at this locus at a higher rate than the genomic background. Polymorphism at this gene is largely shared, and we have identified 18 pairs of S-alleles that are only slightly divergent between the two species. For these pairs of S-alleles, divergence at four-fold degenerate sites (K = 0.0193 is about four times lower than the genomic background (K = 0.0743. We demonstrate that this difference cannot be explained by differences in effective population size between the two types of loci. Rather, our data are most consistent with a five-fold increase of introgression rates for S-alleles as compared to the genomic background, making this study the first documented example of adaptive introgression facilitated by balancing selection. We suggest that this process plays an important role in the maintenance of high allelic diversity and divergence at the S-locus in flowering plant families. Because genes under balancing selection are expected to be among the last to stop introgressing, their comparison in closely related species provides a lower-bound estimate of the time since the

  17. The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer

    Directory of Open Access Journals (Sweden)

    Rosa Aghdam

    2017-12-01

    Full Text Available Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/.

  18. Selective Gene Delivery for Integrating Exogenous DNA into Plastid and Mitochondrial Genomes Using Peptide-DNA Complexes.

    Science.gov (United States)

    Yoshizumi, Takeshi; Oikawa, Kazusato; Chuah, Jo-Ann; Kodama, Yutaka; Numata, Keiji

    2018-05-14

    Selective gene delivery into organellar genomes (mitochondrial and plastid genomes) has been limited because of a lack of appropriate platform technology, even though these organelles are essential for metabolite and energy production. Techniques for selective organellar modification are needed to functionally improve organelles and produce transplastomic/transmitochondrial plants. However, no method for mitochondrial genome modification has yet been established for multicellular organisms including plants. Likewise, modification of plastid genomes has been limited to a few plant species and algae. In the present study, we developed ionic complexes of fusion peptides containing organellar targeting signal and plasmid DNA for selective delivery of exogenous DNA into the plastid and mitochondrial genomes of intact plants. This is the first report of exogenous DNA being integrated into the mitochondrial genomes of not only plants, but also multicellular organisms in general. This fusion peptide-mediated gene delivery system is a breakthrough platform for both plant organellar biotechnology and gene therapy for mitochondrial diseases in animals.

  19. Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica.

    Science.gov (United States)

    Guo, Yue; Liu, Jing; Zhang, Jiefu; Liu, Shengyi; Du, Jianchang

    2017-07-01

    It has been well documented that most nuclear protein-coding genes in organisms can be classified into two categories: positively selected genes (PSGs) and negatively selected genes (NSGs). The characteristics and evolutionary fates of different types of genes, however, have been poorly understood. In this study, the rates of nonsynonymous substitution (K a ) and the rates of synonymous substitution (K s ) were investigated by comparing the orthologs between the two sequenced Brassica species, Brassica rapa and Brassica oleracea, and the evolutionary rates, gene structures, expression patterns, and codon bias were compared between PSGs and NSGs. The resulting data show that PSGs have higher protein evolutionary rates, lower synonymous substitution rates, shorter gene length, fewer exons, higher functional specificity, lower expression level, higher tissue-specific expression and stronger codon bias than NSGs. Although the quantities and values are different, the relative features of PSGs and NSGs have been largely verified in the model species Arabidopsis. These data suggest that PSGs and NSGs differ not only under selective pressure (K a /K s ), but also in their evolutionary, structural and functional properties, indicating that selective modes may serve as a determinant factor for measuring evolutionary rates, gene compactness and expression patterns in Brassica. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  20. Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes

    Directory of Open Access Journals (Sweden)

    Lotz Meredith J

    2008-01-01

    Full Text Available Abstract Background Gene expression data frequently contain missing values, however, most down-stream analyses for microarray experiments require complete data. In the literature many methods have been proposed to estimate missing values via information of the correlation patterns within the gene expression matrix. Each method has its own advantages, but the specific conditions for which each method is preferred remains largely unclear. In this report we describe an extensive evaluation of eight current imputation methods on multiple types of microarray experiments, including time series, multiple exposures, and multiple exposures × time series data. We then introduce two complementary selection schemes for determining the most appropriate imputation method for any given data set. Results We found that the optimal imputation algorithms (LSA, LLS, and BPCA are all highly competitive with each other, and that no method is uniformly superior in all the data sets we examined. The success of each method can also depend on the underlying "complexity" of the expression data, where we take complexity to indicate the difficulty in mapping the gene expression matrix to a lower-dimensional subspace. We developed an entropy measure to quantify the complexity of expression matrixes and found that, by incorporating this information, the entropy-based selection (EBS scheme is useful for selecting an appropriate imputation algorithm. We further propose a simulation-based self-training selection (STS scheme. This technique has been used previously for microarray data imputation, but for different purposes. The scheme selects the optimal or near-optimal method with high accuracy but at an increased computational cost. Conclusion Our findings provide insight into the problem of which imputation method is optimal for a given data set. Three top-performing methods (LSA, LLS and BPCA are competitive with each other. Global-based imputation methods (PLS, SVD, BPCA

  1. Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes.

    Science.gov (United States)

    Brock, Guy N; Shaffer, John R; Blakesley, Richard E; Lotz, Meredith J; Tseng, George C

    2008-01-10

    Gene expression data frequently contain missing values, however, most down-stream analyses for microarray experiments require complete data. In the literature many methods have been proposed to estimate missing values via information of the correlation patterns within the gene expression matrix. Each method has its own advantages, but the specific conditions for which each method is preferred remains largely unclear. In this report we describe an extensive evaluation of eight current imputation methods on multiple types of microarray experiments, including time series, multiple exposures, and multiple exposures x time series data. We then introduce two complementary selection schemes for determining the most appropriate imputation method for any given data set. We found that the optimal imputation algorithms (LSA, LLS, and BPCA) are all highly competitive with each other, and that no method is uniformly superior in all the data sets we examined. The success of each method can also depend on the underlying "complexity" of the expression data, where we take complexity to indicate the difficulty in mapping the gene expression matrix to a lower-dimensional subspace. We developed an entropy measure to quantify the complexity of expression matrixes and found that, by incorporating this information, the entropy-based selection (EBS) scheme is useful for selecting an appropriate imputation algorithm. We further propose a simulation-based self-training selection (STS) scheme. This technique has been used previously for microarray data imputation, but for different purposes. The scheme selects the optimal or near-optimal method with high accuracy but at an increased computational cost. Our findings provide insight into the problem of which imputation method is optimal for a given data set. Three top-performing methods (LSA, LLS and BPCA) are competitive with each other. Global-based imputation methods (PLS, SVD, BPCA) performed better on mcroarray data with lower complexity

  2. A probabilistic method for testing and estimating selection differences between populations.

    Science.gov (United States)

    He, Yungang; Wang, Minxian; Huang, Xin; Li, Ran; Xu, Hongyang; Xu, Shuhua; Jin, Li

    2015-12-01

    Human populations around the world encounter various environmental challenges and, consequently, develop genetic adaptations to different selection forces. Identifying the differences in natural selection between populations is critical for understanding the roles of specific genetic variants in evolutionary adaptation. Although numerous methods have been developed to detect genetic loci under recent directional selection, a probabilistic solution for testing and quantifying selection differences between populations is lacking. Here we report the development of a probabilistic method for testing and estimating selection differences between populations. By use of a probabilistic model of genetic drift and selection, we showed that logarithm odds ratios of allele frequencies provide estimates of the differences in selection coefficients between populations. The estimates approximate a normal distribution, and variance can be estimated using genome-wide variants. This allows us to quantify differences in selection coefficients and to determine the confidence intervals of the estimate. Our work also revealed the link between genetic association testing and hypothesis testing of selection differences. It therefore supplies a solution for hypothesis testing of selection differences. This method was applied to a genome-wide data analysis of Han and Tibetan populations. The results confirmed that both the EPAS1 and EGLN1 genes are under statistically different selection in Han and Tibetan populations. We further estimated differences in the selection coefficients for genetic variants involved in melanin formation and determined their confidence intervals between continental population groups. Application of the method to empirical data demonstrated the outstanding capability of this novel approach for testing and quantifying differences in natural selection. © 2015 He et al.; Published by Cold Spring Harbor Laboratory Press.

  3. Positive selection in the SLC11A1 gene in the family Equidae.

    Science.gov (United States)

    Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr

    2016-05-01

    Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.

  4. Analysis of polymorphisms and selective pressures on ama1 gene ...

    Indian Academy of Sciences (India)

    Chuen Yang Chua

    2017-09-05

    Sep 5, 2017 ... The presence of purifying selection and low nucleotide diversity ... (2000) studied the gene substitution of ama1 ... in the gene coding for AMA-1 protein in Plasmodium ... Health Malaysia. ...... X. Asembo Bay Cohort Project.

  5. The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer.

    Science.gov (United States)

    Aghdam, Rosa; Baghfalaki, Taban; Khosravi, Pegah; Saberi Ansari, Elnaz

    2017-12-01

    Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM) method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/. Copyright © 2017. Production and hosting by Elsevier B.V.

  6. Robust variable selection method for nonparametric differential equation models with application to nonlinear dynamic gene regulatory network analysis.

    Science.gov (United States)

    Lu, Tao

    2016-01-01

    The gene regulation network (GRN) evaluates the interactions between genes and look for models to describe the gene expression behavior. These models have many applications; for instance, by characterizing the gene expression mechanisms that cause certain disorders, it would be possible to target those genes to block the progress of the disease. Many biological processes are driven by nonlinear dynamic GRN. In this article, we propose a nonparametric differential equation (ODE) to model the nonlinear dynamic GRN. Specially, we address following questions simultaneously: (i) extract information from noisy time course gene expression data; (ii) model the nonlinear ODE through a nonparametric smoothing function; (iii) identify the important regulatory gene(s) through a group smoothly clipped absolute deviation (SCAD) approach; (iv) test the robustness of the model against possible shortening of experimental duration. We illustrate the usefulness of the model and associated statistical methods through a simulation and a real application examples.

  7. Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion.

    Science.gov (United States)

    Zhao, Shanrong; Zhang, Ying; Gamini, Ramya; Zhang, Baohong; von Schack, David

    2018-03-19

    To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.

  8. A data science approach to candidate gene selection of pain regarded as a process of learning and neural plasticity.

    Science.gov (United States)

    Ultsch, Alfred; Kringel, Dario; Kalso, Eija; Mogil, Jeffrey S; Lötsch, Jörn

    2016-12-01

    The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 535 genes identified empirically as relevant to pain with the knowledge about the functions of thousands of genes. Starting from an accepted description of chronic pain as displaying systemic features described by the terms "learning" and "neuronal plasticity," a functional genomics analysis proposed that among the functions of the 535 "pain genes," the biological processes "learning or memory" (P = 8.6 × 10) and "nervous system development" (P = 2.4 × 10) are statistically significantly overrepresented as compared with the annotations to these processes expected by chance. After establishing that the hypothesized biological processes were among important functional genomics features of pain, a subset of n = 34 pain genes were found to be annotated with both Gene Ontology terms. Published empirical evidence supporting their involvement in chronic pain was identified for almost all these genes, including 1 gene identified in March 2016 as being involved in pain. By contrast, such evidence was virtually absent in a randomly selected set of 34 other human genes. Hence, the present computational functional genomics-based method can be used for candidate gene selection, providing an alternative to established methods.

  9. Balancing selection on immunity genes: review of the current literature and new analysis in Drosophila melanogaster.

    Science.gov (United States)

    Croze, Myriam; Živković, Daniel; Stephan, Wolfgang; Hutter, Stephan

    2016-08-01

    Balancing selection has been widely assumed to be an important evolutionary force, yet even today little is known about its abundance and its impact on the patterns of genetic diversity. Several studies have shown examples of balancing selection in humans, plants or parasites, and many genes under balancing selection are involved in immunity. It has been proposed that host-parasite coevolution is one of the main forces driving immune genes to evolve under balancing selection. In this paper, we review the literature on balancing selection on immunity genes in several organisms, including Drosophila. Furthermore, we performed a genome scan for balancing selection in an African population of Drosophila melanogaster using coalescent simulations of a demographic model with and without selection. We find very few genes under balancing selection and only one novel candidate gene related to immunity. Finally, we discuss the possible causes of the low number of genes under balancing selection. Copyright © 2016 The Authors. Published by Elsevier GmbH.. All rights reserved.

  10. Genome-wide analysis of positively selected genes in seasonal and non-seasonal breeding species.

    Directory of Open Access Journals (Sweden)

    Yuhuan Meng

    Full Text Available Some mammals breed throughout the year, while others breed only at certain times of year. These differences in reproductive behavior can be explained by evolution. We identified positively-selected genes in two sets of species with different degrees of relatedness including seasonal and non-seasonal breeding species, using branch-site models. After stringent filtering by sum of pairs scoring, we revealed that more genes underwent positive selection in seasonal compared with non-seasonal breeding species. Positively-selected genes were verified by cDNA mapping of the positive sites with the corresponding cDNA sequences. The design of the evolutionary analysis can effectively lower the false-positive rate and thus identify valid positive genes. Validated, positively-selected genes, including CGA, DNAH1, INVS, and CD151, were related to reproductive behaviors such as spermatogenesis and cell proliferation in non-seasonal breeding species. Genes in seasonal breeding species, including THRAP3, TH1L, and CMTM6, may be related to the evolution of sperm and the circadian rhythm system. Identification of these positively-selected genes might help to identify the molecular mechanisms underlying seasonal and non-seasonal reproductive behaviors.

  11. Bacterial evolution through the selective loss of beneficial Genes. Trade-offs in expression involving two loci.

    Science.gov (United States)

    Zinser, Erik R; Schneider, Dominique; Blot, Michel; Kolter, Roberto

    2003-01-01

    The loss of preexisting genes or gene activities during evolution is a major mechanism of ecological specialization. Evolutionary processes that can account for gene loss or inactivation have so far been restricted to one of two mechanisms: direct selection for the loss of gene activities that are disadvantageous under the conditions of selection (i.e., antagonistic pleiotropy) and selection-independent genetic drift of neutral (or nearly neutral) mutations (i.e., mutation accumulation). In this study we demonstrate with an evolved strain of Escherichia coli that a third, distinct mechanism exists by which gene activities can be lost. This selection-dependent mechanism involves the expropriation of one gene's upstream regulatory element by a second gene via a homologous recombination event. Resulting from this genetic exchange is the activation of the second gene and a concomitant inactivation of the first gene. This gene-for-gene expression tradeoff provides a net fitness gain, even if the forfeited activity of the first gene can play a positive role in fitness under the conditions of selection. PMID:12930738

  12. Bacterial evolution through the selective loss of beneficial Genes. Trade-offs in expression involving two loci.

    Science.gov (United States)

    Zinser, Erik R; Schneider, Dominique; Blot, Michel; Kolter, Roberto

    2003-08-01

    The loss of preexisting genes or gene activities during evolution is a major mechanism of ecological specialization. Evolutionary processes that can account for gene loss or inactivation have so far been restricted to one of two mechanisms: direct selection for the loss of gene activities that are disadvantageous under the conditions of selection (i.e., antagonistic pleiotropy) and selection-independent genetic drift of neutral (or nearly neutral) mutations (i.e., mutation accumulation). In this study we demonstrate with an evolved strain of Escherichia coli that a third, distinct mechanism exists by which gene activities can be lost. This selection-dependent mechanism involves the expropriation of one gene's upstream regulatory element by a second gene via a homologous recombination event. Resulting from this genetic exchange is the activation of the second gene and a concomitant inactivation of the first gene. This gene-for-gene expression tradeoff provides a net fitness gain, even if the forfeited activity of the first gene can play a positive role in fitness under the conditions of selection.

  13. A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology.

    Science.gov (United States)

    Koo, Ching Lee; Liew, Mei Jing; Mohamad, Mohd Saberi; Salleh, Abdul Hakim Mohamed

    2013-01-01

    Recently, the greatest statistical computational challenge in genetic epidemiology is to identify and characterize the genes that interact with other genes and environment factors that bring the effect on complex multifactorial disease. These gene-gene interactions are also denoted as epitasis in which this phenomenon cannot be solved by traditional statistical method due to the high dimensionality of the data and the occurrence of multiple polymorphism. Hence, there are several machine learning methods to solve such problems by identifying such susceptibility gene which are neural networks (NNs), support vector machine (SVM), and random forests (RFs) in such common and multifactorial disease. This paper gives an overview on machine learning methods, describing the methodology of each machine learning methods and its application in detecting gene-gene and gene-environment interactions. Lastly, this paper discussed each machine learning method and presents the strengths and weaknesses of each machine learning method in detecting gene-gene interactions in complex human disease.

  14. A Review for Detecting Gene-Gene Interactions Using Machine Learning Methods in Genetic Epidemiology

    Directory of Open Access Journals (Sweden)

    Ching Lee Koo

    2013-01-01

    Full Text Available Recently, the greatest statistical computational challenge in genetic epidemiology is to identify and characterize the genes that interact with other genes and environment factors that bring the effect on complex multifactorial disease. These gene-gene interactions are also denoted as epitasis in which this phenomenon cannot be solved by traditional statistical method due to the high dimensionality of the data and the occurrence of multiple polymorphism. Hence, there are several machine learning methods to solve such problems by identifying such susceptibility gene which are neural networks (NNs, support vector machine (SVM, and random forests (RFs in such common and multifactorial disease. This paper gives an overview on machine learning methods, describing the methodology of each machine learning methods and its application in detecting gene-gene and gene-environment interactions. Lastly, this paper discussed each machine learning method and presents the strengths and weaknesses of each machine learning method in detecting gene-gene interactions in complex human disease.

  15. Evidence for positive selection on the leptin gene in Cetacea and Pinnipedia.

    Directory of Open Access Journals (Sweden)

    Li Yu

    Full Text Available The leptin gene has received intensive attention and scientific investigation for its importance in energy homeostasis and reproductive regulation in mammals. Furthermore, study of the leptin gene is of crucial importance for public health, particularly for its role in obesity, as well as for other numerous physiological roles that it plays in mammals. In the present work, we report the identification of novel leptin genes in 4 species of Cetacea, and a comparison with 55 publicly available leptin sequences from mammalian genome assemblies and previous studies. Our study provides evidence for positive selection in the suborder Odontoceti (toothed whales of the Cetacea and the family Phocidae (earless seals of the Pinnipedia. We also detected positive selection in several leptin gene residues in these two lineages. To test whether leptin and its receptor evolved in a coordinated manner, we analyzed 24 leptin receptor gene (LPR sequences from available mammalian genome assemblies and other published data. Unlike the case of leptin, our analyses did not find evidence of positive selection for LPR across the Cetacea and Pinnipedia lineages. In line with this, positively selected sites identified in the leptin genes of these two lineages were located outside of leptin receptor binding sites, which at least partially explains why co-evolution of leptin and its receptor was not observed in the present study. Our study provides interesting insights into current understanding of the evolution of mammalian leptin genes in response to selective pressures from life in an aquatic environment, and leads to a hypothesis that new tissue specificity or novel physiologic functions of leptin genes may have arisen in both odontocetes and phocids. Additional data from other species encompassing varying life histories and functional tests of the adaptive role of the amino acid changes identified in this study will help determine the factors that promote the adaptive

  16. Flies selected for longevity retain a young gene expression profile

    DEFF Research Database (Denmark)

    Sarup, Pernille Merete; Sørensen, Peter; Loeschcke, Volker

    2011-01-01

      We investigated correlated responses in the transcriptomes of longevity-selected lines of Drosophila melanogaster to identify pathways that affect life span in metazoan systems. We evaluated the gene expression profile in young, middle-aged, and old male flies, finding that 530 genes were...

  17. Selecting and validating reference genes for quantitative real-time PCR in Plutella xylostella (L.).

    Science.gov (United States)

    You, Yanchun; Xie, Miao; Vasseur, Liette; You, Minsheng

    2018-05-01

    Gene expression analysis provides important clues regarding gene functions, and quantitative real-time PCR (qRT-PCR) is a widely used method in gene expression studies. Reference genes are essential for normalizing and accurately assessing gene expression. In the present study, 16 candidate reference genes (ACTB, CyPA, EF1-α, GAPDH, HSP90, NDPk, RPL13a, RPL18, RPL19, RPL32, RPL4, RPL8, RPS13, RPS4, α-TUB, and β-TUB) from Plutella xylostella were selected to evaluate gene expression stability across different experimental conditions using five statistical algorithms (geNorm, NormFinder, Delta Ct, BestKeeper, and RefFinder). The results suggest that different reference genes or combinations of reference genes are suitable for normalization in gene expression studies of P. xylostella according to the different developmental stages, strains, tissues, and insecticide treatments. Based on the given experimental sets, the most stable reference genes were RPS4 across different developmental stages, RPL8 across different strains and tissues, and EF1-α across different insecticide treatments. A comprehensive and systematic assessment of potential reference genes for gene expression normalization is essential for post-genomic functional research in P. xylostella, a notorious pest with worldwide distribution and a high capacity to adapt and develop resistance to insecticides.

  18. Selection of reference genes for gene expression studies in heart failure for left and right ventricles.

    Science.gov (United States)

    Li, Mengmeng; Rao, Man; Chen, Kai; Zhou, Jianye; Song, Jiangping

    2017-07-15

    Real-time quantitative reverse transcriptase-PCR (qRT-PCR) is a feasible tool for determining gene expression profiles, but the accuracy and reliability of the results depends on the stable expression of selected housekeeping genes in different samples. By far, researches on stable housekeeping genes in human heart failure samples are rare. Moreover the effect of heart failure on the expression of housekeeping genes in right and left ventricles is yet to be studied. Therefore we aim to provide stable housekeeping genes for both ventricles in heart failure and normal heart samples. In this study, we selected seven commonly used housekeeping genes as candidates. By using the qRT-PCR, the expression levels of ACTB, RAB7A, GAPDH, REEP5, RPL5, PSMB4 and VCP in eight heart failure and four normal heart samples were assessed. The stability of candidate housekeeping genes was evaluated by geNorm and Normfinder softwares. GAPDH showed the least variation in all heart samples. Results also indicated the difference of gene expression existed in heart failure left and right ventricles. GAPDH had the highest expression stability in both heart failure and normal heart samples. We also propose using different sets of housekeeping genes for left and right ventricles respectively. The combination of RPL5, GAPDH and PSMB4 is suitable for the right ventricle and the combination of GAPDH, REEP5 and RAB7A is suitable for the left ventricle. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Natural selection on protein-coding genes in the human genome

    DEFF Research Database (Denmark)

    Bustamente, Carlos D.; Fledel-Alon, Adi; Williamson, Scott

    2005-01-01

    , showing an excess of deleterious variation within local populations 9, 10 . Here we contrast patterns of coding sequence polymorphism identified by direct sequencing of 39 humans for over 11,000 genes to divergence between humans and chimpanzees, and find strong evidence that natural selection has shaped......Comparisons of DNA polymorphism within species to divergence between species enables the discovery of molecular adaptation in evolutionarily constrained genes as well as the differentiation of weak from strong purifying selection 1, 2, 3, 4 . The extent to which weak negative and positive darwinian...... selection have driven the molecular evolution of different species varies greatly 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 , with some species, such as Drosophila melanogaster, showing strong evidence of pervasive positive selection 6, 7, 8, 9 , and others, such as the selfing weed Arabidopsis thaliana...

  20. Selection and validation of reference genes for quantitative gene expression analyses in various tissues and seeds at different developmental stages in Bixa orellana L.

    Science.gov (United States)

    Moreira, Viviane S; Soares, Virgínia L F; Silva, Raner J S; Sousa, Aurizangela O; Otoni, Wagner C; Costa, Marcio G C

    2018-05-01

    Bixa orellana L., popularly known as annatto, produces several secondary metabolites of pharmaceutical and industrial interest, including bixin, whose molecular basis of biosynthesis remain to be determined. Gene expression analysis by quantitative real-time PCR (qPCR) is an important tool to advance such knowledge. However, correct interpretation of qPCR data requires the use of suitable reference genes in order to reduce experimental variations. In the present study, we have selected four different candidates for reference genes in B. orellana , coding for 40S ribosomal protein S9 (RPS9), histone H4 (H4), 60S ribosomal protein L38 (RPL38) and 18S ribosomal RNA (18SrRNA). Their expression stabilities in different tissues (e.g. flower buds, flowers, leaves and seeds at different developmental stages) were analyzed using five statistical tools (NormFinder, geNorm, BestKeeper, ΔCt method and RefFinder). The results indicated that RPL38 is the most stable gene in different tissues and stages of seed development and 18SrRNA is the most unstable among the analyzed genes. In order to validate the candidate reference genes, we have analyzed the relative expression of a target gene coding for carotenoid cleavage dioxygenase 1 (CCD1) using the stable RPL38 and the least stable gene, 18SrRNA , for normalization of the qPCR data. The results demonstrated significant differences in the interpretation of the CCD1 gene expression data, depending on the reference gene used, reinforcing the importance of the correct selection of reference genes for normalization.

  1. Selection and validation of endogenous reference genes for qRT-PCR analysis in leafy spurge (Euphorbia esula.

    Directory of Open Access Journals (Sweden)

    Wun S Chao

    Full Text Available Quantitative real-time polymerase chain reaction (qRT-PCR is the most important tool in measuring levels of gene expression due to its accuracy, specificity, and sensitivity. However, the accuracy of qRT-PCR analysis strongly depends on transcript normalization using stably expressed reference genes. The aim of this study was to find internal reference genes for qRT-PCR analysis in various experimental conditions for seed, adventitious underground bud, and other organs of leafy spurge. Eleven candidate reference genes (BAM4, PU1, TRP-like, FRO1, ORE9, BAM1, SEU, ARF2, KAPP, ZTL, and MPK4 were selected from among 171 genes based on expression stabilities during seed germination and bud growth. The other ten candidate reference genes were selected from three different sources: (1 3 stably expressed leafy spurge genes (60S, bZIP21, and MD-100 identified from the analyses of leafy spurge microarray data; (2 3 orthologs of Arabidopsis "general purpose" traditional reference genes (GAPDH_1, GAPDH_2, and UBC; and (3 4 orthologs of Arabidopsis stably expressed genes (UBC9, SAND, PTB, and F-box identified from Affymetrix ATH1 whole-genome GeneChip studies. The expression stabilities of these 21 genes were ranked based on the C(T values of 72 samples using four different computation programs including geNorm, Normfinder, BestKeeper, and the comparative ΔC(T method. Our analyses revealed SAND, PTB, ORE9, and ARF2 to be the most appropriate reference genes for accurate normalization of gene expression data. Since SAND and PTB were obtained from 4 orthologs of Arabidopsis, while ORE9 and ARF2 were selected from 171 leafy spurge genes, it was more efficient to identify good reference genes from the orthologs of other plant species that were known to be stably expressed than that of randomly testing endogenous genes. Nevertheless, the two newly identified leafy spurge genes, ORE9 and ARF2, can serve as orthologous candidates in the search for reference genes

  2. Selection of Suitable Endogenous Reference Genes for Relative Copy Number Detection in Sugarcane

    Directory of Open Access Journals (Sweden)

    Bantong Xue

    2014-05-01

    Full Text Available Transgene copy number has a great impact on the expression level and stability of exogenous gene in transgenic plants. Proper selection of endogenous reference genes is necessary for detection of genetic components in genetically modification (GM crops by quantitative real-time PCR (qPCR or by qualitative PCR approach, especially in sugarcane with polyploid and aneuploid genomic structure. qPCR technique has been widely accepted as an accurate, time-saving method on determination of copy numbers in transgenic plants and on detection of genetically modified plants to meet the regulatory and legislative requirement. In this study, to find a suitable endogenous reference gene and its real-time PCR assay for sugarcane (Saccharum spp. hybrids DNA content quantification, we evaluated a set of potential “single copy” genes including P4H, APRT, ENOL, CYC, TST and PRR, through qualitative PCR and absolute quantitative PCR. Based on copy number comparisons among different sugarcane genotypes, including five S. officinarum, one S. spontaneum and two S. spp. hybrids, these endogenous genes fell into three groups: ENOL-3—high copy number group, TST-1 and PRR-1—medium copy number group, P4H-1, APRT-2 and CYC-2—low copy number group. Among these tested genes, P4H, APRT and CYC were the most stable, while ENOL and TST were the least stable across different sugarcane genotypes. Therefore, three primer pairs of P4H-3, APRT-2 and CYC-2 were then selected as the suitable reference gene primer pairs for sugarcane. The test of multi-target reference genes revealed that the APRT gene was a specific amplicon, suggesting this gene is the most suitable to be used as an endogenous reference target for sugarcane DNA content quantification. These results should be helpful for establishing accurate and reliable qualitative and quantitative PCR analysis of GM sugarcane.

  3. Clustering based gene expression feature selection method: A computational approach to enrich the classifier efficiency of differentially expressed genes

    KAUST Repository

    Abusamra, Heba; Bajic, Vladimir B.

    2016-01-01

    decrease the computational time and cost, but also improve the classification performance. Among different approaches of feature selection methods, however most of them suffer from several problems such as lack of robustness, validation issues etc. Here, we

  4. Actin gene identification from selected medicinal plants for their use as internal controls for gene expression studies

    International Nuclear Information System (INIS)

    Mufti, F.U.D.; Banaras, S.

    2015-01-01

    Internal control genes are the constitutive genes which maintain the basic cellular functions and regularly express in both normal and stressed conditions in living organisms. They are used in normalization of gene expression studies in comparative analysis of target genes, as their expression remains comparatively unchanged in all varied conditions. Among internal control genes, actin is considered as a candidate gene for expression studies due to its vital role in shaping cytoskeleton and plant physiology. Unfortunately most of such knowledge is limited to only model plants or crops, not much is known about important medicinal plants. Therefore, we selected seven important medicinal wild plants for molecular identification of actin gene. We used gene specific primers designed from the conserved regions of several known orthologues or homologues of actin genes from other plants. The amplified products of 370-380 bp were sequenced and submitted to GeneBank after their confirmation using different bioinformatics tools. All the novel partial sequences of putative actin genes were submitted to GeneBank (Parthenium hysterophorus (KJ774023), Fagonia indica (KJ774024), Rhazya stricta (KJ774025), Whithania coagulans (KJ774026), Capparis decidua (KJ774027), Verbena officinalis (KJ774028) and Aerva javanica (KJ774029)). The comparisons of these partial sequences by Basic Local Alignment Search Tool (BLAST) and phylogenetic trees demonstrated high similarity with known actin genes of other plants. Our findings illustrated highly conserved nature of actin gene among these selected plants. These novel partial fragments of actin genes from these wild medicinal plants can be used as internal controls for future gene expression studies of these important plants after precise validations of their stable expression in such plants. (author)

  5. Identification of Subtype-Specific Prognostic Genes for Early-Stage Lung Adenocarcinoma and Squamous Cell Carcinoma Patients Using an Embedded Feature Selection Algorithm.

    Directory of Open Access Journals (Sweden)

    Suyan Tian

    Full Text Available The existence of fundamental differences between lung adenocarcinoma (AC and squamous cell carcinoma (SCC in their underlying mechanisms motivated us to postulate that specific genes might exist relevant to prognosis of each histology subtype. To test on this research hypothesis, we previously proposed a simple Cox-regression model based feature selection algorithm and identified successfully some subtype-specific prognostic genes when applying this method to real-world data. In this article, we continue our effort on identification of subtype-specific prognostic genes for AC and SCC, and propose a novel embedded feature selection method by extending Threshold Gradient Descent Regularization (TGDR algorithm and minimizing on a corresponding negative partial likelihood function. Using real-world datasets and simulated ones, we show these two proposed methods have comparable performance whereas the new proposal is superior in terms of model parsimony. Our analysis provides some evidence on the existence of such subtype-specific prognostic genes, more investigation is warranted.

  6. Adaptive L1/2 Shooting Regularization Method for Survival Analysis Using Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Xiao-Ying Liu

    2013-01-01

    Full Text Available A new adaptive L1/2 shooting regularization method for variable selection based on the Cox’s proportional hazards mode being proposed. This adaptive L1/2 shooting algorithm can be easily obtained by the optimization of a reweighed iterative series of L1 penalties and a shooting strategy of L1/2 penalty. Simulation results based on high dimensional artificial data show that the adaptive L1/2 shooting regularization method can be more accurate for variable selection than Lasso and adaptive Lasso methods. The results from real gene expression dataset (DLBCL also indicate that the L1/2 regularization method performs competitively.

  7. Selection of Reference Genes for qRT-PCR Analysis of Gene Expression in Stipa grandis during Environmental Stresses.

    Directory of Open Access Journals (Sweden)

    Dongli Wan

    Full Text Available Stipa grandis P. Smirn. is a dominant plant species in the typical steppe of the Xilingole Plateau of Inner Mongolia. Selection of suitable reference genes for the quantitative real-time reverse transcription polymerase chain reaction (qRT-PCR is important for gene expression analysis and research into the molecular mechanisms underlying the stress responses of S. grandis. In the present study, 15 candidate reference genes (EF1 beta, ACT, GAPDH, SamDC, CUL4, CAP, SNF2, SKIP1, SKIP5, SKIP11, UBC2, UBC15, UBC17, UCH, and HERC2 were evaluated for their stability as potential reference genes for qRT-PCR under different stresses. Four algorithms were used: GeNorm, NormFinder, BestKeeper, and RefFinder. The results showed that the most stable reference genes were different under different stress conditions: EF1beta and UBC15 during drought and salt stresses; ACT and GAPDH under heat stress; SKIP5 and UBC17 under cold stress; UBC15 and HERC2 under high pH stress; UBC2 and UBC15 under wounding stress; EF1beta and UBC17 under jasmonic acid treatment; UBC15 and CUL4 under abscisic acid treatment; and HERC2 and UBC17 under salicylic acid treatment. EF1beta and HERC2 were the most suitable genes for the global analysis of all samples. Furthermore, six target genes, SgPOD, SgPAL, SgLEA, SgLOX, SgHSP90 and SgPR1, were selected to validate the most and least stable reference genes under different treatments. Our results provide guidelines for reference gene selection for more accurate qRT-PCR quantification and will promote studies of gene expression in S. grandis subjected to environmental stress.

  8. Comparison of normalization methods for the analysis of metagenomic gene abundance data.

    Science.gov (United States)

    Pereira, Mariana Buongermino; Wallroth, Mikael; Jonsson, Viktor; Kristiansson, Erik

    2018-04-20

    In shotgun metagenomics, microbial communities are studied through direct sequencing of DNA without any prior cultivation. By comparing gene abundances estimated from the generated sequencing reads, functional differences between the communities can be identified. However, gene abundance data is affected by high levels of systematic variability, which can greatly reduce the statistical power and introduce false positives. Normalization, which is the process where systematic variability is identified and removed, is therefore a vital part of the data analysis. A wide range of normalization methods for high-dimensional count data has been proposed but their performance on the analysis of shotgun metagenomic data has not been evaluated. Here, we present a systematic evaluation of nine normalization methods for gene abundance data. The methods were evaluated through resampling of three comprehensive datasets, creating a realistic setting that preserved the unique characteristics of metagenomic data. Performance was measured in terms of the methods ability to identify differentially abundant genes (DAGs), correctly calculate unbiased p-values and control the false discovery rate (FDR). Our results showed that the choice of normalization method has a large impact on the end results. When the DAGs were asymmetrically present between the experimental conditions, many normalization methods had a reduced true positive rate (TPR) and a high false positive rate (FPR). The methods trimmed mean of M-values (TMM) and relative log expression (RLE) had the overall highest performance and are therefore recommended for the analysis of gene abundance data. For larger sample sizes, CSS also showed satisfactory performance. This study emphasizes the importance of selecting a suitable normalization methods in the analysis of data from shotgun metagenomics. Our results also demonstrate that improper methods may result in unacceptably high levels of false positives, which in turn may lead

  9. Genomic consequences of selection on self-incompatibility genes

    DEFF Research Database (Denmark)

    Schierup, Mikkel Heide; Vekemans, Xavier

    2008-01-01

    closely related species. We review recent empirical studies demonstrating these features and relate the empirical findings to theoretical predictions. We show how these features are being exploited in searches for other genes under multi-allelic balancing selection and for inference on recent breakdown...

  10. Network-Based Method for Identifying Co- Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues.

    Science.gov (United States)

    Chen, Lei; Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Huang, Tao; Cai, Yu-Dong

    2017-10-02

    Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein-protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method.

  11. FCERI AND HISTAMINE METABOLISM GENE VARIABILITY IN SELECTIVE RESPONDERS TO NSAIDS

    Directory of Open Access Journals (Sweden)

    Gemma Amo

    2016-09-01

    Full Text Available The high-affinity IgE receptor (Fcε RI is a heterotetramer of three subunits: Fcε RIα, Fcε RIβ and Fcε RIγ (αβγ2 encoded by three genes designated as FCER1A, FCER1B (MS4A2 and FCER1G, respectively. Recent evidence points to FCERI gene variability as a relevant factor in the risk of developing allergic diseases. Because Fcε RI plays a key role in the events downstream of the triggering factors in immunological response, we hypothesized that FCERI gene variants might be related with the risk of, or with the clinical response to, selective (IgE mediated non-steroidal anti-inflammatory (NSAID hypersensitivity.From a cohort of 314 patients suffering from selective hypersensitivity to metamizole, ibuprofen, diclofenac, paracetamol, acetylsalicylic acid (ASA, propifenazone, naproxen, ketoprofen, dexketoprofen, etofenamate, aceclofenac, etoricoxib, dexibuprofen, indomethacin, oxyphenylbutazone or piroxicam, and 585 unrelated healthy controls that tolerated these NSAIDs, we analyzed the putative effects of the FCERI SNPs FCER1A rs2494262, rs2427837 and rs2251746; FCER1B rs1441586, rs569108 and rs512555; FCER1G rs11587213, rs2070901 and rs11421. Furthermore, in order to identify additional genetic markers which might be associated with the risk of developing selective NSAID hypersensitivity, or which may modify the putative association of FCERI gene variations with risk, we analyzed polymorphisms known to affect histamine synthesis or metabolism, such as rs17740607, rs2073440, rs1801105, rs2052129, rs10156191, rs1049742 and rs1049793 in the HDC, HNMT and DAO genes.No major genetic associations with risk or with clinical presentation, and no gene-gene interactions, or gene-phenotype interactions (including age, gender, IgE concentration, antecedents of atopy, culprit drug or clinical presentation were identified in patients. However, logistic regression analyses indicated that the presence of antecedents of atopy and the DAO SNP rs2052129 (GG

  12. Allopatric integrations selectively change host transcriptomes, leading to varied expression efficiencies of exotic genes in Myxococcus xanthus.

    Science.gov (United States)

    Zhu, Li-Ping; Yue, Xin-Jing; Han, Kui; Li, Zhi-Feng; Zheng, Lian-Shuai; Yi, Xiu-Nan; Wang, Hai-Long; Zhang, You-Ming; Li, Yue-Zhong

    2015-07-22

    Exotic genes, especially clustered multiple-genes for a complex pathway, are normally integrated into chromosome for heterologous expression. The influences of insertion sites on heterologous expression and allotropic expressions of exotic genes on host remain mostly unclear. We compared the integration and expression efficiencies of single and multiple exotic genes that were inserted into Myxococcus xanthus genome by transposition and attB-site-directed recombination. While the site-directed integration had a rather stable chloramphenicol acetyl transferase (CAT) activity, the transposition produced varied CAT enzyme activities. We attempted to integrate the 56-kb gene cluster for the biosynthesis of antitumor polyketides epothilones into M. xanthus genome by site-direction but failed, which was determined to be due to the insertion size limitation at the attB site. The transposition technique produced many recombinants with varied production capabilities of epothilones, which, however, were not paralleled to the transcriptional characteristics of the local sites where the genes were integrated. Comparative transcriptomics analysis demonstrated that the allopatric integrations caused selective changes of host transcriptomes, leading to varied expressions of epothilone genes in different mutants. With the increase of insertion fragment size, transposition is a more practicable integration method for the expression of exotic genes. Allopatric integrations selectively change host transcriptomes, which lead to varied expression efficiencies of exotic genes.

  13. Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease

    Directory of Open Access Journals (Sweden)

    Maria V. Fernández

    2018-04-01

    Full Text Available Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235 with late-onset Alzheimer disease (LOAD. After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L as candidate genes for familial LOAD.

  14. Estimation of genetic variability and selection response for clutch length in dwarf brown-egg layers carrying or not the naked neck gene

    Directory of Open Access Journals (Sweden)

    Tixier-Boichard Michèle

    2003-03-01

    Full Text Available Abstract In order to investigate the possibility of using the dwarf gene for egg production, two dwarf brown-egg laying lines were selected for 16 generations on average clutch length; one line (L1 was normally feathered and the other (L2 was homozygous for the naked neck gene NA. A control line from the same base population, dwarf and segregating for the NA gene, was maintained during the selection experiment under random mating. The average clutch length was normalized using a Box-Cox transformation. Genetic variability and selection response were estimated either with the mixed model methodology, or with the classical methods for calculating genetic gain, as the deviation from the control line, and the realized heritability, as the ratio of the selection response on cumulative selection differentials. Heritability of average clutch length was estimated to be 0.42 ± 0.02, with a multiple trait animal model, whereas the estimates of the realized heritability were lower, being 0.28 and 0.22 in lines L1 and L2, respectively. REML estimates of heritability were found to decline with generations of selection, suggesting a departure from the infinitesimal model, either because a limited number of genes was involved, or their frequencies were changed. The yearly genetic gains in average clutch length, after normalization, were estimated to be 0.37 ± 0.02 and 0.33 ± 0.04 with the classical methods, 0.46 ± 0.02 and 0.43 ± 0.01 with animal model methodology, for lines L1 and L2 respectively, which represented about 30% of the genetic standard deviation on the transformed scale. Selection response appeared to be faster in line L2, homozygous for the NA gene, but the final cumulated selection response for clutch length was not different between the L1 and L2 lines at generation 16.

  15. Enhanced gene ranking approaches using modified trace ratio algorithm for gene expression data

    Directory of Open Access Journals (Sweden)

    Shruti Mishra

    Full Text Available Microarray technology enables the understanding and investigation of gene expression levels by analyzing high dimensional datasets that contain few samples. Over time, microarray expression data have been collected for studying the underlying biological mechanisms of disease. One such application for understanding the mechanism is by constructing a gene regulatory network (GRN. One of the foremost key criteria for GRN discovery is gene selection. Choosing a generous set of genes for the structure of the network is highly desirable. For this role, two suitable methods were proposed for selection of appropriate genes. The first approach comprises a gene selection method called Information gain, where the dataset is reformed and fused with another distinct algorithm called Trace Ratio (TR. Our second method is the implementation of our projected modified TR algorithm, where the scoring base for finding weight matrices has been re-designed. Both the methods' efficiency was shown with different classifiers that include variants of the Artificial Neural Network classifier, such as Resilient Propagation, Quick Propagation, Back Propagation, Manhattan Propagation and Radial Basis Function Neural Network and also the Support Vector Machine (SVM classifier. In the study, it was confirmed that both of the proposed methods worked well and offered high accuracy with a lesser number of iterations as compared to the original Trace Ratio algorithm. Keywords: Gene regulatory network, Gene selection, Information gain, Trace ratio, Canonical correlation analysis, Classification

  16. Methods for the Isolation of Genes Encoding Novel PHA Metabolism Enzymes from Complex Microbial Communities.

    Science.gov (United States)

    Cheng, Jiujun; Nordeste, Ricardo; Trainer, Maria A; Charles, Trevor C

    2017-01-01

    Development of different PHAs as alternatives to petrochemically derived plastics can be facilitated by mining metagenomic libraries for diverse PHA cycle genes that might be useful for synthesis of bio-plastics. The specific phenotypes associated with mutations of the PHA synthesis pathway genes in Sinorhizobium meliloti and Pseudomonas putida, allows the use of powerful selection and screening tools to identify complementing novel PHA synthesis genes. Identification of novel genes through their function rather than sequence facilitates the functional proteins that may otherwise have been excluded through sequence-only screening methodology. We present here methods that we have developed for the isolation of clones expressing novel PHA metabolism genes from metagenomic libraries.

  17. Methods for the isolation of genes encoding novel PHB cycle enzymes from complex microbial communities.

    Science.gov (United States)

    Nordeste, Ricardo F; Trainer, Maria A; Charles, Trevor C

    2010-01-01

    Development of different PHAs as alternatives to petrochemically derived plastics can be facilitated by mining metagenomic libraries for diverse PHA cycle genes that might be useful for synthesis of bioplastics. The specific phenotypes associated with mutations of the PHA synthesis pathway genes in Sinorhizobium meliloti allows for the use of powerful selection and screening tools to identify complementing novel PHA synthesis genes. Identification of novel genes through their function rather than sequence facilitates finding functional proteins that may otherwise have been excluded through sequence-only screening methodology. We present here methods that we have developed for the isolation of clones expressing novel PHA metabolism genes from metagenomic libraries.

  18. [Methylation of selected tumor-supressor genes in benign and malignant ovarian tumors].

    Science.gov (United States)

    Cul'bová, M; Lasabová, Z; Stanclová, A; Tilandyová, P; Zúbor, P; Fiolka, R; Danko, J; Visnovský, J

    2011-09-01

    To evaluate the usefullness of examination of methylation status of selected tumor-supressor genes in early diagnosis of ovarian cancer. Prospective clinical study. Department of Gynecology and Obstetrics, Department of Molecular Biology, Jessenius Medical Faculty, Commenius University, Martin, Slovak Republic. In this study we analyzed hypermethylation of 5 genes RASSF1A, GSTP, E-cadherin, p16 and APC in ovarian tumor samples from 34 patients - 13 patients with epithelial ovarian cancer, 2 patients with border-line ovarian tumors, 12 patients with benign lesions of ovaries and 7 patients with healthy ovarian tissue. The methylation status of promoter region of tumor-supressor genes was determined by Methylation Specific Polymerase Chain Reaction (MSP) using a nested two-step approach with bisulfite modified DNA template and specific primers. Gene methylation analysis revealed hypermethylation of gene RASSF1A (46%) and GSTP (8%) only in malignant ovarian tissue samples. Ecad, p16 and APC genes were methylated both in maignant and benign tissue samples. Methylation positivity in observed genes was present independently to all clinical stages of ovarian cancer and to tumor grades. However, there was observed a trend of increased number and selective involvement of methylated genes with increasing disease stages. Furthermore, there was no association between positive methylation status and histological subtypes of ovarian carcinomas. RASSF1A and GSTP promoter methylation positivity is associated with ovarian cancer. The revealed gene-selective methylation positivity and the increased number of methylated genes with advancing disease stages could be considered as a useful molecular marker for early detection of ovarian cancer. However, there is need to find diagnostic approach of specifically and frequently methylated genes to determining a methylation phenotype for early detection of ovarian malignancies.

  19. Development of a rapid method for direct detection of tet(M) genes in soil from Danish farmland

    DEFF Research Database (Denmark)

    Agersø, Yvonne; Sengeløv, Gitte; Jensen, Lars Bogø

    2004-01-01

    . The tet(M) gene was directly detected in 10-80% of the samples from the various farmland soils and could be detected in all samples tested after selective enrichment. To validate the obtained results, the method was applied to garden soil samples where lower prevalence of resistance was found. Result......A method for direct detection of antibiotic resistance genes in soil samples has been developed. The tetracycline resistance gene, tet(M), was used as a model. The method was validated on Danish farmland soil that had repeatedly been treated with pig manure slurry containing resistant bacteria......: A detection limit of 10(2)-10(3) copies of the tet(M) gene per gram of soil (in a Bacillus cereus group bacterium) was achieved. tet(M) gene was detected in soil samples with the highest prevalence on farmland treated with pig manure slurry....

  20. Investigation of the molecular relationship between breast cancer and obesity by candidate gene prioritization methods

    Directory of Open Access Journals (Sweden)

    Saba Garshasbi

    2015-10-01

    Full Text Available Background: Cancer and obesity are two major public health concerns. More than 12 million cases of cancer are reported annually. Many reports confirmed obesity as a risk factor for cancer. The molecular relationship between obesity and breast cancer has not been clear yet. The purpose of this study was to investigate priorities of effective genes in the molecular relationship between obesity and breast cancer. Methods: In this study, computer simulation method was used for prioritizing the genes that involved in the molecular links between obesity and breast cancer in laboratory of systems biology and bioinformatics (LBB, Tehran University, Tehran, Iran, from March to July 2014. In this study, ENDEAVOUR software was used for prioritizing the genes and integrating multiple data sources was used for data analysis. Training genes were selected from effective genes in obesity and/or breast cancer. Two groups of candidate genes were selected. The first group was included the existential genes in 5 common region chromosomes (between obesity and breast cancer and the second group was included the results of genes microarray data analysis of research Creighton, et al (In 2012 on patients with breast cancer. The microarray data were analyzed with GER2 software (R online software on GEO website. Finally, both training and candidate genes were entered in ENDEAVOUR software package. Results: The candidate genes were prioritized to four style and five genes in ten of the first priorities were repeated twice. In other word, the outcome of prioritizing of 72 genes (Product of microarray data analysis and genes of 5 common chromosome regions (Between obesity and breast cancer showed, 5 genes (TNFRSF10B, F2, IGFALS, NTRK3 and HSP90B1 were the priorities in the molecular connection between obesity and breast cancer. Conclusion: There are some common genes between breast cancer and obesity. So, molecular relationship is confirmed. In this study the possible effect

  1. Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs

    Directory of Open Access Journals (Sweden)

    Ye Zhi-Qiang

    2011-08-01

    Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.

  2. HMM-Based Gene Annotation Methods

    Energy Technology Data Exchange (ETDEWEB)

    Haussler, David; Hughey, Richard; Karplus, Keven

    1999-09-20

    Development of new statistical methods and computational tools to identify genes in human genomic DNA, and to provide clues to their functions by identifying features such as transcription factor binding sites, tissue, specific expression and splicing patterns, and remove homologies at the protein level with genes of known function.

  3. A high efficiency gene disruption strategy using a positive-negative split selection marker and electroporation for Fusarium oxysporum.

    Science.gov (United States)

    Liang, Liqin; Li, Jianqiang; Cheng, Lin; Ling, Jian; Luo, Zhongqin; Bai, Miao; Xie, Bingyan

    2014-11-01

    The Fusarium oxysporum species complex consists of fungal pathogens that cause serial vascular wilt disease on more than 100 cultivated species throughout the world. Gene function analysis is rapidly becoming more and more important as the whole-genome sequences of various F. oxysporum strains are being completed. Gene-disruption techniques are a common molecular tool for studying gene function, yet are often a limiting step in gene function identification. In this study we have developed a F. oxysporum high-efficiency gene-disruption strategy based on split-marker homologous recombination cassettes with dual selection and electroporation transformation. The method was efficiently used to delete three RNA-dependent RNA polymerase (RdRP) genes. The gene-disruption cassettes of three genes can be constructed simultaneously within a short time using this technique. The optimal condition for electroporation is 10μF capacitance, 300Ω resistance, 4kV/cm field strength, with 1μg of DNA (gene-disruption cassettes). Under these optimal conditions, we were able to obtain 95 transformants per μg DNA. And after positive-negative selection, the transformants were efficiently screened by PCR, screening efficiency averaged 85%: 90% (RdRP1), 85% (RdRP2) and 77% (RdRP3). This gene-disruption strategy should pave the way for high throughout genetic analysis in F. oxysporum. Copyright © 2014 Elsevier GmbH. All rights reserved.

  4. Targeting of breast metastases using a viral gene vector with tumour-selective transcription.

    LENUS (Irish Health Repository)

    Rajendran, Simon

    2012-01-31

    BACKGROUND: Adeno-associated virus (AAV) vectors have significant potential as gene delivery vectors for cancer gene therapy. However, broad AAV2 tissue tropism results in nonspecific gene expression. MATERIALS AND METHODS: We investigated use of the C-X-C chemokine receptor type 4 (CXCR4) promoter to restrict AAV expression to tumour cells, in subcutaneous MCF-7 xenograft mouse models of breast cancer and in patient samples, using bioluminescent imaging and flow cytometric analysis. RESULTS: Higher transgene expression levels were observed in subcutaneous MCF-7 tumours relative to normal tissue (muscle) using the CXCR4 promoter, unlike a ubiquitously expressing Cytomegalovirus promoter construct, with preferential AAVCXCR4 expression in epithelial tumour and CXCR4-positive cells. Transgene expression following intravenously administered AAVCXCR4 in a model of liver metastasis was detected specifically in livers of tumour bearing mice. Ex vivo analysis using patient samples also demonstrated higher AAVCXCR4 expression in tumour compared with normal liver tissue. CONCLUSION: This study demonstrates for the first time, the potential for systemic administration of AAV2 vector for tumour-selective gene therapy.

  5. Flexible metabolic pathway construction using modular and divisible selection gene regulators

    DEFF Research Database (Denmark)

    Rugbjerg, Peter; Myling-Petersen, Nils; Sommer, Morten Otto Alexander

    2015-01-01

    Genetic selections are important to biological engineering. Although selectable traits are limited,currently each trait only permits simultaneous introduction of a single DNA fragment. Complex pathwayand strain construction however depends on rapid, combinatorial introduction of many genes thaten...

  6. Combining gene prediction methods to improve metagenomic gene annotation

    Directory of Open Access Journals (Sweden)

    Rosen Gail L

    2011-01-01

    Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.

  7. Global patterns of diversity and selection in human tyrosinase gene.

    Science.gov (United States)

    Hudjashov, Georgi; Villems, Richard; Kivisild, Toomas

    2013-01-01

    Global variation in skin pigmentation is one of the most striking examples of environmental adaptation in humans. More than two hundred loci have been identified as candidate genes in model organisms and a few tens of these have been found to be significantly associated with human skin pigmentation in genome-wide association studies. However, the evolutionary history of different pigmentation genes is rather complex: some loci have been subjected to strong positive selection, while others evolved under the relaxation of functional constraints in low UV environment. Here we report the results of a global study of the human tyrosinase gene, which is one of the key enzymes in melanin production, to assess the role of its variation in the evolution of skin pigmentation differences among human populations. We observe a higher rate of non-synonymous polymorphisms in the European sample consistent with the relaxation of selective constraints. A similar pattern was previously observed in the MC1R gene and concurs with UV radiation-driven model of skin color evolution by which mutations leading to lower melanin levels and decreased photoprotection are subject to purifying selection at low latitudes while being tolerated or even favored at higher latitudes because they facilitate UV-dependent vitamin D production. Our coalescent date estimates suggest that the non-synonymous variants, which are frequent in Europe and North Africa, are recent and have emerged after the separation of East and West Eurasian populations.

  8. Genetic diversity of selected genes that are potentially economically important in feral sheep of New Zealand

    Directory of Open Access Journals (Sweden)

    Sedcole J Richard

    2010-12-01

    Full Text Available Abstract Background Feral sheep are considered to be a source of genetic variation that has been lost from their domestic counterparts through selection. Methods This study investigates variation in the genes KRTAP1-1, KRT33, ADRB3 and DQA2 in Merino-like feral sheep populations from New Zealand and its offshore islands. These genes have previously been shown to influence wool, lamb survival and animal health. Results All the genes were polymorphic, but no new allele was identified in the feral populations. In some of these populations, allele frequencies differed from those observed in commercial Merino sheep and other breeds found in New Zealand. Heterozygosity levels were comparable to those observed in other studies on feral sheep. Our results suggest that some of the feral populations may have been either inbred or outbred over the duration of their apparent isolation. Conclusion The variation described here allows us to draw some conclusions about the likely genetic origin of the populations and selective pressures that may have acted upon them, but they do not appear to be a source of new genetic material, at least for these four genes.

  9. Mining method selection by integrated AHP and PROMETHEE method.

    Science.gov (United States)

    Bogdanovic, Dejan; Nikolic, Djordje; Ilic, Ivana

    2012-03-01

    Selecting the best mining method among many alternatives is a multicriteria decision making problem. The aim of this paper is to demonstrate the implementation of an integrated approach that employs AHP and PROMETHEE together for selecting the most suitable mining method for the "Coka Marin" underground mine in Serbia. The related problem includes five possible mining methods and eleven criteria to evaluate them. Criteria are accurately chosen in order to cover the most important parameters that impact on the mining method selection, such as geological and geotechnical properties, economic parameters and geographical factors. The AHP is used to analyze the structure of the mining method selection problem and to determine weights of the criteria, and PROMETHEE method is used to obtain the final ranking and to make a sensitivity analysis by changing the weights. The results have shown that the proposed integrated method can be successfully used in solving mining engineering problems.

  10. A complex selection signature at the human AVPR1B gene

    Directory of Open Access Journals (Sweden)

    Cagliani Rachele

    2009-06-01

    Full Text Available Abstract Background The vasopressin receptor type 1b (AVPR1B is mainly expressed by pituitary corticotropes and it mediates the stimulatory effects of AVP on ACTH release; common AVPR1B haplotypes have been involved in mood and anxiety disorders in humans, while rodents lacking a functional receptor gene display behavioral defects and altered stress responses. Results Here we have analyzed the two exons of the gene and the data we present suggest that AVPR1B has been subjected to natural selection in humans. In particular, analysis of exon 2 strongly suggests the action of balancing selection in African populations and Europeans: the region displays high nucleotide diversity, an excess of intermediate-frequency alleles, a higher level of within-species diversity compared to interspecific divergence and a genealogy with common haplotypes separated by deep branches. This relatively unambiguous situation coexists with unusual features across exon 1, raising the possibility that a nonsynonymous variant (Gly191Arg in this region has been subjected to directional selection. Conclusion Although the underlying selective pressure(s remains to be identified, we consider this to be among the first documented examples of a gene involved in mood disorders and subjected to natural selection in humans; this observation might add support to the long-debated idea that depression/low mood might have played an adaptive role during human evolution.

  11. A complex selection signature at the human AVPR1B gene.

    Science.gov (United States)

    Cagliani, Rachele; Fumagalli, Matteo; Pozzoli, Uberto; Riva, Stefania; Cereda, Matteo; Comi, Giacomo P; Pattini, Linda; Bresolin, Nereo; Sironi, Manuela

    2009-06-01

    The vasopressin receptor type 1b (AVPR1B) is mainly expressed by pituitary corticotropes and it mediates the stimulatory effects of AVP on ACTH release; common AVPR1B haplotypes have been involved in mood and anxiety disorders in humans, while rodents lacking a functional receptor gene display behavioral defects and altered stress responses. Here we have analyzed the two exons of the gene and the data we present suggest that AVPR1B has been subjected to natural selection in humans. In particular, analysis of exon 2 strongly suggests the action of balancing selection in African populations and Europeans: the region displays high nucleotide diversity, an excess of intermediate-frequency alleles, a higher level of within-species diversity compared to interspecific divergence and a genealogy with common haplotypes separated by deep branches. This relatively unambiguous situation coexists with unusual features across exon 1, raising the possibility that a nonsynonymous variant (Gly191Arg) in this region has been subjected to directional selection. Although the underlying selective pressure(s) remains to be identified, we consider this to be among the first documented examples of a gene involved in mood disorders and subjected to natural selection in humans; this observation might add support to the long-debated idea that depression/low mood might have played an adaptive role during human evolution.

  12. Lentiviral gene ontology (LeGO) vectors equipped with novel drug-selectable fluorescent proteins: new building blocks for cell marking and multi-gene analysis.

    Science.gov (United States)

    Weber, K; Mock, U; Petrowitz, B; Bartsch, U; Fehse, B

    2010-04-01

    Vector-encoded fluorescent proteins (FPs) facilitate unambiguous identification or sorting of gene-modified cells by fluorescence-activated cell sorting (FACS). Exploiting this feature, we have recently developed lentiviral gene ontology (LeGO) vectors (www.LentiGO-Vectors.de) for multi-gene analysis in different target cells. In this study, we extend the LeGO principle by introducing 10 different drug-selectable FPs created by fusing one of the five selection marker (protecting against blasticidin, hygromycin, neomycin, puromycin and zeocin) and one of the five FP genes (Cerulean, eGFP, Venus, dTomato and mCherry). All tested fusion proteins allowed both fluorescence-mediated detection and drug-mediated selection of LeGO-transduced cells. Newly generated codon-optimized hygromycin- and neomycin-resistance genes showed improved expression as compared with their ancestors. New LeGO constructs were produced at titers >10(6) per ml (for non-concentrated supernatants). We show efficient combinatorial marking and selection of various cells, including mesenchymal stem cells, simultaneously transduced with different LeGO constructs. Inclusion of the cytomegalovirus early enhancer/chicken beta-actin promoter into LeGO vectors facilitated robust transgene expression in and selection of neural stem cells and their differentiated progeny. We suppose that the new drug-selectable markers combining advantages of FACS and drug selection are well suited for numerous applications and vector systems. Their inclusion into LeGO vectors opens new possibilities for (stem) cell tracking and functional multi-gene analysis.

  13. Recurrent selection on the Winters sex-ratio genes in Drosophila simulans.

    Science.gov (United States)

    Kingan, Sarah B; Garrigan, Daniel; Hartl, Daniel L

    2010-01-01

    Selfish genes, such as meiotic drive elements, propagate themselves through a population without increasing the fitness of host organisms. X-linked (or Y-linked) meiotic drive elements reduce the transmission of the Y (X) chromosome and skew progeny and population sex ratios, leading to intense conflict among genomic compartments. Drosophila simulans is unusual in having a least three distinct systems of X chromosome meiotic drive. Here, we characterize naturally occurring genetic variation at the Winters sex-ratio driver (Distorter on the X or Dox), its progenitor gene (Mother of Dox or MDox), and its suppressor gene (Not Much Yang or Nmy), which have been previously mapped and characterized. We survey three North American populations as well as 13 globally distributed strains and present molecular polymorphism data at the three loci. We find that all three genes show signatures of selection in North America, judging from levels of polymorphism and skews in the site-frequency spectrum. These signatures likely result from the biased transmission of the driver and selection on the suppressor for the maintenance of equal sex ratios. Coalescent modeling indicates that the timing of selection is more recent than the age of the alleles, suggesting that the driver and suppressor are coevolving under an evolutionary "arms race." None of the Winters sex-ratio genes are fixed in D. simulans, and at all loci we find ancestral alleles, which lack the gene insertions and exhibit high levels of nucleotide polymorphism compared to the derived alleles. In addition, we find several "null" alleles that have mutations on the derived Dox background, which result in loss of drive function. We discuss the possible causes of the maintenance of presence-absence polymorphism in the Winters sex-ratio genes.

  14. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  15. B lymphocyte selection and age-related changes in VH gene usage in mutant Alicia rabbits.

    Science.gov (United States)

    Zhu, X; Boonthum, A; Zhai, S K; Knight, K L

    1999-09-15

    Young Alicia rabbits use VHa-negative genes, VHx and VHy, in most VDJ genes, and their serum Ig is VHa negative. However, as Alicia rabbits age, VHa2 allotype Ig is produced at high levels. We investigated which VH gene segments are used in the VDJ genes of a2 Ig-secreting hybridomas and of a2 Ig+ B cells from adult Alicia rabbits. We found that 21 of the 25 VDJ genes used the a2-encoding genes, VH4 or VH7; the other four VDJ genes used four unknown VH gene segments. Because VH4 and VH7 are rarely found in VDJ genes of normal or young Alicia rabbits, we investigated the timing of rearrangement of these genes in Alicia rabbits. During fetal development, VH4 was used in 60-80% of nonproductively rearranged VDJ genes, and VHx and VHy together were used in 10-26%. These data indicate that during B lymphopoiesis VH4 is preferentially rearranged. However, the percentage of productive VHx- and VHy-utilizing VDJ genes increased from 38% at day 21 of gestation to 89% at birth (gestation day 31), whereas the percentage of VH4-utilizing VDJ genes remained at 15%. These data suggest that during fetal development, either VH4-utilizing B-lineage cells are selectively eliminated, or B cells with VHx- and VHy-utilizing VDJ genes are selectively expanded, or both. The accumulation of peripheral VH4-utilizing a2 B cells with age indicates that these B cells might be selectively expanded in the periphery. We discuss the possible selection mechanisms that regulate VH gene segment usage in rabbit B cells during lymphopoiesis and in the periphery.

  16. A Selection Method for COTS Systems

    DEFF Research Database (Denmark)

    Hedman, Jonas

    new skills and methods supporting the process of evaluating and selecting information systems. This paper presents a method for selecting COTS systems. The method includes the following phases: problem framing, requirements and appraisal, and selection of systems. The idea and distinguishing feature...... behind the method is that improved understanding of organizational' ends' or goals should govern the selection of a COTS system. This can also be expressed as a match or fit between ‘ends' (e.g. improved organizational effectiveness) and ‘means' (e.g. implementing COTS systems). This way of approaching...

  17. Expression stability and selection of optimal reference genes for gene expression normalization in early life stage rainbow trout exposed to cadmium and copper.

    Science.gov (United States)

    Shekh, Kamran; Tang, Song; Niyogi, Som; Hecker, Markus

    2017-09-01

    Gene expression analysis represents a powerful approach to characterize the specific mechanisms by which contaminants interact with organisms. One of the key considerations when conducting gene expression analyses using quantitative real-time reverse transcription-polymerase chain reaction (qPCR) is the selection of appropriate reference genes, which is often overlooked. Specifically, to reach meaningful conclusions when using relative quantification approaches, expression levels of reference genes must be highly stable and cannot vary as a function of experimental conditions. However, to date, information on the stability of commonly used reference genes across developmental stages, tissues and after exposure to contaminants such as metals is lacking for many vertebrate species including teleost fish. Therefore, in this study, we assessed the stability of expression of 8 reference gene candidates in the gills and skin of three different early life-stages of rainbow trout after acute exposure (24h) to two metals, cadmium (Cd) and copper (Cu) using qPCR. Candidate housekeeping genes were: beta actin (b-actin), DNA directed RNA polymerase II subunit I (DRP2), elongation factor-1 alpha (EF1a), glyceraldehyde 3-phosphate dehydrogenase (GAPDH), glucose-6-phosphate dehydrogenase (G6PD), hypoxanthine phosphoribosyltransferase (HPRT), ribosomal protein L8 (RPL8), and 18S ribosomal RNA (18S). Four algorithms, geNorm, NormFinder, BestKeeper, and the comparative ΔCt method were employed to systematically evaluate the expression stability of these candidate genes under control and exposed conditions as well as across three different life-stages. Finally, stability of genes was ranked by taking geometric means of the ranks established by the different methods. Stability of reference genes was ranked in the following order (from lower to higher stability): HPRT

  18. Selecting patients with young-onset colorectal cancer for mismatch repair gene analysis

    DEFF Research Database (Denmark)

    Walker, M; O'Sullivan, B; Perakath, B

    2007-01-01

    BACKGROUND: Young patients with colorectal cancer are at increased risk of carrying a germline mutation in mismatch repair (MMR) genes. This study investigated the role of clinical criteria and immunohistochemistry for MMR proteins in selecting young patients for mutation testing. METHODS: A cohort...... of 56 consecutive patients with colorectal cancer aged less than 45 years were stratified into three groups based on clinical criteria: 'Amsterdam criteria', 'high risk' and 'young onset only'. Immunohistochemistry for four MMR proteins was carried out and the rate of compliance with clinical guidelines...

  19. Selection of reference genes for qPCR in hairy root cultures of peanut

    Directory of Open Access Journals (Sweden)

    Medrano Giuliana

    2011-10-01

    Full Text Available Abstract Background Hairy root cultures produced via Agrobacterium rhizogenes-mediated transformation have emerged as practical biological models to elucidate the biosynthesis of specialized metabolites. To effectively understand the expression patterns of the genes involved in the metabolic pathways of these compounds, reference genes need to be systematically validated under specific experimental conditions as established by the MIQE (Minimum Information for Publication of Quantitative Real-Time PCR Experiments guidelines. In the present report we describe the first validation of reference genes for RT-qPCR in hairy root cultures of peanut which produce stilbenoids upon elicitor treatments. Results A total of 21 candidate reference genes were evaluated. Nineteen genes were selected based on previous qPCR studies in plants and two were from the T-DNAs transferred from A. rhizogenes. Nucleotide sequences of peanut candidate genes were obtained using their homologous sequences in Arabidopsis. To identify the suitable primers, calibration curves were obtained for each candidate reference gene. After data analysis, 12 candidate genes meeting standard efficiency criteria were selected. The expression stability of these genes was analyzed using geNorm and NormFinder algorithms and a ranking was established based on expression stability of the genes. Candidate reference gene expression was shown to have less variation in methyl jasmonate (MeJA treated root cultures than those treated with sodium acetate (NaOAc. Conclusions This work constitutes the first effort to validate reference genes for RT-qPCR in hairy roots. While these genes were selected under conditions of NaOAc and MeJA treatment, we anticipate these genes to provide good targets for reference genes for hairy roots under a variety of stress conditions. The lead reference genes were a gene encoding for a TATA box binding protein (TBP2 and a gene encoding a ribosomal protein (RPL8C. A

  20. Diagnosis of Neisseria gonorrhoeae among pregnant women by culture method and PCR on cppB gene

    Directory of Open Access Journals (Sweden)

    Jalal Mardaneh

    2013-11-01

    Full Text Available Background: Neisseria gonorrhoeae is a human obligate pathogen and the etiological agent of gonorrhea. Health irreparable complications resulting from gonorrhea disease occur mainly in pregnant women and neonates. Aim of this study was diagnosis of Neisseria gonorrhoeae among pregnant women with using culture and molecular method by amplification of cppB gene with PCR. Material and Methods: In this cross-sectional study, two endocervical swab specimens were obtained from 1100 pregnant women who referred to Shiraz Hospitals. Culture on nonselective and selective media and nucleic acid amplification test (NAAT were performed for detection of Neisseria gonorrhoeae cppB gene. Results: All endocervical swabs cultures on selective and nonselective media were negative for Neisseria gonorrhoeae. Among examined endocervical swabs, 13samples (1.18% were positive by nucleic acid amplification of Neisseria gonorrgoeae cppB gene. Conclusion: Negative results of culture and positive results of PCR in this study indicate that however culture is gold standard method for detection of Neisseria gonorrhoeae but due to bacterial autolysis, poor sampling techniques and improper specimen storage and transport, its value decline as compared with Nucleic acid amplification test (NAAT.

  1. Selection of reference genes for transcriptional analysis of edible tubers of potato (Solanum tuberosum L..

    Directory of Open Access Journals (Sweden)

    Roberta Fogliatto Mariot

    Full Text Available Potato (Solanum tuberosum yield has increased dramatically over the last 50 years and this has been achieved by a combination of improved agronomy and biotechnology efforts. Gene studies are taking place to improve new qualities and develop new cultivars. Reverse transcriptase quantitative polymerase chain reaction (RT-qPCR is a bench-marking analytical tool for gene expression analysis, but its accuracy is highly dependent on a reliable normalization strategy of an invariant reference genes. For this reason, the goal of this work was to select and validate reference genes for transcriptional analysis of edible tubers of potato. To do so, RT-qPCR primers were designed for ten genes with relatively stable expression in potato tubers as observed in RNA-Seq experiments. Primers were designed across exon boundaries to avoid genomic DNA contamination. Differences were observed in the ranking of candidate genes identified by geNorm, NormFinder and BestKeeper algorithms. The ranks determined by geNorm and NormFinder were very similar and for all samples the most stable candidates were C2, exocyst complex component sec3 (SEC3 and ATCUL3/ATCUL3A/CUL3/CUL3A (CUL3A. According to BestKeeper, the importin alpha and ubiquitin-associated/ts-n genes were the most stable. Three genes were selected as reference genes for potato edible tubers in RT-qPCR studies. The first one, called C2, was selected in common by NormFinder and geNorm, the second one is SEC3, selected by NormFinder, and the third one is CUL3A, selected by geNorm. Appropriate reference genes identified in this work will help to improve the accuracy of gene expression quantification analyses by taking into account differences that may be observed in RNA quality or reverse transcription efficiency across the samples.

  2. Selection of reference genes for transcriptional analysis of edible tubers of potato (Solanum tuberosum L.).

    Science.gov (United States)

    Mariot, Roberta Fogliatto; de Oliveira, Luisa Abruzzi; Voorhuijzen, Marleen M; Staats, Martijn; Hutten, Ronald C B; Van Dijk, Jeroen P; Kok, Esther; Frazzon, Jeverson

    2015-01-01

    Potato (Solanum tuberosum) yield has increased dramatically over the last 50 years and this has been achieved by a combination of improved agronomy and biotechnology efforts. Gene studies are taking place to improve new qualities and develop new cultivars. Reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) is a bench-marking analytical tool for gene expression analysis, but its accuracy is highly dependent on a reliable normalization strategy of an invariant reference genes. For this reason, the goal of this work was to select and validate reference genes for transcriptional analysis of edible tubers of potato. To do so, RT-qPCR primers were designed for ten genes with relatively stable expression in potato tubers as observed in RNA-Seq experiments. Primers were designed across exon boundaries to avoid genomic DNA contamination. Differences were observed in the ranking of candidate genes identified by geNorm, NormFinder and BestKeeper algorithms. The ranks determined by geNorm and NormFinder were very similar and for all samples the most stable candidates were C2, exocyst complex component sec3 (SEC3) and ATCUL3/ATCUL3A/CUL3/CUL3A (CUL3A). According to BestKeeper, the importin alpha and ubiquitin-associated/ts-n genes were the most stable. Three genes were selected as reference genes for potato edible tubers in RT-qPCR studies. The first one, called C2, was selected in common by NormFinder and geNorm, the second one is SEC3, selected by NormFinder, and the third one is CUL3A, selected by geNorm. Appropriate reference genes identified in this work will help to improve the accuracy of gene expression quantification analyses by taking into account differences that may be observed in RNA quality or reverse transcription efficiency across the samples.

  3. A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees

    DEFF Research Database (Denmark)

    Nielsen, Rasmus; Bustamente, Carlos; Clark, Andrew G.

    2005-01-01

    Since the divergence of humans and chimpanzees about 5 million years ago, these species have undergone a remarkable evolution with drastic divergence in anatomy and cognitive abilities. At the molecular level, despite the small overall magnitude of DNA sequence divergence, we might expect...... such evolutionary changes to leave a noticeable signature throughout the genome. We here compare 13,731 annotated genes from humans to their chimpanzee orthologs to identify genes that show evidence of positive selection. Many of the genes that present a signature of positive selection tend to be involved...

  4. A hybrid gene selection approach for microarray data classification using cellular learning automata and ant colony optimization.

    Science.gov (United States)

    Vafaee Sharbaf, Fatemeh; Mosafer, Sara; Moattar, Mohammad Hossein

    2016-06-01

    This paper proposes an approach for gene selection in microarray data. The proposed approach consists of a primary filter approach using Fisher criterion which reduces the initial genes and hence the search space and time complexity. Then, a wrapper approach which is based on cellular learning automata (CLA) optimized with ant colony method (ACO) is used to find the set of features which improve the classification accuracy. CLA is applied due to its capability to learn and model complicated relationships. The selected features from the last phase are evaluated using ROC curve and the most effective while smallest feature subset is determined. The classifiers which are evaluated in the proposed framework are K-nearest neighbor; support vector machine and naïve Bayes. The proposed approach is evaluated on 4 microarray datasets. The evaluations confirm that the proposed approach can find the smallest subset of genes while approaching the maximum accuracy. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. Selection in the dopamine receptor 2 gene: a candidate SNP study

    Directory of Open Access Journals (Sweden)

    Tobias Göllner

    2015-08-01

    Full Text Available Dopamine is a major neurotransmitter in the human brain and is associated with various diseases. Schizophrenia, for example, is treated by blocking the dopamine receptors type 2. Shaner, Miller & Mintz (2004 stated that schizophrenia was the low fitness variant of a highly variable mental trait. We therefore explore whether the dopamine receptor 2 gene (DRD2 underwent any selection processes. We acquired genotype data of the 1,000 Genomes project (phase I, which contains 1,093 individuals from 14 populations. We included single nucleotide polymorphisms (SNPs with two minor allele frequencies (MAFs in the analysis: MAF over 0.05 and over 0.01. This is equivalent to 151 SNPs (MAF > 0.05 and 246 SNPs (MAF > 0.01 for DRD2. We used two different approaches (an outlier approach and a Bayesian approach to detect loci under selection. The combined results of both approaches yielded nine (MAF > 0.05 and two candidate SNPs (MAF > 0.01, under balancing selection. We also found weak signs for directional selection on DRD2, but in our opinion these were too weak to draw any final conclusions on directional selection in DRD2. All candidates for balancing selection are in the intronic region of the gene and only one (rs12574471 has been mentioned in the literature. Two of our candidate SNPs are located in specific regions of the gene: rs80215768 lies within a promoter flanking region and rs74751335 lies within a transcription factor binding site. We strongly encourage research on our candidate SNPs and their possible effects.

  6. Selecting Question-Specific Genes to Reduce Incongruence in Phylogenomics: A Case Study of Jawed Vertebrate Backbone Phylogeny.

    Science.gov (United States)

    Chen, Meng-Yun; Liang, Dan; Zhang, Peng

    2015-11-01

    Incongruence between different phylogenomic analyses is the main challenge faced by phylogeneticists in the genomic era. To reduce incongruence, phylogenomic studies normally adopt some data filtering approaches, such as reducing missing data or using slowly evolving genes, to improve the signal quality of data. Here, we assembled a phylogenomic data set of 58 jawed vertebrate taxa and 4682 genes to investigate the backbone phylogeny of jawed vertebrates under both concatenation and coalescent-based frameworks. To evaluate the efficiency of extracting phylogenetic signals among different data filtering methods, we chose six highly intractable internodes within the backbone phylogeny of jawed vertebrates as our test questions. We found that our phylogenomic data set exhibits substantial conflicting signal among genes for these questions. Our analyses showed that non-specific data sets that are generated without bias toward specific questions are not sufficient to produce consistent results when there are several difficult nodes within a phylogeny. Moreover, phylogenetic accuracy based on non-specific data is considerably influenced by the size of data and the choice of tree inference methods. To address such incongruences, we selected genes that resolve a given internode but not the entire phylogeny. Notably, not only can this strategy yield correct relationships for the question, but it also reduces inconsistency associated with data sizes and inference methods. Our study highlights the importance of gene selection in phylogenomic analyses, suggesting that simply using a large amount of data cannot guarantee correct results. Constructing question-specific data sets may be more powerful for resolving problematic nodes. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Expression of Root Genes in Arabidopsis Seedlings Grown by Standard and Improved Growing Methods.

    Science.gov (United States)

    Qu, Yanli; Liu, Shuai; Bao, Wenlong; Xue, Xian; Ma, Zhengwen; Yokawa, Ken; Baluška, František; Wan, Yinglang

    2017-05-03

    Roots of Arabidopsis thaliana seedlings grown in the laboratory using the traditional plant-growing culture system (TPG) were covered to maintain them in darkness. This new method is based on a dark chamber and is named the improved plant-growing method (IPG). We measured the light conditions in dark chambers, and found that the highest light intensity was dramatically reduced deeper in the dark chamber. In the bottom and side parts of dark chambers, roots were almost completely shaded. Using the high-throughput RNA sequencing method on the whole RNA extraction from roots, we compared the global gene expression levels in roots of seedlings from these two conditions and identified 141 differently expressed genes (DEGs) between them. According to the KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment, the flavone and flavonol biosynthesis and flavonoid biosynthesis pathways were most affected among all annotated pathways. Surprisingly, no genes of known plant photoreceptors were identified as DEGs by this method. Considering that the light intensity was decreased in the IPG system, we collected four sections (1.5 cm for each) of Arabidopsis roots grown in TPG and IPG conditions, and the spatial-related differential gene expression levels of plant photoreceptors and polar auxin transporters, including CRY1 , CRY2 , PHYA , PHYB , PHOT1 , PHOT2 , and UVR8 were analyzed by qRT-PCR. Using these results, we generated a map of the spatial-related expression patterns of these genes under IPG and TPG conditions. The expression levels of light-related genes in roots is highly sensitive to illumination and it provides a background reference for selecting an improved culture method for laboratory-maintained Arabidopsis seedlings.

  8. Tagging of blast resistance gene(s) to DNA markers and marker-assisted selection (MAS) in rice improvement

    International Nuclear Information System (INIS)

    Zhuang, J.Y.; Lu, J.; Qian, H.R.; Lin, H.X.; Zheng, K.L.

    1998-01-01

    This paper reports progress made on the tagging of blast resistance gene(s) to DNA markers and on the initiation of marker-assisted selection (MAS) for blast resistance in rice improvement. A pair of near isogenic lines, K8OR and K79S, were developed using a Chinese landrace Hong-jiao-zhan as the resistance donor. Ten putatively positive markers were identified by screening 177 mapped DNA markers. Using the F 2 population of 143 plants and the derived F 3 lines, three Restriction Fragment Length Polymorphism (RFLP) markers (RG81, RG869 and RZ397) on chromosome 12 of rice were identified to be closely linked to the blast resistance gene Pi-12(t). The genetic distance between Pi-12(t) and the closest marker RG869 was 5.1 cM. By employing the bulk segregant analysis (BSA) procedure, six of 199 arbitrary primers were found to produce positive Randomly Amplified Polymorphic DNA (RAPD) bands. Tight linkage between Pi-12(t) and three RAPD bands, each from a different primer, was confirmed after amplification of DNA of all F 2 individuals. Two fragments were cloned and sequenced, and two sequence characterised amplified re-ion (SCAR) markers were established. In two other F 3 populations, Xian-feng I/Tetep and Xian-feng, 1/Hong-jiao-zhan, the blast resistance was found to be controlled by interactions of two or more genes. One resistance gene was located in the vicinity of RG81 in both populations. Work to identify other gene(s) is currently under way. Marker assisted selection for blast resistance was initiated. Crosses were made between elite varieties and blast resistance donors to develop populations for DNA marker-assisted selection of blast resistance. In addition, 48 varieties widely used in current rice breeding programs were provided by rice breeders. DNA marker-based polymorphism among, these varieties and resistance donors were analysed to produce a database for future MAS program. (author)

  9. Antibiosis and bmyB Gene Presence As Prevalent Traits for the Selection of Efficient Bacillus Biocontrol Agents against Crown Gall Disease

    Directory of Open Access Journals (Sweden)

    Olfa Frikha-Gargouri

    2017-08-01

    Full Text Available This study aimed to improve the screening method for the selection of Bacillus biocontrol agents against crown gall disease. The relationship between the strain biocontrol ability and their in vitro studied traits was investigated to identify the most important factors to be considered for the selection of effective biocontrol agents. In fact, previous selection procedure relying only on in vitro antibacterial activity was shown to be not suitable in some cases. A direct plant-protection strategy was performed to screen the 32 Bacillus biocontrol agent candidates. Moreover, potential in vitro biocontrol traits were investigated including biofilm formation, motility, hemolytic activity, detection of lipopeptide biosynthetic genes (sfp, ituC and bmyB and production of antibacterial compounds. The obtained results indicated high correlations of the efficiency of the biocontrol with the reduction of gall weight (p = 0.000 and the antibacterial activity in vitro (p = 0.000. Moreover, there was strong correlations of the efficiency of the biocontrol (p = 0.004 and the reduction in gall weight (p = 0.000 with the presence of the bmyB gene. This gene directs the synthesis of the lipopeptide bacillomycin belonging to the iturinic family of lipopeptides. These results were also confirmed by the two-way hierarchical cluster analysis and the correspondence analysis showing the relatedness of these four variables. According to the obtained results a new screening procedure of Bacillus biocontrol agents against crown gall disease could be advanced consisting on two step selection procedure. The first consists on selecting strains with high antibacterial activity in vitro or those harbouring the bmyB gene. Further selection has to be performed on tomato plants in vivo. Moreover, based on the results of the biocontrol assay, five potent strains exhibiting high biocontrol abilities were selected. They were identified as Bacillus subtilis or Bacillus

  10. An evolutionarily conserved gene family encodes proton-selective ion channels.

    Science.gov (United States)

    Tu, Yu-Hsiang; Cooper, Alexander J; Teng, Bochuan; Chang, Rui B; Artiga, Daniel J; Turner, Heather N; Mulhall, Eric M; Ye, Wenlei; Smith, Andrew D; Liman, Emily R

    2018-03-02

    Ion channels form the basis for cellular electrical signaling. Despite the scores of genetically identified ion channels selective for other monatomic ions, only one type of proton-selective ion channel has been found in eukaryotic cells. By comparative transcriptome analysis of mouse taste receptor cells, we identified Otopetrin1 (OTOP1), a protein required for development of gravity-sensing otoconia in the vestibular system, as forming a proton-selective ion channel. We found that murine OTOP1 is enriched in acid-detecting taste receptor cells and is required for their zinc-sensitive proton conductance. Two related murine genes, Otop2 and Otop3 , and a Drosophila ortholog also encode proton channels. Evolutionary conservation of the gene family and its widespread tissue distribution suggest a broad role for proton channels in physiology and pathophysiology. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

  11. Multiplex Real-Time PCR for Detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL) Genes from Selective Enrichments from Animals and Retail Meat

    Science.gov (United States)

    Velasco, Valeria; Sherwood, Julie S.; Rojas-García, Pedro P.; Logue, Catherine M.

    2014-01-01

    The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL) genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat). The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus), mecA (associated with methicillin resistance) and PVL (virulence factor), and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68–0.88 (from substantial to almost perfect agreement) and 0.29–0.77 (from fair to substantial agreement) for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0–0.49 (from no agreement beyond that expected by chance to moderate agreement) for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA) using

  12. Multiplex real-time PCR for detection of Staphylococcus aureus, mecA and Panton-Valentine Leukocidin (PVL genes from selective enrichments from animals and retail meat.

    Directory of Open Access Journals (Sweden)

    Valeria Velasco

    Full Text Available The aim of this study was to compare a real-time PCR assay, with a conventional culture/PCR method, to detect S. aureus, mecA and Panton-Valentine Leukocidin (PVL genes in animals and retail meat, using a two-step selective enrichment protocol. A total of 234 samples were examined (77 animal nasal swabs, 112 retail raw meat, and 45 deli meat. The multiplex real-time PCR targeted the genes: nuc (identification of S. aureus, mecA (associated with methicillin resistance and PVL (virulence factor, and the primary and secondary enrichment samples were assessed. The conventional culture/PCR method included the two-step selective enrichment, selective plating, biochemical testing, and multiplex PCR for confirmation. The conventional culture/PCR method recovered 95/234 positive S. aureus samples. Application of real-time PCR on samples following primary and secondary enrichment detected S. aureus in 111/234 and 120/234 samples respectively. For detection of S. aureus, the kappa statistic was 0.68-0.88 (from substantial to almost perfect agreement and 0.29-0.77 (from fair to substantial agreement for primary and secondary enrichments, using real-time PCR. For detection of mecA gene, the kappa statistic was 0-0.49 (from no agreement beyond that expected by chance to moderate agreement for primary and secondary enrichment samples. Two pork samples were mecA gene positive by all methods. The real-time PCR assay detected the mecA gene in samples that were negative for S. aureus, but positive for Staphylococcus spp. The PVL gene was not detected in any sample by the conventional culture/PCR method or the real-time PCR assay. Among S. aureus isolated by conventional culture/PCR method, the sequence type ST398, and multi-drug resistant strains were found in animals and raw meat samples. The real-time PCR assay may be recommended as a rapid method for detection of S. aureus and the mecA gene, with further confirmation of methicillin-resistant S. aureus (MRSA

  13. Sexual selection and magic traits in speciation with gene flow

    Directory of Open Access Journals (Sweden)

    Maria R. SERVEDIO, Michael KOPP

    2012-06-01

    Full Text Available The extent to which sexual selection is involved in speciation with gene flow remains an open question and the subject of much research. Here, we propose that some insight can be gained from considering the concept of magic traits (i.e., traits involved in both reproductive isolation and ecological divergence. Both magic traits and other, “non-magic”, traits can contribute to speciation via a number of specific mechanisms. We argue that many of these mechanisms are likely to differ widely in the extent to which they involve sexual selection. Furthermore, in some cases where sexual selection is present, it may be prone to inhibit rather than drive speciation. Finally, there are a priori reasons to believe that certain categories of traits are much more effective than others in driving speciation. The combination of these points suggests a classification of traits that may shed light on the broader role of sexual selection in speciation with gene flow. In particular, we suggest that sexual selection can act as a driver of speciation in some scenarios, but may play a negligible role in potentially common categories of magic traits, and may be likely to inhibit speciation in common categories of non-magic traits [Current Zoology 58 (3: 507–513, 2012].

  14. Differential selection on carotenoid biosynthesis genes as a function of gene position in the metabolic pathway: a study on the carrot and dicots.

    Directory of Open Access Journals (Sweden)

    Jérémy Clotault

    Full Text Available Selection of genes involved in metabolic pathways could target them differently depending on the position of genes in the pathway and on their role in controlling metabolic fluxes. This hypothesis was tested in the carotenoid biosynthesis pathway using population genetics and phylogenetics.Evolutionary rates of seven genes distributed along the carotenoid biosynthesis pathway, IPI, PDS, CRTISO, LCYB, LCYE, CHXE and ZEP, were compared in seven dicot taxa. A survey of deviations from neutrality expectations at these genes was also undertaken in cultivated carrot (Daucus carota subsp. sativus, a species that has been intensely bred for carotenoid pattern diversification in its root during its cultivation history. Parts of sequences of these genes were obtained from 46 individuals representing a wide diversity of cultivated carrots. Downstream genes exhibited higher deviations from neutral expectations than upstream genes. Comparisons of synonymous and nonsynonymous substitution rates between genes among dicots revealed greater constraints on upstream genes than on downstream genes. An excess of intermediate frequency polymorphisms, high nucleotide diversity and/or high differentiation of CRTISO, LCYB1 and LCYE in cultivated carrot suggest that balancing selection may have targeted genes acting centrally in the pathway.Our results are consistent with relaxed constraints on downstream genes and selection targeting the central enzymes of the carotenoid biosynthesis pathway during carrot breeding history.

  15. The utility of optical detection system (qPCR) and bioinformatics methods in reference gene expression analysis

    Science.gov (United States)

    Skarzyńska, Agnieszka; Pawełkowicz, Magdalena; PlÄ der, Wojciech; Przybecki, Zbigniew

    2016-09-01

    Real-time quantitative polymerase chain reaction is consider as the most reliable method for gene expression studies. However, the expression of target gene could be misinterpreted due to improper normalization. Therefore, the crucial step for analysing of qPCR data is selection of suitable reference genes, which should be validated experimentally. In order to choice the gene with stable expression in the designed experiment, we performed reference gene expression analysis. In this study genes described in the literature and novel genes predicted as control genes, based on the in silico analysis of transcriptome data were used. Analysis with geNorm and NormFinder algorithms allow to create the ranking of candidate genes and indicate the best reference for flower morphogenesis study. According to the results, genes CACS and CYCL were characterised the most stable expression, but the least suitable genes were TUA and EF.

  16. Selection of Highly Expressed Gene Variants in Escherichia coli Using Translationally Coupled Antibiotic Selection Markers

    DEFF Research Database (Denmark)

    Rennig, Maja; Daley, Daniel O.; Nørholm, Morten H. H.

    2018-01-01

    Strategies to select highly expressed variants of a protein coding sequence are usually based on trial-and-error approaches, which are time-consuming and expensive. We address this problem using translationally coupled antibiotic resistance markers. The system requires that the target gene can...

  17. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  18. "Every Gene Is Everywhere but the Environment Selects": Global Geolocalization of Gene Sharing in Environmental Samples through Network Analysis.

    Science.gov (United States)

    Fondi, Marco; Karkman, Antti; Tamminen, Manu V; Bosi, Emanuele; Virta, Marko; Fani, Renato; Alm, Eric; McInerney, James O

    2016-05-13

    The spatial distribution of microbes on our planet is famously formulated in the Baas Becking hypothesis as "everything is everywhere but the environment selects." While this hypothesis does not strictly rule out patterns caused by geographical effects on ecology and historical founder effects, it does propose that the remarkable dispersal potential of microbes leads to distributions generally shaped by environmental factors rather than geographical distance. By constructing sequence similarity networks from uncultured environmental samples, we show that microbial gene pool distributions are not influenced nearly as much by geography as ecology, thus extending the Bass Becking hypothesis from whole organisms to microbial genes. We find that gene pools are shaped by their broad ecological niche (such as sea water, fresh water, host, and airborne). We find that freshwater habitats act as a gene exchange bridge between otherwise disconnected habitats. Finally, certain antibiotic resistance genes deviate from the general trend of habitat specificity by exhibiting a high degree of cross-habitat mobility. The strong cross-habitat mobility of antibiotic resistance genes is a cause for concern and provides a paradigmatic example of the rate by which genes colonize new habitats when new selective forces emerge. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Selection-independent generation of gene knockout mouse embryonic stem cells using zinc-finger nucleases.

    Directory of Open Access Journals (Sweden)

    Anna Osiak

    Full Text Available Gene knockout in murine embryonic stem cells (ESCs has been an invaluable tool to study gene function in vitro or to generate animal models with altered phenotypes. Gene targeting using standard techniques, however, is rather inefficient and typically does not exceed frequencies of 10(-6. In consequence, the usage of complex positive/negative selection strategies to isolate targeted clones has been necessary. Here, we present a rapid single-step approach to generate a gene knockout in mouse ESCs using engineered zinc-finger nucleases (ZFNs. Upon transient expression of ZFNs, the target gene is cleaved by the designer nucleases and then repaired by non-homologous end-joining, an error-prone DNA repair process that introduces insertions/deletions at the break site and therefore leads to functional null mutations. To explore and quantify the potential of ZFNs to generate a gene knockout in pluripotent stem cells, we generated a mouse ESC line containing an X-chromosomally integrated EGFP marker gene. Applying optimized conditions, the EGFP locus was disrupted in up to 8% of ESCs after transfection of the ZFN expression vectors, thus obviating the need of selection markers to identify targeted cells, which may impede or complicate downstream applications. Both activity and ZFN-associated cytotoxicity was dependent on vector dose and the architecture of the nuclease domain. Importantly, teratoma formation assays of selected ESC clones confirmed that ZFN-treated ESCs maintained pluripotency. In conclusion, the described ZFN-based approach represents a fast strategy for generating gene knockouts in ESCs in a selection-independent fashion that should be easily transferrable to other pluripotent stem cells.

  20. Examination of Signatures of Recent Positive Selection on Genes Involved in Human Sialic Acid Biology.

    Science.gov (United States)

    Moon, Jiyun M; Aronoff, David M; Capra, John A; Abbot, Patrick; Rokas, Antonis

    2018-03-28

    Sialic acids are nine carbon sugars ubiquitously found on the surfaces of vertebrate cells and are involved in various immune response-related processes. In humans, at least 58 genes spanning diverse functions, from biosynthesis and activation to recycling and degradation, are involved in sialic acid biology. Because of their role in immunity, sialic acid biology genes have been hypothesized to exhibit elevated rates of evolutionary change. Consistent with this hypothesis, several genes involved in sialic acid biology have experienced higher rates of non-synonymous substitutions in the human lineage than their counterparts in other great apes, perhaps in response to ancient pathogens that infected hominins millions of years ago (paleopathogens). To test whether sialic acid biology genes have also experienced more recent positive selection during the evolution of the modern human lineage, reflecting adaptation to contemporary cosmopolitan or geographically-restricted pathogens, we examined whether their protein-coding regions showed evidence of recent hard and soft selective sweeps. This examination involved the calculation of four measures that quantify changes in allele frequency spectra, extent of population differentiation, and haplotype homozygosity caused by recent hard and soft selective sweeps for 55 sialic acid biology genes using publicly available whole genome sequencing data from 1,668 humans from three ethnic groups. To disentangle evidence for selection from confounding demographic effects, we compared the observed patterns in sialic acid biology genes to simulated sequences of the same length under a model of neutral evolution that takes into account human demographic history. We found that the patterns of genetic variation of most sialic acid biology genes did not significantly deviate from neutral expectations and were not significantly different among genes belonging to different functional categories. Those few sialic acid biology genes that

  1. Tracing evolutionary relicts of positive selection on eight malaria-related immune genes in mammals.

    Science.gov (United States)

    Huang, Bing-Hong; Liao, Pei-Chun

    2015-07-01

    Plasmodium-induced malaria widely infects primates and other mammals. Multiple past studies have revealed that positive selection could be the main evolutionary force triggering the genetic diversity of anti-malaria resistance-associated genes in human or primates. However, researchers focused most of their attention on the infra-generic and intra-specific genome evolution rather than analyzing the complete evolutionary history of mammals. Here we extend previous research by testing the evolutionary link of natural selection on eight candidate genes associated with malaria resistance in mammals. Three of the eight genes were detected to be affected by recombination, including TNF-α, iNOS and DARC. Positive selection was detected in the rest five immunogenes multiple times in different ancestral lineages of extant species throughout the mammalian evolution. Signals of positive selection were exposed in four malaria-related immunogenes in primates: CCL2, IL-10, HO1 and CD36. However, selection signals of G6PD have only been detected in non-primate eutherians. Significantly higher evolutionary rates and more radical amino acid replacement were also detected in primate CD36, suggesting its functional divergence from other eutherians. Prevalent positive selection throughout the evolutionary trajectory of mammalian malaria-related genes supports the arms race evolutionary hypothesis of host genetic response of mammalian immunogenes to infectious pathogens. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.

  2. A Double Selection Approach to Achieve Specific Expression of Toxin Genes for Ovarian Cancer Gene Therapy

    National Research Council Canada - National Science Library

    Curiel, David T; Siegal, Gene; Wang, Minghui

    2007-01-01

    ...) to achieve efficient and selective gene transfer to target tumor cells. Proposed herein is a strategy to modify one candidate vector, recombinant adenovirus, such that it embodies the requisite properties of efficacy and specificity...

  3. Bayesian nonparametric variable selection as an exploratory tool for discovering differentially expressed genes.

    Science.gov (United States)

    Shahbaba, Babak; Johnson, Wesley O

    2013-05-30

    High-throughput scientific studies involving no clear a priori hypothesis are common. For example, a large-scale genomic study of a disease may examine thousands of genes without hypothesizing that any specific gene is responsible for the disease. In these studies, the objective is to explore a large number of possible factors (e.g., genes) in order to identify a small number that will be considered in follow-up studies that tend to be more thorough and on smaller scales. A simple, hierarchical, linear regression model with random coefficients is assumed for case-control data that correspond to each gene. The specific model used will be seen to be related to a standard Bayesian variable selection model. Relatively large regression coefficients correspond to potential differences in responses for cases versus controls and thus to genes that might 'matter'. For large-scale studies, and using a Dirichlet process mixture model for the regression coefficients, we are able to find clusters of regression effects of genes with increasing potential effect or 'relevance', in relation to the outcome of interest. One cluster will always correspond to genes whose coefficients are in a neighborhood that is relatively close to zero and will be deemed least relevant. Other clusters will correspond to increasing magnitudes of the random/latent regression coefficients. Using simulated data, we demonstrate that our approach could be quite effective in finding relevant genes compared with several alternative methods. We apply our model to two large-scale studies. The first study involves transcriptome analysis of infection by human cytomegalovirus. The second study's objective is to identify differentially expressed genes between two types of leukemia. Copyright © 2012 John Wiley & Sons, Ltd.

  4. Prediction of Genes Related to Positive Selection Using Whole-Genome Resequencing in Three Commercial Pig Breeds

    Directory of Open Access Journals (Sweden)

    HyoYoung Kim

    2015-12-01

    Full Text Available Selective sweep can cause genetic differentiation across populations, which allows for the identification of possible causative regions/genes underlying important traits. The pig has experienced a long history of allele frequency changes through artificial selection in the domestication process. We obtained an average of 329,482,871 sequence reads for 24 pigs from three pig breeds: Yorkshire (n = 5, Landrace (n = 13, and Duroc (n = 6. An average read depth of 11.7 was obtained using whole-genome resequencing on an Illumina HiSeq2000 platform. In this study, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio tests were implemented to detect genes experiencing positive selection for the genome-wide resequencing data generated from three commercial pig breeds. In our results, 26, 7, and 14 genes from Yorkshire, Landrace, and Duroc, respectively were detected by two kinds of statistical tests. Significant evidence for positive selection was identified on genes ST6GALNAC2 and EPHX1 in Yorkshire, PARK2 in Landrace, and BMP6, SLA-DQA1, and PRKG1 in Duroc.These genes are reportedly relevant to lactation, reproduction, meat quality, and growth traits. To understand how these single nucleotide polymorphisms (SNPs related positive selection affect protein function, we analyzed the effect of non-synonymous SNPs. Three SNPs (rs324509622, rs80931851, and rs80937718 in the SLA-DQA1 gene were significant in the enrichment tests, indicating strong evidence for positive selection in Duroc. Our analyses identified genes under positive selection for lactation, reproduction, and meat-quality and growth traits in Yorkshire, Landrace, and Duroc, respectively.

  5. Drug and cell type-specific regulation of genes with different classes of estrogen receptor beta-selective agonists.

    Directory of Open Access Journals (Sweden)

    Sreenivasan Paruthiyil

    2009-07-01

    Full Text Available Estrogens produce biological effects by interacting with two estrogen receptors, ERalpha and ERbeta. Drugs that selectively target ERalpha or ERbeta might be safer for conditions that have been traditionally treated with non-selective estrogens. Several synthetic and natural ERbeta-selective compounds have been identified. One class of ERbeta-selective agonists is represented by ERB-041 (WAY-202041 which binds to ERbeta much greater than ERalpha. A second class of ERbeta-selective agonists derived from plants include MF101, nyasol and liquiritigenin that bind similarly to both ERs, but only activate transcription with ERbeta. Diarylpropionitrile represents a third class of ERbeta-selective compounds because its selectivity is due to a combination of greater binding to ERbeta and transcriptional activity. However, it is unclear if these three classes of ERbeta-selective compounds produce similar biological activities. The goals of these studies were to determine the relative ERbeta selectivity and pattern of gene expression of these three classes of ERbeta-selective compounds compared to estradiol (E(2, which is a non-selective ER agonist. U2OS cells stably transfected with ERalpha or ERbeta were treated with E(2 or the ERbeta-selective compounds for 6 h. Microarray data demonstrated that ERB-041, MF101 and liquiritigenin were the most ERbeta-selective agonists compared to estradiol, followed by nyasol and then diarylpropionitrile. FRET analysis showed that all compounds induced a similar conformation of ERbeta, which is consistent with the finding that most genes regulated by the ERbeta-selective compounds were similar to each other and E(2. However, there were some classes of genes differentially regulated by the ERbeta agonists and E(2. Two ERbeta-selective compounds, MF101 and liquiritigenin had cell type-specific effects as they regulated different genes in HeLa, Caco-2 and Ishikawa cell lines expressing ERbeta. Our gene profiling studies

  6. Gene selection for the reconstruction of stem cell differentiation trees: a linear programming approach.

    Science.gov (United States)

    Ghadie, Mohamed A; Japkowicz, Nathalie; Perkins, Theodore J

    2015-08-15

    Stem cell differentiation is largely guided by master transcriptional regulators, but it also depends on the expression of other types of genes, such as cell cycle genes, signaling genes, metabolic genes, trafficking genes, etc. Traditional approaches to understanding gene expression patterns across multiple conditions, such as principal components analysis or K-means clustering, can group cell types based on gene expression, but they do so without knowledge of the differentiation hierarchy. Hierarchical clustering can organize cell types into a tree, but in general this tree is different from the differentiation hierarchy itself. Given the differentiation hierarchy and gene expression data at each node, we construct a weighted Euclidean distance metric such that the minimum spanning tree with respect to that metric is precisely the given differentiation hierarchy. We provide a set of linear constraints that are provably sufficient for the desired construction and a linear programming approach to identify sparse sets of weights, effectively identifying genes that are most relevant for discriminating different parts of the tree. We apply our method to microarray gene expression data describing 38 cell types in the hematopoiesis hierarchy, constructing a weighted Euclidean metric that uses just 175 genes. However, we find that there are many alternative sets of weights that satisfy the linear constraints. Thus, in the style of random-forest training, we also construct metrics based on random subsets of the genes and compare them to the metric of 175 genes. We then report on the selected genes and their biological functions. Our approach offers a new way to identify genes that may have important roles in stem cell differentiation. tperkins@ohri.ca Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. An efficient parallel stochastic simulation method for analysis of nonviral gene delivery systems

    KAUST Repository

    Kuwahara, Hiroyuki

    2011-01-01

    Gene therapy has a great potential to become an effective treatment for a wide variety of diseases. One of the main challenges to make gene therapy practical in clinical settings is the development of efficient and safe mechanisms to deliver foreign DNA molecules into the nucleus of target cells. Several computational and experimental studies have shown that the design process of synthetic gene transfer vectors can be greatly enhanced by computational modeling and simulation. This paper proposes a novel, effective parallelization of the stochastic simulation algorithm (SSA) for pharmacokinetic models that characterize the rate-limiting, multi-step processes of intracellular gene delivery. While efficient parallelizations of the SSA are still an open problem in a general setting, the proposed parallel simulation method is able to substantially accelerate the next reaction selection scheme and the reaction update scheme in the SSA by exploiting and decomposing the structures of stochastic gene delivery models. This, thus, makes computationally intensive analysis such as parameter optimizations and gene dosage control for specific cell types, gene vectors, and transgene expression stability substantially more practical than that could otherwise be with the standard SSA. Here, we translated the nonviral gene delivery model based on mass-action kinetics by Varga et al. [Molecular Therapy, 4(5), 2001] into a more realistic model that captures intracellular fluctuations based on stochastic chemical kinetics, and as a case study we applied our parallel simulation to this stochastic model. Our results show that our simulation method is able to increase the efficiency of statistical analysis by at least 50% in various settings. © 2011 ACM.

  8. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA

    2012-05-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  9. Methods for monitoring multiple gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Berka, Randy; Bachkirova, Elena; Rey, Michael

    2013-10-01

    The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.

  10. Selection of Reference Genes for Expression Studies in Diaphorina citri (Hemiptera: Liviidae).

    Science.gov (United States)

    Bassan, Meire Menezes; Angelotti-Mendonc A, Je Ssika; Alves, Gustavo Rodrigues; Yamamoto, Pedro Takao; Moura O Filho, Francisco de Assis Alves

    2017-12-05

    The Asian citrus psyllid, Diaphorina citri Kuwayama (Hemiptera: Liviidae), is considered the main vector of the bacteria associated with huanglongbing, a very serious disease that has threatened the world citrus industry. The absence of efficient control management protocols, including a lack of resistant cultivars, has led to the development of different approaches to study this pathosystem. The production of resistant genotypes relies on D. citri gene expression analyses by RT-qPCR to assess control of the vector population. High-quality, reliable RT-qPCR analyses depend upon proper reference gene selection and validation. However, adequate D. citri reference genes have not yet been identified. In the present study, we evaluated the genes EF 1-α, ACT, GAPDH, RPL7, RPL17, and TUB as candidate reference genes for this insect. Gene expression stability was evaluated using the mathematical algorithms deltaCt, NormFinder, BestKeeper, and geNorm, at five insect developmental stages, grown on two different plant hosts [Citrus sinensis (L.) Osbeck (Sapindales: Rutaceae) and Murraya paniculata (L.) Jack (Sapindales: Rutaceae)]. The final gene ranking was calculated using RefFinder software, and the V-ATPase-A gene was selected for validation. According to our results, two reference genes are recommended when different plant hosts and developmental stages are considered. Considering gene expression studies in D. citri grown on M. paniculata, regardless of the insect developmental stage, GAPDH and RPL7 have the best fit as reference genes in RT-qPCR analyses, whereas GAPDH and EF 1-α are recommended as reference genes in insect studies using C. sinensis. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  11. Effective selection of transgenic papaya plants with the PMI/Man selection system.

    Science.gov (United States)

    Zhu, Yun J; Agbayani, Ricelle; McCafferty, Heather; Albert, Henrik H; Moore, Paul H

    2005-09-01

    The selectable marker gene phospho-mannose isomerase (pmi), which encodes the enzyme phospho-mannose isomerase (PMI) to enable selection of transformed cell lines on media containing mannose (Man), was evaluated for genetic transformation of papaya (Carica papaya L.). We found that papaya embryogenic calli have little or no PMI activity and cannot utilize Man as a carbon source; however, when calli were transformed with a pmi gene, the PMI activity was greatly increased and they could utilize Man as efficiently as sucrose. Plants regenerated from selected callus lines also exhibited PMI activity but at a lower specific activity level. Our transformation efficiency with Man selection was higher than that reported using antibiotic selection or with a visual marker. For papaya, the PMI/Man selection system for producing transgenic plants is a highly efficient addition to previously published methods for selection and may facilitate the stacking of multiple transgenes of interest. Additionally, since the PMI/Man selection system does not involve antibiotic or herbicide resistance genes, its use might reduce environmental concerns about the potential flow of those genes into related plant populations.

  12. A Screening Method for the ALK Fusion Gene in NSCLC

    International Nuclear Information System (INIS)

    Murakami, Yoshiko; Mitsudomi, Tetsuya; Yatabe, Yasushi

    2012-01-01

    Lung cancer research has recently made significant progress in understanding the molecular pathogenesis of lung cancer and in developing treatments for it. Such achievements are directly utilized in clinical practice. Indeed, the echinoderm microtubule-associated protein-like 4–anaplastic lymphoma kinase (ALK) fusion gene was first described in non-small cell lung cancer in 2007, and a molecularly targeted drug against the fusion was approved in 2011. However, lung cancer with the ALK fusion constitutes only a small fraction of lung cancers; therefore, efficient patient selection is crucial for successful treatment using the ALK inhibitor. Currently, RT-PCR, fluorescent in situ hybridization (FISH), and immunohistochemistry are commonly used to detect the ALK fusion. Although FISH is currently the gold standard technique, there are no perfect methods for detecting these genetic alterations. In this article, we discuss the advantages and disadvantages of each method and the possible criteria for selecting patients who are more likely to have the ALK fusion. If we can successfully screen patients, then ALK inhibitor treatment will be the best example of personalized therapy in terms of selecting patients with an uncommon genotype from a larger group with the same tumor phenotype. In other words, the personalized therapy may offer a new challenge for current clinical oncology.

  13. Preclinical evaluation of gene delivery methods for the treatment of loco-regional disease in breast cancer.

    LENUS (Irish Health Repository)

    Rajendran, Simon

    2011-04-01

    Preclinical results with various gene therapy strategies indicate significant potential for new cancer treatments. However, many therapeutics fail at clinical trial, often due to differences in tissue physiology between animal models and humans, and tumor phenotype variation. Clinical data relevant to treatment strategies may be generated prior to clinical trial through experimentation using intact patient tissue ex vivo. We developed a novel tumor slice model culture system that is universally applicable to gene delivery methods, using a realtime luminescence detection method to assess gene delivery. Methods investigated include viruses (adenovirus [Ad] and adeno-associated virus), lipofection, ultrasound (US), electroporation and naked DNA. Viability and tumor populations within the slices were well maintained for seven days, and gene delivery was qualitatively and quantitatively examinable for all vectors. Ad was the most efficient gene delivery vector with transduction efficiency >50%. US proved the optimal non-viral gene delivery method in human tumor slices. The nature of the ex vivo culture system permitted examination of specific elements. Parameters shown to diminish Ad gene delivery included blood, regions of low viability and secondary disease. US gene delivery was significantly reduced by blood and skin, while tissue hyperthermia improved gene delivery. US achieved improved efficacy for secondary disease. The ex vivo model was also suitable for examination of tissue-specific effects on vector expression, with Ad expression mediated by the CXCR4 promoter shown to provide a tumor selective advantage over the ubiquitously active cytomegalovirus promoter. In conclusion, this is the first study incorporating patient tissue models in comparing gene delivery from various vectors, providing knowledge on cell-type specificity and examining the crucial biological factors determining successful gene delivery. The results highlight the importance of in

  14. Preclinical evaluation of gene delivery methods for the treatment of loco-regional disease in breast cancer.

    LENUS (Irish Health Repository)

    Rajendran, Simon

    2012-01-31

    Preclinical results with various gene therapy strategies indicate significant potential for new cancer treatments. However, many therapeutics fail at clinical trial, often due to differences in tissue physiology between animal models and humans, and tumor phenotype variation. Clinical data relevant to treatment strategies may be generated prior to clinical trial through experimentation using intact patient tissue ex vivo. We developed a novel tumor slice model culture system that is universally applicable to gene delivery methods, using a realtime luminescence detection method to assess gene delivery. Methods investigated include viruses (adenovirus [Ad] and adeno-associated virus), lipofection, ultrasound (US), electroporation and naked DNA. Viability and tumor populations within the slices were well maintained for seven days, and gene delivery was qualitatively and quantitatively examinable for all vectors. Ad was the most efficient gene delivery vector with transduction efficiency >50%. US proved the optimal non-viral gene delivery method in human tumor slices. The nature of the ex vivo culture system permitted examination of specific elements. Parameters shown to diminish Ad gene delivery included blood, regions of low viability and secondary disease. US gene delivery was significantly reduced by blood and skin, while tissue hyperthermia improved gene delivery. US achieved improved efficacy for secondary disease. The ex vivo model was also suitable for examination of tissue-specific effects on vector expression, with Ad expression mediated by the CXCR4 promoter shown to provide a tumor selective advantage over the ubiquitously active cytomegalovirus promoter. In conclusion, this is the first study incorporating patient tissue models in comparing gene delivery from various vectors, providing knowledge on cell-type specificity and examining the crucial biological factors determining successful gene delivery. The results highlight the importance of in

  15. A Bayesian variable selection procedure for ranking overlapping gene sets

    DEFF Research Database (Denmark)

    Skarman, Axel; Mahdi Shariati, Mohammad; Janss, Luc

    2012-01-01

    Background Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been de...

  16. Use of the alr gene as a food-grade selection marker in lactic acid bacteria.

    Science.gov (United States)

    Bron, Peter A; Benchimol, Marcos G; Lambert, Jolanda; Palumbo, Emmanuelle; Deghorain, Marie; Delcour, Jean; De Vos, Willem M; Kleerebezem, Michiel; Hols, Pascal

    2002-11-01

    Both Lactococcus lactis and Lactobacillus plantarum contain a single alr gene, encoding an alanine racemase (EC 5.1.1.1), which catalyzes the interconversion of D-alanine and L-alanine. The alr genes of these lactic acid bacteria were investigated for their application as food-grade selection markers in a heterologous complementation approach. Since isogenic mutants of both species carrying an alr deletion (Deltaalr) showed auxotrophy for D-alanine, plasmids carrying a heterologous alr were constructed and could be selected, since they complemented D-alanine auxotrophy in the L. plantarum Deltaalr and L. lactis Deltaalr strains. Selection was found to be highly stringent, and plasmids were stably maintained over 200 generations of culturing. Moreover, the plasmids carrying the heterologous alr genes could be stably maintained in wild-type strains of L. plantarum and L. lactis by selection for resistance to D-cycloserine, a competitive inhibitor of Alr (600 and 200 micro g/ml, respectively). In addition, a plasmid carrying the L. plantarum alr gene under control of the regulated nisA promoter was constructed to demonstrate that D-cycloserine resistance of L. lactis is linearly correlated to the alr expression level. Finally, the L. lactis alr gene controlled by the nisA promoter, together with the nisin-regulatory genes nisRK, were integrated into the chromosome of L. plantarum Deltaalr. The resulting strain could grow in the absence of D-alanine only when expression of the alr gene was induced with nisin.

  17. Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

    Directory of Open Access Journals (Sweden)

    Alok Arun

    Full Text Available Real-time quantitative reverse transcription PCR (qRT-PCR is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae, two developmental stages (pupal and adult and two sexes (male and female, all of which were subjected to two food treatments (food stress and control feeding ad libitum. The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the

  18. Selection and validation of reference genes for qRT-PCR expression analysis of candidate genes involved in olfactory communication in the butterfly Bicyclus anynana.

    Science.gov (United States)

    Arun, Alok; Baumlé, Véronique; Amelot, Gaël; Nieberding, Caroline M

    2015-01-01

    Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at identifying reference genes for accurate data normalization for any butterfly is available. The African bush brown butterfly Bicyclus anynana has drawn considerable attention owing to its suitability as a model for evolutionary ecology, and we here provide a maiden extensive study to identify suitable reference gene in this species. We monitored the expression profile of twelve reference genes: eEF-1α, FK506, UBQL40, RpS8, RpS18, HSP, GAPDH, VATPase, ACT3, TBP, eIF2 and G6PD. We tested the stability of their expression profiles in three different tissues (wings, brains, antennae), two developmental stages (pupal and adult) and two sexes (male and female), all of which were subjected to two food treatments (food stress and control feeding ad libitum). The expression stability and ranking of twelve reference genes was assessed using two algorithm-based methods, NormFinder and geNorm. Both methods identified RpS8 as the best suitable reference gene for expression data normalization. We also showed that the use of two reference genes is sufficient to effectively normalize the qRT-PCR data under varying tissues and experimental conditions that we used in B. anynana. Finally, we tested the effect of choosing reference genes with different stability on the normalization of the transcript abundance of a candidate gene involved in olfactory communication in B. anynana, the Fatty Acyl Reductase 2, and we confirmed that using an unstable reference gene can drastically alter the expression

  19. Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes.

    Science.gov (United States)

    Karn, Robert C; Laukaitis, Christina M

    2014-08-01

    In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.

  20. Positive selection and ancient duplications in the evolution of class B floral homeotic genes of orchids and grasses

    Directory of Open Access Journals (Sweden)

    Koch Marcus A

    2009-04-01

    Full Text Available Abstract Background Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged. Results The DEF-like genes from Orchidaceae form four well-supported, ancient clades of orthologues. In contrast, orchid GLO-like genes form a single clade of ancient orthologues and recent paralogues. DEF-like genes from orchid clade 2 (OMADS3-like genes are under less stringent purifying selection than the other orchid DEF-like and GLO-like genes. In comparison with orchids, purifying selection

  1. The relationship between PMI (manA) gene expression and optimal selection pressure in Indica rice transformation.

    Science.gov (United States)

    Gui, Huaping; Li, Xia; Liu, Yubo; Han, Kai; Li, Xianggan

    2014-07-01

    An efficient mannose selection system was established for transformation of Indica cultivar IR58025B . Different selection pressures were required to achieve optimum transformation frequency for different PMI selectable marker cassettes. This study was conducted to establish an efficient transformation system for Indica rice, cultivar IR58025B. Four combinations of two promoters, rice Actin 1 and maize Ubiquitin 1, and two manA genes, native gene from E. coli (PMI-01) and synthetic maize codon-optimized gene (PMI-09) were compared under various concentrations of mannose. Different selection pressures were required for different gene cassettes to achieve corresponding optimum transformation frequency (TF). Higher TFs as 54 and 53% were obtained when 5 g/L mannose was used for selection of prActin-PMI-01 cassette and 7.5 g/L mannose used for selection of prActin-PMI-09, respectively. TFs as 67 and 56% were obtained when 7.5 and 15 g/L mannose were used for selection of prUbi-PMI-01 and prUbi-PMI-09, respectively. We conclude that higher TFs can be achieved for different gene cassettes when an optimum selection pressure is applied. By investigating the PMI expression level in transgenic calli and leaves, we found there was a significant positive correlation between the protein expression level and the optimal selection pressure. Higher optimal selection pressure is required for those constructs which confer higher expression of PMI protein. The single copy rate of those transgenic events for prActin-PMI-01 cassette is lower than that for other three cassettes. We speculate some of low copy events with low protein expression levels might not have been able to survive in the mannose selection.

  2. Selection of relatively exact reference genes for gene expression studies in goosegrass (Eleusine indica) under herbicide stress.

    Science.gov (United States)

    Chen, Jingchao; Huang, Zhaofeng; Huang, Hongjuan; Wei, Shouhui; Liu, Yan; Jiang, Cuilan; Zhang, Jie; Zhang, Chaoxian

    2017-04-21

    Goosegrass (Eleusine indica) is one of the most serious annual grassy weeds worldwide, and its evolved herbicide-resistant populations are more difficult to control. Quantitative real-time PCR (qPCR) is a common technique for investigating the resistance mechanism; however, there is as yet no report on the systematic selection of stable reference genes for goosegrass. This study proposed to test the expression stability of 9 candidate reference genes in goosegrass in different tissues and developmental stages and under stress from three types of herbicide. The results show that for different developmental stages and organs (control), eukaryotic initiation factor 4 A (eIF-4) is the most stable reference gene. Chloroplast acetolactate synthase (ALS) is the most stable reference gene under glyphosate stress. Under glufosinate stress, eIF-4 is the best reference gene. Ubiquitin-conjugating enzyme (UCE) is the most stable reference gene under quizalofop-p-ethyl stress. The gene eIF-4 is the recommended reference gene for goosegrass under the stress of all three herbicides. Moreover, pairwise analysis showed that seven reference genes were sufficient to normalize the gene expression data under three herbicides treatment. This study provides a list of reliable reference genes for transcript normalization in goosegrass, which will facilitate resistance mechanism studies in this weed species.

  3. Selection of reliable reference genes for quantitative real-time PCR in human T cells and neutrophils

    Directory of Open Access Journals (Sweden)

    Ledderose Carola

    2011-10-01

    Full Text Available Abstract Background The choice of reliable reference genes is a prerequisite for valid results when analyzing gene expression with real-time quantitative PCR (qPCR. This method is frequently applied to study gene expression patterns in immune cells, yet a thorough validation of potential reference genes is still lacking for most leukocyte subtypes and most models of their in vitro stimulation. In the current study, we evaluated the expression stability of common reference genes in two widely used cell culture models-anti-CD3/CD28 activated T cells and lipopolysaccharide stimulated neutrophils-as well as in unselected untreated leukocytes. Results The mRNA expression of 17 (T cells, 7 (neutrophils or 8 (unselected leukocytes potential reference genes was quantified by reverse transcription qPCR, and a ranking of the preselected candidate genes according to their expression stability was calculated using the programs NormFinder, geNorm and BestKeeper. IPO8, RPL13A, TBP and SDHA were identified as suitable reference genes in T cells. TBP, ACTB and SDHA were stably expressed in neutrophils. TBP and SDHA were also the most stable genes in untreated total blood leukocytes. The critical impact of reference gene selection on the estimated target gene expression is demonstrated for IL-2 and FIH expression in T cells. Conclusions The study provides a shortlist of suitable reference genes for normalization of gene expression data in unstimulated and stimulated T cells, unstimulated and stimulated neutrophils and in unselected leukocytes.

  4. A copula method for modeling directional dependence of genes

    Directory of Open Access Journals (Sweden)

    Park Changyi

    2008-05-01

    Full Text Available Abstract Background Genes interact with each other as basic building blocks of life, forming a complicated network. The relationship between groups of genes with different functions can be represented as gene networks. With the deposition of huge microarray data sets in public domains, study on gene networking is now possible. In recent years, there has been an increasing interest in the reconstruction of gene networks from gene expression data. Recent work includes linear models, Boolean network models, and Bayesian networks. Among them, Bayesian networks seem to be the most effective in constructing gene networks. A major problem with the Bayesian network approach is the excessive computational time. This problem is due to the interactive feature of the method that requires large search space. Since fitting a model by using the copulas does not require iterations, elicitation of the priors, and complicated calculations of posterior distributions, the need for reference to extensive search spaces can be eliminated leading to manageable computational affords. Bayesian network approach produces a discretely expression of conditional probabilities. Discreteness of the characteristics is not required in the copula approach which involves use of uniform representation of the continuous random variables. Our method is able to overcome the limitation of Bayesian network method for gene-gene interaction, i.e. information loss due to binary transformation. Results We analyzed the gene interactions for two gene data sets (one group is eight histone genes and the other group is 19 genes which include DNA polymerases, DNA helicase, type B cyclin genes, DNA primases, radiation sensitive genes, repaire related genes, replication protein A encoding gene, DNA replication initiation factor, securin gene, nucleosome assembly factor, and a subunit of the cohesin complex by adopting a measure of directional dependence based on a copula function. We have compared

  5. Reranking candidate gene models with cross-species comparison for improved gene prediction

    Directory of Open Access Journals (Sweden)

    Pereira Fernando CN

    2008-10-01

    Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.

  6. Selection of Reliable Reference Genes for Gene Expression Studies of a Promising Oilseed Crop, Plukenetia volubilis, by Real-Time Quantitative PCR

    Directory of Open Access Journals (Sweden)

    Longjian Niu

    2015-06-01

    Full Text Available Real-time quantitative PCR (RT-qPCR is a reliable and widely used method for gene expression analysis. The accuracy of the determination of a target gene expression level by RT-qPCR demands the use of appropriate reference genes to normalize the mRNA levels among different samples. However, suitable reference genes for RT-qPCR have not been identified in Sacha inchi (Plukenetia volubilis, a promising oilseed crop known for its polyunsaturated fatty acid (PUFA-rich seeds. In this study, using RT-qPCR, twelve candidate reference genes were examined in seedlings and adult plants, during flower and seed development and for the entire growth cycle of Sacha inchi. Four statistical algorithms (delta cycle threshold (ΔCt, BestKeeper, geNorm, and NormFinder were used to assess the expression stabilities of the candidate genes. The results showed that ubiquitin-conjugating enzyme (UCE, actin (ACT and phospholipase A22 (PLA were the most stable genes in Sacha inchi seedlings. For roots, stems, leaves, flowers, and seeds from adult plants, 30S ribosomal protein S13 (RPS13, cyclophilin (CYC and elongation factor-1alpha (EF1α were recommended as reference genes for RT-qPCR. During the development of reproductive organs, PLA, ACT and UCE were the optimal reference genes for flower development, whereas UCE, RPS13 and RNA polymerase II subunit (RPII were optimal for seed development. Considering the entire growth cycle of Sacha inchi, UCE, ACT and EF1α were sufficient for the purpose of normalization. Our results provide useful guidelines for the selection of reliable reference genes for the normalization of RT-qPCR data for seedlings and adult plants, for reproductive organs, and for the entire growth cycle of Sacha inchi.

  7. Selection of Reliable Reference Genes for Gene Expression Studies of a Promising Oilseed Crop, Plukenetia volubilis, by Real-Time Quantitative PCR

    Science.gov (United States)

    Niu, Longjian; Tao, Yan-Bin; Chen, Mao-Sheng; Fu, Qiantang; Li, Chaoqiong; Dong, Yuling; Wang, Xiulan; He, Huiying; Xu, Zeng-Fu

    2015-01-01

    Real-time quantitative PCR (RT-qPCR) is a reliable and widely used method for gene expression analysis. The accuracy of the determination of a target gene expression level by RT-qPCR demands the use of appropriate reference genes to normalize the mRNA levels among different samples. However, suitable reference genes for RT-qPCR have not been identified in Sacha inchi (Plukenetia volubilis), a promising oilseed crop known for its polyunsaturated fatty acid (PUFA)-rich seeds. In this study, using RT-qPCR, twelve candidate reference genes were examined in seedlings and adult plants, during flower and seed development and for the entire growth cycle of Sacha inchi. Four statistical algorithms (delta cycle threshold (ΔCt), BestKeeper, geNorm, and NormFinder) were used to assess the expression stabilities of the candidate genes. The results showed that ubiquitin-conjugating enzyme (UCE), actin (ACT) and phospholipase A22 (PLA) were the most stable genes in Sacha inchi seedlings. For roots, stems, leaves, flowers, and seeds from adult plants, 30S ribosomal protein S13 (RPS13), cyclophilin (CYC) and elongation factor-1alpha (EF1α) were recommended as reference genes for RT-qPCR. During the development of reproductive organs, PLA, ACT and UCE were the optimal reference genes for flower development, whereas UCE, RPS13 and RNA polymerase II subunit (RPII) were optimal for seed development. Considering the entire growth cycle of Sacha inchi, UCE, ACT and EF1α were sufficient for the purpose of normalization. Our results provide useful guidelines for the selection of reliable reference genes for the normalization of RT-qPCR data for seedlings and adult plants, for reproductive organs, and for the entire growth cycle of Sacha inchi. PMID:26047338

  8. Improvement Method of Gene Transfer in Kappaphycus Alvarezii

    OpenAIRE

    Triana, St. Hidayah; Alimuddin,; Widyastuti, Utut; Suharsono,; Suryati, Emma; Parenrengi, Andi

    2016-01-01

    Method of foreign gene transfer in red seaweed Kappaphycus alvarezii has been reported, however, li-mited number of transgenic F0 (broodstock) was obtained. This study was conducted to improve the method of gene transfer mediated by Agrobacterium tumefaciens in order to obtain high percentage of K. alvarezii transgenic. Superoxide dismutase gene from Melastoma malabatrichum (MmCu/Zn-SOD) was used as model towards increasing adaptability of K. alvarezii to environmental stress. The treat-ment...

  9. Genealogy-based methods for inference of historical recombination and gene flow and their application in Saccharomyces cerevisiae.

    Science.gov (United States)

    Jenkins, Paul A; Song, Yun S; Brem, Rachel B

    2012-01-01

    Genetic exchange between isolated populations, or introgression between species, serves as a key source of novel genetic material on which natural selection can act. While detecting historical gene flow from DNA sequence data is of much interest, many existing methods can be limited by requirements for deep population genomic sampling. In this paper, we develop a scalable genealogy-based method to detect candidate signatures of gene flow into a given population when the source of the alleles is unknown. Our method does not require sequenced samples from the source population, provided that the alleles have not reached fixation in the sampled recipient population. The method utilizes recent advances in algorithms for the efficient reconstruction of ancestral recombination graphs, which encode genealogical histories of DNA sequence data at each site, and is capable of detecting the signatures of gene flow whose footprints are of length up to single genes. Further, we employ a theoretical framework based on coalescent theory to test for statistical significance of certain recombination patterns consistent with gene flow from divergent sources. Implementing these methods for application to whole-genome sequences of environmental yeast isolates, we illustrate the power of our approach to highlight loci with unusual recombination histories. By developing innovative theory and methods to analyze signatures of gene flow from population sequence data, our work establishes a foundation for the continued study of introgression and its evolutionary relevance.

  10. Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions.

    Science.gov (United States)

    Evans, Ciaran; Hardin, Johanna; Stoebel, Daniel M

    2017-02-27

    RNA-Seq is a widely used method for studying the behavior of genes under different biological conditions. An essential step in an RNA-Seq study is normalization, in which raw data are adjusted to account for factors that prevent direct comparison of expression measures. Errors in normalization can have a significant impact on downstream analysis, such as inflated false positives in differential expression analysis. An underemphasized feature of normalization is the assumptions on which the methods rely and how the validity of these assumptions can have a substantial impact on the performance of the methods. In this article, we explain how assumptions provide the link between raw RNA-Seq read counts and meaningful measures of gene expression. We examine normalization methods from the perspective of their assumptions, as an understanding of methodological assumptions is necessary for choosing methods appropriate for the data at hand. Furthermore, we discuss why normalization methods perform poorly when their assumptions are violated and how this causes problems in subsequent analysis. To analyze a biological experiment, researchers must select a normalization method with assumptions that are met and that produces a meaningful measure of expression for the given experiment. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  11. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  12. A new electrospray method for targeted gene delivery.

    Science.gov (United States)

    Boehringer, Stephan; Ruzgys, Paulius; Tamò, Luca; Šatkauskas, Saulius; Geiser, Thomas; Gazdhar, Amiq; Hradetzky, David

    2018-03-05

    A challenge for gene therapy is absence of safe and efficient local delivery of therapeutic genetic material. An efficient and reproducible physical method of electrospray for localized and targeted gene delivery is presented. Electrospray works on the principle of coulombs repulsion, under influence of electric field the liquid carrying genetic material is dispersed into micro droplets and is accelerated towards the targeted tissue, acting as a counter electrode. The accelerated droplets penetrate the targeted cells thus facilitating the transfer of genetic material into the cell. The work described here presents the principle of electrospray for gene delivery, the basic instrument design, and the various optimized parameters to enhance gene transfer in vitro. We estimate a transfection efficiency of up to 60% was achieved. We describe an efficient gene transfer method and a potential electrospray-mediated gene transfer mechanism.

  13. Selection of reliable reference genes in Caenorhabditis elegans for analysis of nanotoxicity.

    Science.gov (United States)

    Zhang, Yanqiong; Chen, Dongliang; Smith, Michael A; Zhang, Baohong; Pan, Xiaoping

    2012-01-01

    Despite rapid development and application of a wide range of manufactured metal oxide nanoparticles (NPs), the understanding of potential risks of using NPs is less completed, especially at the molecular level. The nematode Caenorhabditis elegans (C.elegans) has been emerging as an environmental model to study the molecular mechanism of environmental contaminations, using standard genetic tools such as the real-time quantitative PCR (RT-qPCR). The most important factor that may affect the accuracy of RT-qPCR is to choose appropriate genes for normalization. In this study, we selected 13 reference gene candidates (act-1, cdc-42, pmp-3, eif-3.C, actin, act-2, csq-1, Y45F10D.4, tba-1, mdh-1, ama-1, F35G12.2, and rbd-1) to test their expression stability under different doses of nano-copper oxide (CuO 0, 1, 10, and 50 µg/mL) using RT-qPCR. Four algorithms, geNorm, NormFinder, BestKeeper, and the comparative ΔCt method, were employed to evaluate these 13 candidates expressions. As a result, tba-1, Y45F10D.4 and pmp-3 were the most reliable, which may be used as reference genes in future study of nanoparticle-induced genetic response using C.elegans.

  14. Selection of Housekeeping Genes for Transgene Expression Analysis in Eucommia ulmoides Oliver Using Real-Time RT-PCR

    Directory of Open Access Journals (Sweden)

    Ren Chen

    2010-01-01

    Full Text Available In order to select appropriate housekeeping genes for accurate calibration of experimental variations in real-time (RT- PCR results in transgene expression analysis, particularly with respect to the influence of transgene on stability of endogenous housekeeping gene expression in transgenic plants, we outline a reliable strategy to identify the optimal housekeeping genes from a set of candidates by combining statistical analyses of their (RT- PCR amplification efficiency, gene expression stability, and transgene influences. We used the strategy to select two genes, ACTα and EF1α, from 10 candidate housekeeping genes, as the optimal housekeeping genes to evaluate transgenic Eucommia ulmoides Oliver root lines overexpressing IPPI or FPPS1 genes, which are involved in isoprenoid biosynthesis.

  15. Selection of reference genes for tissue/organ samples on day 3 fifth-instar larvae in silkworm, Bombyx mori.

    Science.gov (United States)

    Wang, Genhong; Chen, Yanfei; Zhang, Xiaoying; Bai, Bingchuan; Yan, Hao; Qin, Daoyuan; Xia, Qingyou

    2018-06-01

    The silkworm, Bombyx mori, is one of the world's most economically important insect. Surveying variations in gene expression among multiple tissue/organ samples will provide clues for gene function assignments and will be helpful for identifying genes related to economic traits or specific cellular processes. To ensure their accuracy, commonly used gene expression quantification methods require a set of stable reference genes for data normalization. In this study, 24 candidate reference genes were assessed in 10 tissue/organ samples of day 3 fifth-instar B. mori larvae using geNorm and NormFinder. The results revealed that, using the combination of the expression of BGIBMGA003186 and BGIBMGA008209 was the optimum choice for normalizing the expression data of the B. mori tissue/organ samples. The most stable gene, BGIBMGA003186, is recommended if just one reference gene is used. Moreover, the commonly used reference gene encoding cytoplasmic actin was the least appropriate reference gene of the samples investigated. The reliability of the selected reference genes was further confirmed by evaluating the expression profiles of two cathepsin genes. Our results may be useful for future studies involving the quantification of relative gene expression levels of different tissue/organ samples in B. mori. © 2018 Wiley Periodicals, Inc.

  16. The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

    Science.gov (United States)

    Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

    2006-10-01

    Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.

  17. GxGrare: gene-gene interaction analysis method for rare variants from high-throughput sequencing data.

    Science.gov (United States)

    Kwon, Minseok; Leem, Sangseob; Yoon, Joon; Park, Taesung

    2018-03-19

    With the rapid advancement of array-based genotyping techniques, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with common complex diseases. However, it has been shown that only a small proportion of the genetic etiology of complex diseases could be explained by the genetic factors identified from GWAS. This missing heritability could possibly be explained by gene-gene interaction (epistasis) and rare variants. There has been an exponential growth of gene-gene interaction analysis for common variants in terms of methodological developments and practical applications. Also, the recent advancement of high-throughput sequencing technologies makes it possible to conduct rare variant analysis. However, little progress has been made in gene-gene interaction analysis for rare variants. Here, we propose GxGrare which is a new gene-gene interaction method for the rare variants in the framework of the multifactor dimensionality reduction (MDR) analysis. The proposed method consists of three steps; 1) collapsing the rare variants, 2) MDR analysis for the collapsed rare variants, and 3) detect top candidate interaction pairs. GxGrare can be used for the detection of not only gene-gene interactions, but also interactions within a single gene. The proposed method is illustrated with 1080 whole exome sequencing data of the Korean population in order to identify causal gene-gene interaction for rare variants for type 2 diabetes. The proposed GxGrare performs well for gene-gene interaction detection with collapsing of rare variants. GxGrare is available at http://bibs.snu.ac.kr/software/gxgrare which contains simulation data and documentation. Supported operating systems include Linux and OS X.

  18. Selection and validation of reference genes for gene expression analysis in switchgrass (Panicum virgatum using quantitative real-time RT-PCR.

    Directory of Open Access Journals (Sweden)

    Jacinta Gimeno

    Full Text Available Switchgrass (Panicum virgatum has received a lot of attention as a forage and bioenergy crop during the past few years. Gene expression studies are in progress to improve new traits and develop new cultivars. Quantitative real time PCR (qRT-PCR has emerged as an important technique to study gene expression analysis. For accurate and reliable results, normalization of data with reference genes is essential. In this work, we evaluate the stability of expression of genes to use as reference for qRT-PCR in the grass P. virgatum. Eleven candidate reference genes, including eEF-1α, UBQ6, ACT12, TUB6, eIF-4a, GAPDH, SAMDC, TUA6, CYP5, U2AF, and FTSH4, were validated for qRT-PCR normalization in different plant tissues and under different stress conditions. The expression stability of these genes was verified by the use of two distinct algorithms, geNorm and NormFinder. Differences were observed after comparison of the ranking of the candidate reference genes identified by both programs but eEF-1α, eIF-4a, CYP5 and U2AF are ranked as the most stable genes in the samples sets under study. Both programs discard the use of SAMDC and TUA6 for normalization. Validation of the reference genes proposed by geNorm and NormFinder were performed by normalization of transcript abundance of a group of target genes in different samples. Results show similar expression patterns when the best reference genes selected by both programs were used but differences were detected in the transcript abundance of the target genes. Based on the above research, we recommend the use of different statistical algorithms to identify the best reference genes for expression data normalization. The best genes selected in this study will help to improve the quality of gene expression data in a wide variety of samples in switchgrass.

  19. Gene Ontology and KEGG Enrichment Analyses of Genes Related to Age-Related Macular Degeneration

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-01-01

    Full Text Available Identifying disease genes is one of the most important topics in biomedicine and may facilitate studies on the mechanisms underlying disease. Age-related macular degeneration (AMD is a serious eye disease; it typically affects older adults and results in a loss of vision due to retina damage. In this study, we attempt to develop an effective method for distinguishing AMD-related genes. Gene ontology and KEGG enrichment analyses of known AMD-related genes were performed, and a classification system was established. In detail, each gene was encoded into a vector by extracting enrichment scores of the gene set, including it and its direct neighbors in STRING, and gene ontology terms or KEGG pathways. Then certain feature-selection methods, including minimum redundancy maximum relevance and incremental feature selection, were adopted to extract key features for the classification system. As a result, 720 GO terms and 11 KEGG pathways were deemed the most important factors for predicting AMD-related genes.

  20. Nucleofection: A New Method for Cutaneous Gene Transfer?

    Directory of Open Access Journals (Sweden)

    Frank Jacobsen

    2006-01-01

    Full Text Available Background. Transfection efficacy after nonviral gene transfer in primary epithelial cells is limited. The aim of this study was to compare transfection efficacy of the recently available method of nucleofection with the established transfection reagent FuGENE6. Methods. Primary human keratinocytes (HKC, primary human fibroblasts (HFB, and a human keratinocyte cell line (HaCaT were transfected with reporter gene construct by FuGENE6 or Amaxa Nucleofector device. At corresponding time points, β-galactosidase expression, cell proliferation (MTT-Test, transduction efficiency (X-gal staining, cell morphology, and cytotoxicity (CASY were determined. Results. Transgene expression after nucleofection was significantly higher in HKC and HFB and detected earlier (3 h vs. 24 h than in FuGENE6. After lipofection 80%–90% of the cells remained proliferative without any influence on cell morphology. In contrast, nucleofection led to a decrease in keratinocyte cell size, with only 20%–42% proliferative cells. Conclusion. Related to the method-dependent increase of cytotoxicity, transgene expression after nucleofection was earlier and higher than after lipofection.

  1. Nucleofection: A New Method for Cutaneous Gene Transfer?

    Science.gov (United States)

    Jacobsen, Frank; Mertens-Rill, Janine; Beller, Juergen; Hirsch, Tobias; Daigeler, Adrien; Langer, Stefan; Lehnhardt, Marcus; Steinau, Hans-Ulrich; Steinstraesser, Lars

    2006-01-01

    Background. Transfection efficacy after nonviral gene transfer in primary epithelial cells is limited. The aim of this study was to compare transfection efficacy of the recently available method of nucleofection with the established transfection reagent FuGENE6. Methods. Primary human keratinocytes (HKC), primary human fibroblasts (HFB), and a human keratinocyte cell line (HaCaT) were transfected with reporter gene construct by FuGENE6 or Amaxa Nucleofector device. At corresponding time points, β-galactosidase expression, cell proliferation (MTT-Test), transduction efficiency (X-gal staining), cell morphology, and cytotoxicity (CASY) were determined. Results. Transgene expression after nucleofection was significantly higher in HKC and HFB and detected earlier (3 h vs. 24 h) than in FuGENE6. After lipofection 80%–90% of the cells remained proliferative without any influence on cell morphology. In contrast, nucleofection led to a decrease in keratinocyte cell size, with only 20%–42% proliferative cells. Conclusion. Related to the method-dependent increase of cytotoxicity, transgene expression after nucleofection was earlier and higher than after lipofection. PMID:17489014

  2. Selection and evaluation of potential reference genes for gene expression analysis in the brown planthopper, Nilaparvata lugens (Hemiptera: Delphacidae using reverse-transcription quantitative PCR.

    Directory of Open Access Journals (Sweden)

    Miao Yuan

    Full Text Available The brown planthopper (BPH, Nilaparvata lugens (Hemiptera, Delphacidae, is one of the most important rice pests. Abundant genetic studies on BPH have been conducted using reverse-transcription quantitative real-time PCR (qRT-PCR. Using qRT-PCR, the expression levels of target genes are calculated on the basis of endogenous controls. These genes need to be appropriately selected by experimentally assessing whether they are stably expressed under different conditions. However, such studies on potential reference genes in N. lugens are lacking. In this paper, we presented a systematic exploration of eight candidate reference genes in N. lugens, namely, actin 1 (ACT, muscle actin (MACT, ribosomal protein S11 (RPS11, ribosomal protein S15e (RPS15, alpha 2-tubulin (TUB, elongation factor 1 delta (EF, 18S ribosomal RNA (18S, and arginine kinase (AK and used four alternative methods (BestKeeper, geNorm, NormFinder, and the delta Ct method to evaluate the suitability of these genes as endogenous controls. We examined their expression levels among different experimental factors (developmental stage, body part, geographic population, temperature variation, pesticide exposure, diet change, and starvation following the MIQE (Minimum Information for publication of Quantitative real time PCR Experiments guidelines. Based on the results of RefFinder, which integrates four currently available major software programs to compare and rank the tested candidate reference genes, RPS15, RPS11, and TUB were found to be the most suitable reference genes in different developmental stages, body parts, and geographic populations, respectively. RPS15 was the most suitable gene under different temperature and diet conditions, while RPS11 was the most suitable gene under different pesticide exposure and starvation conditions. This work sheds light on establishing a standardized qRT-PCR procedure in N. lugens, and serves as a starting point for screening for reference genes for

  3. YamiPred: A novel evolutionary method for predicting pre-miRNAs and selecting relevant features

    KAUST Repository

    Kleftogiannis, Dimitrios A.; Theofilatos, Konstantinos; Likothanassis, Spiros; Mavroudi, Seferina

    2015-01-01

    MicroRNAs (miRNAs) are small non-coding RNAs, which play a significant role in gene regulation. Predicting miRNA genes is a challenging bioinformatics problem and existing experimental and computational methods fail to deal with it effectively. We developed YamiPred, an embedded classification method that combines the efficiency and robustness of Support Vector Machines (SVM) with Genetic Algorithms (GA) for feature selection and parameters optimization. YamiPred was tested in a new and realistic human dataset and was compared with state-of-the-art computational intelligence approaches and the prevalent SVM-based tools for miRNA prediction. Experimental results indicate that YamiPred outperforms existing approaches in terms of accuracy and of geometric mean of sensitivity and specificity. The embedded feature selection component selects a compact feature subset that contributes to the performance optimization. Further experimentation with this minimal feature subset has achieved very high classification performance and revealed the minimum number of samples required for developing a robust predictor. YamiPred also confirmed the important role of commonly used features such as entropy and enthalpy, and uncovered the significance of newly introduced features, such as %A-U aggregate nucleotide frequency and positional entropy. The best model trained on human data has successfully predicted pre-miRNAs to other organisms including the category of viruses.

  4. YamiPred: A novel evolutionary method for predicting pre-miRNAs and selecting relevant features

    KAUST Repository

    Kleftogiannis, Dimitrios A.

    2015-01-23

    MicroRNAs (miRNAs) are small non-coding RNAs, which play a significant role in gene regulation. Predicting miRNA genes is a challenging bioinformatics problem and existing experimental and computational methods fail to deal with it effectively. We developed YamiPred, an embedded classification method that combines the efficiency and robustness of Support Vector Machines (SVM) with Genetic Algorithms (GA) for feature selection and parameters optimization. YamiPred was tested in a new and realistic human dataset and was compared with state-of-the-art computational intelligence approaches and the prevalent SVM-based tools for miRNA prediction. Experimental results indicate that YamiPred outperforms existing approaches in terms of accuracy and of geometric mean of sensitivity and specificity. The embedded feature selection component selects a compact feature subset that contributes to the performance optimization. Further experimentation with this minimal feature subset has achieved very high classification performance and revealed the minimum number of samples required for developing a robust predictor. YamiPred also confirmed the important role of commonly used features such as entropy and enthalpy, and uncovered the significance of newly introduced features, such as %A-U aggregate nucleotide frequency and positional entropy. The best model trained on human data has successfully predicted pre-miRNAs to other organisms including the category of viruses.

  5. Biclustering methods: biological relevance and application in gene expression analysis.

    Directory of Open Access Journals (Sweden)

    Ali Oghabian

    Full Text Available DNA microarray technologies are used extensively to profile the expression levels of thousands of genes under various conditions, yielding extremely large data-matrices. Thus, analyzing this information and extracting biologically relevant knowledge becomes a considerable challenge. A classical approach for tackling this challenge is to use clustering (also known as one-way clustering methods where genes (or respectively samples are grouped together based on the similarity of their expression profiles across the set of all samples (or respectively genes. An alternative approach is to develop biclustering methods to identify local patterns in the data. These methods extract subgroups of genes that are co-expressed across only a subset of samples and may feature important biological or medical implications. In this study we evaluate 13 biclustering and 2 clustering (k-means and hierarchical methods. We use several approaches to compare their performance on two real gene expression data sets. For this purpose we apply four evaluation measures in our analysis: (1 we examine how well the considered (biclustering methods differentiate various sample types; (2 we evaluate how well the groups of genes discovered by the (biclustering methods are annotated with similar Gene Ontology categories; (3 we evaluate the capability of the methods to differentiate genes that are known to be specific to the particular sample types we study and (4 we compare the running time of the algorithms. In the end, we conclude that as long as the samples are well defined and annotated, the contamination of the samples is limited, and the samples are well replicated, biclustering methods such as Plaid and SAMBA are useful for discovering relevant subsets of genes and samples.

  6. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  7. Vancomycin gene selection in the microbiome of urban Rattus norvegicus from hospital environment

    DEFF Research Database (Denmark)

    Arn Hansen, Thomas; Joshi, Tejal; Larsen, Anders Rhod

    2016-01-01

    for the presence of antibiotic resistance genes. We show that despite the shared resistome within samples from the same geographic locations, samples from hospital area carry significantly abundant vancomycin resistance genes. The observed pattern is consistent with a selection for vancomycin genes in the R...

  8. Selection on plant male function genes identifies candidates for reproductive isolation of yellow monkeyflowers.

    Directory of Open Access Journals (Sweden)

    Jan E Aagaard

    Full Text Available Understanding the genetic basis of reproductive isolation promises insight into speciation and the origins of biological diversity. While progress has been made in identifying genes underlying barriers to reproduction that function after fertilization (post-zygotic isolation, we know much less about earlier acting pre-zygotic barriers. Of particular interest are barriers involved in mating and fertilization that can evolve extremely rapidly under sexual selection, suggesting they may play a prominent role in the initial stages of reproductive isolation. A significant challenge to the field of speciation genetics is developing new approaches for identification of candidate genes underlying these barriers, particularly among non-traditional model systems. We employ powerful proteomic and genomic strategies to study the genetic basis of conspecific pollen precedence, an important component of pre-zygotic reproductive isolation among yellow monkeyflowers (Mimulus spp. resulting from male pollen competition. We use isotopic labeling in combination with shotgun proteomics to identify more than 2,000 male function (pollen tube proteins within maternal reproductive structures (styles of M. guttatus flowers where pollen competition occurs. We then sequence array-captured pollen tube exomes from a large outcrossing population of M. guttatus, and identify those genes with evidence of selective sweeps or balancing selection consistent with their role in pollen competition. We also test for evidence of positive selection on these genes more broadly across yellow monkeyflowers, because a signal of adaptive divergence is a common feature of genes causing reproductive isolation. Together the molecular evolution studies identify 159 pollen tube proteins that are candidate genes for conspecific pollen precedence. Our work demonstrates how powerful proteomic and genomic tools can be readily adapted to non-traditional model systems, allowing for genome-wide screens

  9. A synbio approach for selection of highly expressed gene variants in Gram-positive bacteria

    DEFF Research Database (Denmark)

    Ferro, Roberto; Rennig, Maja; Hernández Rollán, Cristina

    2018-01-01

    with a long history in food fermentation. We have developed a synbio approach for increasing gene expression in two Gram-positive bacteria. First of all, the gene of interest was coupled to an antibiotic resistance gene to create a growth-based selection system. We then randomised the translation initiation...... region (TIR) preceding the gene of interest and selected clones that produced high protein titres, as judged by their ability to survive on high concentrations of antibiotic. Using this approach, we were able to significantly increase production of two industrially relevant proteins; sialidase in B....... subtilis and tyrosine ammonia lyase in L. lactis. Gram-positive bacteria are widely used to produce industrial enzymes. High titres are necessary to make the production economically feasible. The synbio approach presented here is a simple and inexpensive way to increase protein titres, which can be carried...

  10. Systematic differences in the response of genetic variation to pedigree and genome-based selection methods.

    Science.gov (United States)

    Heidaritabar, M; Vereijken, A; Muir, W M; Meuwissen, T; Cheng, H; Megens, H-J; Groenen, M A M; Bastiaansen, J W M

    2014-12-01

    Genomic selection (GS) is a DNA-based method of selecting for quantitative traits in animal and plant breeding, and offers a potentially superior alternative to traditional breeding methods that rely on pedigree and phenotype information. Using a 60 K SNP chip with markers spaced throughout the entire chicken genome, we compared the impact of GS and traditional BLUP (best linear unbiased prediction) selection methods applied side-by-side in three different lines of egg-laying chickens. Differences were demonstrated between methods, both at the level and genomic distribution of allele frequency changes. In all three lines, the average allele frequency changes were larger with GS, 0.056 0.064 and 0.066, compared with BLUP, 0.044, 0.045 and 0.036 for lines B1, B2 and W1, respectively. With BLUP, 35 selected regions (empirical P selected regions were identified. Empirical thresholds for local allele frequency changes were determined from gene dropping, and differed considerably between GS (0.167-0.198) and BLUP (0.105-0.126). Between lines, the genomic regions with large changes in allele frequencies showed limited overlap. Our results show that GS applies selection pressure much more locally than BLUP, resulting in larger allele frequency changes. With these results, novel insights into the nature of selection on quantitative traits have been gained and important questions regarding the long-term impact of GS are raised. The rapid changes to a part of the genetic architecture, while another part may not be selected, at least in the short term, require careful consideration, especially when selection occurs before phenotypes are observed.

  11. Natural selection in avian protein-coding genes expressed in brain.

    Science.gov (United States)

    Axelsson, Erik; Hultin-Rosenberg, Lina; Brandström, Mikael; Zwahlén, Martin; Clayton, David F; Ellegren, Hans

    2008-06-01

    The evolution of birds from theropod dinosaurs took place approximately 150 million years ago, and was associated with a number of specific adaptations that are still evident among extant birds, including feathers, song and extravagant secondary sexual characteristics. Knowledge about the molecular evolutionary background to such adaptations is lacking. Here, we analyse the evolution of > 5000 protein-coding gene sequences expressed in zebra finch brain by comparison to orthologous sequences in chicken. Mean d(N)/d(S) is 0.085 and genes with their maximal expression in the eye and central nervous system have the lowest mean d(N)/d(S) value, while those expressed in digestive and reproductive tissues exhibit the highest. We find that fast-evolving genes (those which have higher than expected rate of nonsynonymous substitution, indicative of adaptive evolution) are enriched for biological functions such as fertilization, muscle contraction, defence response, response to stress, wounding and endogenous stimulus, and cell death. After alignment to mammalian orthologues, we identify a catalogue of 228 genes that show a significantly higher rate of protein evolution in the two bird lineages than in mammals. These accelerated bird genes, representing candidates for avian-specific adaptations, include genes implicated in vocal learning and other cognitive processes. Moreover, colouration genes evolve faster in birds than in mammals, which may have been driven by sexual selection for extravagant plumage characteristics.

  12. [Construction of plant expression vectors with PMI gene as selection marker and their utilization in transformation of Salvia miltiorrhiza f. alba].

    Science.gov (United States)

    Tao, Ru; Zhang, You-Can; Fang, Qian; Shi, Ren-Jiu; Li, Yan-Ling; Huang, Lu-Qi; Hao, Gang-Ping

    2014-04-01

    To construct plant expression pCAMBIA1301-PMI by substituting PMI for hygromycin resistance gene in pCAMBIA1301 and obtain transgenic Salvia miltiorrhiza f. alba using PMI-mannose selection system. The 6-phosphomannose isomerase gene (PMI) of Escherichia coli was amplified by PCR. Sequence analysis showed that it shared 100% amino acids identities with the sequences of PMI genes isolates reported in the NCBI. Based on pCAMBIA1305, the plant expression pCAMBIA1305-PMI was constructed successfully by substituting PMI for hygromycin resistance gene in pCAMBIA1305. pCAMBIA1305-PMI was transformed into Agrobacterium tumefaciens LBA4404, and then the leaves of S. miltiorrhiza f. alba were inoculated in LBA4404 with pCAMBIA1305-PMI. Plant expression pCAMBIA1301-PMI was successfully constructed and the leaves of S. miltiorrhiza f. alba inoculated in LBA4404 with pCAMBIA1305-PMI were selected on medium supplemented with a combination of 20 g x L(-1) mannose and 10 g x L(-1) sucrose as a carbon source. The transformation efficiency rate was 23.7%. Genetic transformation was confirmed by PCR, indicating that a new method for obtaining transgenic S. miltiorrhiza f. alba plants was developed using PMI-mannose selection system.

  13. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E.; Re, Matteo

    2014-01-01

    Objective In the context of “network medicine”, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different “informativeness” embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Conclusions Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further

  14. A simple method for analyzing exome sequencing data shows distinct levels of nonsynonymous variation for human immune and nervous system genes.

    Directory of Open Access Journals (Sweden)

    Jan Freudenberg

    Full Text Available To measure the strength of natural selection that acts upon single nucleotide variants (SNVs in a set of human genes, we calculate the ratio between nonsynonymous SNVs (nsSNVs per nonsynonymous site and synonymous SNVs (sSNVs per synonymous site. We transform this ratio with a respective factor f that corrects for the bias of synonymous sites towards transitions in the genetic code and different mutation rates for transitions and transversions. This method approximates the relative density of nsSNVs (rdnsv in comparison with the neutral expectation as inferred from the density of sSNVs. Using SNVs from a diploid genome and 200 exomes, we apply our method to immune system genes (ISGs, nervous system genes (NSGs, randomly sampled genes (RSGs, and gene ontology annotated genes. The estimate of rdnsv in an individual exome is around 20% for NSGs and 30-40% for ISGs and RSGs. This smaller rdnsv of NSGs indicates overall stronger purifying selection. To quantify the relative shift of nsSNVs towards rare variants, we next fit a linear regression model to the estimates of rdnsv over different SNV allele frequency bins. The obtained regression models show a negative slope for NSGs, ISGs and RSGs, supporting an influence of purifying selection on the frequency spectrum of segregating nsSNVs. The y-intercept of the model predicts rdnsv for an allele frequency close to 0. This parameter can be interpreted as the proportion of nonsynonymous sites where mutations are tolerated to segregate with an allele frequency notably greater than 0 in the population, given the performed normalization of the observed nsSNV to sSNV ratio. A smaller y-intercept is displayed by NSGs, indicating more nonsynonymous sites under strong negative selection. This predicts more monogenically inherited or de-novo mutation diseases that affect the nervous system.

  15. Detecting loci under recent positive selection in dairy and beef cattle by combining different genome-wide scan methods.

    Directory of Open Access Journals (Sweden)

    Yuri Tani Utsunomiya

    Full Text Available As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different methodologies, it is strong evidence that the locus has been under past selection. In this paper, a straightforward frequentist approach based on the Stouffer Method to combine P-values across different tests for evidence of recent positive selection in common variations, as well as strategies for extracting biological information from the detected signals, were described and applied to high density single nucleotide polymorphism (SNP data generated from dairy and beef cattle (taurine and indicine. The ancestral Bovinae allele state of over 440,000 SNP is also reported. Using this combination of methods, highly significant (P<3.17×10(-7 population-specific sweeps pointing out to candidate genes and pathways that may be involved in beef and dairy production were identified. The most significant signal was found in the Cornichon homolog 3 gene (CNIH3 in Brown Swiss (P = 3.82×10(-12, and may be involved in the regulation of pre-ovulatory luteinizing hormone surge. Other putative pathways under selection are the glucolysis/gluconeogenesis, transcription machinery and chemokine/cytokine activity in Angus; calpain-calpastatin system and ribosome biogenesis in Brown Swiss; and gangliosides deposition in milk fat globules in Gyr. The composite method, combined with the strategies applied to retrieve functional information, may be a useful tool for surveying genome-wide selective sweeps and providing insights in to the source of selection.

  16. EEG feature selection method based on decision tree.

    Science.gov (United States)

    Duan, Lijuan; Ge, Hui; Ma, Wei; Miao, Jun

    2015-01-01

    This paper aims to solve automated feature selection problem in brain computer interface (BCI). In order to automate feature selection process, we proposed a novel EEG feature selection method based on decision tree (DT). During the electroencephalogram (EEG) signal processing, a feature extraction method based on principle component analysis (PCA) was used, and the selection process based on decision tree was performed by searching the feature space and automatically selecting optimal features. Considering that EEG signals are a series of non-linear signals, a generalized linear classifier named support vector machine (SVM) was chosen. In order to test the validity of the proposed method, we applied the EEG feature selection method based on decision tree to BCI Competition II datasets Ia, and the experiment showed encouraging results.

  17. Selection of reliable reference genes in Caenorhabditis elegans for analysis of nanotoxicity.

    Directory of Open Access Journals (Sweden)

    Yanqiong Zhang

    Full Text Available Despite rapid development and application of a wide range of manufactured metal oxide nanoparticles (NPs, the understanding of potential risks of using NPs is less completed, especially at the molecular level. The nematode Caenorhabditis elegans (C.elegans has been emerging as an environmental model to study the molecular mechanism of environmental contaminations, using standard genetic tools such as the real-time quantitative PCR (RT-qPCR. The most important factor that may affect the accuracy of RT-qPCR is to choose appropriate genes for normalization. In this study, we selected 13 reference gene candidates (act-1, cdc-42, pmp-3, eif-3.C, actin, act-2, csq-1, Y45F10D.4, tba-1, mdh-1, ama-1, F35G12.2, and rbd-1 to test their expression stability under different doses of nano-copper oxide (CuO 0, 1, 10, and 50 µg/mL using RT-qPCR. Four algorithms, geNorm, NormFinder, BestKeeper, and the comparative ΔCt method, were employed to evaluate these 13 candidates expressions. As a result, tba-1, Y45F10D.4 and pmp-3 were the most reliable, which may be used as reference genes in future study of nanoparticle-induced genetic response using C.elegans.

  18. PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes.

    Science.gov (United States)

    Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping

    2013-12-27

    With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.

  19. Selection signature in domesticated animals.

    Science.gov (United States)

    Pan, Zhang-yuan; He, Xiao-yun; Wang, Xiang-yu; Guo, Xiao-fei; Cao, Xiao-han; Hu, Wen-ping; Di, Ran; Liu, Qiu-yue; Chu, Ming-xing

    2016-12-20

    Domesticated animals play an important role in the life of humanity. All these domesticated animals undergo same process, first domesticated from wild animals, then after long time natural and artificial selection, formed various breeds that adapted to the local environment and human needs. In this process, domestication, natural and artificial selection will leave the selection signal in the genome. The research on these selection signals can find functional genes directly, is one of the most important strategies in screening functional genes. The current studies of selection signal have been performed in pigs, chickens, cattle, sheep, goats, dogs and other domestic animals, and found a great deal of functional genes. This paper provided an overview of the types and the detected methods of selection signal, and outlined researches of selection signal in domestic animals, and discussed the key issues in selection signal analysis and its prospects.

  20. A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules

    Science.gov (United States)

    Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried

    2015-01-01

    A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis. PMID:26355961

  1. Shifts in Selective Pressures on Snake Phototransduction Genes Associated with Photoreceptor Transmutation and Dim-Light Ancestry.

    Science.gov (United States)

    Schott, Ryan K; Van Nynatten, Alexander; Card, Daren C; Castoe, Todd A; S W Chang, Belinda

    2018-06-01

    The visual systems of snakes are heavily modified relative to other squamates, a condition often thought to reflect their fossorial origins. Further modifications are seen in caenophidian snakes, where evolutionary transitions between rod and cone photoreceptors, termed photoreceptor transmutations, have occurred in many lineages. Little previous work, however, has focused on the molecular evolutionary underpinnings of these morphological changes. To address this, we sequenced seven snake eye transcriptomes and utilized new whole-genome and targeted capture sequencing data. We used these data to analyze gene loss and shifts in selection pressures in phototransduction genes that may be associated with snake evolutionary origins and photoreceptor transmutation. We identified the surprising loss of rhodopsin kinase (GRK1), despite a low degree of gene loss overall and a lack of relaxed selection early during snake evolution. These results provide some of the first evolutionary genomic corroboration for a dim-light ancestor that lacks strong fossorial adaptations. Our results also indicate that snakes with photoreceptor transmutation experienced significantly different selection pressures from other reptiles. Significant positive selection was found primarily in cone-specific genes, but not rod-specific genes, contrary to our expectations. These results reveal potential molecular adaptations associated with photoreceptor transmutation and also highlight unappreciated functional differences between rod- and cone-specific phototransduction proteins. This intriguing example of snake visual system evolution illustrates how the underlying molecular components of a complex system can be reshaped in response to changing selection pressures.

  2. Identification of landscape features influencing gene flow: How useful are habitat selection models?

    Science.gov (United States)

    Gretchen H. Roffler; Michael K. Schwartz; Kristine Pilgrim; Sandra L. Talbot; George K. Sage; Layne G. Adams; Gordon Luikart

    2016-01-01

    Understanding how dispersal patterns are influenced by landscape heterogeneity is critical for modeling species connectivity. Resource selection function (RSF) models are increasingly used in landscape genetics approaches. However, because the ecological factors that drive habitat selection may be different from those influencing dispersal and gene flow, it is...

  3. Likelihood analysis of the chalcone synthase genes suggests the role of positive selection in morning glories (Ipomoea).

    Science.gov (United States)

    Yang, Ji; Gu, Hongya; Yang, Ziheng

    2004-01-01

    Chalcone synthase (CHS) is a key enzyme in the biosynthesis of flavonoides, which are important for the pigmentation of flowers and act as attractants to pollinators. Genes encoding CHS constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. In morning glories (Ipomoea), five functional CHS genes (A-E) have been described. Phylogenetic analysis of the Ipomoea CHS gene family revealed that CHS A, B, and C experienced accelerated rates of amino acid substitution relative to CHS D and E. To examine whether the CHS genes of the morning glories underwent adaptive evolution, maximum-likelihood models of codon substitution were used to analyze the functional sequences in the Ipomoea CHS gene family. These models used the nonsynonymous/synonymous rate ratio (omega = d(N)/ d(S)) as an indicator of selective pressure and allowed the ratio to vary among lineages or sites. Likelihood ratio test suggested significant variation in selection pressure among amino acid sites, with a small proportion of them detected to be under positive selection along the branches ancestral to CHS A, B, and C. Positive Darwinian selection appears to have promoted the divergence of subfamily ABC and subfamily DE and is at least partially responsible for a rate increase following gene duplication.

  4. Assessment of ptxD gene as an alternative selectable marker for Agrobacterium-mediated maize transformation.

    Science.gov (United States)

    Nahampun, Hartinio N; López-Arredondo, Damar; Xu, Xing; Herrera-Estrella, Luis; Wang, Kan

    2016-05-01

    Bacterial phosphite oxidoreductase gene and chemical phosphite can be used as a selection system for Agrobacterium -mediated maize transformation. Application of phosphite (Phi) on plants can interfere the plant metabolic system leading to stunted growth and lethality. On the other hand, ectopic expression of the ptxD gene in tobacco and Arabidopsis allowed plants to grow in media with Phi as the sole phosphorous source. The phosphite oxidoreductase (PTXD) enzyme catalyzes the conversion of Phi into phosphate (Pi) that can then be metabolized by plants and utilized as their essential phosphorous source. Here we assess an alternative selectable marker based on a bacterial ptxD gene for Agrobacterium-mediated maize transformation. We compared the transformation frequencies of maize using either the ptxD/Phi selection system or a standard herbicide bar/bialaphos selection system. Two maize genotypes, a transformation amenable hybrid Hi II and an inbred B104, were tested. Transgene presence, insertion copy numbers, and ptxD transcript levels were analyzed and compared. This work demonstrates that the ptxD/Phi selection system can be used for Agrobacterium-mediated maize transformation of both type I and type II callus culture and achieve a comparable frequency as that of the herbicide bar/bialaphos selection system.

  5. IMPROVEMENT METHOD OF GENE TRANSFER IN Kappaphycus alvarezii

    Directory of Open Access Journals (Sweden)

    St. Hidayah Triana

    2016-11-01

    Full Text Available Method of foreign gene transfer in red seaweed Kappaphycus alvarezii has been reported, however, li-mited number of transgenic F0 (broodstock was obtained. This study was conducted to improve the method of gene transfer mediated by Agrobacterium tumefaciens in order to obtain high percentage of K. alvarezii transgenic. Superoxide dismutase gene from Melastoma malabatrichum (MmCu/Zn-SOD was used as model towards increasing adaptability of K. alvarezii to environmental stress. The treat-ments were the culture media and recovery duration, and each treatment consisted of three replica-tions. The best method was co-cultivation using liquid media, then recovery was conducted in liquid media for 10 days. That treatment allowed higher transformation percentage (90%, regeneration effi-ciency (90%, putative bud efficiency (100%, number of buds and explants sprouted (100% and transgenic explants (100%. The transgenic explants showed an amplification PCR product of Mm-Cu/Zn-SOD gene fragment, whereas the non-transgenic explants showed no amplification product.  All results revealed that suitable method of transgenesis for K. alvarezii has been developed. Keywords:       Agrobacterium tumefaciens, culture media, Kappaphycus alvarezii, recovery duration, transformation

  6. Selection and Verification of Candidate Reference Genes for Mature MicroRNA Expression by Quantitative RT-PCR in the Tea Plant (Camellia sinensis

    Directory of Open Access Journals (Sweden)

    Hui Song

    2016-05-01

    Full Text Available Quantitative reverse transcription-polymerase chain reaction (qRT-PCR is a rapid and sensitive method for analyzing microRNA (miRNA expression. However, accurate qRT-PCR results depend on the selection of reliable reference genes as internal positive controls. To date, few studies have identified reliable reference genes for differential expression analysis of miRNAs among tissues, and among experimental conditions in plants. In this study, three miRNAs and four non-coding small RNAs (ncRNA were selected as reference candidates, and the stability of their expression was evaluated among different tissues and under different experimental conditions in the tea plant (Camellia sinensis using the geNorm and NormFinder programs. It was shown that miR159a was the best single reference gene in the bud to the fifth leaf, 5S rRNA was the most suitable gene in different organs, miR6149 was the most stable gene when the leaves were attacked by Ectropis oblique and U4, miR5368n and miR159a were the best genes when the leaves were treated by methyl jasmonate (MeJA, salicylic acid (SA and abscisic acid (ABA, respectively. Our results provide suitable reference genes for future investigations on miRNA functions in tea plants.

  7. The joint effects of background selection and genetic recombination on local gene genealogies.

    Science.gov (United States)

    Zeng, Kai; Charlesworth, Brian

    2011-09-01

    Background selection, the effects of the continual removal of deleterious mutations by natural selection on variability at linked sites, is potentially a major determinant of DNA sequence variability. However, the joint effects of background selection and genetic recombination on the shape of the neutral gene genealogy have proved hard to study analytically. The only existing formula concerns the mean coalescent time for a pair of alleles, making it difficult to assess the importance of background selection from genome-wide data on sequence polymorphism. Here we develop a structured coalescent model of background selection with recombination and implement it in a computer program that efficiently generates neutral gene genealogies for an arbitrary sample size. We check the validity of the structured coalescent model against forward-in-time simulations and show that it accurately captures the effects of background selection. The model produces more accurate predictions of the mean coalescent time than the existing formula and supports the conclusion that the effect of background selection is greater in the interior of a deleterious region than at its boundaries. The level of linkage disequilibrium between sites is elevated by background selection, to an extent that is well summarized by a change in effective population size. The structured coalescent model is readily extendable to more realistic situations and should prove useful for analyzing genome-wide polymorphism data.

  8. Efficacious and safe tissue-selective controlled gene therapy approaches for the cornea.

    Directory of Open Access Journals (Sweden)

    Rajiv R Mohan

    2011-04-01

    Full Text Available Untargeted and uncontrolled gene delivery is a major cause of gene therapy failure. This study aimed to define efficient and safe tissue-selective targeted gene therapy approaches for delivering genes into keratocytes of the cornea in vivo using a normal or diseased rabbit model. New Zealand White rabbits, adeno-associated virus serotype 5 (AAV5, and a minimally invasive hair-dryer based vector-delivery technique were used. Fifty microliters of AAV5 titer (6.5×10(12 vg/ml expressing green fluorescent protein gene (GFP was topically applied onto normal or diseased (fibrotic or neovascularized rabbit corneas for 2-minutes with a custom vector-delivery technique. Corneal fibrosis and neovascularization in rabbit eyes were induced with photorefractive keratectomy using excimer laser and VEGF (630 ng using micropocket assay, respectively. Slit-lamp biomicroscopy and immunocytochemistry were used to confirm fibrosis and neovascularization in rabbit corneas. The levels, location and duration of delivered-GFP gene expression in the rabbit stroma were measured with immunocytochemistry and/or western blotting. Slot-blot measured delivered-GFP gene copy number. Confocal microscopy performed in whole-mounts of cornea and thick corneal sections determined geometric and spatial localization of delivered-GFP in three-dimensional arrangement. AAV5 toxicity and safety were evaluated with clinical eye exam, stereomicroscopy, slit-lamp biomicroscopy, and H&E staining. A single 2-minute AAV5 topical application via custom delivery-technique efficiently and selectively transduced keratocytes in the anterior stroma of normal and diseased rabbit corneas as evident from immunocytochemistry and confocal microscopy. Transgene expression was first detected at day 3, peaked at day 7, and was maintained up to 16 weeks (longest tested time point. Clinical and slit-lamp eye examination in live rabbits and H&E staining did not reveal any significant changes between AAV5

  9. Disentangling the effects of selection and loss bias on gene dynamics

    NARCIS (Netherlands)

    Iranzo, J.; Cuesta, J.A.; Manrubia, S.; Katsnelson, M.I.; Koonin, E.V.

    2017-01-01

    We combine mathematical modeling of genome evolution with comparative analysis of prokaryotic genomes to estimate the relative contributions of selection and intrinsic loss bias to the evolution of different functional classes of genes and mobile genetic elements (MGE). An exact solution for the

  10. AMES: Towards an Agile Method for ERP Selection

    OpenAIRE

    Juell-Skielse, Gustaf; Nilsson, Anders G.; Nordqvist, Andreas; Westergren, Mattias

    2012-01-01

    Conventional on-premise installations of ERP are now rapidly being replaced by ERP as service. Although ERP becomes more accessible and no longer requires local infrastructure, current selection methods do not take full advantage of the provided agility. In this paper we present AMES (Agile Method for ERP Selection), a novel method for ERP selection which better utilizes the strengths of service oriented ERP. AMES is designed to shorten lead time for selection, support identification of essen...

  11. MAINTENANCE OF ECOLOGICALLY SIGNIFICANT GENETIC VARIATION IN THE TIGER SWALLOWTAIL BUTTERFLY THROUGH DIFFERENTIAL SELECTION AND GENE FLOW.

    Science.gov (United States)

    Bossart, J L; Scriber, J M

    1995-12-01

    Differential selection in a heterogeneous environment is thought to promote the maintenance of ecologically significant genetic variation. Variation is maintained when selection is counterbalanced by the homogenizing effects of gene flow and random mating. In this study, we examine the relative importance of differential selection and gene flow in maintaining genetic variation in Papilio glaucus. Differential selection on traits contributing to successful use of host plants (oviposition preference and larval performance) was assessed by comparing the responses of southern Ohio, north central Georgia, and southern Florida populations of P. glaucus to three hosts: Liriodendron tulipifera, Magnolia virginiana, and Prunus serotina. Gene flow among populations was estimated using allozyme frequencies from nine polymorphic loci. Significant genetic differentiation was observed among populations for both oviposition preference and larval performance. This differentiation was interpreted to be the result of selection acting on Florida P. glaucus for enhanced use of Magnolia, the prevalent host in Florida. In contrast, no evidence of population differentiation was revealed by allozyme frequencies. F ST -values were very small and Nm, an estimate of the relative strengths of gene flow and genetic drift, was large, indicating that genetic exchange among P. glaucus populations is relatively unrestricted. The contrasting patterns of spatial differentiation for host-use traits and lack of differentiation for electrophoretically detectable variation implies that differential selection among populations will be counterbalanced by gene flow, thereby maintaining genetic variation for host-use traits. © 1995 The Society for the Study of Evolution.

  12. Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A.

    Science.gov (United States)

    Cadzow, Murray; Merriman, Tony R; Boocock, James; Dalbeth, Nicola; Stamp, Lisa K; Black, Michael A; Visscher, Peter M; Wilcox, Phillip L

    2016-11-15

    The gene PPARGC1A, in particular the Gly482Ser variant (rs8192678), had been proposed to be subject to natural selection, particularly in recent progenitors of extant Polynesian populations. Reasons include high levels of population differentiation and increased frequencies of the derived type 2 diabetes (T2D) risk 482Ser allele, and association with body mass index (BMI) in a small Tongan population. However, no direct statistical tests for selection have been applied. Using a range of Polynesian populations (Tongan, Māori, Samoan) we re-examined evidence for association between Gly482Ser with T2D and BMI as well as gout. Using also Asian, European, and African 1000 Genome Project samples a range of statistical tests for selection (F ST , integrated haplotype score (iHS), cross population extended haplotype homozygosity (XP-EHH), Tajima's D and Fay and Wu's H) were conducted on the PPARGC1A locus. No statistically significant evidence for association between Gly482Ser and any of BMI, T2D or gout was found. Population differentiation (F ST ) was smallest between Asian and Pacific populations (New Zealand Māori ≤ 0.35, Samoan ≤ 0.20). When compared to European (New Zealand Māori ≤ 0.40, Samoan ≤ 0.25) or African populations (New Zealand Māori ≤ 0.80, Samoan ≤ 0.66) this differentiation was larger. We did not find any strong evidence for departure from neutral evolution at this locus when applying any of the other statistical tests for selection. However, using the same analytical methods, we found evidence for selection in specific populations at previously identified loci, indicating that lack of selection was the most likely explanation for the lack of evidence of selection in PPARGC1A. We conclude that there is no compelling evidence for selection at this locus, and that this gene should not be considered a candidate thrifty gene locus in Pacific populations. High levels of population differentiation at this locus and the

  13. Sex linkage, sex-specific selection, and the role of recombination in the evolution of sexually dimorphic gene expression.

    Science.gov (United States)

    Connallon, Tim; Clark, Andrew G

    2010-12-01

    Sex-biased genes--genes that are differentially expressed within males and females--are nonrandomly distributed across animal genomes, with sex chromosomes and autosomes often carrying markedly different concentrations of male- and female-biased genes. These linkage patterns are often gene- and lineage-dependent, differing between functional genetic categories and between species. Although sex-specific selection is often hypothesized to shape the evolution of sex-linked and autosomal gene content, population genetics theory has yet to account for many of the gene- and lineage-specific idiosyncrasies emerging from the empirical literature. With the goal of improving the connection between evolutionary theory and a rapidly growing body of genome-wide empirical studies, we extend previous population genetics theory of sex-specific selection by developing and analyzing a biologically informed model that incorporates sex linkage, pleiotropy, recombination, and epistasis, factors that are likely to vary between genes and between species. Our results demonstrate that sex-specific selection and sex-specific recombination rates can generate, and are compatible with, the gene- and species-specific linkage patterns reported in the genomics literature. The theory suggests that sexual selection may strongly influence the architectures of animal genomes, as well as the chromosomal distribution of fixed substitutions underlying sexually dimorphic traits. © 2010 The Author(s). Evolution© 2010 The Society for the Study of Evolution.

  14. Dominant negative selection of vaccinia virus using a thymidine kinase/thymidylate kinase fusion gene and the prodrug azidothymidine

    International Nuclear Information System (INIS)

    Holzer, Georg W.; Mayrhofer, Josef; Gritschenberger, Werner; Falkner, Falko G.

    2005-01-01

    The Escherichia coli thymidine kinase/thymidylate kinase (tk/tmk) fusion gene encodes an enzyme that efficiently converts the prodrug 3'-azido-2',3'-dideoxythymidine (AZT) into its toxic triphosphate derivative, a substance which stops DNA chain elongation. Integration of this marker gene into vaccinia virus that normally is not inhibited by AZT allowed the establishment of a powerful selection procedure for recombinant viruses. In contrast to the conventional vaccinia thymidine kinase (tk) selection that is performed in tk-negative cell lines, AZT selection can be performed in normal (tk-positive) cell lines. The technique is especially useful for the generation of replication-deficient vaccinia viruses and may also be used for gene knock-out studies of essential vaccinia genes

  15. Selection of housekeeping genes and demonstration of RNAi in cotton leafhopper, Amrasca biguttula biguttula (Ishida)

    Science.gov (United States)

    Gupta, Mridula; Pandher, Suneet; Kaur, Gurmeet; Rathore, Pankaj; Palli, Subba Reddy

    2018-01-01

    Amrasca biguttula biguttula (Ishida) commonly known as cotton leafhopper is a severe pest of cotton and okra. Not much is known on this insect at molecular level due to lack of genomic and transcriptomic data. To prepare for functional genomic studies in this insect, we evaluated 15 common housekeeping genes (Tub, B-Tub, EF alpha, GADPH, UbiCF, RP13, Ubiq, G3PD, VATPase, Actin, 18s, 28s, TATA, ETF, SOD and Cytolytic actin) during different developmental stages and under starvation stress. We selected early (1st and 2nd), late (3rd and 4th) stage nymphs and adults for identification of stable housekeeping genes using geNorm, NormFinder, BestKeeper and RefFinder software. Based on the different algorithms, RP13 and VATPase are identified as the most suitable reference genes for quantification of gene expression by reverse transcriptase quantitative PCR (RT-qPCR). Based on RefFinder which comprehended the results of three algorithms, RP13 in adults, Tubulin (Tub) in late nymphs, 28S in early nymph and UbiCF under starvation stress were identified as the most stable genes. We also developed methods for feeding double-stranded RNA (dsRNA) incorporated in the diet. Feeding dsRNA targeting Snf7, IAP, AQP1, and VATPase caused 56.17–77.12% knockdown of targeted genes compared to control and 16 to 48% mortality of treated insects when compared to control. PMID:29329327

  16. Selection of housekeeping genes and demonstration of RNAi in cotton leafhopper, Amrasca biguttula biguttula (Ishida.

    Directory of Open Access Journals (Sweden)

    Satnam Singh

    Full Text Available Amrasca biguttula biguttula (Ishida commonly known as cotton leafhopper is a severe pest of cotton and okra. Not much is known on this insect at molecular level due to lack of genomic and transcriptomic data. To prepare for functional genomic studies in this insect, we evaluated 15 common housekeeping genes (Tub, B-Tub, EF alpha, GADPH, UbiCF, RP13, Ubiq, G3PD, VATPase, Actin, 18s, 28s, TATA, ETF, SOD and Cytolytic actin during different developmental stages and under starvation stress. We selected early (1st and 2nd, late (3rd and 4th stage nymphs and adults for identification of stable housekeeping genes using geNorm, NormFinder, BestKeeper and RefFinder software. Based on the different algorithms, RP13 and VATPase are identified as the most suitable reference genes for quantification of gene expression by reverse transcriptase quantitative PCR (RT-qPCR. Based on RefFinder which comprehended the results of three algorithms, RP13 in adults, Tubulin (Tub in late nymphs, 28S in early nymph and UbiCF under starvation stress were identified as the most stable genes. We also developed methods for feeding double-stranded RNA (dsRNA incorporated in the diet. Feeding dsRNA targeting Snf7, IAP, AQP1, and VATPase caused 56.17-77.12% knockdown of targeted genes compared to control and 16 to 48% mortality of treated insects when compared to control.

  17. Purifying Selection Maintains Dosage-Sensitive Genes during Degeneration of the Threespine Stickleback Y Chromosome

    Science.gov (United States)

    White, Michael A.; Kitano, Jun; Peichel, Catherine L.

    2015-01-01

    Sex chromosomes are subject to unique evolutionary forces that cause suppression of recombination, leading to sequence degeneration and the formation of heteromorphic chromosome pairs (i.e., XY or ZW). Although progress has been made in characterizing the outcomes of these evolutionary processes on vertebrate sex chromosomes, it is still unclear how recombination suppression and sequence divergence typically occur and how gene dosage imbalances are resolved in the heterogametic sex. The threespine stickleback fish (Gasterosteus aculeatus) is a powerful model system to explore vertebrate sex chromosome evolution, as it possesses an XY sex chromosome pair at relatively early stages of differentiation. Using a combination of whole-genome and transcriptome sequencing, we characterized sequence evolution and gene expression across the sex chromosomes. We uncovered two distinct evolutionary strata that correspond with known structural rearrangements on the Y chromosome. In the oldest stratum, only a handful of genes remain, and these genes are under strong purifying selection. By comparing sex-linked gene expression with expression of autosomal orthologs in an outgroup, we show that dosage compensation has not evolved in threespine sticklebacks through upregulation of the X chromosome in males. Instead, in the oldest stratum, the genes that still possess a Y chromosome allele are enriched for genes predicted to be dosage sensitive in mammals and yeast. Our results suggest that dosage imbalances may have been avoided at haploinsufficient genes by retaining function of the Y chromosome allele through strong purifying selection. PMID:25818858

  18. [Gene method for inconsistent hydrological frequency calculation. 2: Diagnosis system of hydrological genes and method of hydrological moment genes with inconsistent characters].

    Science.gov (United States)

    Xie, Ping; Zhao, Jiang Yan; Wu, Zi Yi; Sang, Yan Fang; Chen, Jie; Li, Bin Bin; Gu, Hai Ting

    2018-04-01

    The analysis of inconsistent hydrological series is one of the major problems that should be solved for engineering hydrological calculation in changing environment. In this study, the diffe-rences of non-consistency and non-stationarity were analyzed from the perspective of composition of hydrological series. The inconsistent hydrological phenomena were generalized into hydrological processes with inheritance, variability and evolution characteristics or regulations. Furthermore, the hydrological genes were identified following the theory of biological genes, while their inheritance bases and variability bases were determined based on composition of hydrological series under diffe-rent time scales. To identify and test the components of hydrological genes, we constructed a diagnosis system of hydrological genes. With the P-3 distribution as an example, we described the process of construction and expression of the moment genes to illustrate the inheritance, variability and evolution principles of hydrological genes. With the annual minimum 1-month runoff series of Yunjinghong station in Lancangjiang River basin as an example, we verified the feasibility and practicability of hydrological gene theory for the calculation of inconsistent hydrological frequency. The results showed that the method could be used to reveal the evolution of inconsistent hydrological series. Therefore, it provided a new research pathway for engineering hydrological calculation in changing environment and an essential reference for the assessment of water security.

  19. Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space

    Science.gov (United States)

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R.; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J.

    2013-01-01

    For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. PMID:23592960

  20. Combinatorial pooling enables selective sequencing of the barley gene space.

    Directory of Open Access Journals (Sweden)

    Stefano Lonardi

    2013-04-01

    Full Text Available For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  1. Combinatorial pooling enables selective sequencing of the barley gene space.

    Science.gov (United States)

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J

    2013-04-01

    For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  2. HisB as novel selection marker for gene targeting approaches in Aspergillus niger.

    Science.gov (United States)

    Fiedler, Markus R M; Gensheimer, Tarek; Kubisch, Christin; Meyer, Vera

    2017-03-08

    For Aspergillus niger, a broad set of auxotrophic and dominant resistance markers is available. However, only few offer targeted modification of a gene of interest into or at a genomic locus of choice, which hampers functional genomics studies. We thus aimed to extend the available set by generating a histidine auxotrophic strain with a characterized hisB locus for targeted gene integration and deletion in A. niger. A histidine-auxotrophic strain was established via disruption of the A. niger hisB gene by using the counterselectable pyrG marker. After curing, a hisB - , pyrG - strain was obtained, which served as recipient strain for further studies. We show here that both hisB orthologs from A. nidulans and A. niger can be used to reestablish histidine prototrophy in this recipient strain. Whereas the hisB gene from A. nidulans was suitable for efficient gene targeting at different loci in A. niger, the hisB gene from A. niger allowed efficient integration of a Tet-on driven luciferase reporter construct at the endogenous non-functional hisB locus. Subsequent analysis of the luciferase activity revealed that the hisB locus is tight under non-inducing conditions and allows even higher luciferase expression levels compared to the pyrG integration locus. Taken together, we provide here an alternative selection marker for A. niger, hisB, which allows efficient homologous integration rates as well as high expression levels which compare favorably to the well-established pyrG selection marker.

  3. Transcriptome-wide selection of a reliable set of reference genes for gene expression studies in potato cyst nematodes (Globodera spp.).

    Science.gov (United States)

    Sabeh, Michael; Duceppe, Marc-Olivier; St-Arnaud, Marc; Mimee, Benjamin

    2018-01-01

    Relative gene expression analyses by qRT-PCR (quantitative reverse transcription PCR) require an internal control to normalize the expression data of genes of interest and eliminate the unwanted variation introduced by sample preparation. A perfect reference gene should have a constant expression level under all the experimental conditions. However, the same few housekeeping genes selected from the literature or successfully used in previous unrelated experiments are often routinely used in new conditions without proper validation of their stability across treatments. The advent of RNA-Seq and the availability of public datasets for numerous organisms are opening the way to finding better reference genes for expression studies. Globodera rostochiensis is a plant-parasitic nematode that is particularly yield-limiting for potato. The aim of our study was to identify a reliable set of reference genes to study G. rostochiensis gene expression. Gene expression levels from an RNA-Seq database were used to identify putative reference genes and were validated with qRT-PCR analysis. Three genes, GR, PMP-3, and aaRS, were found to be very stable within the experimental conditions of this study and are proposed as reference genes for future work.

  4. Inferring Gene Regulatory Networks Using Conditional Regulation Pattern to Guide Candidate Genes.

    Directory of Open Access Journals (Sweden)

    Fei Xiao

    Full Text Available Combining path consistency (PC algorithms with conditional mutual information (CMI are widely used in reconstruction of gene regulatory networks. CMI has many advantages over Pearson correlation coefficient in measuring non-linear dependence to infer gene regulatory networks. It can also discriminate the direct regulations from indirect ones. However, it is still a challenge to select the conditional genes in an optimal way, which affects the performance and computation complexity of the PC algorithm. In this study, we develop a novel conditional mutual information-based algorithm, namely RPNI (Regulation Pattern based Network Inference, to infer gene regulatory networks. For conditional gene selection, we define the co-regulation pattern, indirect-regulation pattern and mixture-regulation pattern as three candidate patterns to guide the selection of candidate genes. To demonstrate the potential of our algorithm, we apply it to gene expression data from DREAM challenge. Experimental results show that RPNI outperforms existing conditional mutual information-based methods in both accuracy and time complexity for different sizes of gene samples. Furthermore, the robustness of our algorithm is demonstrated by noisy interference analysis using different types of noise.

  5. Gene set analysis: limitations in popular existing methods and proposed improvements.

    Science.gov (United States)

    Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

    2014-10-01

    Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. Evidence of Positive Selection of Aquaporins Genes from Pontoporia blainvillei during the Evolutionary Process of Cetaceans.

    Directory of Open Access Journals (Sweden)

    Simone Lima São Pedro

    Full Text Available Marine mammals are well adapted to their hyperosmotic environment. Several morphological and physiological adaptations for water conservation and salt excretion are known to be present in cetaceans, being responsible for regulating salt balance. However, most previous studies have focused on the unique renal physiology of marine mammals, but the molecular bases of these mechanisms remain poorly explored. Many genes have been identified to be involved in osmotic regulation, including the aquaporins. Considering that aquaporin genes were potentially subject to strong selective pressure, the aim of this study was to analyze the molecular evolution of seven aquaporin genes (AQP1, AQP2, AQP3, AQP4, AQP6, AQP7, and AQP9 comparing the lineages of cetaceans and terrestrial mammals.Our results demonstrated strong positive selection in cetacean-specific lineages acting only in the gene for AQP2 (amino acids 23, 83, 107,179, 180, 181, 182, whereas no selection was observed in terrestrial mammalian lineages. We also analyzed the changes in the 3D structure of the aquaporin 2 protein. Signs of strong positive selection in AQP2 sites 179, 180, 181, and 182 were unexpectedly identified only in the baiji lineage, which was the only river dolphin examined in this study. Positive selection in aquaporins AQP1 (45, AQP4 (74, AQP7 (342, 343, 356 was detected in cetaceans and artiodactyls, suggesting that these events are not related to maintaining water and electrolyte homeostasis in seawater.Our results suggest that the AQP2 gene might reflect different selective pressures in maintaining water balance in cetaceans, contributing to the passage from the terrestrial environment to the aquatic. Further studies are necessary, especially those including other freshwater dolphins, who exhibit osmoregulatory mechanisms different from those of marine cetaceans for the same essential task of maintaining serum electrolyte balance.

  7. Gene transfer preferentially selects MHC class I positive tumour cells and enhances tumour immunogenicity.

    Science.gov (United States)

    Hacker, Ulrich T; Schildhauer, Ines; Barroso, Margarita Céspedes; Kofler, David M; Gerner, Franz M; Mysliwietz, Josef; Buening, Hildegard; Hallek, Michael; King, Susan B S

    2006-05-01

    The modulated expression of MHC class I on tumour tissue is well documented. Although the effect of MHC class I expression on the tumorigenicity and immunogenicity of MHC class I negative tumour cell lines has been rigorously studied, less is known about the validity of gene transfer and selection in cell lines with a mixed MHC class I phenotype. To address this issue we identified a C26 cell subline that consists of distinct populations of MHC class I (H-2D/K) positive and negative cells. Transient transfection experiments using liposome-based transfer showed a lower transgene expression in MHC class I negative cells. In addition, MHC class I negative cells were more sensitive to antibiotic selection. This led to the generation of fully MHC class I positive cell lines. In contrast to C26 cells, all transfectants were rejected in vivo and induced protection against the parental tumour cells in rechallenge experiments. Tumour cell specificity of the immune response was demonstrated in in vitro cytokine secretion and cytotoxicity assays. Transfectants expressing CD40 ligand and hygromycin phosphotransferase were not more immunogenic than cells expressing hygromycin resistance alone. We suggest that the MHC class I positive phenotype of the C26 transfectants had a bearing on their immunogenicity, because selected MHC class I positive cells were more immunogenic than parental C26 cells and could induce specific anti-tumour immune responses. These data demonstrate that the generation of tumour cell transfectants can lead to the selection of subpopulations that show an altered phenotype compared to the parental cell line and display altered immunogenicity independent of selection marker genes or other immune modulatory genes. Our results show the importance of monitoring gene transfer in the whole tumour cell population, especially for the evaluation of in vivo therapies targeted to heterogeneous tumour cell populations.

  8. On theoretical models of gene expression evolution with random genetic drift and natural selection.

    Directory of Open Access Journals (Sweden)

    Osamu Ogasawara

    2009-11-01

    Full Text Available The relative contributions of natural selection and random genetic drift are a major source of debate in the study of gene expression evolution, which is hypothesized to serve as a bridge from molecular to phenotypic evolution. It has been suggested that the conflict between views is caused by the lack of a definite model of the neutral hypothesis, which can describe the long-run behavior of evolutionary change in mRNA abundance. Therefore previous studies have used inadequate analogies with the neutral prediction of other phenomena, such as amino acid or nucleotide sequence evolution, as the null hypothesis of their statistical inference.In this study, we introduced two novel theoretical models, one based on neutral drift and the other assuming natural selection, by focusing on a common property of the distribution of mRNA abundance among a variety of eukaryotic cells, which reflects the result of long-term evolution. Our results demonstrated that (1 our models can reproduce two independently found phenomena simultaneously: the time development of gene expression divergence and Zipf's law of the transcriptome; (2 cytological constraints can be explicitly formulated to describe long-term evolution; (3 the model assuming that natural selection optimized relative mRNA abundance was more consistent with previously published observations than the model of optimized absolute mRNA abundances.The models introduced in this study give a formulation of evolutionary change in the mRNA abundance of each gene as a stochastic process, on the basis of previously published observations. This model provides a foundation for interpreting observed data in studies of gene expression evolution, including identifying an adequate time scale for discriminating the effect of natural selection from that of random genetic drift of selectively neutral variations.

  9. Alternative microbial methods: An overview and selection criteria.

    Science.gov (United States)

    Jasson, Vicky; Jacxsens, Liesbeth; Luning, Pieternel; Rajkovic, Andreja; Uyttendaele, Mieke

    2010-09-01

    This study provides an overview and criteria for the selection of a method, other than the reference method, for microbial analysis of foods. In a first part an overview of the general characteristics of rapid methods available, both for enumeration and detection, is given with reference to relevant bibliography. Perspectives on future development and the potential of the rapid method for routine application in food diagnostics are discussed. As various alternative "rapid" methods in different formats are available on the market, it can be very difficult for a food business operator or for a control authority to select the most appropriate method which fits its purpose. Validation of a method by a third party, according to international accepted protocol based upon ISO 16140, may increase the confidence in the performance of a method. A list of at the moment validated methods for enumeration of both utility indicators (aerobic plate count) and hygiene indicators (Enterobacteriaceae, Escherichia coli, coagulase positive Staphylococcus) as well as for detection of the four major pathogens (Salmonella spp., Listeria monocytogenes, E. coli O157 and Campylobacter spp.) is included with reference to relevant websites to check for updates. In a second part of this study, selection criteria are introduced to underpin the choice of the appropriate method(s) for a defined application. The selection criteria link the definition of the context in which the user of the method functions - and thus the prospective use of the microbial test results - with the technical information on the method and its operational requirements and sustainability. The selection criteria can help the end user of the method to obtain a systematic insight into all relevant factors to be taken into account for selection of a method for microbial analysis. Copyright 2010 Elsevier Ltd. All rights reserved.

  10. Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm

    Science.gov (United States)

    Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder

    2018-01-01

    Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525

  11. Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm.

    Science.gov (United States)

    Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J

    2018-01-01

    Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.

  12. Differential gene expression and clonal selection during cellular transformation induced by adhesion deprivation

    Directory of Open Access Journals (Sweden)

    Kumar Mahesh J

    2010-12-01

    Full Text Available Abstract Background Anchorage independent growth is an important hallmark of oncogenic transformation. Previous studies have shown that when adhesion dependent fibroblasts were prevented from adhering to a substrate they underwent anoikis. In the present study we have demonstrated how anoikis resistant cells gain the transformation related properties with sequential selection of genes. We have proposed this process as a model system for selection of transformed cells from normal cells. Results This report demonstrates that some fibroblasts can survive during late stages of anoikis, at which time they exhibit transformation-associated properties such as in vitro colony formation in soft agar and in vivo subcutaneous tumour formation in nude mice. Cytogenetic characterisation of these cells revealed that they contained a t (2; 2 derivative chromosome and they have a selective survival advantage in non adherent conditions. Gene expression profile indicated that these cells over expressed genes related to hypoxia, glycolysis and tumor suppression/metastasis which could be helpful in their retaining a transformed phenotype. Conclusion Our results reveal some new links between anoikis and cell transformation and they provide a reproducible model system which can potentially be useful to study multistage cancer and to identify new targets for drug development.

  13. Gene duplication, loss and selection in the evolution of saxitoxin biosynthesis in alveolates.

    Science.gov (United States)

    Murray, Shauna A; Diwan, Rutuja; Orr, Russell J S; Kohli, Gurjeet S; John, Uwe

    2015-11-01

    A group of marine dinoflagellates (Alveolata, Eukaryota), consisting of ∼10 species of the genus Alexandrium, Gymnodinium catenatum and Pyrodinium bahamense, produce the toxin saxitoxin and its analogues (STX), which can accumulate in shellfish, leading to ecosystem and human health impacts. The genes, sxt, putatively involved in STX biosynthesis, have recently been identified, however, the evolution of these genes within dinoflagellates is not clear. There are two reasons for this: uncertainty over the phylogeny of dinoflagellates; and that the sxt genes of many species of Alexandrium and other dinoflagellate genera are not known. Here, we determined the phylogeny of STX-producing and other dinoflagellates based on a concatenated eight-gene alignment. We determined the presence, diversity and phylogeny of sxtA, domains A1 and A4 and sxtG in 52 strains of Alexandrium, and a further 43 species of dinoflagellates and thirteen other alveolates. We confirmed the presence and high sequence conservation of sxtA, domain A4, in 40 strains (35 Alexandrium, 1 Pyrodinium, 4 Gymnodinium) of 8 species of STX-producing dinoflagellates, and absence from non-producing species. We found three paralogs of sxtA, domain A1, and a widespread distribution of sxtA1 in non-STX producing dinoflagellates, indicating duplication events in the evolution of this gene. One paralog, clade 2, of sxtA1 may be particularly related to STX biosynthesis. Similarly, sxtG appears to be generally restricted to STX-producing species, while three amidinotransferase gene paralogs were found in dinoflagellates. We investigated the role of positive (diversifying) selection following duplication in sxtA1 and sxtG, and found negative selection in clades of sxtG and sxtA1, clade 2, suggesting they were functionally constrained. Significant episodic diversifying selection was found in some strains in clade 3 of sxtA1, a clade that may not be involved in STX biosynthesis, indicating pressure for diversification

  14. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both

  15. Candidate gene identification of ovulation-inducing genes by RNA sequencing with an in vivo assay in zebrafish.

    Directory of Open Access Journals (Sweden)

    Wanlada Klangnurak

    Full Text Available We previously reported the microarray-based selection of three ovulation-related genes in zebrafish. We used a different selection method in this study, RNA sequencing analysis. An additional eight up-regulated candidates were found as specifically up-regulated genes in ovulation-induced samples. Changes in gene expression were confirmed by qPCR analysis. Furthermore, up-regulation prior to ovulation during natural spawning was verified in samples from natural pairing. Gene knock-out zebrafish strains of one of the candidates, the starmaker gene (stm, were established by CRISPR genome editing techniques. Unexpectedly, homozygous mutants were fertile and could spawn eggs. However, a high percentage of unfertilized eggs and abnormal embryos were produced from these homozygous females. The results suggest that the stm gene is necessary for fertilization. In this study, we selected additional ovulation-inducing candidate genes, and a novel function of the stm gene was investigated.

  16. Inventory of LCIA selection methods for assessing toxic releases. Methods and typology report part B

    DEFF Research Database (Denmark)

    Larsen, Henrik Fred; Birkved, Morten; Hauschild, Michael Zwicky

    method(s) in Work package 8 (WP8) of the OMNIITOX project. The selection methods and the other CRS methods are described in detail, a set of evaluation criteria are developed and the methods are evaluated against these criteria. This report (Deliverable 11B (D11B)) gives the results from task 7.1d, 7.1e......This report describes an inventory of Life Cycle Impact Assessment (LCIA) selection methods for assessing toxic releases. It consists of an inventory of current selection methods and other Chemical Ranking and Scoring (CRS) methods assessed to be relevant for the development of (a) new selection...... and 7.1f of WP 7 for selection methods. The other part of D11 (D11A) is reported in another report and deals with characterisation methods. A selection method is a method for prioritising chemical emissions to be included in an LCIA characterisation of toxic releases, i.e. calculating indicator scores...

  17. Positive selection within the Schizophrenia-associated GABA(A receptor beta(2 gene.

    Directory of Open Access Journals (Sweden)

    Wing-Sze Lo

    Full Text Available The gamma-aminobutyric acid type-A (GABA(A receptor plays a major role in inhibitory neurotransmissions. Intronic SNPs and haplotypes in GABRB2, the gene for GABA(A receptor beta(2 subunit, are associated with schizophrenia and correlated with the expression of two alternatively spliced beta(2 isoforms. In the present study, using chimpanzee as an ancestral reference, high frequencies were observed for the derived (D alleles of the four SNPs rs6556547, rs187269, rs1816071 and rs1816072 in GABRB2, suggesting the occurrence of positive selection for these derived alleles. Coalescence-based simulation showed that the population frequency spectra and the frequencies of H56, the haplotype having all four D alleles, significantly deviated from neutral-evolution expectation in various demographic models. Haplotypes containing the derived allele of rs1816072 displayed significantly less diversity compared to haplotypes containing its ancestral allele, further supporting positive selection. The variations in DD-genotype frequencies in five human populations provided a snapshot of the evolutionary history, which suggested that the positive selections of the D alleles are recent and likely ongoing. The divergence between the DD-genotype profiles of schizophrenic and control samples pointed to the schizophrenia-relevance of positive selections, with the schizophrenic samples showing weakened selections compared to the controls. These DD-genotypes were previously found to increase the expression of beta(2, especially its long isoform. Electrophysiological analysis showed that this long beta(2 isoform favored by the positive selections is more sensitive than the short isoform to the inhibition of GABA(A receptor function by energy depletion. These findings represent the first demonstration of positive selection in a schizophrenia-associated gene.

  18. Selection of suitable reference genes for RT-qPCR analyses in cyanobacteria.

    Directory of Open Access Journals (Sweden)

    Filipe Pinto

    Full Text Available Cyanobacteria are a group of photosynthetic prokaryotes that have a diverse morphology, minimal nutritional requirements and metabolic plasticity that has made them attractive organisms to use in biotechnological applications. The use of these organisms as cell factories requires the knowledge of their physiology and metabolism at a systems level. For the quantification of gene transcripts real-time quantitative polymerase chain reaction (RT-qPCR is the standard technique. However, to obtain reliable RT-qPCR results the use and validation of reference genes is mandatory. Towards this goal we have selected and analyzed twelve candidate reference genes from three morphologically distinct cyanobacteria grown under routinely used laboratory conditions. The six genes exhibiting less variation in each organism were evaluated in terms of their expression stability using geNorm, NormFinder and BestKeeper. In addition, the minimum number of reference genes required for normalization was determined. Based on the three algorithms, we provide a list of genes for cyanobacterial RT-qPCR data normalization. To our knowledge, this is the first work on the validation of reference genes for cyanobacteria constituting a valuable starting point for future works.

  19. Inspection methods and their selection

    International Nuclear Information System (INIS)

    Maier, H.J.

    1980-01-01

    First those nondestructive testing methods, which are used in quality assurance, are to be treated, e.g. - ultrasonics - radiography - magnetic particle testing - dye penetrant testing - eddy currents, and their capabilities and limitations are shown. Second the selection of optimal testing methods under the aspect of defect recognition in different materials and components are shown. (orig./RW)

  20. A relative variation-based method to unraveling gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Yali Wang

    Full Text Available Gene regulatory network (GRN reconstruction is essential in understanding the functioning and pathology of a biological system. Extensive models and algorithms have been developed to unravel a GRN. The DREAM project aims to clarify both advantages and disadvantages of these methods from an application viewpoint. An interesting yet surprising observation is that compared with complicated methods like those based on nonlinear differential equations, etc., methods based on a simple statistics, such as the so-called Z-score, usually perform better. A fundamental problem with the Z-score, however, is that direct and indirect regulations can not be easily distinguished. To overcome this drawback, a relative expression level variation (RELV based GRN inference algorithm is suggested in this paper, which consists of three major steps. Firstly, on the basis of wild type and single gene knockout/knockdown experimental data, the magnitude of RELV of a gene is estimated. Secondly, probability for the existence of a direct regulation from a perturbed gene to a measured gene is estimated, which is further utilized to estimate whether a gene can be regulated by other genes. Finally, the normalized RELVs are modified to make genes with an estimated zero in-degree have smaller RELVs in magnitude than the other genes, which is used afterwards in queuing possibilities of the existence of direct regulations among genes and therefore leads to an estimate on the GRN topology. This method can in principle avoid the so-called cascade errors under certain situations. Computational results with the Size 100 sub-challenges of DREAM3 and DREAM4 show that, compared with the Z-score based method, prediction performances can be substantially improved, especially the AUPR specification. Moreover, it can even outperform the best team of both DREAM3 and DREAM4. Furthermore, the high precision of the obtained most reliable predictions shows that the suggested algorithm may be

  1. EasyClone: method for iterative chromosomal integration of multiple genes in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Jensen, Niels Bjerg; Strucko, Tomas; Kildegaard, Kanchana Rueksomtawin

    2014-01-01

    of multiple genes with an option of recycling selection markers. The vectors combine the advantage of efficient uracil excision reaction-based cloning and Cre-LoxP-mediated marker recycling system. The episomal and integrative vector sets were tested by inserting genes encoding cyan, yellow, and red...... fluorescent proteins into separate vectors and analyzing for co-expression of proteins by flow cytometry. Cells expressing genes encoding for the three fluorescent proteins from three integrations exhibited a much higher level of simultaneous expression than cells producing fluorescent proteins encoded...... on episomal plasmids, where correspondingly 95% and 6% of the cells were within a fluorescence interval of Log10 mean ± 15% for all three colors. We demonstrate that selective markers can be simultaneously removed using Cre-mediated recombination and all the integrated heterologous genes remain...

  2. High amino acid diversity and positive selection at a putative coral immunity gene (tachylectin-2

    Directory of Open Access Journals (Sweden)

    Hellberg Michael E

    2010-05-01

    Full Text Available Abstract Background Genes involved in immune functions, including pathogen recognition and the activation of innate defense pathways, are among the most genetically variable known, and the proteins that they encode are often characterized by high rates of amino acid substitutions, a hallmark of positive selection. The high levels of variation characteristic of immunity genes make them useful tools for conservation genetics. To date, highly variable immunity genes have yet to be found in corals, keystone organisms of the world's most diverse marine ecosystem, the coral reef. Here, we examine variation in and selection on a putative innate immunity gene from Oculina, a coral genus previously used as a model for studies of coral disease and bleaching. Results In a survey of 244 Oculina alleles, we find high nonsynonymous variation and a signature of positive selection, consistent with a putative role in immunity. Using computational protein structure prediction, we generate a structural model of the Oculina protein that closely matches the known structure of tachylectin-2 from the Japanese horseshoe crab (Tachypleus tridentatus, a protein with demonstrated function in microbial recognition and agglutination. We also demonstrate that at least three other genera of anthozoan cnidarians (Acropora, Montastrea and Nematostella possess proteins structurally similar to tachylectin-2. Conclusions Taken together, the evidence of high amino acid diversity, positive selection and structural correspondence to the horseshoe crab tachylectin-2 suggests that this protein is 1 part of Oculina's innate immunity repertoire, and 2 evolving adaptively, possibly under selective pressure from coral-associated microorganisms. Tachylectin-2 may serve as a candidate locus to screen coral populations for their capacity to respond adaptively to future environmental change.

  3. A method for selecting cis-acting regulatory sequences that respond to small molecule effectors

    Directory of Open Access Journals (Sweden)

    Allas Ülar

    2010-08-01

    Full Text Available Abstract Background Several cis-acting regulatory sequences functioning at the level of mRNA or nascent peptide and specifically influencing transcription or translation have been described. These regulatory elements often respond to specific chemicals. Results We have developed a method that allows us to select cis-acting regulatory sequences that respond to diverse chemicals. The method is based on the β-lactamase gene containing a random sequence inserted into the beginning of the ORF. Several rounds of selection are used to isolate sequences that suppress β-lactamase expression in response to the compound under study. We have isolated sequences that respond to erythromycin, troleandomycin, chloramphenicol, meta-toluate and homoserine lactone. By introducing synonymous and non-synonymous mutations we have shown that at least in the case of erythromycin the sequences act at the peptide level. We have also tested the cross-activities of the constructs and found that in most cases the sequences respond most strongly to the compound on which they were isolated. Conclusions Several selected peptides showed ligand-specific changes in amino acid frequencies, but no consensus motif could be identified. This is consistent with previous observations on natural cis-acting peptides, showing that it is often impossible to demonstrate a consensus. Applying the currently developed method on a larger scale, by selecting and comparing an extended set of sequences, might allow the sequence rules underlying the activity of cis-acting regulatory peptides to be identified.

  4. Metabolomic analysis of the selection response of Drosophila melanogaster to environmental stress: are there links to gene expression and phenotypic traits?

    Science.gov (United States)

    Malmendal, Anders; Sørensen, Jesper Givskov; Overgaard, Johannes; Holmstrup, Martin; Nielsen, Niels Chr.; Loeschcke, Volker

    2013-05-01

    We investigated the global metabolite response to artificial selection for tolerance to stressful conditions such as cold, heat, starvation, and desiccation, and for longevity in Drosophila melanogaster. Our findings were compared to data from other levels of biological organization, including gene expression, physiological traits, and organismal stress tolerance phenotype. Overall, we found that selection for environmental stress tolerance changes the metabolomic 1H NMR fingerprint largely in a similar manner independent of the trait selected for, indicating that experimental evolution led to a general stress selection response at the metabolomic level. Integrative analyses across data sets showed little similarity when general correlations between selection effects at the level of the metabolome and gene expression were compared. This is likely due to the fact that the changes caused by these selection regimes were rather mild and/or that the dominating determinants for gene expression and metabolite levels were different. However, expression of a number of genes was correlated with the metabolite data. Many of the identified genes were general stress response genes that are down-regulated in response to selection for some of the stresses in this study. Overall, the results illustrate that selection markedly alters the metabolite profile and that the coupling between different levels of biological organization indeed is present though not very strong for stress selection at this level. The results highlight the extreme complexity of environmental stress adaptation and the difficulty of extrapolating and interpreting responses across levels of biological organization.

  5. Use of the alr gene as a food-grade selection marker in lactic acid bacteria

    NARCIS (Netherlands)

    Bron, P.A.; Benchimol, M.G.; Lambert, J.; Palumbo, E.; Deghorain, M.; Delcour, J.; Vos, de W.M.; Kleerebezem, M.; Hols, P.

    2002-01-01

    Both Lactococcus lactis and Lactobacillus plantarum contain a single alr gene, encoding an alanine racemase (EC 5.1.1.1), which catalyzes the interconversion of D-alanine and L-alanine. The alr genes of these lactic acid bacteria were investigated for their application as food-grade selection

  6. Transcriptome profile and unique genetic evolution of positively selected genes in yak lungs.

    Science.gov (United States)

    Lan, DaoLiang; Xiong, XianRong; Ji, WenHui; Li, Jian; Mipam, Tserang-Donko; Ai, Yi; Chai, ZhiXin

    2018-04-01

    The yak (Bos grunniens), which is a unique bovine breed that is distributed mainly in the Qinghai-Tibetan Plateau, is considered a good model for studying plateau adaptability in mammals. The lungs are important functional organs that enable animals to adapt to their external environment. However, the genetic mechanism underlying the adaptability of yak lungs to harsh plateau environments remains unknown. To explore the unique evolutionary process and genetic mechanism of yak adaptation to plateau environments, we performed transcriptome sequencing of yak and cattle (Bos taurus) lungs using RNA-Seq technology and a subsequent comparison analysis to identify the positively selected genes in the yak. After deep sequencing, a normal transcriptome profile of yak lung that containing a total of 16,815 expressed genes was obtained, and the characteristics of yak lungs transcriptome was described by functional analysis. Furthermore, Ka/Ks comparison statistics result showed that 39 strong positively selected genes are identified from yak lungs. Further GO and KEGG analysis was conducted for the functional annotation of these genes. The results of this study provide valuable data for further explorations of the unique evolutionary process of high-altitude hypoxia adaptation in yaks in the Tibetan Plateau and the genetic mechanism at the molecular level.

  7. AST: an automated sequence-sampling method for improving the taxonomic diversity of gene phylogenetic trees.

    Science.gov (United States)

    Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

    2014-01-01

    A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.

  8. Selection of reference genes for expression analysis in the entomophthoralean fungus Pandora neoaphidis.

    Science.gov (United States)

    Chen, Chun; Xie, Tingna; Ye, Sudan; Jensen, Annette Bruun; Eilenberg, Jørgen

    2016-01-01

    The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host-pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S), 28S rRNA(28S) and elongation factor 1 alpha-like protein (EF1), were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae), and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189), 28S (1.414) and EF1 (3). The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.

  9. Selection of reference genes for expression analysis in the entomophthoralean fungus Pandora neoaphidis

    Directory of Open Access Journals (Sweden)

    Chun Chen

    2016-03-01

    Full Text Available Abstract The selection of suitable reference genes is crucial for accurate quantification of gene expression and can add to our understanding of host–pathogen interactions. To identify suitable reference genes in Pandora neoaphidis, an obligate aphid pathogenic fungus, the expression of three traditional candidate genes including 18S rRNA(18S, 28S rRNA(28S and elongation factor 1 alpha-like protein (EF1, were measured by quantitative polymerase chain reaction at different developmental stages (conidia, conidia with germ tubes, short hyphae and elongated hyphae, and under different nutritional conditions. We calculated the expression stability of candidate reference genes using four algorithms including geNorm, NormFinder, BestKeeper and Delta Ct. The analysis results revealed that the comprehensive ranking of candidate reference genes from the most stable to the least stable was 18S (1.189, 28S (1.414 and EF1 (3. The 18S was, therefore, the most suitable reference gene for real-time RT-PCR analysis of gene expression under all conditions. These results will support further studies on gene expression in P. neoaphidis.

  10. Reference gene selection for real-time quantitative PCR analysis of the mouse uterus in the peri-implantation period.

    Directory of Open Access Journals (Sweden)

    Pengfei Lin

    Full Text Available The study of uterine gene expression patterns is valuable for understanding the biological and molecular mechanisms that occur during embryo implantation. Real-time quantitative RT-PCR (qRT-PCR is an extremely sensitive technique that allows for the precise quantification of mRNA abundance; however, selecting stable reference genes suitable for the normalization of qRT-PCR data is required to avoid the misinterpretation of experimental results and erroneous analyses. This study employs several mouse models, including an early pregnancy, a pseudopregnancy, a delayed implantation and activation, an artificial decidualization and a hormonal treatment model; ten candidate reference genes (PPIA, RPLP0, HPRT1, GAPDH, ACTB, TBP, B2M, 18S, UBC and TUBA that are found in uterine tissues were assessed for their suitability as internal controls for relative qRT-PCR quantification. GeNorm(PLUS, NormFinder, and BestKeeper were used to evaluate these candidate reference genes, and all of these methods identified RPLP0 and GAPDH as the most stable candidates and B2M and 18S as the least stable candidates. However, when the different models were analyzed separately, the reference genes exhibited some variation in their expression levels.

  11. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    Science.gov (United States)

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  12. Predator-induced defences in Daphnia pulex: Selection and evaluation of internal reference genes for gene expression studies with real-time PCR

    Directory of Open Access Journals (Sweden)

    Gilbert Don

    2010-06-01

    Full Text Available Abstract Background The planktonic microcrustacean Daphnia pulex is among the best-studied animals in ecological, toxicological and evolutionary research. One aspect that has sustained interest in the study system is the ability of D. pulex to develop inducible defence structures when exposed to predators, such as the phantom midge larvae Chaoborus. The available draft genome sequence for D. pulex is accelerating research to identify genes that confer plastic phenotypes that are regularly cued by environmental stimuli. Yet for quantifying gene expression levels, no experimentally validated set of internal control genes exists for the accurate normalization of qRT-PCR data. Results In this study, we tested six candidate reference genes for normalizing transcription levels of D. pulex genes; alpha tubulin (aTub, glyceraldehyde-3-phosphate dehydrogenase (GAPDH, TATA box binding protein (Tbp syntaxin 16 (Stx16, X-box binding protein 1 (Xbp1 and CAPON, a protein associated with the neuronal nitric oxide synthase, were selected on the basis of an earlier study and from microarray studies. One additional gene, a matrix metalloproteinase (MMP, was tested to validate its transcriptional response to Chaoborus, which was earlier observed in a microarray study. The transcription profiles of these seven genes were assessed by qRT-PCR from RNA of juvenile D. pulex that showed induced defences in comparison to untreated control animals. We tested the individual suitability of genes for expression normalization using the programs geNorm, NormFinder and BestKeeper. Intriguingly, Xbp1, Tbp, CAPON and Stx16 were selected as ideal reference genes. Analyses on the relative expression level using the software REST showed that both classical housekeeping candidate genes (aTub and GAPDH were significantly downregulated, whereas the MMP gene was shown to be significantly upregulated, as predicted. aTub is a particularly ill suited reference gene because five copies are

  13. Tracking microorganisms and gene in the environment

    International Nuclear Information System (INIS)

    Atlas, R.M.; Sayler, G.S.

    1988-01-01

    Studies have been conducted to determine the sensitivities and limitations of various methods for determining the fate of genetically engineered microorganisms (GEMs) and their genes in the environment. Selective viable plate count procedures can be designed to detect the introduced organisms with high sensitivity; but they are restricted by potential mutations affecting the expression of the selective characteristic in the introduced organism, the occurrence of the particular selective characteristic in the indigenous organisms, and the need to culture the organism. The accuracy of this approach is greatly improved by colony hybridization procedures that use a specific gene probe to detect the introduced genes, but this approach is still only as sensitive as the plating procedure. Direct extraction of DNA from environmental samples, coupled with dot blot hybridization with radiolabeled probe DNA or solution hybridization, gives a high degree of both sensitivity and precision. This approach does not require culturing of the organism; and even if an introduced gene moves into a new organism or if the introduced organism is viable but nonculturable, the gene probe methods will detect the persistence of the introduced genes in the environment. Efficient direct DNA extraction methods have been developed and tested following in vitro experimental additions of GEMs to sediment and water samples

  14. Using MACBETH method for supplier selection in manufacturing environment

    Directory of Open Access Journals (Sweden)

    Prasad Karande

    2013-04-01

    Full Text Available Supplier selection is always found to be a complex decision-making problem in manufacturing environment. The presence of several independent and conflicting evaluation criteria, either qualitative or quantitative, makes the supplier selection problem a candidate to be solved by multi-criteria decision-making (MCDM methods. Even several MCDM methods have already been proposed for solving the supplier selection problems, the need for an efficient method that can deal with qualitative judgments related to supplier selection still persists. In this paper, the applicability and usefulness of measuring attractiveness by a categorical-based evaluation technique (MACBETH is demonstrated to act as a decision support tool while solving two real time supplier selection problems having qualitative performance measures. The ability of MACBETH method to quantify the qualitative performance measures helps to provide a numerical judgment scale for ranking the alternative suppliers and selecting the best one. The results obtained from MACBETH method exactly corroborate with those derived by the past researchers employing different mathematical approaches.

  15. The importance of the selection of appropriate reference genes for gene expression profiling in adrenal medulla or sympathetic ganglia of spontaneously hypertensive rat

    Czech Academy of Sciences Publication Activity Database

    Vavřínová, Anna; Behuliak, Michal; Zicha, Josef

    2016-01-01

    Roč. 65, č. 3 (2016), s. 401-411 ISSN 0862-8408 R&D Projects: GA ČR(CZ) GP14-16225P Institutional support: RVO:67985823 Keywords : adrenal medulla * gene expression profiling * reference gene selection * sympathetic nervous system Subject RIV: FA - Cardiovascular Diseases incl. Cardiotharic Surgery Impact factor: 1.461, year: 2016

  16. Adiponectin gene polymorphism is selectively associated with the concomitant presence of metabolic syndrome and essential hypertension.

    Directory of Open Access Journals (Sweden)

    Hsin-Bang Leu

    Full Text Available OBJECTIVE: Cardiovascular risk increases with the presence of both metabolic syndrome (MetS and hypertension (HTN. Although the adiponectin (ADIPOQ gene has been reported to be involved in MetS, its association with HTN remained undetermined. This study aimed to investigate the association of ADIPOQ gene with the phenotypes of HTN and MetS. METHODS: A total of 962 participants from 302 families from the Taiwan young-onset hypertension genetic study were enrolled. Plasma adiponectin were measured, and association analysis was conducted by using GEE regression-based method. Another study, of 1448 unrelated participants, was conducted to replicate the association between ADIPOQ gene and variable phenotypes of MetS with or without HTN. RESULTS: Among 962 subjects from family samples, the lowest plasma adiponectin value was observed in MetS with HTN component (9.3±0.47 µg/ml compared with hypertensives (13.4±0.74 µg /ml or MetS without HTN (11.9±0.60 µg/ml, P<0.05. The SNP rs1501299 (G276T in ADIPOQ gene was found associated with the presence of HTN in MetS (odds ratio for GG+GT vs. TT = 2.46; 95% CI: 1.14-5.3, p = 0.02, but not rs2241766 (T45G. No association of ADIPOQ gene with HTN alone or MetS without HTN was observed. The significant association of the SNP rs1501299 (G276T with the phenotype of presence of HTN in MetS was confirmed (odds ratio for GG+GT vs. TT = 2.15; 95% CI: 1.1-4.3 in the replication study. CONCLUSIONS: ADIPOQ genetic variants were selectively and specifically associated with the concomitant presence of MetS and HTN, suggesting potential genetic linkage between MetS and HTN.

  17. Selectable antibiotic resistance marker gene-free transgenic rice harbouring the garlic leaf lectin gene exhibits resistance to sap-sucking planthoppers.

    Science.gov (United States)

    Sengupta, Subhadipa; Chakraborti, Dipankar; Mondal, Hossain A; Das, Sampa

    2010-03-01

    Rice, the major food crop of world is severely affected by homopteran sucking pests. We introduced coding sequence of Allium sativum leaf agglutinin, ASAL, in rice cultivar IR64 to develop sustainable resistance against sap-sucking planthoppers as well as eliminated the selectable antibiotic-resistant marker gene hygromycin phosphotransferase (hpt) exploiting cre/lox site-specific recombination system. An expression vector was constructed containing the coding sequence of ASAL, a potent controlling agent against green leafhoppers (GLH, Nephotettix virescens) and brown planthopper (BPH, Nilaparvata lugens). The selectable marker (hpt) gene cassette was cloned within two lox sites of the same vector. Alongside, another vector was developed with chimeric cre recombinase gene cassette. Reciprocal crosses were performed between three single-copy T(0) plants with ASAL- lox-hpt-lox T-DNA and three single-copy T(0) plants with cre-bar T-DNA. Marker gene excisions were detected in T(1) hybrids through hygromycin sensitivity assay. Molecular analysis of T(1) plants exhibited 27.4% recombination efficiency. T(2) progenies of L03C04(1) hybrid parent showed 25% cre negative ASAL-expressing plants. Northern blot, western blot and ELISA showed significant level of ASAL expression in five marker-free T(2) progeny plants. In planta bioassay of GLH and BPH performed on these T(2) progenies exhibited radical reduction in survivability and fecundity compared with the untransformed control plants.

  18. Duplication and independent selection of cell-wall invertase genes GIF1 and OsCIN1 during rice evolution and domestication

    Directory of Open Access Journals (Sweden)

    Ge Song

    2010-04-01

    Full Text Available Abstract Background Various evolutionary models have been proposed to interpret the fate of paralogous duplicates, which provides substrates on which evolution selection could act. In particular, domestication, as a special selection, has played important role in crop cultivation with divergence of many genes controlling important agronomic traits. Recent studies have indicated that a pair of duplicate genes was often sub-functionalized from their ancestral functions held by the parental genes. We previously demonstrated that the rice cell-wall invertase (CWI gene GIF1 that plays an important role in the grain-filling process was most likely subjected to domestication selection in the promoter region. Here, we report that GIF1 and another CWI gene OsCIN1 constitute a pair of duplicate genes with differentiated expression and function through independent selection. Results Through synteny analysis, we show that GIF1 and another cell-wall invertase gene OsCIN1 were paralogues derived from a segmental duplication originated during genome duplication of grasses. Results based on analyses of population genetics and gene phylogenetic tree of 25 cultivars and 25 wild rice sequences demonstrated that OsCIN1 was also artificially selected during rice domestication with a fixed mutation in the coding region, in contrast to GIF1 that was selected in the promoter region. GIF1 and OsCIN1 have evolved into different expression patterns and probable different kinetics parameters of enzymatic activity with the latter displaying less enzymatic activity. Overexpression of GIF1 and OsCIN1 also resulted in different phenotypes, suggesting that OsCIN1 might regulate other unrecognized biological process. Conclusion How gene duplication and divergence contribute to genetic novelty and morphological adaptation has been an interesting issue to geneticists and biologists. Our discovery that the duplicated pair of GIF1 and OsCIN1 has experienced sub

  19. Intrinsic incompatibilities evolving as a by-product of divergent ecological selection: Considering them in empirical studies on divergence with gene flow.

    Science.gov (United States)

    Kulmuni, J; Westram, A M

    2017-06-01

    The possibility of intrinsic barriers to gene flow is often neglected in empirical research on local adaptation and speciation with gene flow, for example when interpreting patterns observed in genome scans. However, we draw attention to the fact that, even with gene flow, divergent ecological selection may generate intrinsic barriers involving both ecologically selected and other interacting loci. Mechanistically, the link between the two types of barriers may be generated by genes that have multiple functions (i.e., pleiotropy), and/or by gene interaction networks. Because most genes function in complex networks, and their evolution is not independent of other genes, changes evolving in response to ecological selection can generate intrinsic barriers as a by-product. A crucial question is to what extent such by-product barriers contribute to divergence and speciation-that is whether they stably reduce gene flow. We discuss under which conditions by-product barriers may increase isolation. However, we also highlight that, depending on the conditions (e.g., the amount of gene flow and the strength of selection acting on the intrinsic vs. the ecological barrier component), the intrinsic incompatibility may actually destabilize barriers to gene flow. In practice, intrinsic barriers generated as a by-product of divergent ecological selection may generate peaks in genome scans that cannot easily be interpreted. We argue that empirical studies on divergence with gene flow should consider the possibility of both ecological and intrinsic barriers. Future progress will likely come from work combining population genomic studies, experiments quantifying fitness and molecular studies on protein function and interactions. © 2017 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  20. Cross-organism learning method to discover new gene functionalities.

    Science.gov (United States)

    Domeniconi, Giacomo; Masseroli, Marco; Moro, Gianluca; Pinoli, Pietro

    2016-04-01

    Knowledge of gene and protein functions is paramount for the understanding of physiological and pathological biological processes, as well as in the development of new drugs and therapies. Analyses for biomedical knowledge discovery greatly benefit from the availability of gene and protein functional feature descriptions expressed through controlled terminologies and ontologies, i.e., of gene and protein biomedical controlled annotations. In the last years, several databases of such annotations have become available; yet, these valuable annotations are incomplete, include errors and only some of them represent highly reliable human curated information. Computational techniques able to reliably predict new gene or protein annotations with an associated likelihood value are thus paramount. Here, we propose a novel cross-organisms learning approach to reliably predict new functionalities for the genes of an organism based on the known controlled annotations of the genes of another, evolutionarily related and better studied, organism. We leverage a new representation of the annotation discovery problem and a random perturbation of the available controlled annotations to allow the application of supervised algorithms to predict with good accuracy unknown gene annotations. Taking advantage of the numerous gene annotations available for a well-studied organism, our cross-organisms learning method creates and trains better prediction models, which can then be applied to predict new gene annotations of a target organism. We tested and compared our method with the equivalent single organism approach on different gene annotation datasets of five evolutionarily related organisms (Homo sapiens, Mus musculus, Bos taurus, Gallus gallus and Dictyostelium discoideum). Results show both the usefulness of the perturbation method of available annotations for better prediction model training and a great improvement of the cross-organism models with respect to the single-organism ones

  1. Tagging of four Rf genes with selective genotyping analysis in rice (Oryza sativa L.

    Directory of Open Access Journals (Sweden)

    Yarahmadi Saeid

    2017-01-01

    Full Text Available Wild abortive type of cytoplasmic male sterility (WA-CMS is commercially used for hybrid rice seed production. The linked markers can be used for selection of plants with desirable traits. Tagging of Rf genes was carried out using recessive and dominant class analysis in a large F2 population from the cross IR58025A×IR42686R. Pollen fertility and seed setting were evaluated at the flowering and maturity stages, respectively. Forty-seven highly sterile and 23 fertile homozygous plants were selected from F2 population for molecular marker assay. Four Rf genes identified in a good restorer line with high-quality derived from a random mating composite population at the International Rice Research Institute (IRRI. The genetic distance from Rf3 locus with flanking markers RM443 and RM315 on chromosome 1 was 3.7 and 21.2 cM, respectively. RM258, RM591, RM271 and RM6737 on the long arm of chromosome 10 were linked with the Rf6 gene with distance of 7.4, 22.6, 6 and 2.9 cM, respectively. Rf6 was flanked by RM6737 and RM591. The Rf4 gene located on chromosome 7 was linked with RM6344 at a genetic distance of 10.6 cM. RM519 and RM7003 were linked with other Rf gene on chromosome 12 at a genetic distance of 8.5 and 20.8 cM, respectively. Closely linked markers identified in this study could be used for marker assisted selection in a hybrid rice breeding program. A new Rf locus on chromosome 12 that designated Rf7 was linked with RM7003 and RM519.

  2. A synbio approach for selection of highly expressed gene variants in Gram-positive bacteria.

    Science.gov (United States)

    Ferro, Roberto; Rennig, Maja; Hernández-Rollán, Cristina; Daley, Daniel O; Nørholm, Morten H H

    2018-03-08

    The market for recombinant proteins is on the rise, and Gram-positive strains are widely exploited for this purpose. Bacillus subtilis is a profitable host for protein production thanks to its ability to secrete large amounts of proteins, and Lactococcus lactis is an attractive production organism with a long history in food fermentation. We have developed a synbio approach for increasing gene expression in two Gram-positive bacteria. First of all, the gene of interest was coupled to an antibiotic resistance gene to create a growth-based selection system. We then randomised the translation initiation region (TIR) preceding the gene of interest and selected clones that produced high protein titres, as judged by their ability to survive on high concentrations of antibiotic. Using this approach, we were able to significantly increase production of two industrially relevant proteins; sialidase in B. subtilis and tyrosine ammonia lyase in L. lactis. Gram-positive bacteria are widely used to produce industrial enzymes. High titres are necessary to make the production economically feasible. The synbio approach presented here is a simple and inexpensive way to increase protein titres, which can be carried out in any laboratory within a few days. It could also be implemented as a tool for applications beyond TIR libraries, such as screening of synthetic, homologous or domain-shuffled genes.

  3. Selection of Reference Genes for Expression Study in Pulp and Seeds of Theobroma grandiflorum (Willd. ex Spreng. Schum.

    Directory of Open Access Journals (Sweden)

    Lucas Ferraz Dos Santos

    Full Text Available Cupuassu (Theobroma grandiflorum [Willd. ex Spreng.] Schum is a species of high economic importance in Brazil with great potential at international level due to the multiple uses of both its seeds and pulp in the industry of sweets and cosmetics. For this reason, the cupuassu breeding program focused on the selection of genotypes with high pulp and seed quality-selection associated with the understanding of the mechanisms involved in fruit formation. Gene expression is one of the most used approaches related to such understanding. In this sense, quantitative real-time PCR (qPCR is a powerful tool, since it rapidly and reliably quantifies gene expression levels across different experimental conditions. The analysis by qPCR and the correct interpretation of data depend on signal normalization using reference genes, i.e. genes presenting a uniform pattern of expression in the analyzed samples. Here, we selected and analyzed the expression of five genes from cupuassu (ACP, ACT, GAPDH, MDH, TUB to be used as candidates for reference genes on pulp and seed of young, maturing and mature cupuassu fruits. The evaluation of the gene expression stability was obtained using the NormFinder, geNorm and BestKeeper programs. In general, our results indicated that the GAPDH and MDH genes constituted the best combination as reference genes to analyze the expression of cupuassu samples. To our knowledge, this is the first report of reference gene definition in cupuassu, and these results will support subsequent analysis related to gene expression studies in cupuassu plants subjected to different biotic or abiotic conditions as well as serve as a tool for diversity analysis based on pulp and seed quality.

  4. Selection of reference genes for quantitative RT-PCR studies in striped dolphin (Stenella coeruleoalba skin biopsies

    Directory of Open Access Journals (Sweden)

    Casini Silvia

    2006-09-01

    Full Text Available Abstract Background Odontocete cetaceans occupy the top position of the marine food-web and are particularly sensitive to the bioaccumulation of lipophilic contaminants. The effects of environmental pollution on these species are highly debated and various ecotoxicological studies have addressed the impact of xenobiotic compounds on marine mammals, raising conservational concerns. Despite its sensitivity, quantitative real-time PCR (qRT-PCR has never been used to quantify gene induction caused by exposure of cetaceans to contaminants. A limitation for the application of qRT-PCR is the need for appropriate reference genes which allow the correct quantification of gene expression. A systematic evaluation of potential reference genes in cetacean skin biopsies is presented, in order to validate future qRT-PCR studies aiming at using the expression of selected genes as non-lethal biomarkers. Results Ten commonly used housekeeping genes (HKGs were partially sequenced in the striped dolphin (Stenella coeruleoalba and, for each gene, PCR primer pairs were specifically designed and tested in qRT-PCR assays. The expression of these potential control genes was examined in 30 striped dolphin skin biopsy samples, obtained from specimens sampled in the north-western Mediterranean Sea. The stability of selected control genes was determined using three different specific VBA applets (geNorm, NormFinder and BestKeeper which produce highly comparable results. Glyceraldehyde-3P-dehydrogenase (GAPDH and tyrosine 3-monooxygenase (YWHAZ always rank as the two most stably expressed HKGs according to the analysis with geNorm and Normfinder, and are defined as optimal control genes by BestKepeer. Ribosomal protein L4 (RPL4 and S18 (RPS18 also exhibit a remarkable stability of their expression levels. On the other hand, transferrin receptor (TFRC, phosphoglycerate kinase 1 (PGK1, hypoxanthine ribosyltransferase (HPRT1 and β-2-microglobin (B2M show variable expression

  5. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  6. Comparative analysis of clustering methods for gene expression time course data

    Directory of Open Access Journals (Sweden)

    Ivan G. Costa

    2004-01-01

    Full Text Available This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series. Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.

  7. Variable selection by lasso-type methods

    Directory of Open Access Journals (Sweden)

    Sohail Chand

    2011-09-01

    Full Text Available Variable selection is an important property of shrinkage methods. The adaptive lasso is an oracle procedure and can do consistent variable selection. In this paper, we provide an explanation that how use of adaptive weights make it possible for the adaptive lasso to satisfy the necessary and almost sufcient condition for consistent variable selection. We suggest a novel algorithm and give an important result that for the adaptive lasso if predictors are normalised after the introduction of adaptive weights, it makes the adaptive lasso performance identical to the lasso.

  8. Relaxin gene family in teleosts: phylogeny, syntenic mapping, selective constraint, andexpression analysis

    Directory of Open Access Journals (Sweden)

    Glen Peter

    2009-12-01

    Full Text Available Abstract Background In recent years, the relaxin family of signaling molecules has been shown to play diverse roles in mammalian physiology, but little is known about its diversity or physiology in teleosts, an infraclass of the bony fishes comprising ~ 50% of all extant vertebrates. In this paper, 32 relaxin family sequences were obtained by searching genomic and cDNA databases from eight teleost species; phylogenetic, molecular evolutionary, and syntenic data analyses were conducted to understand the relationship and differential patterns of evolution of relaxin family genes in teleosts compared with mammals. Additionally, real-time quantitative PCR was used to confirm and assess the tissues of expression of five relaxin family genes in Danio rerio and in situ hybridization used to assess the site-specific expression of the insulin 3-like gene in D. rerio testis. Results Up to six relaxin family genes were identified in each teleost species. Comparative syntenic mapping revealed that fish possess two paralogous copies of human RLN3, which we call rln3a and rln3b, an orthologue of human RLN2, rln, two paralogous copies of human INSL5, insl5a and insl5b, and an orthologue of human INSL3, insl3. Molecular evolutionary analyses indicated that: rln3a, rln3b and rln are under strong evolutionary constraint, that insl3 has been subject to moderate rates of sequence evolution with two amino acids in insl3/INSL3 showing evidence of positively selection, and that insl5b exhibits a higher rate of sequence evolution than its paralogue insl5a suggesting that it may have been neo-functionalized after the teleost whole genome duplication. Quantitative PCR analyses in D. rerio indicated that rln3a and rln3b are expressed in brain, insl3 is highly expressed in gonads, and that there was low expression of both insl5 genes in adult zebrafish. Finally, in situ hybridization of insl3 in D. rerio testes showed highly specific hybridization to interstitial Leydig

  9. The creation and selection of mutations resistant to a gene drive over multiple generations in the malaria mosquito.

    Directory of Open Access Journals (Sweden)

    Andrew M Hammond

    2017-10-01

    Full Text Available Gene drives have enormous potential for the control of insect populations of medical and agricultural relevance. By preferentially biasing their own inheritance, gene drives can rapidly introduce genetic traits even if these confer a negative fitness effect on the population. We have recently developed gene drives based on CRISPR nuclease constructs that are designed to disrupt key genes essential for female fertility in the malaria mosquito. The construct copies itself and the associated genetic disruption from one homologous chromosome to another during gamete formation, a process called homing that ensures the majority of offspring inherit the drive. Such drives have the potential to cause long-lasting, sustainable population suppression, though they are also expected to impose a large selection pressure for resistance in the mosquito. One of these population suppression gene drives showed rapid invasion of a caged population over 4 generations, establishing proof of principle for this technology. In order to assess the potential for the emergence of resistance to the gene drive in this population we allowed it to run for 25 generations and monitored the frequency of the gene drive over time. Following the initial increase of the gene drive we observed a gradual decrease in its frequency that was accompanied by the spread of small, nuclease-induced mutations at the target gene that are resistant to further cleavage and restore its functionality. Such mutations showed rates of increase consistent with positive selection in the face of the gene drive. Our findings represent the first documented example of selection for resistance to a synthetic gene drive and lead to important design recommendations and considerations in order to mitigate for resistance in future gene drive applications.

  10. Vancomycin gene selection in the microbiome of urban Rattus norvegicus from hospital environment

    DEFF Research Database (Denmark)

    Arn Hansen, Thomas; Joshi, Tejal; Larsen, Anders Rhod

    2016-01-01

    Widespread use of antibiotics has resulted in selection pressure on genes that make bacteria non-responsive to antibiotics. These antibiotic-resistant bacteria are currently a major threat to global health. There are various possibilities for the transfer of antibiotic resistance genes. It has be....... norvegicus microbiome, potentially driven by the outflow of antibiotics and antibiotic-resistant bacteria into the wastewater systems. Carriage of vancomycin resistance may suggest that R. norvegicus is acting as a reservoir for possible transmission to the human population....

  11. Selection of internal control genes for quantitative real-time RT-PCR studies during tomato development process

    Directory of Open Access Journals (Sweden)

    Borges-Pérez Andrés

    2008-12-01

    Full Text Available Abstract Background The elucidation of gene expression patterns leads to a better understanding of biological processes. Real-time quantitative RT-PCR has become the standard method for in-depth studies of gene expression. A biologically meaningful reporting of target mRNA quantities requires accurate and reliable normalization in order to identify real gene-specific variation. The purpose of normalization is to control several variables such as different amounts and quality of starting material, variable enzymatic efficiencies of retrotranscription from RNA to cDNA, or differences between tissues or cells in overall transcriptional activity. The validity of a housekeeping gene as endogenous control relies on the stability of its expression level across the sample panel being analysed. In the present report we describe the first systematic evaluation of potential internal controls during tomato development process to identify which are the most reliable for transcript quantification by real-time RT-PCR. Results In this study, we assess the expression stability of 7 traditional and 4 novel housekeeping genes in a set of 27 samples representing different tissues and organs of tomato plants at different developmental stages. First, we designed, tested and optimized amplification primers for real-time RT-PCR. Then, expression data from each candidate gene were evaluated with three complementary approaches based on different statistical procedures. Our analysis suggests that SGN-U314153 (CAC, SGN-U321250 (TIP41, SGN-U346908 ("Expressed" and SGN-U316474 (SAND genes provide superior transcript normalization in tomato development studies. We recommend different combinations of these exceptionally stable housekeeping genes for suited normalization of different developmental series, including the complete tomato development process. Conclusion This work constitutes the first effort for the selection of optimal endogenous controls for quantitative real

  12. Personnel Selection Based on Fuzzy Methods

    Directory of Open Access Journals (Sweden)

    Lourdes Cañós

    2011-03-01

    Full Text Available The decisions of managers regarding the selection of staff strongly determine the success of the company. A correct choice of employees is a source of competitive advantage. We propose a fuzzy method for staff selection, based on competence management and the comparison with the valuation that the company considers the best in each competence (ideal candidate. Our method is based on the Hamming distance and a Matching Level Index. The algorithms, implemented in the software StaffDesigner, allow us to rank the candidates, even when the competences of the ideal candidate have been evaluated only in part. Our approach is applied in a numerical example.

  13. Selection and Validation of Reference Genes for qRT-PCR Expression Analysis of Candidate Genes Involved in Olfactory Communication in the Butterfly Bicyclus anynana

    OpenAIRE

    Arun, Alok; Bauml?, V?ronique; Amelot, Ga?l; Nieberding, Caroline M.

    2015-01-01

    Real-time quantitative reverse transcription PCR (qRT-PCR) is a technique widely used to quantify the transcriptional expression level of candidate genes. qRT-PCR requires the selection of one or several suitable reference genes, whose expression profiles remain stable across conditions, to normalize the qRT-PCR expression profiles of candidate genes. Although several butterfly species (Lepidoptera) have become important models in molecular evolutionary ecology, so far no study aimed at ident...

  14. Reference Gene Screening for Analyzing Gene Expression Across Goat Tissue

    Directory of Open Access Journals (Sweden)

    Yu Zhang

    2013-12-01

    Full Text Available Real-time quantitative PCR (qRT-PCR is one of the important methods for investigating the changes in mRNA expression levels in cells and tissues. Selection of the proper reference genes is very important when calibrating the results of real-time quantitative PCR. Studies on the selection of reference genes in goat tissues are limited, despite the economic importance of their meat and dairy products. We used real-time quantitative PCR to detect the expression levels of eight reference gene candidates (18S, TBP, HMBS, YWHAZ, ACTB, HPRT1, GAPDH and EEF1A2 in ten tissues types sourced from Boer goats. The optimal reference gene combination was selected according to the results determined by geNorm, NormFinder and Bestkeeper software packages. The analyses showed that tissue is an important variability factor in genes expression stability. When all tissues were considered, 18S, TBP and HMBS is the optimal reference combination for calibrating quantitative PCR analysis of gene expression from goat tissues. Dividing data set by tissues, ACTB was the most stable in stomach, small intestine and ovary, 18S in heart and spleen, HMBS in uterus and lung, TBP in liver, HPRT1 in kidney and GAPDH in muscle. Overall, this study provided valuable information about the goat reference genes that can be used in order to perform a proper normalisation when relative quantification by qRT-PCR studies is undertaken.

  15. Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.

    Directory of Open Access Journals (Sweden)

    Neutelings Godfrey

    2010-04-01

    Full Text Available Abstract Background Quantitative real-time PCR (qRT-PCR is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs. Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L. Results Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups. qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59. LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both ge

  16. Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.).

    Science.gov (United States)

    Huis, Rudy; Hawkins, Simon; Neutelings, Godfrey

    2010-04-19

    Quantitative real-time PCR (qRT-PCR) is currently the most accurate method for detecting differential gene expression. Such an approach depends on the identification of uniformly expressed 'housekeeping genes' (HKGs). Extensive transcriptomic data mining and experimental validation in different model plants have shown that the reliability of these endogenous controls can be influenced by the plant species, growth conditions and organs/tissues examined. It is therefore important to identify the best reference genes to use in each biological system before using qRT-PCR to investigate differential gene expression. In this paper we evaluate different candidate HKGs for developmental transcriptomic studies in the economically-important flax fiber- and oil-crop (Linum usitatissimum L). Specific primers were designed in order to quantify the expression levels of 20 different potential housekeeping genes in flax roots, internal- and external-stem tissues, leaves and flowers at different developmental stages. After calculations of PCR efficiencies, 13 HKGs were retained and their expression stabilities evaluated by the computer algorithms geNorm and NormFinder. According to geNorm, 2 Transcriptional Elongation Factors (TEFs) and 1 Ubiquitin gene are necessary for normalizing gene expression when all studied samples are considered. However, only 2 TEFs are required for normalizing expression in stem tissues. In contrast, NormFinder identified glyceraldehyde-3-phosphate dehydrogenase (GADPH) as the most stably expressed gene when all samples were grouped together, as well as when samples were classed into different sub-groups.qRT-PCR was then used to investigate the relative expression levels of two splice variants of the flax LuMYB1 gene (homologue of AtMYB59). LuMYB1-1 and LuMYB1-2 were highly expressed in the internal stem tissues as compared to outer stem tissues and other samples. This result was confirmed with both geNorm-designated- and Norm

  17. Methods for model selection in applied science and engineering.

    Energy Technology Data Exchange (ETDEWEB)

    Field, Richard V., Jr.

    2004-10-01

    Mathematical models are developed and used to study the properties of complex systems and/or modify these systems to satisfy some performance requirements in just about every area of applied science and engineering. A particular reason for developing a model, e.g., performance assessment or design, is referred to as the model use. Our objective is the development of a methodology for selecting a model that is sufficiently accurate for an intended use. Information on the system being modeled is, in general, incomplete, so that there may be two or more models consistent with the available information. The collection of these models is called the class of candidate models. Methods are developed for selecting the optimal member from a class of candidate models for the system. The optimal model depends on the available information, the selected class of candidate models, and the model use. Classical methods for model selection, including the method of maximum likelihood and Bayesian methods, as well as a method employing a decision-theoretic approach, are formulated to select the optimal model for numerous applications. There is no requirement that the candidate models be random. Classical methods for model selection ignore model use and require data to be available. Examples are used to show that these methods can be unreliable when data is limited. The decision-theoretic approach to model selection does not have these limitations, and model use is included through an appropriate utility function. This is especially important when modeling high risk systems, where the consequences of using an inappropriate model for the system can be disastrous. The decision-theoretic method for model selection is developed and applied for a series of complex and diverse applications. These include the selection of the: (1) optimal order of the polynomial chaos approximation for non-Gaussian random variables and stationary stochastic processes, (2) optimal pressure load model to be

  18. Selection of reference genes for normalisation of real-time RT-PCR in brain-stem death injury in Ovis aries

    Directory of Open Access Journals (Sweden)

    Fraser John F

    2009-07-01

    Full Text Available Abstract Background Heart and lung transplantation is frequently the only therapeutic option for patients with end stage cardio respiratory disease. Organ donation post brain stem death (BSD is a pre-requisite, yet BSD itself causes such severe damage that many organs offered for donation are unusable, with lung being the organ most affected by BSD. In Australia and New Zealand, less than 50% of lungs offered for donation post BSD are suitable for transplantation, as compared with over 90% of kidneys, resulting in patients dying for lack of suitable lungs. Our group has developed a novel 24 h sheep BSD model to mimic the physiological milieu of the typical human organ donor. Characterisation of the gene expression changes associated with BSD is critical and will assist in determining the aetiology of lung damage post BSD. Real-time PCR is a highly sensitive method involving multiple steps from extraction to processing RNA so the choice of housekeeping genes is important in obtaining reliable results. Little information however, is available on the expression stability of reference genes in the sheep pulmonary artery and lung. We aimed to establish a set of stably expressed reference genes for use as a standard for analysis of gene expression changes in BSD. Results We evaluated the expression stability of 6 candidate normalisation genes (ACTB, GAPDH, HGPRT, PGK1, PPIA and RPLP0 using real time quantitative PCR. There was a wide range of Ct-values within each tissue for pulmonary artery (15–24 and lung (16–25 but the expression pattern for each gene was similar across the two tissues. After geNorm analysis, ACTB and PPIA were shown to be the most stably expressed in the pulmonary artery and ACTB and PGK1 in the lung tissue of BSD sheep. Conclusion Accurate normalisation is critical in obtaining reliable and reproducible results in gene expression studies. This study demonstrates tissue associated variability in the selection of these

  19. ptxD gene in combination with phosphite serves as a highly effective selection system to generate transgenic cotton (Gossypium hirsutum L.).

    Science.gov (United States)

    Pandeya, Devendra; Campbell, LeAnne M; Nunes, Eugenia; Lopez-Arredondo, Damar L; Janga, Madhusudhana R; Herrera-Estrella, Luis; Rathore, Keerti S

    2017-12-01

    This report demonstrates the usefulness of ptxD/phosphite as a selection system that not only provides a highly efficient and simple means to generate transgenic cotton plants, but also helps address many of the concerns related to the use of antibiotic and herbicide resistance genes in the production of transgenic crops. Two of the most popular dominant selectable marker systems for plant transformation are based on either antibiotic or herbicide resistance genes. Due to concerns regarding their safety and in order to stack multiple traits in a single plant, there is a need for alternative selectable marker genes. The ptxD gene, derived from Pseudomonas stutzeri WM88, that confers to cells the ability to convert phosphite (Phi) into orthophosphate (Pi) offers an alternative selectable marker gene as demonstrated for tobacco and maize. Here, we show that the ptxD gene in combination with a protocol based on selection medium containing Phi, as the sole source of phosphorus (P), can serve as an effective and efficient system to select for transformed cells and generate transgenic cotton plants. Fluorescence microscopy examination of the cultures under selection and molecular analyses on the regenerated plants demonstrate the efficacy of the system in recovering cotton transformants following Agrobacterium-mediated transformation. Under the ptxD/Phi selection, an average of 3.43 transgenic events per 100 infected explants were recovered as opposed to only 0.41% recovery when bar/phosphinothricin (PPT) selection was used. The event recovery rates for nptII/kanamycin and hpt/hygromycin systems were 2.88 and 2.47%, respectively. Molecular analysis on regenerated events showed a selection efficiency of ~ 97% under the ptxD/Phi system. Thus, ptxD/Phi has proven to be a very efficient, positive selection system for the generation of transgenic cotton plants with equal or higher transformation efficiencies compared to the commonly used, negative selection systems.

  20. SELECTION OF NON-CONVENTIONAL MACHINING PROCESSES USING THE OCRA METHOD

    Directory of Open Access Journals (Sweden)

    Miloš Madić

    2015-04-01

    Full Text Available Selection of the most suitable nonconventional machining process (NCMP for a given machining application can be viewed as multi-criteria decision making (MCDM problem with many conflicting and diverse criteria. To aid these selection processes, different MCDM methods have been proposed. This paper introduces the use of an almost unexplored MCDM method, i.e. operational competitiveness ratings analysis (OCRA method for solving the NCMP selection problems. Applicability, suitability and computational procedure of OCRA method have been demonstrated while solving three case studies dealing with selection of the most suitable NCMP. In each case study the obtained rankings were compared with those derived by the past researchers using different MCDM methods. The results obtained using the OCRA method have good correlation with those derived by the past researchers which validate the usefulness of this method while solving complex NCMP selection problems.

  1. Genetic diversity and natural selection footprints of the glycine amidinotransferase gene in various human populations.

    Science.gov (United States)

    Khan, Asifullah; Tian, Lei; Zhang, Chao; Yuan, Kai; Xu, Shuhua

    2016-01-05

    The glycine amidinotransferase gene (GATM) plays a vital role in energy metabolism in muscle tissues and is associated with multiple clinically important phenotypes. However, the genetic diversity of the GATM gene remains poorly understood within and between human populations. Here we analyzed the 1,000 Genomes Project data through population genetics approaches and observed significant genetic diversity across the GATM gene among various continental human populations. We observed considerable variations in GATM allele frequencies and haplotype composition among different populations. Substantial genetic differences were observed between East Asian and European populations (FST = 0.56). In addition, the frequency of a distinct major GATM haplotype in these groups was congruent with population-wide diversity at this locus. Furthermore, we identified GATM as the top differentiated gene compared to the other statin drug response-associated genes. Composite multiple analyses identified signatures of positive selection at the GATM locus, which was estimated to have occurred around 850 generations ago in European populations. As GATM catalyzes the key step of creatine biosynthesis involved in energy metabolism, we speculate that the European prehistorical demographic transition from hunter-gatherer to farming cultures was the driving force of selection that fulfilled creatine-based metabolic requirement of the populations.

  2. Selection of Suitable Reference Genes for Quantitative Real-time PCR in Sapium sebiferum

    Directory of Open Access Journals (Sweden)

    Xue Chen

    2017-05-01

    Full Text Available Chinese tallow (Sapium sebiferum L. is a promising landscape and bioenergy plant. Measuring gene expression by quantitative real-time polymerase chain reaction (qRT-PCR can provide valuable information on gene function. Stably expressed reference genes for normalization are a prerequisite for ensuring the accuracy of the target gene expression level among different samples. However, the reference genes in Chinese tallow have not been systematically validated. In this study, 12 candidate reference genes (18S, GAPDH, UBQ, RPS15, SAND, TIP41, 60S, ACT7, PDF2, APT, TBP, and TUB were investigated with qRT-PCR in 18 samples, including those from different tissues, from plants treated with sucrose and cold stresses. The data were calculated with four common algorithms, geNorm, BestKeeper, NormFinder, and the delta cycle threshold (ΔCt. TIP41 and GAPDH were the most stable for the tissue-specific experiment, GAPDH and 60S for cold treatment, and GAPDH and UBQ for sucrose stresses, while the least stable genes were 60S, TIP41, and 18S respectively. The comprehensive results showed APT, GAPDH, and UBQ to be the top-ranked stable genes across all the samples. The stability of 60S was the lowest during all experiments. These selected reference genes were further validated by comparing the expression profiles of the chalcone synthase gene in Chinese tallow in different samples. The results will help to improve the accuracy of gene expression studies in Chinese tallow.

  3. Balancing Selection at the Tomato RCR3 Guardee Gene Family Maintains Variation in Strength of Pathogen Defense

    Science.gov (United States)

    Hörger, Anja C.; Ilyas, Muhammad; Stephan, Wolfgang; Tellier, Aurélien; van der Hoorn, Renier A. L.; Rose, Laura E.

    2012-01-01

    Coevolution between hosts and pathogens is thought to occur between interacting molecules of both species. This results in the maintenance of genetic diversity at pathogen antigens (or so-called effectors) and host resistance genes such as the major histocompatibility complex (MHC) in mammals or resistance (R) genes in plants. In plant–pathogen interactions, the current paradigm posits that a specific defense response is activated upon recognition of pathogen effectors via interaction with their corresponding R proteins. According to the “Guard-Hypothesis,” R proteins (the “guards”) can sense modification of target molecules in the host (the “guardees”) by pathogen effectors and subsequently trigger the defense response. Multiple studies have reported high genetic diversity at R genes maintained by balancing selection. In contrast, little is known about the evolutionary mechanisms shaping the guardee, which may be subject to contrasting evolutionary forces. Here we show that the evolution of the guardee RCR3 is characterized by gene duplication, frequent gene conversion, and balancing selection in the wild tomato species Solanum peruvianum. Investigating the functional characteristics of 54 natural variants through in vitro and in planta assays, we detected differences in recognition of the pathogen effector through interaction with the guardee, as well as substantial variation in the strength of the defense response. This variation is maintained by balancing selection at each copy of the RCR3 gene. Our analyses pinpoint three amino acid polymorphisms with key functional consequences for the coevolution between the guardee (RCR3) and its guard (Cf-2). We conclude that, in addition to coevolution at the “guardee-effector” interface for pathogen recognition, natural selection acts on the “guard-guardee” interface. Guardee evolution may be governed by a counterbalance between improved activation in the presence and prevention of auto

  4. Balancing selection at the tomato RCR3 Guardee gene family maintains variation in strength of pathogen defense.

    Directory of Open Access Journals (Sweden)

    Anja C Hörger

    Full Text Available Coevolution between hosts and pathogens is thought to occur between interacting molecules of both species. This results in the maintenance of genetic diversity at pathogen antigens (or so-called effectors and host resistance genes such as the major histocompatibility complex (MHC in mammals or resistance (R genes in plants. In plant-pathogen interactions, the current paradigm posits that a specific defense response is activated upon recognition of pathogen effectors via interaction with their corresponding R proteins. According to the "Guard-Hypothesis," R proteins (the "guards" can sense modification of target molecules in the host (the "guardees" by pathogen effectors and subsequently trigger the defense response. Multiple studies have reported high genetic diversity at R genes maintained by balancing selection. In contrast, little is known about the evolutionary mechanisms shaping the guardee, which may be subject to contrasting evolutionary forces. Here we show that the evolution of the guardee RCR3 is characterized by gene duplication, frequent gene conversion, and balancing selection in the wild tomato species Solanum peruvianum. Investigating the functional characteristics of 54 natural variants through in vitro and in planta assays, we detected differences in recognition of the pathogen effector through interaction with the guardee, as well as substantial variation in the strength of the defense response. This variation is maintained by balancing selection at each copy of the RCR3 gene. Our analyses pinpoint three amino acid polymorphisms with key functional consequences for the coevolution between the guardee (RCR3 and its guard (Cf-2. We conclude that, in addition to coevolution at the "guardee-effector" interface for pathogen recognition, natural selection acts on the "guard-guardee" interface. Guardee evolution may be governed by a counterbalance between improved activation in the presence and prevention of auto-immune responses in

  5. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures

    Directory of Open Access Journals (Sweden)

    Hayes Michael L

    2012-05-01

    Full Text Available Abstract Background Pentatricopeptide repeat (PPR proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82 and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. Results All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3’ UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. Conclusion PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative

  6. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

    Science.gov (United States)

    Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

    2012-05-14

    Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target

  7. Sibling genes as environment: Sibling dopamine genotypes and adolescent health support frequency dependent selection.

    Science.gov (United States)

    Rauscher, Emily; Conley, Dalton; Siegal, Mark L

    2015-11-01

    While research consistently suggests siblings matter for individual outcomes, it remains unclear why. At the same time, studies of genetic effects on health typically correlate variants of a gene with the average level of behavioral or health measures, ignoring more complicated genetic dynamics. Using National Longitudinal Study of Adolescent Health data, we investigate whether sibling genes moderate individual genetic expression. We compare twin variation in health-related absences and self-rated health by genetic differences at three locations related to dopamine regulation and transport to test sibship-level cross-person gene-gene interactions. Results suggest effects of variation at these genetic locations are moderated by sibling genes. Although the mechanism remains unclear, this evidence is consistent with frequency dependent selection and suggests much genetic research may violate the stable unit treatment value assumption. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Selection of internal control genes for real-time quantitative PCR in ovary and uterus of sows across pregnancy.

    Directory of Open Access Journals (Sweden)

    María Martínez-Giner

    Full Text Available BACKGROUND: Reproductive traits play a key role in pig production in order to reduce costs and increase economic returns. Among others, gene expression analyses represent a useful approach to study genetic mechanisms underlying reproductive traits in pigs. The application of reverse-transcription quantitative PCR requires the selection of appropriate reference genes, whose expression levels should not be affected by the experimental conditions, especially when comparing gene expression across different physiological stages. RESULTS: The gene expression stability of ten potential reference genes was studied by three different methods (geNorm, NormFinder and BestKeeper in ovary and uterus collected at five different physiological time points (heat, and 15, 30, 45 and 60 days of pregnancy. Although final ranking differed, the three algorithms gave very similar results. Thus, the most stable genes across time were TBP and UBC in uterus and TBP and HPRT1 in ovary, while HMBS and ACTB showed the less stable expression in uterus and ovary, respectively. When studied as a systematic effect, the reproductive stage did not significantly affect the expression of the candidate reference genes except at 30d and 60d of pregnancy, when a general drop in expression was observed in ovary. CONCLUSIONS: Based in our results, we propose the use of TBP, UBC and SDHA in uterus and TBP, GNB2L1 and HPRT1 in ovary for normalization of longitudinal expression studies using quantitative PCR in sows.

  9. Selection of industrial robots using the Polygons area method

    Directory of Open Access Journals (Sweden)

    Mortaza Honarmande Azimi

    2014-08-01

    Full Text Available Selection of robots from the several proposed alternatives is a very important and tedious task. Decision makers are not limited to one method and several methods have been proposed for solving this problem. This study presents Polygons Area Method (PAM as a multi attribute decision making method for robot selection problem. In this method, the maximum polygons area obtained from the attributes of an alternative robot on the radar chart is introduced as a decision-making criterion. The results of this method are compared with other typical multiple attribute decision-making methods (SAW, WPM, TOPSIS, and VIKOR by giving two examples. To find similarity in ranking given by different methods, Spearman’s rank correlation coefficients are obtained for different pairs of MADM methods. It was observed that the introduced method is in good agreement with other well-known MADM methods in the robot selection problem.

  10. Equipment Selection by using Fuzzy TOPSIS Method

    Science.gov (United States)

    Yavuz, Mahmut

    2016-10-01

    In this study, Fuzzy TOPSIS method was performed for the selection of open pit truck and the optimal solution of the problem was investigated. Data from Turkish Coal Enterprises was used in the application of the method. This paper explains the Fuzzy TOPSIS approaches with group decision-making application in an open pit coal mine in Turkey. An algorithm of the multi-person multi-criteria decision making with fuzzy set approach was applied an equipment selection problem. It was found that Fuzzy TOPSIS with a group decision making is a method that may help decision-makers in solving different decision-making problems in mining.

  11. Using Variable Precision Rough Set for Selection and Classification of Biological Knowledge Integrated in DNA Gene Expression

    Directory of Open Access Journals (Sweden)

    Calvo-Dmgz D.

    2012-12-01

    Full Text Available DNA microarrays have contributed to the exponential growth of genomic and experimental data in the last decade. This large amount of gene expression data has been used by researchers seeking diagnosis of diseases like cancer using machine learning methods. In turn, explicit biological knowledge about gene functions has also grown tremendously over the last decade. This work integrates explicit biological knowledge, provided as gene sets, into the classication process by means of Variable Precision Rough Set Theory (VPRS. The proposed model is able to highlight which part of the provided biological knowledge has been important for classification. This paper presents a novel model for microarray data classification which is able to incorporate prior biological knowledge in the form of gene sets. Based on this knowledge, we transform the input microarray data into supergenes, and then we apply rough set theory to select the most promising supergenes and to derive a set of easy interpretable classification rules. The proposed model is evaluated over three breast cancer microarrays datasets obtaining successful results compared to classical classification techniques. The experimental results shows that there are not significat differences between our model and classical techniques but it is able to provide a biological-interpretable explanation of how it classifies new samples.

  12. Adaptive Evolution of Gene Expression in Drosophila.

    Science.gov (United States)

    Nourmohammad, Armita; Rambeau, Joachim; Held, Torsten; Kovacova, Viera; Berg, Johannes; Lässig, Michael

    2017-08-08

    Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Adaptive Evolution of Gene Expression in Drosophila

    Directory of Open Access Journals (Sweden)

    Armita Nourmohammad

    2017-08-01

    Full Text Available Gene expression levels are important quantitative traits that link genotypes to molecular functions and fitness. In Drosophila, population-genetic studies have revealed substantial adaptive evolution at the genomic level, but the evolutionary modes of gene expression remain controversial. Here, we present evidence that adaptation dominates the evolution of gene expression levels in flies. We show that 64% of the observed expression divergence across seven Drosophila species are adaptive changes driven by directional selection. Our results are derived from time-resolved data of gene expression divergence across a family of related species, using a probabilistic inference method for gene-specific selection. Adaptive gene expression is stronger in specific functional classes, including regulation, sensory perception, sexual behavior, and morphology. Moreover, we identify a large group of genes with sex-specific adaptation of expression, which predominantly occurs in males. Our analysis opens an avenue to map system-wide selection on molecular quantitative traits independently of their genetic basis.

  14. Evolution of the complementary sex-determination gene of honey bees: balancing selection and trans-species polymorphisms.

    Science.gov (United States)

    Cho, Soochin; Huang, Zachary Y; Green, Daniel R; Smith, Deborah R; Zhang, Jianzhi

    2006-11-01

    The mechanism of sex determination varies substantively among evolutionary lineages. One important mode of genetic sex determination is haplodiploidy, which is used by approximately 20% of all animal species, including >200,000 species of the entire insect order Hymenoptera. In the honey bee Apis mellifera, a hymenopteran model organism, females are heterozygous at the csd (complementary sex determination) locus, whereas males are hemizygous (from unfertilized eggs). Fertilized homozygotes develop into sterile males that are eaten before maturity. Because homozygotes have zero fitness and because common alleles are more likely than rare ones to form homozygotes, csd should be subject to strong overdominant selection and negative frequency-dependent selection. Under these selective forces, together known as balancing selection, csd is expected to exhibit a high degree of intraspecific polymorphism, with long-lived alleles that may be even older than the species. Here we sequence the csd genes as well as randomly selected neutral genomic regions from individuals of three closely related species, A. mellifera, Apis cerana, and Apis dorsata. The polymorphic level is approximately seven times higher in csd than in the neutral regions. Gene genealogies reveal trans-species polymorphisms at csd but not at any neutral regions. Consistent with the prediction of rare-allele advantage, nonsynonymous mutations are found to be positively selected in csd only in early stages after their appearances. Surprisingly, three different hypervariable repetitive regions in csd are present in the three species, suggesting variable mechanisms underlying allelic specificities. Our results provide a definitive demonstration of balancing selection acting at the honey bee csd gene, offer insights into the molecular determinants of csd allelic specificities, and help avoid homozygosity in bee breeding.

  15. Assessing the joint effect of population stratification and sample selection in studies of gene-gene (environment interactions

    Directory of Open Access Journals (Sweden)

    Cheng KF

    2012-01-01

    Full Text Available Abstract Background It is well known that the presence of population stratification (PS may cause the usual test in case-control studies to produce spurious gene-disease associations. However, the impact of the PS and sample selection (SS is less known. In this paper, we provide a systematic study of the joint effect of PS and SS under a more general risk model containing genetic and environmental factors. We provide simulation results to show the magnitude of the bias and its impact on type I error rate of the usual chi-square test under a wide range of PS level and selection bias. Results The biases to the estimation of main and interaction effect are quantified and then their bounds derived. The estimated bounds can be used to compute conservative p-values for the association test. If the conservative p-value is smaller than the significance level, we can safely claim that the association test is significant regardless of the presence of PS or not, or if there is any selection bias. We also identify conditions for the null bias. The bias depends on the allele frequencies, exposure rates, gene-environment odds ratios and disease risks across subpopulations and the sampling of the cases and controls. Conclusion Our results show that the bias cannot be ignored even the case and control data were matched in ethnicity. A real example is given to illustrate application of the conservative p-value. These results are useful to the genetic association studies of main and interaction effects.

  16. An ensemble method to predict target genes and pathways in uveal melanoma

    Directory of Open Access Journals (Sweden)

    Wei Chao

    2018-04-01

    Full Text Available This work proposes to predict target genes and pathways for uveal melanoma (UM based on an ensemble method and pathway analyses. Methods: The ensemble method integrated a correlation method (Pearson correlation coefficient, PCC, a causal inference method (IDA and a regression method (Lasso utilizing the Borda count election method. Subsequently, to validate the performance of PIL method, comparisons between confirmed database and predicted miRNA targets were performed. Ultimately, pathway enrichment analysis was conducted on target genes in top 1000 miRNA-mRNA interactions to identify target pathways for UM patients. Results: Thirty eight of the predicted interactions were matched with the confirmed interactions, indicating that the ensemble method was a suitable and feasible approach to predict miRNA targets. We obtained 50 seed miRNA-mRNA interactions of UM patients and extracted target genes from these interactions, such as ASPG, BSDC1 and C4BP. The 601 target genes in top 1,000 miRNA-mRNA interactions were enriched in 12 target pathways, of which Phototransduction was the most significant one. Conclusion: The target genes and pathways might provide a new way to reveal the molecular mechanism of UM and give hand for target treatments and preventions of this malignant tumor.

  17. Mining gene expression data of multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Pi Guo

    Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.

  18. Niewirusowy transfer genów do komórek skóry – wybrane metody = Non-viral gene transfer into skin cells – selected methods

    Directory of Open Access Journals (Sweden)

    Ewelina Wędrowska

    2016-01-01

    . Therefore, there is an urgent need for alternative, non-viral methods of gene transfer. Aim of the study: To present methods for non-viral gene transfer used in gene therapy of skin diseases. Short description of knowledge state: Gene therapy for skin diseases include the usage of plasmid vectors as a carrier for therapeutic genes and different methods for their delivery into cells such as: electroporation, microinjection, sonication, lipid carriers and cationic polymers. Summary: Non-viral gene transfer methods offer some advantages including lower toxicity, non-infectious properties, ease of production and low costs as compared to viral techniques. Non-viral approaches are the promising tool in gene therapy of skin diseases, in particular in skin cancer ceases.   Key words: plasmids, gene transfer, skin, viruses, gene therapy.

  19. Double suicide genes selectively kill human umbilical vein endothelial cells

    Directory of Open Access Journals (Sweden)

    Liu Lunxu

    2011-02-01

    Full Text Available Abstract Background To construct a recombinant adenovirus containing CDglyTK double suicide genes and evaluate the killing effect of the double suicide genes driven by kinase domain insert containing receptor (KDR promoter on human umbilical vein endothelial cells. Methods Human KDR promoter, Escherichia coli (E. coli cytosine deaminase (CD gene and the herpes simplex virus-thymidine kinase (TK gene were cloned using polymerase chain reaction (PCR. Plasmid pKDR-CDglyTK was constructed with the KDR promoter and CDglyTK genes. A recombinant adenoviral plasmid AdKDR-CDglyTK was then constructed and transfected into 293 packaging cells to grow and harvest adenoviruses. KDR-expressing human umbilical vein endothelial cells (ECV304 and KDR-negative liver cancer cell line (HepG2 were infected with the recombinant adenoviruses at different multiplicity of infection (MOI. The infection rate was measured by green fluorescent protein (GFP expression. The infected cells were cultured in culture media containing different concentrations of prodrugs ganciclovir (GCV and/or 5-fluorocytosine (5-FC. The killing effects were measured using two different methods, i.e. annexin V-FITC staining and terminal transferase-mediated dUTP nick end-labeling (TUNEL staining. Results Recombinant adenoviruses AdKDR-CDglyTK were successfully constructed and they infected ECV304 and HepG2 cells efficiently. The infection rate was dependent on MOI of recombinant adenoviruses. ECV304 cells infected with AdKDR-CDglyTK were highly sensitive to GCV and 5-FC. The cell survival rate was dependent on both the concentration of the prodrugs and the MOI of recombinant adenoviruses. In contrast, there were no killing effects in the HepG2 cells. The combination of two prodrugs was much more effective in killing ECV304 cells than GCV or 5-FC alone. The growth of transgenic ECV304 cells was suppressed in the presence of prodrugs. Conclusion AdKDR-CDglyTK/double prodrog system may be a useful

  20. Optimization methods for activities selection problems

    Science.gov (United States)

    Mahad, Nor Faradilah; Alias, Suriana; Yaakop, Siti Zulaika; Arshad, Norul Amanina Mohd; Mazni, Elis Sofia

    2017-08-01

    Co-curriculum activities must be joined by every student in Malaysia and these activities bring a lot of benefits to the students. By joining these activities, the students can learn about the time management and they can developing many useful skills. This project focuses on the selection of co-curriculum activities in secondary school using the optimization methods which are the Analytic Hierarchy Process (AHP) and Zero-One Goal Programming (ZOGP). A secondary school in Negeri Sembilan, Malaysia was chosen as a case study. A set of questionnaires were distributed randomly to calculate the weighted for each activity based on the 3 chosen criteria which are soft skills, interesting activities and performances. The weighted was calculated by using AHP and the results showed that the most important criteria is soft skills. Then, the ZOGP model will be analyzed by using LINGO Software version 15.0. There are two priorities to be considered. The first priority which is to minimize the budget for the activities is achieved since the total budget can be reduced by RM233.00. Therefore, the total budget to implement the selected activities is RM11,195.00. The second priority which is to select the co-curriculum activities is also achieved. The results showed that 9 out of 15 activities were selected. Thus, it can concluded that AHP and ZOGP approach can be used as the optimization methods for activities selection problem.

  1. Red Queen Processes Drive Positive Selection on Major Histocompatibility Complex (MHC Genes.

    Directory of Open Access Journals (Sweden)

    Maciej Jan Ejsmond

    2015-11-01

    Full Text Available Major Histocompatibility Complex (MHC genes code for proteins involved in the incitation of the adaptive immune response in vertebrates, which is achieved through binding oligopeptides (antigens of pathogenic origin. Across vertebrate species, substitutions of amino acids at sites responsible for the specificity of antigen binding (ABS are positively selected. This is attributed to pathogen-driven balancing selection, which is also thought to maintain the high polymorphism of MHC genes, and to cause the sharing of allelic lineages between species. However, the nature of this selection remains controversial. We used individual-based computer simulations to investigate the roles of two phenomena capable of maintaining MHC polymorphism: heterozygote advantage and host-pathogen arms race (Red Queen process. Our simulations revealed that levels of MHC polymorphism were high and driven mostly by the Red Queen process at a high pathogen mutation rate, but were low and driven mostly by heterozygote advantage when the pathogen mutation rate was low. We found that novel mutations at ABSs are strongly favored by the Red Queen process, but not by heterozygote advantage, regardless of the pathogen mutation rate. However, while the strong advantage of novel alleles increased the allele turnover rate, under a high pathogen mutation rate, allelic lineages persisted for a comparable length of time under Red Queen and under heterozygote advantage. Thus, when pathogens evolve quickly, the Red Queen is capable of explaining both positive selection and long coalescence times, but the tension between the novel allele advantage and persistence of alleles deserves further investigation.

  2. Relationship of Source Selection Methods to Contract Outcomes: an Analysis of Air Force Source Selection

    Science.gov (United States)

    2015-12-01

    some occasions, performance is terminated early; this can occur due to either mutual agreement or a breach of contract by one of the parties (Garrett...Relationship of Source Selection Methods to Contract Outcomes: an Analysis of Air Force Source Selection December 2015 Capt Jacques Lamoureux, USAF...on the contract management process, with special emphasis on the source selection methods of tradeoff and lowest price technically acceptable (LPTA

  3. Selection Method for COTS Systems

    DEFF Research Database (Denmark)

    Hedman, Jonas; Andersson, Bo

    2014-01-01

    feature behind the method is that improved understanding of organizational ‘ends’ or goals should govern the selection of a COTS system. This can also be expressed as a match or fit between ‘ends’ (e.g. improved organizational effectiveness) and ‘means’ (e.g. implementing COTS systems). This way...

  4. Selection of reliable reference genes for gene expression studies in Trichoderma afroharzianum LTR-2 under oxalic acid stress.

    Science.gov (United States)

    Lyu, Yuping; Wu, Xiaoqing; Ren, He; Zhou, Fangyuan; Zhou, Hongzi; Zhang, Xinjian; Yang, Hetong

    2017-10-01

    An appropriate reference gene is required to get reliable results from gene expression analysis by quantitative real-time reverse transcription PCR (qRT-PCR). In order to identify stable and reliable reference genes in Trichoderma afroharzianum under oxalic acid (OA) stress, six commonly used housekeeping genes, i.e., elongation factor 1, ubiquitin, ubiquitin-conjugating enzyme, glyceraldehyde-3-phosphate dehydrogenase, α-tubulin, actin, from the effective biocontrol isolate T. afroharzianum strain LTR-2 were tested for their expression during growth in liquid culture amended with OA. Four in silico programs (comparative ΔCt, NormFinder, geNorm and BestKeeper) were used to evaluate the expression stabilities of six candidate reference genes. The elongation factor 1 gene EF-1 was identified as the most stably expressed reference gene, and was used as the normalizer to quantify the expression level of the oxalate decarboxylase coding gene OXDC in T. afroharzianum strain LTR-2 under OA stress. The result showed that the expression of OXDC was significantly up-regulated as expected. This study provides an effective method to quantify expression changes of target genes in T. afroharzianum under OA stress. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Is preimplantation genetic diagnosis the ideal embryo selection method in aneuploidy screening?

    Directory of Open Access Journals (Sweden)

    Levent Sahin

    2014-10-01

    Full Text Available To select cytogenetically normal embryos, preimplantation genetic diagnosis (PGD aneuploidy screening (AS is used in numerous centers around the world. Chromosomal abnormalities lead to developmental problems, implantation failure, and early abortion of embryos. The usefulness of PGD in identifying single-gene diseases, human leukocyte antigen typing, X-linked diseases, and specific genetic diseases is well-known. In this review, preimplantation embryo genetics, PGD research studies, and the European Society of Human Reproduction and Embryology PGD Consortium studies and reports are examined. In addition, criteria for embryo selection, technical aspects of PGD-AS, and potential noninvasive embryo selection methods are described. Indications for PGD and possible causes of discordant PGD results between the centers are discussed. The limitations of fluorescence in situ hybridization, and the advantages of the array comparative genomic hybridization are included in this review. Although PGD-AS for patients of advanced maternal age has been shown to improve in vitro fertilization outcomes in some studies, to our knowledge, there is not sufficient evidence to use advanced maternal age as the sole indication for PGD-AS. PGD-AS might be harmful and may not increase the success rates of in vitro fertilization. At the same time PGD, is not recommended for recurrent implantation failure and unexplained recurrent pregnancy loss.

  6. Development of a multiple-gene-loading method by combining multi-integration system-equipped mouse artificial chromosome vector and CRISPR-Cas9.

    Directory of Open Access Journals (Sweden)

    Kazuhisa Honma

    Full Text Available Mouse artificial chromosome (MAC vectors have several advantages as gene delivery vectors, such as stable and independent maintenance in host cells without integration, transferability from donor cells to recipient cells via microcell-mediated chromosome transfer (MMCT, and the potential for loading a megabase-sized DNA fragment. Previously, a MAC containing a multi-integrase platform (MI-MAC was developed to facilitate the transfer of multiple genes into desired cells. Although the MI system can theoretically hold five gene-loading vectors (GLVs, there are a limited number of drugs available for the selection of multiple-GLV integration. To overcome this issue, we attempted to knock out and reuse drug resistance genes (DRGs using the CRISPR-Cas9 system. In this study, we developed new methods for multiple-GLV integration. As a proof of concept, we introduced five GLVs in the MI-MAC by these methods, in which each GLV contained a gene encoding a fluorescent or luminescent protein (EGFP, mCherry, BFP, Eluc, and Cluc. Genes of interest (GOI on the MI-MAC were expressed stably and functionally without silencing in the host cells. Furthermore, the MI-MAC carrying five GLVs was transferred to other cells by MMCT, and the resultant recipient cells exhibited all five fluorescence/luminescence signals. Thus, the MI-MAC was successfully used as a multiple-GLV integration vector using the CRISPR-Cas9 system. The MI-MAC employing these methods may resolve bottlenecks in developing multiple-gene humanized models, multiple-gene monitoring models, disease models, reprogramming, and inducible gene expression systems.

  7. An improved selective sampling method

    International Nuclear Information System (INIS)

    Miyahara, Hiroshi; Iida, Nobuyuki; Watanabe, Tamaki

    1986-01-01

    The coincidence methods which are currently used for the accurate activity standardisation of radio-nuclides, require dead time and resolving time corrections which tend to become increasingly uncertain as countrates exceed about 10 K. To reduce the dependence on such corrections, Muller, in 1981, proposed the selective sampling method using a fast multichannel analyser (50 ns ch -1 ) for measuring the countrates. It is, in many ways, more convenient and possibly potentially more reliable to replace the MCA with scalers and a circuit is described employing five scalers; two of them serving to measure the background correction. Results of comparisons using our new method and the coincidence method for measuring the activity of 60 Co sources yielded agree-ment within statistical uncertainties. (author)

  8. Positive Selection on Hemagglutinin and Neuraminidase Genes of H1N1 Influenza Viruses

    LENUS (Irish Health Repository)

    Li, Wenfu

    2011-04-21

    Abstract Background Since its emergence in March 2009, the pandemic 2009 H1N1 influenza A virus has posed a serious threat to public health. To trace the evolutionary path of these new pathogens, we performed a selection-pressure analysis of a large number of hemagglutinin (HA) and neuraminidase (NA) gene sequences of H1N1 influenza viruses from different hosts. Results Phylogenetic analysis revealed that both HA and NA genes have evolved into five distinct clusters, with further analyses indicating that the pandemic 2009 strains have experienced the strongest positive selection. We also found evidence of strong selection acting on the seasonal human H1N1 isolates. However, swine viruses from North America and Eurasia were under weak positive selection, while there was no significant evidence of positive selection acting on the avian isolates. A site-by-site analysis revealed that the positively selected sites were located in both of the cleaved products of HA (HA1 and HA2), as well as NA. In addition, the pandemic 2009 strains were subject to differential selection pressures compared to seasonal human, North American swine and Eurasian swine H1N1 viruses. Conclusions Most of these positively and\\/or differentially selected sites were situated in the B-cell and\\/or T-cell antigenic regions, suggesting that selection at these sites might be responsible for the antigenic variation of the viruses. Moreover, some sites were also associated with glycosylation and receptor-binding ability. Thus, selection at these positions might have helped the pandemic 2009 H1N1 viruses to adapt to the new hosts after they were introduced from pigs to humans. Positive selection on position 274 of NA protein, associated with drug resistance, might account for the prevalence of drug-resistant variants of seasonal human H1N1 influenza viruses, but there was no evidence that positive selection was responsible for the spread of the drug resistance of the pandemic H1N1 strains.

  9. An Identification Key for Selecting Methods for Sustainability Assessments

    Directory of Open Access Journals (Sweden)

    Michiel C. Zijp

    2015-03-01

    Full Text Available Sustainability assessments can play an important role in decision making. This role starts with selecting appropriate methods for a given situation. We observed that scientists, consultants, and decision-makers often do not systematically perform a problem analyses that guides the choice of the method, partly related to a lack of systematic, though sufficiently versatile approaches to do so. Therefore, we developed and propose a new step towards method selection on the basis of question articulation: the Sustainability Assessment Identification Key. The identification key was designed to lead its user through all important choices needed for comprehensive question articulation. Subsequently, methods that fit the resulting specific questions are suggested by the key. The key consists of five domains, of which three determine method selection and two the design or use of the method. Each domain consists of four or more criteria that need specification. For example in the domain “system boundaries”, amongst others, the spatial and temporal scales are specified. The key was tested (retrospectively on a set of thirty case studies. Using the key appeared to contribute to improved: (i transparency in the link between the question and method selection; (ii consistency between questions asked and answers provided; and (iii internal consistency in methodological design. There is latitude to develop the current initial key further, not only for selecting methods pertinent to a problem definition, but also as a principle for associated opportunities such as stakeholder identification.

  10. Evolution of High Cellulolytic Activity in Symbiotic Streptomyces through Selection of Expanded Gene Content and Coordinated Gene Expression

    Science.gov (United States)

    McDonald, Bradon R.; Takasuka, Taichi E.; Wendt-Pienkowski, Evelyn; Doering, Drew T.; Raffa, Kenneth F.; Fox, Brian G.; Currie, Cameron R.

    2016-01-01

    The evolution of cellulose degradation was a defining event in the history of life. Without efficient decomposition and recycling, dead plant biomass would quickly accumulate and become inaccessible to terrestrial food webs and the global carbon cycle. On land, the primary drivers of plant biomass deconstruction are fungi and bacteria in the soil or associated with herbivorous eukaryotes. While the ecological importance of plant-decomposing microbes is well established, little is known about the distribution or evolution of cellulolytic activity in any bacterial genus. Here we show that in Streptomyces, a genus of Actinobacteria abundant in soil and symbiotic niches, the ability to rapidly degrade cellulose is largely restricted to two clades of host-associated strains and is not a conserved characteristic of the Streptomyces genus or host-associated strains. Our comparative genomics identify that while plant biomass degrading genes (CAZy) are widespread in Streptomyces, key enzyme families are enriched in highly cellulolytic strains. Transcriptomic analyses demonstrate that cellulolytic strains express a suite of multi-domain CAZy enzymes that are coregulated by the CebR transcriptional regulator. Using targeted gene deletions, we verify the importance of a highly expressed cellulase (GH6 family cellobiohydrolase) and the CebR transcriptional repressor to the cellulolytic phenotype. Evolutionary analyses identify complex genomic modifications that drive plant biomass deconstruction in Streptomyces, including acquisition and selective retention of CAZy genes and transcriptional regulators. Our results suggest that host-associated niches have selected some symbiotic Streptomyces for increased cellulose degrading activity and that symbiotic bacteria are a rich biochemical and enzymatic resource for biotechnology. PMID:27276034

  11. A Selection Method That Succeeds!

    Science.gov (United States)

    Weitman, Catheryn J.

    Provided a structural selection method is carried out, it is possible to find quality early childhood personnel. The hiring process involves five definite steps, each of which establishes a base for the next. A needs assessment formulating basic minimal qualifications is the first step. The second step involves review of current job descriptions…

  12. Natural selection in a population of Drosophila melanogaster explained by changes in gene expression caused by sequence variation in core promoter regions.

    Science.gov (United States)

    Sato, Mitsuhiko P; Makino, Takashi; Kawata, Masakado

    2016-02-09

    Understanding the evolutionary forces that influence variation in gene regulatory regions in natural populations is an important challenge for evolutionary biology because natural selection for such variations could promote adaptive phenotypic evolution. Recently, whole-genome sequence analyses have identified regulatory regions subject to natural selection. However, these studies could not identify the relationship between sequence variation in the detected regions and change in gene expression levels. We analyzed sequence variations in core promoter regions, which are critical regions for gene regulation in higher eukaryotes, in a natural population of Drosophila melanogaster, and identified core promoter sequence variations associated with differences in gene expression levels subjected to natural selection. Among the core promoter regions whose sequence variation could change transcription factor binding sites and explain differences in expression levels, three core promoter regions were detected as candidates associated with purifying selection or selective sweep and seven as candidates associated with balancing selection, excluding the possibility of linkage between these regions and core promoter regions. CHKov1, which confers resistance to the sigma virus and related insecticides, was identified as core promoter regions that has been subject to selective sweep, although it could not be denied that selection for variation in core promoter regions was due to linked single nucleotide polymorphisms in the regulatory region outside core promoter regions. Nucleotide changes in core promoter regions of CHKov1 caused the loss of two basal transcription factor binding sites and acquisition of one transcription factor binding site, resulting in decreased gene expression levels. Of nine core promoter regions regions associated with balancing selection, brat, and CG9044 are associated with neuromuscular junction development, and Nmda1 are associated with learning

  13. Signatures of selection acting on the innate immunity gene Toll-like receptor 2 (TLR2) during the evolutionary history of rodents.

    Science.gov (United States)

    Tschirren, B; Råberg, L; Westerdahl, H

    2011-06-01

    Patterns of selection acting on immune defence genes have recently been the focus of considerable interest. Yet, when it comes to vertebrates, studies have mainly focused on the acquired branch of the immune system. Consequently, the direction and strength of selection acting on genes of the vertebrate innate immune defence remain poorly understood. Here, we present a molecular analysis of selection on an important receptor of the innate immune system of vertebrates, the Toll-like receptor 2 (TLR2), across 17 rodent species. Although purifying selection was the prevalent evolutionary force acting on most parts of the rodent TLR2, we found that codons in close proximity to pathogen-binding and TLR2-TLR1 heterodimerization sites have been subject to positive selection. This indicates that parasite-mediated selection is not restricted to acquired immune system genes like the major histocompatibility complex, but also affects innate defence genes. To obtain a comprehensive understanding of evolutionary processes in host-parasite systems, both innate and acquired immunity thus need to be considered. © 2011 The Authors. Journal of Evolutionary Biology © 2011 European Society For Evolutionary Biology.

  14. The Impact of Selection, Gene Conversion, and Biased Sampling on the Assessment of Microbial Demography.

    Science.gov (United States)

    Lapierre, Marguerite; Blin, Camille; Lambert, Amaury; Achaz, Guillaume; Rocha, Eduardo P C

    2016-07-01

    Recent studies have linked demographic changes and epidemiological patterns in bacterial populations using coalescent-based approaches. We identified 26 studies using skyline plots and found that 21 inferred overall population expansion. This surprising result led us to analyze the impact of natural selection, recombination (gene conversion), and sampling biases on demographic inference using skyline plots and site frequency spectra (SFS). Forward simulations based on biologically relevant parameters from Escherichia coli populations showed that theoretical arguments on the detrimental impact of recombination and especially natural selection on the reconstructed genealogies cannot be ignored in practice. In fact, both processes systematically lead to spurious interpretations of population expansion in skyline plots (and in SFS for selection). Weak purifying selection, and especially positive selection, had important effects on skyline plots, showing patterns akin to those of population expansions. State-of-the-art techniques to remove recombination further amplified these biases. We simulated three common sampling biases in microbiological research: uniform, clustered, and mixed sampling. Alone, or together with recombination and selection, they further mislead demographic inferences producing almost any possible skyline shape or SFS. Interestingly, sampling sub-populations also affected skyline plots and SFS, because the coalescent rates of populations and their sub-populations had different distributions. This study suggests that extreme caution is needed to infer demographic changes solely based on reconstructed genealogies. We suggest that the development of novel sampling strategies and the joint analyzes of diverse population genetic methods are strictly necessary to estimate demographic changes in populations where selection, recombination, and biased sampling are present. © The Author 2016. Published by Oxford University Press on behalf of the Society for

  15. Supplier selection based on multi-criterial AHP method

    Directory of Open Access Journals (Sweden)

    Jana Pócsová

    2010-03-01

    Full Text Available This paper describes a case-study of supplier selection based on multi-criterial Analytic Hierarchy Process (AHP method.It is demonstrated that using adequate mathematical method can bring us “unprejudiced” conclusion, even if the alternatives (suppliercompanies are very similar in given selection-criteria. The result is the best possible supplier company from the viewpoint of chosen criteriaand the price of the product.

  16. Cloning and selection of reference genes for gene expression ...

    African Journals Online (AJOL)

    Full length mRNA sequences of Ac-β-actin and Ac-gapdh, and partial mRNA sequences of Ac-18SrRNA and Ac-ubiquitin were cloned from pineapple in this study. The four genes were tested as housekeeping genes in three experimental sets. GeNorm and NormFinder analysis revealed that β-actin was the most ...

  17. Methods for producing thin film charge selective transport layers

    Science.gov (United States)

    Hammond, Scott Ryan; Olson, Dana C.; van Hest, Marinus Franciscus Antonius Maria

    2018-01-02

    Methods for producing thin film charge selective transport layers are provided. In one embodiment, a method for forming a thin film charge selective transport layer comprises: providing a precursor solution comprising a metal containing reactive precursor material dissolved into a complexing solvent; depositing the precursor solution onto a surface of a substrate to form a film; and forming a charge selective transport layer on the substrate by annealing the film.

  18. Models of Aire-dependent gene regulation for thymic negative selection

    Directory of Open Access Journals (Sweden)

    Dina eDanso-Abeam

    2011-05-01

    Full Text Available Mutations in the Autoimmune Regulator (AIRE gene lead to Autoimmune Polyendocrinopathy Syndrome type 1 (APS1, characterized by the development of multi-organ autoimmune damage. The mechanism by which defects in AIRE result in autoimmunity has been the subject of intense scrutiny. At the cellular level, the working model explains most of the clinical and immunological characteristics of APS1, with AIRE driving the expression of tissue restricted antigens (TRAs in the epithelial cells of the thymic medulla. This TRA expression results in effective negative selection of TRA-reactive thymocytes, preventing autoimmune disease. At the molecular level, the mechanism by which AIRE initiates TRA expression in the thymic medulla remains unclear. Multiple different models for the molecular mechanism have been proposed, ranging from classical transcriptional activity, to random induction of gene expression, to epigenetic tag recognition effect, to altered cell biology. In this review, we evaluate each of these models and discuss their relative strengths and weaknesses.

  19. Reference satellite selection method for GNSS high-precision relative positioning

    Directory of Open Access Journals (Sweden)

    Xiao Gao

    2017-03-01

    Full Text Available Selecting the optimal reference satellite is an important component of high-precision relative positioning because the reference satellite directly influences the strength of the normal equation. The reference satellite selection methods based on elevation and positional dilution of precision (PDOP value were compared. Results show that all the above methods cannot select the optimal reference satellite. We introduce condition number of the design matrix in the reference satellite selection method to improve structure of the normal equation, because condition number can indicate the ill condition of the normal equation. The experimental results show that the new method can improve positioning accuracy and reliability in precise relative positioning.

  20. Systematic hybrid LOH: a new method to reduce false positives and negatives during screening of yeast gene deletion libraries

    DEFF Research Database (Denmark)

    Alvaro, D.; Sunjevaric, I.; Reid, R. J.

    2006-01-01

    We have developed a new method, systematic hybrid loss of heterozygosity, to facilitate genomic screens utilizing the yeast gene deletion library. Screening is performed using hybrid diploid strains produced through mating the library haploids with strains from a different genetic background......, to minimize the contribution of unpredicted recessive genetic factors present in the individual library strains. We utilize a set of strains where each contains a conditional centromere construct on one of the 16 yeast chromosomes that allows the destabilization and selectable loss of that chromosome. After...... complementation of any spurious recessive mutations in the library strain, facilitating attribution of the observed phenotype to the documented gene deletion and dramatically reducing false positive results commonly obtained in library screens. The systematic hybrid LOH method can be applied to virtually any...

  1. Molecular detection of genes encoding AcrAB , Qep A efflux pumps in Klebsiella pneumoniae strains isolated from hospitalized patients in selected hospitals in Tehran

    Directory of Open Access Journals (Sweden)

    Mohsen Heidary

    2017-03-01

    Full Text Available Abstract Background and Objectives: Increasing emergence of fluoroquinolone resistance among clinical isolates of Klebsiella pneumoniae  (K. pneumoniae, has limited the treatment options for treatment of infections caused by these bacteria. The aim of this study was to investigate the dissemination of genes encoding AcrAB and QepA efflux pumps among K. pneumoniae strains. Methods: This study was carried out on 117 K. pneumoniae strains isolated from patients hospitalized in selected hospitals in Tehran city, 2015-2016, Iran. Antimicrobial susceptibility tests were performed using disk diffusion method (based on CLSI guidelines and identification of acr A, acr B and qep A genes using PCR assay. Results: In this study, colistin and tigecycline had the best effect against clinical isolates of K. pneumoniae. According to PCR results, 110 (94% isolates had acrA gene and 102 (87% isolates had acrB gene, respectively. The qepA gene was not found in any of the K. pneumoniae strains. Conclusion: According to the results of the present study, dissemination of the genes encoding AcrAB efflux pumps among K. pneumoniae strains, which cause resistance to fluoroquinolones, is a matter of concern. Therefore, infection control and prevention of the spread of drug-resistant bacteria requires careful management in drug prescription and identification of resistant isolates.

  2. Comparing Patterns of Natural Selection across Species Using Selective Signatures

    Energy Technology Data Exchange (ETDEWEB)

    Shapiro, Jesse; Alm, Eric J.

    2007-12-01

    Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 c-proteobacterial species. We describe the pattern of fast or slow evolution across species as the"selective signature" of a gene. Selective signatures represent aprofile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example,glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell.

  3. Comparing Patterns of Natural Selection Across Species Using Selective Signatures

    Energy Technology Data Exchange (ETDEWEB)

    Alm, Eric J.; Shapiro, B. Jesse; Alm, Eric J.

    2007-12-18

    Comparing gene expression profiles over many different conditions has led to insights that were not obvious from single experiments. In the same way, comparing patterns of natural selection across a set of ecologically distinct species may extend what can be learned from individual genome-wide surveys. Toward this end, we show how variation in protein evolutionary rates, after correcting for genome-wide effects such as mutation rate and demographic factors, can be used to estimate the level and types of natural selection acting on genes across different species. We identify unusually rapidly and slowly evolving genes, relative to empirically derived genome-wide and gene family-specific background rates for 744 core protein families in 30 gamma-proteobacterial species. We describe the pattern of fast or slow evolution across species as the 'selective signature' of a gene. Selective signatures represent a profile of selection across species that is predictive of gene function: pairs of genes with correlated selective signatures are more likely to share the same cellular function, and genes in the same pathway can evolve in concert. For example, glycolysis and phenylalanine metabolism genes evolve rapidly in Idiomarina loihiensis, mirroring an ecological shift in carbon source from sugars to amino acids. In a broader context, our results suggest that the genomic landscape is organized into functional modules even at the level of natural selection, and thus it may be easier than expected to understand the complex evolutionary pressures on a cell.

  4. Comparative analysis of methods for gene transcription profiling data derived from different microarray technologies in rat and mouse models of diabetes

    Directory of Open Access Journals (Sweden)

    Bihoreau Marie-Thérèse

    2009-02-01

    Full Text Available Abstract Background Microarray technologies are widely used to quantify the abundance of transcripts corresponding to thousands of genes. To maximise the robustness of transcriptome results, we have tested the performance and reproducibility of rat and mouse gene expression data obtained with Affymetrix, Illumina and Operon platforms. Results We present a thorough analysis of the degree of reproducibility provided by analysing the transcriptomic profile of the same animals of several experimental groups under different popular microarray technologies in different tissues. Concordant results from inter- and intra-platform comparisons were maximised by testing many popular computational methods for generating fold changes and significances and by only considering oligonucleotides giving high expression levels. The choice of Affymetrix signal extraction technique was shown to have the greatest effect on the concordance across platforms. In both species, when choosing optimal methods, the agreement between data generated on the Affymetrix and Illumina was excellent; this was verified using qRT-PCR on a selection of genes present on all platforms. Conclusion This study provides an extensive assessment of analytical methods best suited for processing data from different microarray technologies and can assist integration of technologically different gene expression datasets in biological systems.

  5. A novel method to discover fluoroquinolone antibiotic resistance (qnr genes in fragmented nucleotide sequences

    Directory of Open Access Journals (Sweden)

    Boulund Fredrik

    2012-12-01

    Full Text Available Abstract Background Broad-spectrum fluoroquinolone antibiotics are central in modern health care and are used to treat and prevent a wide range of bacterial infections. The recently discovered qnr genes provide a mechanism of resistance with the potential to rapidly spread between bacteria using horizontal gene transfer. As for many antibiotic resistance genes present in pathogens today, qnr genes are hypothesized to originate from environmental bacteria. The vast amount of data generated by shotgun metagenomics can therefore be used to explore the diversity of qnr genes in more detail. Results In this paper we describe a new method to identify qnr genes in nucleotide sequence data. We show, using cross-validation, that the method has a high statistical power of correctly classifying sequences from novel classes of qnr genes, even for fragments as short as 100 nucleotides. Based on sequences from public repositories, the method was able to identify all previously reported plasmid-mediated qnr genes. In addition, several fragments from novel putative qnr genes were identified in metagenomes. The method was also able to annotate 39 chromosomal variants of which 11 have previously not been reported in literature. Conclusions The method described in this paper significantly improves the sensitivity and specificity of identification and annotation of qnr genes in nucleotide sequence data. The predicted novel putative qnr genes in the metagenomic data support the hypothesis of a large and uncharacterized diversity within this family of resistance genes in environmental bacterial communities. An implementation of the method is freely available at http://bioinformatics.math.chalmers.se/qnr/.

  6. A hybrid network-based method for the detection of disease-related genes

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Dai, Yang; Stanley, H. Eugene

    2018-02-01

    Detecting disease-related genes is crucial in disease diagnosis and drug design. The accepted view is that neighbors of a disease-causing gene in a molecular network tend to cause the same or similar diseases, and network-based methods have been recently developed to identify novel hereditary disease-genes in available biomedical networks. Despite the steady increase in the discovery of disease-associated genes, there is still a large fraction of disease genes that remains under the tip of the iceberg. In this paper we exploit the topological properties of the protein-protein interaction (PPI) network to detect disease-related genes. We compute, analyze, and compare the topological properties of disease genes with non-disease genes in PPI networks. We also design an improved random forest classifier based on these network topological features, and a cross-validation test confirms that our method performs better than previous similar studies.

  7. Selection of reference genes for quantitative real-time PCR in bovine preimplantation embryos

    Directory of Open Access Journals (Sweden)

    Van Zeveren Alex

    2005-12-01

    Full Text Available Abstract Background Real-time quantitative PCR is a sensitive and very efficient technique to examine gene transcription patterns in preimplantation embryos, in order to gain information about embryo development and to optimize assisted reproductive technologies. Critical to the succesful application of real-time PCR is careful assay design, reaction optimization and validation to maximize sensitivity and accuracy. In most of the studies published GAPD, ACTB or 18S rRNA have been used as a single reference gene without prior verification of their expression stability. Normalization of the data using unstable controls can result in erroneous conclusions, especially when only one reference gene is used. Results In this study the transcription levels of 8 commonly used reference genes (ACTB, GAPD, Histone H2A, TBP, HPRT1, SDHA, YWHAZ and 18S rRNA were determined at different preimplantation stages (2-cell, 8-cell, blastocyst and hatched blastocyst in order to select the most stable genes to normalize quantitative data within different preimplantation embryo stages. Conclusion Using the geNorm application YWHAZ, GAPD and SDHA were found to be the most stable genes across the examined embryonic stages, while the commonly used ACTB was shown to be highly regulated. We recommend the use of the geometric mean of those 3 reference genes as an accurate normalization factor, which allows small expression differences to be reliably measured.

  8. Signatures of positive selection in Toll-like receptor (TLR genes in mammals

    Directory of Open Access Journals (Sweden)

    Areal Helena

    2011-12-01

    Full Text Available Abstract Background Toll-like receptors (TLRs are a major class of pattern recognition receptors (PRRs expressed in the cell surface or membrane compartments of immune and non-immune cells. TLRs are encoded by a multigene family and represent the first line of defense against pathogens by detecting foreigner microbial molecular motifs, the pathogen-associated molecular patterns (PAMPs. TLRs are also important by triggering the adaptive immunity in vertebrates. They are characterized by the presence of leucine-rich repeats (LRRs in the ectodomain, which are associated with the PAMPs recognition. The direct recognition of different pathogens by TLRs might result in different evolutionary adaptations important to understand the dynamics of the host-pathogen interplay. Ten mammal TLR genes, viral (TLR3, 7, 8, 9 and non-viral (TLR1-6, 10, were selected to identify signatures of positive selection that might have been imposed by interacting pathogens and to clarify if viral and non-viral TLRs might display different patterns of molecular evolution. Results By using Maximum Likelihood approaches, evidence of positive selection was found in all the TLRs studied. The number of positively selected codons (PSC ranged between 2-26 codons (0.25%-2.65% with the non-viral TLR4 as the receptor with higher percentage of positively selected codons (2.65%, followed by the viral TLR8 (2.50%. The results indicated that viral and non-viral TLRs are similarly under positive selection. Almost all TLRs have at least one PSC located in the LRR ectodomain which underlies the importance of the pathogen recognition by this region. Conclusions Our results are not in line with previous studies on primates and birds that identified more codons under positive selection in non-viral TLRs. This might be explained by the fact that both primates and birds are homogeneous groups probably being affected by only a restricted number of related viruses with equivalent motifs to be

  9. A new computational method for the detection of horizontal gene transfer events.

    Science.gov (United States)

    Tsirigos, Aristotelis; Rigoutsos, Isidore

    2005-01-01

    In recent years, the increase in the amounts of available genomic data has made it easier to appreciate the extent by which organisms increase their genetic diversity through horizontally transferred genetic material. Such transfers have the potential to give rise to extremely dynamic genomes where a significant proportion of their coding DNA has been contributed by external sources. Because of the impact of these horizontal transfers on the ecological and pathogenic character of the recipient organisms, methods are continuously sought that are able to computationally determine which of the genes of a given genome are products of transfer events. In this paper, we introduce and discuss a novel computational method for identifying horizontal transfers that relies on a gene's nucleotide composition and obviates the need for knowledge of codon boundaries. In addition to being applicable to individual genes, the method can be easily extended to the case of clusters of horizontally transferred genes. With the help of an extensive and carefully designed set of experiments on 123 archaeal and bacterial genomes, we demonstrate that the new method exhibits significant improvement in sensitivity when compared to previously published approaches. In fact, it achieves an average relative improvement across genomes of between 11 and 41% compared to the Codon Adaptation Index method in distinguishing native from foreign genes. Our method's horizontal gene transfer predictions for 123 microbial genomes are available online at http://cbcsrv.watson.ibm.com/HGT/.

  10. Gene Transfection Method Using Atmospheric Pressure Dielectric-Barrier Discharge Plasmas

    Science.gov (United States)

    Sasaki, Shota; Kanzaki, Makoto; Kaneko, Toshiro

    2013-09-01

    Gene transfection which is the process of deliberately introducing nucleic acids into cells is expected to play an important role in medical treatment because the process is necessary for gene therapy and creation of induced pluripotent stem (iPS) cells. However, the conventional transfection methods have some problems, so we focus attention on promising transfection methods by atmospheric pressure dielectric-barrier discharge (AP-DBD) plasmas. AP-DBD He plasmas are irradiated to the living cell covered with genes. Preliminarily, we use fluorescent dye YOYO-1 instead of the genes and use LIVE/DEAD Stain for cell viability test, and we analyze the transfection efficiency and cell viability under the various conditions. It is clarified that the transfection efficiency is strongly dependence on the plasma irradiation time and cell viability rates is high rates (>90%) regardless of long plasma irradiation time. These results suggest that ROS (Reactive Oxygen Species) and electric field generated by the plasma affect the gene transfection. In addition to this (the plasma irradiation time) dependency, we now investigate the effect of the plasma irradiation under the various conditions.

  11. Neuroplasticity of selective attention: Research foundations and preliminary evidence for a gene by intervention interaction

    Science.gov (United States)

    Stevens, Courtney; Pakulak, Eric; Hampton Wray, Amanda; Bell, Theodore A.; Neville, Helen J.

    2017-01-01

    This article reviews the trajectory of our research program on selective attention, which has moved from basic research on the neural processes underlying selective attention to translational studies using selective attention as a neurobiological target for evidence-based interventions. We use this background to present a promising preliminary investigation of how genetic and experiential factors interact during development (i.e., gene × intervention interactions). Our findings provide evidence on how exposure to a family-based training can modify the associations between genotype (5-HTTLPR) and the neural mechanisms of selective attention in preschool children from lower socioeconomic status backgrounds. PMID:28819066

  12. Neuroplasticity of selective attention: Research foundations and preliminary evidence for a gene by intervention interaction.

    Science.gov (United States)

    Isbell, Elif; Stevens, Courtney; Pakulak, Eric; Hampton Wray, Amanda; Bell, Theodore A; Neville, Helen J

    2017-08-29

    This article reviews the trajectory of our research program on selective attention, which has moved from basic research on the neural processes underlying selective attention to translational studies using selective attention as a neurobiological target for evidence-based interventions. We use this background to present a promising preliminary investigation of how genetic and experiential factors interact during development (i.e., gene × intervention interactions). Our findings provide evidence on how exposure to a family-based training can modify the associations between genotype (5-HTTLPR) and the neural mechanisms of selective attention in preschool children from lower socioeconomic status backgrounds.

  13. Selection and validation of appropriate reference genes for quantitative real-time PCR analysis in Salvia hispanica.

    Directory of Open Access Journals (Sweden)

    Rahul Gopalam

    Full Text Available Quantitative real-time polymerase chain reaction (qRT-PCR has become the most popular choice for gene expression studies. For accurate expression analysis, it is pertinent to select a stable reference gene to normalize the data. It is now known that the expression of internal reference genes varies considerably during developmental stages and under different experimental conditions. For Salvia hispanica, an economically important oilseed crop, there are no reports of stable reference genes till date. In this study, we chose 13 candidate reference genes viz. Actin11 (ACT, Elongation factor 1-alpha (EF1-α, Eukaryotic translation initiation factor 3E (ETIF3E, alpha tubulin (α-TUB, beta tubulin (β-TUB, Glyceraldehyde 3-phosphate dehydrogenase (GAPDH, Cyclophilin (CYP, Clathrin adaptor complex (CAC, Serine/threonine-protein phosphatase 2A (PP2A, FtsH protease (FtsH, 18S ribosomal RNA (18S rRNA, S-adenosyl methionine decarboxylase (SAMDC and Rubisco activase (RCA and the expression levels of these genes were assessed in a diverse set of tissue samples representing vegetative stages, reproductive stages and various abiotic stress treatments. Two of the widely used softwares, geNorm and Normfinder were used to evaluate the expression stabilities of these 13 candidate reference genes under different conditions. Results showed that GAPDH and CYP expression remain stable throughout in the different abiotic stress treatments, CAC and PP2A expression were relatively stable under reproductive stages and α-TUB, PP2A and ETIF3E were found to be stably expressed in vegetative stages. Further, the expression levels of Diacylglycerol acyltransferase (DGAT1, a key enzyme in triacylglycerol synthesis was analyzed to confirm the validity of reference genes identified in the study. This is the first systematic study of selection of reference genes in S. hispanica, and will benefit future expression studies in this crop.

  14. The impact of selection, gene flow and demographic history on heterogeneous genomic divergence: three-spine sticklebacks in divergent environments.

    Science.gov (United States)

    Ferchaud, Anne-Laure; Hansen, Michael M

    2016-01-01

    Heterogeneous genomic divergence between populations may reflect selection, but should also be seen in conjunction with gene flow and drift, particularly population bottlenecks. Marine and freshwater three-spine stickleback (Gasterosteus aculeatus) populations often exhibit different lateral armour plate morphs. Moreover, strikingly parallel genomic footprints across different marine-freshwater population pairs are interpreted as parallel evolution and gene reuse. Nevertheless, in some geographic regions like the North Sea and Baltic Sea, different patterns are observed. Freshwater populations in coastal regions are often dominated by marine morphs, suggesting that gene flow overwhelms selection, and genomic parallelism may also be less pronounced. We used RAD sequencing for analysing 28 888 SNPs in two marine and seven freshwater populations in Denmark, Europe. Freshwater populations represented a variety of environments: river populations accessible to gene flow from marine sticklebacks and large and small isolated lakes with and without fish predators. Sticklebacks in an accessible river environment showed minimal morphological and genomewide divergence from marine populations, supporting the hypothesis of gene flow overriding selection. Allele frequency spectra suggested bottlenecks in all freshwater populations, and particularly two small lake populations. However, genomic footprints ascribed to selection could nevertheless be identified. No genomic regions were consistent freshwater-marine outliers, and parallelism was much lower than in other comparable studies. Two genomic regions previously described to be under divergent selection in freshwater and marine populations were outliers between different freshwater populations. We ascribe these patterns to stronger environmental heterogeneity among freshwater populations in our study as compared to most other studies, although the demographic history involving bottlenecks should also be considered in the

  15. Underground Mining Method Selection Using WPM and PROMETHEE

    Science.gov (United States)

    Balusa, Bhanu Chander; Singam, Jayanthu

    2018-04-01

    The aim of this paper is to represent the solution to the problem of selecting suitable underground mining method for the mining industry. It is achieved by using two multi-attribute decision making techniques. These two techniques are weighted product method (WPM) and preference ranking organization method for enrichment evaluation (PROMETHEE). In this paper, analytic hierarchy process is used for weight's calculation of the attributes (i.e. parameters which are used in this paper). Mining method selection depends on physical parameters, mechanical parameters, economical parameters and technical parameters. WPM and PROMETHEE techniques have the ability to consider the relationship between the parameters and mining methods. The proposed techniques give higher accuracy and faster computation capability when compared with other decision making techniques. The proposed techniques are presented to determine the effective mining method for bauxite mine. The results of these techniques are compared with methods used in the earlier research works. The results show, conventional cut and fill method is the most suitable mining method.

  16. Population SAMC vs SAMC: Convergence and Applications to Gene Selection Problems

    KAUST Repository

    Faming Liang, Mingqi Wu

    2013-01-01

    The Bayesian model selection approach has been adopted by more and more people when analyzing a large data. However, it is known that the reversible jump MCMC (RJMCMC) algorithm, which is perhaps the most popular MCMC algorithm for Bayesian model selection, is prone to get trapped into local modes when the model space is complex. The stochastic approximation Monte Carlo (SAMC) algorithm essentially overcomes the local trap problem suffered by conventional MCMC algorithms by introducing a self-adjusting mechanism based on the past samples. In this paper, we propose a population SAMC (Pop-SAMC) algorithm, which works on a population of SAMC chains and can make use of crossover operators from genetic algorithms to further improve its efficiency. Under mild conditions, we show the convergence of this algorithm. Comparing to the single chain SAMC algorithm, Pop-SAMC provides a more efficient self-adjusting mechanism and thus can converge faster. The effectiveness of Pop-SAMC for Bayesian model selection problems is examined through a change-point identification problem and a gene selection problem. The numerical results indicate that Pop-SAMC significantly outperforms both the single chain SAMC and RJMCMC.

  17. Wild Carrot Differentiation in Europe and Selection at DcAOX1 Gene?

    Directory of Open Access Journals (Sweden)

    Tânia Nobre

    Full Text Available By definition, the domestication process leads to an overall reduction of crop genetic diversity. This lead to the current search of genomic regions in wild crop relatives (CWR, an important task for modern carrot breeding. Nowadays massive sequencing possibilities can allow for discovery of novel genetic resources in wild populations, but this quest could be aided by the use of a surrogate gene (to first identify and prioritize novel wild populations for increased sequencing effort. Alternative oxidase (AOX gene family seems to be linked to all kinds of abiotic and biotic stress reactions in various organisms and thus have the potential to be used in the identification of CWR hotspots of environment-adapted diversity. High variability of DcAOX1 was found in populations of wild carrot sampled across a West-European environmental gradient. Even though no direct relation was found with the analyzed climatic conditions or with physical distance, population differentiation exists and results mainly from the polymorphisms associated with DcAOX1 exon 1 and intron 1. The relatively high number of amino acid changes and the identification of several unusually variable positions (through a likelihood ratio test, suggests that DcAOX1 gene might be under positive selection. However, if positive selection is considered, it only acts on some specific populations (i.e. is in the form of adaptive differences in different population locations given the observed high genetic diversity. We were able to identify two populations with higher levels of differentiation which are promising as hot spots of specific functional diversity.

  18. LGscore: A method to identify disease-related genes using biological literature and Google data.

    Science.gov (United States)

    Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

    2015-04-01

    Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.

  19. Development of modelling method selection tool for health services management: from problem structuring methods to modelling and simulation methods.

    Science.gov (United States)

    Jun, Gyuchan T; Morris, Zoe; Eldabi, Tillal; Harper, Paul; Naseer, Aisha; Patel, Brijesh; Clarkson, John P

    2011-05-19

    There is an increasing recognition that modelling and simulation can assist in the process of designing health care policies, strategies and operations. However, the current use is limited and answers to questions such as what methods to use and when remain somewhat underdeveloped. The aim of this study is to provide a mechanism for decision makers in health services planning and management to compare a broad range of modelling and simulation methods so that they can better select and use them or better commission relevant modelling and simulation work. This paper proposes a modelling and simulation method comparison and selection tool developed from a comprehensive literature review, the research team's extensive expertise and inputs from potential users. Twenty-eight different methods were identified, characterised by their relevance to different application areas, project life cycle stages, types of output and levels of insight, and four input resources required (time, money, knowledge and data). The characterisation is presented in matrix forms to allow quick comparison and selection. This paper also highlights significant knowledge gaps in the existing literature when assessing the applicability of particular approaches to health services management, where modelling and simulation skills are scarce let alone money and time. A modelling and simulation method comparison and selection tool is developed to assist with the selection of methods appropriate to supporting specific decision making processes. In particular it addresses the issue of which method is most appropriate to which specific health services management problem, what the user might expect to be obtained from the method, and what is required to use the method. In summary, we believe the tool adds value to the scarce existing literature on methods comparison and selection.

  20. Physical non-viral gene delivery methods for tissue engineering.

    Science.gov (United States)

    Mellott, Adam J; Forrest, M Laird; Detamore, Michael S

    2013-03-01

    The integration of gene therapy into tissue engineering to control differentiation and direct tissue formation is not a new concept; however, successful delivery of nucleic acids into primary cells, progenitor cells, and stem cells has proven exceptionally challenging. Viral vectors are generally highly effective at delivering nucleic acids to a variety of cell populations, both dividing and non-dividing, yet these viral vectors are marred by significant safety concerns. Non-viral vectors are preferred for gene therapy, despite lower transfection efficiencies, and possess many customizable attributes that are desirable for tissue engineering applications. However, there is no single non-viral gene delivery strategy that "fits-all" cell types and tissues. Thus, there is a compelling opportunity to examine different non-viral vectors, especially physical vectors, and compare their relative degrees of success. This review examines the advantages and disadvantages of physical non-viral methods (i.e., microinjection, ballistic gene delivery, electroporation, sonoporation, laser irradiation, magnetofection, and electric field-induced molecular vibration), with particular attention given to electroporation because of its versatility, with further special emphasis on Nucleofection™. In addition, attributes of cellular character that can be used to improve differentiation strategies are examined for tissue engineering applications. Ultimately, electroporation exhibits a high transfection efficiency in many cell types, which is highly desirable for tissue engineering applications, but electroporation and other physical non-viral gene delivery methods are still limited by poor cell viability. Overcoming the challenge of poor cell viability in highly efficient physical non-viral techniques is the key to using gene delivery to enhance tissue engineering applications.

  1. Physical non-viral gene delivery methods for tissue engineering

    Science.gov (United States)

    Mellott, Adam J.; Forrest, M. Laird; Detamore, Michael S.

    2016-01-01

    The integration of gene therapy into tissue engineering to control differentiation and direct tissue formation is not a new concept; however, successful delivery of nucleic acids into primary cells, progenitor cells, and stem cells has proven exceptionally challenging. Viral vectors are generally highly effective at delivering nucleic acids to a variety of cell populations, both dividing and non-dividing, yet these viral vectors are marred by significant safety concerns. Non-viral vectors are preferred for gene therapy, despite lower transfection efficiencies, and possess many customizable attributes that are desirable for tissue engineering applications. However, there is no single non-viral gene delivery strategy that “fits-all” cell types and tissues. Thus, there is a compelling opportunity to examine different non-viral vectors, especially physical vectors, and compare their relative degrees of success. This review examines the advantages and disadvantages of physical non-viral methods (i.e., microinjection, ballistic gene delivery, electroporation, sonoporation, laser irradiation, magnetofection, and electric field-induced molecular vibration), with particular attention given to electroporation because of its versatility, with further special emphasis on Nucleofection™. In addition, attributes of cellular character that can be used to improve differentiation strategies are examined for tissue engineering applications. Ultimately, electroporation exhibits a high transfection efficiency in many cell types, which is highly desirable for tissue engineering applications, but electroporation and other physical non-viral gene delivery methods are still limited by poor cell viability. Overcoming the challenge of poor cell viability in highly efficient physical non-viral techniques is the key to using gene delivery to enhance tissue engineering applications. PMID:23099792

  2. Optimal Reference Genes for Gene Expression Normalization in Trichomonas vaginalis

    Science.gov (United States)

    dos Santos, Odelta; de Vargas Rigo, Graziela; Frasson, Amanda Piccoli; Macedo, Alexandre José; Tasca, Tiana

    2015-01-01

    Trichomonas vaginalis is the etiologic agent of trichomonosis, the most common non-viral sexually transmitted disease worldwide. This infection is associated with several health consequences, including cervical and prostate cancers and HIV acquisition. Gene expression analysis has been facilitated because of available genome sequences and large-scale transcriptomes in T. vaginalis, particularly using quantitative real-time polymerase chain reaction (qRT-PCR), one of the most used methods for molecular studies. Reference genes for normalization are crucial to ensure the accuracy of this method. However, to the best of our knowledge, a systematic validation of reference genes has not been performed for T. vaginalis. In this study, the transcripts of nine candidate reference genes were quantified using qRT-PCR under different cultivation conditions, and the stability of these genes was compared using the geNorm and NormFinder algorithms. The most stable reference genes were α-tubulin, actin and DNATopII, and, conversely, the widely used T. vaginalis reference genes GAPDH and β-tubulin were less stable. The PFOR gene was used to validate the reliability of the use of these candidate reference genes. As expected, the PFOR gene was upregulated when the trophozoites were cultivated with ferrous ammonium sulfate when the DNATopII, α-tubulin and actin genes were used as normalizing gene. By contrast, the PFOR gene was downregulated when the GAPDH gene was used as an internal control, leading to misinterpretation of the data. These results provide an important starting point for reference gene selection and gene expression analysis with qRT-PCR studies of T. vaginalis. PMID:26393928

  3. Gene-expression profiling after exposure to C-ion beams

    International Nuclear Information System (INIS)

    Saegusa, Kumiko; Furuno, Aki; Ishikawa, Kenichi; Ishikawa, Atsuko; Ohtsuka, Yoshimi; Kawai, Seiko; Imai, Takashi; Nojima, Kumie

    2005-01-01

    It is recognized that carbon-ion beam kills cancer cells more efficiently than X-ray. In this study we have compared cellular gene expression response after carbon-ion beam exposure with that after X-ray exposure. Gene expression profiles of cultured neonatal human dermal fibroblasts (NHDF) at 0, 1, 3, 6, 12, 18, and 24 hr after exposure to 0.1, 2 and 5 Gy of X-ray or carbon-ion beam were obtained using 22K oligonucleotide microarray. N-way ANOVA analysis of whole gene expression data sets selected 960 genes for carbon-ion beam and 977 genes for X-ray, respectively. Interestingly, majority of these genes (91% for carbon-ion beam and 88% for X-ray, respectively) were down regulated. The selected genes were further classified by their dose-dependence or time-dependence of gene expression change (fold change>1.5). It was revealed that genes involved in cell proliferation had tendency to show time-dependent up regulation by carbon-ion beam. Another N-way ANOVA analysis was performed to select 510 genes, and further selection was made to find 70 genes that showed radiation species-dependent gene expression change (fold change>1.25). These genes were then categorized by the K-Mean clustering method into 4 clusters. Each cluster showed tendency to contain genes involved in cell cycle regulation, cell death, responses to stress and metabolisms, respectively. (author)

  4. Selection Signatures in the First Exon of Paralogous Receptor Kinase Genes from the Sym2 Region of the Pisum sativum L. Genome

    Directory of Open Access Journals (Sweden)

    Anton S. Sulima

    2017-11-01

    Full Text Available During the initial step of the symbiosis between legumes (Fabaceae and nitrogen-fixing bacteria (rhizobia, the bacterial signal molecule known as the Nod factor (nodulation factor is recognized by plant LysM motif-containing receptor-like kinases (LysM-RLKs. The fifth chromosome of barrel medic (Medicago truncatula Gaertn. contains a cluster of paralogous LysM-RLK genes, one of which is known to participate in symbiosis. In the syntenic region of the pea (Pisum sativum L. genome, three genes have been identified: PsK1 and PsSym37, two symbiosis-related LysM-RLK genes with known sequences, and the unsequenced PsSym2 gene which presumably encodes a LysM-RLK and is associated with increased selectivity to certain Nod factors. In this work, we identified a new gene encoding a LysM-RLK, designated as PsLykX, within the Sym2 genomic region. We sequenced the first exons (corresponding to the protein receptor domain of PsSym37, PsK1, and PsLykX from a large set of pea genotypes of diverse origin. The nucleotide diversity of these fragments was estimated and groups of haplotypes for each gene were revealed. Footprints of selection pressure were detected via comparative analyses of SNP distribution across the first exons of these genes and their homologs MtLYK2, MtLYK3, and MtLYK4 from M. truncatula retrieved from the Medicago Hapmap project. Despite the remarkable similarity among all the studied genes, they exhibited contrasting selection signatures, possibly pointing to diversification of their functions. Signatures of balancing selection were found in LysM1-encoding parts of PsSym37 and PsK1, suggesting that the diversity of these parts may be important for pea LysM-RLKs. The first exons of PsSym37 and PsK1 displayed signatures of purifying selection, as well as MtLYK2 of M. truncatula. Evidence of positive selection affecting primarily LysM domains was found in all three investigated M. truncatula genes, as well as in the pea gene PsLykX. The data

  5. A Selective Review of Multimodal Fusion Methods in Schizophrenia

    Directory of Open Access Journals (Sweden)

    Jing eSui

    2012-02-01

    Full Text Available Schizophrenia (SZ is one of the most cryptic and costly mental disorders in terms of human suffering and societal expenditure (van Os and Kapur, 2009. Though strong evidences for functional, structural and genetic abnormalities associated with this disease exist, there is yet no replicable finding which has proven accurate enough to be useful in clinical decision making (Fornito et al., 2009, and its diagnosis relies primarily upon symptom assessment (Williams et al., 2010a. It is likely in part that the lack of consistent neuroimaging findings is because most models favor only one data type or do not combine data from different imaging modalities effectively, thus missing potentially important differences which are only partially detected by each modality (Calhoun et al., 2006a. It is becoming increasingly clear that multi-modal fusion, a technique which takes advantage of the fact that each modality provides a limited view of the brain/gene and may uncover hidden relationships, is an important tool to help unravel the black box of schizophrenia. In this review paper, we survey a number of multimodal fusion applications which enable us to study the schizophrenia macro-connectome, including brain functional, structural and genetic aspects and may help us understand the disorder in a more comprehensive and integrated manner. We also provide a table that characterizes these applications by the methods used and compare these methods in detail, especially for multivariate models, which may serve as a valuable reference that helps readers select an appropriate method based on a given research.

  6. Transcriptome sequencing and positive selected genes analysis of Bombyx mandarina.

    Directory of Open Access Journals (Sweden)

    Tingcai Cheng

    Full Text Available The wild silkworm Bombyx mandarina is widely believed to be an ancestor of the domesticated silkworm, Bombyx mori. Silkworms are often used as a model for studying the mechanism of species domestication. Here, we performed transcriptome sequencing of the wild silkworm using an Illumina HiSeq2000 platform. We produced 100,004,078 high-quality reads and assembled them into 50,773 contigs with an N50 length of 1764 bp and a mean length of 941.62 bp. A total of 33,759 unigenes were identified, with 12,805 annotated in the Nr database, 8273 in the Pfam database, and 9093 in the Swiss-Prot database. Expression profile analysis found significant differential expression of 1308 unigenes between the middle silk gland (MSG and posterior silk gland (PSG. Three sericin genes (sericin 1, sericin 2, and sericin 3 were expressed specifically in the MSG and three fibroin genes (fibroin-H, fibroin-L, and fibroin/P25 were expressed specifically in the PSG. In addition, 32,297 Single-nucleotide polymorphisms (SNPs and 361 insertion-deletions (INDELs were detected. Comparison with the domesticated silkworm p50/Dazao identified 5,295 orthologous genes, among which 400 might have experienced or to be experiencing positive selection by Ka/Ks analysis. These data and analyses presented here provide insights into silkworm domestication and an invaluable resource for wild silkworm genomics research.

  7. In vitro selection of mutants: Inducible gene regulation for salt tolerance

    International Nuclear Information System (INIS)

    Winicov, I.; Bastola, D.R.; Deutch, C.E.; Pethe, V.V.; Petrusa, L.

    2001-01-01

    Regulation of differentially expressed genes in plants may be involved in inducing tolerance to stress. Isogenic salt-sensitive and salt-tolerant alfalfa lines were investigated for molecular differences in their response to salt. The genes, which are differentially induced by salt in the salt-tolerant alfalfa cells and are also regulated by salt at the whole plant level, were cloned. Both transcriptional and post- transcriptional mechanisms influenced salt-induced product accumulation in the salt-tolerant alfalfa. The salt-tolerant plants doubled proline concentration rapidly in roots, while salt-sensitive plants showed a delayed response. To understand the regulatory system in the salt-tolerant alfalfa, two genes that are expressed in roots were studied. Alfin1 encodes a zinc-finger type putative DNA transcription factor conserved in alfalfa, rice and Arabidopsis, and MsPRP2 encodes a protein that serves as a cell wall- membrane linker in roots. Recombinant Alfin1 protein was selected, amplified, cloned and its consensus sequence was identified. The recombinant Alfin1 also bound specifically to fragments of the MsPRP2 promoter in vitro, containing the Alfin1 binding consensus sequence. The results show unambiguously binding specificity of Alfin1 DNA, supporting its role in gene regulation. Alfin1 function was tested in transformed alfalfa in vivo by over-expressing Alfin1 from 35S CaMV promoter. The transgenic plants appeared normal. However, plants harboring the anti-sense construct did not grow well in soil, indicating that Alfin1 expression was essential. Alfin1 over-expression in transgenic alfalfa led to enhanced levels of MsPRP2 transcript accumulation, demonstrating that Alfin1 functioned in vivo in gene regulation. Since MsPRP2 gene is also induced by salt, it is likely that Alfin1 is an important transcription factor for gene regulation in salt-tolerant alfalfa, and an excellent target for manipulation to improve salt tolerance. (author)

  8. A genetic ensemble approach for gene-gene interaction identification

    Directory of Open Access Journals (Sweden)

    Ho Joshua WK

    2010-10-01

    Full Text Available Abstract Background It has now become clear that gene-gene interactions and gene-environment interactions are ubiquitous and fundamental mechanisms for the development of complex diseases. Though a considerable effort has been put into developing statistical models and algorithmic strategies for identifying such interactions, the accurate identification of those genetic interactions has been proven to be very challenging. Methods In this paper, we propose a new approach for identifying such gene-gene and gene-environment interactions underlying complex diseases. This is a hybrid algorithm and it combines genetic algorithm (GA and an ensemble of classifiers (called genetic ensemble. Using this approach, the original problem of SNP interaction identification is converted into a data mining problem of combinatorial feature selection. By collecting various single nucleotide polymorphisms (SNP subsets as well as environmental factors generated in multiple GA runs, patterns of gene-gene and gene-environment interactions can be extracted using a simple combinatorial ranking method. Also considered in this study is the idea of combining identification results obtained from multiple algorithms. A novel formula based on pairwise double fault is designed to quantify the degree of complementarity. Conclusions Our simulation study demonstrates that the proposed genetic ensemble algorithm has comparable identification power to Multifactor Dimensionality Reduction (MDR and is slightly better than Polymorphism Interaction Analysis (PIA, which are the two most popular methods for gene-gene interaction identification. More importantly, the identification results generated by using our genetic ensemble algorithm are highly complementary to those obtained by PIA and MDR. Experimental results from our simulation studies and real world data application also confirm the effectiveness of the proposed genetic ensemble algorithm, as well as the potential benefits of

  9. Reference Gene Selection in the Desert Plant Eremosparton songoricum

    Directory of Open Access Journals (Sweden)

    Dao-Yuan Zhang

    2012-06-01

    Full Text Available Eremosparton songoricum (Litv. Vass. (E. songoricum is a rare and extremely drought-tolerant desert plant that holds promise as a model organism for the identification of genes associated with water deficit stress. Here, we cloned and evaluated the expression of eight candidate reference genes using quantitative real-time reverse transcriptase polymerase chain reactions. The expression of these candidate reference genes was analyzed in a diverse set of 20 samples including various E. songoricum plant tissues exposed to multiple environmental stresses. GeNorm analysis indicated that expression stability varied between the reference genes in the different experimental conditions, but the two most stable reference genes were sufficient for normalization in most conditions. EsEF and Esα-TUB were sufficient for various stress conditions, EsEF and EsACT were suitable for samples of differing germination stages, and EsGAPDHand EsUBQ were most stable across multiple adult tissue samples. The Es18S gene was unsuitable as a reference gene in our analysis. In addition, the expression level of the drought-stress related transcription factor EsDREB2 verified the utility of E. songoricum reference genes and indicated that no single gene was adequate for normalization on its own. This is the first systematic report on the selection of reference genes in E. songoricum, and these data will facilitate future work on gene expression in this species.

  10. FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

    Directory of Open Access Journals (Sweden)

    Ratri Enggar Pawening

    2016-06-01

    Full Text Available Datasets with heterogeneous features can affect feature selection results that are not appropriate because it is difficult to evaluate heterogeneous features concurrently. Feature transformation (FT is another way to handle heterogeneous features subset selection. The results of transformation from non-numerical into numerical features may produce redundancy to the original numerical features. In this paper, we propose a method to select feature subset based on mutual information (MI for classifying heterogeneous features. We use unsupervised feature transformation (UFT methods and joint mutual information maximation (JMIM methods. UFT methods is used to transform non-numerical features into numerical features. JMIM methods is used to select feature subset with a consideration of the class label. The transformed and the original features are combined entirely, then determine features subset by using JMIM methods, and classify them using support vector machine (SVM algorithm. The classification accuracy are measured for any number of selected feature subset and compared between UFT-JMIM methods and Dummy-JMIM methods. The average classification accuracy for all experiments in this study that can be achieved by UFT-JMIM methods is about 84.47% and Dummy-JMIM methods is about 84.24%. This result shows that UFT-JMIM methods can minimize information loss between transformed and original features, and select feature subset to avoid redundant and irrelevant features.

  11. Selective sweep on human amylase genes postdates the split with Neanderthals

    OpenAIRE

    Inchley, Charlotte E.; Larbey, Cynthia D. A.; Shwan, Nzar A. A.; Pagani, Luca; Saag, Lauri; Antão, Tiago; Jacobs, Guy; Hudjashov, Georgi; Metspalu, Ene; Mitt, Mario; Eichstaedt, Christina A.; Malyarchuk, Boris; Derenko, Miroslava; Wee, Joseph; Abdullah, Syafiq

    2016-01-01

    Humans have more copies of amylase genes than other primates. It is still poorly understood, however, when the copy number expansion occurred and whether its spread was enhanced by selection. Here we assess amylase copy numbers in a global sample of 480 high coverage genomes and find that regions flanking the amylase locus show notable depression of genetic diversity both in African and non-African populations. Analysis of genetic variation in these regions supports the model of an early sele...

  12. Selecting a set of housekeeping genes for quantitative real-time PCR in normal and tetraploid haemocytes of soft-shell clams, Mya arenaria.

    Science.gov (United States)

    Siah, A; Dohoo, C; McKenna, P; Delaporte, M; Berthe, F C J

    2008-09-01

    The transcripts involved in the molecular mechanisms of haemic neoplasia in relation to the haemocyte ploidy status of the soft-shell clam, Mya arenaria, have yet to be identified. For this purpose, real-time quantitative RT-PCR constitutes a sensitive and efficient technique, which can help determine the gene expression involved in haemocyte tetraploid status in clams affected by haemic neoplasia. One of the critical steps in comparing transcription profiles is the stability of selected housekeeping genes, as well as an accurate normalization. In this study, we selected five reference genes, S18, L37, EF1, EF2 and actin, generally used as single control genes. Their expression was analyzed by real-time quantitative RT-PCR at different levels of haemocyte ploidy status in order to select the most stable genes. Using the geNorm software, our results showed that L37, EF1 and S18 represent the most stable gene expressions related to various ploidy status ranging from 0 to 78% of tetraploid haemocytes in clams sampled in North River (Prince Edward Island, Canada). However, actin gene expression appeared to be highly regulated. Hence, using it as a housekeeping gene in tetraploid haemocytes can result in inaccurate data. To compare gene expression levels related to haemocyte ploidy status in Mya arenaria, using L37, EF1 and S18 as housekeeping genes for accurate normalization is therefore recommended.

  13. Supplier Selection Using Weighted Utility Additive Method

    Science.gov (United States)

    Karande, Prasad; Chakraborty, Shankar

    2015-10-01

    Supplier selection is a multi-criteria decision-making (MCDM) problem which mainly involves evaluating a number of available suppliers according to a set of common criteria for choosing the best one to meet the organizational needs. For any manufacturing or service organization, selecting the right upstream suppliers is a key success factor that will significantly reduce purchasing cost, increase downstream customer satisfaction and improve competitive ability. The past researchers have attempted to solve the supplier selection problem employing different MCDM techniques which involve active participation of the decision makers in the decision-making process. This paper deals with the application of weighted utility additive (WUTA) method for solving supplier selection problems. The WUTA method, an extension of utility additive approach, is based on ordinal regression and consists of building a piece-wise linear additive decision model from a preference structure using linear programming (LP). It adopts preference disaggregation principle and addresses the decision-making activities through operational models which need implicit preferences in the form of a preorder of reference alternatives or a subset of these alternatives present in the process. The preferential preorder provided by the decision maker is used as a restriction of a LP problem, which has its own objective function, minimization of the sum of the errors associated with the ranking of each alternative. Based on a given reference ranking of alternatives, one or more additive utility functions are derived. Using these utility functions, the weighted utilities for individual criterion values are combined into an overall weighted utility for a given alternative. It is observed that WUTA method, having a sound mathematical background, can provide accurate ranking to the candidate suppliers and choose the best one to fulfill the organizational requirements. Two real time examples are illustrated to prove

  14. Maximizing biomarker discovery by minimizing gene signatures

    Directory of Open Access Journals (Sweden)

    Chang Chang

    2011-12-01

    Full Text Available Abstract Background The use of gene signatures can potentially be of considerable value in the field of clinical diagnosis. However, gene signatures defined with different methods can be quite various even when applied the same disease and the same endpoint. Previous studies have shown that the correct selection of subsets of genes from microarray data is key for the accurate classification of disease phenotypes, and a number of methods have been proposed for the purpose. However, these methods refine the subsets by only considering each single feature, and they do not confirm the association between the genes identified in each gene signature and the phenotype of the disease. We proposed an innovative new method termed Minimize Feature's Size (MFS based on multiple level similarity analyses and association between the genes and disease for breast cancer endpoints by comparing classifier models generated from the second phase of MicroArray Quality Control (MAQC-II, trying to develop effective meta-analysis strategies to transform the MAQC-II signatures into a robust and reliable set of biomarker for clinical applications. Results We analyzed the similarity of the multiple gene signatures in an endpoint and between the two endpoints of breast cancer at probe and gene levels, the results indicate that disease-related genes can be preferably selected as the components of gene signature, and that the gene signatures for the two endpoints could be interchangeable. The minimized signatures were built at probe level by using MFS for each endpoint. By applying the approach, we generated a much smaller set of gene signature with the similar predictive power compared with those gene signatures from MAQC-II. Conclusions Our results indicate that gene signatures of both large and small sizes could perform equally well in clinical applications. Besides, consistency and biological significances can be detected among different gene signatures, reflecting the

  15. Multiplex cDNA quantification method that facilitates the standardization of gene expression data

    Science.gov (United States)

    Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira

    2011-01-01

    Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008

  16. A statistical method for predicting splice variants between two groups of samples using GeneChip® expression array data

    Directory of Open Access Journals (Sweden)

    Olson James M

    2006-04-01

    Full Text Available Abstract Background Alternative splicing of pre-messenger RNA results in RNA variants with combinations of selected exons. It is one of the essential biological functions and regulatory components in higher eukaryotic cells. Some of these variants are detectable with the Affymetrix GeneChip® that uses multiple oligonucleotide probes (i.e. probe set, since the target sequences for the multiple probes are adjacent within each gene. Hybridization intensity from a probe correlates with abundance of the corresponding transcript. Although the multiple-probe feature in the current GeneChip® was designed to assess expression values of individual genes, it also measures transcriptional abundance for a sub-region of a gene sequence. This additional capacity motivated us to develop a method to predict alternative splicing, taking advance of extensive repositories of GeneChip® gene expression array data. Results We developed a two-step approach to predict alternative splicing from GeneChip® data. First, we clustered the probes from a probe set into pseudo-exons based on similarity of probe intensities and physical adjacency. A pseudo-exon is defined as a sequence in the gene within which multiple probes have comparable probe intensity values. Second, for each pseudo-exon, we assessed the statistical significance of the difference in probe intensity between two groups of samples. Differentially expressed pseudo-exons are predicted to be alternatively spliced. We applied our method to empirical data generated from GeneChip® Hu6800 arrays, which include 7129 probe sets and twenty probes per probe set. The dataset consists of sixty-nine medulloblastoma (27 metastatic and 42 non-metastatic samples and four cerebellum samples as normal controls. We predicted that 577 genes would be alternatively spliced when we compared normal cerebellum samples to medulloblastomas, and predicted that thirteen genes would be alternatively spliced when we compared metastatic

  17. MICB gene diversity and balancing selection on its promoter region in Yao population in southern China.

    Science.gov (United States)

    Chen, Xiang; Liu, Xuexiang; Wei, Xiaomou; Meng, Yuming; Liu, Limin; Qin, Shini; Liu, Yanyu; Dai, Shengming

    2016-12-01

    To comprehensively examine the MICB gene polymorphism and identify its differences in Chinese Yao population from other ethnic groups, we investigated the polymorphism in the 5'-upstream regulation region (5'-URR), coding region (exons 2-4), and the 3'-untranslated region (3'-UTR) of MICB gene by using PCR-SBT method in 125 healthy unrelated Yao individuals in Guangxi Zhuang Autonomous Region. Higher polymorphism was observed in the 5'-URR, nine single nucleotide polymorphisms (SNPs) and a two base pairs deletion at position -139/-138 were found in our study. Only five different variation sites, however, were detected in exons 2-4 and three were observed in the 3'-UTR. The minor allele frequencies of all variants were greater than 5%, except for rs3828916, rs3131639, rs45627734, rs113620316, rs779737471, and the variation at position +11803 in the 3'-UTR. The first nine SNPs of 5'-URR and rs1065075, rs1051788 of the coding region showed significant linkage disequilibrium with each other. Ten different MICB extended haplotypes (EH) encompassing the 5'-URR, exons 2-4, and 3'-UTR were found in this population, and the most frequent was EH1 (23.2%). We provided several evidences for balancing selection effect on the 5'-URR of MICB gene in Yao population. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  18. Tetracycline resistance genes persist in soil amended with cattle feces independently from chlortetracycline selection pressure

    NARCIS (Netherlands)

    Kyselkova, Martina; Kotrbova, Lucie; Bhumibhamon, Gamonsiri; Chronakova, Alica; Jirout, Jiri; Vrchotova, Nadezda; Schmitt, Heike; Elhottova, Dana

    Antibiotic residues and antibiotic resistance genes originating from animal waste represent environmental pollutants with possible human health consequences. In this study, we addressed the question whether chlortetracycline (CTC) residues in soils can act as selective pressure enhancing the

  19. Feature selection for high-dimensional integrated data

    KAUST Repository

    Zheng, Charles

    2012-04-26

    Motivated by the problem of identifying correlations between genes or features of two related biological systems, we propose a model of feature selection in which only a subset of the predictors Xt are dependent on the multidimensional variate Y, and the remainder of the predictors constitute a “noise set” Xu independent of Y. Using Monte Carlo simulations, we investigated the relative performance of two methods: thresholding and singular-value decomposition, in combination with stochastic optimization to determine “empirical bounds” on the small-sample accuracy of an asymptotic approximation. We demonstrate utility of the thresholding and SVD feature selection methods to with respect to a recent infant intestinal gene expression and metagenomics dataset.

  20. Feature selection for high-dimensional integrated data

    KAUST Repository

    Zheng, Charles; Schwartz, Scott; Chapkin, Robert S.; Carroll, Raymond J.; Ivanov, Ivan

    2012-01-01

    Motivated by the problem of identifying correlations between genes or features of two related biological systems, we propose a model of feature selection in which only a subset of the predictors Xt are dependent on the multidimensional variate Y, and the remainder of the predictors constitute a “noise set” Xu independent of Y. Using Monte Carlo simulations, we investigated the relative performance of two methods: thresholding and singular-value decomposition, in combination with stochastic optimization to determine “empirical bounds” on the small-sample accuracy of an asymptotic approximation. We demonstrate utility of the thresholding and SVD feature selection methods to with respect to a recent infant intestinal gene expression and metagenomics dataset.

  1. A primary study of high performance transgenic rice through maize UBI-1 promoter fusing selective maker gene

    International Nuclear Information System (INIS)

    Shen, J.; Cai, P.; Qing, F.

    2012-01-01

    Based on the expression vector pBI121, we successfully constructed a plant over-expression vector of Hspa4 gene fusing with selective maker gene (hygromycin-resistance gene) driven by the Ubi-1 promoter (pBI121-Ubi-Hpt-Hspa4, p121UHH). The plant expression vectors p121UHH and pCAMBIA1301-Ubi-Hspa4 (p1301UH) were transformed into the rice callus, mediated by Agrobacterium tumefaciens. We screened 17 p121UHH-positive transgenic plants and 15 p1301UH-positive transgenic plants by the hygromycin-resistance gene. The pick-up rate of the resistance callus was 51.7% and 42.5%, respectively, and the rate of regeneration for the resistance callus was 51.2% and 49.1%, respectively. The result of polymerase chain reaction (P CR) identification indicated that the pick-up rate of positive transgenic plants was 51.7% and 42.5% and the total transformation efficiency was 16.5% and 6.2%, and the former was 2.66 time s of the later. The results of the experiment indicate that the possibility of the appearance of false positive results in the fusing of a plant over-expression vector with a selective maker gene is much less. (author)

  2. Normalization and gene p-value estimation: issues in microarray data processing.

    Science.gov (United States)

    Fundel, Katrin; Küffner, Robert; Aigner, Thomas; Zimmer, Ralf

    2008-05-28

    Numerous methods exist for basic processing, e.g. normalization, of microarray gene expression data. These methods have an important effect on the final analysis outcome. Therefore, it is crucial to select methods appropriate for a given dataset in order to assure the validity and reliability of expression data analysis. Furthermore, biological interpretation requires expression values for genes, which are often represented by several spots or probe sets on a microarray. How to best integrate spot/probe set values into gene values has so far been a somewhat neglected problem. We present a case study comparing different between-array normalization methods with respect to the identification of differentially expressed genes. Our results show that it is feasible and necessary to use prior knowledge on gene expression measurements to select an adequate normalization method for the given data. Furthermore, we provide evidence that combining spot/probe set p-values into gene p-values for detecting differentially expressed genes has advantages compared to combining expression values for spots/probe sets into gene expression values. The comparison of different methods suggests to use Stouffer's method for this purpose. The study has been conducted on gene expression experiments investigating human joint cartilage samples of osteoarthritis related groups: a cDNA microarray (83 samples, four groups) and an Affymetrix (26 samples, two groups) data set. The apparently straight forward steps of gene expression data analysis, e.g. between-array normalization and detection of differentially regulated genes, can be accomplished by numerous different methods. We analyzed multiple methods and the possible effects and thereby demonstrate the importance of the single decisions taken during data processing. We give guidelines for evaluating normalization outcomes. An overview of these effects via appropriate measures and plots compared to prior knowledge is essential for the biological

  3. Increasing cocoa butter-like lipid production of Saccharomyces cerevisiae by expression of selected cocoa genes

    DEFF Research Database (Denmark)

    Wei, Yongjun; Gossing, Michael; Bergenholm, David

    2017-01-01

    for CB biosynthesis from the cocoa genome using a phylogenetic analysis approach. By expressing the selected cocoa genes in S. cerevisiae, we successfully increased total fatty acid production, TAG production and CBL production in some S. cerevisiae strains. The relative CBL content in three yeast...... higher level of CBL compared with the control strain. In summary, CBL production by S. cerevisiae were increased through expressing selected cocoa genes potentially involved in CB biosynthesis.......Cocoa butter (CB) extracted from cocoa beans mainly consists of three different kinds of triacylglycerols (TAGs), 1,3-dipalmitoyl-2-oleoyl-glycerol (POP, C16:0-C18:1-C16:0), 1-palmitoyl-3-stearoyl-2-oleoyl-glycerol(POS,C16:0C18:1-C18:0) and 1,3-distearoyl-2-oleoyl-glycerol (SOS, C18:0-C18:1-C18...

  4. A novel approach to select differential pathways associated with hypertrophic cardiomyopathy based on gene co‑expression analysis.

    Science.gov (United States)

    Chen, Xiao-Min; Feng, Ming-Jun; Shen, Cai-Jie; He, Bin; Du, Xian-Feng; Yu, Yi-Bo; Liu, Jing; Chu, Hui-Min

    2017-07-01

    The present study was designed to develop a novel method for identifying significant pathways associated with human hypertrophic cardiomyopathy (HCM), based on gene co‑expression analysis. The microarray dataset associated with HCM (E‑GEOD‑36961) was obtained from the European Molecular Biology Laboratory‑European Bioinformatics Institute database. Informative pathways were selected based on the Reactome pathway database and screening treatments. An empirical Bayes method was utilized to construct co‑expression networks for informative pathways, and a weight value was assigned to each pathway. Differential pathways were extracted based on weight threshold, which was calculated using a random model. In order to assess whether the co‑expression method was feasible, it was compared with traditional pathway enrichment analysis of differentially expressed genes, which were identified using the significance analysis of microarrays package. A total of 1,074 informative pathways were screened out for subsequent investigations and their weight values were also obtained. According to the threshold of weight value of 0.01057, 447 differential pathways, including folding of actin by chaperonin containing T‑complex protein 1 (CCT)/T‑complex protein 1 ring complex (TRiC), purine ribonucleoside monophosphate biosynthesis and ubiquinol biosynthesis, were obtained. Compared with traditional pathway enrichment analysis, the number of pathways obtained from the co‑expression approach was increased. The results of the present study demonstrated that this method may be useful to predict marker pathways for HCM. The pathways of folding of actin by CCT/TRiC and purine ribonucleoside monophosphate biosynthesis may provide evidence of the underlying molecular mechanisms of HCM, and offer novel therapeutic directions for HCM.

  5. Mapping and marker-assisted selection of a brown planthopper resistance gene bph2 in rice (Oryza sativa L.).

    Science.gov (United States)

    Sun, Li-Hong; Wang, Chun-Ming; Su, Chang-Chao; Liu, Yu-Qiang; Zhai, Hu-Qu; Wan, Jian-Min

    2006-08-01

    Nilaparvata lugens Stål (brown planthopper, BPH), is one of the major insect pests of rice (Oryza sativa L.) in the temperate rice-growing region. In this study, ASD7 harboring a BPH resistance gene bph2 was crossed to a susceptible cultivar C418, a japonica restorer line. BPH resistance was evaluated using 134 F2:3 lines derived from the cross between "ASD7" and "C418". SSR assay and linkage analysis were carried out to detect bph2. As a result, the resistant gene bph2 in ASD7 was successfully mapped between RM7102 and RM463 on the long arm of chromosome 12, with distances of 7.6 cM and 7.2 cM, respectively. Meanwhile, both phenotypic selection and marker-assisted selection (MAS) were conducted in the BC1F1 and BC2F1 populations. Selection efficiencies of RM7102 and RM463 were determined to be 89.9% and 91.2%, respectively. It would be very beneficial for BPH resistance improvement by using MAS of this gene.

  6. Discrete Biogeography Based Optimization for Feature Selection in Molecular Signatures.

    Science.gov (United States)

    Liu, Bo; Tian, Meihong; Zhang, Chunhua; Li, Xiangtao

    2015-04-01

    Biomarker discovery from high-dimensional data is a complex task in the development of efficient cancer diagnoses and classification. However, these data are usually redundant and noisy, and only a subset of them present distinct profiles for different classes of samples. Thus, selecting high discriminative genes from gene expression data has become increasingly interesting in the field of bioinformatics. In this paper, a discrete biogeography based optimization is proposed to select the good subset of informative gene relevant to the classification. In the proposed algorithm, firstly, the fisher-markov selector is used to choose fixed number of gene data. Secondly, to make biogeography based optimization suitable for the feature selection problem; discrete migration model and discrete mutation model are proposed to balance the exploration and exploitation ability. Then, discrete biogeography based optimization, as we called DBBO, is proposed by integrating discrete migration model and discrete mutation model. Finally, the DBBO method is used for feature selection, and three classifiers are used as the classifier with the 10 fold cross-validation method. In order to show the effective and efficiency of the algorithm, the proposed algorithm is tested on four breast cancer dataset benchmarks. Comparison with genetic algorithm, particle swarm optimization, differential evolution algorithm and hybrid biogeography based optimization, experimental results demonstrate that the proposed method is better or at least comparable with previous method from literature when considering the quality of the solutions obtained. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Pugalendhi Ganesh Kumar

    Full Text Available This study describes a novel approach to reducing the challenges of highly nonlinear multiclass gene expression values for cancer diagnosis. To build a fruitful system for cancer diagnosis, in this study, we introduced two levels of gene selection such as filtering and embedding for selection of potential genes and the most relevant genes associated with cancer, respectively. The filter procedure was implemented by developing a fuzzy rough set (FR-based method for redefining the criterion function of f-information (FI to identify the potential genes without discretizing the continuous gene expression values. The embedded procedure is implemented by means of a water swirl algorithm (WSA, which attempts to optimize the rule set and membership function required to classify samples using a fuzzy-rule-based multiclassification system (FRBMS. Two novel update equations are proposed in WSA, which have better exploration and exploitation abilities while designing a self-learning FRBMS. The efficiency of our new approach was evaluated on 13 multicategory and 9 binary datasets of cancer gene expression. Additionally, the performance of the proposed FRFI-WSA method in designing an FRBMS was compared with existing methods for gene selection and optimization such as genetic algorithm (GA, particle swarm optimization (PSO, and artificial bee colony algorithm (ABC on all the datasets. In the global cancer map with repeated measurements (GCM_RM dataset, the FRFI-WSA showed the smallest number of 16 most relevant genes associated with cancer using a minimal number of 26 compact rules with the highest classification accuracy (96.45%. In addition, the statistical validation used in this study revealed that the biological relevance of the most relevant genes associated with cancer and their linguistics detected by the proposed FRFI-WSA approach are better than those in the other methods. The simple interpretable rules with most relevant genes and effectively

  8. Mitogen activated protein kinases selectively regulate palytoxin-stimulated gene expression in mouse keratinocytes

    International Nuclear Information System (INIS)

    Zeliadt, Nicholette A.; Warmka, Janel K.; Wattenberg, Elizabeth V.

    2003-01-01

    We have been investigating how the novel skin tumor promoter palytoxin transmits signals through mitogen activated protein kinases (MAPKs). Palytoxin activates three major MAPKs, extracellular signal-regulated kinase (ERK), c-Jun N-terminal kinase (JNK), and p38, in a keratinocyte cell line derived from initiated mouse skin (308). We previously showed that palytoxin requires ERK to increase matrix metalloproteinase-13 (MMP-13) gene expression, an enzyme implicated in carcinogenesis. Diverse stimuli require JNK and p38 to increase MMP-13 gene expression, however. We therefore used the JNK and p38 inhibitors SP 600125 and SB 202190, respectively, to investigate the role of these MAPKs in palytoxin-induced MMP-13 gene expression. Surprisingly, palytoxin does not require JNK and p38 to increase MMP-13 gene expression. Accordingly, ERK activation, independent of palytoxin and in the absence of JNK and p38 activation, is sufficient to induce MMP-13 gene expression in 308 keratinocytes. Dexamethasone, a synthetic glucocorticoid that inhibits activator protein-1 (AP-1), blocked palytoxin-stimulated MMP-13 gene expression. Therefore, the AP-1 site present in the promoter of the MMP-13 gene appears to be functional and to play a key role in palytoxin-stimulated gene expression. Previous studies showed that palytoxin simulates an ERK-dependent selective increase in the c-Fos content of AP-1 complexes that bind to the promoter of the MMP-13 gene. JNK and p38 can also modulate c-Fos. Palytoxin does not require JNK or p38 to increase c-Fos binding, however. Altogether, these studies indicate that ERK plays a distinctly essential role in transmitting palytoxin-stimulated signals to specific nuclear targets in keratinocytes derived from initiated mouse skin

  9. Aquaporins in the wild: natural genetic diversity and selective pressure in the PIP gene family in five Neotropical tree species

    Directory of Open Access Journals (Sweden)

    Vendramin Giovanni G

    2010-06-01

    Full Text Available Abstract Background Tropical trees undergo severe stress through seasonal drought and flooding, and the ability of these species to respond may be a major factor in their survival in tropical ecosystems, particularly in relation to global climate change. Aquaporins are involved in the regulation of water flow and have been shown to be involved in drought response; they may therefore play a major adaptive role in these species. We describe genetic diversity in the PIP sub-family of the widespread gene family of Aquaporins in five Neotropical tree species covering four botanical families. Results PIP Aquaporin subfamily genes were isolated, and their DNA sequence polymorphisms characterised in natural populations. Sequence data were analysed with statistical tests of standard neutral equilibrium and demographic scenarios simulated to compare with the observed results. Chloroplast SSRs were also used to test demographic transitions. Most gene fragments are highly polymorphic and display signatures of balancing selection or bottlenecks; chloroplast SSR markers have significant statistics that do not conform to expectations for population bottlenecks. Although not incompatible with a purely demographic scenario, the combination of all tests tends to favour a selective interpretation of extant gene diversity. Conclusions Tropical tree PIP genes may generally undergo balancing selection, which may maintain high levels of genetic diversity at these loci. Genetic variation at PIP genes may represent a response to variable environmental conditions.

  10. An improved method for transformation of lettuce by Agrobacterium tumefaciens with a gene that confers freezing resistance

    Directory of Open Access Journals (Sweden)

    Pileggi Marcos

    2001-01-01

    Full Text Available An efficient method for constructing transgenic lettuce cultivars by Agrobacterium tumefaciens was described by Torres et al., 1993. In the present work, an improvement of the above procedure is described and applied to transform the cultivar Grand Rapids with a mutated P5CS gene. The major modifications were concerned with turning more practical the transformation and regeneration protocols. Also we tried to improve transformation steps by increasing injured area in explants and prolonging co-cultivation with Agrobacteria (in larger concentration. A more significant selective pressure was used against non-transformed plants and bacteria. In these work we were concerned to obtain T1 and T2 seeds. The P5CS gene codes for a delta¹-pyrroline-5-carboxylate synthetase, a bifunctional enzyme that catalyzes two steps of proline biosynthesis in plants (Zhang et al., 1995; Peng et al., 1996, while the mutated gene is insensitive to feedback inhibition by proline. The potential benefit of this gene is to confer water stress resistance (drought, salt, cold due to increased intracellular levels of proline that works like an osmoprotectant. In this work could obtain and characterize transgenic lettuce lineages which are resistant to freezing temperature.

  11. An Antibiotic Selection System For Protein Overproducing Bacteria

    DEFF Research Database (Denmark)

    Rennig, Maja; Nørholm, Morten

    2015-01-01

    Introduction: Protein overproduction is a major bottleneck for analyses of membrane proteins and for the construction of cell factories. Screening for optimized protein production can be very time consuming. In this study we show that the coupling of antibiotic resistance to poorly produced...... membrane proteins of Escherichia coli can be used as a fast and simple selection system for protein overproduction.Methods: We designed an expression plasmid encoding the gene of interest and an additional, inducible antibiotic resistance marker. Both genes were linked by a hairpin structure...... that translationally couples the genes. Consequently, high expressing gene variants also allow for higher production of the coupled antibiotic resistance marker. Therefore, high expressing gene variants in a library can be determined either by plating the expression library on selection plates or by growing...

  12. Island-Model Genomic Selection for Long-Term Genetic Improvement of Autogamous Crops.

    Science.gov (United States)

    Yabe, Shiori; Yamasaki, Masanori; Ebana, Kaworu; Hayashi, Takeshi; Iwata, Hiroyoshi

    2016-01-01

    Acceleration of genetic improvement of autogamous crops such as wheat and rice is necessary to increase cereal production in response to the global food crisis. Population and pedigree methods of breeding, which are based on inbred line selection, are used commonly in the genetic improvement of autogamous crops. These methods, however, produce a few novel combinations of genes in a breeding population. Recurrent selection promotes recombination among genes and produces novel combinations of genes in a breeding population, but it requires inaccurate single-plant evaluation for selection. Genomic selection (GS), which can predict genetic potential of individuals based on their marker genotype, might have high reliability of single-plant evaluation and might be effective in recurrent selection. To evaluate the efficiency of recurrent selection with GS, we conducted simulations using real marker genotype data of rice cultivars. Additionally, we introduced the concept of an "island model" inspired by evolutionary algorithms that might be useful to maintain genetic variation through the breeding process. We conducted GS simulations using real marker genotype data of rice cultivars to evaluate the efficiency of recurrent selection and the island model in an autogamous species. Results demonstrated the importance of producing novel combinations of genes through recurrent selection. An initial population derived from admixture of multiple bi-parental crosses showed larger genetic gains than a population derived from a single bi-parental cross in whole cycles, suggesting the importance of genetic variation in an initial population. The island-model GS better maintained genetic improvement in later generations than the other GS methods, suggesting that the island-model GS can utilize genetic variation in breeding and can retain alleles with small effects in the breeding population. The island-model GS will become a new breeding method that enhances the potential of genomic

  13. Island-Model Genomic Selection for Long-Term Genetic Improvement of Autogamous Crops.

    Directory of Open Access Journals (Sweden)

    Shiori Yabe

    Full Text Available Acceleration of genetic improvement of autogamous crops such as wheat and rice is necessary to increase cereal production in response to the global food crisis. Population and pedigree methods of breeding, which are based on inbred line selection, are used commonly in the genetic improvement of autogamous crops. These methods, however, produce a few novel combinations of genes in a breeding population. Recurrent selection promotes recombination among genes and produces novel combinations of genes in a breeding population, but it requires inaccurate single-plant evaluation for selection. Genomic selection (GS, which can predict genetic potential of individuals based on their marker genotype, might have high reliability of single-plant evaluation and might be effective in recurrent selection. To evaluate the efficiency of recurrent selection with GS, we conducted simulations using real marker genotype data of rice cultivars. Additionally, we introduced the concept of an "island model" inspired by evolutionary algorithms that might be useful to maintain genetic variation through the breeding process. We conducted GS simulations using real marker genotype data of rice cultivars to evaluate the efficiency of recurrent selection and the island model in an autogamous species. Results demonstrated the importance of producing novel combinations of genes through recurrent selection. An initial population derived from admixture of multiple bi-parental crosses showed larger genetic gains than a population derived from a single bi-parental cross in whole cycles, suggesting the importance of genetic variation in an initial population. The island-model GS better maintained genetic improvement in later generations than the other GS methods, suggesting that the island-model GS can utilize genetic variation in breeding and can retain alleles with small effects in the breeding population. The island-model GS will become a new breeding method that enhances the

  14. Alternative microbial methods: An overview and selection criteria.

    NARCIS (Netherlands)

    Jasson, V.; Jacxsens, L.; Luning, P.A.; Rajkovic, A.; Uyttendaele, M.

    2010-01-01

    This study provides an overview and criteria for the selection of a method, other than the reference method, for microbial analysis of foods. In a first part an overview of the general characteristics of rapid methods available, both for enumeration and detection, is given with reference to relevant

  15. Methods for selective functionalization and separation of carbon nanotubes

    Science.gov (United States)

    Strano, Michael S. (Inventor); Usrey, Monica (Inventor); Barone, Paul (Inventor); Dyke, Christopher A. (Inventor); Tour, James M. (Inventor); Kittrell, W. Carter (Inventor); Hauge, Robert H (Inventor); Smalley, Richard E. (Inventor); Marek, legal representative, Irene Marie (Inventor)

    2011-01-01

    The present invention is directed toward methods of selectively functionalizing carbon nanotubes of a specific type or range of types, based on their electronic properties, using diazonium chemistry. The present invention is also directed toward methods of separating carbon nanotubes into populations of specific types or range(s) of types via selective functionalization and electrophoresis, and also to the novel compositions generated by such separations.

  16. Development of an ELA-DRA gene typing method based on pyrosequencing technology.

    Science.gov (United States)

    Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

    2008-11-01

    The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.

  17. Advanced display object selection methods for enhancing user-computer productivity

    Science.gov (United States)

    Osga, Glenn A.

    1993-01-01

    The User-Interface Technology Branch at NCCOSC RDT&E Division has been conducting a series of studies to address the suitability of commercial off-the-shelf (COTS) graphic user-interface (GUI) methods for efficiency and performance in critical naval combat systems. This paper presents an advanced selection algorithm and method developed to increase user performance when making selections on tactical displays. The method has also been applied with considerable success to a variety of cursor and pointing tasks. Typical GUI's allow user selection by: (1) moving a cursor with a pointing device such as a mouse, trackball, joystick, touchscreen; and (2) placing the cursor on the object. Examples of GUI objects are the buttons, icons, folders, scroll bars, etc. used in many personal computer and workstation applications. This paper presents an improved method of selection and the theoretical basis for the significant performance gains achieved with various input devices tested. The method is applicable to all GUI styles and display sizes, and is particularly useful for selections on small screens such as notebook computers. Considering the amount of work-hours spent pointing and clicking across all styles of available graphic user-interfaces, the cost/benefit in applying this method to graphic user-interfaces is substantial, with the potential for increasing productivity across thousands of users and applications.

  18. Novel Harmonic Regularization Approach for Variable Selection in Cox’s Proportional Hazards Model

    Directory of Open Access Journals (Sweden)

    Ge-Jin Chu

    2014-01-01

    Full Text Available Variable selection is an important issue in regression and a number of variable selection methods have been proposed involving nonconvex penalty functions. In this paper, we investigate a novel harmonic regularization method, which can approximate nonconvex Lq  (1/2select key risk factors in the Cox’s proportional hazards model using microarray gene expression data. The harmonic regularization method can be efficiently solved using our proposed direct path seeking approach, which can produce solutions that closely approximate those for the convex loss function and the nonconvex regularization. Simulation results based on the artificial datasets and four real microarray gene expression datasets, such as real diffuse large B-cell lymphoma (DCBCL, the lung cancer, and the AML datasets, show that the harmonic regularization method can be more accurate for variable selection than existing Lasso series methods.

  19. Characterization of the bovine pregnancy-associated glycoprotein gene family – analysis of gene sequences, regulatory regions within the promoter and expression of selected genes

    Directory of Open Access Journals (Sweden)

    Walker Angela M

    2009-04-01

    Full Text Available Abstract Background The Pregnancy-associated glycoproteins (PAGs belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown that the PAG family segregates into 'ancient' and 'modern' groupings. Along with sequence differences between family members, there are clear distinctions in their spatio-temporal distribution and in their relative level of expression. In this report, 1 we performed an in silico analysis of the bovine genome to further characterize the PAG gene family, 2 we scrutinized proximal promoter sequences of the PAG genes to evaluate the evolution pressures operating on them and to identify putative regulatory regions, 3 we determined relative transcript abundance of selected PAGs during pregnancy and, 4 we performed preliminary characterization of the putative regulatory elements for one of the candidate PAGs, bovine (bo PAG-2. Results From our analysis of the bovine genome, we identified 18 distinct PAG genes and 14 pseudogenes. We observed that the first 500 base pairs upstream of the translational start site contained multiple regions that are conserved among all boPAGs. However, a preponderance of conserved regions, that harbor recognition sites for putative transcriptional factors (TFs, were found to be unique to the modern boPAG grouping, but not the ancient boPAGs. We gathered evidence by means of Q-PCR and screening of EST databases to show that boPAG-2 is the most abundant of all boPAG transcripts. Finally, we provided preliminary evidence for the role of ETS- and DDVL-related TFs in the regulation of the boPAG-2 gene. Conclusion PAGs represent a relatively large gene family in the bovine genome. The proximal promoter regions of these genes display differences in putative TF binding sites, likely contributing to observed

  20. A fuzzy logic based PROMETHEE method for material selection problems

    Directory of Open Access Journals (Sweden)

    Muhammet Gul

    2018-03-01

    Full Text Available Material selection is a complex problem in the design and development of products for diverse engineering applications. This paper presents a fuzzy PROMETHEE (Preference Ranking Organization Method for Enrichment Evaluation method based on trapezoidal fuzzy interval numbers that can be applied to the selection of materials for an automotive instrument panel. Also, it presents uniqueness in making a significant contribution to the literature in terms of the application of fuzzy decision-making approach to material selection problems. The method is illustrated, validated, and compared against three different fuzzy MCDM methods (fuzzy VIKOR, fuzzy TOPSIS, and fuzzy ELECTRE in terms of its ranking performance. Also, the relationships between the compared methods and the proposed scenarios for fuzzy PROMETHEE are evaluated via the Spearman’s correlation coefficient. Styrene Maleic Anhydride and Polypropylene are determined optionally as suitable materials for the automotive instrument panel case. We propose a generic fuzzy MCDM methodology that can be practically implemented to material selection problem. The main advantages of the methodology are consideration of the vagueness, uncertainty, and fuzziness to decision making environment.

  1. Signature of balancing selection at the MC1R gene in Kunming dog populations.

    Directory of Open Access Journals (Sweden)

    Guo-dong Wang

    Full Text Available Coat color in dog breeds is an excellent character for revealing the power of artificial selection, as it is extremely diverse and likely the result of recent domestication. Coat color is generated by melanocytes, which synthesize pheomelanin (a red or yellow pigment or eumelanin (a black or brown pigment through the pigment type-switching pathway, and is regulated by three genes in dogs: MC1R (melanocortin receptor 1, CBD103 (β-defensin 103, and ASIP (agouti-signaling protein precursor. The genotypes of these three gene loci in dog breeds are associated with coat color pattern. Here, we resequenced these three gene loci in two Kunming dog populations and analyzed these sequences using population genetic approaches to identify evolutionary patterns that have occurred at these loci during the recent domestication and breeding of the Kunming dog. The analysis showed that MC1R undergoes balancing selection in both Kunming dog populations, and that the Fst value for MC1R indicates significant genetic differentiation across the two populations. In contrast, similar results were not observed for CBD103 or ASIP. These results suggest that high heterozygosity and allelic differences at the MC1R locus may explain both the mixed color coat, of yellow and black, and the difference in coat colors in both Kunming dog populations.

  2. The strong selective sweep candidate gene ADRA2C does not explain domestication related changes in the stress response of chickens.

    Directory of Open Access Journals (Sweden)

    Magnus Elfwing

    Full Text Available Analysis of selective sweeps to pinpoint causative genomic regions involved in chicken domestication has revealed a strong selective sweep on chromosome 4 in layer chickens. The autoregulatory α-adrenergic receptor 2C (ADRA2C gene is the closest to the selective sweep and was proposed as an important gene in the domestication of layer chickens. The ADRA2C promoter region was also hypermethylated in comparison to the non-selected ancestor of all domesticated chicken breeds, the Red Junglefowl, further supporting its relevance. In mice the receptor is involved in the fight-or-flight response as it modulates epinephrine release from the adrenals. To investigate the involvement of ADRA2C in chicken domestication, we measured gene expression in the adrenals and radiolabeled receptor ligand in three brain regions comparing the domestic White Leghorn strain with the wild ancestor Red Junglefowl. In adrenals ADRA2C was twofold greater expressed than the related receptor gene ADRA2A, indicating that ADRA2C is the predominant modulator of epinephrine release but no strain differences were measured. In hypothalamus and amygdala, regions associated with the stress response, and in striatum, receptor binding pIC50 values ranged between 8.1-8.4, and the level was not influenced by the genotyped allele. Because chicken strains differ in morphology, physiology and behavior, differences attributed to a single gene may be lost in the noise caused by the heterogeneous genetic background. Therefore an F10 advanced intercross strain between White Leghorn and Red Junglefowl was used to investigate effects of ADRA2C alleles on fear related behaviors and fecundity. We did not find compelling genotype effects in open field, tonic immobility, aerial predator, associative learning or fecundity. Therefore we conclude that ADRA2C is probably not involved in the domestication of the stress response in chicken, and the strong selective sweep is probably caused by selection

  3. Identification of learning and memory genes in canine; promoter investigation and determining the selective pressure.

    Science.gov (United States)

    Seifi Moroudi, Reihane; Masoudi, Ali Akbar; Vaez Torshizi, Rasoul; Zandi, Mohammad

    2014-12-01

    One of the important behaviors of dogs is trainability which is affected by learning and memory genes. These kinds of the genes have not yet been identified in dogs. In the current research, these genes were found in animal models by mining the biological data and scientific literatures. The proteins of these genes were obtained from the UniProt database in dogs and humans. Not all homologous proteins perform similar functions, thus comparison of these proteins was studied in terms of protein families, domains, biological processes, molecular functions, and cellular location of metabolic pathways in Interpro, KEGG, Quick Go and Psort databases. The results showed that some of these proteins have the same performance in the rat or mouse, dog, and human. It is anticipated that the protein of these genes may be effective in learning and memory in dogs. Then, the expression pattern of the recognized genes was investigated in the dog hippocampus using the existing information in the GEO profile. The results showed that BDNF, TAC1 and CCK genes are expressed in the dog hippocampus, therefore, these genes could be strong candidates associated with learning and memory in dogs. Subsequently, due to the importance of the promoter regions in gene function, this region was investigated in the above genes. Analysis of the promoter indicated that the HNF-4 site of BDNF gene and the transcription start site of CCK gene is exposed to methylation. Phylogenetic analysis of protein sequences of these genes showed high similarity in each of these three genes among the studied species. The dN/dS ratio for BDNF, TAC1 and CCK genes indicates a purifying selection during the evolution of the genes.

  4. Ferritin gene organization: differences between plants and animals suggest possible kingdom-specific selective constraints.

    Science.gov (United States)

    Proudhon, D; Wei, J; Briat, J; Theil, E C

    1996-03-01

    exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.

  5. Degrees of separation as a statistical tool for evaluating candidate genes.

    Science.gov (United States)

    Nelson, Ronald M; Pettersson, Mats E

    2014-12-01

    Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

    Science.gov (United States)

    Hoffmann, Robert D; Palmgren, Michael

    2016-06-13

    Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.

  7. The impacts of drift and selection on genomic evolution in insects

    Directory of Open Access Journals (Sweden)

    K. Jun Tong

    2017-04-01

    Full Text Available Genomes evolve through a combination of mutation, drift, and selection, all of which act heterogeneously across genes and lineages. This leads to differences in branch-length patterns among gene trees. Genes that yield trees with the same branch-length patterns can be grouped together into clusters. Here, we propose a novel phylogenetic approach to explain the factors that influence the number and distribution of these gene-tree clusters. We apply our method to a genomic dataset from insects, an ancient and diverse group of organisms. We find some evidence that when drift is the dominant evolutionary process, each cluster tends to contain a large number of fast-evolving genes. In contrast, strong negative selection leads to many distinct clusters, each of which contains only a few slow-evolving genes. Our work, although preliminary in nature, illustrates the use of phylogenetic methods to shed light on the factors driving rate variation in genomic evolution.

  8. An Intelligent Method of Product Scheme Design Based on Product Gene

    Directory of Open Access Journals (Sweden)

    Qing Song Ai

    2013-01-01

    Full Text Available Nowadays, in order to have some featured products, many customers tend to buy customized products instead of buying common ones in supermarket. The manufacturing enterprises, with the purpose of improving their competitiveness, are focusing on providing customized products with high quality and low cost as well. At present, how to produce customized products rapidly and cheaply has been the key challenge to manufacturing enterprises. In this paper, an intelligent modeling approach applied to supporting the modeling of customized products is proposed, which may improve the efficiency during the product design process. Specifically, the product gene (PG method, which is an analogy of biological evolution in engineering area, is employed to model products in a new way. Based on product gene, we focus on the intelligent modeling method to generate product schemes rapidly and automatically. The process of our research includes three steps: (1 develop a product gene model for customized products; (2 find the obtainment and storage method for product gene; and (3 propose a specific genetic algorithm used for calculating the solution of customized product and generating new product schemes. Finally, a case study is applied to test the usefulness of our study.

  9. Factors of Selection of the Stock Allocation Method

    Directory of Open Access Journals (Sweden)

    Rohov Heorhii K.

    2014-03-01

    Full Text Available The article describes results of the author’s study of factors of making strategic decisions on selection of methods of stock allocation by public joint stock companies in Ukraine. The author used the Random forest mathematical apparatus of classification trees building and also informal methods. The article analyses the reasons that restrain public allocation of stock. It shows significant influence upon selection of a method of stock allocation of such factors as capital concentration, balance rate of corporate rights, sector of economy and significant participation of the institutes of common investment or the state in the authorised capital. The built hierarchical model of classification of factors of the issuing policy of joint stock companies finds logical justification in specific features of the institutional environment, however, it does not fit into the framework of the classical concept of the market economy. The model could be used both for formation of goals of corporate financial strategies and in the process of improvement of state regulation of activity of securities issuers. The prospect of further studies in this direction is identification of transformation of factors of selection of the stock allocation method under conditions of revival of the stock market.

  10. Proactive AP Selection Method Considering the Radio Interference Environment

    Science.gov (United States)

    Taenaka, Yuzo; Kashihara, Shigeru; Tsukamoto, Kazuya; Yamaguchi, Suguru; Oie, Yuji

    In the near future, wireless local area networks (WLANs) will overlap to provide continuous coverage over a wide area. In such ubiquitous WLANs, a mobile node (MN) moving freely between multiple access points (APs) requires not only permanent access to the Internet but also continuous communication quality during handover. In order to satisfy these requirements, an MN needs to (1) select an AP with better performance and (2) execute a handover seamlessly. To satisfy requirement (2), we proposed a seamless handover method in a previous study. Moreover, in order to achieve (1), the Received Signal Strength Indicator (RSSI) is usually employed to measure wireless link quality in a WLAN system. However, in a real environment, especially if APs are densely situated, it is difficult to always select an AP with better performance based on only the RSSI. This is because the RSSI alone cannot detect the degradation of communication quality due to radio interference. Moreover, it is important that AP selection is completed only on an MN, because we can assume that, in ubiquitous WLANs, various organizations or operators will manage APs. Hence, we cannot modify the APs for AP selection. To overcome these difficulties, in the present paper, we propose and implement a proactive AP selection method considering wireless link condition based on the number of frame retransmissions in addition to the RSSI. In the evaluation, we show that the proposed AP selection method can appropriately select an AP with good wireless link quality, i.e., high RSSI and low radio interference.

  11. Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

    Directory of Open Access Journals (Sweden)

    Huihua Wang

    Full Text Available Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed.We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality and EDAR (associated with hair thickness were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9 were associated with pre-weaning gain in our previous genome-wide association study.Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.

  12. Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

    Science.gov (United States)

    Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.

  13. Selection method of terrain matching area for TERCOM algorithm

    Science.gov (United States)

    Zhang, Qieqie; Zhao, Long

    2017-10-01

    The performance of terrain aided navigation is closely related to the selection of terrain matching area. The different matching algorithms have different adaptability to terrain. This paper mainly studies the adaptability to terrain of TERCOM algorithm, analyze the relation between terrain feature and terrain characteristic parameters by qualitative and quantitative methods, and then research the relation between matching probability and terrain characteristic parameters by the Monte Carlo method. After that, we propose a selection method of terrain matching area for TERCOM algorithm, and verify the method correctness with real terrain data by simulation experiment. Experimental results show that the matching area obtained by the method in this paper has the good navigation performance and the matching probability of TERCOM algorithm is great than 90%

  14. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

    Science.gov (United States)

    Zhou, Xionghui; Liu, Juan

    2014-01-01

    Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for

  15. Extreme MHC class I diversity in the sedge warbler (Acrocephalus schoenobaenus); selection patterns and allelic divergence suggest that different genes have different functions.

    Science.gov (United States)

    Biedrzycka, Aleksandra; O'Connor, Emily; Sebastian, Alvaro; Migalska, Magdalena; Radwan, Jacek; Zając, Tadeusz; Bielański, Wojciech; Solarz, Wojciech; Ćmiel, Adam; Westerdahl, Helena

    2017-07-05

    Recent work suggests that gene duplications may play an important role in the evolution of immunity genes. Passerine birds, and in particular Sylvioidea warblers, have highly duplicated major histocompatibility complex (MHC) genes, which are key in immunity, compared to other vertebrates. However, reasons for this high MHC gene copy number are yet unclear. High-throughput sequencing (HTS) allows MHC genotyping even in individuals with extremely duplicated genes. This HTS data can reveal evidence of selection, which may help to unravel the putative functions of different gene copies, i.e. neofunctionalization. We performed exhaustive genotyping of MHC class I in a Sylvioidea warbler, the sedge warbler, Acrocephalus schoenobaenus, using the Illumina MiSeq technique on individuals from a wild study population. The MHC diversity in 863 genotyped individuals by far exceeds that of any other bird species described to date. A single individual could carry up to 65 different alleles, a large proportion of which are expressed (transcribed). The MHC alleles were of three different lengths differing in evidence of selection, diversity and divergence within our study population. Alleles without any deletions and alleles containing a 6 bp deletion showed characteristics of classical MHC genes, with evidence of multiple sites subject to positive selection and high sequence divergence. In contrast, alleles containing a 3 bp deletion had no sites subject to positive selection and had low divergence. Our results suggest that sedge warbler MHC alleles that either have no deletion, or contain a 6 bp deletion, encode classical antigen presenting MHC molecules. In contrast, MHC alleles containing a 3 bp deletion may encode molecules with a different function. This study demonstrates that highly duplicated MHC genes can be characterised with HTS and that selection patterns can be useful for revealing neofunctionalization. Importantly, our results highlight the need to consider the

  16. Evaluating codon bias perspective in barbiturase gene using ...

    African Journals Online (AJOL)

    Abdullah

    2014-01-08

    Jan 8, 2014 ... along with codon usage was done to reveal dynamics of gene evolution and expression ... analysis is a potent approach for detecting mutations, selection methods and finding rationale of biased and unbiased gene changes and hence, evolutionary ... in the perception of the molecular basics plus potential.

  17. Spatial and temporal variation in selection of genes associated with pearl millet varietal quantitative traits in situ

    Directory of Open Access Journals (Sweden)

    Cedric Mariac

    2016-07-01

    Full Text Available Ongoing global climate changes imply new challenges for agriculture. Whether plants and crops can adapt to such rapid changes is still a widely debated question. We previously showed adaptation in the form of earlier flowering in pearl millet at the scale of a whole country over three decades. However, this analysis did not deal with variability of year to year selection. To understand and possibly manage plant and crop adaptation, we need more knowledge of how selection acts in situ. Is selection gradual, abrupt, and does it vary in space and over time? In the present study, we tracked the evolution of allele frequency in two genes associated with pearl millet phenotypic variation in situ. We sampled 17 populations of cultivated pearl millet over a period of two years. We tracked changes in allele frequencies in these populations by genotyping more than seven thousand individuals. We demonstrate that several allele frequencies changes are compatible with selection, by correcting allele frequency changes associated with genetic drift. We found marked variation in allele frequencies from year to year, suggesting a variable selection effect in space and over time. We estimated the strength of selection associated with variations in allele frequency. Our results suggest that the polymorphism maintained at the genes we studied is partially explained by the spatial and temporal variability of selection. In response to environmental changes, traditional pearl millet varieties could rapidly adapt thanks to this available functional variability.

  18. Selection of reference genes in different myocardial regions of an in vivo ischemia/reperfusion rat model for normalization of antioxidant gene expression

    Directory of Open Access Journals (Sweden)

    Vesentini Nicoletta

    2012-02-01

    Full Text Available Abstract Background Changes in cardiac gene expression due to myocardial injury are usually assessed in whole heart tissue. However, as the heart is a heterogeneous system, spatial and temporal heterogeneity is expected in gene expression. Results In an ischemia/reperfusion (I/R rat model we evaluated gene expression of mitochondrial and cytoplasmatic superoxide dismutase (MnSod, Cu-ZnSod and thioredoxin reductase (trxr1 upon short (4 h and long (72 h reperfusion times in the right ventricle (RV, and in the ischemic/reperfused (IRR and the remote region (RR of the left ventricle. Gene expression was assessed by Real-time reverse-transcription quantitative PCR (RT-qPCR. In order to select most stable reference genes suitable for normalization purposes, in each myocardial region we tested nine putative reference genes by geNorm analysis. The genes investigated were: Actin beta (actb, Glyceraldehyde-3-P-dehydrogenase (gapdh, Ribosomal protein L13A (rpl13a, Tyrosine 3-monooxygenase (ywhaz, Beta-glucuronidase (gusb, Hypoxanthine guanine Phosphoribosyltransferase 1 (hprt, TATA binding box protein (tbp, Hydroxymethylbilane synthase (hmbs, Polyadenylate-binding protein 1 (papbn1. According to our findings, most stable reference genes in the RV and RR were hmbs/hprt and hmbs/tbp/hprt respectively. In the IRR, six reference genes were recommended for normalization purposes; however, in view of experimental feasibility limitations, target gene expression could be normalized against the three most stable reference genes (ywhaz/pabp/hmbs without loss of sensitivity. In all cases MnSod and Cu-ZnSod expression decreased upon long reperfusion, the former in all myocardial regions and the latter in IRR alone. trxr1 expression did not vary. Conclusions This study provides a validation of reference genes in the RV and in the anterior and posterior wall of the LV of cardiac ischemia/reperfusion model and shows that gene expression should be assessed separately in

  19. The experiments and analysis of several selective video encryption methods

    Science.gov (United States)

    Zhang, Yue; Yang, Cheng; Wang, Lei

    2013-07-01

    This paper presents four methods for selective video encryption based on the MPEG-2 video compression,including the slices, the I-frames, the motion vectors, and the DCT coefficients. We use the AES encryption method for simulation experiment for the four methods on VS2010 Platform, and compare the video effects and the processing speed of each frame after the video encrypted. The encryption depth can be arbitrarily selected, and design the encryption depth by using the double limit counting method, so the accuracy can be increased.

  20. A model-based approach for identifying signatures of ancient balancing selection in genetic data.

    Science.gov (United States)

    DeGiorgio, Michael; Lohmueller, Kirk E; Nielsen, Rasmus

    2014-08-01

    While much effort has focused on detecting positive and negative directional selection in the human genome, relatively little work has been devoted to balancing selection. This lack of attention is likely due to the paucity of sophisticated methods for identifying sites under balancing selection. Here we develop two composite likelihood ratio tests for detecting balancing selection. Using simulations, we show that these methods outperform competing methods under a variety of assumptions and demographic models. We apply the new methods to whole-genome human data, and find a number of previously-identified loci with strong evidence of balancing selection, including several HLA genes. Additionally, we find evidence for many novel candidates, the strongest of which is FANK1, an imprinted gene that suppresses apoptosis, is expressed during meiosis in males, and displays marginal signs of segregation distortion. We hypothesize that balancing selection acts on this locus to stabilize the segregation distortion and negative fitness effects of the distorter allele. Thus, our methods are able to reproduce many previously-hypothesized signals of balancing selection, as well as discover novel interesting candidates.

  1. A model-based approach for identifying signatures of ancient balancing selection in genetic data.

    Directory of Open Access Journals (Sweden)

    Michael DeGiorgio

    2014-08-01

    Full Text Available While much effort has focused on detecting positive and negative directional selection in the human genome, relatively little work has been devoted to balancing selection. This lack of attention is likely due to the paucity of sophisticated methods for identifying sites under balancing selection. Here we develop two composite likelihood ratio tests for detecting balancing selection. Using simulations, we show that these methods outperform competing methods under a variety of assumptions and demographic models. We apply the new methods to whole-genome human data, and find a number of previously-identified loci with strong evidence of balancing selection, including several HLA genes. Additionally, we find evidence for many novel candidates, the strongest of which is FANK1, an imprinted gene that suppresses apoptosis, is expressed during meiosis in males, and displays marginal signs of segregation distortion. We hypothesize that balancing selection acts on this locus to stabilize the segregation distortion and negative fitness effects of the distorter allele. Thus, our methods are able to reproduce many previously-hypothesized signals of balancing selection, as well as discover novel interesting candidates.

  2. Production of recombinant Ig molecules from antigen-selected single B cells and restricted usage of Ig-gene segments by anti-D antibodies

    NARCIS (Netherlands)

    Dohmen, Serge E.; Mulder, Arend; Verhagen, Onno J. H. M.; Eijsink, Chantal; Franke-van Dijk, Marry E. I.; van der Schoot, C. Ellen

    2005-01-01

    The Ig-genes of the heavy chains in anti-D-specific hybridomas and Fab/scFv-fragments selected from phage-display libraries are restricted to a group of closely related genes (IGHV3s genes). We analyzed the Ig-gene repertoire in anti-D-specific B cells of two hyperimmunized donors using a completely

  3. Quantitative Methods for Software Selection and Evaluation

    National Research Council Canada - National Science Library

    Bandor, Michael S

    2006-01-01

    ... (the ability of the product to meet the need) and the cost. The method used for the analysis and selection activities can range from the use of basic intuition to counting the number of requirements fulfilled, or something...

  4. The effects of recombination, mutation and selection on the evolution of the Rp1 resistance genes in grasses.

    Science.gov (United States)

    Jouet, Agathe; McMullan, Mark; van Oosterhout, Cock

    2015-06-01

    Plant immune genes, or resistance genes, are involved in a co-evolutionary arms race with a diverse range of pathogens. In agronomically important grasses, such R genes have been extensively studied because of their role in pathogen resistance and in the breeding of resistant cultivars. In this study, we evaluate the importance of recombination, mutation and selection on the evolution of the R gene complex Rp1 of Sorghum, Triticum, Brachypodium, Oryza and Zea. Analyses show that recombination is widespread, and we detected 73 independent instances of sequence exchange, involving on average 1567 of 4692 nucleotides analysed (33.4%). We were able to date 24 interspecific recombination events and found that four occurred postspeciation, which suggests that genetic introgression took place between different grass species. Other interspecific events seemed to have been maintained over long evolutionary time, suggesting the presence of balancing selection. Significant positive selection (i.e. a relative excess of nonsynonymous substitutions (dN /dS >1)) was detected in 17-95 codons (0.42-2.02%). Recombination was significantly associated with areas with high levels of polymorphism but not with an elevated dN /dS ratio. Finally, phylogenetic analyses show that recombination results in a general overestimation of the divergence time (mean = 14.3%) and an alteration of the gene tree topology if the tree is not calibrated. Given that the statistical power to detect recombination is determined by the level of polymorphism of the amplicon as well as the number of sequences analysed, it is likely that many studies have underestimated the importance of recombination relative to the mutation rate. © 2015 John Wiley & Sons Ltd.

  5. [Establisment of a genetic transformation method of coffee (Coffea arabica cv. Catimor) and incorporation of bar gene for ammonium glufosinate resistance].

    Science.gov (United States)

    Fernández Da Silva, Rafael

    2003-01-01

    In order to establish a successful method of genetic transformation in Coffea arabica cv. Catimor, different conditions of generation and electroporation were evaluated on different plant tissues. Cell suspension system was improved using one hormone only (BA), obtaining high yields of primary and secondary somatic embryo production. For selection of viable and potentially transformed cells, MTT (1%) method and ammonium glufosinate concentration (1 mg/L in leaf, callus and embryos; and 5 mg/L in cells) were established. Different conditions were evaluated to electroporate different explants (embryogenic callus, vitroplants leaves, globular and torpedo embryos). The highest gus gene expression percentage by explant were found on enzymatic treated tissues at 375 V/cm in callus, and at 625 V/cm in leaves and embryos. Torpedo embryos cultured on liquid medium were the only type of tissue that could regenerate into plants, where secondary somatic embryos were obtained. Those embryos were positive to the gus gene histochemical test and to the gus and bar genes amplification on a PCR reaction.

  6. Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods

    Science.gov (United States)

    Väremo, Leif; Nielsen, Jens; Nookaew, Intawat

    2013-01-01

    Gene set analysis (GSA) is used to elucidate genome-wide data, in particular transcriptome data. A multitude of methods have been proposed for this step of the analysis, and many of them have been compared and evaluated. Unfortunately, there is no consolidated opinion regarding what methods should be preferred, and the variety of available GSA software and implementations pose a difficulty for the end-user who wants to try out different methods. To address this, we have developed the R package Piano that collects a range of GSA methods into the same system, for the benefit of the end-user. Further on we refine the GSA workflow by using modifications of the gene-level statistics. This enables us to divide the resulting gene set P-values into three classes, describing different aspects of gene expression directionality at gene set level. We use our fully implemented workflow to investigate the impact of the individual components of GSA by using microarray and RNA-seq data. The results show that the evaluated methods are globally similar and the major separation correlates well with our defined directionality classes. As a consequence of this, we suggest to use a consensus scoring approach, based on multiple GSA runs. In combination with the directionality classes, this constitutes a more thorough basis for an enriched biological interpretation. PMID:23444143

  7. Analysis of the robustness of network-based disease-gene prioritization methods reveals redundancy in the human interactome and functional diversity of disease-genes.

    Directory of Open Access Journals (Sweden)

    Emre Guney

    Full Text Available Complex biological systems usually pose a trade-off between robustness and fragility where a small number of perturbations can substantially disrupt the system. Although biological systems are robust against changes in many external and internal conditions, even a single mutation can perturb the system substantially, giving rise to a pathophenotype. Recent advances in identifying and analyzing the sequential variations beneath human disorders help to comprehend a systemic view of the mechanisms underlying various disease phenotypes. Network-based disease-gene prioritization methods rank the relevance of genes in a disease under the hypothesis that genes whose proteins interact with each other tend to exhibit similar phenotypes. In this study, we have tested the robustness of several network-based disease-gene prioritization methods with respect to the perturbations of the system using various disease phenotypes from the Online Mendelian Inheritance in Man database. These perturbations have been introduced either in the protein-protein interaction network or in the set of known disease-gene associations. As the network-based disease-gene prioritization methods are based on the connectivity between known disease-gene associations, we have further used these methods to categorize the pathophenotypes with respect to the recoverability of hidden disease-genes. Our results have suggested that, in general, disease-genes are connected through multiple paths in the human interactome. Moreover, even when these paths are disturbed, network-based prioritization can reveal hidden disease-gene associations in some pathophenotypes such as breast cancer, cardiomyopathy, diabetes, leukemia, parkinson disease and obesity to a greater extend compared to the rest of the pathophenotypes tested in this study. Gene Ontology (GO analysis highlighted the role of functional diversity for such diseases.

  8. Targets of balancing selection in the human genome

    DEFF Research Database (Denmark)

    Andrés, Aida M; Hubisz, Melissa J; Indap, Amit

    2009-01-01

    Balancing selection is potentially an important biological force for maintaining advantageous genetic diversity in populations, including variation that is responsible for long-term adaptation to the environment. By serving as a means to maintain genetic variation, it may be particularly relevant...... to maintaining phenotypic variation in natural populations. Nevertheless, its prevalence and specific targets in the human genome remain largely unknown. We have analyzed the patterns of diversity and divergence of 13,400 genes in two human populations using an unbiased single-nucleotide polymorphism data set......, a genome-wide approach, and a method that incorporates demography in neutrality tests. We identified an unbiased catalog of genes with signatures of long-term balancing selection, which includes immunity genes as well as genes encoding keratins and membrane channels; the catalog also shows enrichment...

  9. Selective sweep on human amylase genes postdates the split with Neanderthals

    Science.gov (United States)

    Inchley, Charlotte E.; Larbey, Cynthia D. A.; Shwan, Nzar A. A.; Pagani, Luca; Saag, Lauri; Antão, Tiago; Jacobs, Guy; Hudjashov, Georgi; Metspalu, Ene; Mitt, Mario; Eichstaedt, Christina A.; Malyarchuk, Boris; Derenko, Miroslava; Wee, Joseph; Abdullah, Syafiq; Ricaut, François-Xavier; Mormina, Maru; Mägi, Reedik; Villems, Richard; Metspalu, Mait; Jones, Martin K.; Armour, John A. L.; Kivisild, Toomas

    2016-01-01

    Humans have more copies of amylase genes than other primates. It is still poorly understood, however, when the copy number expansion occurred and whether its spread was enhanced by selection. Here we assess amylase copy numbers in a global sample of 480 high coverage genomes and find that regions flanking the amylase locus show notable depression of genetic diversity both in African and non-African populations. Analysis of genetic variation in these regions supports the model of an early selective sweep in the human lineage after the split of humans from Neanderthals which led to the fixation of multiple copies of AMY1 in place of a single copy. We find evidence of multiple secondary losses of copy number with the highest frequency (52%) of a deletion of AMY2A and associated low copy number of AMY1 in Northeast Siberian populations whose diet has been low in starch content. PMID:27853181

  10. Evaluation of biolistic gene transfer methods in vivo using non-invasive bioluminescent imaging techniques

    Directory of Open Access Journals (Sweden)

    Daniell Henry

    2011-06-01

    Full Text Available Abstract Background Gene therapy continues to hold great potential for treating many different types of disease and dysfunction. Safe and efficient techniques for gene transfer and expression in vivo are needed to enable gene therapeutic strategies to be effective in patients. Currently, the most commonly used methods employ replication-defective viral vectors for gene transfer, while physical gene transfer methods such as biolistic-mediated ("gene-gun" delivery to target tissues have not been as extensively explored. In the present study, we evaluated the efficacy of biolistic gene transfer techniques in vivo using non-invasive bioluminescent imaging (BLI methods. Results Plasmid DNA carrying the firefly luciferase (LUC reporter gene under the control of the human Cytomegalovirus (CMV promoter/enhancer was transfected into mouse skin and liver using biolistic methods. The plasmids were coupled to gold microspheres (1 μm diameter using different DNA Loading Ratios (DLRs, and "shot" into target tissues using a helium-driven gene gun. The optimal DLR was found to be in the range of 4-10. Bioluminescence was measured using an In Vivo Imaging System (IVIS-50 at various time-points following transfer. Biolistic gene transfer to mouse skin produced peak reporter gene expression one day after transfer. Expression remained detectable through four days, but declined to undetectable levels by six days following gene transfer. Maximum depth of tissue penetration following biolistic transfer to abdominal skin was 200-300 μm. Similarly, biolistic gene transfer to mouse liver in vivo also produced peak early expression followed by a decline over time. In contrast to skin, however, liver expression of the reporter gene was relatively stable 4-8 days post-biolistic gene transfer, and remained detectable for nearly two weeks. Conclusions The use of bioluminescence imaging techniques enabled efficient evaluation of reporter gene expression in vivo. Our results

  11. Fuzzy C-means method for clustering microarray data.

    Science.gov (United States)

    Dembélé, Doulaye; Kastner, Philippe

    2003-05-22

    Clustering analysis of data from DNA microarray hybridization studies is essential for identifying biologically relevant groups of genes. Partitional clustering methods such as K-means or self-organizing maps assign each gene to a single cluster. However, these methods do not provide information about the influence of a given gene for the overall shape of clusters. Here we apply a fuzzy partitioning method, Fuzzy C-means (FCM), to attribute cluster membership values to genes. A major problem in applying the FCM method for clustering microarray data is the choice of the fuzziness parameter m. We show that the commonly used value m = 2 is not appropriate for some data sets, and that optimal values for m vary widely from one data set to another. We propose an empirical method, based on the distribution of distances between genes in a given data set, to determine an adequate value for m. By setting threshold levels for the membership values, genes which are tigthly associated to a given cluster can be selected. Using a yeast cell cycle data set as an example, we show that this selection increases the overall biological significance of the genes within the cluster. Supplementary text and Matlab functions are available at http://www-igbmc.u-strasbg.fr/fcm/

  12. Genomic signatures of local directional selection in a high gene flow marine organism, the Atlantic cod (Gadus morhua)

    DEFF Research Database (Denmark)

    Eg Nielsen, Einar; Hansen, Jakob Hemmer; Poulsen, Nina Aagaard

    2009-01-01

    -associated single nucleotide polymorphisms (SNPs) for evidence of selection in local populations of Atlantic cod (Gadus morhua L.) across the species distribution. Results: Our global genome scan analysis identified eight outlier gene loci with very high statistical support, likely to be subject to directional...... selection in local demes, or closely linked to loci under selection. Likewise, on a regional south/north transect of central and eastern Atlantic populations, seven loci displayed strongly elevated levels of genetic differentiation. Selection patterns among populations appeared to be relatively widespread...

  13. Unraveling the effects of selection and demography on immune gene variation in free-ranging plains zebra (Equus quagga) populations.

    Science.gov (United States)

    Kamath, Pauline L; Getz, Wayne M

    2012-01-01

    Demography, migration and natural selection are predominant processes affecting the distribution of genetic variation among natural populations. Many studies use neutral genetic markers to make inferences about population history. However, the investigation of functional coding loci, which directly reflect fitness, is critical to our understanding of species' ecology and evolution. Immune genes, such as those of the Major Histocompatibility Complex (MHC), play an important role in pathogen recognition and provide a potent model system for studying selection. We contrasted diversity patterns of neutral data with MHC loci, ELA-DRA and -DQA, in two southern African plains zebra (Equus quagga) populations: Etosha National Park, Namibia, and Kruger National Park, South Africa. Results from neutrality tests, along with observations of elevated diversity and low differentiation across populations, supported previous genus-level evidence for balancing selection at these loci. Despite being low, MHC divergence across populations was significant and may be attributed to drift effects typical of geographically separated populations experiencing little to no gene flow, or alternatively to shifting allele frequency distributions driven by spatially variable and fluctuating pathogen communities. At the DRA, zebra exhibited geographic differentiation concordant with microsatellites and reduced levels of diversity in Etosha due to highly skewed allele frequencies that could not be explained by demography, suggestive of spatially heterogeneous selection and local adaptation. This study highlights the complexity in which selection affects immune gene diversity and warrants the need for further research on the ecological mechanisms shaping patterns of adaptive variation among natural populations.

  14. Whole genome detection of signature of positive selection in African cattle reveals selection for thermotolerance.

    Science.gov (United States)

    Taye, Mengistie; Lee, Wonseok; Caetano-Anolles, Kelsey; Dessie, Tadelle; Hanotte, Olivier; Mwai, Okeyo Ally; Kemp, Stephen; Cho, Seoae; Oh, Sung Jong; Lee, Hak-Kyo; Kim, Heebal

    2017-12-01

    As African indigenous cattle evolved in a hot tropical climate, they have developed an inherent thermotolerance; survival mechanisms include a light-colored and shiny coat, increased sweating, and cellular and molecular mechanisms to cope with high environmental temperature. Here, we report the positive selection signature of genes in African cattle breeds which contribute for their heat tolerance mechanisms. We compared the genomes of five indigenous African cattle breeds with the genomes of four commercial cattle breeds using cross-population composite likelihood ratio (XP-CLR) and cross-population extended haplotype homozygosity (XP-EHH) statistical methods. We identified 296 (XP-EHH) and 327 (XP-CLR) positively selected genes. Gene ontology analysis resulted in 41 biological process terms and six Kyoto Encyclopedia of Genes and Genomes pathways. Several genes and pathways were found to be involved in oxidative stress response, osmotic stress response, heat shock response, hair and skin properties, sweat gland development and sweating, feed intake and metabolism, and reproduction functions. The genes and pathways identified directly or indirectly contribute to the superior heat tolerance mechanisms in African cattle populations. The result will improve our understanding of the biological mechanisms of heat tolerance in African cattle breeds and opens an avenue for further study. © 2017 Japanese Society of Animal Science.

  15. Partial Sequencing of 16S rRNA Gene of Selected Staphylococcus aureus Isolates and its Antibiotic Resistance

    Directory of Open Access Journals (Sweden)

    Harsi Dewantari Kusumaningrum

    2016-08-01

    Full Text Available The choice of primer used in 16S rRNA sequencing for identification of Staphylococcus species found in food is important. This study aimed to characterize Staphylococcus aureus isolates by partial sequencing based on 16S rRNA gene employing primers 16sF, 63F or 1387R. The isolates were isolated from milk, egg dishes and chicken dishes and selected based on the presence of sea gene that responsible for formation of enterotoxin-A. Antibiotic susceptibility of the isolates towards six antibiotics was also tested. The use of 16sF resulted generally in higher identity percentage and query coverage compared to the sequencing by 63F or 1387R. BLAST results of all isolates, sequenced by 16sF, showed 99% homology to complete genome of four S. aureus strains, with different characteristics on enterotoxin production and antibiotic resistance. Considering that all isolates were carrying sea gene, indicated by the occurence of 120 bp amplicon after PCR amplification using primer SEA1/SEA2,  the isolates were most in agreeing to S. aureus subsp. aureus ST288. This study indicated that 4 out of 8 selected isolates were resistant towards streptomycin. The 16S rRNA gene sequencing using 16sF is useful for identification of S. aureus. However, additional analysis such as PCR employing specific gene target, should give a valuable supplementary information, when specific characteristic is expected.

  16. A simple and robust method for connecting small-molecule drugs using gene-expression signatures

    Directory of Open Access Journals (Sweden)

    Gant Timothy W

    2008-06-01

    Full Text Available Abstract Background Interaction of a drug or chemical with a biological system can result in a gene-expression profile or signature characteristic of the event. Using a suitably robust algorithm these signatures can potentially be used to connect molecules with similar pharmacological or toxicological properties by gene expression profile. Lamb et al first proposed the Connectivity Map [Lamb et al (2006, Science 313, 1929–1935] to make successful connections among small molecules, genes, and diseases using genomic signatures. Results Here we have built on the principles of the Connectivity Map to present a simpler and more robust method for the construction of reference gene-expression profiles and for the connection scoring scheme, which importantly allows the valuation of statistical significance of all the connections observed. We tested the new method with two randomly generated gene signatures and three experimentally derived gene signatures (for HDAC inhibitors, estrogens, and immunosuppressive drugs, respectively. Our testing with this method indicates that it achieves a higher level of specificity and sensitivity and so advances the original method. Conclusion The method presented here not only offers more principled statistical procedures for testing connections, but more importantly it provides effective safeguard against false connections at the same time achieving increased sensitivity. With its robust performance, the method has potential use in the drug development pipeline for the early recognition of pharmacological and toxicological properties in chemicals and new drug candidates, and also more broadly in other 'omics sciences.

  17. Selective in vivo radiosensitization by 5-fluorocytosine of human colorectal carcinoma cells transduced with the E. coli cytosine deaminase (CD) gene

    International Nuclear Information System (INIS)

    Gabel, M.; Kim, J.H.; Kolozsvary, A.; Khil, M.; Freytag, S.

    1998-01-01

    Purpose: The E. coli cytosine deaminase (CD) gene encodes an enzyme capable of converting the nontoxic prodrug 5-fluorocytosine (5-FC) to 5-fluorouracil (5-FU), a known radiosensitizer. Having previously shown that combined CD suicide gene therapy and radiation (RT) results in pronounced radiosensitization in vitro, we progressed to in vivo studies of combined therapy. Methods and Materials: WiDr human colon cancer cells were transduced in vitro with the CD gene and cells expressing CD were selected for use as xenografts in a nude mouse model. After administration of 5-FC, tumors received 10-30 Gy local field radiation (RT) and tumor growth delay was compared to control animals receiving either 5-FU, 5-FC, or RT alone. Results: Maximal growth delay was seen in mice treated with 5-FC for 6 consecutive days prior to RT. Combined treatment with 15 Gy radiation resulted in a dose-modifying factor (DMF) of 1.50, and a greater DMF was observed with higher doses of radiation. There was no appreciable toxicity using this new approach. In contrast, a similar treatment of combined 5-FU and radiation resulted in considerable toxicity and no appreciable radiosensitization. Conclusion: The present results show that combined suicide gene therapy and RT results in pronounced antitumor effect without any notable toxicity. This indicates that the CD gene may be useful in the development of novel treatment strategies combining radiation and gene therapy in the treatment of locally advanced cancers

  18. Determination of Selection Method in Genetic Algorithm for Land Suitability

    Directory of Open Access Journals (Sweden)

    Irfianti Asti Dwi

    2016-01-01

    Full Text Available Genetic Algoirthm is one alternative solution in the field of modeling optimization, automatic programming and machine learning. The purpose of the study was to compare some type of selection methods in Genetic Algorithm for land suitability. Contribution of this research applies the best method to develop region based horticultural commodities. This testing is done by comparing the three methods on the method of selection, the Roulette Wheel, Tournament Selection and Stochastic Universal Sampling. Parameters of the locations used in the test scenarios include Temperature = 27°C, Rainfall = 1200 mm, hummidity = 30%, Cluster fruit = 4, Crossover Probabiitiy (Pc = 0.6, Mutation Probabilty (Pm = 0.2 and Epoch = 10. The second test epoch incluides location parameters consist of Temperature = 30°C, Rainfall = 2000 mm, Humidity = 35%, Cluster fruit = 5, Crossover Probability (Pc = 0.7, Mutation Probability (Pm = 0.3 and Epoch 10. The conclusion of this study shows that the Roulette Wheel is the best method because it produces more stable and fitness value than the other two methods.

  19. Comparison of fuzzy AHP and fuzzy TODIM methods for landfill location selection.

    Science.gov (United States)

    Hanine, Mohamed; Boutkhoum, Omar; Tikniouine, Abdessadek; Agouti, Tarik

    2016-01-01

    Landfill location selection is a multi-criteria decision problem and has a strategic importance for many regions. The conventional methods for landfill location selection are insufficient in dealing with the vague or imprecise nature of linguistic assessment. To resolve this problem, fuzzy multi-criteria decision-making methods are proposed. The aim of this paper is to use fuzzy TODIM (the acronym for Interactive and Multi-criteria Decision Making in Portuguese) and the fuzzy analytic hierarchy process (AHP) methods for the selection of landfill location. The proposed methods have been applied to a landfill location selection problem in the region of Casablanca, Morocco. After determining the criteria affecting the landfill location decisions, fuzzy TODIM and fuzzy AHP methods are applied to the problem and results are presented. The comparisons of these two methods are also discussed.

  20. De novo clustering methods outperform reference-based methods for assigning 16S rRNA gene sequences to operational taxonomic units

    Directory of Open Access Journals (Sweden)

    Sarah L. Westcott

    2015-12-01

    Full Text Available Background. 16S rRNA gene sequences are routinely assigned to operational taxonomic units (OTUs that are then used to analyze complex microbial communities. A number of methods have been employed to carry out the assignment of 16S rRNA gene sequences to OTUs leading to confusion over which method is optimal. A recent study suggested that a clustering method should be selected based on its ability to generate stable OTU assignments that do not change as additional sequences are added to the dataset. In contrast, we contend that the quality of the OTU assignments, the ability of the method to properly represent the distances between the sequences, is more important.Methods. Our analysis implemented six de novo clustering algorithms including the single linkage, complete linkage, average linkage, abundance-based greedy clustering, distance-based greedy clustering, and Swarm and the open and closed-reference methods. Using two previously published datasets we used the Matthew’s Correlation Coefficient (MCC to assess the stability and quality of OTU assignments.Results. The stability of OTU assignments did not reflect the quality of the assignments. Depending on the dataset being analyzed, the average linkage and the distance and abundance-based greedy clustering methods generated OTUs that were more likely to represent the actual distances between sequences than the open and closed-reference methods. We also demonstrated that for the greedy algorithms VSEARCH produced assignments that were comparable to those produced by USEARCH making VSEARCH a viable free and open source alternative to USEARCH. Further interrogation of the reference-based methods indicated that when USEARCH or VSEARCH were used to identify the closest reference, the OTU assignments were sensitive to the order of the reference sequences because the reference sequences can be identical over the region being considered. More troubling was the observation that while both USEARCH and

  1. Polymorphisms of Selected DNA Repair Genes and Lung Cancer in Chromium Exposure.

    Science.gov (United States)

    Halasova, E; Matakova, T; Skerenova, M; Krutakova, M; Slovakova, P; Dzian, A; Javorkova, S; Pec, M; Kypusova, K; Hamzik, J

    2016-01-01

    Chromium is a well-known mutagen and carcinogen involved in lung cancer development. DNA repair genes play an important role in the elimination of genetic changes caused by chromium exposure. In the present study, we investigated the polymorphisms of the following DNA repair genes: XRCC3, participating in the homologous recombination repair, and hMLH1 and hMSH2, functioning in the mismatch repair. We focused on the risk the polymorphisms present in the development of lung cancer regarding the exposure to chromium. We analyzed 106 individuals; 45 patients exposed to chromium with diagnosed lung cancer and 61 healthy controls. Genotypes were determined by a PCR-RFLP method. We unravelled a potential for increased risk of lung cancer development in the hMLH1 (rs1800734) AA genotype in the recessive model. In conclusion, gene polymorphisms in the DNA repair genes underscores the risk of lung cancer development in chromium exposed individuals.

  2. Research on filter’s parameter selection based on PROMETHEE method

    Science.gov (United States)

    Zhu, Hui-min; Wang, Hang-yu; Sun, Shi-yan

    2018-03-01

    The selection of filter’s parameters in target recognition was studied in this paper. The PROMETHEE method was applied to the optimization problem of Gabor filter parameters decision, the correspondence model of the elemental relation between two methods was established. The author took the identification of military target as an example, problem about the filter’s parameter decision was simulated and calculated by PROMETHEE. The result showed that using PROMETHEE method for the selection of filter’s parameters was more scientific. The human disturbance caused by the experts method and empirical method could be avoided by this way. The method can provide reference for the parameter configuration scheme decision of the filter.

  3. Combining AHP and DEA Methods for Selecting a Project Manager

    Directory of Open Access Journals (Sweden)

    Baruch Keren

    2014-07-01

    Full Text Available A project manager has a major influence on the success or failure of the project. A good project manager can match between the strategy and objectives of the organization and the goals of the project. Therefore, the selection of the appropriate project manager is a key factor for the success of the project. A potential project manager is judged by his or her proven performance and personal qualifications. This paper proposes a method to calculate the weighted scores and the full rank of candidates for managing a project, and to select the best of those candidates. The proposed method combines specific methodologies: the Data Envelopment Analysis (DEA and the Analytical Hierarchical Process (AHP and uses DEA Ranking Methods to enhance selection.

  4. Geography, assortative mating, and the effects of sexual selection on speciation with gene flow.

    Science.gov (United States)

    Servedio, Maria R

    2016-01-01

    Theoretical and empirical research on the evolution of reproductive isolation have both indicated that the effects of sexual selection on speciation with gene flow are quite complex. As part of this special issue on the contributions of women to basic and applied evolutionary biology, I discuss my work on this question in the context of a broader assessment of the patterns of sexual selection that lead to, versus inhibit, the speciation process, as derived from theoretical research. In particular, I focus on how two factors, the geographic context of speciation and the mechanism leading to assortative mating, interact to alter the effect that sexual selection through mate choice has on speciation. I concentrate on two geographic contexts: sympatry and secondary contact between two geographically separated populations that are exchanging migrants and two mechanisms of assortative mating: phenotype matching and separate preferences and traits. I show that both of these factors must be considered for the effects of sexual selection on speciation to be inferred.

  5. On-the-fly selection of cell-specific enhancers, genes, miRNAs and proteins across the human body using SlideBase

    DEFF Research Database (Denmark)

    Ienasescu, Hans-Ioan; Li, Kang; Andersson, Robin

    2016-01-01

    Genomics consortia have produced large datasets profiling the expression of genes, micro-RNAs, enhancers and more across human tissues or cells. There is a need for intuitive tools to select subsets of such data that is the most relevant for specific studies. To this end, we present Slide...... for individual cell types/tissues, producing sets of genes, enhancers etc. which satisfy these constraints. Changes in slider settings result in simultaneous changes in the selected sets, updated in real time. SlideBase is linked to major databases from genomics consortia, including FANTOM, GTEx, The Human...

  6. Genomic scans for selective sweeps using SNP data

    DEFF Research Database (Denmark)

    Nielsen, Rasmus; Williamson, Scott; Kim, Yuseob

    2005-01-01

    of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence...

  7. A computational method based on the integration of heterogeneous networks for predicting disease-gene associations.

    Directory of Open Access Journals (Sweden)

    Xingli Guo

    Full Text Available The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities have shown their power in tackling the issue. In this paper, a novel systematic and global method that integrates two heterogeneous networks for prioritizing candidate disease-causing genes is provided, based on the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein interactions. In this method, the association score function between a query disease and a candidate gene is defined as the weighted sum of all the association scores between similar diseases and neighbouring genes. Moreover, the topological correlation of these two heterogeneous networks can be incorporated into the definition of the score function, and finally an iterative algorithm is designed for this issue. This method was tested with 10-fold cross-validation on all 1,126 diseases that have at least a known causal gene, and it ranked the correct gene as one of the top ten in 622 of all the 1,428 cases, significantly outperforming a state-of-the-art method called PRINCE. The results brought about by this method were applied to study three multi-factorial disorders: breast cancer, Alzheimer disease and diabetes mellitus type 2, and some suggestions of novel causal genes and candidate disease-causing subnetworks were provided for further investigation.

  8. Selective Inhibition of Histone Deacetylation in Melanoma Increases Targeted Gene Delivery by a Bacteriophage Viral Vector

    Directory of Open Access Journals (Sweden)

    Samuel Campbell

    2018-04-01

    Full Text Available The previously developed adeno-associated virus/phage (AAVP vector, a hybrid between M13 bacteriophage (phage viruses that infect bacteria only and human Adeno-Associated Virus (AAV, is a promising tool in targeted gene therapy against cancer. AAVP can be administered systemically and made tissue specific through the use of ligand-directed targeting. Cancer cells and tumor-associated blood vessels overexpress the αν integrin receptors, which are involved in tumor angiogenesis and tumor invasion. AAVP is targeted to these integrins via a double cyclic RGD4C ligand displayed on the phage capsid. Nevertheless, there remain significant host-defense hurdles to the use of AAVP in targeted gene delivery and subsequently in gene therapy. We previously reported that histone deacetylation in cancer constitutes a barrier to AAVP. Herein, to improve AAVP-mediated gene delivery to cancer cells, we combined the vector with selective adjuvant chemicals that inhibit specific histone deacetylases (HDAC. We examined the effects of the HDAC inhibitor C1A that mainly targets HDAC6 and compared this to sodium butyrate, a pan-HDAC inhibitor with broad spectrum HDAC inhibition. We tested the effects on melanoma, known for HDAC6 up-regulation, and compared this side by side with a normal human kidney HEK293 cell line. Varying concentrations were tested to determine cytotoxic levels as well as effects on AAVP gene delivery. We report that the HDAC inhibitor C1A increased AAVP-mediated transgene expression by up to ~9-fold. These findings indicate that selective HDAC inhibition is a promising adjuvant treatment for increasing the therapeutic value of AAVP.

  9. Reference gene selection for qRT-PCR assays in Stellera chamaejasme subjected to abiotic stresses and hormone treatments based on transcriptome datasets

    Directory of Open Access Journals (Sweden)

    Xin Liu

    2018-04-01

    Full Text Available Background Stellera chamaejasme Linn, an important poisonous plant of the China grassland, is toxic to humans and livestock. The rapid expansion of S. chamaejasme has greatly damaged the grassland ecology and, consequently, seriously endangered the development of animal husbandry. To draft efficient prevention and control measures, it has become more urgent to carry out research on its adaptive and expansion mechanisms in different unfavorable habitats at the genetic level. Quantitative real-time polymerase chain reaction (qRT-PCR is a widely used technique for studying gene expression at the transcript level; however, qRT-PCR requires reference genes (RGs as endogenous controls for data normalization and only through appropriate RG selection and qRT-PCR can we guarantee the reliability and robustness of expression studies and RNA-seq data analysis. Unfortunately, little research on the selection of RGs for gene expression data normalization in S. chamaejasme has been reported. Method In this study, 10 candidate RGs namely, 18S, 60S, CYP, GAPCP1, GAPDH2, EF1B, MDH, SAND, TUA1, and TUA6, were singled out from the transcriptome database of S. chamaejasme, and their expression stability under three abiotic stresses (drought, cold, and salt and three hormone treatments (abscisic acid, ABA; gibberellin, GA; ethephon, ETH were estimated with the programs geNorm, NormFinder, and BestKeeper. Result Our results showed that GAPCP1 and EF1B were the best combination for the three abiotic stresses, whereas TUA6 and SAND, TUA1 and CYP, GAPDH2 and 60S were the best choices for ABA, GA, and ETH treatment, respectively. Moreover, GAPCP1 and 60S were assessed to be the best combination for all samples, and 18S was the least stable RG for use as an internal control in all of the experimental subsets. The expression patterns of two target genes (P5CS2 and GI further verified that the RGs that we selected were suitable for gene expression normalization. Discussion

  10. Indicators for Monitoring Water, Sanitation, and Hygiene: A Systematic Review of Indicator Selection Methods

    Directory of Open Access Journals (Sweden)

    Stefanie Schwemlein

    2016-03-01

    Full Text Available Monitoring water, sanitation, and hygiene (WaSH is important to track progress, improve accountability, and demonstrate impacts of efforts to improve conditions and services, especially in low- and middle-income countries. Indicator selection methods enable robust monitoring of WaSH projects and conditions. However, selection methods are not always used and there are no commonly-used methods for selecting WaSH indicators. To address this gap, we conducted a systematic review of indicator selection methods used in WaSH-related fields. We present a summary of indicator selection methods for environment, international development, and water. We identified six methodological stages for selecting indicators for WaSH: define the purpose and scope; select a conceptual framework; search for candidate indicators; determine selection criteria; score indicators against criteria; and select a final suite of indicators. This summary of indicator selection methods provides a foundation for the critical assessment of existing methods. It can be used to inform future efforts to construct indicator sets in WaSH and related fields.

  11. Genetic association between selected cytokine genes and glioblastoma in the Han Chinese population

    International Nuclear Information System (INIS)

    Jin, Tianbo; Li, Xiaolan; Zhang, Jiayi; Wang, Hong; Geng, Tingting; Li, Gang; Gao, Guodong; Chen, Chao

    2013-01-01

    Glioblastoma (GBM) is the most malignant brain tumor. Many abnormal secretion and expression of cytokines have been found in GBM, initially speculated that the occurrence of GBM may be involved in these abnormal secretion of cytokines. This study aims to detect the association of cytokine genes with GBM. We selected seven tag single nucleotide polymorphisms (tSNPs) in six cytokine genes, which previously reported to be associated with brain tumors, and analyzed their association with GBM in a Han Chinese population using χ 2 test and genetic model analysis. We found two risk tSNPs and one protective tSNP. By χ 2 test, the rs1801275 in IL-4R showed an increased risk of GBM. In the genetic model analysis, the genotype “TC” of rs20541 in IL-13 gene showed an increased risk of GBM in over-dominant model (OR = 2.00; 95% CI, 1.13-3.54, p = 0.015); the genotype “CT” of rs1800871 in the IL-10 gene showed a decrease risk in the over-dominant model (OR = 0.57; 95% CI, 0.33 – 0.97; p = 0.037). The genotype “AG” of rs1801275 in the IL-4R gene showed an increase risk in over-dominant model (OR = 2.29; 95% CI, 1.20 - 4.35; p = 0.0081) We further analyzed whether the six cytokine genes have a different effect on the disease in gender specific population, and found that the allele “G” of rs2243248 in the IL-4 gene showed a decrease risk of GBM in female (OR = 0.35, 95% CI, 0.13 - 0.94, p = 0.0032), but the allele “T” showed a decrease risk in male (OR = 0.30, 95% CI, 0.17 - 0.53, p = 0.0032). Our findings, combined with previously reported results, suggest that cytokine genes have potential role in GBM development, which may be useful to early prognostics for GBM in the Han Chinese population

  12. How to Make a Dolphin: Molecular Signature of Positive Selection in Cetacean Genome.

    Directory of Open Access Journals (Sweden)

    Mariana F Nery

    Full Text Available Cetaceans are unique in being the only mammals completely adapted to an aquatic environment. This adaptation has required complex changes and sometimes a complete restructuring of physiology, behavior and morphology. Identifying genes that have been subjected to selection pressure during cetacean evolution would greatly enhance our knowledge of the ways in which genetic variation in this mammalian order has been shaped by natural selection. Here, we performed a genome-wide scan for positive selection in the dolphin lineage. We employed models of codon substitution that account for variation of selective pressure over branches on the tree and across sites in a sequence. We analyzed 7,859 nuclear-coding ortholog genes and using a series of likelihood ratio tests (LRTs, we identified 376 genes (4.8% with molecular signatures of positive selection in the dolphin lineage. We used the cow as the sister group and compared estimates of selection in the cetacean genome to this using the same methods. This allowed us to define which genes have been exclusively under positive selection in the dolphin lineage. The enrichment analysis found that the identified positively selected genes are significantly over-represented for three exclusive functional categories only in the dolphin lineage: segment specification, mesoderm development and system development. Of particular interest for cetacean adaptation to an aquatic life are the following GeneOntology targets under positive selection: genes related to kidney, heart, lung, eye, ear and nervous system development.

  13. An Expression of Periodic Phenomena of Fashion on Sexual Selection Model with Conformity Genes and Memes

    Science.gov (United States)

    Mutoh, Atsuko; Tokuhara, Shinya; Kanoh, Masayoshi; Oboshi, Tamon; Kato, Shohei; Itoh, Hidenori

    It is generally thought that living things have trends in their preferences. The mechanism of occurrence of another trends in successive periods is concerned in their conformity. According to social impact theory, the minority is always exists in the group. There is a possibility that the minority make the transition to the majority by conforming agents. Because of agent's promotion of their conform actions, the majority can make the transition. We proposed an evolutionary model with both genes and memes, and elucidated the interaction between genes and memes on sexual selection. In this paper, we propose an agent model for sexual selection imported the concept of conformity. Using this model we try an environment where male agents and female agents are existed, we find that periodic phenomena of fashion are expressed. And we report the influence of conformity and differentiation on the transition of their preferences.

  14. Differential gene expression from genome-wide microarray analyses distinguishes Lohmann Selected Leghorn and Lohmann Brown layers.

    Directory of Open Access Journals (Sweden)

    Christin Habig

    Full Text Available The Lohmann Selected Leghorn (LSL and Lohmann Brown (LB layer lines have been selected for high egg production since more than 50 years and belong to the worldwide leading commercial layer lines. The objectives of the present study were to characterize the molecular processes that are different among these two layer lines using whole genome RNA expression profiles. The hens were kept in the newly developed small group housing system Eurovent German with two different group sizes. Differential expression was observed for 6,276 microarray probes (FDR adjusted P-value <0.05 among the two layer lines LSL and LB. A 2-fold or greater change in gene expression was identified on 151 probe sets. In LSL, 72 of the 151 probe sets were up- and 79 of them were down-regulated. Gene ontology (GO enrichment analysis accounting for biological processes evinced 18 GO-terms for the 72 probe sets with higher expression in LSL, especially those taking part in immune system processes and membrane organization. A total of 32 enriched GO-terms were determined among the 79 down-regulated probe sets of LSL. Particularly, these terms included phosphorus metabolic processes and signaling pathways. In conclusion, the phenotypic differences among the two layer lines LSL and LB are clearly reflected in their gene expression profiles of the cerebrum. These novel findings provide clues for genes involved in economically important line characteristics of commercial laying hens.

  15. Selection and Validation of Reference Genes for Quantitative Real-Time PCR Normalization Under Ethanol Stress Conditions in Oenococcus oeni SD-2a

    Directory of Open Access Journals (Sweden)

    Shuai Peng

    2018-05-01

    Full Text Available The powerful Quantitative real-time PCR (RT-qPCR was widely used to assess gene expression levels, which requires the optimal reference genes used for normalization. Oenococcus oeni (O. oeni, as the one of most important microorganisms in wine industry and the most resistant lactic acid bacteria (LAB species to ethanol, has not been investigated regarding the selection of stable reference genes for RT-qPCR normalization under ethanol stress conditions. In this study, nine candidate reference genes (proC, dnaG, rpoA, ldhD, ddlA, rrs, gyrA, gyrB, and dpoIII were analyzed to determine the most stable reference genes for RT-qPCR in O. oeni SD-2a under different ethanol stress conditions (8, 12, and 16% (v/v ethanol. The transcript stabilities of these genes were evaluated using the algorithms geNorm, NormFinder, and BestKeeper. The results showed that dnaG and dpoIII were selected as the best reference genes across all experimental ethanol conditions. Considering single stress experimental modes, dpoIII and dnaG would be suitable to normalize expression level for 8% ethanol shock treatment, while the combination of gyrA, gyrB, and rrs would be suitable for 12% ethanol shock treatment. proC and gyrB revealed the most stable expression in 16% ethanol shock treatment. This study selected and validated for the first time the reference genes for RT-qPCR normalization in O. oeni SD-2a under ethanol stress conditions.

  16. Detection of virulence genes in Uropathogenic E. coli (UPEC strains by Multiplex-PCR method

    Directory of Open Access Journals (Sweden)

    Javad Mohammadi

    2017-06-01

    Full Text Available Background & Objectives: Urinary tract infection caused by E. coli is one of the most common illnesses in all age groups worldwide. Presence of virulence genes is a key factor in bacterial pathogens in uroepithelial cells. The present study was performed to detect iha, iroN, ompT genes in the Uropathogenic E.coli isolates from clinical samples using multiplex-PCR method in Kerman. Materials & Methods: In this descriptive cross-sectional study, 200 samples of patients with urinary tract infections in Kerman hospitals were collected. After biochemical and microbiological tests, all strains were tested with regard to the presence of iha, iroN, and ompT genes using multiplex-PCR method. Results: The results of Multiplex-PCR showed that all specimens had one, two, or three virulence genes simultaneously. The highest and lowest frequency distribution of genes was related to iha (56.7% and iroN (20% respectively. Conclusion: According to the prevalence of urinary tract infection in the community and distribution of resistance and virulence factors, the fast and accurate detection of the strains and virulence genes is necessary

  17. Reference gene selection for quantitative reverse transcription-polymerase chain reaction normalization during in vitro adventitious rooting in Eucalyptus globulus Labill.

    Science.gov (United States)

    de Almeida, Márcia R; Ruedell, Carolina M; Ricachenevsky, Felipe K; Sperotto, Raul A; Pasquali, Giancarlo; Fett-Neto, Arthur G

    2010-09-20

    Eucalyptus globulus and its hybrids are very important for the cellulose and paper industry mainly due to their low lignin content and frost resistance. However, rooting of cuttings of this species is recalcitrant and exogenous auxin application is often necessary for good root development. To date one of the most accurate methods available for gene expression analysis is quantitative reverse transcription-polymerase chain reaction (qPCR); however, reliable use of this technique requires reference genes for normalization. There is no single reference gene that can be regarded as universal for all experiments and biological materials. Thus, the identification of reliable reference genes must be done for every species and experimental approach. The present study aimed at identifying suitable control genes for normalization of gene expression associated with adventitious rooting in E. globulus microcuttings. By the use of two distinct algorithms, geNorm and NormFinder, we have assessed gene expression stability of eleven candidate reference genes in E. globulus: 18S, ACT2, EF2, EUC12, H2B, IDH, SAND, TIP41, TUA, UBI and 33380. The candidate reference genes were evaluated in microccuttings rooted in vitro, in presence or absence of auxin, along six time-points spanning the process of adventitious rooting. Overall, the stability profiles of these genes determined with each one of the algorithms were very similar. Slight differences were observed in the most stable pair of genes indicated by each program: IDH and SAND for geNorm, and H2B and TUA for NormFinder. Both programs identified UBI and 18S as the most variable genes. To validate these results and select the most suitable reference genes, the expression profile of the ARGONAUTE1 gene was evaluated in relation to the most stable candidate genes indicated by each algorithm. Our study showed that expression stability varied between putative reference genes tested in E. globulus. Based on the AGO1 relative expression

  18. Distinguishing between Selective Sweeps from Standing Variation and from a De Novo Mutation

    Science.gov (United States)

    Peter, Benjamin M.; Huerta-Sanchez, Emilia; Nielsen, Rasmus

    2012-01-01

    An outstanding question in human genetics has been the degree to which adaptation occurs from standing genetic variation or from de novo mutations. Here, we combine several common statistics used to detect selection in an Approximate Bayesian Computation (ABC) framework, with the goal of discriminating between models of selection and providing estimates of the age of selected alleles and the selection coefficients acting on them. We use simulations to assess the power and accuracy of our method and apply it to seven of the strongest sweeps currently known in humans. We identify two genes, ASPM and PSCA, that are most likely affected by selection on standing variation; and we find three genes, ADH1B, LCT, and EDAR, in which the adaptive alleles seem to have swept from a new mutation. We also confirm evidence of selection for one further gene, TRPV6. In one gene, G6PD, neither neutral models nor models of selective sweeps fit the data, presumably because this locus has been subject to balancing selection. PMID:23071458

  19. Distinguishing between selective sweeps from standing variation and from a de novo mutation.

    Directory of Open Access Journals (Sweden)

    Benjamin M Peter

    Full Text Available An outstanding question in human genetics has been the degree to which adaptation occurs from standing genetic variation or from de novo mutations. Here, we combine several common statistics used to detect selection in an Approximate Bayesian Computation (ABC framework, with the goal of discriminating between models of selection and providing estimates of the age of selected alleles and the selection coefficients acting on them. We use simulations to assess the power and accuracy of our method and apply it to seven of the strongest sweeps currently known in humans. We identify two genes, ASPM and PSCA, that are most likely affected by selection on standing variation; and we find three genes, ADH1B, LCT, and EDAR, in which the adaptive alleles seem to have swept from a new mutation. We also confirm evidence of selection for one further gene, TRPV6. In one gene, G6PD, neither neutral models nor models of selective sweeps fit the data, presumably because this locus has been subject to balancing selection.

  20. Balancing selection and recombination as evolutionary forces caused population genetic variations in golden pheasant MHC class I genes.

    Science.gov (United States)

    Zeng, Qian-Qian; He, Ke; Sun, Dan-Dan; Ma, Mei-Ying; Ge, Yun-Fa; Fang, Sheng-Guo; Wan, Qiu-Hong

    2016-02-18

    The major histocompatibility complex (MHC) genes are vital partners in the acquired immune processes of vertebrates. MHC diversity may be directly associated with population resistance to infectious pathogens. Here, we screened for polymorphisms in exons 2 and 3 of the IA1 and IA2 genes in 12 golden pheasant populations across the Chinese mainland to characterize their genetic variation levels, to understand the effects of historical positive selection and recombination in shaping class I diversity, and to investigate the genetic structure of wild golden pheasant populations. Among 339 individual pheasants, we identified 14 IA1 alleles in exon 2 (IA1-E2), 11 IA1-E3 alleles, 27 IA2-E2 alleles, and 28 IA2-E3 alleles. The non-synonymous substitution rate was significantly greater than the synonymous substitution rate at sequences in the IA2 gene encoding putative peptide-binding sites but not in the IA1 gene; we also found more positively selected sites in IA2 than in IA1. Frequent recombination events resulted in at least 9 recombinant IA2 alleles, in accordance with the intermingling pattern of the phylogenetic tree. Although some IA alleles are widely shared among studied populations, large variation occurs in the number of IA alleles across these populations. Allele frequency analysis across 2 IA loci showed low levels of genetic differentiation among populations on small geographic scales; however, significant genetic differentiation was observed between pheasants from the northern and southern regions of the Yangtze River. Both STRUCTURE analysis and F-statistic (F ST ) value comparison classified those populations into 2 major groups: the northern region of the Yangtze River (NYR) and the southern region of the Yangtze River (SYR). More extensive polymorphisms in IA2 than IA1 indicate that IA2 has undergone much stronger positive-selection pressure during evolution. Moreover, the recombination events detected between the genes and the intermingled phylogenetic