WorldWideScience

Sample records for validated all-qsar models

  1. Structural refinement and prediction of potential CCR2 antagonists through validated multi-QSAR modeling studies.

    Science.gov (United States)

    Amin, Sk Abdul; Adhikari, Nilanjan; Baidya, Sandip Kumar; Gayen, Shovanlal; Jha, Tarun

    2018-01-03

    Chemokines trigger numerous inflammatory responses and modulate the immune system. The interaction between monocyte chemoattractant protein-1 and chemokine receptor 2 (CCR2) may be the cause of atherosclerosis, obesity, and insulin resistance. However, CCR2 is also implicated in other inflammatory diseases such as rheumatoid arthritis, multiple sclerosis, asthma, and neuropathic pain. Therefore, there is a paramount importance of designing potent and selective CCR2 antagonists despite a number of drug candidates failed in clinical trials. In this article, 83 CCR2 antagonists by Jhonson and Jhonson Pharmaceuticals have been considered for robust validated multi-QSAR modeling studies to get an idea about the structural and pharmacophoric requirements for designing more potent CCR2 antagonists. All these QSAR models were validated and statistically reliable. Observations resulted from different modeling studies correlated and validated results of other ones. Finally, depending on these QSAR observations, some new molecules were proposed that may exhibit higher activity against CCR2.

  2. On various metrics used for validation of predictive QSAR models with applications in virtual screening and focused library design.

    Science.gov (United States)

    Roy, Kunal; Mitra, Indrani

    2011-07-01

    Quantitative structure-activity relationships (QSARs) have important applications in drug discovery research, environmental fate modeling, property prediction, etc. Validation has been recognized as a very important step for QSAR model development. As one of the important objectives of QSAR modeling is to predict activity/property/toxicity of new chemicals falling within the domain of applicability of the developed models and QSARs are being used for regulatory decisions, checking reliability of the models and confidence of their predictions is a very important aspect, which can be judged during the validation process. One prime application of a statistically significant QSAR model is virtual screening for molecules with improved potency based on the pharmacophoric features and the descriptors appearing in the QSAR model. Validated QSAR models may also be utilized for design of focused libraries which may be subsequently screened for the selection of hits. The present review focuses on various metrics used for validation of predictive QSAR models together with an overview of the application of QSAR models in the fields of virtual screening and focused library design for diverse series of compounds with citation of some recent examples.

  3. QSAR models for anti-malarial activity of 4-aminoquinolines.

    Science.gov (United States)

    Masand, Vijay H; Toropov, Andrey A; Toropova, Alla P; Mahajan, Devidas T

    2014-03-01

    In the present study, predictive quantitative structure - activity relationship (QSAR) models for anti-malarial activity of 4-aminoquinolines have been developed. CORAL, which is freely available on internet (http://www.insilico.eu/coral), has been used as a tool of QSAR analysis to establish statistically robust QSAR model of anti-malarial activity of 4-aminoquinolines. Six random splits into the visible sub-system of the training and invisible subsystem of validation were examined. Statistical qualities for these splits vary, but in all these cases, statistical quality of prediction for anti-malarial activity was quite good. The optimal SMILES-based descriptor was used to derive the single descriptor based QSAR model for a data set of 112 aminoquinolones. All the splits had r(2)> 0.85 and r(2)> 0.78 for subtraining and validation sets, respectively. The three parametric multilinear regression (MLR) QSAR model has Q(2) = 0.83, R(2) = 0.84 and F = 190.39. The anti-malarial activity has strong correlation with presence/absence of nitrogen and oxygen at a topological distance of six.

  4. QSAR Modeling: Where have you been? Where are you going to?

    Science.gov (United States)

    Cherkasov, Artem; Muratov, Eugene N.; Fourches, Denis; Varnek, Alexandre; Baskin, Igor I.; Cronin, Mark; Dearden, John; Gramatica, Paola; Martin, Yvonne C.; Todeschini, Roberto; Consonni, Viviana; Kuz'min, Victor E.; Cramer, Richard; Benigni, Romualdo; Yang, Chihae; Rathman, James; Terfloth, Lothar; Gasteiger, Johann; Richard, Ann; Tropsha, Alexander

    2014-01-01

    Quantitative Structure-Activity Relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss: (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists towards collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making. PMID:24351051

  5. Validity and validation of expert (Q)SAR systems.

    Science.gov (United States)

    Hulzebos, E; Sijm, D; Traas, T; Posthumus, R; Maslankiewicz, L

    2005-08-01

    At a recent workshop in Setubal (Portugal) principles were drafted to assess the suitability of (quantitative) structure-activity relationships ((Q)SARs) for assessing the hazards and risks of chemicals. In the present study we applied some of the Setubal principles to test the validity of three (Q)SAR expert systems and validate the results. These principles include a mechanistic basis, the availability of a training set and validation. ECOSAR, BIOWIN and DEREK for Windows have a mechanistic or empirical basis. ECOSAR has a training set for each QSAR. For half of the structural fragments the number of chemicals in the training set is >4. Based on structural fragments and log Kow, ECOSAR uses linear regression to predict ecotoxicity. Validating ECOSAR for three 'valid' classes results in predictivity of > or = 64%. BIOWIN uses (non-)linear regressions to predict the probability of biodegradability based on fragments and molecular weight. It has a large training set and predicts non-ready biodegradability well. DEREK for Windows predictions are supported by a mechanistic rationale and literature references. The structural alerts in this program have been developed with a training set of positive and negative toxicity data. However, to support the prediction only a limited number of chemicals in the training set is presented to the user. DEREK for Windows predicts effects by 'if-then' reasoning. The program predicts best for mutagenicity and carcinogenicity. Each structural fragment in ECOSAR and DEREK for Windows needs to be evaluated and validated separately.

  6. Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient.

    Science.gov (United States)

    Chirico, Nicola; Gramatica, Paola

    2011-09-26

    The main utility of QSAR models is their ability to predict activities/properties for new chemicals, and this external prediction ability is evaluated by means of various validation criteria. As a measure for such evaluation the OECD guidelines have proposed the predictive squared correlation coefficient Q(2)(F1) (Shi et al.). However, other validation criteria have been proposed by other authors: the Golbraikh-Tropsha method, r(2)(m) (Roy), Q(2)(F2) (Schüürmann et al.), Q(2)(F3) (Consonni et al.). In QSAR studies these measures are usually in accordance, though this is not always the case, thus doubts can arise when contradictory results are obtained. It is likely that none of the aforementioned criteria is the best in every situation, so a comparative study using simulated data sets is proposed here, using threshold values suggested by the proponents or those widely used in QSAR modeling. In addition, a different and simple external validation measure, the concordance correlation coefficient (CCC), is proposed and compared with other criteria. Huge data sets were used to study the general behavior of validation measures, and the concordance correlation coefficient was shown to be the most restrictive. On using simulated data sets of a more realistic size, it was found that CCC was broadly in agreement, about 96% of the time, with other validation measures in accepting models as predictive, and in almost all the examples it was the most precautionary. The proposed concordance correlation coefficient also works well on real data sets, where it seems to be more stable, and helps in making decisions when the validation measures are in conflict. Since it is conceptually simple, and given its stability and restrictiveness, we propose the concordance correlation coefficient as a complementary, or alternative, more prudent measure of a QSAR model to be externally predictive.

  7. Combinatorial QSAR modeling of chemical toxicants tested against Tetrahymena pyriformis.

    Science.gov (United States)

    Zhu, Hao; Tropsha, Alexander; Fourches, Denis; Varnek, Alexandre; Papa, Ester; Gramatica, Paola; Oberg, Tomas; Dao, Phuong; Cherkasov, Artem; Tetko, Igor V

    2008-04-01

    Selecting most rigorous quantitative structure-activity relationship (QSAR) approaches is of great importance in the development of robust and predictive models of chemical toxicity. To address this issue in a systematic way, we have formed an international virtual collaboratory consisting of six independent groups with shared interests in computational chemical toxicology. We have compiled an aqueous toxicity data set containing 983 unique compounds tested in the same laboratory over a decade against Tetrahymena pyriformis. A modeling set including 644 compounds was selected randomly from the original set and distributed to all groups that used their own QSAR tools for model development. The remaining 339 compounds in the original set (external set I) as well as 110 additional compounds (external set II) published recently by the same laboratory (after this computational study was already in progress) were used as two independent validation sets to assess the external predictive power of individual models. In total, our virtual collaboratory has developed 15 different types of QSAR models of aquatic toxicity for the training set. The internal prediction accuracy for the modeling set ranged from 0.76 to 0.93 as measured by the leave-one-out cross-validation correlation coefficient ( Q abs2). The prediction accuracy for the external validation sets I and II ranged from 0.71 to 0.85 (linear regression coefficient R absI2) and from 0.38 to 0.83 (linear regression coefficient R absII2), respectively. The use of an applicability domain threshold implemented in most models generally improved the external prediction accuracy but at the same time led to a decrease in chemical space coverage. Finally, several consensus models were developed by averaging the predicted aquatic toxicity for every compound using all 15 models, with or without taking into account their respective applicability domains. We find that consensus models afford higher prediction accuracy for the

  8. Predictive QSAR Models for the Toxicity of Disinfection Byproducts

    Directory of Open Access Journals (Sweden)

    Litang Qin

    2017-10-01

    Full Text Available Several hundred disinfection byproducts (DBPs in drinking water have been identified, and are known to have potentially adverse health effects. There are toxicological data gaps for most DBPs, and the predictive method may provide an effective way to address this. The development of an in-silico model of toxicology endpoints of DBPs is rarely studied. The main aim of the present study is to develop predictive quantitative structure–activity relationship (QSAR models for the reactive toxicities of 50 DBPs in the five bioassays of X-Microtox, GSH+, GSH−, DNA+ and DNA−. All-subset regression was used to select the optimal descriptors, and multiple linear-regression models were built. The developed QSAR models for five endpoints satisfied the internal and external validation criteria: coefficient of determination (R2 > 0.7, explained variance in leave-one-out prediction (Q2LOO and in leave-many-out prediction (Q2LMO > 0.6, variance explained in external prediction (Q2F1, Q2F2, and Q2F3 > 0.7, and concordance correlation coefficient (CCC > 0.85. The application domains and the meaning of the selective descriptors for the QSAR models were discussed. The obtained QSAR models can be used in predicting the toxicities of the 50 DBPs.

  9. Predictive QSAR Models for the Toxicity of Disinfection Byproducts.

    Science.gov (United States)

    Qin, Litang; Zhang, Xin; Chen, Yuhan; Mo, Lingyun; Zeng, Honghu; Liang, Yanpeng

    2017-10-09

    Several hundred disinfection byproducts (DBPs) in drinking water have been identified, and are known to have potentially adverse health effects. There are toxicological data gaps for most DBPs, and the predictive method may provide an effective way to address this. The development of an in-silico model of toxicology endpoints of DBPs is rarely studied. The main aim of the present study is to develop predictive quantitative structure-activity relationship (QSAR) models for the reactive toxicities of 50 DBPs in the five bioassays of X-Microtox, GSH+, GSH-, DNA+ and DNA-. All-subset regression was used to select the optimal descriptors, and multiple linear-regression models were built. The developed QSAR models for five endpoints satisfied the internal and external validation criteria: coefficient of determination ( R ²) > 0.7, explained variance in leave-one-out prediction ( Q ² LOO ) and in leave-many-out prediction ( Q ² LMO ) > 0.6, variance explained in external prediction ( Q ² F1 , Q ² F2 , and Q ² F3 ) > 0.7, and concordance correlation coefficient ( CCC ) > 0.85. The application domains and the meaning of the selective descriptors for the QSAR models were discussed. The obtained QSAR models can be used in predicting the toxicities of the 50 DBPs.

  10. Structural exploration for the refinement of anticancer matrix metalloproteinase-2 inhibitor designing approaches through robust validated multi-QSARs

    Science.gov (United States)

    Adhikari, Nilanjan; Amin, Sk. Abdul; Saha, Achintya; Jha, Tarun

    2018-03-01

    Matrix metalloproteinase-2 (MMP-2) is a promising pharmacological target for designing potential anticancer drugs. MMP-2 plays critical functions in apoptosis by cleaving the DNA repair enzyme namely poly (ADP-ribose) polymerase (PARP). Moreover, MMP-2 expression triggers the vascular endothelial growth factor (VEGF) having a positive influence on tumor size, invasion, and angiogenesis. Therefore, it is an urgent need to develop potential MMP-2 inhibitors without any toxicity but better pharmacokinetic property. In this article, robust validated multi-quantitative structure-activity relationship (QSAR) modeling approaches were attempted on a dataset of 222 MMP-2 inhibitors to explore the important structural and pharmacophoric requirements for higher MMP-2 inhibition. Different validated regression and classification-based QSARs, pharmacophore mapping and 3D-QSAR techniques were performed. These results were challenged and subjected to further validation to explain 24 in house MMP-2 inhibitors to judge the reliability of these models further. All these models were individually validated internally as well as externally and were supported and validated by each other. These results were further justified by molecular docking analysis. Modeling techniques adopted here not only helps to explore the necessary structural and pharmacophoric requirements but also for the overall validation and refinement techniques for designing potential MMP-2 inhibitors.

  11. An ensemble model of QSAR tools for regulatory risk assessment.

    Science.gov (United States)

    Pradeep, Prachi; Povinelli, Richard J; White, Shannon; Merrill, Stephen J

    2016-01-01

    Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa ( κ ): 0

  12. Experimental Errors in QSAR Modeling Sets: What We Can Do and What We Cannot Do.

    Science.gov (United States)

    Zhao, Linlin; Wang, Wenyi; Sedykh, Alexander; Zhu, Hao

    2017-06-30

    Numerous chemical data sets have become available for quantitative structure-activity relationship (QSAR) modeling studies. However, the quality of different data sources may be different based on the nature of experimental protocols. Therefore, potential experimental errors in the modeling sets may lead to the development of poor QSAR models and further affect the predictions of new compounds. In this study, we explored the relationship between the ratio of questionable data in the modeling sets, which was obtained by simulating experimental errors, and the QSAR modeling performance. To this end, we used eight data sets (four continuous endpoints and four categorical endpoints) that have been extensively curated both in-house and by our collaborators to create over 1800 various QSAR models. Each data set was duplicated to create several new modeling sets with different ratios of simulated experimental errors (i.e., randomizing the activities of part of the compounds) in the modeling process. A fivefold cross-validation process was used to evaluate the modeling performance, which deteriorates when the ratio of experimental errors increases. All of the resulting models were also used to predict external sets of new compounds, which were excluded at the beginning of the modeling process. The modeling results showed that the compounds with relatively large prediction errors in cross-validation processes are likely to be those with simulated experimental errors. However, after removing a certain number of compounds with large prediction errors in the cross-validation process, the external predictions of new compounds did not show improvement. Our conclusion is that the QSAR predictions, especially consensus predictions, can identify compounds with potential experimental errors. But removing those compounds by the cross-validation procedure is not a reasonable means to improve model predictivity due to overfitting.

  13. QSAR modelling using combined simple competitive learning networks and RBF neural networks.

    Science.gov (United States)

    Sheikhpour, R; Sarram, M A; Rezaeian, M; Sheikhpour, E

    2018-04-01

    The aim of this study was to propose a QSAR modelling approach based on the combination of simple competitive learning (SCL) networks with radial basis function (RBF) neural networks for predicting the biological activity of chemical compounds. The proposed QSAR method consisted of two phases. In the first phase, an SCL network was applied to determine the centres of an RBF neural network. In the second phase, the RBF neural network was used to predict the biological activity of various phenols and Rho kinase (ROCK) inhibitors. The predictive ability of the proposed QSAR models was evaluated and compared with other QSAR models using external validation. The results of this study showed that the proposed QSAR modelling approach leads to better performances than other models in predicting the biological activity of chemical compounds. This indicated the efficiency of simple competitive learning networks in determining the centres of RBF neural networks.

  14. Consensus hologram QSAR modeling for the prediction of human intestinal absorption.

    Science.gov (United States)

    Moda, Tiago L; Andricopulo, Adriano D

    2012-04-15

    Consistent in silico models for ADME properties are useful tools in early drug discovery. Here, we report the hologram QSAR modeling of human intestinal absorption using a dataset of 638 compounds with experimental data associated. The final validated models are consistent and robust for the consensus prediction of this important pharmacokinetic property and are suitable for virtual screening applications. Copyright © 2012 Elsevier Ltd. All rights reserved.

  15. QSAR Models for Thyroperoxidase Inhibition and Screening of U.S. and EU Chemical Inventories

    DEFF Research Database (Denmark)

    Abildgaard Rosenberg, Sine; D. Watt, Eric; Judson, Richard S.

    2017-01-01

    to QSAR1. Of the substances predicted within QSAR2’s applicability domain, 8,790 (19.3%) REACH substances and 7,166 (19.0%) U.S. EPA substances, respectively, were predicted to be TPO inhibitors. A case study on butyl hydroxyanisole (BHA), which is extensively used as an antioxidant, was included.......6% (SD = 4.6%) and 85.3%, respectively. The external validation test set was subsequently merged with the training set to constitute a larger training set totaling 1,519 chemicals for a second model, QSAR2, which underwent robust cross-validation with a balanced accuracy of 82.7% (SD = 2.2%). An analysis...... of QSAR2 identified the ten most discriminating structural features for TPO inhibition and non-inhibition, respectively. Both models were used to screen 72,524 REACH substances and 32,197 U.S. EPA substances, and QSAR2 with the expanded training set had an approximately 10% larger coverages compared...

  16. Validation of Quantitative Structure-Activity Relationship (QSAR Model for Photosensitizer Activity Prediction

    Directory of Open Access Journals (Sweden)

    Sharifuddin M. Zain

    2011-11-01

    Full Text Available Photodynamic therapy is a relatively new treatment method for cancer which utilizes a combination of oxygen, a photosensitizer and light to generate reactive singlet oxygen that eradicates tumors via direct cell-killing, vasculature damage and engagement of the immune system. Most of photosensitizers that are in clinical and pre-clinical assessments, or those that are already approved for clinical use, are mainly based on cyclic tetrapyrroles. In an attempt to discover new effective photosensitizers, we report the use of the quantitative structure-activity relationship (QSAR method to develop a model that could correlate the structural features of cyclic tetrapyrrole-based compounds with their photodynamic therapy (PDT activity. In this study, a set of 36 porphyrin derivatives was used in the model development where 24 of these compounds were in the training set and the remaining 12 compounds were in the test set. The development of the QSAR model involved the use of the multiple linear regression analysis (MLRA method. Based on the method, r2 value, r2 (CV value and r2 prediction value of 0.87, 0.71 and 0.70 were obtained. The QSAR model was also employed to predict the experimental compounds in an external test set. This external test set comprises 20 porphyrin-based compounds with experimental IC50 values ranging from 0.39 µM to 7.04 µM. Thus the model showed good correlative and predictive ability, with a predictive correlation coefficient (r2 prediction for external test set of 0.52. The developed QSAR model was used to discover some compounds as new lead photosensitizers from this external test set.

  17. Predicting chemically-induced skin reactions. Part I: QSAR models of skin sensitization and their application to identify potentially hazardous compounds

    Energy Technology Data Exchange (ETDEWEB)

    Alves, Vinicius M. [Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220 (Brazil); Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Muratov, Eugene [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical-Chemical Institute NAS of Ukraine, Odessa 65080 (Ukraine); Fourches, Denis [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Strickland, Judy; Kleinstreuer, Nicole [ILS/Contractor Supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709 (United States); Andrade, Carolina H. [Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220 (Brazil); Tropsha, Alexander, E-mail: alex_tropsha@unc.edu [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States)

    2015-04-15

    Repetitive exposure to a chemical agent can induce an immune reaction in inherently susceptible individuals that leads to skin sensitization. Although many chemicals have been reported as skin sensitizers, there have been very few rigorously validated QSAR models with defined applicability domains (AD) that were developed using a large group of chemically diverse compounds. In this study, we have aimed to compile, curate, and integrate the largest publicly available dataset related to chemically-induced skin sensitization, use this data to generate rigorously validated and QSAR models for skin sensitization, and employ these models as a virtual screening tool for identifying putative sensitizers among environmental chemicals. We followed best practices for model building and validation implemented with our predictive QSAR workflow using Random Forest modeling technique in combination with SiRMS and Dragon descriptors. The Correct Classification Rate (CCR) for QSAR models discriminating sensitizers from non-sensitizers was 71–88% when evaluated on several external validation sets, within a broad AD, with positive (for sensitizers) and negative (for non-sensitizers) predicted rates of 85% and 79% respectively. When compared to the skin sensitization module included in the OECD QSAR Toolbox as well as to the skin sensitization model in publicly available VEGA software, our models showed a significantly higher prediction accuracy for the same sets of external compounds as evaluated by Positive Predicted Rate, Negative Predicted Rate, and CCR. These models were applied to identify putative chemical hazards in the Scorecard database of possible skin or sense organ toxicants as primary candidates for experimental validation. - Highlights: • It was compiled the largest publicly-available skin sensitization dataset. • Predictive QSAR models were developed for skin sensitization. • Developed models have higher prediction accuracy than OECD QSAR Toolbox. • Putative

  18. Predicting chemically-induced skin reactions. Part I: QSAR models of skin sensitization and their application to identify potentially hazardous compounds

    International Nuclear Information System (INIS)

    Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander

    2015-01-01

    Repetitive exposure to a chemical agent can induce an immune reaction in inherently susceptible individuals that leads to skin sensitization. Although many chemicals have been reported as skin sensitizers, there have been very few rigorously validated QSAR models with defined applicability domains (AD) that were developed using a large group of chemically diverse compounds. In this study, we have aimed to compile, curate, and integrate the largest publicly available dataset related to chemically-induced skin sensitization, use this data to generate rigorously validated and QSAR models for skin sensitization, and employ these models as a virtual screening tool for identifying putative sensitizers among environmental chemicals. We followed best practices for model building and validation implemented with our predictive QSAR workflow using Random Forest modeling technique in combination with SiRMS and Dragon descriptors. The Correct Classification Rate (CCR) for QSAR models discriminating sensitizers from non-sensitizers was 71–88% when evaluated on several external validation sets, within a broad AD, with positive (for sensitizers) and negative (for non-sensitizers) predicted rates of 85% and 79% respectively. When compared to the skin sensitization module included in the OECD QSAR Toolbox as well as to the skin sensitization model in publicly available VEGA software, our models showed a significantly higher prediction accuracy for the same sets of external compounds as evaluated by Positive Predicted Rate, Negative Predicted Rate, and CCR. These models were applied to identify putative chemical hazards in the Scorecard database of possible skin or sense organ toxicants as primary candidates for experimental validation. - Highlights: • It was compiled the largest publicly-available skin sensitization dataset. • Predictive QSAR models were developed for skin sensitization. • Developed models have higher prediction accuracy than OECD QSAR Toolbox. • Putative

  19. Development of quantitative structure activity relationship (QSAR) model for disinfection byproduct (DBP) research: A review of methods and resources

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Baiyang, E-mail: poplar_chen@hotmail.com [Harbin Institute of Technology Shenzhen Graduate School, Shenzhen Key Laboratory of Water Resource Utilization and Environmental Pollution Control, Shenzhen 518055 (China); Zhang, Tian [Harbin Institute of Technology Shenzhen Graduate School, Shenzhen Key Laboratory of Water Resource Utilization and Environmental Pollution Control, Shenzhen 518055 (China); Bond, Tom [Department of Civil and Environmental Engineering, Imperial College, London SW7 2AZ (United Kingdom); Gan, Yiqun [Harbin Institute of Technology Shenzhen Graduate School, Shenzhen Key Laboratory of Water Resource Utilization and Environmental Pollution Control, Shenzhen 518055 (China)

    2015-12-15

    Quantitative structure–activity relationship (QSAR) models are tools for linking chemical activities with molecular structures and compositions. Due to the concern about the proliferating number of disinfection byproducts (DBPs) in water and the associated financial and technical burden, researchers have recently begun to develop QSAR models to investigate the toxicity, formation, property, and removal of DBPs. However, there are no standard procedures or best practices regarding how to develop QSAR models, which potentially limit their wide acceptance. In order to facilitate more frequent use of QSAR models in future DBP research, this article reviews the processes required for QSAR model development, summarizes recent trends in QSAR-DBP studies, and shares some important resources for QSAR development (e.g., free databases and QSAR programs). The paper follows the four steps of QSAR model development, i.e., data collection, descriptor filtration, algorithm selection, and model validation; and finishes by highlighting several research needs. Because QSAR models may have an important role in progressing our understanding of DBP issues, it is hoped that this paper will encourage their future use for this application.

  20. Development of quantitative structure activity relationship (QSAR) model for disinfection byproduct (DBP) research: A review of methods and resources

    International Nuclear Information System (INIS)

    Chen, Baiyang; Zhang, Tian; Bond, Tom; Gan, Yiqun

    2015-01-01

    Quantitative structure–activity relationship (QSAR) models are tools for linking chemical activities with molecular structures and compositions. Due to the concern about the proliferating number of disinfection byproducts (DBPs) in water and the associated financial and technical burden, researchers have recently begun to develop QSAR models to investigate the toxicity, formation, property, and removal of DBPs. However, there are no standard procedures or best practices regarding how to develop QSAR models, which potentially limit their wide acceptance. In order to facilitate more frequent use of QSAR models in future DBP research, this article reviews the processes required for QSAR model development, summarizes recent trends in QSAR-DBP studies, and shares some important resources for QSAR development (e.g., free databases and QSAR programs). The paper follows the four steps of QSAR model development, i.e., data collection, descriptor filtration, algorithm selection, and model validation; and finishes by highlighting several research needs. Because QSAR models may have an important role in progressing our understanding of DBP issues, it is hoped that this paper will encourage their future use for this application.

  1. QSAR models for the removal of organic micropollutants in four different river water matrices

    KAUST Repository

    Sudhakaran, Sairam

    2012-04-01

    Ozonation is an advanced water treatment process used to remove organic micropollutants (OMPs) such as pharmaceuticals and personal care products (PPCPs). In this study, Quantitative Structure Activity Relationship (QSAR) models, for ozonation and advanced oxidation process (AOP), were developed with percent-removal of OMPs by ozonation as the criterion variable. The models focused on PPCPs and pesticides elimination in bench-scale studies done within natural water matrices: Colorado River, Passaic River, Ohio River and Suwannee synthetic water. The OMPs removal for the different water matrices varied depending on the water quality conditions such as pH, DOC, alkalinity. The molecular descriptors used to define the OMPs physico-chemical properties range from one-dimensional (atom counts) to three-dimensional (quantum-chemical). Based on a statistical modeling approach using more than 40 molecular descriptors as predictors, descriptors influencing ozonation/AOP were chosen for inclusion in the QSAR models. The modeling approach was based on multiple linear regression (MLR). Also, a global model based on neural networks was created, compiling OMPs from all the four river water matrices. The chemically relevant molecular descriptors involved in the QSAR models were: energy difference between lowest unoccupied and highest occupied molecular orbital (E LUMO-E HOMO), electron-affinity (EA), number of halogen atoms (#X), number of ring atoms (#ring atoms), weakly polar component of the solvent accessible surface area (WPSA) and oxygen to carbon ratio (O/C). All the QSAR models resulted in a goodness-of-fit, R 2, greater than 0.8. Internal and external validations were performed on the models. © 2011 Elsevier Ltd.

  2. QSAR Studies on Andrographolide Derivatives as α-Glucosidase Inhibitors

    Directory of Open Access Journals (Sweden)

    Shaohui Cai

    2010-03-01

    Full Text Available Andrographolide derivatives were shown to inhibit α-glucosidase. To investigate the relationship between activities and structures of andrographolide derivatives, a training set was chosen from 25 andrographolide derivatives by the principal component analysis (PCA method, and a quantitative structure-activity relationship (QSAR was established by 2D and 3D QSAR methods. The cross-validation r2 (0.731 and standard error (0.225 illustrated that the 2D-QSAR model was able to identify the important molecular fragments and the cross-validation r2 (0.794 and standard error (0.127 demonstrated that the 3D-QSAR model was capable of exploring the spatial distribution of important fragments. The obtained results suggested that proposed combination of 2D and 3D QSAR models could be useful in predicting the α-glucosidase inhibiting activity of andrographolide derivatives.

  3. Does rational selection of training and test sets improve the outcome of QSAR modeling?

    Science.gov (United States)

    Martin, Todd M; Harten, Paul; Young, Douglas M; Muratov, Eugene N; Golbraikh, Alexander; Zhu, Hao; Tropsha, Alexander

    2012-10-22

    Prior to using a quantitative structure activity relationship (QSAR) model for external predictions, its predictive power should be established and validated. In the absence of a true external data set, the best way to validate the predictive ability of a model is to perform its statistical external validation. In statistical external validation, the overall data set is divided into training and test sets. Commonly, this splitting is performed using random division. Rational splitting methods can divide data sets into training and test sets in an intelligent fashion. The purpose of this study was to determine whether rational division methods lead to more predictive models compared to random division. A special data splitting procedure was used to facilitate the comparison between random and rational division methods. For each toxicity end point, the overall data set was divided into a modeling set (80% of the overall set) and an external evaluation set (20% of the overall set) using random division. The modeling set was then subdivided into a training set (80% of the modeling set) and a test set (20% of the modeling set) using rational division methods and by using random division. The Kennard-Stone, minimal test set dissimilarity, and sphere exclusion algorithms were used as the rational division methods. The hierarchical clustering, random forest, and k-nearest neighbor (kNN) methods were used to develop QSAR models based on the training sets. For kNN QSAR, multiple training and test sets were generated, and multiple QSAR models were built. The results of this study indicate that models based on rational division methods generate better statistical results for the test sets than models based on random division, but the predictive power of both types of models are comparable.

  4. QSAR as a random event: modeling of nanoparticles uptake in PaCa2 cancer cells.

    Science.gov (United States)

    Toropov, Andrey A; Toropova, Alla P; Puzyn, Tomasz; Benfenati, Emilio; Gini, Giuseppina; Leszczynska, Danuta; Leszczynski, Jerzy

    2013-06-01

    Quantitative structure-property/activity relationships (QSPRs/QSARs) are a tool to predict various endpoints for various substances. The "classic" QSPR/QSAR analysis is based on the representation of the molecular structure by the molecular graph. However, simplified molecular input-line entry system (SMILES) gradually becomes most popular representation of the molecular structure in the databases available on the Internet. Under such circumstances, the development of molecular descriptors calculated directly from SMILES becomes attractive alternative to "classic" descriptors. The CORAL software (http://www.insilico.eu/coral) is provider of SMILES-based optimal molecular descriptors which are aimed to correlate with various endpoints. We analyzed data set on nanoparticles uptake in PaCa2 pancreatic cancer cells. The data set includes 109 nanoparticles with the same core but different surface modifiers (small organic molecules). The concept of a QSAR as a random event is suggested in opposition to "classic" QSARs which are based on the only one distribution of available data into the training and the validation sets. In other words, five random splits into the "visible" training set and the "invisible" validation set were examined. The SMILES-based optimal descriptors (obtained by the Monte Carlo technique) for these splits are calculated with the CORAL software. The statistical quality of all these models is good. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. Sigma-2 receptor ligands QSAR model dataset

    Directory of Open Access Journals (Sweden)

    Antonio Rescifina

    2017-08-01

    Full Text Available The data have been obtained from the Sigma-2 Receptor Selective Ligands Database (S2RSLDB and refined according to the QSAR requirements. These data provide information about a set of 548 Sigma-2 (σ2 receptor ligands selective over Sigma-1 (σ1 receptor. The development of the QSAR model has been undertaken with the use of CORAL software using SMILES, molecular graphs and hybrid descriptors (SMILES and graph together. Data here reported include the regression for σ2 receptor pKi QSAR models. The QSAR model was also employed to predict the σ2 receptor pKi values of the FDA approved drugs that are herewith included.

  6. Proposição, validação e análise dos modelos que correlacionam estrutura química e atividade biológica Proposition, validation and analysis of QSAR models

    Directory of Open Access Journals (Sweden)

    Anderson Coser Gaudio

    2001-10-01

    Full Text Available The present paper aims to bring under discussion some theoretical and practical aspects about the proposition, validation and analysis of QSAR models based on multiple linear regression. A comprehensive approach for the derivation of extrathermodynamic equations is reviewed. Some examples of QSAR models published in the literature are analyzed and criticized.

  7. Predicting chemically-induced skin reactions. Part I: QSAR models of skin sensitization and their application to identify potentially hazardous compounds

    Science.gov (United States)

    Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander

    2015-01-01

    Repetitive exposure to a chemical agent can induce an immune reaction in inherently susceptible individuals that leads to skin sensitization. Although many chemicals have been reported as skin sensitizers, there have been very few rigorously validated QSAR models with defined applicability domains (AD) that were developed using a large group of chemically diverse compounds. In this study, we have aimed to compile, curate, and integrate the largest publicly available dataset related to chemically-induced skin sensitization, use this data to generate rigorously validated and QSAR models for skin sensitization, and employ these models as a virtual screening tool for identifying putative sensitizers among environmental chemicals. We followed best practices for model building and validation implemented with our predictive QSAR workflow using random forest modeling technique in combination with SiRMS and Dragon descriptors. The Correct Classification Rate (CCR) for QSAR models discriminating sensitizers from non-sensitizers were 71–88% when evaluated on several external validation sets, within a broad AD, with positive (for sensitizers) and negative (for non-sensitizers) predicted rates of 85% and 79% respectively. When compared to the skin sensitization module included in the OECD QSAR toolbox as well as to the skin sensitization model in publicly available VEGA software, our models showed a significantly higher prediction accuracy for the same sets of external compounds as evaluated by Positive Predicted Rate, Negative Predicted Rate, and CCR. These models were applied to identify putative chemical hazards in the ScoreCard database of possible skin or sense organ toxicants as primary candidates for experimental validation. PMID:25560674

  8. Does Rational Selection of Training and Test Sets Improve the Outcome of QSAR Modeling?

    Science.gov (United States)

    Prior to using a quantitative structure activity relationship (QSAR) model for external predictions, its predictive power should be established and validated. In the absence of a true external dataset, the best way to validate the predictive ability of a model is to perform its s...

  9. QSAR Modeling and Prediction of Drug-Drug Interactions.

    Science.gov (United States)

    Zakharov, Alexey V; Varlamova, Ekaterina V; Lagunin, Alexey A; Dmitriev, Alexander V; Muratov, Eugene N; Fourches, Denis; Kuz'min, Victor E; Poroikov, Vladimir V; Tropsha, Alexander; Nicklaus, Marc C

    2016-02-01

    Severe adverse drug reactions (ADRs) are the fourth leading cause of fatality in the U.S. with more than 100,000 deaths per year. As up to 30% of all ADRs are believed to be caused by drug-drug interactions (DDIs), typically mediated by cytochrome P450s, possibilities to predict DDIs from existing knowledge are important. We collected data from public sources on 1485, 2628, 4371, and 27,966 possible DDIs mediated by four cytochrome P450 isoforms 1A2, 2C9, 2D6, and 3A4 for 55, 73, 94, and 237 drugs, respectively. For each of these data sets, we developed and validated QSAR models for the prediction of DDIs. As a unique feature of our approach, the interacting drug pairs were represented as binary chemical mixtures in a 1:1 ratio. We used two types of chemical descriptors: quantitative neighborhoods of atoms (QNA) and simplex descriptors. Radial basis functions with self-consistent regression (RBF-SCR) and random forest (RF) were utilized to build QSAR models predicting the likelihood of DDIs for any pair of drug molecules. Our models showed balanced accuracy of 72-79% for the external test sets with a coverage of 81.36-100% when a conservative threshold for the model's applicability domain was applied. We generated virtually all possible binary combinations of marketed drugs and employed our models to identify drug pairs predicted to be instances of DDI. More than 4500 of these predicted DDIs that were not found in our training sets were confirmed by data from the DrugBank database.

  10. Estimation of the chemical-induced eye injury using a Weight-of-Evidence (WoE) battery of 21 artificial neural network (ANN) c-QSAR models (QSAR-21): part II: corrosion potential.

    Science.gov (United States)

    Verma, Rajeshwar P; Matthews, Edwin J

    2015-03-01

    This is part II of an in silico investigation of chemical-induced eye injury that was conducted at FDA's CFSAN. Serious eye damage caused by chemical (eye corrosion) is assessed using the rabbit Draize test, and this endpoint is an essential part of hazard identification and labeling of industrial and consumer products to ensure occupational and consumer safety. There is an urgent need to develop an alternative to the Draize test because EU's 7th amendment to the Cosmetic Directive (EC, 2003; 76/768/EEC) and recast Regulation now bans animal testing on all cosmetic product ingredients and EU's REACH Program limits animal testing for chemicals in commerce. Although in silico methods have been reported for eye irritation (reversible damage), QSARs specific for eye corrosion (irreversible damage) have not been published. This report describes the development of 21 ANN c-QSAR models (QSAR-21) for assessing eye corrosion potential of chemicals using a large and diverse CFSAN data set of 504 chemicals, ADMET Predictor's three sensitivity analyses and ANNE classification functionalities with 20% test set selection from seven different methods. QSAR-21 models were internally and externally validated and exhibited high predictive performance: average statistics for the training, verification, and external test sets of these models were 96/96/94% sensitivity and 91/91/90% specificity. Copyright © 2014 Elsevier Inc. All rights reserved.

  11. Transfer and Multi-task Learning in QSAR Modeling: Advances and Challenges

    Directory of Open Access Journals (Sweden)

    Rodolfo S. Simões

    2018-02-01

    Full Text Available Medicinal chemistry projects involve some steps aiming to develop a new drug, such as the analysis of biological targets related to a given disease, the discovery and the development of drug candidates for these targets, performing parallel biological tests to validate the drug effectiveness and side effects. Approaches as quantitative study of activity-structure relationships (QSAR involve the construction of predictive models that relate a set of descriptors of a chemical compound series and its biological activities with respect to one or more targets in the human body. Datasets used to perform QSAR analyses are generally characterized by a small number of samples and this makes them more complex to build accurate predictive models. In this context, transfer and multi-task learning techniques are very suitable since they take information from other QSAR models to the same biological target, reducing efforts and costs for generating new chemical compounds. Therefore, this review will present the main features of transfer and multi-task learning studies, as well as some applications and its potentiality in drug design projects.

  12. QSAR analysis of salicylamide isosteres with the use of quantum chemical molecular descriptors.

    Science.gov (United States)

    Dolezal, R; Van Damme, S; Bultinck, P; Waisser, K

    2009-02-01

    Quantitative relationships between the molecular structure and the biological activity of 49 isosteric salicylamide derivatives as potential antituberculotics with a new mechanism of action against three Mycobacterial strains were investigated. The molecular structures were represented by quantum chemical B3LYP/6-31G( *) based molecular descriptors. A resulting set of 220 molecular descriptors, including especially electronic properties, was statistically analyzed using multiple linear regression, resulting in acceptable and robust QSAR models. The best QSAR model was found for Mycobacterium tuberculosis (r(2)=0.92; q(2)=0.89), and somewhat less good QSAR models were found for Mycobacterium avium (r(2)=0.84; q(2)=0.78) and Mycobacterium kansasii (r(2)=0.80; q(2)=0.56). All QSAR models were cross-validated using the leave-10-out procedure.

  13. Towards interoperable and reproducible QSAR analyses: Exchange of datasets.

    Science.gov (United States)

    Spjuth, Ola; Willighagen, Egon L; Guha, Rajarshi; Eklund, Martin; Wikberg, Jarl Es

    2010-06-30

    QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but

  14. Towards interoperable and reproducible QSAR analyses: Exchange of datasets

    Directory of Open Access Journals (Sweden)

    Spjuth Ola

    2010-06-01

    Full Text Available Abstract Background QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. Results We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Conclusions Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join

  15. Development of an ecotoxicity QSAR model for the KAshinhou Tool for Ecotoxicity (KATE) system, March 2009 version.

    Science.gov (United States)

    Furuhama, A; Toida, T; Nishikawa, N; Aoki, Y; Yoshioka, Y; Shiraishi, H

    2010-07-01

    The KAshinhou Tool for Ecotoxicity (KATE) system, including ecotoxicity quantitative structure-activity relationship (QSAR) models, was developed by the Japanese National Institute for Environmental Studies (NIES) using the database of aquatic toxicity results gathered by the Japanese Ministry of the Environment and the US EPA fathead minnow database. In this system chemicals can be entered according to their one-dimensional structures and classified by substructure. The QSAR equations for predicting the toxicity of a chemical compound assume a linear correlation between its log P value and its aquatic toxicity. KATE uses a structural domain called C-judgement, defined by the substructures of specified functional groups in the QSAR models. Internal validation by the leave-one-out method confirms that the QSAR equations, with r(2 )> 0.7, RMSE 5, give acceptable q(2) values. Such external validation indicates that a group of chemicals with an in-domain of KATE C-judgements exhibits a lower root mean square error (RMSE). These findings demonstrate that the KATE system has the potential to enable chemicals to be categorised as potential hazards.

  16. QSAR models for anti-androgenic effect - a preliminary study

    DEFF Research Database (Denmark)

    Jensen, Gunde Egeskov; Nikolov, Nikolai Georgiev; Wedebye, Eva Bay

    2011-01-01

    Three modelling systems (MultiCase (R), LeadScope (R) and MDL (R) QSAR) were used for construction of androgenic receptor antagonist models. There were 923-942 chemicals in the training sets. The models were cross-validated (leave-groups-out) with concordances of 77-81%, specificity of 78...... of the model for a particular application, balance of training sets, domain definition, and cut-offs for prediction interpretation should also be taken into account. Different descriptors in the modelling systems are illustrated with hydroxyflutamide and dexamethasone as examples (a non-steroid and a steroid...

  17. Combretastatin A-4 based thiophene derivatives as antitumor agent: Development of structure activity correlation model using 3D-QSAR, pharmacophore and docking studies

    Directory of Open Access Journals (Sweden)

    Vijay K. Patel

    2017-12-01

    Full Text Available The structure and ligand based synergistic approach is being applied to design ligands more correctly. The present report discloses the combination of structure and ligand based tactics i.e., molecular docking, energetic based pharmacophore, pharmacophore and atom based 3D-QSAR modeling for the analysis of thiophene derivatives as anticancer agent. The main purpose of using structure and ligand based synergistic approach is to ascertain a correlation between structure and its biological activity. Thiophene derivatives have been found to possess cytotoxic activity in several cancer cell lines and its mechanism of action basically involves the binding to the colchicine site on β-tubulin. The structure based approach (molecular docking was performed on a series of thiophene derivatives. All the structures were docked to colchicine binding site of β tubulin for examining the binding affinity of compounds for antitumor activity. The pharmacophore and atom based 3D-QSAR modeling was accomplished on a series of thiophene (32 compounds analogues. Five-point common pharmacophore hypotheses (AAAAR.38 were selected for alignment of all compounds. The atom based 3D-QSAR models were developed by selection of 23 compounds as training set and 9 compounds as test set, demonstrated good partial least squares statistical results. The generated common pharmacophore hypothesis and 3D-QSAR models were validated further externally by measuring the activity of database compounds and assessing it with actual activity. The common pharmacophore hypothesis AAAAR.38 resulted in a 3D-QSAR model with excellent PLSs data for factor two characterized by the best predication coefficient Q2 (cross validated r2 (0.7213, regression R2 (0.8311, SD (0.3672, F (49.2, P (1.89E-08, RMSE (0.3864, Stability (0.8702, Pearson-r (0.8722. The results of these molecular modeling studies i.e., molecular docking, energetic based pharmacophore, pharmacophore and atom based 3D-QSAR modeling

  18. Quantitative structure-activity relationship (QSAR) for insecticides: development of predictive in vivo insecticide activity models.

    Science.gov (United States)

    Naik, P K; Singh, T; Singh, H

    2009-07-01

    Quantitative structure-activity relationship (QSAR) analyses were performed independently on data sets belonging to two groups of insecticides, namely the organophosphates and carbamates. Several types of descriptors including topological, spatial, thermodynamic, information content, lead likeness and E-state indices were used to derive quantitative relationships between insecticide activities and structural properties of chemicals. A systematic search approach based on missing value, zero value, simple correlation and multi-collinearity tests as well as the use of a genetic algorithm allowed the optimal selection of the descriptors used to generate the models. The QSAR models developed for both organophosphate and carbamate groups revealed good predictability with r(2) values of 0.949 and 0.838 as well as [image omitted] values of 0.890 and 0.765, respectively. In addition, a linear correlation was observed between the predicted and experimental LD(50) values for the test set data with r(2) of 0.871 and 0.788 for both the organophosphate and carbamate groups, indicating that the prediction accuracy of the QSAR models was acceptable. The models were also tested successfully from external validation criteria. QSAR models developed in this study should help further design of novel potent insecticides.

  19. Application of QSAR models in analysis of antibacterial activity of some benzimidazole derivatives against Sarcina lutea

    Directory of Open Access Journals (Sweden)

    Podunavac-Kuzmanović Sanja O.

    2013-01-01

    Full Text Available In the present paper, a quantitative structure activity relationship (QSAR has been carried out on a series of 2-methyl and 2-aminobenzimidazole derivatives to identify the lipophilicity requirements for their inhibitory activity against bacteria Sarcina lutea. The tested compounds displayed in vitro antibacterial activity and minimum inhibitory concentration (MIC was determined for all compounds. The partition coefficients of the studied compounds were measured by the shake flask method (log P and by theoretical calculation (Clog P. The relationships between lipophilicity descriptors and antibacterial activities were investigated and the mathematical models have been developed as a calibration models for predicting the inhibitory activity of this class of compounds. The models were validated by leave-one-out (LOO technique as well as by the calculation of statistical parameters for the established models. Therefore, QSAR analysis reveals that lipophilicity descriptor govern the inhibitory activity of benzimidazoles studied against Sarcina lutea.

  20. 4D-Fingerprint Categorical QSAR Models for Skin Sensitization Based on Classification Local Lymph Node Assay Measures

    Science.gov (United States)

    Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.

    2008-01-01

    Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934

  1. Determination and importance of temperature dependence of retention coefficient (RPHPLC) in QSAR model of nitrazepams' partition coefficient in bile acid micelles.

    Science.gov (United States)

    Posa, Mihalj; Pilipović, Ana; Lalić, Mladena; Popović, Jovan

    2011-02-15

    Linear dependence between temperature (t) and retention coefficient (k, reversed phase HPLC) of bile acids is obtained. Parameters (a, intercept and b, slope) of the linear function k=f(t) highly correlate with bile acids' structures. Investigated bile acids form linear congeneric groups on a principal component (calculated from k=f(t)) score plot that are in accordance with conformations of the hydroxyl and oxo groups in a bile acid steroid skeleton. Partition coefficient (K(p)) of nitrazepam in bile acids' micelles is investigated. Nitrazepam molecules incorporated in micelles show modified bioavailability (depo effect, higher permeability, etc.). Using multiple linear regression method QSAR models of nitrazepams' partition coefficient, K(p) are derived on the temperatures of 25°C and 37°C. For deriving linear regression models on both temperatures experimentally obtained lipophilicity parameters are included (PC1 from data k=f(t)) and in silico descriptors of the shape of a molecule while on the higher temperature molecular polarisation is introduced. This indicates the fact that the incorporation mechanism of nitrazepam in BA micelles changes on the higher temperatures. QSAR models are derived using partial least squares method as well. Experimental parameters k=f(t) are shown to be significant predictive variables. Both QSAR models are validated using cross validation and internal validation method. PLS models have slightly higher predictive capability than MLR models. Copyright © 2010 Elsevier B.V. All rights reserved.

  2. QSAR Analysis of 2-Amino or 2-Methyl-1-Substituted Benzimidazoles Against Pseudomonas aeruginosa

    Science.gov (United States)

    Podunavac-Kuzmanović, Sanja O.; Cvetković, Dragoljub D.; Barna, Dijana J.

    2009-01-01

    A set of benzimidazole derivatives were tested for their inhibitory activities against the Gram-negative bacterium Pseudomonas aeruginosa and minimum inhibitory concentrations were determined for all the compounds. Quantitative structure activity relationship (QSAR) analysis was applied to fourteen of the abovementioned derivatives using a combination of various physicochemical, steric, electronic, and structural molecular descriptors. A multiple linear regression (MLR) procedure was used to model the relationships between molecular descriptors and the antibacterial activity of the benzimidazole derivatives. The stepwise regression method was used to derive the most significant models as a calibration model for predicting the inhibitory activity of this class of molecules. The best QSAR models were further validated by a leave one out technique as well as by the calculation of statistical parameters for the established theoretical models. To confirm the predictive power of the models, an external set of molecules was used. High agreement between experimental and predicted inhibitory values, obtained in the validation procedure, indicated the good quality of the derived QSAR models. PMID:19468332

  3. A novel QSAR model of Salmonella mutagenicity and its application in the safety assessment of drug impurities

    International Nuclear Information System (INIS)

    Valencia, Antoni; Prous, Josep; Mora, Oscar; Sadrieh, Nakissa; Valerio, Luis G.

    2013-01-01

    As indicated in ICH M7 draft guidance, in silico predictive tools including statistically-based QSARs and expert analysis may be used as a computational assessment for bacterial mutagenicity for the qualification of impurities in pharmaceuticals. To address this need, we developed and validated a QSAR model to predict Salmonella t. mutagenicity (Ames assay outcome) of pharmaceutical impurities using Prous Institute's Symmetry℠, a new in silico solution for drug discovery and toxicity screening, and the Mold2 molecular descriptor package (FDA/NCTR). Data was sourced from public benchmark databases with known Ames assay mutagenicity outcomes for 7300 chemicals (57% mutagens). Of these data, 90% was used to train the model and the remaining 10% was set aside as a holdout set for validation. The model's applicability to drug impurities was tested using a FDA/CDER database of 951 structures, of which 94% were found within the model's applicability domain. The predictive performance of the model is acceptable for supporting regulatory decision-making with 84 ± 1% sensitivity, 81 ± 1% specificity, 83 ± 1% concordance and 79 ± 1% negative predictivity based on internal cross-validation, while the holdout dataset yielded 83% sensitivity, 77% specificity, 80% concordance and 78% negative predictivity. Given the importance of having confidence in negative predictions, an additional external validation of the model was also carried out, using marketed drugs known to be Ames-negative, and obtained 98% coverage and 81% specificity. Additionally, Ames mutagenicity data from FDA/CFSAN was used to create another data set of 1535 chemicals for external validation of the model, yielding 98% coverage, 73% sensitivity, 86% specificity, 81% concordance and 84% negative predictivity. - Highlights: • A new in silico QSAR model to predict Ames mutagenicity is described. • The model is extensively validated with chemicals from the FDA and the public domain. • Validation tests

  4. Modelling the effect of structural QSAR parameters on skin penetration using genetic programming

    International Nuclear Information System (INIS)

    Chung, K K; Do, D Q

    2010-01-01

    In order to model relationships between chemical structures and biological effects in quantitative structure–activity relationship (QSAR) data, an alternative technique of artificial intelligence computing—genetic programming (GP)—was investigated and compared to the traditional method—statistical. GP, with the primary advantage of generating mathematical equations, was employed to model QSAR data and to define the most important molecular descriptions in QSAR data. The models predicted by GP agreed with the statistical results, and the most predictive models of GP were significantly improved when compared to the statistical models using ANOVA. Recently, artificial intelligence techniques have been applied widely to analyse QSAR data. With the capability of generating mathematical equations, GP can be considered as an effective and efficient method for modelling QSAR data

  5. Effect of dissolved organic matter on pre-equilibrium passive sampling: A predictive QSAR modeling study.

    Science.gov (United States)

    Lin, Wei; Jiang, Ruifen; Shen, Yong; Xiong, Yaxin; Hu, Sizi; Xu, Jianqiao; Ouyang, Gangfeng

    2018-04-13

    Pre-equilibrium passive sampling is a simple and promising technique for studying sampling kinetics, which is crucial to determine the distribution, transfer and fate of hydrophobic organic compounds (HOCs) in environmental water and organisms. Environmental water samples contain complex matrices that complicate the traditional calibration process for obtaining the accurate rate constants. This study proposed a QSAR model to predict the sampling rate constants of HOCs (polycyclic aromatic hydrocarbons (PAHs), polychlorinated biphenyls (PCBs) and pesticides) in aqueous systems containing complex matrices. A homemade flow-through system was established to simulate an actual aqueous environment containing dissolved organic matter (DOM) i.e. humic acid (HA) and (2-Hydroxypropyl)-β-cyclodextrin (β-HPCD)), and to obtain the experimental rate constants. Then, a quantitative structure-activity relationship (QSAR) model using Genetic Algorithm-Multiple Linear Regression (GA-MLR) was found to correlate the experimental rate constants to the system state including physicochemical parameters of the HOCs and DOM which were calculated and selected as descriptors by Density Functional Theory (DFT) and Chem 3D. The experimental results showed that the rate constants significantly increased as the concentration of DOM increased, and the enhancement factors of 70-fold and 34-fold were observed for the HOCs in HA and β-HPCD, respectively. The established QSAR model was validated as credible (R Adj. 2 =0.862) and predictable (Q 2 =0.835) in estimating the rate constants of HOCs for complex aqueous sampling, and a probable mechanism was developed by comparison to the reported theoretical study. The present study established a QSAR model of passive sampling rate constants and calibrated the effect of DOM on the sampling kinetics. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Quantitative structure-activity relationship (QSAR) models for polycyclic aromatic hydrocarbons (PAHs) dissipation in rhizosphere based on molecular structure and effect size

    International Nuclear Information System (INIS)

    Ma Bin; Chen Huaihai; Xu Minmin; Hayat, Tahir; He Yan; Xu Jianming

    2010-01-01

    Rhizoremediation is a significant form of bioremediation for polycyclic aromatic hydrocarbons (PAHs). This study examined the role of molecular structure in determining the rhizosphere effect on PAHs dissipation. Effect size in meta-analysis was employed as activity dataset for building quantitative structure-activity relationship (QSAR) models and accumulative effect sizes of 16 PAHs were used for validation of these models. Based on the genetic algorithm combined with partial least square regression, models for comprehensive dataset, Poaceae dataset, and Fabaceae dataset were built. The results showed that information indices, calculated as information content of molecules based on the calculation of equivalence classes from the molecular graph, were the most important molecular structural indices for QSAR models of rhizosphere effect on PAHs dissipation. The QSAR model, based on the molecular structure indices and effect size, has potential to be used in studying and predicting the rhizosphere effect of PAHs dissipation. - Effect size based on meta-analysis was used for building PAHs dissipation quantitative structure-activity relationship (QSAR) models.

  7. Quantitative structure-activity relationship (QSAR) models for polycyclic aromatic hydrocarbons (PAHs) dissipation in rhizosphere based on molecular structure and effect size

    Energy Technology Data Exchange (ETDEWEB)

    Ma Bin; Chen Huaihai; Xu Minmin; Hayat, Tahir [Zhejiang Provincial Key Laboratory of Subtropical Soil and Plant Nutrition, College of Environmental and Natural Resource Sciences, Zhejiang University, Hangzhou 310029 (China); He Yan, E-mail: yhe2006@zju.edu.c [Zhejiang Provincial Key Laboratory of Subtropical Soil and Plant Nutrition, College of Environmental and Natural Resource Sciences, Zhejiang University, Hangzhou 310029 (China); Xu Jianming, E-mail: jmxu@zju.edu.c [Zhejiang Provincial Key Laboratory of Subtropical Soil and Plant Nutrition, College of Environmental and Natural Resource Sciences, Zhejiang University, Hangzhou 310029 (China)

    2010-08-15

    Rhizoremediation is a significant form of bioremediation for polycyclic aromatic hydrocarbons (PAHs). This study examined the role of molecular structure in determining the rhizosphere effect on PAHs dissipation. Effect size in meta-analysis was employed as activity dataset for building quantitative structure-activity relationship (QSAR) models and accumulative effect sizes of 16 PAHs were used for validation of these models. Based on the genetic algorithm combined with partial least square regression, models for comprehensive dataset, Poaceae dataset, and Fabaceae dataset were built. The results showed that information indices, calculated as information content of molecules based on the calculation of equivalence classes from the molecular graph, were the most important molecular structural indices for QSAR models of rhizosphere effect on PAHs dissipation. The QSAR model, based on the molecular structure indices and effect size, has potential to be used in studying and predicting the rhizosphere effect of PAHs dissipation. - Effect size based on meta-analysis was used for building PAHs dissipation quantitative structure-activity relationship (QSAR) models.

  8. Estimation of the chemical-induced eye injury using a weight-of-evidence (WoE) battery of 21 artificial neural network (ANN) c-QSAR models (QSAR-21): part I: irritation potential.

    Science.gov (United States)

    Verma, Rajeshwar P; Matthews, Edwin J

    2015-03-01

    Evaluation of potential chemical-induced eye injury through irritation and corrosion is required to ensure occupational and consumer safety for industrial, household and cosmetic ingredient chemicals. The historical method for evaluating eye irritant and corrosion potential of chemicals is the rabbit Draize test. However, the Draize test is controversial and its use is diminishing - the EU 7th Amendment to the Cosmetic Directive (76/768/EEC) and recast Regulation now bans marketing of new cosmetics having animal testing of their ingredients and requires non-animal alternative tests for safety assessments. Thus, in silico and/or in vitro tests are advocated. QSAR models for eye irritation have been reported for several small (congeneric) data sets; however, large global models have not been described. This report describes FDA/CFSAN's development of 21 ANN c-QSAR models (QSAR-21) to predict eye irritation using the ADMET Predictor program and a diverse training data set of 2928 chemicals. The 21 models had external (20% test set) and internal validation and average training/verification/test set statistics were: 88/88/85(%) sensitivity and 82/82/82(%) specificity, respectively. The new method utilized multiple artificial neural network (ANN) molecular descriptor selection functionalities to maximize the applicability domain of the battery. The eye irritation models will be used to provide information to fill the critical data gaps for the safety assessment of cosmetic ingredient chemicals. Copyright © 2014 Elsevier Inc. All rights reserved.

  9. Transfer and Multi-task Learning in QSAR Modeling: Advances and Challenges

    OpenAIRE

    Rodolfo S. Simões; Vinicius G. Maltarollo; Patricia R. Oliveira; Kathia M. Honorio; Kathia M. Honorio

    2018-01-01

    Medicinal chemistry projects involve some steps aiming to develop a new drug, such as the analysis of biological targets related to a given disease, the discovery and the development of drug candidates for these targets, performing parallel biological tests to validate the drug effectiveness and side effects. Approaches as quantitative study of activity-structure relationships (QSAR) involve the construction of predictive models that relate a set of descriptors of a chemical compound series a...

  10. Quasi-QSAR for mutagenic potential of multi-walled carbon-nanotubes.

    Science.gov (United States)

    Toropov, Andrey A; Toropova, Alla P

    2015-04-01

    Available on the Internet, the CORAL software (http://www.insilico.eu/coral) has been used to build up quasi-quantitative structure-activity relationships (quasi-QSAR) for prediction of mutagenic potential of multi-walled carbon-nanotubes (MWCNTs). In contrast with the previous models built up by CORAL which were based on representation of the molecular structure by simplified molecular input-line entry system (SMILES) the quasi-QSARs based on the representation of conditions (not on the molecular structure) such as concentration, presence (absence) S9 mix, the using (or without the using) of preincubation were encoded by so-called quasi-SMILES. The statistical characteristics of these models (quasi-QSARs) for three random splits into the visible training set and test set and invisible validation set are the following: (i) split 1: n=13, r(2)=0.8037, q(2)=0.7260, s=0.033, F=45 (training set); n=5, r(2)=0.9102, s=0.071 (test set); n=6, r(2)=0.7627, s=0.044 (validation set); (ii) split 2: n=13, r(2)=0.6446, q(2)=0.4733, s=0.045, F=20 (training set); n=5, r(2)=0.6785, s=0.054 (test set); n=6, r(2)=0.9593, s=0.032 (validation set); and (iii) n=14, r(2)=0.8087, q(2)=0.6975, s=0.026, F=51 (training set); n=5, r(2)=0.9453, s=0.074 (test set); n=5, r(2)=0.8951, s=0.052 (validation set). Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. A novel QSAR model of Salmonella mutagenicity and its application in the safety assessment of drug impurities

    Energy Technology Data Exchange (ETDEWEB)

    Valencia, Antoni; Prous, Josep; Mora, Oscar [Prous Institute for Biomedical Research, Rambla de Catalunya, 135, 3-2, Barcelona 08008 (Spain); Sadrieh, Nakissa [Office of Pharmaceutical Science, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002 (United States); Valerio, Luis G., E-mail: luis.valerio@fda.hhs.gov [Office of Pharmaceutical Science, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993-0002 (United States)

    2013-12-15

    As indicated in ICH M7 draft guidance, in silico predictive tools including statistically-based QSARs and expert analysis may be used as a computational assessment for bacterial mutagenicity for the qualification of impurities in pharmaceuticals. To address this need, we developed and validated a QSAR model to predict Salmonella t. mutagenicity (Ames assay outcome) of pharmaceutical impurities using Prous Institute's Symmetry℠, a new in silico solution for drug discovery and toxicity screening, and the Mold2 molecular descriptor package (FDA/NCTR). Data was sourced from public benchmark databases with known Ames assay mutagenicity outcomes for 7300 chemicals (57% mutagens). Of these data, 90% was used to train the model and the remaining 10% was set aside as a holdout set for validation. The model's applicability to drug impurities was tested using a FDA/CDER database of 951 structures, of which 94% were found within the model's applicability domain. The predictive performance of the model is acceptable for supporting regulatory decision-making with 84 ± 1% sensitivity, 81 ± 1% specificity, 83 ± 1% concordance and 79 ± 1% negative predictivity based on internal cross-validation, while the holdout dataset yielded 83% sensitivity, 77% specificity, 80% concordance and 78% negative predictivity. Given the importance of having confidence in negative predictions, an additional external validation of the model was also carried out, using marketed drugs known to be Ames-negative, and obtained 98% coverage and 81% specificity. Additionally, Ames mutagenicity data from FDA/CFSAN was used to create another data set of 1535 chemicals for external validation of the model, yielding 98% coverage, 73% sensitivity, 86% specificity, 81% concordance and 84% negative predictivity. - Highlights: • A new in silico QSAR model to predict Ames mutagenicity is described. • The model is extensively validated with chemicals from the FDA and the public domain.

  12. The great descriptor melting pot: mixing descriptors for the common good of QSAR models.

    Science.gov (United States)

    Tseng, Yufeng J; Hopfinger, Anton J; Esposito, Emilio Xavier

    2012-01-01

    The usefulness and utility of QSAR modeling depends heavily on the ability to estimate the values of molecular descriptors relevant to the endpoints of interest followed by an optimized selection of descriptors to form the best QSAR models from a representative set of the endpoints of interest. The performance of a QSAR model is directly related to its molecular descriptors. QSAR modeling, specifically model construction and optimization, has benefited from its ability to borrow from other unrelated fields, yet the molecular descriptors that form QSAR models have remained basically unchanged in both form and preferred usage. There are many types of endpoints that require multiple classes of descriptors (descriptors that encode 1D through multi-dimensional, 4D and above, content) needed to most fully capture the molecular features and interactions that contribute to the endpoint. The advantages of QSAR models constructed from multiple, and different, descriptor classes have been demonstrated in the exploration of markedly different, and principally biological systems and endpoints. Multiple examples of such QSAR applications using different descriptor sets are described and that examined. The take-home-message is that a major part of the future of QSAR analysis, and its application to modeling biological potency, ADME-Tox properties, general use in virtual screening applications, as well as its expanding use into new fields for building QSPR models, lies in developing strategies that combine and use 1D through nD molecular descriptors.

  13. Integration of QSAR and in vitro toxicology.

    Science.gov (United States)

    Barratt, M D

    1998-01-01

    The principles of quantitative structure-activity relationships (QSAR) are based on the premise that the properties of a chemical are implicit in its molecular structure. Therefore, if a mechanistic hypothesis can be proposed linking a group of related chemicals with a particular toxic end point, the hypothesis can be used to define relevant parameters to establish a QSAR. Ways in which QSAR and in vitro toxicology can complement each other in development of alternatives to live animal experiments are described and illustrated by examples from acute toxicological end points. Integration of QSAR and in vitro methods is examined in the context of assessing mechanistic competence and improving the design of in vitro assays and the development of prediction models. The nature of biological variability is explored together with its implications for the selection of sets of chemicals for test development, optimization, and validation. Methods are described to support the use of data from in vivo tests that do not meet today's stringent requirements of acceptability. Integration of QSAR and in vitro methods into strategic approaches for the replacement, reduction, and refinement of the use of animals is described with examples. PMID:9599692

  14. Improving the applicability of (Q)SARs for percutaneous penetration in regulatory risk assessment.

    Science.gov (United States)

    Bouwman, T; Cronin, M T D; Bessems, J G M; van de Sandt, J J M

    2008-04-01

    The new regulatory framework REACH (Registration, Evaluation, and Authorisation of Chemicals) foresees the use of non-testing approaches, such as read-across, chemical categories, structure-activity relationships (SARs) and quantitative structure-activity relationships (QSARs). Although information on skin absorption data are not a formal requirement under REACH, data on dermal absorption are an integral part of risk assessment of substances/products to which man is predominantly exposed via the dermal route. In this study, we assess the present applicability of publicly available QSARs on skin absorption for risk assessment purposes. We explicitly did not aim to give scientific judgments on individual QSARs. A total of 33 QSARs selected from the public domain were evaluated using the OECD (Organisation for Economic Co-operation and Development) Principles for the Validation of (Q)SAR Models. Additionally, several pragmatic criteria were formulated to select QSARs that are most suitable for their use in regulatory risk assessment. Based on these criteria, four QSARs were selected. The predictivity of these QSARs was evaluated by comparing their outcomes with experimentally derived skin absorption data (for 62 compounds). The predictivity was low for three of four QSARs, whereas one model gave reasonable predictions. Several suggestions are made to increase the applicability of QSARs for skin absorption for risk assessment purposes.

  15. QSAR modeling of toxicity of diverse organic chemicals to Daphnia magna using 2D and 3D descriptors

    International Nuclear Information System (INIS)

    Kar, Supratik; Roy, Kunal

    2010-01-01

    One of the major economic alternatives to experimental toxicity testing is the use of quantitative structure-activity relationships (QSARs) which are used in formulating regulatory decisions of environmental protection agencies. In this background, we have modeled a large diverse group of 297 chemicals for their toxicity to Daphnia magna using mechanistically interpretable descriptors. Three-dimensional (3D) (electronic and spatial) and two-dimensional (2D) (topological and information content indices) descriptors along with physicochemical parameter log K o/w (n-octanol/water partition coefficient) and structural descriptors were used as predictor variables. The QSAR models were developed by stepwise multiple linear regression (MLR), partial least squares (PLS), genetic function approximation (GFA), and genetic PLS (G/PLS). All the models were validated internally and externally. Among several models developed using different chemometric tools, the best model based on both internal and external validation characteristics was a PLS equation with 7 descriptors and three latent variables explaining 67.8% leave-one-out predicted variance and 74.1% external predicted variance. The PLS model suggests that higher lipophilicity and electrophilicity, less negative charge surface area and presence of ether linkage, hydrogen bond donor groups and acetylenic carbons are responsible for greater toxicity of chemicals. The developed model may be used for prediction of toxicity, safety and risk assessment of chemicals to achieve better ecotoxicological management and prevent adverse health consequences.

  16. QSAR modeling of toxicity of diverse organic chemicals to Daphnia magna using 2D and 3D descriptors

    Energy Technology Data Exchange (ETDEWEB)

    Kar, Supratik [Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Raja S C Mullick Road, Kolkata 700032 (India); Roy, Kunal, E-mail: kunalroy_in@yahoo.com [Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Raja S C Mullick Road, Kolkata 700032 (India)

    2010-05-15

    One of the major economic alternatives to experimental toxicity testing is the use of quantitative structure-activity relationships (QSARs) which are used in formulating regulatory decisions of environmental protection agencies. In this background, we have modeled a large diverse group of 297 chemicals for their toxicity to Daphnia magna using mechanistically interpretable descriptors. Three-dimensional (3D) (electronic and spatial) and two-dimensional (2D) (topological and information content indices) descriptors along with physicochemical parameter log K{sub o/w} (n-octanol/water partition coefficient) and structural descriptors were used as predictor variables. The QSAR models were developed by stepwise multiple linear regression (MLR), partial least squares (PLS), genetic function approximation (GFA), and genetic PLS (G/PLS). All the models were validated internally and externally. Among several models developed using different chemometric tools, the best model based on both internal and external validation characteristics was a PLS equation with 7 descriptors and three latent variables explaining 67.8% leave-one-out predicted variance and 74.1% external predicted variance. The PLS model suggests that higher lipophilicity and electrophilicity, less negative charge surface area and presence of ether linkage, hydrogen bond donor groups and acetylenic carbons are responsible for greater toxicity of chemicals. The developed model may be used for prediction of toxicity, safety and risk assessment of chemicals to achieve better ecotoxicological management and prevent adverse health consequences.

  17. QSAR models for prediction of chromatographic behavior of homologous Fab variants.

    Science.gov (United States)

    Robinson, Julie R; Karkov, Hanne S; Woo, James A; Krogh, Berit O; Cramer, Steven M

    2017-06-01

    While quantitative structure activity relationship (QSAR) models have been employed successfully for the prediction of small model protein chromatographic behavior, there have been few reports to date on the use of this methodology for larger, more complex proteins. Recently our group generated focused libraries of antibody Fab fragment variants with different combinations of surface hydrophobicities and electrostatic potentials, and demonstrated that the unique selectivities of multimodal resins can be exploited to separate these Fab variants. In this work, results from linear salt gradient experiments with these Fabs were employed to develop QSAR models for six chromatographic systems, including multimodal (Capto MMC, Nuvia cPrime, and two novel ligand prototypes), hydrophobic interaction chromatography (HIC; Capto Phenyl), and cation exchange (CEX; CM Sepharose FF) resins. The models utilized newly developed "local descriptors" to quantify changes around point mutations in the Fab libraries as well as novel cluster descriptors recently introduced by our group. Subsequent rounds of feature selection and linearized machine learning algorithms were used to generate robust, well-validated models with high training set correlations (R 2  > 0.70) that were well suited for predicting elution salt concentrations in the various systems. The developed models then were used to predict the retention of a deamidated Fab and isotype variants, with varying success. The results represent the first successful utilization of QSAR for the prediction of chromatographic behavior of complex proteins such as Fab fragments in multimodal chromatographic systems. The framework presented here can be employed to facilitate process development for the purification of biological products from product-related impurities by in silico screening of resin alternatives. Biotechnol. Bioeng. 2017;114: 1231-1240. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  18. Statistical molecular design of balanced compound libraries for QSAR modeling.

    Science.gov (United States)

    Linusson, A; Elofsson, M; Andersson, I E; Dahlgren, M K

    2010-01-01

    A fundamental step in preclinical drug development is the computation of quantitative structure-activity relationship (QSAR) models, i.e. models that link chemical features of compounds with activities towards a target macromolecule associated with the initiation or progression of a disease. QSAR models are computed by combining information on the physicochemical and structural features of a library of congeneric compounds, typically assembled from two or more building blocks, and biological data from one or more in vitro assays. Since the models provide information on features affecting the compounds' biological activity they can be used as guides for further optimization. However, in order for a QSAR model to be relevant to the targeted disease, and drug development in general, the compound library used must contain molecules with balanced variation of the features spanning the chemical space believed to be important for interaction with the biological target. In addition, the assays used must be robust and deliver high quality data that are directly related to the function of the biological target and the associated disease state. In this review, we discuss and exemplify the concept of statistical molecular design (SMD) in the selection of building blocks and final synthetic targets (i.e. compounds to synthesize) to generate information-rich, balanced libraries for biological testing and computation of QSAR models.

  19. Development of QSAR model for immunomodulatory activity of natural coumarinolignoids

    Directory of Open Access Journals (Sweden)

    Dharmendra K Yadav

    2010-07-01

    Full Text Available Dharmendra K Yadav, Abha Meena, Ankit Srivastava, D Chanda, Feroz Khan, SK ChattopadhyayMetabolic and Structural Biology Department, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, PO-CIMAP, IndiaAbstract: Immunomodulation is the process of alteration in immune response due to foreign intrusion of molecules inside the body. Along with the available drugs, a large number of herbal drugs are promoted in traditional Indian treatments, for their immunomodulating activity. Natural coumarinolignoids isolated from the seeds of Cleome viscose have been recognized as having hepatoprotective action and have recently been tested preclinically for their immunomodulatory activity affecting both cell-mediated and humoral immune response. To explore the immunomodulatory compound from derivatives of coumarinolignoids, a quantitative structure activity relationship (QSAR and molecular docking studies were performed. Theoretical results are in accord with the in vivo experimental data studied on Swiss albino mice. Immunostimulatory activity was predicted through QSAR model, developed by forward feed multiple linear regression method with leave-one-out approach. Relationship correlating measure of QSAR model was 99% (R2 = 0.99 and predictive accuracy was 96% (RCV2 = 0.96. QSAR studies indicate that dipole moment, steric energy, amide group count, lambda max (UV-visible, and molar refractivity correlates well with biological activity, while decrease in dipole moment, steric energy, and molar refractivity has negative correlation. Docking studies also showed strong binding affinity to immunomodulatory receptors.Keywords: coumarinolignoids, immunomodulation, docking, QSAR, regression model

  20. Exploring the QSAR's predictive truthfulness of the novel N-tuple discrete derivative indices on benchmark datasets.

    Science.gov (United States)

    Martínez-Santiago, O; Marrero-Ponce, Y; Vivas-Reyes, R; Rivera-Borroto, O M; Hurtado, E; Treto-Suarez, M A; Ramos, Y; Vergara-Murillo, F; Orozco-Ugarriza, M E; Martínez-López, Y

    2017-05-01

    Graph derivative indices (GDIs) have recently been defined over N-atoms (N = 2, 3 and 4) simultaneously, which are based on the concept of derivatives in discrete mathematics (finite difference), metaphorical to the derivative concept in classical mathematical analysis. These molecular descriptors (MDs) codify topo-chemical and topo-structural information based on the concept of the derivative of a molecular graph with respect to a given event (S) over duplex, triplex and quadruplex relations of atoms (vertices). These GDIs have been successfully applied in the description of physicochemical properties like reactivity, solubility and chemical shift, among others, and in several comparative quantitative structure activity/property relationship (QSAR/QSPR) studies. Although satisfactory results have been obtained in previous modelling studies with the aforementioned indices, it is necessary to develop new, more rigorous analysis to assess the true predictive performance of the novel structure codification. So, in the present paper, an assessment and statistical validation of the performance of these novel approaches in QSAR studies are executed, as well as a comparison with those of other QSAR procedures reported in the literature. To achieve the main aim of this research, QSARs were developed on eight chemical datasets widely used as benchmarks in the evaluation/validation of several QSAR methods and/or many different MDs (fundamentally 3D MDs). Three to seven variable QSAR models were built for each chemical dataset, according to the original dissection into training/test sets. The models were developed by using multiple linear regression (MLR) coupled with a genetic algorithm as the feature wrapper selection technique in the MobyDigs software. Each family of GDIs (for duplex, triplex and quadruplex) behaves similarly in all modelling, although there were some exceptions. However, when all families were used in combination, the results achieved were quantitatively

  1. Corrosion Inhibition of Q235A Steel in Acid Medium Using Isatin Derivatives: A Qsar Study

    International Nuclear Information System (INIS)

    Abdo M Al-Fakih; Madzlan Aziz; Abdo M Al-Fakih; Abdallah, H.H.; Hasmerya Maarof; Rosmahaida Jamaludin; Bishir Usman

    2016-01-01

    Quantitative Structure-Activity Relationship (QSAR) study was performed on 10 isatin derivatives which were reportedly used as corrosion inhibitors. Dragon software was used to calculate the molecular descriptors. Partial least square (PLS) method was used to run the regression analysis between the descriptors and the corrosion inhibition efficiencies (IE) of the inhibitors. A predictive QSAR model was developed with a correlation coefficient (r 2 cal ) of 0.9676. The model validity was assessed through internal and external validation. The results show that cross-validation regression coefficient (r 2 cv ) and prediction regression coefficient (r 2 pred ) are 0.8163 and 0.9189, respectively. The model was used to predict the IE for ten isatin derivatives. The results confirm a good stability and predictive ability of the model. Dragon-based descriptors provide a very good description of the corrosion inhibition properties of the inhibitors. The results of the QSAR study were found to be consistent with the experimental data. (author)

  2. 3D QSAR models built on structure-based alignments of Abl tyrosine kinase inhibitors.

    Science.gov (United States)

    Falchi, Federico; Manetti, Fabrizio; Carraro, Fabio; Naldini, Antonella; Maga, Giovanni; Crespan, Emmanuele; Schenone, Silvia; Bruno, Olga; Brullo, Chiara; Botta, Maurizio

    2009-06-01

    Quality QSAR: A combination of docking calculations and a statistical approach toward Abl inhibitors resulted in a 3D QSAR model, the analysis of which led to the identification of ligand portions important for affinity. New compounds designed on the basis of the model were found to have very good affinity for the target, providing further validation of the model itself.The X-ray crystallographic coordinates of the Abl tyrosine kinase domain in its active, inactive, and Src-like inactive conformations were used as targets to simulate the binding mode of a large series of pyrazolo[3,4-d]pyrimidines (known Abl inhibitors) by means of GOLD software. Receptor-based alignments provided by molecular docking calculations were submitted to a GRID-GOLPE protocol to generate 3D QSAR models. Analysis of the results showed that the models based on the inactive and Src-like inactive conformations had very poor statistical parameters, whereas the sole model based on the active conformation of Abl was characterized by significant internal and external predictive ability. Subsequent analysis of GOLPE PLS pseudo-coefficient contour plots of this model gave us a better understanding of the relationships between structure and affinity, providing suggestions for the next optimization process. On the basis of these results, new compounds were designed according to the hydrophobic and hydrogen bond donor and acceptor contours, and were found to have improved enzymatic and cellular activity with respect to parent compounds. Additional biological assays confirmed the important role of the selected compounds as inhibitors of cell proliferation in leukemia cells.

  3. Multiple QSAR models, pharmacophore pattern and molecular docking analysis for anticancer activity of α, β-unsaturated carbonyl-based compounds, oxime and oxime ether analogues

    Science.gov (United States)

    Masand, Vijay H.; El-Sayed, Nahed N. E.; Bambole, Mukesh U.; Quazi, Syed A.

    2018-04-01

    Multiple discrete quantitative structure-activity relationships (QSARs) models were constructed for the anticancer activity of α, β-unsaturated carbonyl-based compounds, oxime and oxime ether analogues with a variety of substituents like sbnd Br, sbnd OH, -OMe, etc. at different positions. A big pool of descriptors was considered for QSAR model building. Genetic algorithm (GA), available in QSARINS-Chem, was executed to choose optimum number and set of descriptors to create the multi-linear regression equations for a dataset of sixty-nine compounds. The newly developed five parametric models were subjected to exhaustive internal and external validation along with Y-scrambling using QSARINS-Chem, according to the OECD principles for QSAR model validation. The models were built using easily interpretable descriptors and accepted after confirming statistically robustness with high external predictive ability. The five parametric models were found to have R2 = 0.80 to 0.86, R2ex = 0.75 to 0.84, and CCCex = 0.85 to 0.90. The models indicate that frequency of nitrogen and oxygen atoms separated by five bonds from each other and internal electronic environment of the molecule have correlation with the anticancer activity.

  4. 2D QSAR studies of the inhibitory activity of a series of substituted purine derivatives against c-Src tyrosine kinase

    OpenAIRE

    Mukesh C. Sharma

    2016-01-01

    A series of 34 substituted purine analogues derivatives were subjected to quantitative structure-activity relationship analyses as inhibitors of c-Src tyrosine kinase. Partial least squares regression was applied to derive QSAR models, which were further validated for statistical significance by internal and external validation. The best QSAR model developed had a good predictive correlation coefficient (r2) of 0.8319, a significant cross-validated correlation coefficient (q2) of 0.7550, and ...

  5. Dependence of QSAR models on the selection of trial descriptor sets: a demonstration using nanotoxicity endpoints of decorated nanotubes.

    Science.gov (United States)

    Shao, Chi-Yu; Chen, Sing-Zuo; Su, Bo-Han; Tseng, Yufeng J; Esposito, Emilio Xavier; Hopfinger, Anton J

    2013-01-28

    Little attention has been given to the selection of trial descriptor sets when designing a QSAR analysis even though a great number of descriptor classes, and often a greater number of descriptors within a given class, are now available. This paper reports an effort to explore interrelationships between QSAR models and descriptor sets. Zhou and co-workers (Zhou et al., Nano Lett. 2008, 8 (3), 859-865) designed, synthesized, and tested a combinatorial library of 80 surface modified, that is decorated, multi-walled carbon nanotubes for their composite nanotoxicity using six endpoints all based on a common 0 to 100 activity scale. Each of the six endpoints for the 29 most nanotoxic decorated nanotubes were incorporated as the training set for this study. The study reported here includes trial descriptor sets for all possible combinations of MOE, VolSurf, and 4D-fingerprints (FP) descriptor classes, as well as including and excluding explicit spatial contributions from the nanotube. Optimized QSAR models were constructed from these multiple trial descriptor sets. It was found that (a) both the form and quality of the best QSAR models for each of the endpoints are distinct and (b) some endpoints are quite dependent upon 4D-FP descriptors of the entire nanotube-decorator complex. However, other endpoints yielded equally good models only using decorator descriptors with and without the decorator-only 4D-FP descriptors. Lastly, and most importantly, the quality, significance, and interpretation of a QSAR model were found to be critically dependent on the trial descriptor sets used within a given QSAR endpoint study.

  6. The Danish (Q)SAR Database Update Project

    DEFF Research Database (Denmark)

    Nikolov, Nikolai Georgiev; Dybdahl, Marianne; Abildgaard Rosenberg, Sine

    2013-01-01

    The Danish (Q)SAR Database is a collection of predictions from quantitative structure–activity relationship ((Q)SAR) models for over 70 environmental and human health-related endpoints (covering biodegradation, metabolism, allergy, irritation, endocrine disruption, teratogenicity, mutagenicity......, carcinogenicity and others), each of them available for 185,000 organic substances. The database has been available online since 2005 (http://qsar.food.dtu.dk). A major update project for the Danish (Q)SAR database is under way, with a new online release planned in the beginning of 2015. The updated version...... will contain more than 600,000 discrete organic structures and new, more precise predictions for all endpoints, derived by consensus algorithms from a number of state-of-the-art individual predictions. Copyright © 2013 Published by Elsevier Ireland Ltd....

  7. 5D-QSAR for spirocyclic sigma1 receptor ligands by Quasar receptor surface modeling.

    Science.gov (United States)

    Oberdorf, Christoph; Schmidt, Thomas J; Wünsch, Bernhard

    2010-07-01

    Based on a contiguous and structurally as well as biologically diverse set of 87 sigma(1) ligands, a 5D-QSAR study was conducted in which a quasi-atomistic receptor surface modeling approach (program package Quasar) was applied. The superposition of the ligands was performed with the tool Pharmacophore Elucidation (MOE-package), which takes all conformations of the ligands into account. This procedure led to four pharmacophoric structural elements with aromatic, hydrophobic, cationic and H-bond acceptor properties. Using the aligned structures a 3D-model of the ligand binding site of the sigma(1) receptor was obtained, whose general features are in good agreement with previous assumptions on the receptor structure, but revealed some novel insights since it represents the receptor surface in more detail. Thus, e.g., our model indicates the presence of an H-bond acceptor moiety in the binding site as counterpart to the ligands' cationic ammonium center, rather than a negatively charged carboxylate group. The presented QSAR model is statistically valid and represents the biological data of all tested compounds, including a test set of 21 ligands not used in the modeling process, with very good to excellent accuracy [q(2) (training set, n=66; leave 1/3 out) = 0.84, p(2) (test set, n=21)=0.64]. Moreover, the binding affinities of 13 further spirocyclic sigma(1) ligands were predicted with reasonable accuracy (mean deviation in pK(i) approximately 0.8). Thus, in addition to novel insights into the requirements for binding of spirocyclic piperidines to the sigma(1) receptor, the presented model can be used successfully in the rational design of new sigma(1) ligands. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.

  8. A QSAR Study of Environmental Estrogens Based on a Novel Variable Selection Method

    Directory of Open Access Journals (Sweden)

    Aiqian Zhang

    2012-05-01

    Full Text Available A large number of descriptors were employed to characterize the molecular structure of 53 natural, synthetic, and environmental chemicals which are suspected of disrupting endocrine functions by mimicking or antagonizing natural hormones and may thus pose a serious threat to the health of humans and wildlife. In this work, a robust quantitative structure-activity relationship (QSAR model with a novel variable selection method has been proposed for the effective estrogens. The variable selection method is based on variable interaction (VSMVI with leave-multiple-out cross validation (LMOCV to select the best subset. During variable selection, model construction and assessment, the Organization for Economic Co-operation and Development (OECD principles for regulation of QSAR acceptability were fully considered, such as using an unambiguous multiple-linear regression (MLR algorithm to build the model, using several validation methods to assessment the performance of the model, giving the define of applicability domain and analyzing the outliers with the results of molecular docking. The performance of the QSAR model indicates that the VSMVI is an effective, feasible and practical tool for rapid screening of the best subset from large molecular descriptors.

  9. Learning from Multiple Classifier Systems: Perspectives for Improving Decision Making of QSAR Models in Medicinal Chemistry.

    Science.gov (United States)

    Pham-The, Hai; Nam, Nguyen-Hai; Nga, Doan-Viet; Hai, Dang Thanh; Dieguez-Santana, Karel; Marrero-Poncee, Yovani; Castillo-Garit, Juan A; Casanola-Martin, Gerardo M; Le-Thi-Thu, Huong

    2018-02-09

    Quantitative Structure - Activity Relationship (QSAR) modeling has been widely used in medicinal chemistry and computational toxicology for many years. Today, as the amount of chemicals is increasing dramatically, QSAR methods have become pivotal for the purpose of handling the data, identifying a decision, and gathering useful information from data processing. The advances in this field have paved a way for numerous alternative approaches that require deep mathematics in order to enhance the learning capability of QSAR models. One of these directions is the use of Multiple Classifier Systems (MCSs) that potentially provide a means to exploit the advantages of manifold learning through decomposition frameworks, while improving generalization and predictive performance. In this paper, we presented MCS as a next generation of QSAR modeling techniques and discuss the chance to mining the vast number of models already published in the literature. We systematically revisited the theoretical frameworks of MCS as well as current advances in MCS application for QSAR practice. Furthermore, we illustrated our idea by describing ensemble approaches on modeling histone deacetylase (HDACs) inhibitors. We expect that our analysis would contribute to a better understanding about MCS application and its future perspectives for improving the decision making of QSAR models. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  10. Predicting chemically-induced skin reactions. Part II: QSAR models of skin permeability and the relationships between skin permeability and skin sensitization

    Energy Technology Data Exchange (ETDEWEB)

    Alves, Vinicius M. [Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220 (Brazil); Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Muratov, Eugene [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical–Chemical Institute NAS of Ukraine, Odessa 65080 (Ukraine); Fourches, Denis [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States); Strickland, Judy; Kleinstreuer, Nicole [ILS/Contractor supporting the NTP Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM), P.O. Box 13501, Research Triangle Park, NC 27709 (United States); Andrade, Carolina H. [Laboratory of Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-220 (Brazil); Tropsha, Alexander, E-mail: alex_tropsha@unc.edu [Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (United States)

    2015-04-15

    Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, we found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R{sup 2} = 0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q{sup 2}{sub ext} = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential. - Highlights: • It was compiled the largest publicly-available skin permeability dataset. • Predictive QSAR models were developed for skin permeability. • No concordance between skin

  11. Predicting chemically-induced skin reactions. Part II: QSAR models of skin permeability and the relationships between skin permeability and skin sensitization

    International Nuclear Information System (INIS)

    Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander

    2015-01-01

    Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, we found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R 2 = 0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q 2 ext = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential. - Highlights: • It was compiled the largest publicly-available skin permeability dataset. • Predictive QSAR models were developed for skin permeability. • No concordance between skin sensitization and

  12. Discovery of DPP IV inhibitors by pharmacophore modeling and QSAR analysis followed by in silico screening.

    Science.gov (United States)

    Al-Masri, Ihab M; Mohammad, Mohammad K; Taha, Mutasem O

    2008-11-01

    Dipeptidyl peptidase IV (DPP IV) deactivates the natural hypoglycemic incretin hormones. Inhibition of this enzyme should restore glucose homeostasis in diabetic patients making it an attractive target for the development of new antidiabetic drugs. With this in mind, the pharmacophoric space of DPP IV was explored using a set of 358 known inhibitors. Thereafter, genetic algorithm and multiple linear regression analysis were employed to select an optimal combination of pharmacophoric models and physicochemical descriptors that yield selfconsistent and predictive quantitative structure-activity relationships (QSAR) (r(2) (287)=0.74, F-statistic=44.5, r(2) (BS)=0.74, r(2) (LOO)=0.69, r(2) (PRESS) against 71 external testing inhibitors=0.51). Two orthogonal pharmacophores (of cross-correlation r(2)=0.23) emerged in the QSAR equation suggesting the existence of at least two distinct binding modes accessible to ligands within the DPP IV binding pocket. Docking experiments supported the binding modes suggested by QSAR/pharmacophore analyses. The validity of the QSAR equation and the associated pharmacophore models were established by the identification of new low-micromolar anti-DPP IV leads retrieved by in silico screening. One of our interesting potent anti-DPP IV hits is the fluoroquinolone gemifloxacin (IC(50)=1.12 muM). The fact that gemifloxacin was recently reported to potently inhibit the prodiabetic target glycogen synthase kinase 3beta (GSK-3beta) suggests that gemifloxacin is an excellent lead for the development of novel dual antidiabetic inhibitors against DPP IV and GSK-3beta.

  13. Using Toxicological Evidence from QSAR Models in Practice

    Science.gov (United States)

    The new generation of QSAR models provides supporting documentation in addition to the predicted toxicological value. Such information enables the toxicologist to explore the properties of chemical substances and to review and increase the reliability of toxicity predictions. Thi...

  14. Cross-validation pitfalls when selecting and assessing regression and classification models.

    Science.gov (United States)

    Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon

    2014-03-29

    We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.

  15. Rate constants of hydroxyl radical oxidation of polychlorinated biphenyls in the gas phase: A single−descriptor based QSAR and DFT study

    International Nuclear Information System (INIS)

    Yang, Zhihui; Luo, Shuang; Wei, Zongsu; Ye, Tiantian; Spinney, Richard; Chen, Dong; Xiao, Ruiyang

    2016-01-01

    The second‒order rate constants (k) of hydroxyl radical (·OH) with polychlorinated biphenyls (PCBs) in the gas phase are of scientific and regulatory importance for assessing their global distribution and fate in the atmosphere. Due to the limited number of measured k values, there is a need to model the k values for unknown PCBs congeners. In the present study, we developed a quantitative structure–activity relationship (QSAR) model with quantum chemical descriptors using a sequential approach, including correlation analysis, principal component analysis, multi−linear regression, validation, and estimation of applicability domain. The result indicates that the single descriptor, polarizability (α), plays an important role in determining the reactivity with a global standardized function of lnk = −0.054 × α ‒ 19.49 at 298 K. In order to validate the QSAR predicted k values and expand the current k value database for PCBs congeners, an independent method, density functional theory (DFT), was employed to calculate the kinetics and thermodynamics of the gas‒phase ·OH oxidation of 2,4′,5-trichlorobiphenyl (PCB31), 2,2′,4,4′-tetrachlorobiphenyl (PCB47), 2,3,4,5,6-pentachlorobiphenyl (PCB116), 3,3′,4,4′,5,5′-hexachlorobiphenyl (PCB169), and 2,3,3′,4,5,5′,6-heptachlorobiphenyl (PCB192) at 298 K at B3LYP/6–311++G**//B3LYP/6–31 + G** level of theory. The QSAR predicted and DFT calculated k values for ·OH oxidation of these PCB congeners exhibit excellent agreement with the experimental k values, indicating the robustness and predictive power of the single–descriptor based QSAR model we developed. - Highlights: • We developed a single−descriptor based QSAR model for ·OH oxidation of PCBs. • We independently validated the QSAR predicted k values of five PCB congeners with the DFT method. • The QSAR predicted and DFT calculated k for the five PCB congeners exhibit excellent agreement. - We developed a single

  16. QSAR models for oxidation of organic micropollutants in water based on ozone and hydroxyl radical rate constants and their chemical classification

    KAUST Repository

    Sudhakaran, Sairam

    2013-03-01

    Ozonation is an oxidation process for the removal of organic micropollutants (OMPs) from water and the chemical reaction is governed by second-order kinetics. An advanced oxidation process (AOP), wherein the hydroxyl radicals (OH radicals) are generated, is more effective in removing a wider range of OMPs from water than direct ozonation. Second-order rate constants (kOH and kO3) are good indices to estimate the oxidation efficiency, where higher rate constants indicate more rapid oxidation. In this study, quantitative structure activity relationships (QSAR) models for O3 and AOP processes were developed, and rate constants, kOH and kO3, were predicted based on target compound properties. The kO3 and kOH values ranged from 5 * 10-4 to 105 M-1s-1 and 0.04 to 18 * (109) M-1 s-1, respectively. Several molecular descriptors which potentially influence O3 and OH radical oxidation were identified and studied. The QSAR-defining descriptors were double bond equivalence (DBE), ionisation potential (IP), electron-affinity (EA) and weakly-polar component of solvent accessible surface area (WPSA), and the chemical and statistical significance of these descriptors was discussed. Multiple linear regression was used to build the QSAR models, resulting in high goodness-of-fit, r2 (>0.75). The models were validated by internal and external validation along with residual plots. © 2012 Elsevier Ltd.

  17. QSAR Modeling of COX -2 Inhibitory Activity of Some Dihydropyridine and Hydroquinoline Derivatives Using Multiple Linear Regression (MLR) Method.

    Science.gov (United States)

    Akbari, Somaye; Zebardast, Tannaz; Zarghi, Afshin; Hajimahdi, Zahra

    2017-01-01

    COX-2 inhibitory activities of some 1,4-dihydropyridine and 5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives were modeled by quantitative structure-activity relationship (QSAR) using stepwise-multiple linear regression (SW-MLR) method. The built model was robust and predictive with correlation coefficient (R 2 ) of 0.972 and 0.531 for training and test groups, respectively. The quality of the model was evaluated by leave-one-out (LOO) cross validation (LOO correlation coefficient (Q 2 ) of 0.943) and Y-randomization. We also employed a leverage approach for the defining of applicability domain of model. Based on QSAR models results, COX-2 inhibitory activity of selected data set had correlation with BEHm6 (highest eigenvalue n. 6 of Burden matrix/weighted by atomic masses), Mor03u (signal 03/unweighted) and IVDE (Mean information content on the vertex degree equality) descriptors which derived from their structures.

  18. The utility of QSARs in predicting acute fish toxicity of pesticide metabolites: A retrospective validation approach.

    Science.gov (United States)

    Burden, Natalie; Maynard, Samuel K; Weltje, Lennart; Wheeler, James R

    2016-10-01

    this non-testing approach, it can be concluded there is a strong data driven rationale for the applicability of QSAR models in the metabolite assessment scheme recommended by EFSA. As such there is value in further refining this approach, to improve the method and enable its future incorporation into regulatory guidance and practice. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  19. Towards Global QSAR Model Building for Acute Toxicity: Munro Database Case Study

    Directory of Open Access Journals (Sweden)

    Swapnil Chavan

    2014-10-01

    Full Text Available A series of 436 Munro database chemicals were studied with respect to their corresponding experimental LD50 values to investigate the possibility of establishing a global QSAR model for acute toxicity. Dragon molecular descriptors were used for the QSAR model development and genetic algorithms were used to select descriptors better correlated with toxicity data. Toxic values were discretized in a qualitative class on the basis of the Globally Harmonized Scheme: the 436 chemicals were divided into 3 classes based on their experimental LD50 values: highly toxic, intermediate toxic and low to non-toxic. The k-nearest neighbor (k-NN classification method was calibrated on 25 molecular descriptors and gave a non-error rate (NER equal to 0.66 and 0.57 for internal and external prediction sets, respectively. Even if the classification performances are not optimal, the subsequent analysis of the selected descriptors and their relationship with toxicity levels constitute a step towards the development of a global QSAR model for acute toxicity.

  20. QSAR study on the histamine (H3 receptor antagonists using the genetic algorithm: Multi parameter linear regression

    Directory of Open Access Journals (Sweden)

    Adimi Maryam

    2012-01-01

    Full Text Available A quantitative structure activity relationship (QSAR model has been produced for predicting antagonist potency of biphenyl derivatives as human histamine (H3 receptors. The molecular structures of the compounds are numerically represented by various kinds of molecular descriptors. The whole data set was divided into training and test sets. Genetic algorithm based multiple linear regression is used to select most statistically effective descriptors. The final QSAR model (N =24, R2=0.916, F = 51.771, Q2 LOO = 0.872, Q2 LGO = 0.847, Q2 BOOT = 0.857 was fully validated employing leaveone- out (LOO cross-validation approach, Fischer statistics (F, Yrandomisation test, and predictions based on the test data set. The test set presented an external prediction power of R2 test=0.855. In conclusion, the QSAR model generated can be used as a valuable tool for designing similar groups of new antagonists of histamine (H3 receptors.

  1. First molecular modeling report on novel arylpyrimidine kynurenine monooxygenase inhibitors through multi-QSAR analysis against Huntington's disease: A proposal to chemists!

    Science.gov (United States)

    Amin, Sk Abdul; Adhikari, Nilanjan; Jha, Tarun; Gayen, Shovanlal

    2016-12-01

    Huntington's disease (HD) is caused by mutation of huntingtin protein (mHtt) leading to neuronal cell death. The mHtt induced toxicity can be rescued by inhibiting the kynurenine monooxygenase (KMO) enzyme. Therefore, KMO is a promising drug target to address the neurodegenerative disorders such as Huntington's diseases. Fiftysix arylpyrimidine KMO inhibitors are structurally explored through regression and classification based multi-QSAR modeling, pharmacophore mapping and molecular docking approaches. Moreover, ten new compounds are proposed and validated through the modeling that may be effective in accelerating Huntington's disease drug discovery efforts. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. QSAR models for reproductive toxicity and endocrine disruption in regulatory use - a preliminary investigation

    DEFF Research Database (Denmark)

    Jensen, Gunde Egeskov; Niemela, J.R.; Wedebye, Eva Bay

    2008-01-01

    A special challenge in the new European Union chemicals legislation, Registration, Evaluation and Authorisation of Chemicals, will be the toxicological evaluation of chemicals for reproductive toxicity. Use of valid quantitative structure-activity relationships (QSARs) is a possibility under...

  3. Consensus QSAR model for identifying novel H5N1 inhibitors.

    Science.gov (United States)

    Sharma, Nitin; Yap, Chun Wei

    2012-08-01

    Due to the importance of neuraminidase in the pathogenesis of influenza virus infection, it has been regarded as the most important drug target for the treatment of influenza. Resistance to currently available drugs and new findings related to structure of the protein requires novel neuraminidase 1 (N1) inhibitors. In this study, a consensus QSAR model with defined applicability domain (AD) was developed using published N1 inhibitors. The consensus model was validated using an external validation set. The model achieved high sensitivity, specificity, and overall accuracy along with low false positive rate (FPR) and false discovery rate (FDR). The performance of model on the external validation set and training set were comparable, thus it was unlikely to be overfitted. The low FPR and low FDR will increase its accuracy in screening large chemical libraries. Screening of ZINC library resulted in 64,772 compounds as probable N1 inhibitors, while 173,674 compounds were defined to be outside the AD of the consensus model. The advantage of the current model is that it was developed using a large and diverse dataset and has a defined AD which prevents its use on compounds that it is not capable of predicting. The consensus model developed in this study is made available via the free software, PaDEL-DDPredictor.

  4. Insight into the structural requirement of substituted quinazolinone biphenyl acylsulfonamides derivatives as Angiotensin II AT1 receptor antagonist: 2D and 3D QSAR approach

    Directory of Open Access Journals (Sweden)

    Mukesh C. Sharma

    2014-01-01

    Full Text Available A series of 19 molecules substituted quinazolinone biphenyl acylsulfonamides derivatives displaying variable inhibition of Angiotensin II receptor AT1 activity were selected to develop models for establishing 2D and 3D QSAR. The compounds in the selected series were characterized by spatial, molecular and electro topological descriptors using QSAR module of Molecular Design Suite (VLife MDS™ 3.5. The best 2D QSAR model was selected, having correlation coefficient r2 (0.8056 and cross validated squared correlation coefficient q2 (0.6742 with external predictive ability of pred_r2 0.7583 coefficient of correlation of predicted data set (pred_r2se 0.2165. The results obtained from QSAR studies could be used in designing better Ang II activity among the congeners in future. The optimum QSAR model showed that the parameters SsssCHE index, SddCE-index, T_2_Cl_4, and SssNHE-index contributed in the model. 3D QSAR analysis by kNN-molecular field analysis approach developed based on principles of the k-nearest neighbor method combined with Genetic algorithms, stepwise forward variable selection approach; a leave-one-out cross-validated correlation coefficient (q2 of 0.6516 and a non-cross-validated correlation coefficient (r2 of 0.8316 and pred_r2 0.6954 were obtained. Contour maps using this approach showed that steric, electrostatic, and hydrophobic field effects dominantly determine binding affinities. The information rendered by 3D QSAR models may lead to a better understanding of structural requirements of Angiotensin II receptor and can help in the design of novel potent antihypertensive molecules.

  5. A primer on QSAR/QSPR modeling fundamental concepts

    CERN Document Server

    Roy, Kunal; Das, Rudra Narayan

    2015-01-01

    This brief goes back to basics and describes the Quantitative structure-activity/property relationships (QSARs/QSPRs) that represent predictive models derived from the application of statistical tools correlating biological activity (including therapeutic and toxic) and properties of chemicals (drugs/toxicants/environmental pollutants) with descriptors representative of molecular structure and/or properties. It explains how the sub-discipline of Cheminformatics is used for many applications such as risk assessment, toxicity prediction, property prediction and regulatory decisions apart from drug discovery and lead optimization. The authors also present, in basic terms, how QSARs and related chemometric tools are extensively involved in medicinal chemistry, environmental chemistry and agricultural chemistry for ranking of potential compounds and prioritizing experiments. At present, there is no standard or introductory publication available that introduces this important topic to students of chemistry and phar...

  6. Benchmarking Variable Selection in QSAR.

    Science.gov (United States)

    Eklund, Martin; Norinder, Ulf; Boyer, Scott; Carlsson, Lars

    2012-02-01

    Variable selection is important in QSAR modeling since it can improve model performance and transparency, as well as reduce the computational cost of model fitting and predictions. Which variable selection methods that perform well in QSAR settings is largely unknown. To address this question we, in a total of 1728 benchmarking experiments, rigorously investigated how eight variable selection methods affect the predictive performance and transparency of random forest models fitted to seven QSAR datasets covering different endpoints, descriptors sets, types of response variables, and number of chemical compounds. The results show that univariate variable selection methods are suboptimal and that the number of variables in the benchmarked datasets can be reduced with about 60 % without significant loss in model performance when using multivariate adaptive regression splines MARS and forward selection. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Prediction of rodent carcinogenic potential of naturally occurring chemicals in the human diet using high-throughput QSAR predictive modeling

    International Nuclear Information System (INIS)

    Valerio, Luis G.; Arvidson, Kirk B.; Chanderbhan, Ronald F.; Contrera, Joseph F.

    2007-01-01

    , comprised primarily of pharmaceutical, industrial and some natural products developed under an FDA-MDL cooperative research and development agreement (CRADA). The predictive performance for this group of dietary natural products and the control group was 97% sensitivity and 80% concordance. Specificity was marginal at 53%. This study finds that the in silico QSAR analysis employing this software's rodent carcinogenicity database is capable of identifying the rodent carcinogenic potential of naturally occurring organic molecules found in the human diet with a high degree of sensitivity. It is the first study to demonstrate successful QSAR predictive modeling of naturally occurring carcinogens found in the human diet using an external validation test. Further test validation of this software and expansion of the training data set for dietary chemicals will help to support the future use of such QSAR methods for screening and prioritizing the risk of dietary chemicals when actual animal data are inadequate, equivocal, or absent

  8. QSAR classification models for the prediction of endocrine disrupting activity of brominated flame retardants.

    Science.gov (United States)

    Kovarich, Simona; Papa, Ester; Gramatica, Paola

    2011-06-15

    The identification of potential endocrine disrupting (ED) chemicals is an important task for the scientific community due to their diffusion in the environment; the production and use of such compounds will be strictly regulated through the authorization process of the REACH regulation. To overcome the problem of insufficient experimental data, the quantitative structure-activity relationship (QSAR) approach is applied to predict the ED activity of new chemicals. In the present study QSAR classification models are developed, according to the OECD principles, to predict the ED potency for a class of emerging ubiquitary pollutants, viz. brominated flame retardants (BFRs). Different endpoints related to ED activity (i.e. aryl hydrocarbon receptor agonism and antagonism, estrogen receptor agonism and antagonism, androgen and progesterone receptor antagonism, T4-TTR competition, E2SULT inhibition) are modeled using the k-NN classification method. The best models are selected by maximizing the sensitivity and external predictive ability. We propose simple QSARs (based on few descriptors) characterized by internal stability, good predictive power and with a verified applicability domain. These models are simple tools that are applicable to screen BFRs in relation to their ED activity, and also to design safer alternatives, in agreement with the requirements of REACH regulation at the authorization step. Copyright © 2011 Elsevier B.V. All rights reserved.

  9. Quantitative structure activity relationship (QSAR) of piperine analogs for bacterial NorA efflux pump inhibitors.

    Science.gov (United States)

    Nargotra, Amit; Sharma, Sujata; Koul, Jawahir Lal; Sangwan, Pyare Lal; Khan, Inshad Ali; Kumar, Ashwani; Taneja, Subhash Chander; Koul, Surrinder

    2009-10-01

    Quantitative structure activity relationship (QSAR) analysis of piperine analogs as inhibitors of efflux pump NorA from Staphylococcus aureus has been performed in order to obtain a highly accurate model enabling prediction of inhibition of S. aureus NorA of new chemical entities from natural sources as well as synthetic ones. Algorithm based on genetic function approximation method of variable selection in Cerius2 was used to generate the model. Among several types of descriptors viz., topological, spatial, thermodynamic, information content and E-state indices that were considered in generating the QSAR model, three descriptors such as partial negative surface area of the compounds, area of the molecular shadow in the XZ plane and heat of formation of the molecules resulted in a statistically significant model with r(2)=0.962 and cross-validation parameter q(2)=0.917. The validation of the QSAR models was done by cross-validation, leave-25%-out and external test set prediction. The theoretical approach indicates that the increase in the exposed partial negative surface area increases the inhibitory activity of the compound against NorA whereas the area of the molecular shadow in the XZ plane is inversely proportional to the inhibitory activity. This model also explains the relationship of the heat of formation of the compound with the inhibitory activity. The model is not only able to predict the activity of new compounds but also explains the important regions in the molecules in quantitative manner.

  10. Predicting chemically-induced skin reactions. Part II: QSAR models of skin permeability and the relationships between skin permeability and skin sensitization

    Science.gov (United States)

    Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander

    2015-01-01

    Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, we found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R2=0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q2ext = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential. PMID:25560673

  11. QSAR models of human data can enrich or replace LLNA testing for human skin sensitization

    Science.gov (United States)

    Alves, Vinicius M.; Capuzzi, Stephen J.; Muratov, Eugene; Braga, Rodolpho C.; Thornton, Thomas; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander

    2016-01-01

    Skin sensitization is a major environmental and occupational health hazard. Although many chemicals have been evaluated in humans, there have been no efforts to model these data to date. We have compiled, curated, analyzed, and compared the available human and LLNA data. Using these data, we have developed reliable computational models and applied them for virtual screening of chemical libraries to identify putative skin sensitizers. The overall concordance between murine LLNA and human skin sensitization responses for a set of 135 unique chemicals was low (R = 28-43%), although several chemical classes had high concordance. We have succeeded to develop predictive QSAR models of all available human data with the external correct classification rate of 71%. A consensus model integrating concordant QSAR predictions and LLNA results afforded a higher CCR of 82% but at the expense of the reduced external dataset coverage (52%). We used the developed QSAR models for virtual screening of CosIng database and identified 1061 putative skin sensitizers; for seventeen of these compounds, we found published evidence of their skin sensitization effects. Models reported herein provide more accurate alternative to LLNA testing for human skin sensitization assessment across diverse chemical data. In addition, they can also be used to guide the structural optimization of toxic compounds to reduce their skin sensitization potential. PMID:28630595

  12. New free Danish online (Q)SAR predictions database with >600,000 substances

    DEFF Research Database (Denmark)

    Wedebye, Eva Bay; Dybdahl, Marianne; Reffstrup, Trine Klein

    Since 2005 the Danish (Q)SAR Database has been freely available on the Internet. It is a tool that allows single chemical substance profiling and screenings based on predicted hazard information. The database is also included in the OECD (Q)SAR Application Toolbox which is used worldwide...... by regulators and industry. A lot of progress in (Q)SAR model development, application and documentation has been made since the publication in 2005. A new and completely rebuild online (Q)SAR predictions database was therefore published in November 2015 at http://qsar.food.dtu.dk. The number of chemicals...... in the database has been expanded from 185,000 to >600,000. As far as possible all organic single constituent substances that were pre-registered under REACH have been included in the new structure set. The new Danish (Q)SAR Database includes estimates from more than 200 (Q)SARs covering a wide range of hazardous...

  13. QSAR models for reproductive toxicity and endocrine disruption in regulatory use – a preliminary investigation

    DEFF Research Database (Denmark)

    Jensen, Gunde Egeskov; Niemelä, Jay Russell; Wedebye, Eva Bay

    2008-01-01

    the new legislation. This article focuses on a screening exercise by use of our own and commercial QSAR models for identification of possible reproductive toxicants. Three QSAR models were used for reproductive toxicity for the endpoints teratogenic risk to humans (based on animal tests, clinical data...... for humans owing to possible developmental toxic effects: Xn (Harmful) and R63 (Possible risk of harm to the unborn child). The chemicals were also screened in three models for endocrine disruption....

  14. Prediction of octanol-air partition coefficients for polychlorinated biphenyls (PCBs) using 3D-QSAR models.

    Science.gov (United States)

    Chen, Ying; Cai, Xiaoyu; Jiang, Long; Li, Yu

    2016-02-01

    Based on the experimental data of octanol-air partition coefficients (KOA) for 19 polychlorinated biphenyl (PCB) congeners, two types of QSAR methods, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA), are used to establish 3D-QSAR models using the structural parameters as independent variables and using logKOA values as the dependent variable with the Sybyl software to predict the KOA values of the remaining 190 PCB congeners. The whole data set (19 compounds) was divided into a training set (15 compounds) for model generation and a test set (4 compounds) for model validation. As a result, the cross-validation correlation coefficient (q(2)) obtained by the CoMFA and CoMSIA models (shuffled 12 times) was in the range of 0.825-0.969 (>0.5), the correlation coefficient (r(2)) obtained was in the range of 0.957-1.000 (>0.9), and the SEP (standard error of prediction) of test set was within the range of 0.070-0.617, indicating that the models were robust and predictive. Randomly selected from a set of models, CoMFA analysis revealed that the corresponding percentages of the variance explained by steric and electrostatic fields were 23.9% and 76.1%, respectively, while CoMSIA analysis by steric, electrostatic and hydrophobic fields were 0.6%, 92.6%, and 6.8%, respectively. The electrostatic field was determined as a primary factor governing the logKOA. The correlation analysis of the relationship between the number of Cl atoms and the average logKOA values of PCBs indicated that logKOA values gradually increased as the number of Cl atoms increased. Simultaneously, related studies on PCB detection in the Arctic and Antarctic areas revealed that higher logKOA values indicate a stronger PCB migration ability. From CoMFA and CoMSIA contour maps, logKOA decreased when substituents possessed electropositive groups at the 2-, 3-, 3'-, 5- and 6- positions, which could reduce the PCB migration ability. These results are

  15. 2D-QSAR study of fullerene nanostructure derivatives as potent HIV-1 protease inhibitors

    Science.gov (United States)

    Barzegar, Abolfazl; Jafari Mousavi, Somaye; Hamidi, Hossein; Sadeghi, Mehdi

    2017-09-01

    The protease of human immunodeficiency virus1 (HIV-PR) is an essential enzyme for antiviral treatments. Carbon nanostructures of fullerene derivatives, have nanoscale dimension with a diameter comparable to the diameter of the active site of HIV-PR which would in turn inhibit HIV. In this research, two dimensional quantitative structure-activity relationships (2D-QSAR) of fullerene derivatives against HIV-PR activity were employed as a powerful tool for elucidation the relationships between structure and experimental observations. QSAR study of 49 fullerene derivatives was performed by employing stepwise-MLR, GAPLS-MLR, and PCA-MLR models for variable (descriptor) selection and model construction. QSAR models were obtained with higher ability to predict the activity of the fullerene derivatives against HIV-PR by a correlation coefficient (R2training) of 0.942, 0.89, and 0.87 as well as R2test values of 0.791, 0.67and 0.674 for stepwise-MLR, GAPLS-MLR, and PCA -MLR models, respectively. Leave-one-out cross-validated correlation coefficient (R2CV) and Y-randomization methods confirmed the models robustness. The descriptors indicated that the HIV-PR inhibition depends on the van der Waals volumes, polarizability, bond order between two atoms and electronegativities of fullerenes derivatives. 2D-QSAR simulation without needing receptor's active site geometry, resulted in useful descriptors mainly denoting ;C60 backbone-functional groups; and ;C60 functional groups; properties. Both properties in fullerene refer to the ligand fitness and improvement van der Waals interactions with HIV-PR active site. Therefore, the QSAR models can be used in the search for novel HIV-PR inhibitors based on fullerene derivatives.

  16. Hydroxyethylamine derivatives as HIV-1 protease inhibitors: a predictive QSAR modelling study based on Monte Carlo optimization.

    Science.gov (United States)

    Bhargava, S; Adhikari, N; Amin, S A; Das, K; Gayen, S; Jha, T

    2017-12-01

    Application of HIV-1 protease inhibitors (as an anti-HIV regimen) may serve as an attractive strategy for anti-HIV drug development. Several investigations suggest that there is a crucial need to develop a novel protease inhibitor with higher potency and reduced toxicity. Monte Carlo optimized QSAR study was performed on 200 hydroxyethylamine derivatives with antiprotease activity. Twenty-one QSAR models with good statistical qualities were developed from three different splits with various combinations of SMILES and GRAPH based descriptors. The best models from different splits were selected on the basis of statistically validated characteristics of the test set and have the following statistical parameters: r 2 = 0.806, Q 2 = 0.788 (split 1); r 2 = 0.842, Q 2 = 0.826 (split 2); r 2 = 0.774, Q 2 = 0.755 (split 3). The structural attributes obtained from the best models were analysed to understand the structural requirements of the selected series for HIV-1 protease inhibitory activity. On the basis of obtained structural attributes, 11 new compounds were designed, out of which five compounds were found to have better activity than the best active compound in the series.

  17. Activity Prediction of Schiff Base Compounds using Improved QSAR Models of Cinnamaldehyde Analogues and Derivatives

    Directory of Open Access Journals (Sweden)

    Hui Wang

    2015-10-01

    Full Text Available In past work, QSAR (quantitative structure-activity relationship models of cinnamaldehyde analogues and derivatives (CADs have been used to predict the activities of new chemicals based on their mass concentrations, but these approaches are not without shortcomings. Therefore, molar concentrations were used instead of mass concentrations to determine antifungal activity. New QSAR models of CADs against Aspergillus niger and Penicillium citrinum were established, and the molecular design of new CADs was performed. The antifungal properties of the designed CADs were tested, and the experimental Log AR values were in agreement with the predicted Log AR values. The results indicate that the improved QSAR models are more reliable and can be effectively used for CADs molecular design and prediction of the activity of CADs. These findings provide new insight into the development and utilization of cinnamaldehyde compounds.

  18. Predictive QSAR modelling of algal toxicity of ionic liquids and its interspecies correlation with Daphnia toxicity.

    Science.gov (United States)

    Roy, Kunal; Das, Rudra Narayan; Popelier, Paul L A

    2015-05-01

    Predictive toxicology using chemometric tools can be very useful in order to fill the data gaps for ionic liquids (ILs) with limited available experimental toxicity information, in view of their growing industrial uses. Though originally promoted as green chemicals, ILs have now been shown to possess considerable toxicity against different ecological endpoints. Against this background, quantitative structure-activity relationship (QSAR) models have been developed here for the toxicity of ILs against the green algae Scenedesmus vacuolatus using computed descriptors with definite physicochemical meaning. The final models emerged from E-state indices, extended topochemical atom (ETA) indices and quantum topological molecular similarity (QTMS) indices. The developed partial least squares models support the established mechanism of toxicity of ionic liquids in terms of a surfactant action of cations and chaotropic action of anions. The models have been developed within the guidelines of the Organization of Economic Co-operation and Development (OECD) for regulatory QSAR models, and they have been validated both internally and externally using multiple strategies and also tested for applicability domain. A preliminary attempt has also been made, for the first time, to develop interspecies quantitative toxicity-toxicity relationship (QTTR) models for the algal toxicity of ILs with Daphnia toxicity, which should be interesting while predicting toxicity of ILs for an endpoint when the data for the other are available.

  19. Identification of the Structural Features of Guanine Derivatives as MGMT Inhibitors Using 3D-QSAR Modeling Combined with Molecular Docking

    Directory of Open Access Journals (Sweden)

    Guohui Sun

    2016-06-01

    Full Text Available DNA repair enzyme O6-methylguanine-DNA methyltransferase (MGMT, which plays an important role in inducing drug resistance against alkylating agents that modify the O6 position of guanine in DNA, is an attractive target for anti-tumor chemotherapy. A series of MGMT inhibitors have been synthesized over the past decades to improve the chemotherapeutic effects of O6-alkylating agents. In the present study, we performed a three-dimensional quantitative structure activity relationship (3D-QSAR study on 97 guanine derivatives as MGMT inhibitors using comparative molecular field analysis (CoMFA and comparative molecular similarity indices analysis (CoMSIA methods. Three different alignment methods (ligand-based, DFT optimization-based and docking-based alignment were employed to develop reliable 3D-QSAR models. Statistical parameters derived from the models using the above three alignment methods showed that the ligand-based CoMFA (Qcv2 = 0.672 and Rncv2 = 0.997 and CoMSIA (Qcv2 = 0.703 and Rncv2 = 0.946 models were better than the other two alignment methods-based CoMFA and CoMSIA models. The two ligand-based models were further confirmed by an external test-set validation and a Y-randomization examination. The ligand-based CoMFA model (Qext2 = 0.691, Rpred2 = 0.738 and slope k = 0.91 was observed with acceptable external test-set validation values rather than the CoMSIA model (Qext2 = 0.307, Rpred2 = 0.4 and slope k = 0.719. Docking studies were carried out to predict the binding modes of the inhibitors with MGMT. The results indicated that the obtained binding interactions were consistent with the 3D contour maps. Overall, the combined results of the 3D-QSAR and the docking obtained in this study provide an insight into the understanding of the interactions between guanine derivatives and MGMT protein, which will assist in designing novel MGMT inhibitors with desired activity.

  20. Rational drug design for anti-cancer chemotherapy: multi-target QSAR models for the in silico discovery of anti-colorectal cancer agents.

    Science.gov (United States)

    Speck-Planche, Alejandro; Kleandrova, Valeria V; Luan, Feng; Cordeiro, M Natália D S

    2012-08-01

    The discovery of new and more potent anti-cancer agents constitutes one of the most active fields of research in chemotherapy. Colorectal cancer (CRC) is one of the most studied cancers because of its high prevalence and number of deaths. In the current pharmaceutical design of more efficient anti-CRC drugs, the use of methodologies based on Chemoinformatics has played a decisive role, including Quantitative-Structure-Activity Relationship (QSAR) techniques. However, until now, there is no methodology able to predict anti-CRC activity of compounds against more than one CRC cell line, which should constitute the principal goal. In an attempt to overcome this problem we develop here the first multi-target (mt) approach for the virtual screening and rational in silico discovery of anti-CRC agents against ten cell lines. Here, two mt-QSAR classification models were constructed using a large and heterogeneous database of compounds. The first model was based on linear discriminant analysis (mt-QSAR-LDA) employing fragment-based descriptors while the second model was obtained using artificial neural networks (mt-QSAR-ANN) with global 2D descriptors. Both models correctly classified more than 90% of active and inactive compounds in training and prediction sets. Some fragments were extracted from the molecules and their contributions to anti-CRC activity were calculated using mt-QSAR-LDA model. Several fragments were identified as potential substructural features responsible for the anti-CRC activity and new molecules designed from those fragments with positive contributions were suggested and correctly predicted by the two models as possible potent and versatile anti-CRC agents. Copyright © 2012 Elsevier Ltd. All rights reserved.

  1. Development of human biotransformation QSARs and application for PBT assessment refinement.

    Science.gov (United States)

    Papa, Ester; Sangion, Alessandro; Arnot, Jon A; Gramatica, Paola

    2018-02-01

    Toxicokinetics heavily influence chemical toxicity as the result of Absorption, Distribution, Metabolism (Biotransformation) and Elimination (ADME) processes. Biotransformation (metabolism) reactions can lead to detoxification or, in some cases, bioactivation of parent compounds to more toxic chemicals. Moreover, biotransformation has been recognized as a key process determining chemical half-life in an organism and is thus a key determinant for bioaccumulation assessment for many chemicals. This study addresses the development of QSAR models for the prediction of in vivo whole body human biotransformation (metabolism) half-lives measured or empirically-derived for over 1000 chemicals, mainly represented by pharmaceuticals. Models presented in this study meet regulatory standards for fitting, validation and applicability domain. These QSARs were used, in combination with literature models for the prediction of biotransformation half-lives in fish, to refine the screening of the potential PBT behaviour of over 1300 Pharmaceuticals and Personal Care Products (PPCPs). The refinement of the PBT screening allowed, among others, for the identification of PPCPs, which were predicted as PBTs on the basis of their chemical structure, but may be easily biotransformed. These compounds are of lower concern in comparison to potential PBTs characterized by large predicted biotransformation half-lives. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. An integrated QSAR-PBK/D modelling approach for predicting detoxification and DNA adduct formation of 18 acyclic food-borne α,β-unsaturated aldehydes

    Energy Technology Data Exchange (ETDEWEB)

    Kiwamoto, R., E-mail: reiko.kiwamoto@wur.nl; Spenkelink, A.; Rietjens, I.M.C.M.; Punt, A.

    2015-01-01

    Acyclic α,β-unsaturated aldehydes present in food raise a concern because the α,β-unsaturated aldehyde moiety is considered a structural alert for genotoxicity. However, controversy remains on whether in vivo at realistic dietary exposure DNA adduct formation is significant. The aim of the present study was to develop physiologically based kinetic/dynamic (PBK/D) models to examine dose-dependent detoxification and DNA adduct formation of a group of 18 food-borne acyclic α,β-unsaturated aldehydes without 2- or 3-alkylation, and with no more than one conjugated double bond. Parameters for the PBK/D models were obtained using quantitative structure–activity relationships (QSARs) defined with a training set of six selected aldehydes. Using the QSARs, PBK/D models for the other 12 aldehydes were defined. Results revealed that DNA adduct formation in the liver increases with decreasing bulkiness of the molecule especially due to less efficient detoxification. 2-Propenal (acrolein) was identified to induce the highest DNA adduct levels. At realistic dietary intake, the predicted DNA adduct levels for all aldehydes were two orders of magnitude lower than endogenous background levels observed in disease free human liver, suggesting that for all 18 aldehydes DNA adduct formation is negligible at the relevant levels of dietary intake. The present study provides a proof of principle for the use of QSAR-based PBK/D modelling to facilitate group evaluations and read-across in risk assessment. - Highlights: • Physiologically based in silico models were made for 18 α,β-unsaturated aldehydes. • Kinetic parameters were determined by in vitro incubations and a QSAR approach. • DNA adduct formation was negligible at levels relevant for dietary intake. • The use of QSAR-based PBK/D modelling facilitates group evaluations and read-across.

  3. Building up a QSAR model for toxicity toward Tetrahymena pyriformis by the Monte Carlo method: A case of benzene derivatives.

    Science.gov (United States)

    Toropova, Alla P; Schultz, Terry W; Toropov, Andrey A

    2016-03-01

    Data on toxicity toward Tetrahymena pyriformis is indicator of applicability of a substance in ecologic and pharmaceutical aspects. Quantitative structure-activity relationships (QSARs) between the molecular structure of benzene derivatives and toxicity toward T. pyriformis (expressed as the negative logarithms of the population growth inhibition dose, mmol/L) are established. The available data were randomly distributed three times into the visible training and calibration sets, and invisible validation sets. The statistical characteristics for the validation set are the following: r(2)=0.8179 and s=0.338 (first distribution); r(2)=0.8682 and s=0.341 (second distribution); r(2)=0.8435 and s=0.323 (third distribution). These models are built up using only information on the molecular structure: no data on physicochemical parameters, 3D features of the molecular structure and quantum mechanics descriptors are involved in the modeling process. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Organic Micropollutants Removal from Water by Oxidation and Other Processes:QSAR Models, Decision Support System and Hybrids of Processes

    KAUST Repository

    Sudhakaran, Sairam

    2013-08-01

    The presence of organic micropollutants (OMPs) in water is of great environmental concern. OMPs such as endocrine disruptors and certain pharmaceuticals have shown alarming effects on aquatic life. OMPs are included in the priority list of contaminants in several government directorate frameworks. The low levels of OMPs concentration (ng/L to μg/L) force the use of sophisticated analytical instruments. Although, the techniques to detect OMPs are progressing, the focus of current research is only on limited, important OMPs due to the high amount of time, cost and effort involved in analyzing them. Alternatively, quantitative structure activity relationship (QSAR) models help to screen processes and propose appropriate options without considerable experimental effort. QSAR models are well-established in regulatory bodies as a method to screen toxic chemicals. The goal of the present thesis was to develop QSAR models for OMPs removal by oxidation. Apart from the QSAR models, a decision support system (DSS) based on multi-criteria analysis (MCA) involving socio-economic-technical and sustainability aspects was developed. Also, hybrids of different water treatment processes were studied to propose a sustainable water treatment train for OMPs removal. In order to build the QSAR models, the ozone/hydroxyl radical rate constants or percent removals of the OMPs were compiled. Several software packages were used to 5 compute the chemical properties of OMPs and perform statistical analyses. For DSS, MCA was used since it allows the comparison of qualitative (non-monetary, non-metric) and quantitative criteria (e.g., costs). Quadrant plots were developed to study the hybrid of natural and advanced water treatment processes. The QSAR models satisfied both chemical and statistical criteria. The DSS resulted in natural treatment and ozonation as the preferred processes for OMPs removal. The QSAR models can be used as a screening tool for OMPs removal by oxidation. Moreover, the

  5. DAT/SERT Selectivity of Flexible GBR 12909 Analogs Modeled Using 3D-QSAR Methods

    Science.gov (United States)

    Gilbert, Kathleen M.; Boos, Terrence L.; Dersch, Christina M.; Greiner, Elisabeth; Jacobson, Arthur E.; Lewis, David; Matecka, Dorota; Prisinzano, Thomas E.; Zhang, Ying; Rothman, Richard B.; Rice, Kenner C.; Venanzi, Carol A.

    2007-01-01

    The dopamine reuptake inhibitor GBR 12909 (1-{2-[bis(4-fluorophenyl)methoxy]ethyl}-4-(3-phenylpropyl)piperazine, 1) and its analogs have been developed as tools to test the hypothesis that selective dopamine transporter (DAT) inhibitors will be useful therapeutics for cocaine addiction. This 3D-QSAR study focuses on the effect of substitutions in the phenylpropyl region of 1. CoMFA and CoMSIA techniques were used to determine a predictive and stable model for the DAT/serotonin transporter (SERT) selectivity (represented by pKi (DAT/SERT)) of a set of flexible analogs of 1, most of which have eight rotatable bonds. In the absence of a rigid analog to use as a 3D-QSAR template, six conformational families of analogs were constructed from six pairs of piperazine and piperidine template conformers identified by hierarchical clustering as representative molecular conformations. Three models stable to y-value scrambling were identified after a comprehensive CoMFA and CoMSIA survey with Region Focusing. Test set correlation validation led to an acceptable model, with q2 = 0.508, standard error of prediction = 0.601, two components, r2 = 0.685, standard error of estimate = 0.481, F value = 39, percent steric contribution = 65, and percent electrostatic contribution = 35. A CoMFA contour map identified areas of the molecule that affect pKi (DAT/SERT). This work outlines a protocol for deriving a stable and predictive model of the biological activity of a set of very flexible molecules. PMID:17127069

  6. QSARs for Plasma Protein Binding: Source Data and Predictions

    Data.gov (United States)

    U.S. Environmental Protection Agency — The dataset has all of the information used to create and evaluate 3 independent QSAR models for the fraction of a chemical unbound by plasma protein (Fub) for...

  7. A combined pharmacophore modeling, 3D-QSAR and molecular docking study of substituted bicyclo-[3.3.0]oct-2-enes as liver receptor homolog-1 (LRH-1) agonists

    Science.gov (United States)

    Lalit, Manisha; Gangwal, Rahul P.; Dhoke, Gaurao V.; Damre, Mangesh V.; Khandelwal, Kanchan; Sangamwar, Abhay T.

    2013-10-01

    A combined pharmacophore modelling, 3D-QSAR and molecular docking approach was employed to reveal structural and chemical features essential for the development of small molecules as LRH-1 agonists. The best HypoGen pharmacophore hypothesis (Hypo1) consists of one hydrogen-bond donor (HBD), two general hydrophobic (H), one hydrophobic aromatic (HYAr) and one hydrophobic aliphatic (HYA) feature. It has exhibited high correlation coefficient of 0.927, cost difference of 85.178 bit and low RMS value of 1.411. This pharmacophore hypothesis was cross-validated using test set, decoy set and Cat-Scramble methodology. Subsequently, validated pharmacophore hypothesis was used in the screening of small chemical databases. Further, 3D-QSAR models were developed based on the alignment obtained using substructure alignment. The best CoMFA and CoMSIA model has exhibited excellent rncv2 values of 0.991 and 0.987, and rcv2 values of 0.767 and 0.703, respectively. CoMFA predicted rpred2 of 0.87 and CoMSIA predicted rpred2 of 0.78 showed that the predicted values were in good agreement with the experimental values. Molecular docking analysis reveals that π-π interaction with His390 and hydrogen bond interaction with His390/Arg393 is essential for LRH-1 agonistic activity. The results from pharmacophore modelling, 3D-QSAR and molecular docking are complementary to each other and could serve as a powerful tool for the discovery of potent small molecules as LRH-1 agonists.

  8. Advantages and limitations of classic and 3D QSAR approaches in nano-QSAR studies based on biological activity of fullerene derivatives

    International Nuclear Information System (INIS)

    Jagiello, Karolina; Grzonkowska, Monika; Swirog, Marta; Ahmed, Lucky; Rasulev, Bakhtiyor; Avramopoulos, Aggelos; Papadopoulos, Manthos G.; Leszczynski, Jerzy; Puzyn, Tomasz

    2016-01-01

    In this contribution, the advantages and limitations of two computational techniques that can be used for the investigation of nanoparticles activity and toxicity: classic nano-QSAR (Quantitative Structure–Activity Relationships employed for nanomaterials) and 3D nano-QSAR (three-dimensional Quantitative Structure–Activity Relationships, such us Comparative Molecular Field Analysis, CoMFA/Comparative Molecular Similarity Indices Analysis, CoMSIA analysis employed for nanomaterials) have been briefly summarized. Both approaches were compared according to the selected criteria, including: efficiency, type of experimental data, class of nanomaterials, time required for calculations and computational cost, difficulties in the interpretation. Taking into account the advantages and limitations of each method, we provide the recommendations for nano-QSAR modellers and QSAR model users to be able to determine a proper and efficient methodology to investigate biological activity of nanoparticles in order to describe the underlying interactions in the most reliable and useful manner.

  9. Advantages and limitations of classic and 3D QSAR approaches in nano-QSAR studies based on biological activity of fullerene derivatives

    Energy Technology Data Exchange (ETDEWEB)

    Jagiello, Karolina; Grzonkowska, Monika; Swirog, Marta [University of Gdansk, Laboratory of Environmental Chemometrics, Faculty of Chemistry, Institute for Environmental and Human Health Protection (Poland); Ahmed, Lucky; Rasulev, Bakhtiyor [Jackson State University, Interdisciplinary Nanotoxicity Center, Department of Chemistry and Biochemistry (United States); Avramopoulos, Aggelos; Papadopoulos, Manthos G. [National Hellenic Research Foundation, Institute of Biology, Pharmaceutical Chemistry and Biotechnology (Greece); Leszczynski, Jerzy [Jackson State University, Interdisciplinary Nanotoxicity Center, Department of Chemistry and Biochemistry (United States); Puzyn, Tomasz, E-mail: t.puzyn@qsar.eu.org [University of Gdansk, Laboratory of Environmental Chemometrics, Faculty of Chemistry, Institute for Environmental and Human Health Protection (Poland)

    2016-09-15

    In this contribution, the advantages and limitations of two computational techniques that can be used for the investigation of nanoparticles activity and toxicity: classic nano-QSAR (Quantitative Structure–Activity Relationships employed for nanomaterials) and 3D nano-QSAR (three-dimensional Quantitative Structure–Activity Relationships, such us Comparative Molecular Field Analysis, CoMFA/Comparative Molecular Similarity Indices Analysis, CoMSIA analysis employed for nanomaterials) have been briefly summarized. Both approaches were compared according to the selected criteria, including: efficiency, type of experimental data, class of nanomaterials, time required for calculations and computational cost, difficulties in the interpretation. Taking into account the advantages and limitations of each method, we provide the recommendations for nano-QSAR modellers and QSAR model users to be able to determine a proper and efficient methodology to investigate biological activity of nanoparticles in order to describe the underlying interactions in the most reliable and useful manner.

  10. Benefits of statistical molecular design, covariance analysis, and reference models in QSAR: a case study on acetylcholinesterase

    Science.gov (United States)

    Andersson, C. David; Hillgren, J. Mikael; Lindgren, Cecilia; Qian, Weixing; Akfur, Christine; Berg, Lotta; Ekström, Fredrik; Linusson, Anna

    2015-03-01

    Scientific disciplines such as medicinal- and environmental chemistry, pharmacology, and toxicology deal with the questions related to the effects small organic compounds exhort on biological targets and the compounds' physicochemical properties responsible for these effects. A common strategy in this endeavor is to establish structure-activity relationships (SARs). The aim of this work was to illustrate benefits of performing a statistical molecular design (SMD) and proper statistical analysis of the molecules' properties before SAR and quantitative structure-activity relationship (QSAR) analysis. Our SMD followed by synthesis yielded a set of inhibitors of the enzyme acetylcholinesterase (AChE) that had very few inherent dependencies between the substructures in the molecules. If such dependencies exist, they cause severe errors in SAR interpretation and predictions by QSAR-models, and leave a set of molecules less suitable for future decision-making. In our study, SAR- and QSAR models could show which molecular sub-structures and physicochemical features that were advantageous for the AChE inhibition. Finally, the QSAR model was used for the prediction of the inhibition of AChE by an external prediction set of molecules. The accuracy of these predictions was asserted by statistical significance tests and by comparisons to simple but relevant reference models.

  11. 2D QSAR studies of the inhibitory activity of a series of substituted purine derivatives against c-Src tyrosine kinase

    Directory of Open Access Journals (Sweden)

    Mukesh C. Sharma

    2016-07-01

    Full Text Available A series of 34 substituted purine analogues derivatives were subjected to quantitative structure-activity relationship analyses as inhibitors of c-Src tyrosine kinase. Partial least squares regression was applied to derive QSAR models, which were further validated for statistical significance by internal and external validation. The best QSAR model developed had a good predictive correlation coefficient (r2 of 0.8319, a significant cross-validated correlation coefficient (q2 of 0.7550, and an r2 for the external test set (pred_r2 of 0.7983. It was developed from the PLS method with descriptors including the SsCH3E-index, H-Donor Count, T_2_Cl_3, and negative correlation with SsOHcount. The current study provides better insight into the future design of more potent c-Src tyrosine kinase inhibitors prior to synthesis.

  12. Docking and 3-D QSAR studies on indolyl aryl sulfones. Binding mode exploration at the HIV-1 reverse transcriptase non-nucleoside binding site and design of highly active N-(2-hydroxyethyl)carboxamide and N-(2-hydroxyethyl)carbohydrazide derivatives.

    Science.gov (United States)

    Ragno, Rino; Artico, Marino; De Martino, Gabriella; La Regina, Giuseppe; Coluccia, Antonio; Di Pasquali, Alessandra; Silvestri, Romano

    2005-01-13

    Three-dimensional quantitative structure-activity relationship (3-D QSAR) studies and docking simulations were developed on indolyl aryl sulfones (IASs), a class of novel HIV-1 non-nucleoside reverse transcriptase (RT) inhibitors (Silvestri, et al. J. Med. Chem. 2003, 46, 2482-2493) highly active against wild type and some clinically relevant resistant strains (Y181C, the double mutant K103N-Y181C, and the K103R-V179D-P225H strain, highly resistant to efavirenz). Predictive 3-D QSAR models using the combination of GRID and GOLPE programs were obtained using a receptor-based alignment by means of docking IASs into the non-nucleoside binding site (NNBS) of RT. The derived 3-D QSAR models showed conventional correlation (r(2)) and cross-validated (q(2)) coefficients values ranging from 0.79 to 0.93 and from 0.59 to 0.84, respectively. All described models were validated by an external test set compiled from previously reported pyrryl aryl sulfones (Artico, et al. J. Med. Chem. 1996, 39, 522-530). The most predictive 3-D QSAR model was then used to predict the activity of novel untested IASs. The synthesis of six designed derivatives (prediction set) allowed disclosure of new IASs endowed with high anti-HIV-1 activities.

  13. Development of QSAR models using artificial neural network analysis for risk assessment of repeated-dose, reproductive, and developmental toxicities of cosmetic ingredients.

    Science.gov (United States)

    Hisaki, Tomoka; Aiba Née Kaneko, Maki; Yamaguchi, Masahiko; Sasa, Hitoshi; Kouzuki, Hirokazu

    2015-04-01

    Use of laboratory animals for systemic toxicity testing is subject to strong ethical and regulatory constraints, but few alternatives are yet available. One possible approach to predict systemic toxicity of chemicals in the absence of experimental data is quantitative structure-activity relationship (QSAR) analysis. Here, we present QSAR models for prediction of maximum "no observed effect level" (NOEL) for repeated-dose, developmental and reproductive toxicities. NOEL values of 421 chemicals for repeated-dose toxicity, 315 for reproductive toxicity, and 156 for developmental toxicity were collected from Japan Existing Chemical Data Base (JECDB). Descriptors to predict toxicity were selected based on molecular orbital (MO) calculations, and QSAR models employing multiple independent descriptors as the input layer of an artificial neural network (ANN) were constructed to predict NOEL values. Robustness of the models was indicated by the root-mean-square (RMS) errors after 10-fold cross-validation (0.529 for repeated-dose, 0.508 for reproductive, and 0.558 for developmental toxicity). Evaluation of the models in terms of the percentages of predicted NOELs falling within factors of 2, 5 and 10 of the in-vivo-determined NOELs suggested that the model is applicable to both general chemicals and the subset of chemicals listed in International Nomenclature of Cosmetic Ingredients (INCI). Our results indicate that ANN models using in silico parameters have useful predictive performance, and should contribute to integrated risk assessment of systemic toxicity using a weight-of-evidence approach. Availability of predicted NOELs will allow calculation of the margin of safety, as recommended by the Scientific Committee on Consumer Safety (SCCS).

  14. Introducing Catastrophe-QSAR. Application on Modeling Molecular Mechanisms of Pyridinone Derivative-Type HIV Non-Nucleoside Reverse Transcriptase Inhibitors

    Directory of Open Access Journals (Sweden)

    Marius Lazea

    2011-12-01

    Full Text Available The classical method of quantitative structure-activity relationships (QSAR is enriched using non-linear models, as Thom’s polynomials allow either uni- or bi-variate structural parameters. In this context, catastrophe QSAR algorithms are applied to the anti-HIV-1 activity of pyridinone derivatives. This requires calculation of the so-called relative statistical power and of its minimum principle in various QSAR models. A new index, known as a statistical relative power, is constructed as an Euclidian measure for the combined ratio of the Pearson correlation to algebraic correlation, with normalized t-Student and the Fisher tests. First and second order inter-model paths are considered for mono-variate catastrophes, whereas for bi-variate catastrophes the direct minimum path is provided, allowing the QSAR models to be tested for predictive purposes. At this stage, the max-to-min hierarchies of the tested models allow the interaction mechanism to be identified using structural parameter succession and the typical catastrophes involved. Minimized differences between these catastrophe models in the common structurally influential domains that span both the trial and tested compounds identify the “optimal molecular structural domains” and the molecules with the best output with respect to the modeled activity, which in this case is human immunodeficiency virus type 1 HIV-1 inhibition. The best molecules are characterized by hydrophobic interactions with the HIV-1 p66 subunit protein, and they concur with those identified in other 3D-QSAR analyses. Moreover, the importance of aromatic ring stacking interactions for increasing the binding affinity of the inhibitor-reverse transcriptase ligand-substrate complex is highlighted.

  15. A stepwise approach for defining the applicability domain of SAR and QSAR models

    DEFF Research Database (Denmark)

    Dimitrov, Sabcho; Dimitrova, Gergana; Pavlov, Todor

    2005-01-01

    A stepwise approach for determining the model applicability domain is proposed. Four stages are applied to account for the diversity and complexity of the current SAR/QSAR models, reflecting their mechanistic rationality (including metabolic activation of chemicals) and transparency. General para...

  16. Investigations of Structural Requirements for BRD4 Inhibitors through Ligand- and Structure-Based 3D QSAR Approaches

    Directory of Open Access Journals (Sweden)

    Adeena Tahir

    2018-06-01

    Full Text Available The bromodomain containing protein 4 (BRD4 recognizes acetylated histone proteins and plays numerous roles in the progression of a wide range of cancers, due to which it is under intense investigation as a novel anti-cancer drug target. In the present study, we performed three-dimensional quantitative structure activity relationship (3D-QSAR molecular modeling on a series of 60 inhibitors of BRD4 protein using ligand- and structure-based alignment and different partial charges assignment methods by employing comparative molecular field analysis (CoMFA and comparative molecular similarity indices analysis (CoMSIA approaches. The developed models were validated using various statistical methods, including non-cross validated correlation coefficient (r2, leave-one-out (LOO cross validated correlation coefficient (q2, bootstrapping, and Fisher’s randomization test. The highly reliable and predictive CoMFA (q2 = 0.569, r2 = 0.979 and CoMSIA (q2 = 0.500, r2 = 0.982 models were obtained from a structure-based 3D-QSAR approach using Merck molecular force field (MMFF94. The best models demonstrate that electrostatic and steric fields play an important role in the biological activities of these compounds. Hence, based on the contour maps information, new compounds were designed, and their binding modes were elucidated in BRD4 protein’s active site. Further, the activities and physicochemical properties of the designed molecules were also predicted using the best 3D-QSAR models. We believe that predicted models will help us to understand the structural requirements of BRD4 protein inhibitors that belong to quinolinone and quinazolinone classes for the designing of better active compounds.

  17. Molecular modeling-driven approach for identification of Janus kinase 1 inhibitors through 3D-QSAR, docking and molecular dynamics simulations.

    Science.gov (United States)

    Itteboina, Ramesh; Ballu, Srilata; Sivan, Sree Kanth; Manga, Vijjulatha

    2017-10-01

    Janus kinase 1 (JAK 1) belongs to the JAK family of intracellular nonreceptor tyrosine kinase. JAK-signal transducer and activator of transcription (JAK-STAT) pathway mediate signaling by cytokines, which control survival, proliferation and differentiation of a variety of cells. Three-dimensional quantitative structure activity relationship (3 D-QSAR), molecular docking and molecular dynamics (MD) methods was carried out on a dataset of Janus kinase 1(JAK 1) inhibitors. Ligands were constructed and docked into the active site of protein using GLIDE 5.6. Best docked poses were selected after analysis for further 3 D-QSAR analysis using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) methodology. Employing 60 molecules in the training set, 3 D-QSAR models were generate that showed good statistical reliability, which is clearly observed in terms of r 2 ncv and q 2 loo values. The predictive ability of these models was determined using a test set of 25 molecules that gave acceptable predictive correlation (r 2 Pred ) values. The key amino acid residues were identified by means of molecular docking, and the stability and rationality of the derived molecular conformations were also validated by MD simulation. The good consonance between the docking results and CoMFA/CoMSIA contour maps provides helpful clues about the reasonable modification of molecules in order to design more efficient JAK 1 inhibitors. The developed models are expected to provide some directives for further synthesis of highly effective JAK 1 inhibitors.

  18. Common SAR Derived from Linear and Non-linear QSAR Studies on AChE Inhibitors used in the Treatment of Alzheimer's Disease.

    Science.gov (United States)

    Pulikkal, Babitha Pallikkara; Marunnan, Sahila Mohammed; Bandaru, Srinivas; Yadav, Mukesh; Nayarisseri, Anuraj; Sureshkumar, Sivanpillai

    2017-11-14

    Deficits in cholinergic neurotransmission due to the degeneration of cholinergic neurons in the brain are believed to be one of the major causes of the memory impairments associated with AD. Targeting acetyl cholinesterase (AChE) surfaced as a potential therapeutic target in the treatment of Alzheimer's disease. The present study is pursued to develop quantitative structure activity relationship (QSAR) models to determine chemical descriptors responsible for AChE activity. Two different sets of AChE inhibitors, dataset-I (30 compounds) and dataset-II (20 compounds) were investigated through MLR aided linear and SVM aided non-linear QSAR models. The obtained QSAR models were found statistically fit, stable and predictive on validation scales. These QSAR models were further investigated for their common structure-activity relationship in terms of overlapping molecular descriptors selection. Atomic mass weighted 3D Morse descriptors (MATS5m) and Radial Distribution Function (RDF045m) descriptors were found in common SAR for both the datasets. Electronegativity weighted (MATS5e, HATSe, and Mor17e) descriptors have also been identified in regulative roles towards endpoint values of dataset-I and dataset-II. The common SAR identified in these linear and non-linear QSAR models could be utilized to design novel inhibitors of AChE with improved biological activity. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Electron-correlation based externally predictive QSARs for mutagenicity of nitrated-PAHs in Salmonella typhimurium TA100.

    Science.gov (United States)

    Reenu; Vikas

    2014-03-01

    In quantitative modeling, there are two major aspects that decide reliability and real external predictivity of a structure-activity relationship (SAR) based on quantum chemical descriptors. First, the information encoded in employed molecular descriptors, computed through a quantum-mechanical method, should be precisely estimated. The accuracy of the quantum-mechanical method, however, is dependent upon the amount of electron-correlation it incorporates. Second, the real external predictivity of a developed quantitative SAR (QSAR) should be validated employing an external prediction set. In this work, to analyze the role of electron-correlation, QSAR models are developed for a set of 51 ubiquitous pollutants, namely, nitrated monocyclic and polycyclic aromatic hydrocarbons (nitrated-AHs and PAHs) having mutagenic activity in TA100 strain of Salmonella typhimurium. The quality of the models, through state-of-the-art external validation procedures employing an external prediction set, is compared to the best models known in the literature for mutagenicity. The molecular descriptors whose electron-correlation contribution is analyzed include total energy, energy of HOMO and LUMO, and commonly employed electron-density based descriptors such as chemical hardness, chemical softness, absolute electronegativity and electrophilicity index. The electron-correlation based QSARs are also compared with those developed using quantum-mechanical descriptors computed with advanced semi-empirical (SE) methods such as PM6, PM7, RM1, and ab initio methods, namely, the Hartree-Fock (HF) and the density functional theory (DFT). The models, developed using electron-correlation contribution of the quantum-mechanical descriptors, are found to be not only reliable but also satisfactorily predictive when compared to the existing robust models. The robustness of the models based on descriptors computed through advanced SE methods, is also observed to be comparable to those developed with

  20. Reduced density gradient as a novel approach for estimating QSAR descriptors, and its application to 1, 4-dihydropyridine derivatives with potential antihypertensive effects.

    Science.gov (United States)

    Jardínez, Christiaan; Vela, Alberto; Cruz-Borbolla, Julián; Alvarez-Mendez, Rodrigo J; Alvarado-Rodríguez, José G

    2016-12-01

    The relationship between the chemical structure and biological activity (log IC 50 ) of 40 derivatives of 1,4-dihydropyridines (DHPs) was studied using density functional theory (DFT) and multiple linear regression analysis methods. With the aim of improving the quantitative structure-activity relationship (QSAR) model, the reduced density gradient s( r) of the optimized equilibrium geometries was used as a descriptor to include weak non-covalent interactions. The QSAR model highlights the correlation between the log IC 50 with highest molecular orbital energy (E HOMO ), molecular volume (V), partition coefficient (log P), non-covalent interactions NCI(H4-G) and the dual descriptor [Δf(r)]. The model yielded values of R 2 =79.57 and Q 2 =69.67 that were validated with the next four internal analytical validations DK=0.076, DQ=-0.006, R P =0.056, and R N =0.000, and the external validation Q 2 boot =64.26. The QSAR model found can be used to estimate biological activity with high reliability in new compounds based on a DHP series. Graphical abstract The good correlation between the log IC 50 with the NCI (H4-G) estimated by the reduced density gradient approach of the DHP derivatives.

  1. QSAR Study of Skin Sensitization Using Local Lymph Node Assay Data

    Directory of Open Access Journals (Sweden)

    Eugene Demchuk

    2004-01-01

    Full Text Available Abstract: Allergic Contact Dermatitis (ACD is a common work-related skin disease that often develops as a result of repetitive skin exposures to a sensitizing chemical agent. A variety of experimental tests have been suggested to assess the skin sensitization potential. We applied a method of Quantitative Structure-Activity Relationship (QSAR to relate measured and calculated physical-chemical properties of chemical compounds to their sensitization potential. Using statistical methods, each of these properties, called molecular descriptors, was tested for its propensity to predict the sensitization potential. A few of the most informative descriptors were subsequently selected to build a model of skin sensitization. In this work sensitization data for the murine Local Lymph Node Assay (LLNA were used. In principle, LLNA provides a standardized continuous scale suitable for quantitative assessment of skin sensitization. However, at present many LLNA results are still reported on a dichotomous scale, which is consistent with the scale of guinea pig tests, which were widely used in past years. Therefore, in this study only a dichotomous version of the LLNA data was used. To the statistical end, we relied on the logistic regression approach. This approach provides a statistical tool for investigating and predicting skin sensitization that is expressed only in categorical terms of activity and nonactivity. Based on the data of compounds used in this study, our results suggest a QSAR model of ACD that is based on the following descriptors: nDB (number of double bonds, C-003 (number of CHR3 molecular subfragments, GATS6M (autocorrelation coefficient and HATS6m (GETAWAY descriptor, although the relevance of the identified descriptors to the continuous ACD QSAR has yet to be shown. The proposed QSAR model gives a percentage of positively predicted responses of 83% on the training set of compounds, and in cross validation it correctly identifies 79% of

  2. Application of 4D-QSAR Studies to a Series of Raloxifene Analogs and Design of Potential Selective Estrogen Receptor Modulators

    Directory of Open Access Journals (Sweden)

    Carlos Rangel Rodrigues

    2012-06-01

    Full Text Available Four-dimensional quantitative structure-activity relationship (4D-QSAR analysis was applied on a series of 54 2-arylbenzothiophene derivatives, synthesized by Grese and coworkers, based on raloxifene (an estrogen receptor-alpha antagonist, and evaluated as ERa ligands and as inhibitors of estrogen-stimulated proliferation of MCF-7 breast cancer cells. The conformations of each analogue, sampled from a molecular dynamics simulation, were placed in a grid cell lattice according to three trial alignments, considering two grid cell sizes (1.0 and 2.0 Å. The QSAR equations, generated by a combined scheme of genetic algorithms (GA and partial least squares (PLS regression, were evaluated by “leave-one-out” cross-validation, using a training set of 41 compounds. External validation was performed using a test set of 13 compounds. The obtained 4D-QSAR models are in agreement with the proposed mechanism of action for raloxifene. This study allowed a quantitative prediction of compounds’ potency and supported the design of new raloxifene analogs.

  3. An approach to the interpretation of backpropagation neural network models in QSAR studies.

    Science.gov (United States)

    Baskin, I I; Ait, A O; Halberstam, N M; Palyulin, V A; Zefirov, N S

    2002-03-01

    An approach to the interpretation of backpropagation neural network models for quantitative structure-activity and structure-property relationships (QSAR/QSPR) studies is proposed. The method is based on analyzing the first and second moments of distribution of the values of the first and the second partial derivatives of neural network outputs with respect to inputs calculated at data points. The use of such statistics makes it possible not only to obtain actually the same characteristics as for the case of traditional "interpretable" statistical methods, such as the linear regression analysis, but also to reveal important additional information regarding the non-linear character of QSAR/QSPR relationships. The approach is illustrated by an example of interpreting a backpropagation neural network model for predicting position of the long-wave absorption band of cyane dyes.

  4. QSAR models for thiophene and imidazopyridine derivatives inhibitors of the Polo-Like Kinase 1.

    Science.gov (United States)

    Comelli, Nieves C; Duchowicz, Pablo R; Castro, Eduardo A

    2014-10-01

    The inhibitory activity of 103 thiophene and 33 imidazopyridine derivatives against Polo-Like Kinase 1 (PLK1) expressed as pIC50 (-logIC50) was predicted by QSAR modeling. Multivariate linear regression (MLR) was employed to model the relationship between 0D and 3D molecular descriptors and biological activities of molecules using the replacement method (MR) as variable selection tool. The 136 compounds were separated into several training and test sets. Two splitting approaches, distribution of biological data and structural diversity, and the statistical experimental design procedure D-optimal distance were applied to the dataset. The significance of the training set models was confirmed by statistically higher values of the internal leave one out cross-validated coefficient of determination (Q2) and external predictive coefficient of determination for the test set (Rtest2). The model developed from a training set, obtained with the D-optimal distance protocol and using 3D descriptor space along with activity values, separated chemical features that allowed to distinguish high and low pIC50 values reasonably well. Then, we verified that such model was sufficient to reliably and accurately predict the activity of external diverse structures. The model robustness was properly characterized by means of standard procedures and their applicability domain (AD) was analyzed by leverage method. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Parameters for Pyrethroid Insecticide QSAR and PBPK/PD Models for Human Risk Assessment

    Science.gov (United States)

    This pyrethroid insecticide parameter review is an extension of our interest in developing quantitative structure–activity relationship–physiologically based pharmacokinetic/pharmacodynamic (QSAR-PBPK/PD) models for assessing health risks, which interest started with the organoph...

  6. Application of 3D-QSAR in the rational design of receptor ligands and enzyme inhibitors.

    Science.gov (United States)

    Mor, Marco; Rivara, Silvia; Lodola, Alessio; Lorenzi, Simone; Bordi, Fabrizio; Plazzi, Pier Vincenzo; Spadoni, Gilberto; Bedini, Annalida; Duranti, Andrea; Tontini, Andrea; Tarzia, Giorgio

    2005-11-01

    Quantitative structure-activity relationships (QSARs) are frequently employed in medicinal chemistry projects, both to rationalize structure-activity relationships (SAR) for known series of compounds and to help in the design of innovative structures endowed with desired pharmacological actions. As a difference from the so-called structure-based drug design tools, they do not require the knowledge of the biological target structure, but are based on the comparison of drug structural features, thus being defined ligand-based drug design tools. In the 3D-QSAR approach, structural descriptors are calculated from molecular models of the ligands, as interaction fields within a three-dimensional (3D) lattice of points surrounding the ligand structure. These descriptors are collected in a large X matrix, which is submitted to multivariate analysis to look for correlations with biological activity. Like for other QSARs, the reliability and usefulness of the correlation models depends on the validity of the assumptions and on the quality of the data. A careful selection of compounds and pharmacological data can improve the application of 3D-QSAR analysis in drug design. Some examples of the application of CoMFA and CoMSIA approaches to the SAR study and design of receptor or enzyme ligands is described, pointing the attention to the fields of melatonin receptor ligands and FAAH inhibitors.

  7. OPERA: A free and open source QSAR tool for predicting physicochemical properties and environmental fate endpoints

    Science.gov (United States)

    Collecting the chemical structures and data for necessary QSAR modeling is facilitated by available public databases and open data. However, QSAR model performance is dependent on the quality of data and modeling methodology used. This study developed robust QSAR models for physi...

  8. a QSAR Study

    African Journals Online (AJOL)

    DK

    Une étude Relation Quantitative Structure- Activité (QSAR) a été réalisée pour évaluer la toxicité relative d'un mélange composé de ... of a substance to enter cells through the lipid ..... evaluations of regression based and classification QSARs,.

  9. QSAR models based on quantum topological molecular similarity.

    Science.gov (United States)

    Popelier, P L A; Smith, P J

    2006-07-01

    A new method called quantum topological molecular similarity (QTMS) was fairly recently proposed [J. Chem. Inf. Comp. Sc., 41, 2001, 764] to construct a variety of medicinal, ecological and physical organic QSAR/QSPRs. QTMS method uses quantum chemical topology (QCT) to define electronic descriptors drawn from modern ab initio wave functions of geometry-optimised molecules. It was shown that the current abundance of computing power can be utilised to inject realistic descriptors into QSAR/QSPRs. In this article we study seven datasets of medicinal interest : the dissociation constants (pK(a)) for a set of substituted imidazolines , the pK(a) of imidazoles , the ability of a set of indole derivatives to displace [(3)H] flunitrazepam from binding to bovine cortical membranes , the influenza inhibition constants for a set of benzimidazoles , the interaction constants for a set of amides and the enzyme liver alcohol dehydrogenase , the natriuretic activity of sulphonamide carbonic anhydrase inhibitors and the toxicity of a series of benzyl alcohols. A partial least square analysis in conjunction with a genetic algorithm delivered excellent models. They are also able to highlight the active site, of the ligand or the molecule whose structure determines the activity. The advantages and limitations of QTMS are discussed.

  10. Identification of putative estrogen receptor-mediated endocrine disrupting chemicals using QSAR- and structure-based virtual screening approaches

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Liying; Sedykh, Alexander; Tripathi, Ashutosh [Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC (United States); Zhu, Hao [The Rutgers Center for Computational and Integrative Biology, Rutgers University, Camden, NJ (United States); Department of Chemistry, Rutgers University, Camden, NJ (United States); Afantitis, Antreas; Mouchlis, Varnavas D.; Melagraki, Georgia [NovaMechanics Ltd., Nicosia (Cyprus); Rusyn, Ivan, E-mail: iir@unc.edu [Department of Environmental Sciences and Engineering, University of North Carolina, Chapel Hill, NC (United States); Tropsha, Alexander, E-mail: alex_tropsha@unc.edu [Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC (United States)

    2013-10-01

    Identification of endocrine disrupting chemicals is one of the important goals of environmental chemical hazard screening. We report on the development of validated in silico predictors of chemicals likely to cause estrogen receptor (ER)-mediated endocrine disruption to facilitate their prioritization for future screening. A database of relative binding affinity of a large number of ERα and/or ERβ ligands was assembled (546 for ERα and 137 for ERβ). Both single-task learning (STL) and multi-task learning (MTL) continuous quantitative structure–activity relationship (QSAR) models were developed for predicting ligand binding affinity to ERα or ERβ. High predictive accuracy was achieved for ERα binding affinity (MTL R{sup 2} = 0.71, STL R{sup 2} = 0.73). For ERβ binding affinity, MTL models were significantly more predictive (R{sup 2} = 0.53, p < 0.05) than STL models. In addition, docking studies were performed on a set of ER agonists/antagonists (67 agonists and 39 antagonists for ERα, 48 agonists and 32 antagonists for ERβ, supplemented by putative decoys/non-binders) using the following ER structures (in complexes with respective ligands) retrieved from the Protein Data Bank: ERα agonist (PDB ID: 1L2I), ERα antagonist (PDB ID: 3DT3), ERβ agonist (PDB ID: 2NV7), and ERβ antagonist (PDB ID: 1L2J). We found that all four ER conformations discriminated their corresponding ligands from presumed non-binders. Finally, both QSAR models and ER structures were employed in parallel to virtually screen several large libraries of environmental chemicals to derive a ligand- and structure-based prioritized list of putative estrogenic compounds to be used for in vitro and in vivo experimental validation. - Highlights: • This is the largest curated dataset inclusive of ERα and β (the latter is unique). • New methodology that for the first time affords acceptable ERβ models. • A combination of QSAR and docking enables prediction of affinity and function.

  11. Identification of putative estrogen receptor-mediated endocrine disrupting chemicals using QSAR- and structure-based virtual screening approaches

    International Nuclear Information System (INIS)

    Zhang, Liying; Sedykh, Alexander; Tripathi, Ashutosh; Zhu, Hao; Afantitis, Antreas; Mouchlis, Varnavas D.; Melagraki, Georgia; Rusyn, Ivan; Tropsha, Alexander

    2013-01-01

    Identification of endocrine disrupting chemicals is one of the important goals of environmental chemical hazard screening. We report on the development of validated in silico predictors of chemicals likely to cause estrogen receptor (ER)-mediated endocrine disruption to facilitate their prioritization for future screening. A database of relative binding affinity of a large number of ERα and/or ERβ ligands was assembled (546 for ERα and 137 for ERβ). Both single-task learning (STL) and multi-task learning (MTL) continuous quantitative structure–activity relationship (QSAR) models were developed for predicting ligand binding affinity to ERα or ERβ. High predictive accuracy was achieved for ERα binding affinity (MTL R 2 = 0.71, STL R 2 = 0.73). For ERβ binding affinity, MTL models were significantly more predictive (R 2 = 0.53, p < 0.05) than STL models. In addition, docking studies were performed on a set of ER agonists/antagonists (67 agonists and 39 antagonists for ERα, 48 agonists and 32 antagonists for ERβ, supplemented by putative decoys/non-binders) using the following ER structures (in complexes with respective ligands) retrieved from the Protein Data Bank: ERα agonist (PDB ID: 1L2I), ERα antagonist (PDB ID: 3DT3), ERβ agonist (PDB ID: 2NV7), and ERβ antagonist (PDB ID: 1L2J). We found that all four ER conformations discriminated their corresponding ligands from presumed non-binders. Finally, both QSAR models and ER structures were employed in parallel to virtually screen several large libraries of environmental chemicals to derive a ligand- and structure-based prioritized list of putative estrogenic compounds to be used for in vitro and in vivo experimental validation. - Highlights: • This is the largest curated dataset inclusive of ERα and β (the latter is unique). • New methodology that for the first time affords acceptable ERβ models. • A combination of QSAR and docking enables prediction of affinity and function. • The results

  12. Dataset of curcumin derivatives for QSAR modeling of anti cancer against P388 cell line

    Directory of Open Access Journals (Sweden)

    Yum Eryanti

    2016-12-01

    Full Text Available The dataset of curcumin derivatives consists of 45 compounds (Table 1 with their anti cancer biological activity (IC50 against P388 cell line. 45 curcumin derivatives were used in the model development where 30 of these compounds were in the training set and the remaining 15 compounds were in the test set. The development of the QSAR model involved the use of the multiple linear regression analysis (MLRA method. Based on the method, r2 value, r2 (CV value of 0.81, 0.67 were obtained. The QSAR model was also employed to predict the biological activity of compounds in the test set. Predictive correlation coefficient r2 values of 0.88 were obtained for the test set.

  13. QSAR models for the removal of organic micropollutants in four different river water matrices

    KAUST Repository

    Sudhakaran, Sairam; Calvin, James; Amy, Gary L.

    2012-01-01

    Ozonation is an advanced water treatment process used to remove organic micropollutants (OMPs) such as pharmaceuticals and personal care products (PPCPs). In this study, Quantitative Structure Activity Relationship (QSAR) models, for ozonation

  14. Quantitative structure activity relationships (QSAR) for binary mixtures at non-equitoxic ratios based on toxic ratios-effects curves.

    Science.gov (United States)

    Tian, Dayong; Lin, Zhifen; Yin, Daqiang

    2013-01-01

    The present study proposed a QSAR model to predict joint effects at non-equitoxic ratios for binary mixtures containing reactive toxicants, cyanogenic compounds and aldehydes. Toxicity of single and binary mixtures was measured by quantifying the decrease in light emission from the Photobacterium phosphoreum for 15 min. The joint effects of binary mixtures (TU sum) can thus be obtained. The results showed that the relationships between toxic ratios of the individual chemicals and their joint effects can be described by normal distribution function. Based on normal distribution equations, the joint effects of binary mixtures at non-equitoxic ratios ( [Formula: see text]) can be predicted quantitatively using the joint effects at equitoxic ratios ( [Formula: see text]). Combined with a QSAR model of [Formula: see text]in our previous work, a novel QSAR model can be proposed to predict the joint effects of mixtures at non-equitoxic ratios ( [Formula: see text]). The proposed model has been validated using additional mixtures other than the one used for the development of the model. Predicted and observed results were similar (p>0.05). This study provides an approach to the prediction of joint effects for binary mixtures at non-equitoxic ratios.

  15. Molecular docking and 3D-QSAR studies on inhibitors of DNA damage signaling enzyme human PARP-1.

    Science.gov (United States)

    Fatima, Sabiha; Bathini, Raju; Sivan, Sree Kanth; Manga, Vijjulatha

    2012-08-01

    Poly (ADP-ribose) polymerase-1 (PARP-1) operates in a DNA damage signaling network. Molecular docking and three dimensional-quantitative structure activity relationship (3D-QSAR) studies were performed on human PARP-1 inhibitors. Docked conformation obtained for each molecule was used as such for 3D-QSAR analysis. Molecules were divided into a training set and a test set randomly in four different ways, partial least square analysis was performed to obtain QSAR models using the comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA). Derived models showed good statistical reliability that is evident from their r², q²(loo) and r²(pred) values. To obtain a consensus for predictive ability from all the models, average regression coefficient r²(avg) was calculated. CoMFA and CoMSIA models showed a value of 0.930 and 0.936, respectively. Information obtained from the best 3D-QSAR model was applied for optimization of lead molecule and design of novel potential inhibitors.

  16. Classification of baseline toxicants for QSAR predictions to replace fish acute toxicity studies.

    Science.gov (United States)

    Nendza, Monika; Müller, Martin; Wenzel, Andrea

    2017-03-22

    Fish acute toxicity studies are required for environmental hazard and risk assessment of chemicals by national and international legislations such as REACH, the regulations of plant protection products and biocidal products, or the GHS (globally harmonised system) for classification and labelling of chemicals. Alternative methods like QSARs (quantitative structure-activity relationships) can replace many ecotoxicity tests. However, complete substitution of in vivo animal tests by in silico methods may not be realistic. For the so-called baseline toxicants, it is possible to predict the fish acute toxicity with sufficient accuracy from log K ow and, hence, valid QSARs can replace in vivo testing. In contrast, excess toxicants and chemicals not reliably classified as baseline toxicants require further in silico, in vitro or in vivo assessments. Thus, the critical task is to discriminate between baseline and excess toxicants. For fish acute toxicity, we derived a scheme based on structural alerts and physicochemical property thresholds to classify chemicals as either baseline toxicants (=predictable by QSARs) or as potential excess toxicants (=not predictable by baseline QSARs). The step-wise approach identifies baseline toxicants (true negatives) in a precautionary way to avoid false negative predictions. Therefore, a certain fraction of false positives can be tolerated, i.e. baseline toxicants without specific effects that may be tested instead of predicted. Application of the classification scheme to a new heterogeneous dataset for diverse fish species results in 40% baseline toxicants, 24% excess toxicants and 36% compounds not classified. Thus, we can conclude that replacing about half of the fish acute toxicity tests by QSAR predictions is realistic to be achieved in the short-term. The long-term goals are classification criteria also for further groups of toxicants and to replace as many in vivo fish acute toxicity tests as possible with valid QSAR

  17. Development of Predictive QSAR Models of 4-Thiazolidinones Antitrypanosomal Activity using Modern Machine Learning Algorithms.

    Science.gov (United States)

    Kryshchyshyn, Anna; Devinyak, Oleg; Kaminskyy, Danylo; Grellier, Philippe; Lesyk, Roman

    2017-11-14

    This paper presents novel QSAR models for the prediction of antitrypanosomal activity among thiazolidines and related heterocycles. The performance of four machine learning algorithms: Random Forest regression, Stochastic gradient boosting, Multivariate adaptive regression splines and Gaussian processes regression have been studied in order to reach better levels of predictivity. The results for Random Forest and Gaussian processes regression are comparable and outperform other studied methods. The preliminary descriptor selection with Boruta method improved the outcome of machine learning methods. The two novel QSAR-models developed with Random Forest and Gaussian processes regression algorithms have good predictive ability, which was proved by the external evaluation of the test set with corresponding Q 2 ext =0.812 and Q 2 ext =0.830. The obtained models can be used further for in silico screening of virtual libraries in the same chemical domain in order to find new antitrypanosomal agents. Thorough analysis of descriptors influence in the QSAR models and interpretation of their chemical meaning allows to highlight a number of structure-activity relationships. The presence of phenyl rings with electron-withdrawing atoms or groups in para-position, increased number of aromatic rings, high branching but short chains, high HOMO energy, and the introduction of 1-substituted 2-indolyl fragment into the molecular structure have been recognized as trypanocidal activity prerequisites. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. QSAR models for predicting in vivo aquatic toxicity of chlorinated alkanes to fish

    NARCIS (Netherlands)

    Zvinavashe, E.; Berg, H. van den; Soffers, A.E.M.F.; Vervoort, J.; Freidig, A.; Murk, A.J.; Rietjens, I.M.C.M.

    2008-01-01

    Quantitative structure-activity relationship (QSAR) models are expected to play a crucial role in reducing the number of animals to be used for toxicity testing resulting from the adoption of the new European Union chemical control system called Registration, Evaluation, and Authorization of

  19. Employing conformational analysis in the molecular modeling of agrochemicals: insights on QSAR parameters of 2,4-D

    Directory of Open Access Journals (Sweden)

    Matheus Puggina de Freitas

    2013-12-01

    Full Text Available A common practice to compute ligand conformations of compounds with various degrees of freedom to be used in molecular modeling (QSAR and docking studies is to perform a conformational distribution based on repeated random sampling, such as Monte-Carlo methods. Further calculations are often required. This short review describes some methods used for conformational analysis and the implications of using selected conformations in QSAR. A case study is developed for 2,4-dichlorophenoxyacetic acid (2,4-D, a widely used herbicide which binds to TIR1 ubiquitin ligase enzyme. The use of such an approach and semi-empirical calculations did not achieve all possible minima for 2,4-D. In addition, the conformations and respective energies obtained by the semi-empirical AM1 method do not match the calculated trends obtained by a high level DFT method. Similar findings were obtained for the carboxylate anion, which is the bioactive form. Finally, the crystal bioactive structure of 2,4-D was not found as a minimum when using Monte-Carlo/AM1 and is similarly populated with another conformer in implicit water solution according to optimization at the B3LYP/aug-cc-pVDZ level. Therefore, quantitative structure-activity relationship (QSAR methods based on three dimensional chemical structures are not fundamental to provide predictive models for 2,4-D congeners as TIR1 ubiquitin ligase ligands, since they do not necessarily reflect the bioactive conformation of this molecule. This probably extends to other systems.

  20. Efficient dynamic molecular simulation using QSAR model to know inhibition activity in breast cancer medicine

    Science.gov (United States)

    Zharifah, A.; Kusumowardani, E.; Saputro, A.; Sarwinda, D.

    2017-07-01

    According to data from GLOBOCAN (IARC) at 2012, breast cancer was the highest rated of new cancer case by 43.3 % (after controlled by age), with mortality rated as high as 12.9 %. Oncology is a major field which focusing on improving the development of drug and therapeutics cancer in pharmaceutical and biotechnology companies. Nowadays, many researchers lead to computational chemistry and bioinformatic for pharmacophore generation. A pharmacophore describes as a group of atoms in the molecule which is considered to be responsible for a pharmacological action. Prediction of biological function from chemical structure in silico modeling reduces the use of chemical reagents so the risk of environmental pollution decreased. In this research, we proposed QSAR model to analyze the composition of cancer drugs which assumed to be homogenous in character and treatment. Atomic interactions which analyzed are learned through parameters such as log p as descriptors hydrophobic, n_poinas descriptor contour strength and molecular structure, and also various concentrations inhibitor (micromolar and nanomolar) from NCBI drugs bank. The differences inhibitor activity was observed by the presence of IC 50 residues value from inhibitor substances at various concentration. Then, we got a general overview of the state of safety for drug stability seen from its IC 50 value. In our study, we also compared between micromolar and nanomolar inhibitor effect from QSAR model results. The QSAR model analysis shows that the drug concentration with nanomolar is better than micromolar, related with the content of inhibitor substances concentration. This QSAR model got the equation: Log 1/IC50 = (0.284) (±0.195) logP + (0.02) (±0.012) n_poin + (-0.005) (±0.083) Inhibition10.2nanoM + (0.1) (±0.079) Inhibition30.5nanoM + (-0.016) (±0.045) Inhibition91.5nanoM + (-2.572) (±1.570) (n = 13; r = 0.813; r2 = 0.660; s = 0.764; F = 2.720; q2 = 0.660).

  1. Categorical QSAR models for skin sensitization based on local lymph node assay measures and both ground and excited state 4D-fingerprint descriptors

    Science.gov (United States)

    Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Santos-Filho, Osvaldo A.; Esposito, Emilio X.; Hopfinger, Anton J.; Tseng, Yufeng J.

    2008-06-01

    In previous studies we have developed categorical QSAR models for predicting skin-sensitization potency based on 4D-fingerprint (4D-FP) descriptors and in vivo murine local lymph node assay (LLNA) measures. Only 4D-FP derived from the ground state (GMAX) structures of the molecules were used to build the QSAR models. In this study we have generated 4D-FP descriptors from the first excited state (EMAX) structures of the molecules. The GMAX, EMAX and the combined ground and excited state 4D-FP descriptors (GEMAX) were employed in building categorical QSAR models. Logistic regression (LR) and partial least square coupled logistic regression (PLS-CLR), found to be effective model building for the LLNA skin-sensitization measures in our previous studies, were used again in this study. This also permitted comparison of the prior ground state models to those involving first excited state 4D-FP descriptors. Three types of categorical QSAR models were constructed for each of the GMAX, EMAX and GEMAX datasets: a binary model (2-state), an ordinal model (3-state) and a binary-binary model (two-2-state). No significant differences exist among the LR 2-state model constructed for each of the three datasets. However, the PLS-CLR 3-state and 2-state models based on the EMAX and GEMAX datasets have higher predictivity than those constructed using only the GMAX dataset. These EMAX and GMAX categorical models are also more significant and predictive than corresponding models built in our previous QSAR studies of LLNA skin-sensitization measures.

  2. Integration of QSAR models for bioconcentration suitable for REACH

    Energy Technology Data Exchange (ETDEWEB)

    Gissi, Andrea [Laboratory of Chemistry and Environmental Toxicology, IRCCS - Istituto di Ricerche Farmacologiche “Mario Negri”, via Giuseppe La Masa 19, 20156 Milan (Italy); Dipartimento di Farmacia — Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, via Orabona 4, I-70125 Bari (Italy); Nicolotti, Orazio; Carotti, Angelo; Gadaleta, Domenico [Dipartimento di Farmacia — Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, via Orabona 4, I-70125 Bari (Italy); Lombardo, Anna [Laboratory of Chemistry and Environmental Toxicology, IRCCS - Istituto di Ricerche Farmacologiche “Mario Negri”, via Giuseppe La Masa 19, 20156 Milan (Italy); Benfenati, Emilio, E-mail: benfenati@marionegri.it [Laboratory of Chemistry and Environmental Toxicology, IRCCS - Istituto di Ricerche Farmacologiche “Mario Negri”, via Giuseppe La Masa 19, 20156 Milan (Italy)

    2013-07-01

    QSAR (Quantitative Structure Activity Relationship) models can be a valuable alternative method to replace or reduce animal test required by REACH. In particular, some endpoints such as bioconcentration factor (BCF) are easier to predict and many useful models have been already developed. In this paper we describe how to integrate two popular BCF models to obtain more reliable predictions. In particular, the herein presented integrated model relies on the predictions of two among the most used BCF models (CAESAR and Meylan), together with the Applicability Domain Index (ADI) provided by the software VEGA. Using a set of simple rules, the integrated model selects the most reliable and conservative predictions and discards possible outliers. In this way, for the prediction of the 851 compounds included in the ANTARES BCF dataset, the integrated model discloses a R{sup 2} (coefficient of determination) of 0.80, a RMSE (Root Mean Square Error) of 0.61 log units and a sensitivity of 76%, with a considerable improvement in respect to the CAESAR (R{sup 2} = 0.63; RMSE = 0.84 log units; sensitivity 55%) and Meylan (R{sup 2} = 0.66; RMSE = 0.77 log units; sensitivity 65%) without discarding too many predictions (118 out of 851). Importantly, considering solely the compounds within the new integrated ADI, the R{sup 2} increased to 0.92, and the sensitivity to 85%, with a RMSE of 0.44 log units. Finally, the use of properly set safety thresholds applied for monitoring the so called “suspicious” compounds, which are those chemicals predicted in proximity of the border normally accepted to discern non-bioaccumulative from bioaccumulative substances, permitted to obtain an integrated model with sensitivity equal to 100%. - Highlights: • Applying two independent QSAR models for bioconcentration factor increases the prediction. • The concordance of the models is an important component of the integration. • The measurement of the applicability domain improves the

  3. Integration of QSAR models for bioconcentration suitable for REACH

    International Nuclear Information System (INIS)

    Gissi, Andrea; Nicolotti, Orazio; Carotti, Angelo; Gadaleta, Domenico; Lombardo, Anna; Benfenati, Emilio

    2013-01-01

    QSAR (Quantitative Structure Activity Relationship) models can be a valuable alternative method to replace or reduce animal test required by REACH. In particular, some endpoints such as bioconcentration factor (BCF) are easier to predict and many useful models have been already developed. In this paper we describe how to integrate two popular BCF models to obtain more reliable predictions. In particular, the herein presented integrated model relies on the predictions of two among the most used BCF models (CAESAR and Meylan), together with the Applicability Domain Index (ADI) provided by the software VEGA. Using a set of simple rules, the integrated model selects the most reliable and conservative predictions and discards possible outliers. In this way, for the prediction of the 851 compounds included in the ANTARES BCF dataset, the integrated model discloses a R 2 (coefficient of determination) of 0.80, a RMSE (Root Mean Square Error) of 0.61 log units and a sensitivity of 76%, with a considerable improvement in respect to the CAESAR (R 2 = 0.63; RMSE = 0.84 log units; sensitivity 55%) and Meylan (R 2 = 0.66; RMSE = 0.77 log units; sensitivity 65%) without discarding too many predictions (118 out of 851). Importantly, considering solely the compounds within the new integrated ADI, the R 2 increased to 0.92, and the sensitivity to 85%, with a RMSE of 0.44 log units. Finally, the use of properly set safety thresholds applied for monitoring the so called “suspicious” compounds, which are those chemicals predicted in proximity of the border normally accepted to discern non-bioaccumulative from bioaccumulative substances, permitted to obtain an integrated model with sensitivity equal to 100%. - Highlights: • Applying two independent QSAR models for bioconcentration factor increases the prediction. • The concordance of the models is an important component of the integration. • The measurement of the applicability domain improves the prediction. • The use of a

  4. Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF studies.

    Science.gov (United States)

    Manoharan, Prabu; Vijayan, R S K; Ghoshal, Nanda

    2010-10-01

    The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables (X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.

  5. Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF studies

    Science.gov (United States)

    Manoharan, Prabu; Vijayan, R. S. K.; Ghoshal, Nanda

    2010-10-01

    The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables ( X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.

  6. Receptor-based 3D-QSAR in Drug Design: Methods and Applications in Kinase Studies.

    Science.gov (United States)

    Fang, Cheng; Xiao, Zhiyan

    2016-01-01

    Receptor-based 3D-QSAR strategy represents a superior integration of structure-based drug design (SBDD) and three-dimensional quantitative structure-activity relationship (3D-QSAR) analysis. It combines the accurate prediction of ligand poses by the SBDD approach with the good predictability and interpretability of statistical models derived from the 3D-QSAR approach. Extensive efforts have been devoted to the development of receptor-based 3D-QSAR methods and two alternative approaches have been exploited. One associates with computing the binding interactions between a receptor and a ligand to generate structure-based descriptors for QSAR analyses. The other concerns the application of various docking protocols to generate optimal ligand poses so as to provide reliable molecular alignments for the conventional 3D-QSAR operations. This review highlights new concepts and methodologies recently developed in the field of receptorbased 3D-QSAR, and in particular, covers its application in kinase studies.

  7. Public (Q)SAR Services, Integrated Modeling Environments, and Model Repositories on the Web: State of the Art and Perspectives for Future Development.

    Science.gov (United States)

    Tetko, Igor V; Maran, Uko; Tropsha, Alexander

    2017-03-01

    Thousands of (Quantitative) Structure-Activity Relationships (Q)SAR models have been described in peer-reviewed publications; however, this way of sharing seldom makes models available for the use by the research community outside of the developer's laboratory. Conversely, on-line models allow broad dissemination and application representing the most effective way of sharing the scientific knowledge. Approaches for sharing and providing on-line access to models range from web services created by individual users and laboratories to integrated modeling environments and model repositories. This emerging transition from the descriptive and informative, but "static", and for the most part, non-executable print format to interactive, transparent and functional delivery of "living" models is expected to have a transformative effect on modern experimental research in areas of scientific and regulatory use of (Q)SAR models. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Rational design of methicillin resistance staphylococcus aureus inhibitors through 3D-QSAR, molecular docking and molecular dynamics simulations.

    Science.gov (United States)

    Ballu, Srilata; Itteboina, Ramesh; Sivan, Sree Kanth; Manga, Vijjulatha

    2018-04-01

    Staphylococcus aureus is a gram positive bacterium. It is the leading cause of skin and respiratory infections, osteomyelitis, Ritter's disease, endocarditis, and bacteraemia in the developed world. We employed combined studies of 3D QSAR, molecular docking which are validated by molecular dynamics simulations and in silico ADME prediction have been performed on Isothiazoloquinolones inhibitors against methicillin resistance Staphylococcus aureus. Three-dimensional quantitative structure-activity relationship (3D-QSAR) study was applied using comparative molecular field analysis (CoMFA) with Q 2 of 0.578, R 2 of 0.988, and comparative molecular similarity indices analysis (CoMSIA) with Q 2 of 0.554, R 2 of 0.975. The predictive ability of these model was determined using a test set of molecules that gave acceptable predictive correlation (r 2 Pred) values 0.55 and 0.57 of CoMFA and CoMSIA respectively. Docking, simulations were employed to position the inhibitors into protein active site to find out the most probable binding mode and most reliable conformations. Developed models and Docking methods provide guidance to design molecules with enhanced activity. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Exploring possible mechanisms of action for the nanotoxicity and protein binding of decorated nanotubes: interpretation of physicochemical properties from optimal QSAR models.

    Science.gov (United States)

    Esposito, Emilio Xavier; Hopfinger, Anton J; Shao, Chi-Yu; Su, Bo-Han; Chen, Sing-Zuo; Tseng, Yufeng Jane

    2015-10-01

    Carbon nanotubes have become widely used in a variety of applications including biosensors and drug carriers. Therefore, the issue of carbon nanotube toxicity is increasingly an area of focus and concern. While previous studies have focused on the gross mechanisms of action relating to nanomaterials interacting with biological entities, this study proposes detailed mechanisms of action, relating to nanotoxicity, for a series of decorated (functionalized) carbon nanotube complexes based on previously reported QSAR models. Possible mechanisms of nanotoxicity for six endpoints (bovine serum albumin, carbonic anhydrase, chymotrypsin, hemoglobin along with cell viability and nitrogen oxide production) have been extracted from the corresponding optimized QSAR models. The molecular features relevant to each of the endpoint respective mechanism of action for the decorated nanotubes are also discussed. Based on the molecular information contained within the optimal QSAR models for each nanotoxicity endpoint, either the decorator attached to the nanotube is directly responsible for the expression of a particular activity, irrespective of the decorator's 3D-geometry and independent of the nanotube, or those decorators having structures that place the functional groups of the decorators as far as possible from the nanotube surface most strongly influence the biological activity. These molecular descriptors are further used to hypothesize specific interactions involved in the expression of each of the six biological endpoints. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Novel 1,4-naphthoquinone-based sulfonamides: Synthesis, QSAR, anticancer and antimalarial studies.

    Science.gov (United States)

    Pingaew, Ratchanok; Prachayasittikul, Veda; Worachartcheewan, Apilak; Nantasenamat, Chanin; Prachayasittikul, Supaluk; Ruchirawat, Somsak; Prachayasittikul, Virapong

    2015-10-20

    A novel series of 1,4-naphthoquinones (33-44) tethered by open and closed chain sulfonamide moieties were designed, synthesized and evaluated for their cytotoxic and antimalarial activities. All quinone-sulfonamide derivatives displayed a broad spectrum of cytotoxic activities against all of the tested cancer cell lines including HuCCA-1, HepG2, A549 and MOLT-3. Most quinones (33-36 and 38-43) exerted higher anticancer activity against HepG2 cell than that of the etoposide. The open chain analogs 36 and 42 were shown to be the most potent compounds. Notably, the restricted sulfonamide analog 38 with 6,7-dimethoxy groups exhibited the most potent antimalarial activity (IC₅₀ = 2.8 μM). Quantitative structure-activity relationships (QSAR) study was performed to reveal important chemical features governing the biological activities. Five constructed QSAR models provided acceptable predictive performance (Rcv 0.5647-0.9317 and RMSEcv 0.1231-0.2825). Four additional sets of structurally modified compounds were generated in silico (34a-34d, 36a-36k, 40a-40d and 42a-42k) in which their activities were predicted using the constructed QSAR models. A comprehensive discussion of the structure-activity relationships was made and a set of promising compounds (i.e., 33, 36, 38, 42, 36d, 36f, 42e, 42g and 42f) was suggested for further development as anticancer and antimalarial agents. Copyright © 2015 Elsevier Masson SAS. All rights reserved.

  11. QSAR studies for the acute toxicity of nitrobenzenes to the Tetrahymena pyriformis

    Directory of Open Access Journals (Sweden)

    Wang Dan-Dan

    2014-01-01

    Full Text Available Quantitative structure-activity relationship (QSAR models play a key role in finding the relationship between molecular structures and the toxicity of nitrobenzenes to Tetrahymena pyriformis. In this work, genetic algorithm, along with partial least square (GA-PLS was employed to select optimal subset of descriptors that have significant contribution to the toxicity of nitrobenzenes to Tetrahymena pyriformis. A set of five descriptors, namely G2, HOMT, G(Cl…Cl, Mor03v and MAXDP, was used for the prediction of the toxicity of 45 nitrobenzene derivatives and then were used to build the model by multiple linear regression (MLR method. It turned out that the built model, whose stability was confirmed using the leave-one-out validation and external validation test, showed high statistical significance (R2=0.963, Q2LOO=0.944. Moreover, Y-scrambling test indicated there was no chance correlation in this model.

  12. T-scale as a novel vector of topological descriptors for amino acids and its application in QSARs of peptides

    Science.gov (United States)

    Tian, Feifei; Zhou, Peng; Li, Zhiliang

    2007-03-01

    In this paper, a new topological descriptor T-scale is derived from principal component analysis (PCA) on the collected 67 kinds of structural and topological variables of 135 amino acids. Applying T-scale to three peptide panels as 58 angiotensin-converting enzyme (ACE) inhibitors, 20 thromboplastin inhibitors (TI) and 28 bovine lactoferricin-(17-31)-pentadecapeptides (LFB), the resulting QSAR models, constructed by partial least squares (PLS), are all superior to reference reports, with correlative coefficient r2 and cross-validated q2 of 0.845, 0.786; 0.996, 0.782 (0.988, 0.961); 0.760, 0.627, respectively.

  13. Sparse QSAR modelling methods for therapeutic and regenerative medicine

    Science.gov (United States)

    Winkler, David A.

    2018-02-01

    The quantitative structure-activity relationships method was popularized by Hansch and Fujita over 50 years ago. The usefulness of the method for drug design and development has been shown in the intervening years. As it was developed initially to elucidate which molecular properties modulated the relative potency of putative agrochemicals, and at a time when computing resources were scarce, there is much scope for applying modern mathematical methods to improve the QSAR method and to extending the general concept to the discovery and optimization of bioactive molecules and materials more broadly. I describe research over the past two decades where we have rebuilt the unit operations of the QSAR method using improved mathematical techniques, and have applied this valuable platform technology to new important areas of research and industry such as nanoscience, omics technologies, advanced materials, and regenerative medicine. This paper was presented as the 2017 ACS Herman Skolnik lecture.

  14. QSAR studies in the discovery of novel type-II diabetic therapies.

    Science.gov (United States)

    Abuhammad, Areej; Taha, Mutasem O

    2016-01-01

    Type-II diabetes mellitus (T2DM) is a complex chronic disease that represents a major therapeutic challenge. Despite extensive efforts in T2DM drug development, therapies remain unsatisfactory. Currently, there are many novel and important antidiabetic drug targets under investigation by many research groups worldwide. One of the main challenges to develop effective orally active hypoglycemic agents is off-target effects. Computational tools have impacted drug discovery at many levels. One of the earliest methods is quantitative structure-activity relationship (QSAR) studies. QSAR strategies help medicinal chemists understand the relationship between hypoglycemic activity and molecular properties. Hence, QSAR may hold promise in guiding the synthesis of specifically designed novel ligands that demonstrate high potency and target selectivity. This review aims to provide an overview of the QSAR strategies used to model antidiabetic agents. In particular, this review focuses on drug targets that raised recent scientific interest and/or led to successful antidiabetic agents in the market. Special emphasis has been made on studies that led to the identification of novel antidiabetic scaffolds. Computer-aided molecular design and discovery techniques like QSAR have a great potential in designing leads against complex diseases such as T2DM. Combined with other in silico techniques, QSAR can provide more useful and rational insights to facilitate the discovery of novel compounds. However, since T2DM is a complex disease that includes several faulty biological targets, multi-target QSAR studies are recommended in the future to achieve efficient antidiabetic therapies.

  15. Integrated QSAR study for inhibitors of Hedgehog Signal Pathway against multiple cell lines:a collaborative filtering method.

    Science.gov (United States)

    Gao, Jun; Che, Dongsheng; Zheng, Vincent W; Zhu, Ruixin; Liu, Qi

    2012-07-31

    The Hedgehog Signaling Pathway is one of signaling pathways that are very important to embryonic development. The participation of inhibitors in the Hedgehog Signal Pathway can control cell growth and death, and searching novel inhibitors to the functioning of the pathway are in a great demand. As the matter of fact, effective inhibitors could provide efficient therapies for a wide range of malignancies, and targeting such pathway in cells represents a promising new paradigm for cell growth and death control. Current research mainly focuses on the syntheses of the inhibitors of cyclopamine derivatives, which bind specifically to the Smo protein, and can be used for cancer therapy. While quantitatively structure-activity relationship (QSAR) studies have been performed for these compounds among different cell lines, none of them have achieved acceptable results in the prediction of activity values of new compounds. In this study, we proposed a novel collaborative QSAR model for inhibitors of the Hedgehog Signaling Pathway by integration the information from multiple cell lines. Such a model is expected to substantially improve the QSAR ability from single cell lines, and provide useful clues in developing clinically effective inhibitors and modifications of parent lead compounds for target on the Hedgehog Signaling Pathway. In this study, we have presented: (1) a collaborative QSAR model, which is used to integrate information among multiple cell lines to boost the QSAR results, rather than only a single cell line QSAR modeling. Our experiments have shown that the performance of our model is significantly better than single cell line QSAR methods; and (2) an efficient feature selection strategy under such collaborative environment, which can derive the commonly important features related to the entire given cell lines, while simultaneously showing their specific contributions to a specific cell-line. Based on feature selection results, we have proposed several

  16. QSAR studies of some side chain modified 7-chloro-4-aminoquinolines as antimalarial agents

    Directory of Open Access Journals (Sweden)

    Nitendra K. Sahu

    2014-11-01

    Full Text Available The quantitative structure–activity relationship (QSAR analyses were carried out for a series of new side chain modified 4-amino-7-chloroquinolines to find out the structural requirements of their antimalarial activities against both chloroquine sensitive (HB3 and resistant (Dd2 Plasmodium falciparum strain. The statistically significant best 2D QSAR models for Dd2, having correlation coefficient (r2 = 0.9188 and cross validated squared correlation coefficient (q2 = 0.8349 with external predictive ability (pred_r2 = 0.7258 and for HB3, having r2 = 0.9024, q2 = 0.8089 and pred_r2 = 0.7463 were developed by multiple linear regression coupled with genetic algorithm (GA–MLR and stepwise (SW–MLR forward algorithm, respectively. The results of the present study may be useful on the designing of more potent analogues as antimalarial agents.

  17. Adaptive Neuro-Fuzzy Inference System Applied QSAR with Quantum Chemical Descriptors for Predicting Radical Scavenging Activities of Carotenoids.

    Science.gov (United States)

    Jhin, Changho; Hwang, Keum Taek

    2015-01-01

    One of the physiological characteristics of carotenoids is their radical scavenging activity. In this study, the relationship between radical scavenging activities and quantum chemical descriptors of carotenoids was determined. Adaptive neuro-fuzzy inference system (ANFIS) applied quantitative structure-activity relationship models (QSAR) were also developed for predicting and comparing radical scavenging activities of carotenoids. Semi-empirical PM6 and PM7 quantum chemical calculations were done by MOPAC. Ionisation energies of neutral and monovalent cationic carotenoids and the product of chemical potentials of neutral and monovalent cationic carotenoids were significantly correlated with the radical scavenging activities, and consequently these descriptors were used as independent variables for the QSAR study. The ANFIS applied QSAR models were developed with two triangular-shaped input membership functions made for each of the independent variables and optimised by a backpropagation method. High prediction efficiencies were achieved by the ANFIS applied QSAR. The R-square values of the developed QSAR models with the variables calculated by PM6 and PM7 methods were 0.921 and 0.902, respectively. The results of this study demonstrated reliabilities of the selected quantum chemical descriptors and the significance of QSAR models.

  18. Adaptive Neuro-Fuzzy Inference System Applied QSAR with Quantum Chemical Descriptors for Predicting Radical Scavenging Activities of Carotenoids.

    Directory of Open Access Journals (Sweden)

    Changho Jhin

    Full Text Available One of the physiological characteristics of carotenoids is their radical scavenging activity. In this study, the relationship between radical scavenging activities and quantum chemical descriptors of carotenoids was determined. Adaptive neuro-fuzzy inference system (ANFIS applied quantitative structure-activity relationship models (QSAR were also developed for predicting and comparing radical scavenging activities of carotenoids. Semi-empirical PM6 and PM7 quantum chemical calculations were done by MOPAC. Ionisation energies of neutral and monovalent cationic carotenoids and the product of chemical potentials of neutral and monovalent cationic carotenoids were significantly correlated with the radical scavenging activities, and consequently these descriptors were used as independent variables for the QSAR study. The ANFIS applied QSAR models were developed with two triangular-shaped input membership functions made for each of the independent variables and optimised by a backpropagation method. High prediction efficiencies were achieved by the ANFIS applied QSAR. The R-square values of the developed QSAR models with the variables calculated by PM6 and PM7 methods were 0.921 and 0.902, respectively. The results of this study demonstrated reliabilities of the selected quantum chemical descriptors and the significance of QSAR models.

  19. QSAR study on the antimalarial activity of Plasmodium falciparum dihydroorotate dehydrogenase (PfDHODH) inhibitors.

    Science.gov (United States)

    Hou, X; Chen, X; Zhang, M; Yan, A

    2016-01-01

    Plasmodium falciparum, the most fatal parasite that causes malaria, is responsible for over one million deaths per year. P. falciparum dihydroorotate dehydrogenase (PfDHODH) has been validated as a promising drug development target for antimalarial therapy since it catalyzes the rate-limiting step for DNA and RNA biosynthesis. In this study, we investigated the quantitative structure-activity relationships (QSAR) of the antimalarial activity of PfDHODH inhibitors by generating four computational models using a multilinear regression (MLR) and a support vector machine (SVM) based on a dataset of 255 PfDHODH inhibitors. All the models display good prediction quality with a leave-one-out q(2) >0.66, a correlation coefficient (r) >0.85 on both training sets and test sets, and a mean square error (MSE) antimalarial activity. The models are capable of predicting inhibitors' antimalarial activity and the molecular descriptors for building the models could be helpful in the development of new antimalarial drugs.

  20. DemQSAR: predicting human volume of distribution and clearance of drugs.

    Science.gov (United States)

    Demir-Kavuk, Ozgur; Bentzien, Jörg; Muegge, Ingo; Knapp, Ernst-Walter

    2011-12-01

    In silico methods characterizing molecular compounds with respect to pharmacologically relevant properties can accelerate the identification of new drugs and reduce their development costs. Quantitative structure-activity/-property relationship (QSAR/QSPR) correlate structure and physico-chemical properties of molecular compounds with a specific functional activity/property under study. Typically a large number of molecular features are generated for the compounds. In many cases the number of generated features exceeds the number of molecular compounds with known property values that are available for learning. Machine learning methods tend to overfit the training data in such situations, i.e. the method adjusts to very specific features of the training data, which are not characteristic for the considered property. This problem can be alleviated by diminishing the influence of unimportant, redundant or even misleading features. A better strategy is to eliminate such features completely. Ideally, a molecular property can be described by a small number of features that are chemically interpretable. The purpose of the present contribution is to provide a predictive modeling approach, which combines feature generation, feature selection, model building and control of overtraining into a single application called DemQSAR. DemQSAR is used to predict human volume of distribution (VD(ss)) and human clearance (CL). To control overtraining, quadratic and linear regularization terms were employed. A recursive feature selection approach is used to reduce the number of descriptors. The prediction performance is as good as the best predictions reported in the recent literature. The example presented here demonstrates that DemQSAR can generate a model that uses very few features while maintaining high predictive power. A standalone DemQSAR Java application for model building of any user defined property as well as a web interface for the prediction of human VD(ss) and CL is

  1. Novel qsar combination forecast model for insect repellent coupling support vector regression and k-nearest-neighbor

    International Nuclear Information System (INIS)

    Wang, L.F.; Bai, L.Y.

    2013-01-01

    To improve the precision of quantitative structure-activity relationship (QSAR) modeling for aromatic carboxylic acid derivatives insect repellent, a novel nonlinear combination forecast model was proposed integrating support vector regression (SVR) and K-nearest neighbor (KNN): Firstly, search optimal kernel function and nonlinearly select molecular descriptors by the rule of minimum MSE value using SVR. Secondly, illuminate the effects of all descriptors on biological activity by multi-round enforcement resistance-selection. Thirdly, construct the sub-models with predicted values of different KNN. Then, get the optimal kernel and corresponding retained sub-models through subtle selection. Finally, make prediction with leave-one-out (LOO) method in the basis of reserved sub-models. Compared with previous widely used models, our work shows significant improvement in modeling performance, which demonstrates the superiority of the present combination forecast model. (author)

  2. OPERA models for predicting physicochemical properties and environmental fate endpoints.

    Science.gov (United States)

    Mansouri, Kamel; Grulke, Chris M; Judson, Richard S; Williams, Antony J

    2018-03-08

    The collection of chemical structure information and associated experimental data for quantitative structure-activity/property relationship (QSAR/QSPR) modeling is facilitated by an increasing number of public databases containing large amounts of useful data. However, the performance of QSAR models highly depends on the quality of the data and modeling methodology used. This study aims to develop robust QSAR/QSPR models for chemical properties of environmental interest that can be used for regulatory purposes. This study primarily uses data from the publicly available PHYSPROP database consisting of a set of 13 common physicochemical and environmental fate properties. These datasets have undergone extensive curation using an automated workflow to select only high-quality data, and the chemical structures were standardized prior to calculation of the molecular descriptors. The modeling procedure was developed based on the five Organization for Economic Cooperation and Development (OECD) principles for QSAR models. A weighted k-nearest neighbor approach was adopted using a minimum number of required descriptors calculated using PaDEL, an open-source software. The genetic algorithms selected only the most pertinent and mechanistically interpretable descriptors (2-15, with an average of 11 descriptors). The sizes of the modeled datasets varied from 150 chemicals for biodegradability half-life to 14,050 chemicals for logP, with an average of 3222 chemicals across all endpoints. The optimal models were built on randomly selected training sets (75%) and validated using fivefold cross-validation (CV) and test sets (25%). The CV Q 2 of the models varied from 0.72 to 0.95, with an average of 0.86 and an R 2 test value from 0.71 to 0.96, with an average of 0.82. Modeling and performance details are described in QSAR model reporting format and were validated by the European Commission's Joint Research Center to be OECD compliant. All models are freely available as an open

  3. Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR.

    Science.gov (United States)

    Winkler, David A; Le, Tu C

    2017-01-01

    Neural networks have generated valuable Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models for a wide variety of small molecules and materials properties. They have grown in sophistication and many of their initial problems have been overcome by modern mathematical techniques. QSAR studies have almost always used so-called "shallow" neural networks in which there is a single hidden layer between the input and output layers. Recently, a new and potentially paradigm-shifting type of neural network based on Deep Learning has appeared. Deep learning methods have generated impressive improvements in image and voice recognition, and are now being applied to QSAR and QSAR modelling. This paper describes the differences in approach between deep and shallow neural networks, compares their abilities to predict the properties of test sets for 15 large drug data sets (the kaggle set), discusses the results in terms of the Universal Approximation theorem for neural networks, and describes how DNN may ameliorate or remove troublesome "activity cliffs" in QSAR data sets. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Combinatorial Pharmacophore-Based 3D-QSAR Analysis and Virtual Screening of FGFR1 Inhibitors

    Directory of Open Access Journals (Sweden)

    Nannan Zhou

    2015-06-01

    Full Text Available The fibroblast growth factor/fibroblast growth factor receptor (FGF/FGFR signaling pathway plays crucial roles in cell proliferation, angiogenesis, migration, and survival. Aberration in FGFRs correlates with several malignancies and disorders. FGFRs have proved to be attractive targets for therapeutic intervention in cancer, and it is of high interest to find FGFR inhibitors with novel scaffolds. In this study, a combinatorial three-dimensional quantitative structure-activity relationship (3D-QSAR model was developed based on previously reported FGFR1 inhibitors with diverse structural skeletons. This model was evaluated for its prediction performance on a diverse test set containing 232 FGFR inhibitors, and it yielded a SD value of 0.75 pIC50 units from measured inhibition affinities and a Pearson’s correlation coefficient R2 of 0.53. This result suggests that the combinatorial 3D-QSAR model could be used to search for new FGFR1 hit structures and predict their potential activity. To further evaluate the performance of the model, a decoy set validation was used to measure the efficiency of the model by calculating EF (enrichment factor. Based on the combinatorial pharmacophore model, a virtual screening against SPECS database was performed. Nineteen novel active compounds were successfully identified, which provide new chemical starting points for further structural optimization of FGFR1 inhibitors.

  5. Characterization and validation of an in silico toxicology model to predict the mutagenic potential of drug impurities*

    Energy Technology Data Exchange (ETDEWEB)

    Valerio, Luis G., E-mail: luis.valerio@fda.hhs.gov [Science and Research Staff, Office of Pharmaceutical Science, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993–0002 (United States); Cross, Kevin P. [Leadscope, Inc., 1393 Dublin Road, Columbus, OH, 43215–1084 (United States)

    2012-05-01

    Control and minimization of human exposure to potential genotoxic impurities found in drug substances and products is an important part of preclinical safety assessments of new drug products. The FDA's 2008 draft guidance on genotoxic and carcinogenic impurities in drug substances and products allows use of computational quantitative structure–activity relationships (QSAR) to identify structural alerts for known and expected impurities present at levels below qualified thresholds. This study provides the information necessary to establish the practical use of a new in silico toxicology model for predicting Salmonella t. mutagenicity (Ames assay outcome) of drug impurities and other chemicals. We describe the model's chemical content and toxicity fingerprint in terms of compound space, molecular and structural toxicophores, and have rigorously tested its predictive power using both cross-validation and external validation experiments, as well as case studies. Consistent with desired regulatory use, the model performs with high sensitivity (81%) and high negative predictivity (81%) based on external validation with 2368 compounds foreign to the model and having known mutagenicity. A database of drug impurities was created from proprietary FDA submissions and the public literature which found significant overlap between the structural features of drug impurities and training set chemicals in the QSAR model. Overall, the model's predictive performance was found to be acceptable for screening drug impurities for Salmonella mutagenicity. -- Highlights: ► We characterize a new in silico model to predict mutagenicity of drug impurities. ► The model predicts Salmonella mutagenicity and will be useful for safety assessment. ► We examine toxicity fingerprints and toxicophores of this Ames assay model. ► We compare these attributes to those found in drug impurities known to FDA/CDER. ► We validate the model and find it has a desired predictive

  6. Characterization and validation of an in silico toxicology model to predict the mutagenic potential of drug impurities*

    International Nuclear Information System (INIS)

    Valerio, Luis G.; Cross, Kevin P.

    2012-01-01

    Control and minimization of human exposure to potential genotoxic impurities found in drug substances and products is an important part of preclinical safety assessments of new drug products. The FDA's 2008 draft guidance on genotoxic and carcinogenic impurities in drug substances and products allows use of computational quantitative structure–activity relationships (QSAR) to identify structural alerts for known and expected impurities present at levels below qualified thresholds. This study provides the information necessary to establish the practical use of a new in silico toxicology model for predicting Salmonella t. mutagenicity (Ames assay outcome) of drug impurities and other chemicals. We describe the model's chemical content and toxicity fingerprint in terms of compound space, molecular and structural toxicophores, and have rigorously tested its predictive power using both cross-validation and external validation experiments, as well as case studies. Consistent with desired regulatory use, the model performs with high sensitivity (81%) and high negative predictivity (81%) based on external validation with 2368 compounds foreign to the model and having known mutagenicity. A database of drug impurities was created from proprietary FDA submissions and the public literature which found significant overlap between the structural features of drug impurities and training set chemicals in the QSAR model. Overall, the model's predictive performance was found to be acceptable for screening drug impurities for Salmonella mutagenicity. -- Highlights: ► We characterize a new in silico model to predict mutagenicity of drug impurities. ► The model predicts Salmonella mutagenicity and will be useful for safety assessment. ► We examine toxicity fingerprints and toxicophores of this Ames assay model. ► We compare these attributes to those found in drug impurities known to FDA/CDER. ► We validate the model and find it has a desired predictive performance.

  7. QSAR, docking and ADMET studies of artemisinin derivatives for antimalarial activity targeting plasmepsin II, a hemoglobin-degrading enzyme from P. falciparum.

    Science.gov (United States)

    Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S

    2012-01-01

    This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.

  8. QSAR development and bioavailability determination: the toxicity of chloroanilines to the soil dwelling springtail Folsomia candida.

    Science.gov (United States)

    Giesen, Daniel; van Gestel, Cornelis A M

    2013-03-01

    Quantitative structure-activity relationships (QSARs) are an established tool in environmental risk assessment and a valuable alternative to the exhaustive use of test animals under REACH. In this study a QSAR was developed for the toxicity of a series of six chloroanilines to the soil-dwelling collembolan Folsomia candida in standardized natural LUFA2.2 soil. Toxicity endpoints incorporated in the QSAR were the concentrations causing 10% (EC10) and 50% (EC50) reduction in reproduction of F. candida. Toxicity was based on concentrations in interstitial water estimated from nominal concentrations in the soil and published soil-water partition coefficients. Estimated effect concentrations were negatively correlated with the lipophilicity of the compounds. Interstitial water concentrations for both the EC10 and EC50 for four compounds were determined by using solid-phase microextraction (SPME). Measured and estimated concentrations were comparable only for tetra- and pentachloroaniline. With decreasing chlorination the disparity between modelled and actual concentrations increased. Optimisation of the QSAR therefore could not be accomplished, showing the necessity to move from total soil to (bio)available concentration measurements. Copyright © 2012 Elsevier Ltd. All rights reserved.

  9. A Combined Pharmacophore Modeling, 3D QSAR and Virtual Screening Studies on Imidazopyridines as B-Raf Inhibitors

    Directory of Open Access Journals (Sweden)

    Huiding Xie

    2015-05-01

    Full Text Available B-Raf kinase is an important target in treatment of cancers. In order to design and find potent B-Raf inhibitors (BRIs, 3D pharmacophore models were created using the Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Database (GALAHAD. The best pharmacophore model obtained which was used in effective alignment of the data set contains two acceptor atoms, three donor atoms and three hydrophobes. In succession, comparative molecular field analysis (CoMFA and comparative molecular similarity indices analysis (CoMSIA were performed on 39 imidazopyridine BRIs to build three dimensional quantitative structure-activity relationship (3D QSAR models based on both pharmacophore and docking alignments. The CoMSIA model based on the pharmacophore alignment shows the best result (q2 = 0.621, r2pred = 0.885. This 3D QSAR approach provides significant insights that are useful for designing potent BRIs. In addition, the obtained best pharmacophore model was used for virtual screening against the NCI2000 database. The hit compounds were further filtered with molecular docking, and their biological activities were predicted using the CoMSIA model, and three potential BRIs with new skeletons were obtained.

  10. A Combined Pharmacophore Modeling, 3D QSAR and Virtual Screening Studies on Imidazopyridines as B-Raf Inhibitors.

    Science.gov (United States)

    Xie, Huiding; Chen, Lijun; Zhang, Jianqiang; Xie, Xiaoguang; Qiu, Kaixiong; Fu, Jijun

    2015-05-29

    B-Raf kinase is an important target in treatment of cancers. In order to design and find potent B-Raf inhibitors (BRIs), 3D pharmacophore models were created using the Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Database (GALAHAD). The best pharmacophore model obtained which was used in effective alignment of the data set contains two acceptor atoms, three donor atoms and three hydrophobes. In succession, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were performed on 39 imidazopyridine BRIs to build three dimensional quantitative structure-activity relationship (3D QSAR) models based on both pharmacophore and docking alignments. The CoMSIA model based on the pharmacophore alignment shows the best result (q(2) = 0.621, r(2)(pred) = 0.885). This 3D QSAR approach provides significant insights that are useful for designing potent BRIs. In addition, the obtained best pharmacophore model was used for virtual screening against the NCI2000 database. The hit compounds were further filtered with molecular docking, and their biological activities were predicted using the CoMSIA model, and three potential BRIs with new skeletons were obtained.

  11. HP-Lattice QSAR for dynein proteins: experimental proteomics (2D-electrophoresis, mass spectrometry) and theoretic study of a Leishmania infantum sequence.

    Science.gov (United States)

    Dea-Ayuela, María Auxiliadora; Pérez-Castillo, Yunierkis; Meneses-Marcel, Alfredo; Ubeira, Florencio M; Bolas-Fernández, Francisco; Chou, Kuo-Chen; González-Díaz, Humberto

    2008-08-15

    The toxicity and inefficacy of actual organic drugs against Leishmaniosis justify research projects to find new molecular targets in Leishmania species including Leishmania infantum (L. infantum) and Leishmaniamajor (L. major), both important pathogens. In this sense, quantitative structure-activity relationship (QSAR) methods, which are very useful in Bioorganic and Medicinal Chemistry to discover small-sized drugs, may help to identify not only new drugs but also new drug targets, if we apply them to proteins. Dyneins are important proteins of these parasites governing fundamental processes such as cilia and flagella motion, nuclear migration, organization of the mitotic splinde, and chromosome separation during mitosis. However, despite the interest for them as potential drug targets, so far there has been no report whatsoever on dyneins with QSAR techniques. To the best of our knowledge, we report here the first QSAR for dynein proteins. We used as input the Spectral Moments of a Markov matrix associated to the HP-Lattice Network of the protein sequence. The data contain 411 protein sequences of different species selected by ClustalX to develop a QSAR that correctly discriminates on average between 92.75% and 92.51% of dyneins and other proteins in four different train and cross-validation datasets. We also report a combined experimental and theoretic study of a new dynein sequence in order to illustrate the utility of the model to search for potential drug targets with a practical example. First, we carried out a 2D-electrophoresis analysis of L. infantum biological samples. Next, we excised from 2D-E gels one spot of interest belonging to an unknown protein or protein fragment in the region Mdata base with the highest similarity score to the MS of the protein isolated from L. infantum. We used the QSAR model to predict the new sequence as dynein with probability of 99.99% without relying upon alignment. In order to confirm the previous function annotation we

  12. Considerations of nano-QSAR/QSPR models for nanopesticide risk assessment within the European legislative framework.

    Science.gov (United States)

    Villaverde, Juan José; Sevilla-Morán, Beatriz; López-Goti, Carmen; Alonso-Prados, José Luis; Sandín-España, Pilar

    2018-09-01

    The European market for pesticides is currently legislated through the well-developed Regulation (EC) No. 1107/2009. This regulation promotes the competitiveness of European agriculture, recognizing the necessity of safe pesticides for human and animal health and the environment to protect crops against pests, diseases and weeds. In this sense, nanotechnology can provide a tremendous opportunity to achieve a more rational use of pesticides. However, the lack of information regarding nanopesticides and their fate and behavior in the environment and their effects on human and animal health is inhibiting rapid nanopesticide incorporation into European Union agriculture. This review analyzes the recent state of knowledge on nanopesticide risk assessment, highlighting the challenges that need to be overcame to accelerate the arrival of these new tools for plant protection to European agricultural professionals. Novel nano-Quantitative Structure-Activity/Structure-Property Relationship (nano-QSAR/QSPR) tools for risk assessment are analyzed, including modeling methods and validation procedures towards the potential of these computational instruments to meet the current requirements for authorization of nanoformulations. Future trends on these issues, of pressing importance within the context of the current European pesticide legislative framework, are also discussed. Standard protocols to make high-quality and well-described datasets for the series of related but differently sized nanoparticles/nanopesticides are required. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Prediction of drug-related cardiac adverse effects in humans--B: use of QSAR programs for early detection of drug-induced cardiac toxicities.

    Science.gov (United States)

    Frid, Anna A; Matthews, Edwin J

    2010-04-01

    This report describes the use of three quantitative structure-activity relationship (QSAR) programs to predict drug-related cardiac adverse effects (AEs), BioEpisteme, MC4PC, and Leadscope Predictive Data Miner. QSAR models were constructed for 9 cardiac AE clusters affecting Purkinje nerve fibers (arrhythmia, bradycardia, conduction disorder, electrocardiogram, palpitations, QT prolongation, rate rhythm composite, tachycardia, and Torsades de pointes) and 5 clusters affecting the heart muscle (coronary artery disorders, heart failure, myocardial disorders, myocardial infarction, and valve disorders). The models were based on a database of post-marketing AEs linked to 1632 chemical structures, and identical training data sets were configured for three QSAR programs. Model performance was optimized and shown to be affected by the ratio of the number of active to inactive drugs. Results revealed that the three programs were complementary and predictive performances using any single positive, consensus two positives, or consensus three positives were as follows, respectively: 70.7%, 91.7%, and 98.0% specificity; 74.7%, 47.2%, and 21.0% sensitivity; and 138.2, 206.3, and 144.2 chi(2). In addition, a prospective study using AE data from the U.S. Food and Drug Administration's (FDA's) MedWatch Program showed 82.4% specificity and 94.3% sensitivity. Furthermore, an external validation study of 18 drugs with serious cardiotoxicity not considered in the models had 88.9% sensitivity. Published by Elsevier Inc.

  14. Seleção de variáveis em QSAR Variable selection in QSAR

    Directory of Open Access Journals (Sweden)

    Márcia Miguel Castro Ferreira

    2002-05-01

    Full Text Available The process of building mathematical models in quantitative structure-activity relationship (QSAR studies is generally limited by the size of the dataset used to select variables from. For huge datasets, the task of selecting a given number of variables that produces the best linear model can be enormous, if not unfeasible. In this case, some methods can be used to separate good parameter combinations from the bad ones. In this paper three methodologies are analyzed: systematic search, genetic algorithm and chemometric methods. These methods have been exposed and discussed through practical examples.

  15. Exploring possible mechanisms of action for the nanotoxicity and protein binding of decorated nanotubes: interpretation of physicochemical properties from optimal QSAR models

    International Nuclear Information System (INIS)

    Esposito, Emilio Xavier; Hopfinger, Anton J.; Shao, Chi-Yu; Su, Bo-Han; Chen, Sing-Zuo; Tseng, Yufeng Jane

    2015-01-01

    Carbon nanotubes have become widely used in a variety of applications including biosensors and drug carriers. Therefore, the issue of carbon nanotube toxicity is increasingly an area of focus and concern. While previous studies have focused on the gross mechanisms of action relating to nanomaterials interacting with biological entities, this study proposes detailed mechanisms of action, relating to nanotoxicity, for a series of decorated (functionalized) carbon nanotube complexes based on previously reported QSAR models. Possible mechanisms of nanotoxicity for six endpoints (bovine serum albumin, carbonic anhydrase, chymotrypsin, hemoglobin along with cell viability and nitrogen oxide production) have been extracted from the corresponding optimized QSAR models. The molecular features relevant to each of the endpoint respective mechanism of action for the decorated nanotubes are also discussed. Based on the molecular information contained within the optimal QSAR models for each nanotoxicity endpoint, either the decorator attached to the nanotube is directly responsible for the expression of a particular activity, irrespective of the decorator's 3D-geometry and independent of the nanotube, or those decorators having structures that place the functional groups of the decorators as far as possible from the nanotube surface most strongly influence the biological activity. These molecular descriptors are further used to hypothesize specific interactions involved in the expression of each of the six biological endpoints. - Highlights: • Proposed toxicity mechanism of action for decorated nanotubes complexes • Discussion of the key molecular features for each endpoint's mechanism of action • Unique mechanisms of action for each of the six biological systems • Hypothesized mechanisms of action based on QSAR/QNAR predictive models

  16. Exploring possible mechanisms of action for the nanotoxicity and protein binding of decorated nanotubes: interpretation of physicochemical properties from optimal QSAR models

    Energy Technology Data Exchange (ETDEWEB)

    Esposito, Emilio Xavier, E-mail: emilio@exeResearch.com [exeResearch, LLC, 32 University Drive, East Lansing, MI 48823 (United States); The Chem21 Group, Inc., 1780 Wilson Drive, Lake Forest, IL 60045 (United States); Hopfinger, Anton J., E-mail: hopfingr@gmail.com [The Chem21 Group, Inc., 1780 Wilson Drive, Lake Forest, IL 60045 (United States); College of Pharmacy MSC09 5360, 1 University of New Mexico, Albuquerque, NM, 87131 (United States); Shao, Chi-Yu [Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei 106, Taiwan (China); Su, Bo-Han [Department of Computer Science and Information Engineering, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei 106, Taiwan (China); Chen, Sing-Zuo [Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei 106, Taiwan (China); Tseng, Yufeng Jane, E-mail: yjtseng@csie.ntu.edu.tw [Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei 106, Taiwan (China); Department of Computer Science and Information Engineering, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei 106, Taiwan (China)

    2015-10-01

    Carbon nanotubes have become widely used in a variety of applications including biosensors and drug carriers. Therefore, the issue of carbon nanotube toxicity is increasingly an area of focus and concern. While previous studies have focused on the gross mechanisms of action relating to nanomaterials interacting with biological entities, this study proposes detailed mechanisms of action, relating to nanotoxicity, for a series of decorated (functionalized) carbon nanotube complexes based on previously reported QSAR models. Possible mechanisms of nanotoxicity for six endpoints (bovine serum albumin, carbonic anhydrase, chymotrypsin, hemoglobin along with cell viability and nitrogen oxide production) have been extracted from the corresponding optimized QSAR models. The molecular features relevant to each of the endpoint respective mechanism of action for the decorated nanotubes are also discussed. Based on the molecular information contained within the optimal QSAR models for each nanotoxicity endpoint, either the decorator attached to the nanotube is directly responsible for the expression of a particular activity, irrespective of the decorator's 3D-geometry and independent of the nanotube, or those decorators having structures that place the functional groups of the decorators as far as possible from the nanotube surface most strongly influence the biological activity. These molecular descriptors are further used to hypothesize specific interactions involved in the expression of each of the six biological endpoints. - Highlights: • Proposed toxicity mechanism of action for decorated nanotubes complexes • Discussion of the key molecular features for each endpoint's mechanism of action • Unique mechanisms of action for each of the six biological systems • Hypothesized mechanisms of action based on QSAR/QNAR predictive models.

  17. Estimating the fates of organic contaminants in an aquifer using QSAR.

    Science.gov (United States)

    Lim, Seung Joo; Fox, Peter

    2013-01-01

    The quantitative structure activity relationship (QSAR) model, BIOWIN, was modified to more accurately estimate the fates of organic contaminants in an aquifer. The predictions from BIOWIN were modified to include oxidation and sorption effects. The predictive model therefore included the effects of sorption, biodegradation, and oxidation. A total of 35 organic compounds were used to validate the predictive model. The majority of the ratios of predicted half-life to measured half-life were within a factor of 2 and no ratio values were greater than a factor of 5. In addition, the accuracy of estimating the persistence of organic compounds in the sub-surface was superior when modified by the relative fraction adsorbed to the solid phase, 1/Rf, to that when modified by the remaining fraction of a given compound adsorbed to a solid, 1 - fs.

  18. Biochemical interpretation of quantitative structure-activity relationships (QSAR) for biodegradation of N-heterocycles: a complementary approach to predict biodegradability.

    Science.gov (United States)

    Philipp, Bodo; Hoff, Malte; Germa, Florence; Schink, Bernhard; Beimborn, Dieter; Mersch-Sundermann, Volker

    2007-02-15

    Prediction of the biodegradability of organic compounds is an ecologically desirable and economically feasible tool for estimating the environmental fate of chemicals. We combined quantitative structure-activity relationships (QSAR) with the systematic collection of biochemical knowledge to establish rules for the prediction of aerobic biodegradation of N-heterocycles. Validated biodegradation data of 194 N-heterocyclic compounds were analyzed using the MULTICASE-method which delivered two QSAR models based on 17 activating (OSAR 1) and on 16 inactivating molecular fragments (GSAR 2), which were statistically significantly linked to efficient or poor biodegradability, respectively. The percentages of correct classifications were over 99% for both models, and cross-validation resulted in 67.9% (GSAR 1) and 70.4% (OSAR 2) correct predictions. Biochemical interpretation of the activating and inactivating characteristics of the molecular fragments delivered plausible mechanistic interpretations and enabled us to establish the following biodegradation rules: (1) Target sites for amidohydrolases and for cytochrome P450 monooxygenases enhance biodegradation of nonaromatic N-heterocycles. (2) Target sites for molybdenum hydroxylases enhance biodegradation of aromatic N-heterocycles. (3) Target sites for hydratation by an urocanase-like mechanism enhance biodegradation of imidazoles. Our complementary approach represents a feasible strategy for generating concrete rules for the prediction of biodegradability of organic compounds.

  19. Relationship between soybean yield/quality and soil quality in a major soybean-producing area based on a 2D-QSAR model

    Science.gov (United States)

    Gao, Ming; Li, Shiwei

    2017-05-01

    Based on experimental data of the soybean yield and quality from 30 sampling points, a quantitative structure-activity relationship model (2D-QSAR) was established using the soil quality (elements, pH, organic matter content and cation exchange capacity) as independent variables and soybean yield or quality as the dependent variable, with SPSS software. During the modeling, the full data set (30 and 14 compounds) was divided into a training set (24 and 11 compounds) for model generation and a test set (6 and 3 compounds) for model validation. The R2 values of the resulting models and data were 0.826 and 0.808 for soybean yield and quality, respectively, and all regression coefficients were significant (P test set were 0.961 and 0.956, respectively, indicating that the models had a good predictive ability. Moreover, the Mo, Se, K, N and organic matter contents and the cation exchange capacity of soil had a positive effect on soybean production, and the B, Mo, Se, K and N contents and cation exchange coefficient had a positive effect on soybean quality. The results are instructive for enhancing soils to improve the yield and quality of soybean, and this method can also be used to study other crops or regions, providing a theoretical basis to improving the yield and quality of crops.

  20. QSAR studies of multidentate nitrogen ligands used in lanthanide and actinide extraction processes

    International Nuclear Information System (INIS)

    Drew, Michael G.B.; Hudson, Michael J.; Youngs, Tristan G.A.

    2004-01-01

    Quantitative structure activity relationships (QSARs) have been developed to optimise the choice of nitrogen heterocyclic molecules that can be used to separate the minor actinides such as americium(III) from europium(III) in the aqueous PUREX raffinate of nuclear waste. Experimental data on distribution coefficients and separation factors (SFs) for 47 such ligands have been obtained and show SF values ranging from 0.61 to 100. The ligands were divided into a training set of 36 molecules to develop the QSAR and a test set of 11 molecules to validate the QSAR. Over 1500 molecular descriptors were calculated for each heterocycle and the Genetic Algorithm was used to select the most appropriate for use in multiple regression equations. Equations were developed fitting the separation factors to 6-8 molecular descriptors which gave r 2 values of >0.8 for the training set and values of >0.7 for the test set, thus showing good predictive quality. The descriptors used in the equations were primarily electronic and steric. These equations can be used to predict the separation factors of nitrogen heterocycles not yet synthesised and/or tested and hence obtain the most efficient ligands for lanthanide and actinide separation

  1. Fragment-based quantitative structure-activity relationship (FB-QSAR) for fragment-based drug design.

    Science.gov (United States)

    Du, Qi-Shi; Huang, Ri-Bo; Wei, Yu-Tuo; Pang, Zong-Wen; Du, Li-Qin; Chou, Kuo-Chen

    2009-01-30

    In cooperation with the fragment-based design a new drug design method, the so-called "fragment-based quantitative structure-activity relationship" (FB-QSAR) is proposed. The essence of the new method is that the molecular framework in a family of drug candidates are divided into several fragments according to their substitutes being investigated. The bioactivities of molecules are correlated with the physicochemical properties of the molecular fragments through two sets of coefficients in the linear free energy equations. One coefficient set is for the physicochemical properties and the other for the weight factors of the molecular fragments. Meanwhile, an iterative double least square (IDLS) technique is developed to solve the two sets of coefficients in a training data set alternately and iteratively. The IDLS technique is a feedback procedure with machine learning ability. The standard Two-dimensional quantitative structure-activity relationship (2D-QSAR) is a special case, in the FB-QSAR, when the whole molecule is treated as one entity. The FB-QSAR approach can remarkably enhance the predictive power and provide more structural insights into rational drug design. As an example, the FB-QSAR is applied to build a predictive model of neuraminidase inhibitors for drug development against H5N1 influenza virus. (c) 2008 Wiley Periodicals, Inc.

  2. Novel chemical scaffolds of the tumor marker AKR1B10 inhibitors discovered by 3D QSAR pharmacophore modeling.

    Science.gov (United States)

    Kumar, Raj; Son, Minky; Bavi, Rohit; Lee, Yuno; Park, Chanin; Arulalapperumal, Venkatesh; Cao, Guang Ping; Kim, Hyong-ha; Suh, Jung-keun; Kim, Yong-seong; Kwon, Yong Jung; Lee, Keun Woo

    2015-08-01

    Recent evidence suggests that aldo-keto reductase family 1 B10 (AKR1B10) may be a potential diagnostic or prognostic marker of human tumors, and that AKR1B10 inhibitors offer a promising choice for treatment of many types of human cancers. The aim of this study was to identify novel chemical scaffolds of AKR1B10 inhibitors using in silico approaches. The 3D QSAR pharmacophore models were generated using HypoGen. A validated pharmacophore model was selected for virtual screening of 4 chemical databases. The best mapped compounds were assessed for their drug-like properties. The binding orientations of the resulting compounds were predicted by molecular docking. Density functional theory calculations were carried out using B3LYP. The stability of the protein-ligand complexes and the final binding modes of the hit compounds were analyzed using 10 ns molecular dynamics (MD) simulations. The best pharmacophore model (Hypo 1) showed the highest correlation coefficient (0.979), lowest total cost (102.89) and least RMSD value (0.59). Hypo 1 consisted of one hydrogen-bond acceptor, one hydrogen-bond donor, one ring aromatic and one hydrophobic feature. This model was validated by Fischer's randomization and 40 test set compounds. Virtual screening of chemical databases and the docking studies resulted in 30 representative compounds. Frontier orbital analysis confirmed that only 3 compounds had sufficiently low energy band gaps. MD simulations revealed the binding modes of the 3 hit compounds: all of them showed a large number of hydrogen bonds and hydrophobic interactions with the active site and specificity pocket residues of AKR1B10. Three compounds with new structural scaffolds have been identified, which have stronger binding affinities for AKR1B10 than known inhibitors.

  3. The discussion of descriptors for the QSAR model and molecular dynamics simulation of benzimidazole derivatives as corrosion inhibitors

    International Nuclear Information System (INIS)

    Li, Lu; Zhang, Xiuhui; Gong, Shida; Zhao, Hongxia; Bai, Yang; Li, Qianshu; Ji, Lin

    2015-01-01

    Graphical abstract: - Highlights: • Aromaticity is used as a descriptor in QSAR model to describe corrosion inhibition. • Improved calculation of I and A is correlated well with inhibition efficiencies. • Binding energies were calculated using a realistic corrosion environment. • Nonlinear QSAR model was built by support vector machine with radial basis function. • Six designed benzimidazole molecules are predicted with high inhibition efficiencies. - Abstract: The corrosion inhibition performances of 20 protonated benzimidazole derivatives were studied using theoretical methods. Nuclear Independent Chemical Shift (NICS), the measurement of aromaticity, demonstrated good correlation with inhibition efficiencies and was used as a descriptor. Binding energies were calculated on the basis of molecular dynamics simulations using a realistic corrosive environment. Some improved descriptors correlate well with experimental inhibition efficiencies. A reliable nonlinear quantitative structure–activity relationship model was constructed by a support vector machine approach. The correlation coefficient and root-mean-square error were 0.96 and 6.79%, respectively. Additionally, six new benzimidazole molecules were designed, and their inhibition efficiencies were predicted.

  4. 3D-QSAR and docking studies of flavonoids as potent Escherichia coli inhibitors

    Science.gov (United States)

    Fang, Yajing; Lu, Yulin; Zang, Xixi; Wu, Ting; Qi, XiaoJuan; Pan, Siyi; Xu, Xiaoyun

    2016-01-01

    Flavonoids are potential antibacterial agents. However, key substituents and mechanism for their antibacterial activity have not been fully investigated. The quantitative structure-activity relationship (QSAR) and molecular docking of flavonoids relating to potent anti-Escherichia coli agents were investigated. Comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were developed by using the pIC50 values of flavonoids. The cross-validated coefficient (q2) values for CoMFA (0.743) and for CoMSIA (0.708) were achieved, illustrating high predictive capabilities. Selected descriptors for the CoMFA model were ClogP (logarithm of the octanol/water partition coefficient), steric and electrostatic fields, while, ClogP, electrostatic and hydrogen bond donor fields were used for the CoMSIA model. Molecular docking results confirmed that half of the tested flavonoids inhibited DNA gyrase B (GyrB) by interacting with adenosine-triphosphate (ATP) pocket in a same orientation. Polymethoxyl flavones, flavonoid glycosides, isoflavonoids changed their orientation, resulting in a decrease of inhibitory activity. Moreover, docking results showed that 3-hydroxyl, 5-hydroxyl, 7-hydroxyl and 4-carbonyl groups were found to be crucial active substituents of flavonoids by interacting with key residues of GyrB, which were in agreement with the QSAR study results. These results provide valuable information for structure requirements of flavonoids as antibacterial agents. PMID:27049530

  5. QSAR and docking studies of anthraquinone derivatives by similarity cluster prediction.

    Science.gov (United States)

    Harsa, Alexandra M; Harsa, Teodora E; Diudea, Mircea V

    2016-01-01

    Forty anthraquinone derivatives have been downloaded from PubChem database and investigated in a quantitative structure-activity relationships (QSAR) study. The models describing log P and LD50 of this set were built up on the hypermolecule scheme that mimics the investigated receptor space; the models were validated by the leave-one-out procedure, in the external test set and in a new version of prediction by using similarity clusters. Molecular docking approach using Lamarckian Genetic Algorithm was made on this class of anthraquinones with respect to 3Q3B receptor. The best scored molecules in the docking assay were used as leaders in the similarity clustering procedure. It is demonstrated that the LD50 data of this set of anthraquinones are related to the binding energies of anthraquinone ligands to the 3Q3B receptor.

  6. Deciphering the Structural Requirements of Nucleoside Bisubstrate Analogues for Inhibition of MbtA in Mycobacterium tuberculosis: A FB-QSAR Study and Combinatorial Library Generation for Identifying Potential Hits.

    Science.gov (United States)

    Maganti, Lakshmi; Das, Sanjit Kumar; Mascarenhas, Nahren Manuel; Ghoshal, Nanda

    2011-10-01

    The re-emergence of tuberculosis infections, which are resistant to conventional drug therapy, has steadily risen in the last decade. Inhibitors of aryl acid adenylating enzyme known as MbtA, involved in siderophore biosynthesis in Mycobacterium tuberculosis, are being explored as potential antitubercular agents. The ability to identify fragments that interact with a biological target is a key step in fragment based drug design (FBDD). To expand the boundaries of quantitative structure activity relationship (QSAR) paradigm, we have proposed a Fragment Based QSAR methodology, referred here in as FB-QSAR, for deciphering the structural requirements of a series of nucleoside bisubstrate analogs for inhibition of MbtA, a key enzyme involved in siderophore biosynthetic pathway. For the development of FB-QSAR models, statistical techniques such as stepwise multiple linear regression (SMLR), genetic function approximation (GFA) and GFAspline were used. The predictive ability of the generated models was validated using different statistical metrics, and similarity-based coverage estimation was carried out to define applicability boundaries. To aid the creation of novel antituberculosis compounds, a bioisosteric database was enumerated using the combichem approach endorsed mining in a lead-like chemical space. The generated library was screened using an integrated in-silico approach and potential hits identified. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Comparison of Multiple Linear Regressions and Neural Networks based QSAR models for the design of new antitubercular compounds.

    Science.gov (United States)

    Ventura, Cristina; Latino, Diogo A R S; Martins, Filomena

    2013-01-01

    The performance of two QSAR methodologies, namely Multiple Linear Regressions (MLR) and Neural Networks (NN), towards the modeling and prediction of antitubercular activity was evaluated and compared. A data set of 173 potentially active compounds belonging to the hydrazide family and represented by 96 descriptors was analyzed. Models were built with Multiple Linear Regressions (MLR), single Feed-Forward Neural Networks (FFNNs), ensembles of FFNNs and Associative Neural Networks (AsNNs) using four different data sets and different types of descriptors. The predictive ability of the different techniques used were assessed and discussed on the basis of different validation criteria and results show in general a better performance of AsNNs in terms of learning ability and prediction of antitubercular behaviors when compared with all other methods. MLR have, however, the advantage of pinpointing the most relevant molecular characteristics responsible for the behavior of these compounds against Mycobacterium tuberculosis. The best results for the larger data set (94 compounds in training set and 18 in test set) were obtained with AsNNs using seven descriptors (R(2) of 0.874 and RMSE of 0.437 against R(2) of 0.845 and RMSE of 0.472 in MLRs, for test set). Counter-Propagation Neural Networks (CPNNs) were trained with the same data sets and descriptors. From the scrutiny of the weight levels in each CPNN and the information retrieved from MLRs, a rational design of potentially active compounds was attempted. Two new compounds were synthesized and tested against M. tuberculosis showing an activity close to that predicted by the majority of the models. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  8. Development of a QSAR model for binding of tripeptides and tripeptidomimetics to the human intestinal di-/tripeptide transporter hPEPT1

    DEFF Research Database (Denmark)

    Andersen, Rikke; Jørgensen, Flemming Steen; Olsen, Lars

    2006-01-01

    The aim of this study was to develop a three-dimensional quantitative structure-activity relationship (QSAR) model for binding of tripeptides and tripeptidomimetics to hPEPT1 based on a series of 25 diverse tripeptides....

  9. Environmental risk assessment of selected organic chemicals based on TOC test and QSAR estimation models.

    Science.gov (United States)

    Chi, Yulang; Zhang, Huanteng; Huang, Qiansheng; Lin, Yi; Ye, Guozhu; Zhu, Huimin; Dong, Sijun

    2018-02-01

    Environmental risks of organic chemicals have been greatly determined by their persistence, bioaccumulation, and toxicity (PBT) and physicochemical properties. Major regulations in different countries and regions identify chemicals according to their bioconcentration factor (BCF) and octanol-water partition coefficient (Kow), which frequently displays a substantial correlation with the sediment sorption coefficient (Koc). Half-life or degradability is crucial for the persistence evaluation of chemicals. Quantitative structure activity relationship (QSAR) estimation models are indispensable for predicting environmental fate and health effects in the absence of field- or laboratory-based data. In this study, 39 chemicals of high concern were chosen for half-life testing based on total organic carbon (TOC) degradation, and two widely accepted and highly used QSAR estimation models (i.e., EPI Suite and PBT Profiler) were adopted for environmental risk evaluation. The experimental results and estimated data, as well as the two model-based results were compared, based on the water solubility, Kow, Koc, BCF and half-life. Environmental risk assessment of the selected compounds was achieved by combining experimental data and estimation models. It was concluded that both EPI Suite and PBT Profiler were fairly accurate in measuring the physicochemical properties and degradation half-lives for water, soil, and sediment. However, the half-lives between the experimental and the estimated results were still not absolutely consistent. This suggests deficiencies of the prediction models in some ways, and the necessity to combine the experimental data and predicted results for the evaluation of environmental fate and risks of pollutants. Copyright © 2016. Published by Elsevier B.V.

  10. In Silico Exploration of 1,7-Diazacarbazole Analogs as Checkpoint Kinase 1 Inhibitors by Using 3D QSAR, Molecular Docking Study, and Molecular Dynamics Simulations

    Directory of Open Access Journals (Sweden)

    Xiaodong Gao

    2016-05-01

    Full Text Available Checkpoint kinase 1 (Chk1 is an important serine/threonine kinase with a self-protection function. The combination of Chk1 inhibitors and anti-cancer drugs can enhance the selectivity of tumor therapy. In this work, a set of 1,7-diazacarbazole analogs were identified as potent Chk1 inhibitors through a series of computer-aided drug design processes, including three-dimensional quantitative structure–activity relationship (3D-QSAR modeling, molecular docking, and molecular dynamics simulations. The optimal QSAR models showed significant cross-validated correlation q2 values (0.531, 0.726, fitted correlation r2 coefficients (higher than 0.90, and standard error of prediction (less than 0.250. These results suggested that the developed models possess good predictive ability. Moreover, molecular docking and molecular dynamics simulations were applied to highlight the important interactions between the ligand and the Chk1 receptor protein. This study shows that hydrogen bonding and electrostatic forces are key interactions that confer bioactivity.

  11. 3D-QSAR Studies on Barbituric Acid Derivatives as Urease Inhibitors and the Effect of Charges on the Quality of a Model.

    Science.gov (United States)

    Ul-Haq, Zaheer; Ashraf, Sajda; Al-Majid, Abdullah Mohammed; Barakat, Assem

    2016-04-30

    Urease enzyme (EC 3.5.1.5) has been determined as a virulence factor in pathogenic microorganisms that are accountable for the development of different diseases in humans and animals. In continuance of our earlier study on the helicobacter pylori urease inhibition by barbituric acid derivatives, 3D-QSAR (three dimensional quantitative structural activity relationship) advance studies were performed by Comparative Molecular Field Analysis (CoMFA) and Comparative Molecular Similarity Indices Analysis (CoMSIA) methods. Different partial charges were calculated to examine their consequences on the predictive ability of the developed models. The finest developed model for CoMFA and CoMSIA were achieved by using MMFF94 charges. The developed CoMFA model gives significant results with cross-validation (q²) value of 0.597 and correlation coefficients (r²) of 0.897. Moreover, five different fields i.e., steric, electrostatic, and hydrophobic, H-bond acceptor and H-bond donors were used to produce a CoMSIA model, with q² and r² of 0.602 and 0.98, respectively. The generated models were further validated by using an external test set. Both models display good predictive power with r²pred ≥ 0.8. The analysis of obtained CoMFA and CoMSIA contour maps provided detailed insight for the promising modification of the barbituric acid derivatives with an enhanced biological activity.

  12. 3D-QSAR Studies on Barbituric Acid Derivatives as Urease Inhibitors and the Effect of Charges on the Quality of a Model

    Directory of Open Access Journals (Sweden)

    Zaheer Ul-Haq

    2016-04-01

    Full Text Available Urease enzyme (EC 3.5.1.5 has been determined as a virulence factor in pathogenic microorganisms that are accountable for the development of different diseases in humans and animals. In continuance of our earlier study on the helicobacter pylori urease inhibition by barbituric acid derivatives, 3D-QSAR (three dimensional quantitative structural activity relationship advance studies were performed by Comparative Molecular Field Analysis (CoMFA and Comparative Molecular Similarity Indices Analysis (CoMSIA methods. Different partial charges were calculated to examine their consequences on the predictive ability of the developed models. The finest developed model for CoMFA and CoMSIA were achieved by using MMFF94 charges. The developed CoMFA model gives significant results with cross-validation (q2 value of 0.597 and correlation coefficients (r2 of 0.897. Moreover, five different fields i.e., steric, electrostatic, and hydrophobic, H-bond acceptor and H-bond donors were used to produce a CoMSIA model, with q2 and r2 of 0.602 and 0.98, respectively. The generated models were further validated by using an external test set. Both models display good predictive power with r2pred ≥ 0.8. The analysis of obtained CoMFA and CoMSIA contour maps provided detailed insight for the promising modification of the barbituric acid derivatives with an enhanced biological activity.

  13. Experimental and QSAR study on the surface activities of alkyl imidazoline surfactants

    Science.gov (United States)

    Kong, Xiangjun; Qian, Chengduo; Fan, Weiyu; Liang, Zupei

    2018-03-01

    15 alkyl imidazoline surfactants with different structures were synthesized and their critical micelle concentration (CMC) and surface tension under the CMC (σcmc) in aqueous solution were measured at 298 K. 54 kinds of molecular structure descriptors were selected as independent variables and the quantitative structure-activity relationship (QSAR) between surface activities of alkyl imidazoline and molecular structure were built through the genetic function approximation (GFA) method. Experimental results showed that the maximum surface excess of alkyl imidazoline molecules at the gas-liquid interface increased and the area occupied by each surfactant molecule and the free energies of micellization ΔGm decreased with increasing carbon number (NC) of the hydrophobic chain or decreasing hydrophilicity of counterions, which resulted in a CMC and σcmc decrease, while the log CMC and NC had a linear relationship and a negative correlation. The GFA-QSAR model, which was generated by a training set composed of 13 kinds of alkyl imidazoline though GFA method regression analysis, was highly correlated with predicted values and experimental values of the CMC. The correlation coefficient R was 0.9991, which means high prediction accuracy. The prediction error of 2 kinds of alkyl imidazoline CMCs in the Validation Set that quantitatively analyzed the influence of the alkyl imidazoline molecular structure on the CMC was less than 4%.

  14. Investigation of antigen-antibody interactions of sulfonamides with a monoclonal antibody in a fluorescence polarization immunoassay using 3D-QSAR models

    Science.gov (United States)

    A three-dimensional quantitative structure-activity relationship (3D-QSAR) model of sulfonamide analogs binding a monoclonal antibody (MAbSMR) produced against sulfamerazine was carried out by Distance Comparison (DISCOtech), comparative molecular field analysis (CoMFA), and comparative molecular si...

  15. Pharmacophore Modelling and 4D-QSAR Study of Ruthenium(II) Arene Complexes as Anticancer Agents (Inhibitors) by Electron Conformational- Genetic Algorithm Method.

    Science.gov (United States)

    Yavuz, Sevtap Caglar; Sabanci, Nazmiye; Saripinar, Emin

    2018-01-01

    The EC-GA method was employed in this study as a 4D-QSAR method, for the identification of the pharmacophore (Pha) of ruthenium(II) arene complex derivatives and quantitative prediction of activity. The arrangement of the computed geometric and electronic parameters for atoms and bonds of each compound occurring in a matrix is known as the electron-conformational matrix of congruity (ECMC). It contains the data from HF/3-21G level calculations. Compounds were represented by a group of conformers for each compound rather than a single conformation, known as fourth dimension to generate the model. ECMCs were compared within a certain range of tolerance values by using the EMRE program and the responsible pharmacophore group for ruthenium(II) arene complex derivatives was found. For selecting the sub-parameter which had the most effect on activity in the series and the calculation of theoretical activity values, the non-linear least square method and genetic algorithm which are included in the EMRE program were used. In addition, compounds were classified as the training and test set and the accuracy of the models was tested by cross-validation statistically. The model for training and test sets attained by the optimum 10 parameters gave highly satisfactory results with R2 training= 0.817, q 2=0.718 and SEtraining=0.066, q2 ext1 = 0.867, q2 ext2 = 0.849, q2 ext3 =0.895, ccctr = 0.895, ccctest = 0.930 and cccall = 0.905. Since there is no 4D-QSAR research on metal based organic complexes in the literature, this study is original and gives a powerful tool to the design of novel and selective ruthenium(II) arene complexes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  16. A comparative QSAR study on the estrogenic activities of persistent organic pollutants by PLS and SVM

    Directory of Open Access Journals (Sweden)

    Fei Li

    2015-11-01

    Full Text Available Quantitative structure-activity relationships (QSARs were determined using partial least square (PLS and support vector machine (SVM. The predicted values by the final QSAR models were in good agreement with the corresponding experimental values. Chemical estrogenic activities are related to atomic properties (atomic Sanderson electronegativities, van der Waals volumes and polarizabilities. Comparison of the results obtained from two models, the SVM method exhibited better overall performances. Besides, three PLS models were constructed for some specific families based on their chemical structures. These predictive models should be useful to rapidly identify potential estrogenic endocrine disrupting chemicals.

  17. QUANTITAVE STRUCTURE-ACTIVITY RELATIONSHIP ANALYSIS (QSAR OF ANTIMALARIAL 1,10-PHENANTHROLINE DERIVATIVES COMPOUNDS

    Directory of Open Access Journals (Sweden)

    Ruslin Hadanu

    2010-06-01

    Full Text Available Quantitative Electronic Structure-Activity Relationship (QSAR analysis of a series of 1,10-phenanthroline derivatives as antiplasmodial compounds have been conducted using atomic net charges (q, dipole moment (μ ELUMO, EHOMO, polarizability (α and log P as the descriptors. The descriptors were obtained from computational chemistry method using semi-empirical PM3. Antiplasmodial activities were taken as the activity of the drugs  against  chloroquine-resistant Plasmodium falciparum FCR3 strain and are presented as the value of ln (1/IC50 where IC50 is an effective concentration inhibiting 50% of the parasite growth. The best model of QSAR model was determine by multiple linear regression method and giving equation of QSAR: ln 1/IC50  =  3.732 + (5.098 qC5 + (7.051 qC7 + (36.696 qC9 + (41.467 qC11 -(135.497 qC12 + (0.332 μ -                    (0.170 α + (0.757 log P. The equation was significant on the 95% level with statistical parameters: n=16; r=0.987; r2= 0.975; SE=0.317;  Fcalc/Ftable = 15.337 and gave the PRESS=0.707. Its means that there were only a relatively few deviations between the experimental and theoretical data of antimalarial activity.   Keywords: QSAR, antimalarial, semi-empirical method, 1,10-phenanthroline.

  18. Toward the identification of a reliable 3D-QSAR model for the protein tyrosine phosphatase 1B inhibitors

    Science.gov (United States)

    Wang, Fangfang; Zhou, Bo

    2018-04-01

    Protein tyrosine phosphatase 1B (PTP1B) is an intracellular non-receptor phosphatase that is implicated in signal transduction of insulin and leptin pathways, thus PTP1B is considered as potential target for treating type II diabetes and obesity. The present article is an attempt to formulate the three-dimensional quantitative structure-activity relationship (3D-QSAR) modeling of a series of compounds possessing PTP1B inhibitory activities using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques. The optimum template ligand-based models are statistically significant with great CoMFA (R2cv = 0.600, R2pred = 0.6760) and CoMSIA (R2cv = 0.624, R2pred = 0.8068) values. Molecular docking was employed to elucidate the inhibitory mechanisms of this series of compounds against PTP1B. In addition, the CoMFA and CoMSIA field contour maps agree well with the structural characteristics of the binding pocket of PTP1B active site. The knowledge of structure-activity relationship and ligand-receptor interactions from 3D-QSAR model and molecular docking will be useful for better understanding the mechanism of ligand-receptor interaction and facilitating development of novel compounds as potent PTP1B inhibitors.

  19. The effects of characteristics of substituents on toxicity of the nitroaromatics: HiT QSAR study

    Science.gov (United States)

    Kuz'min, Victor E.; Muratov, Eugene N.; Artemenko, Anatoly G.; Gorb, Leonid; Qasim, Mohammad; Leszczynski, Jerzy

    2008-10-01

    The present study applies the Hierarchical Technology for Quantitative Structure-Activity Relationships (HiT QSAR) for (i) evaluation of the influence of the characteristics of 28 nitroaromatic compounds (some of which belong to a widely known class of explosives) as to their toxicity; (ii) prediction of toxicity for new nitroaromatic derivatives; (iii) analysis of the effects of substituents in nitroaromatic compounds on their toxicity in vivo. The 50% lethal dose concentration for rats (LD50) was used to develop the QSAR models based on simplex representation of molecular structure. The preliminary 1D QSAR results show that even the information on the composition of molecules reveals the main tendencies of changes in toxicity. The statistic characteristics for partial least squares 2D QSAR models are quite satisfactory ( R 2 = 0.96-0.98; Q 2 = 0.91-0.93; R 2 test = 0.89-0.92), which allows us to carry out the prediction of activity for 41 novel compounds designed by the application of new combinations of substituents represented in the training set. The comprehensive analysis of toxicity changes as a function of substituent position and nature was carried out. Molecular fragments that promote and interfere with toxicity were defined on the basis of the obtained models. It was shown that the mutual influence of substituents in the benzene ring plays a crucial role regarding toxicity. The influence of different substituents on toxicity can be mediated via different C-H fragments of the aromatic ring.

  20. Towards discovering dual functional inhibitors against both wild type and K103N mutant HIV-1 reverse transcriptases: molecular docking and QSAR studies on 4,1-benzoxazepinone analogues

    Science.gov (United States)

    Zhang, Zhenshan; Zheng, Mingyue; Du, Li; Shen, Jianhua; Luo, Xiaomin; Zhu, Weiliang; Jiang, Hualiang

    2006-05-01

    To find useful information for discovering dual functional inhibitors against both wild type (WT) and K103N mutant reverse transcriptases (RTs) of HIV-1, molecular docking and 3D-QSAR approaches were applied to a set of twenty-five 4,1-benzoxazepinone analogues of efavirenz (SUSTIVA®), some of them are active against the two RTs. 3D-QSAR models were constructed, based on their binding conformations determined by molecular docking, with r 2 cv values ranging from 0.656 to 0.834 for CoMFA and CoMSIA, respectively. The models were then validated to be highly predictive and extrapolative by inhibitors in two test sets with different molecular skeletons. Furthermore, CoMFA models were found to be well matched with the binding sites of both WT and K103N RTs. Finally, a reasonable pharmacophore model of 4,1-benzoxazepinones were established. The application of the model not only successfully differentiated the experimentally determined inhibitors from non-inhibitors, but also discovered two potent inhibitors from the compound database SPECS. On the basis of both the 3D-QSAR and pharmacophore models, new clues for discovering and designing potent dual functional drug leads against HIV-1 were proposed: (i) adopting positively charged aliphatic group at the cis-substituent of C3; (ii) reducing the electronic density at the position of O4; (iii) positioning a small branched aliphatic group at position of C5; (iv) using the negatively charged bulky substituents at position of C7.

  1. The applications of PCA in QSAR studies: A case study on CCR5 antagonists.

    Science.gov (United States)

    Yoo, ChangKyoo; Shahlaei, Mohsen

    2018-01-01

    Principal component analysis (PCA), as a well-known multivariate data analysis and data reduction technique, is an important and useful algebraic tool in drug design and discovery. PCA, in a typical quantitative structure-activity relationship (QSAR) study, analyzes an original data matrix in which molecules are described by several intercorrelated quantitative dependent variables (molecular descriptors). Although extensively applied, there is disparity in the literature with respect to the applications of PCA in the QSAR studies. This study investigates the different applications of PCA in QSAR studies using a dataset including CCR5 inhibitors. The different types of preprocessing are used to compare the PCA performances. The use of PC plots in the exploratory investigation of matrix of descriptors is described. This work is also proved PCA analysis to be a powerful technique for exploring complex datasets in QSAR studies for identification of outliers. This study shows that PCA is able to easily apply to the pool of calculated structural descriptors and also the extracted information can be used to help decide upon an appropriate harder model for further analysis. © 2017 John Wiley & Sons A/S.

  2. Molecular docking, MM/GBSA and 3D-QSAR studies on EGFR ...

    Indian Academy of Sciences (India)

    Abstract. Epidermal growth factor receptor (EGFR) is the first growth factor receptor proposed as a target ... Information rendered from 3D-QSAR model and sitemap analysis was used to ... skin making it a key target for anti-tumor strategy. ∗.

  3. Exploring QSARs of the interaction of flavonoids with GABA (A) receptor using MLR, ANN and SVM techniques.

    Science.gov (United States)

    Deeb, Omar; Shaik, Basheerulla; Agrawal, Vijay K

    2014-10-01

    Quantitative Structure-Activity Relationship (QSAR) models for binding affinity constants (log Ki) of 78 flavonoid ligands towards the benzodiazepine site of GABA (A) receptor complex were calculated using the machine learning methods: artificial neural network (ANN) and support vector machine (SVM) techniques. The models obtained were compared with those obtained using multiple linear regression (MLR) analysis. The descriptor selection and model building were performed with 10-fold cross-validation using the training data set. The SVM and MLR coefficient of determination values are 0.944 and 0.879, respectively, for the training set and are higher than those of ANN models. Though the SVM model shows improvement of training set fitting, the ANN model was superior to SVM and MLR in predicting the test set. Randomization test is employed to check the suitability of the models.

  4. Synthesis, biological evaluation, QSAR study and molecular docking of novel N-(4-amino carbonylpiperazinyl) (thio)phosphoramide derivatives as cholinesterase inhibitors.

    Science.gov (United States)

    Gholivand, Khodayar; Ebrahimi Valmoozi, Ali Asghar; Bonsaii, Mahyar

    2014-06-01

    Novel (thio)phosphoramidate derivatives based on piperidincarboxamide with the general formula of (NH2-C(O)-C5H9N)-P(X=O,S)R1R2 (1-5) and (NH2-C(O)-C5H9N)2-P(O)R (6-9) were synthesized and characterized by (31)P, (13)C, (1)H NMR, IR spectroscopy. Furthermore, the crystal structure of compound (NH2-C(O)-C5H9N)2-P(O)(OC6H5) (6) was investigated. The activities of derivatives on cholinesterases (ChE) were determined using a modified Ellman's method. Also the mixed-type mechanisms of these compounds were evaluated by Lineweaver-Burk plots. Molecular docking and quantitative structure-activity relationship (QSAR) were used to understand the relationship between molecular structural features and anti-ChE activity, and to predict the binding affinity of phosphoramido-piperidinecarboxamides (PAPCAs) to ChE receptors. From molecular docking analysis, noncovalent interactions especially hydrogen bonding as well as hydrophobic was found between PAPCAs and ChE. Based on the docking results, appropriate molecular structural parameters were adopted to develop a QSAR model. DFT-QSAR models for ChE enzymes demonstrated the importance of electrophilicity parameter in describing the anti-AChE and anti-BChE activities of the synthesized compounds. The correlation matrix of QSAR models and docking analysis confirmed that electrophilicity descriptor can control the influence of the hydrophobic properties of P=(O, S) and CO functional groups of PAPCA derivatives in the inhibition of human ChE enzymes. Copyright © 2014 Elsevier Inc. All rights reserved.

  5. QSAR analysis on Spodoptera litura antifeedant activities for flavone derivatives

    International Nuclear Information System (INIS)

    Duchowicz, Pablo R.; Goodarzi, Mohammad; Ocsachoque, Marco A.; Romanelli, Gustavo P.; Ortiz, Erlinda del V.; Autino, Juan C.; Bennardi, Daniel O.; Ruiz, Diego M.; Castro, Eduardo A.

    2009-01-01

    We establish useful models that relate experimentally measured biological activities of compounds to their molecular structure. The pED 50 feeding inhibition on Spodoptera litura species exhibited by aurones, chromones, 3-coumarones and flavones is analyzed in this work through the hypothesis encompassed in the Quantitative Structure-Activity Relationships (QSAR) Theory. This constitutes a first necessary computationally based step during the design of more bio-friendly repellents that could lead to insights for improving the insecticidal activities of the investigated compounds. After optimizing the molecular structure of each furane and pyrane benzoderivative with the semiempirical molecular orbitals method PM3, more than a thousand of constitutional, topological, geometrical and electronic descriptors are calculated and multiparametric linear regression models are established on the antifeedant potencies. The feature selection method employed in this study is the Replacement Method, which has proven to be successful in previous analyzes. We establish the QSAR both for the complete molecular set of compounds and also for each chemical class, so that acceptably describing the variation of the inhibitory activities from the knowledge of their structure and thus achieving useful predictive results. The main interest of developing trustful QSAR models is that these enable the prediction of compounds having no experimentally measured activities for any reason. Therefore, the structure-activity relationships are further employed for investigating the antifeedant activity on previously synthesized 2-,7-substituted benzopyranes, which do not pose any measured values on the biological expression. One of them, 2-(α-naphtyl)-4H-1-benzopyran-4-one, results in a promising structure to be experimentally analyzed as it has predicted pED 50 = 1.162.

  6. DFT/PCM, QTAIM, 1H NMR conformational studies and QSAR modeling of thirty-two anti-Leishmania amazonensis Morita-Baylis-Hillman Adducts

    Science.gov (United States)

    Filho, Edilson B. A.; Moraes, Ingrid A.; Weber, Karen C.; Rocha, Gerd B.; Vasconcellos, Mário L. A. A.

    2012-08-01

    Morita-Baylis-Hillman Adducts (MBHA) has been recently synthesized and bio-evaluated by our research group against Leishmania amazonensis, parasite that causes cutaneous and mucocutaneous leishmaniasis. We present here a theoretical conformational study of thirty-two leismanicidal MBHA by B3LYP/6-31+g(d) calculations with Polarized Continuum Model (PCM) to simulate water influence. Intramolecular Hydrogen Bonds (IHBs) indicated to control the most conformational preferences of MBHA. Quantum Theory Atoms in Molecules (QTAIM) calculations were able to characterize these interactions at Bond Critical Point level. Compounds presenting an unusual seven member IHB between NO2 group and hydroxyl moiety, supported by experimental spectroscopic data, showed a considerable improvement of biological activity (lower IC50 values). These results are in accordance to redox NO2 mechanism of action. Based on structural observations, some molecular descriptors were calculated and submitted to Quantitative Structure-Activity Relationship (QSAR) studies through the PLS Regression Method. These studies provided a model with good validation parameters values (R2 = 0.71, Q2 = 0.61 and Qext2 = 0.92).

  7. BCL::EMAS — Enantioselective Molecular Asymmetry Descriptor for 3D-QSAR

    Directory of Open Access Journals (Sweden)

    Mariusz Butkiewicz

    2012-08-01

    Full Text Available Stereochemistry is an important determinant of a molecule’s biological activity. Stereoisomers can have different degrees of efficacy or even opposing effects when interacting with a target protein. Stereochemistry is a molecular property difficult to represent in 2D-QSAR as it is an inherently three-dimensional phenomenon. A major drawback of most proposed descriptors for 3D-QSAR that encode stereochemistry is that they require a heuristic for defining all stereocenters and rank-ordering its substituents. Here we propose a novel 3D-QSAR descriptor termed Enantioselective Molecular ASymmetry (EMAS that is capable of distinguishing between enantiomers in the absence of such heuristics. The descriptor aims to measure the deviation from an overall symmetric shape of the molecule. A radial-distribution function (RDF determines a signed volume of tetrahedrons of all triplets of atoms and the molecule center. The descriptor can be enriched with atom-centric properties such as partial charge. This descriptor showed good predictability when tested with a dataset of thirty-one steroids commonly used to benchmark stereochemistry descriptors (r2 = 0.89, q2 = 0.78. Additionally, EMAS improved enrichment of 4.38 versus 3.94 without EMAS in a simulated virtual high-throughput screening (vHTS for inhibitors and substrates of cytochrome P450 (PUBCHEM AID891.

  8. A novel simple QSAR model for the prediction of anti-HIV activity using multiple linear regression analysis.

    Science.gov (United States)

    Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga

    2006-08-01

    A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.

  9. Flow network QSAR for the prediction of physicochemical properties by mapping an electrical resistance network onto a chemical reaction poset.

    Science.gov (United States)

    Ivanciuc, Ovidiu; Ivanciuc, Teodora; Klein, Douglas J

    2013-06-01

    Usual quantitative structure-activity relationship (QSAR) models are computed from unstructured input data, by using a vector of molecular descriptors for each chemical in the dataset. Another alternative is to consider the structural relationships between the chemical structures, such as molecular similarity, presence of certain substructures, or chemical transformations between compounds. We defined a class of network-QSAR models based on molecular networks induced by a sequence of substitution reactions on a chemical structure that generates a partially ordered set (or poset) oriented graph that may be used to predict various molecular properties with quantitative superstructure-activity relationships (QSSAR). The network-QSAR interpolation models defined on poset graphs, namely average poset, cluster expansion, and spline poset, were tested with success for the prediction of several physicochemical properties for diverse chemicals. We introduce the flow network QSAR, a new poset regression model in which the dataset of chemicals, represented as a reaction poset, is transformed into an oriented network of electrical resistances in which the current flow results in a potential at each node. The molecular property considered in the QSSAR model is represented as the electrical potential, and the value of this potential at a particular node is determined by the electrical resistances assigned to each edge and by a system of batteries. Each node with a known value for the molecular property is attached to a battery that sets the potential on that node to the value of the respective molecular property, and no external battery is attached to nodes from the prediction set, representing chemicals for which the values of the molecular property are not known or are intended to be predicted. The flow network QSAR algorithm determines the values of the molecular property for the prediction set of molecules by applying Ohm's law and Kirchhoff's current law to the poset

  10. 4D-QSAR: Perspectives in Drug Design

    Directory of Open Access Journals (Sweden)

    Carolina H. Andrade

    2010-05-01

    Full Text Available Drug design is a process driven by innovation and technological breakthroughs involving a combination of advanced experimental and computational methods. A broad variety of medicinal chemistry approaches can be used for the identification of hits, generation of leads, as well as to accelerate the optimization of leads into drug candidates. The quantitative structure–activity relationship (QSAR formalisms are among the most important strategies that can be applied for the successful design new molecules. This review provides a comprehensive review on the evolution and current status of 4D-QSAR, highlighting present challenges and new opportunities in drug design.

  11. A 3D QSAR pharmacophore model and quantum chemical structure--activity analysis of chloroquine(CQ)-resistance reversal.

    Science.gov (United States)

    Bhattacharjee, Apurba K; Kyle, Dennis E; Vennerstrom, Jonathan L; Milhous, Wilbur K

    2002-01-01

    Using CATALYST, a three-dimensional QSAR pharmacophore model for chloroquine(CQ)-resistance reversal was developed from a training set of 17 compounds. These included imipramine (1), desipramine (2), and 15 of their analogues (3-17), some of which fully reversed CQ-resistance, while others were without effect. The generated pharmacophore model indicates that two aromatic hydrophobic interaction sites on the tricyclic ring and a hydrogen bond acceptor (lipid) site at the side chain, preferably on a nitrogen atom, are necessary for potent activity. Stereoelectronic properties calculated by using AM1 semiempirical calculations were consistent with the model, particularly the electrostatic potential profiles characterized by a localized negative potential region by the side chain nitrogen atom and a large region covering the aromatic ring. The calculated data further revealed that aminoalkyl substitution at the N5-position of the heterocycle and a secondary or tertiary aliphatic aminoalkyl nitrogen atom with a two or three carbon bridge to the heteroaromatic nitrogen (N5) are required for potent "resistance reversal activity". Lowest energy conformers for 1-17 were determined and optimized to afford stereoelectronic properties such as molecular orbital energies, electrostatic potentials, atomic charges, proton affinities, octanol-water partition coefficients (log P), and structural parameters. For 1-17, fairly good correlation exists between resistance reversal activity and intrinsic basicity of the nitrogen atom at the tricyclic ring system, frontier orbital energies, and lipophilicity. Significantly, nine out of 11 of a group of structurally diverse CQ-resistance reversal agents mapped very well on the 3D QSAR pharmacophore model.

  12. QSAR models for oxidation of organic micropollutants in water based on ozone and hydroxyl radical rate constants and their chemical classification

    KAUST Repository

    Sudhakaran, Sairam; Amy, Gary L.

    2013-01-01

    . In this study, quantitative structure activity relationships (QSAR) models for O3 and AOP processes were developed, and rate constants, kOH and kO3, were predicted based on target compound properties. The kO3 and kOH values ranged from 5 * 10-4 to 105 M-1s-1

  13. An in silico skin absorption model for fragrance materials.

    Science.gov (United States)

    Shen, Jie; Kromidas, Lambros; Schultz, Terry; Bhatia, Sneha

    2014-12-01

    Fragrance materials are widely used in cosmetics and other consumer products. The Research Institute for Fragrance Materials (RIFM) evaluates the safety of these ingredients and skin absorption is an important parameter in refining systemic exposure. Currently, RIFM's safety assessment process assumes 100% skin absorption when experimental data are lacking. This 100% absorption default is not supportable and alternate default values were proposed. This study aims to develop and validate a practical skin absorption model (SAM) specific for fragrance material. It estimates skin absorption based on the methodology proposed by Kroes et al. SAM uses three default absorption values based on the maximum flux (J(max)) - namely, 10%, 40%, and 80%. J(max) may be calculated by using QSAR models that determine octanol/water partition coefficient (K(ow)), water solubility (S) and permeability coefficient (K(p)). Each of these QSAR models was refined and a semi-quantitative mechanistic model workflow is presented. SAM was validated with a large fragrance-focused data set containing 131 materials. All resulted in predicted values fitting the three-tiered absorption scenario based on Jmax ranges. This conservative SAM may be applied when fragrance material lack skin absorption data. Copyright © 2014 Elsevier Ltd. All rights reserved.

  14. Application of 3D-QSAR, Pharmacophore, and Molecular Docking in the Molecular Design of Diarylpyrimidine Derivatives as HIV-1 Nonnucleoside Reverse Transcriptase Inhibitors.

    Science.gov (United States)

    Liu, Genyan; Wang, Wenjie; Wan, Youlan; Ju, Xiulian; Gu, Shuangxi

    2018-05-11

    Diarylpyrimidines (DAPYs), acting as HIV-1 nonnucleoside reverse transcriptase inhibitors (NNRTIs), have been considered to be one of the most potent drug families in the fight against acquired immunodeficiency syndrome (AIDS). To better understand the structural requirements of HIV-1 NNRTIs, three-dimensional quantitative structure⁻activity relationship (3D-QSAR), pharmacophore, and molecular docking studies were performed on 52 DAPY analogues that were synthesized in our previous studies. The internal and external validation parameters indicated that the generated 3D-QSAR models, including comparative molecular field analysis (CoMFA, q 2 = 0.679, R 2 = 0.983, and r pred 2 = 0.884) and comparative molecular similarity indices analysis (CoMSIA, q 2 = 0.734, R 2 = 0.985, and r pred 2 = 0.891), exhibited good predictive abilities and significant statistical reliability. The docking results demonstrated that the phenyl ring at the C₄-position of the pyrimidine ring was better than the cycloalkanes for the activity, as the phenyl group was able to participate in π⁻π stacking interactions with the aromatic residues of the binding site, whereas the cycloalkanes were not. The pharmacophore model and 3D-QSAR contour maps provided significant insights into the key structural features of DAPYs that were responsible for the activity. On the basis of the obtained information, a series of novel DAPY analogues of HIV-1 NNRTIs with potentially higher predicted activity was designed. This work might provide useful information for guiding the rational design of potential HIV-1 NNRTI DAPYs.

  15. Autocorrelation descriptor improvements for QSAR: 2DA_Sign and 3DA_Sign

    Science.gov (United States)

    Sliwoski, Gregory; Mendenhall, Jeffrey; Meiler, Jens

    2016-03-01

    Quantitative structure-activity relationship (QSAR) is a branch of computer aided drug discovery that relates chemical structures to biological activity. Two well established and related QSAR descriptors are two- and three-dimensional autocorrelation (2DA and 3DA). These descriptors encode the relative position of atoms or atom properties by calculating the separation between atom pairs in terms of number of bonds (2DA) or Euclidean distance (3DA). The sums of all values computed for a given small molecule are collected in a histogram. Atom properties can be added with a coefficient that is the product of atom properties for each pair. This procedure can lead to information loss when signed atom properties are considered such as partial charge. For example, the product of two positive charges is indistinguishable from the product of two equivalent negative charges. In this paper, we present variations of 2DA and 3DA called 2DA_Sign and 3DA_Sign that avoid information loss by splitting unique sign pairs into individual histograms. We evaluate these variations with models trained on nine datasets spanning a range of drug target classes. Both 2DA_Sign and 3DA_Sign significantly increase model performance across all datasets when compared with traditional 2DA and 3DA. Lastly, we find that limiting 3DA_Sign to maximum atom pair distances of 6 Å instead of 12 Å further increases model performance, suggesting that conformational flexibility may hinder performance with longer 3DA descriptors. Consistent with this finding, limiting the number of bonds in 2DA_Sign from 11 to 5 fails to improve performance.

  16. QSAR, docking, dynamic simulation and quantum mechanics studies to explore the recognition properties of cholinesterase binding sites.

    Science.gov (United States)

    Correa-Basurto, J; Bello, M; Rosales-Hernández, M C; Hernández-Rodríguez, M; Nicolás-Vázquez, I; Rojo-Domínguez, A; Trujillo-Ferrara, J G; Miranda, René; Flores-Sandoval, C A

    2014-02-25

    A set of 84 known N-aryl-monosubstituted derivatives (42 amides: series 1 and 2, and 42 imides: series 3 an 4, from maleic and succinic anhydrides, respectively) that display inhibitory activity toward both acetylcholinesterase and butyrylcholinesterase (ChEs) was considered for Quantitative structure-activity relationship (QSAR) studies. These QSAR studies employed docking data from both ChEs that were previously submitted to molecular dynamics (MD) simulations. Donepezil and galanthamine stereoisomers were included to analyze their quantum mechanics properties and for validating the docking procedure. Quantum parameters such as frontier orbital energies, dipole moment, molecular volume, atomic charges, bond length and reactivity parameters were measured, as well as partition coefficients, molar refractivity and polarizability were also analyzed. In order to evaluate the obtained equations, four compounds: 1a (4-oxo-4-(phenylamino)butanoic acid), 2a ((2Z)-4-oxo-4-(phenylamino)but-2-enoic acid), 3a (2-phenylcyclopentane-1,3-dione) and 4a (2-phenylcyclopent-4-ene-1,3-dione) were employed as independent data set, using only equations with r(m(test))²>0.5. It was observed that residual values gave low value in almost all series, excepting in series 1 for compounds 3a and 4a, and in series 4 for compounds 1a, 2a and 3a, giving a low value for 4a. Consequently, equations seems to be specific according to the structure of the evaluated compound, that means, series 1 fits better for compound 1a, series 3 or 4 fits better for compounds 3a or 4a. Same behavior was observed in the butyrylcholinesterase (BChE). Therefore, obtained equations in this QSAR study could be employed to calculate the inhibition constant (Ki) value for compounds having a similar structure as N-aryl derivatives described here. The QSAR study showed that bond lengths, molecular electrostatic potential and frontier orbital energies are important in both ChE targets. Docking studies revealed that

  17. Evolution of the international workshops on quantitative structure-activity relationships (QSARs) in environmental toxicology.

    Science.gov (United States)

    Kaiser, K L E

    2007-01-01

    This presentation will review the evolution of the workshops from a scientific and personal perspective. From their modest beginning in 1983, the workshops have developed into larger international meetings, regularly held every two years. Their initial focus on the aquatic sphere soon expanded to include properties and effects on atmospheric and terrestrial species, including man. Concurrent with this broadening of their scientific scope, the workshops have become an important forum for the early dissemination of all aspects of qualitative and quantitative structure-activity research in ecotoxicology and human health effects. Over the last few decades, the field of quantitative structure/activity relationships (QSARs) has quickly emerged as a major scientific method in understanding the properties and effects of chemicals on the environment and human health. From substances that only affect cell membranes to those that bind strongly to a specific enzyme, QSARs provides insight into the biological effects and chemical and physical properties of substances. QSARs are useful for delineating the quantitative changes in biological effects resulting from minor but systematic variations of the structure of a compound with a specific mode of action. In addition, more holistic approaches are being devised that result in our ability to predict the effects of structurally unrelated compounds with (potentially) different modes of action. Research in QSAR environmental toxicology has led to many improvements in the manufacturing, use, and disposal of chemicals. Furthermore, it has led to national policies and international agreements, from use restrictions or outright bans of compounds, such as polychlorinated biphenyls (PCBs), mirex, and highly chlorinated pesticides (e.g. DDT, dieldrin) for the protection of avian predators, to alternatives for ozone-depleting compounds, to better waste treatment systems, to more powerful and specific acting drugs. Most of the recent advances

  18. Construction and analysis of a human hepatotoxicity database suitable for QSAR modeling using post-market safety data

    International Nuclear Information System (INIS)

    Zhu, Xiao; Kruhlak, Naomi L.

    2014-01-01

    Graphical abstract: - Abstract: Drug-induced liver injury (DILI) is one of the most common drug-induced adverse events (AEs) leading to life-threatening conditions such as acute liver failure. It has also been recognized as the single most common cause of safety-related post-market withdrawals or warnings. Efforts to develop new predictive methods to assess the likelihood of a drug being a hepatotoxicant have been challenging due to the complexity and idiosyncrasy of clinical manifestations of DILI. The FDA adverse event reporting system (AERS) contains post-market data that depict the morbidity of AEs. Here, we developed a scalable approach to construct a hepatotoxicity database using post-market data for the purpose of quantitative structure–activity relationship (QSAR) modeling. A set of 2029 unique and modelable drug entities with 13,555 drug-AE combinations was extracted from the AERS database using 37 hepatotoxicity-related query preferred terms (PTs). In order to determine the optimal classification scheme to partition positive from negative drugs, a manually-curated DILI calibration set composed of 105 negatives and 177 positives was developed based on the published literature. The final classification scheme combines hepatotoxicity-related PT data with supporting information that optimize the predictive performance across the calibration set. Data for other toxicological endpoints related to liver injury such as liver enzyme abnormalities, cholestasis, and bile duct disorders, were also extracted and classified. Collectively, these datasets can be used to generate a battery of QSAR models that assess a drug's potential to cause DILI

  19. QSAR models for describing the toxicological effects of ILs against Staphylococcus aureus based on norm indexes.

    Science.gov (United States)

    He, Wensi; Yan, Fangyou; Jia, Qingzhu; Xia, Shuqian; Wang, Qiang

    2018-03-01

    The hazardous potential of ionic liquids (ILs) is becoming an issue of great concern due to their important role in many industrial fields as green agents. The mathematical model for the toxicological effects of ILs is useful for the risk assessment and design of environmentally benign ILs. The objective of this work is to develop QSAR models to describe the minimal inhibitory concentration (MIC) and minimal bactericidal concentration (MBC) of ILs against Staphylococcus aureus (S. aureus). A total of 169 and 101 ILs with MICs and MBCs, respectively, are used to obtain multiple linear regression models based on matrix norm indexes. The norm indexes used in this work are proposed by our research group and they are first applied to estimate the antibacterial toxicity of these ILs against S. aureus. These two models precisely and reliably calculated the IL toxicities with a square of correlation coefficient (R 2 ) of 0.919 and a standard error of estimate (SE) of 0.341 (in log unit of mM) for pMIC, and an R 2 of 0.913 and SE of 0.282 for pMBC. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. A biology-based approach for quantitative structure-activity relationships (QSARs) in ecotoxicity.

    NARCIS (Netherlands)

    Jager, T.; Kooijman, S.A.L.M.

    2009-01-01

    Quantitative structure-activity relationships (QSARs) for ecotoxicity can be used to fill data gaps and limit toxicity testing on animals. QSAR development may additionally reveal mechanistic information based on observed patterns in the data. However, the use of descriptive summary statistics for

  1. QSAR models for prediction study of HIV protease inhibitors using support vector machines, neural networks and multiple linear regression

    Directory of Open Access Journals (Sweden)

    Rachid Darnag

    2017-02-01

    Full Text Available Support vector machines (SVM represent one of the most promising Machine Learning (ML tools that can be applied to develop a predictive quantitative structure–activity relationship (QSAR models using molecular descriptors. Multiple linear regression (MLR and artificial neural networks (ANNs were also utilized to construct quantitative linear and non linear models to compare with the results obtained by SVM. The prediction results are in good agreement with the experimental value of HIV activity; also, the results reveal the superiority of the SVM over MLR and ANN model. The contribution of each descriptor to the structure–activity relationships was evaluated.

  2. QSAR study of HCV NS5B polymerase inhibitors using the genetic algorithm-multiple linear regression (GA-MLR).

    Science.gov (United States)

    Rafiei, Hamid; Khanzadeh, Marziyeh; Mozaffari, Shahla; Bostanifar, Mohammad Hassan; Avval, Zhila Mohajeri; Aalizadeh, Reza; Pourbasheer, Eslam

    2016-01-01

    Quantitative structure-activity relationship (QSAR) study has been employed for predicting the inhibitory activities of the Hepatitis C virus (HCV) NS5B polymerase inhibitors . A data set consisted of 72 compounds was selected, and then different types of molecular descriptors were calculated. The whole data set was split into a training set (80 % of the dataset) and a test set (20 % of the dataset) using principle component analysis. The stepwise (SW) and the genetic algorithm (GA) techniques were used as variable selection tools. Multiple linear regression method was then used to linearly correlate the selected descriptors with inhibitory activities. Several validation technique including leave-one-out and leave-group-out cross-validation, Y-randomization method were used to evaluate the internal capability of the derived models. The external prediction ability of the derived models was further analyzed using modified r(2), concordance correlation coefficient values and Golbraikh and Tropsha acceptable model criteria's. Based on the derived results (GA-MLR), some new insights toward molecular structural requirements for obtaining better inhibitory activity were obtained.

  3. Combining QSAR Modeling and Text-Mining Techniques to Link Chemical Structures and Carcinogenic Modes of Action.

    Science.gov (United States)

    Papamokos, George; Silins, Ilona

    2016-01-01

    There is an increasing need for new reliable non-animal based methods to predict and test toxicity of chemicals. Quantitative structure-activity relationship (QSAR), a computer-based method linking chemical structures with biological activities, is used in predictive toxicology. In this study, we tested the approach to combine QSAR data with literature profiles of carcinogenic modes of action automatically generated by a text-mining tool. The aim was to generate data patterns to identify associations between chemical structures and biological mechanisms related to carcinogenesis. Using these two methods, individually and combined, we evaluated 96 rat carcinogens of the hematopoietic system, liver, lung, and skin. We found that skin and lung rat carcinogens were mainly mutagenic, while the group of carcinogens affecting the hematopoietic system and the liver also included a large proportion of non-mutagens. The automatic literature analysis showed that mutagenicity was a frequently reported endpoint in the literature of these carcinogens, however, less common endpoints such as immunosuppression and hormonal receptor-mediated effects were also found in connection with some of the carcinogens, results of potential importance for certain target organs. The combined approach, using QSAR and text-mining techniques, could be useful for identifying more detailed information on biological mechanisms and the relation with chemical structures. The method can be particularly useful in increasing the understanding of structure and activity relationships for non-mutagens.

  4. Combining QSAR Modeling and Text-Mining Techniques to Link Chemical Structures and Carcinogenic Modes of Action

    Science.gov (United States)

    Papamokos, George; Silins, Ilona

    2016-01-01

    There is an increasing need for new reliable non-animal based methods to predict and test toxicity of chemicals. Quantitative structure-activity relationship (QSAR), a computer-based method linking chemical structures with biological activities, is used in predictive toxicology. In this study, we tested the approach to combine QSAR data with literature profiles of carcinogenic modes of action automatically generated by a text-mining tool. The aim was to generate data patterns to identify associations between chemical structures and biological mechanisms related to carcinogenesis. Using these two methods, individually and combined, we evaluated 96 rat carcinogens of the hematopoietic system, liver, lung, and skin. We found that skin and lung rat carcinogens were mainly mutagenic, while the group of carcinogens affecting the hematopoietic system and the liver also included a large proportion of non-mutagens. The automatic literature analysis showed that mutagenicity was a frequently reported endpoint in the literature of these carcinogens, however, less common endpoints such as immunosuppression and hormonal receptor-mediated effects were also found in connection with some of the carcinogens, results of potential importance for certain target organs. The combined approach, using QSAR and text-mining techniques, could be useful for identifying more detailed information on biological mechanisms and the relation with chemical structures. The method can be particularly useful in increasing the understanding of structure and activity relationships for non-mutagens. PMID:27625608

  5. Identification of cytochrome P450 2D6 and 2C9 substrates and inhibitors by QSAR analysis

    DEFF Research Database (Denmark)

    Jónsdóttir, Svava Ósk; Ringsted, Tine; Nikolov, Nikolai G.

    2012-01-01

    This paper presents four new QSAR models for CYP2C9 and CYP2D6 substrate recognition and inhibitor identification based on human clinical data. The models were used to screen a large data set of environmental chemicals for CYP activity, and to analyze the frequency of CYP activity among these com......This paper presents four new QSAR models for CYP2C9 and CYP2D6 substrate recognition and inhibitor identification based on human clinical data. The models were used to screen a large data set of environmental chemicals for CYP activity, and to analyze the frequency of CYP activity among...... these compounds. A large fraction of these chemicals were found to be CYP active, and thus potentially capable of affecting human physiology. 20% of the compounds within applicability domain of the models were predicted to be CYP2C9 substrates, and 17% to be inhibitors. The corresponding numbers for CYP2D6 were 9...... of specific CYP activity. An overrepresentation was seen for poly-aromatic hydrocarbons (group of procarcinogens) among CYP2C9 active and mutagenic compounds compared to CYP2C9 inactive and mutagenic compounds. The mutagenicity was predicted with a QSAR model based on Ames in vitro test data....

  6. A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities.

    Science.gov (United States)

    Valizade Hasanloei, Mohammad Amin; Sheikhpour, Razieh; Sarram, Mehdi Agha; Sheikhpour, Elnaz; Sharifi, Hamdollah

    2018-02-01

    Quantitative structure-activity relationship (QSAR) is an effective computational technique for drug design that relates the chemical structures of compounds to their biological activities. Feature selection is an important step in QSAR based drug design to select the most relevant descriptors. One of the most popular feature selection methods for classification problems is Fisher score which aim is to minimize the within-class distance and maximize the between-class distance. In this study, the properties of Fisher criterion were extended for QSAR models to define the new distance metrics based on the continuous activity values of compounds with known activities. Then, a semi-supervised feature selection method was proposed based on the combination of Fisher and Laplacian criteria which exploits both compounds with known and unknown activities to select the relevant descriptors. To demonstrate the efficiency of the proposed semi-supervised feature selection method in selecting the relevant descriptors, we applied the method and other feature selection methods on three QSAR data sets such as serine/threonine-protein kinase PLK3 inhibitors, ROCK inhibitors and phenol compounds. The results demonstrated that the QSAR models built on the selected descriptors by the proposed semi-supervised method have better performance than other models. This indicates the efficiency of the proposed method in selecting the relevant descriptors using the compounds with known and unknown activities. The results of this study showed that the compounds with known and unknown activities can be helpful to improve the performance of the combined Fisher and Laplacian based feature selection methods.

  7. Exploring 2D and 3D QSARs of benzimidazole derivatives as transient receptor potential melastatin 8 (TRPM8 antagonists using MLR and kNN-MFA methodology

    Directory of Open Access Journals (Sweden)

    Kamlendra Singh Bhadoriya

    2016-09-01

    Full Text Available TRPM8 is now best known as a cold- and menthol-activated channel implicated in thermosensation. TRPM8 is specifically expressed in a subset of pain- and temperature-sensing neuron. TRPM8 plays a major role in the sensation of cold and cooling substances. TRPM8 is a potential new target for the treatment of painful conditions. Thus, TRPM8 antagonists represent a new, novel and potentially useful treatment strategy to treat various disease states such as urological disorders, asthma, COPD, prostate and colon cancers, and painful conditions related to cold, such as cold allodynia and cold hyperalgesia. Better tools such as potent and specific TRPM8 antagonists are mandatory as high unmet medical need for such progress. To achieve this objective quantitative structure–activity relationship (QSAR studies were carried out on a series of 25 benzimidazole-containing TRPM8 antagonists to investigate the structural requirements of their inhibitory activity against cTRPM8. The statistically significant best 2D-QSAR model having correlation coefficient r2 = 0.88 and cross-validated squared correlation coefficient q2 = 0.64 with external predictive ability of pred_r2 = 0.69 was developed by SW-MLR. The physico-chemical descriptors such as polarizabilityAHP, kappa2, XcompDipole, +vePotentialSurfaceArea, XKMostHydrophilic were found to show a significant correlation with biological activity in benzimidazole derivatives. Molecular field analysis was used to construct the best 3D-QSAR model using SW-kNN method, showing good correlative and predictive capabilities in terms of q2 = 0.81 and pred_r2 = 0.55. Developed kNN-MFA model highlighted the importance of shape of the molecules, i.e., steric & electrostatic descriptors at the grid points S_774 & E_1024 for TRPM8 receptor binding. These models (2D & 3D were found to yield reliable clues for further optimization of benzimidazole derivatives in the data set. The information rendered by 2D- and 3D-QSAR

  8. 3D-QSAR studies on CCR2B receptor antagonists: Insight into the structural requirements of (R-3-aminopyrrolidine series of molecules based on CoMFA/CoMSIA models

    Directory of Open Access Journals (Sweden)

    Swetha Gade

    2012-01-01

    Full Text Available Objective: Monocyte chemo attractant protein-1 (MCP-1 is a member of the CC-chemokine family and it selectively recruits leukocytes from the circulation to the site of inflammation through binding with the chemotactic cytokine receptor 2B (CCR2B. The recruitment and activation of selected populations of leukocytes is a key feature in a variety of inflammatory conditions. Thus MCP-1 receptor antagonist represents an attractive target for drug discovery. To understand the structural requirements that will lead to enhanced inhibitory potencies, we have carried out 3D-QSAR (quantitative structure-activity relationship studies on (R-3-aminopyrrolidine series of molecules as CCR2B receptor antagonists. Materials and Methods: Comparative molecular field analysis (CoMFA and comparative molecular similarity indices analysis (CoMSIA were performed on a series of (R-3-aminopyrrolidine derivatives as antagonists of CCR2B receptor with Sybyl 6.7v. Results: We have derived statistically significant model from 37 molecules and validated it against an external test set of 13 compounds. The CoMFA model yielded a leave one out r 2 (r 2 loo of 0.847, non-cross-validated r 2 (r 2 ncv of 0.977, F value of 267.930, and bootstrapped r 2 (r 2 bs of 0.988. We have derived the standard error of prediction value of 0.367, standard error of estimate 0.141, and a reliable external predictivity, with a predictive r 2 (r 2 pred of 0.673. While the CoMSIA model yielded an r 2 loo of 0.719, r 2 ncv of 0.964,F value of 135.666, r 2 bs of 0.975, standard error of prediction of 0.512, standard error of estimate of 0.180, and an external predictivity with an r 2 pred of 0.611. These validation tests not only revealed the robustness of the models but also demonstrated that for our models r 2 pred, based on the mean activity of test set compounds can accurately estimate external predictivity. Conclusion: The QSAR model gave satisfactory statistical results in terms of q 2 and r 2

  9. Evaluation on joint toxicity of chlorinated anilines and cadmium to Photobacterium phosphoreum and QSAR analysis

    Energy Technology Data Exchange (ETDEWEB)

    Jin, Hao, E-mail: realking163@163.com [School of Life and Chemistry, Jiangsu Second Normal University, Nanjing, Jiangsu 210013 (China); Wang, Chao; Shi, Jiaqi [State Key Laboratory of Pollution Control and Resources Reuse, School of Environment, Nanjing University, Nanjing, Jiangsu 210023 (China); Chen, Lei [School of Life and Chemistry, Jiangsu Second Normal University, Nanjing, Jiangsu 210013 (China)

    2014-08-30

    Highlights: • Cd has different effects on joint toxicity when in different concentrations. • The toxicity of most binary mixtures decreases when Cd concentration rises. • Different QSAR models are developed to predict the joint toxicity. • Descriptors in QSARs can help to elucidate the joint toxicity mechanism. • Van der Waals’ force or complexation may reduce the toxicity of mixtures. - Abstract: The individual IC{sub 50} (the concentrations causing a 50% inhibition of bioluminescence after 15 min exposure) of cadmium ion (Cd) and nine chlorinated anilines to Photobacterium phosphoreum (P. phosphoreum) were determined. In order to evaluate the combined effects of the nine chlorinated anilines and Cd, the toxicities of chlorinated anilines combined with different concentrations of Cd were determined, respectively. The results showed that the number of chlorinated anilines manifesting synergy with Cd decreased with the increasing Cd concentration, and the number manifesting antagonism decreased firstly and then increased. The joint toxicity of mixtures at low Cd concentration was weaker than that of most binary mixtures when combined with Cd at medium and high concentrations as indicated by TU{sub Total}. QSAR analysis showed that the single toxicity of chlorinated anilines was related to the energy of the lowest unoccupied molecular orbital (E{sub LUMO}). When combined with different concentrations of Cd, the toxicity was related to the energy difference (E{sub HOMO} − E{sub LUMO}) with different coefficients. Van der Waals’ force or the complexation between chlorinated anilines and Cd had an impact on the toxicity of combined systems, which could account for QSAR models with different physico-chemical descriptors.

  10. QSAR screening of 70,983 REACH substances for genotoxic carcinogenicity, mutagenicity and developmental toxicity in the ChemScreen project

    DEFF Research Database (Denmark)

    Wedebye, Eva Bay; Dybdahl, Marianne; Nikolov, Nikolai Georgiev

    2015-01-01

    The ChemScreen project aimed to develop a screening system for reproductive toxicity based on alternative methods. QSARs can, if adequate, contribute to the evaluation of chemical substances under REACH and may in some cases be applied instead of experimental testing to fill data gaps...... for information requirements. As no testing for reproductive effects should be performed in REACH on known genotoxic carcinogens or germ cell mutagens with appropriate risk management measures implemented, a QSAR pre-screen for 70,983 REACH substances was performed. Sixteen models and three decision algorithms...... were used to reach overall predictions of substances with potential effects with the following result: 6.5% genotoxic carcinogens, 16.3% mutagens, 11.5% developmental toxicants. These results are similar to findings in earlier QSAR and experimental studies of chemical inventories, and illustrate how...

  11. Evaluation and comparison of benchmark QSAR models to predict a relevant REACH endpoint: The bioconcentration factor (BCF)

    Energy Technology Data Exchange (ETDEWEB)

    Gissi, Andrea [Laboratory of Environmental Chemistry and Toxicology, IRCCS – Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156 Milano (Italy); Dipartimento di Farmacia – Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Via E. Orabona 4, 70125 Bari (Italy); Lombardo, Anna; Roncaglioni, Alessandra [Laboratory of Environmental Chemistry and Toxicology, IRCCS – Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156 Milano (Italy); Gadaleta, Domenico [Laboratory of Environmental Chemistry and Toxicology, IRCCS – Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156 Milano (Italy); Dipartimento di Farmacia – Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Via E. Orabona 4, 70125 Bari (Italy); Mangiatordi, Giuseppe Felice; Nicolotti, Orazio [Dipartimento di Farmacia – Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Via E. Orabona 4, 70125 Bari (Italy); Benfenati, Emilio, E-mail: emilio.benfenati@marionegri.it [Laboratory of Environmental Chemistry and Toxicology, IRCCS – Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156 Milano (Italy)

    2015-02-15

    The bioconcentration factor (BCF) is an important bioaccumulation hazard assessment metric in many regulatory contexts. Its assessment is required by the REACH regulation (Registration, Evaluation, Authorization and Restriction of Chemicals) and by CLP (Classification, Labeling and Packaging). We challenged nine well-known and widely used BCF QSAR models against 851 compounds stored in an ad-hoc created database. The goodness of the regression analysis was assessed by considering the determination coefficient (R{sup 2}) and the Root Mean Square Error (RMSE); Cooper's statistics and Matthew's Correlation Coefficient (MCC) were calculated for all the thresholds relevant for regulatory purposes (i.e. 100 L/kg for Chemical Safety Assessment; 500 L/kg for Classification and Labeling; 2000 and 5000 L/kg for Persistent, Bioaccumulative and Toxic (PBT) and very Persistent, very Bioaccumulative (vPvB) assessment) to assess the classification, with particular attention to the models' ability to control the occurrence of false negatives. As a first step, statistical analysis was performed for the predictions of the entire dataset; R{sup 2}>0.70 was obtained using CORAL, T.E.S.T. and EPISuite Arnot–Gobas models. As classifiers, ACD and log P-based equations were the best in terms of sensitivity, ranging from 0.75 to 0.94. External compound predictions were carried out for the models that had their own training sets. CORAL model returned the best performance (R{sup 2}{sub ext}=0.59), followed by the EPISuite Meylan model (R{sup 2}{sub ext}=0.58). The latter gave also the highest sensitivity on external compounds with values from 0.55 to 0.85, depending on the thresholds. Statistics were also compiled for compounds falling into the models Applicability Domain (AD), giving better performances. In this respect, VEGA CAESAR was the best model in terms of regression (R{sup 2}=0.94) and classification (average sensitivity>0.80). This model also showed the best

  12. Evaluation and comparison of benchmark QSAR models to predict a relevant REACH endpoint: The bioconcentration factor (BCF)

    International Nuclear Information System (INIS)

    Gissi, Andrea; Lombardo, Anna; Roncaglioni, Alessandra; Gadaleta, Domenico; Mangiatordi, Giuseppe Felice; Nicolotti, Orazio; Benfenati, Emilio

    2015-01-01

    The bioconcentration factor (BCF) is an important bioaccumulation hazard assessment metric in many regulatory contexts. Its assessment is required by the REACH regulation (Registration, Evaluation, Authorization and Restriction of Chemicals) and by CLP (Classification, Labeling and Packaging). We challenged nine well-known and widely used BCF QSAR models against 851 compounds stored in an ad-hoc created database. The goodness of the regression analysis was assessed by considering the determination coefficient (R 2 ) and the Root Mean Square Error (RMSE); Cooper's statistics and Matthew's Correlation Coefficient (MCC) were calculated for all the thresholds relevant for regulatory purposes (i.e. 100 L/kg for Chemical Safety Assessment; 500 L/kg for Classification and Labeling; 2000 and 5000 L/kg for Persistent, Bioaccumulative and Toxic (PBT) and very Persistent, very Bioaccumulative (vPvB) assessment) to assess the classification, with particular attention to the models' ability to control the occurrence of false negatives. As a first step, statistical analysis was performed for the predictions of the entire dataset; R 2 >0.70 was obtained using CORAL, T.E.S.T. and EPISuite Arnot–Gobas models. As classifiers, ACD and log P-based equations were the best in terms of sensitivity, ranging from 0.75 to 0.94. External compound predictions were carried out for the models that had their own training sets. CORAL model returned the best performance (R 2 ext =0.59), followed by the EPISuite Meylan model (R 2 ext =0.58). The latter gave also the highest sensitivity on external compounds with values from 0.55 to 0.85, depending on the thresholds. Statistics were also compiled for compounds falling into the models Applicability Domain (AD), giving better performances. In this respect, VEGA CAESAR was the best model in terms of regression (R 2 =0.94) and classification (average sensitivity>0.80). This model also showed the best regression (R 2 =0.85) and

  13. QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP ANALYSIS (QSAR OF VINCADIFFORMINE ANALOGUES AS THE ANTIPLASMODIAL COMPOUNDS OF THE CHLOROQUINOSENSIBLE STRAIN

    Directory of Open Access Journals (Sweden)

    Iqmal Tahir

    2010-06-01

    Full Text Available Quantitative Structure-Activity Relationship (QSAR analysis of vincadifformine analogs as an antimalarial drug has been conducted using atomic net charges (q, moment dipole (, LUMO (Lowest Unoccupied Molecular Orbital and HOMO (Highest Occupied Molecular Orbital energies, molecular mass (m as well as surface area (A as the predictors to their activity. Data of predictors are obtained from computational chemistry method using semi-empirical molecular orbital AM1 calculation. Antimalarial activities were taken as the activity of the drugs against chloroquine-sensitive Plasmodium falciparum (Nigerian Cell strain and were presented as the value of ln(1/IC50 where IC50 is an effective concentration inhibiting 50% of the parasite growth. The best QSAR model has been determined by multiple linier regression analysis giving QSAR equation: Log (1/IC50 = 9.602.qC1 -17.012.qC2 +6.084.qC3 -19.758.qC5 -6.517.qC6 +2.746.qC7 -6.795.qN +6.59.qC8 -0.190. -0.974.ELUMO +0.515.EHOMO -0.274. +0.029.A -1.673 (n = 16; r = 0.995; SD = 0.099; F = 2.682   Keywords: QSAR analysis, antimalaria, vincadifformine.

  14. Novel 3-Amino-6-chloro-7-(azol-2 or 5-yl-1,1-dioxo-1,4,2-benzodithiazine Derivatives with Anticancer Activity: Synthesis and QSAR Study

    Directory of Open Access Journals (Sweden)

    Aneta Pogorzelska

    2015-12-01

    Full Text Available A series of new 3-amino-6-chloro-7-(azol-2 or 5-yl-1,1-dioxo-1,4,2-benzodithiazine derivatives 5a–j have been synthesized and evaluated in vitro for their antiproliferative activity at the U.S. National Cancer Institute. The most active compound 5h showed significant cytotoxic effects against ovarian (OVCAR-3 and breast (MDA-MB-468 cancer (10% and 47% cancer cell death, respectively as well as a good selectivity toward prostate (DU-145, colon (SW-620 and renal (TK-10 cancer cell lines. To obtain a deeper insight into the structure-activity relationships of the new compounds 5a–j QSAR studies have been applied. Theoretical calculations allowed the identification of molecular descriptors belonging to the RDF (RDF055p and RDF145m in the MOLT-4 and UO-31 QSAR models, respectively and 3D-MorSE (Mor32m and Mor16e for MOLT-4 and UO-31 QSAR models descriptor classes. Based on these data, QSAR models with good robustness and predictive ability have been obtained.

  15. A combined QSAR and partial order ranking approach to risk assessment.

    Science.gov (United States)

    Carlsen, L

    2006-04-01

    QSAR generated data appear as an attractive alternative to experimental data as foreseen in the proposed new chemicals legislation REACH. A preliminary risk assessment for the aquatic environment can be based on few factors, i.e. the octanol-water partition coefficient (Kow), the vapour pressure (VP) and the potential biodegradability of the compound in combination with the predicted no-effect concentration (PNEC) and the actual tonnage in which the substance is produced. Application of partial order ranking, allowing simultaneous inclusion of several parameters leads to a mutual prioritisation of the investigated substances, the prioritisation possibly being further analysed through the concept of linear extensions and average ranks. The ranking uses endpoint values (log Kow and log VP) derived from strictly linear 'noise-deficient' QSAR models as input parameters. Biodegradation estimates were adopted from the BioWin module of the EPI Suite. The population growth impairment of Tetrahymena pyriformis was used as a surrogate for fish lethality.

  16. INTEGRATION OF QSAR AND SAR METHODS FOR THE MECHANISTIC INTERPRETATION OF PREDICTIVE MODELS FOR CARCINOGENICITY

    Directory of Open Access Journals (Sweden)

    Natalja Fjodorova

    2012-07-01

    Full Text Available The knowledge-based Toxtree expert system (SAR approach was integrated with the statistically based counter propagation artificial neural network (CP ANN model (QSAR approach to contribute to a better mechanistic understanding of a carcinogenicity model for non-congeneric chemicals using Dragon descriptors and carcinogenic potency for rats as a response. The transparency of the CP ANN algorithm was demonstrated using intrinsic mapping technique specifically Kohonen maps. Chemical structures were represented by Dragon descriptors that express the structural and electronic features of molecules such as their shape and electronic surrounding related to reactivity of molecules. It was illustrated how the descriptors are correlated with particular structural alerts (SAs for carcinogenicity with recognized mechanistic link to carcinogenic activity. Moreover, the Kohonen mapping technique enables one to examine the separation of carcinogens and non-carcinogens (for rats within a family of chemicals with a particular SA for carcinogenicity. The mechanistic interpretation of models is important for the evaluation of safety of chemicals.

  17. Quantitative Structure--Activity Relationship (QSAR) for the Oxidation of Trace Organic Contaminants by Sulfate Radical.

    Science.gov (United States)

    Xiao, Ruiyang; Ye, Tiantian; Wei, Zongsu; Luo, Shuang; Yang, Zhihui; Spinney, Richard

    2015-11-17

    The sulfate radical anion (SO4•–) based oxidation of trace organic contaminants (TrOCs) has recently received great attention due to its high reactivity and low selectivity. In this study, a meta-analysis was conducted to better understand the role of functional groups on the reactivity between SO4•– and TrOCs. The results indicate that compounds in which electron transfer and addition channels dominate tend to exhibit a faster second-order rate constants (kSO4•–) than that of H–atom abstraction, corroborating the SO4•– reactivity and mechanisms observed in the individual studies. Then, a quantitative structure activity relationship (QSAR) model was developed using a sequential approach with constitutional, geometrical, electrostatic, and quantum chemical descriptors. Two descriptors, ELUMO and EHOMO energy gap (ELUMO–EHOMO) and the ratio of oxygen atoms to carbon atoms (#O:C), were found to mechanistically and statistically affect kSO4•– to a great extent with the standardized QSAR model: ln kSO4•– = 26.8–3.97 × #O:C – 0.746 × (ELUMO–EHOMO). In addition, the correlation analysis indicates that there is no dominant reaction channel for SO4•– reactions with various structurally diverse compounds. Our QSAR model provides a robust predictive tool for estimating emerging micropollutants removal using SO4•– during wastewater treatment processes.

  18. The Three Dimensional Quantitative Structure Activity Relationships (3D-QSAR and Docking Studies of Curcumin Derivatives as Androgen Receptor Antagonists

    Directory of Open Access Journals (Sweden)

    Jing Yang

    2012-05-01

    Full Text Available Androgen receptor antagonists have been proved to be effective anti-prostate cancer agents. 3D-QSAR and Molecular docking methods were performed on curcumin derivatives as androgen receptor antagonists. The bioactive conformation was explored by docking the potent compound 29 into the binding site of AR. The constructed Comparative Molecular Field Analysis (CoMFA and Comparative Similarity Indices Analysis (CoMSIA models produced statistically significant results with the cross-validated correlation coefficients q2 of 0.658 and 0.567, non-cross-validated correlation coefficients r2 of 0.988 and 0.978, and predicted correction coefficients r2pred of 0.715 and 0.793, respectively. These results ensure the CoMFA and CoMSIA models as a tool to guide the design of novel potent AR antagonists. A set of 30 new analogs were proposed by utilizing the results revealed in the present study, and were predicted with potential activities in the developed models.

  19. 3D-QSAR comparative molecular field analysis on opioid receptor antagonists: pooling data from different studies.

    Science.gov (United States)

    Peng, Youyi; Keenan, Susan M; Zhang, Qiang; Kholodovych, Vladyslav; Welsh, William J

    2005-03-10

    Three-dimensional quantitative structure-activity relationship (3D-QSAR) models were constructed using comparative molecular field analysis (CoMFA) on a series of opioid receptor antagonists. To obtain statistically significant and robust CoMFA models, a sizable data set of naltrindole and naltrexone analogues was assembled by pooling biological and structural data from independent studies. A process of "leave one data set out", similar to the traditional "leave one out" cross-validation procedure employed in partial least squares (PLS) analysis, was utilized to study the feasibility of pooling data in the present case. These studies indicate that our approach yields statistically significant and highly predictive CoMFA models from the pooled data set of delta, mu, and kappa opioid receptor antagonists. All models showed excellent internal predictability and self-consistency: q(2) = 0.69/r(2) = 0.91 (delta), q(2) = 0.67/r(2) = 0.92 (mu), and q(2) = 0.60/r(2) = 0.96 (kappa). The CoMFA models were further validated using two separate test sets: one test set was selected randomly from the pooled data set, while the other test set was retrieved from other published sources. The overall excellent agreement between CoMFA-predicted and experimental binding affinities for a structurally diverse array of ligands across all three opioid receptor subtypes gives testimony to the superb predictive power of these models. CoMFA field analysis demonstrated that the variations in binding affinity of opioid antagonists are dominated by steric rather than electrostatic interactions with the three opioid receptor binding sites. The CoMFA steric-electrostatic contour maps corresponding to the delta, mu, and kappa opioid receptor subtypes reflected the characteristic similarities and differences in the familiar "message-address" concept of opioid receptor ligands. Structural modifications to increase selectivity for the delta over mu and kappa opioid receptors have been predicted on the

  20. Molecular docking, QSAR and ADMET based mining of natural compounds against prime targets of HIV.

    Science.gov (United States)

    Vora, Jaykant; Patel, Shivani; Sinha, Sonam; Sharma, Sonal; Srivastava, Anshu; Chhabria, Mahesh; Shrivastava, Neeta

    2018-01-07

    AIDS is one of the multifaceted diseases and this underlying complexity hampers its complete cure. The toxicity of existing drugs and emergence of multidrug-resistant virus makes the treatment worse. Development of effective, safe and low-cost anti-HIV drugs is among the top global priority. Exploration of natural resources may give ray of hope to develop new anti-HIV leads. Among the various therapeutic targets for HIV treatment, reverse transcriptase, protease, integrase, GP120, and ribonuclease are the prime focus. In the present study, we predicted potential plant-derived natural molecules for HIV treatment using computational approach, i.e. molecular docking, quantitative structure activity relationship (QSAR), and ADMET studies. Receptor-ligand binding studies were performed using three different software for precise prediction - Discovery studio 4.0, Schrodinger and Molegrow virtual docker. Docking scores revealed that Mulberrosides, Anolignans, Curcumin and Chebulic acid are promising candidates that bind with multi targets of HIV, while Neo-andrographolide, Nimbolide and Punigluconin were target-specific candidates. Subsequently, QSAR was performed using biologically proved compounds which predicted the biological activity of compounds. We identified Anolignans, Curcumin, Mulberrosides, Chebulic acid and Neo-andrographolide as potential natural molecules for HIV treatment from results of molecular docking and 3D-QSAR. In silico ADMET studies showed drug-likeness of these lead molecules. Structure similarities of identified lead molecules were compared with identified marketed drugs by superimposing both the molecules. Using in silico studies, we have identified few best fit molecules of natural origin against identified targets which may give new drugs to combat HIV infection after wet lab validation.

  1. Development of predictive pharmacophore model for in silico screening, and 3D QSAR CoMFA and CoMSIA studies for lead optimization, for designing of potent tumor necrosis factor alpha converting enzyme inhibitors

    Science.gov (United States)

    Murumkar, Prashant Revan; Zambre, Vishal Prakash; Yadav, Mange Ram

    2010-02-01

    A chemical feature-based pharmacophore model was developed for Tumor Necrosis Factor-α converting enzyme (TACE) inhibitors. A five point pharmacophore model having two hydrogen bond acceptors (A), one hydrogen bond donor (D) and two aromatic rings (R) with discrete geometries as pharmacophoric features was developed. The pharmacophore model so generated was then utilized for in silico screening of a database. The pharmacophore model so developed was validated by using four compounds having proven TACE inhibitory activity which were grafted into the database. These compounds mapped well onto the five listed pharmacophoric features. This validated pharmacophore model was also used for alignment of molecules in CoMFA and CoMSIA analysis. The contour maps of the CoMFA/CoMSIA models were utilized to provide structural insight for activity improvement of potential novel TACE inhibitors. The pharmacophore model so developed could be used for in silico screening of any commercial/in house database for identification of TACE inhibiting lead compounds, and the leads so identified could be optimized using the developed CoMSIA model. The present work highlights the tremendous potential of the two mutually complementary ligand-based drug designing techniques (i.e. pharmacophore mapping and 3D-QSAR analysis) using TACE inhibitors as prototype biologically active molecules.

  2. Ligand-based modeling of Akt3 lead to potent dual Akt1/Akt3 inhibitor.

    Science.gov (United States)

    Al-Sha'er, Mahmoud A; Taha, Mutasem O

    2018-02-13

    Akt1 and Akt3 are important serine/threonine-specific protein kinases involved in G2 phase required by cancer cells to maintain cell cycle and to prevent cell death. Accordingly, inhibitors of these kinases should have potent anti-cancer properties. This prompted us to use pharmacophore/QSAR modeling to identify optimal binding models and physicochemical descriptors that explain bioactivity variation within a set of 74 diverse Akt3 inhibitors. Two successful orthogonal pharmacophores were identified and further validated using receiver operating characteristic (ROC) curve analyses. The pharmacophoric models and associated QSAR equation were applied to screen the national cancer institute (NCI) list of compounds for new Akt3 inhibitors. Six hits showed significant experimental anti-Akt3 IC 50 values, out of which one compound exhibited dual low micromolar anti-Akt1 and anti-Akt3 inhibitory profiles. Copyright © 2018 Elsevier Inc. All rights reserved.

  3. (Q)SAR tools for priority setting: A case study with printed paper and board food contact material substances.

    Science.gov (United States)

    Van Bossuyt, Melissa; Van Hoeck, Els; Raitano, Giuseppa; Manganelli, Serena; Braeken, Els; Ates, Gamze; Vanhaecke, Tamara; Van Miert, Sabine; Benfenati, Emilio; Mertens, Birgit; Rogiers, Vera

    2017-04-01

    Over the last years, more stringent safety requirements for an increasing number of chemicals across many regulatory fields (e.g. industrial chemicals, pharmaceuticals, food, cosmetics, …) have triggered the need for an efficient screening strategy to prioritize the substances of highest concern. In this context, alternative methods such as in silico (i.e. computational) techniques gain more and more importance. In the current study, a new prioritization strategy for identifying potentially mutagenic substances was developed based on the combination of multiple (quantitative) structure-activity relationship ((Q)SAR) tools. Non-evaluated substances used in printed paper and board food contact materials (FCM) were selected for a case study. By applying our strategy, 106 out of the 1723 substances were assigned 'high priority' as they were predicted mutagenic by 4 different (Q)SAR models. Information provided within the models allowed to identify 53 substances for which Ames mutagenicity prediction already has in vitro Ames test results. For further prioritization, additional support could be obtained by applying local i.e. specific models, as demonstrated here for aromatic azo compounds, typically found in printed paper and board FCM. The strategy developed here can easily be applied to other groups of chemicals facing the same need for priority ranking. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Consensus models to predict endocrine disruption for all ...

    Science.gov (United States)

    Humans are potentially exposed to tens of thousands of man-made chemicals in the environment. It is well known that some environmental chemicals mimic natural hormones and thus have the potential to be endocrine disruptors. Most of these environmental chemicals have never been tested for their ability to disrupt the endocrine system, in particular, their ability to interact with the estrogen receptor. EPA needs tools to prioritize thousands of chemicals, for instance in the Endocrine Disruptor Screening Program (EDSP). Collaborative Estrogen Receptor Activity Prediction Project (CERAPP) was intended to be a demonstration of the use of predictive computational models on HTS data including ToxCast and Tox21 assays to prioritize a large chemical universe of 32464 unique structures for one specific molecular target – the estrogen receptor. CERAPP combined multiple computational models for prediction of estrogen receptor activity, and used the predicted results to build a unique consensus model. Models were developed in collaboration between 17 groups in the U.S. and Europe and applied to predict the common set of chemicals. Structure-based techniques such as docking and several QSAR modeling approaches were employed, mostly using a common training set of 1677 compounds provided by U.S. EPA, to build a total of 42 classification models and 8 regression models for binding, agonist and antagonist activity. All predictions were evaluated on ToxCast data and on an exte

  5. Molecular docking and 3D-QSAR studies on triazolinone and pyridazinone, non-nucleoside inhibitor of HIV-1 reverse transcriptase.

    Science.gov (United States)

    Sivan, Sree Kanth; Manga, Vijjulatha

    2010-06-01

    Nonnucleoside reverse transcriptase inhibitors (NNRTIs) are allosteric inhibitors of the HIV-1 reverse transcriptase. Recently a series of Triazolinone and Pyridazinone were reported as potent inhibitors of HIV-1 wild type reverse transcriptase. In the present study, docking and 3D quantitative structure activity relationship (3D QSAR) studies involving comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were performed on 31 molecules. Ligands were built and minimized using Tripos force field and applying Gasteiger-Hückel charges. These ligands were docked into protein active site using GLIDE 4.0. The docked poses were analyzed; the best docked poses were selected and aligned. CoMFA and CoMSIA fields were calculated using SYBYL6.9. The molecules were divided into training set and test set, a PLS analysis was performed and QSAR models were generated. The model showed good statistical reliability which is evident from the r2 nv, q2 loo and r2 pred values. The CoMFA model provides the most significant correlation of steric and electrostatic fields with biological activities. The CoMSIA model provides a correlation of steric, electrostatic, acceptor and hydrophobic fields with biological activities. The information rendered by 3D QSAR model initiated us to optimize the lead and design new potential inhibitors.

  6. Pyrid-2-yl and 2-CyanoPhenyl fused heterocyclic compounds as human P2X3 inhibitors: a combined approach based on homology modelling, docking and QSAR analysis.

    Science.gov (United States)

    Janardhan, Sridhara; Seth, Subhendu; Viswanadhan, Vellarkad N

    2014-02-01

    P2X receptors are hetero-oligomeric proteins that function as membrane ion channels and are gated by extracellular ATP. The hP2X[Formula: see text] subunit is a constituent of the channels on a subset of sensory neurons involved in pain signaling, where ATP released by damaged and inflamed tissue can initiate action potentials. Hence, the inhibition of ATP-activated P2X3 receptor is an exciting approach for the treatment of inflammatory and neuropathic pain. Recently, the crystal structures of zebrafish P2X4 (zP2X4) were obtained in closed, apo state (PDB ID: 3I5D) and ATP-bound, open state (PDB ID: 4DW1). These structures were used to develop a homology model of human P2X3 (hP2X3 in order to identify through docking studies, the binding modes of known P2X3 inhibitors and their key active site interactions, along with a pharmacophore-based 3D-QSAR model for a series of 136 Pyrid-2-yl and 2-CyanoPhenyl fused heterocyclic compounds. These 3D-QSAR models have been developed with different combinations of training and test set divisions obtained by random separation, Jarvis-Patrick clustering, K-means clustering and sphere exclusion methods. The best predictive 3D-QSAR model resulted in training set R2 of 0.75, internal test set Q2 of 0.74, Pearson-R value of 0.87 and root mean square error of 0.37. The information generated by the pharmacophore model and docking analyses using the homology model provides valuable clues to design novel potent hP2X3 inhibitors.

  7. Design and combinatorial library generation of 1H 1,4 benzodiazepine 2,5 diones as photosystem-II inhibitors: A public QSAR approach

    Directory of Open Access Journals (Sweden)

    Purusottam Banjare

    2017-09-01

    Full Text Available Exponential rise in the population around the word increased the demand of food grains/crops with limited expansion of the agricultural land. To meet the demand, generation of new herbicidal agents is of primary need for the manufacturing firm. In silico tool like QSAR is one of the regularly used in designing newer compounds along with wet experiment. Photosystem-II (PS-II regarded as one of the major target for the herbicidal agents. With this aim in the present study a series of 1H, 1,4 benzodiazepine 2,5-dione analogues as herbicidal (PS-II inhibitors agents were subjected to QSAR analysis using 2D PaDEL descriptors (open source. Two different splitting techniques namely, kennard stone based and k-means clustering splitting were used to divide the whole data set and GFA based on MAE criteria was used a statistical method to develop a model to investigate the physicochemical and structural requirement of potential PS-II inhibitors. All the models are statistically robust both internally and externally (Q2: 0.540–0.693, R2pred: 0.722–0.810. The activity is mostly affected by polarizabilities, electro negativities as well as substituents at the phenyl ring. Based on the results, a library of compounds was generated using SmiLib v2.0 tool (open source and better predicted inside applicability domain compounds were identified by applying three different applicability domain (AD approaches. Therefore the developed public QSAR models may be helpful for the scientific community for the further research.

  8. A new model of flavonoids affinity towards P-glycoprotein: genetic algorithm-support vector machine with features selected by a modified particle swarm optimization algorithm.

    Science.gov (United States)

    Cui, Ying; Chen, Qinggang; Li, Yaxiao; Tang, Ling

    2017-02-01

    Flavonoids exhibit a high affinity for the purified cytosolic NBD (C-terminal nucleotide-binding domain) of P-glycoprotein (P-gp). To explore the affinity of flavonoids for P-gp, quantitative structure-activity relationship (QSAR) models were developed using support vector machines (SVMs). A novel method coupling a modified particle swarm optimization algorithm with random mutation strategy and a genetic algorithm coupled with SVM was proposed to simultaneously optimize the kernel parameters of SVM and determine the subset of optimized features for the first time. Using DRAGON descriptors to represent compounds for QSAR, three subsets (training, prediction and external validation set) derived from the dataset were employed to investigate QSAR. With excluding of the outlier, the correlation coefficient (R 2 ) of the whole training set (training and prediction) was 0.924, and the R 2 of the external validation set was 0.941. The root-mean-square error (RMSE) of the whole training set was 0.0588; the RMSE of the cross-validation of the external validation set was 0.0443. The mean Q 2 value of leave-many-out cross-validation was 0.824. With more informations from results of randomization analysis and applicability domain, the proposed model is of good predictive ability, stability.

  9. Beyond the scope of Free-Wilson analysis: building interpretable QSAR models with machine learning algorithms.

    Science.gov (United States)

    Chen, Hongming; Carlsson, Lars; Eriksson, Mats; Varkonyi, Peter; Norinder, Ulf; Nilsson, Ingemar

    2013-06-24

    A novel methodology was developed to build Free-Wilson like local QSAR models by combining R-group signatures and the SVM algorithm. Unlike Free-Wilson analysis this method is able to make predictions for compounds with R-groups not present in a training set. Eleven public data sets were chosen as test cases for comparing the performance of our new method with several other traditional modeling strategies, including Free-Wilson analysis. Our results show that the R-group signature SVM models achieve better prediction accuracy compared with Free-Wilson analysis in general. Moreover, the predictions of R-group signature models are also comparable to the models using ECFP6 fingerprints and signatures for the whole compound. Most importantly, R-group contributions to the SVM model can be obtained by calculating the gradient for R-group signatures. For most of the studied data sets, a significant correlation with that of a corresponding Free-Wilson analysis is shown. These results suggest that the R-group contribution can be used to interpret bioactivity data and highlight that the R-group signature based SVM modeling method is as interpretable as Free-Wilson analysis. Hence the signature SVM model can be a useful modeling tool for any drug discovery project.

  10. First report on 3D-QSAR and molecular dynamics based docking studies of GCPII inhibitors for targeted drug delivery applications

    Science.gov (United States)

    Pandit, Amit; Sengupta, Sagnik; Krishnan, Mena Asha; Reddy, Ramesh B.; Sharma, Rajesh; Venkatesh, Chelvam

    2018-05-01

    Prostate Specific Membrane Antigen (PSMA) or Glutamate carboxypeptidase II (GCPII) has been identified as an important target in diagnosis and therapy of prostate cancer. Among several types of inhibitors, urea based inhibitors are the most common and widely employed in preclinical and clinical studies. Computational studies have been carried out to uncover active sites and interaction of PSMA inhibitors with the protein by modifying the core structure of the ligand. Analysis of the literature, however, show lack of 3-D quantitative structure activity relationship (QSAR) and molecular dynamics based molecular docking study to identify structural modifications responsible for better GCPII inhibitory activity. The present study aims to fulfil this gap by analysing well known PSMA inhibitors reported in the literature with known experimental PSMA inhibition constants. Also in order to validate the in silico study, a new GCPII inhibitor 7 was designed, synthesized and experimental PSMA enzyme inhibition was evaluated by using freshly isolated PSMA protein from human cancer cell line derived from lymph node, LNCaP. 3D-QSAR CoMFA models on 58 urea based GCPII inhibitors were generated, and the best correlation was obtained in Gast-Huck charge assigning method with q2, r2 and predictive r2 values as 0.592, 0.995 and 0.842 respectively. Moreover, steric, electrostatic, and hydrogen bond donor field contribution analysis provided best statistical values from CoMSIA model (q2, r2 and predictive r2 as 0.527, 0.981 and 0.713 respectively). Contour maps study revealed that electrostatic field contribution is the major factor for discovering better binding affinity ligands. Further molecular dynamic assisted molecular docking was also performed on GCPII receptor (PDB ID 4NGM) and most active GCPII inhibitor, DCIBzL. 4NGM co-crystallised ligand, JB7 was used to validate the docking procedure and the amino acid interactions present in JB7 are compared with DCIBzL. The results

  11. In silico modelling of permeation enhancement potency in Caco-2 monolayers based on molecular descriptors and random forest

    DEFF Research Database (Denmark)

    Welling, Søren Havelund; Clemmensen, Line Katrine Harder; Buckley, Stephen T.

    2015-01-01

    has been developed.The random forest-QSAR model was based upon Caco-2 data for 41 surfactant-like permeation enhancers from Whitehead et al. (2008) and molecular descriptors calculated from their structure.The QSAR model was validated by two test-sets: (i) an eleven compound experimental set with Caco......-2 data and (ii) nine compounds with Caco-2 data from literature. Feature contributions, a recent developed diagnostic tool, was applied to elucidate the contribution of individual molecular descriptors to the predicted potency. Feature contributions provided easy interpretable suggestions...

  12. Novel dimer based descriptors with solvational computation for QSAR study of oxadiazoylbenzoyl-ureas as novel insect-growth regulators.

    Science.gov (United States)

    Fan, Feng; Cheng, Jiagao; Li, Zhong; Xu, Xiaoyong; Qian, Xuhong

    2010-02-01

    Molecular aggregation state of bioactive compounds plays a key role in its bio-interactive procedure. In this article, based on the structure information of dimers, the simplest model of molecular aggregation state, and combined with solvational computation, total four descriptors (DeltaV, MR2, DeltaE(1), and DeltaE(2)) were calculated for QSAR study of a novel insect-growth regulator, N-(5-phenyl-1,3,4-oxadiazol-2-yl)-N'-benzoyl urea. Two QSAR models were constructed with r(2) = 0.671, q(2) = 0.516 and r(2) = 0.816, q(2) = 0.695, respectively. It implicates that the bioactivity may strongly depend on the characters of molecular aggregation state, especially on the dimeric transport ability from oil phase to water phase. Copyright 2009 Wiley Periodicals, Inc.

  13. CoMFA and CoMSIA 3D-QSAR studies on S(6)-(4-nitrobenzyl)mercaptopurine riboside (NBMPR) analogs as inhibitors of human equilibrative nucleoside transporter 1 (hENT1).

    Science.gov (United States)

    Gupte, Amol; Buolamwini, John K

    2009-01-15

    3D-QSAR (CoMFA and CoMSIA) studies were performed on human equlibrative nucleoside transporter (hENT1) inhibitors displaying K(i) values ranging from 10,000 to 0.7nM. Both CoMFA and CoMSIA analysis gave reliable models with q2 values >0.50 and r2 values >0.92. The models have been validated for their stability and robustness using group validation and bootstrapping techniques and for their predictive abilities using an external test set of nine compounds. The high predictive r2 values of the test set (0.72 for CoMFA model and 0.74 for CoMSIA model) reveals that the models can prove to be a useful tool for activity prediction of newly designed nucleoside transporter inhibitors. The CoMFA and CoMSIA contour maps identify features important for exhibiting good binding affinities at the transporter, and can thus serve as a useful guide for the design of potential equilibrative nucleoside transporter inhibitors.

  14. Hsp90 inhibitors, part 1: definition of 3-D QSAutogrid/R models as a tool for virtual screening.

    Science.gov (United States)

    Ballante, Flavio; Caroli, Antonia; Wickersham, Richard B; Ragno, Rino

    2014-03-24

    The multichaperone heat shock protein (Hsp) 90 complex mediates the maturation and stability of a variety of oncogenic signaling proteins. For this reason, Hsp90 has emerged as a promising target for anticancer drug development. Herein, we describe a complete computational procedure for building several 3-D QSAR models used as a ligand-based (LB) component of a comprehensive ligand-based (LB) and structure-based (SB) virtual screening (VS) protocol to identify novel molecular scaffolds of Hsp90 inhibitors. By the application of the 3-D QSAutogrid/R method, eight SB PLS 3-D QSAR models were generated, leading to a final multiprobe (MP) 3-D QSAR pharmacophoric model capable of recognizing the most significant chemical features for Hsp90 inhibition. Both the monoprobe and multiprobe models were optimized, cross-validated, and tested against an external test set. The obtained statistical results confirmed the models as robust and predictive to be used in a subsequent VS.

  15. Prediction of Acute Mammalian Toxicity Using QSAR Methods: A Case Study of Sulfur Mustard and Its Breakdown Products

    Directory of Open Access Journals (Sweden)

    John Wheeler

    2012-07-01

    Full Text Available Predicting toxicity quantitatively, using Quantitative Structure Activity Relationships (QSAR, has matured over recent years to the point that the predictions can be used to help identify missing comparison values in a substance’s database. In this manuscript we investigate using the lethal dose that kills fifty percent of a test population (the LD50 for determining relative toxicity of a number of substances. In general, the smaller the LD50 value, the more toxic the chemical, and the larger the LD50 value, the lower the toxicity. When systemic toxicity and other specific toxicity data are unavailable for the chemical(s of interest, during emergency responses, LD50 values may be employed to determine the relative toxicity of a series of chemicals. In the present study, a group of chemical warfare agents and their breakdown products have been evaluated using four available rat oral QSAR LD50 models. The QSAR analysis shows that the breakdown products of Sulfur Mustard (HD are predicted to be less toxic than the parent compound as well as other known breakdown products that have known toxicities. The QSAR estimated break down products LD50 values ranged from 299 mg/kg to 5,764 mg/kg. This evaluation allows for the ranking and toxicity estimation of compounds for which little toxicity information existed; thus leading to better risk decision making in the field.

  16. Imidazole derivatives as angiotensin II AT1 receptor blockers: Benchmarks, drug-like calculations and quantitative structure-activity relationships modeling

    Science.gov (United States)

    Alloui, Mebarka; Belaidi, Salah; Othmani, Hasna; Jaidane, Nejm-Eddine; Hochlaf, Majdi

    2018-03-01

    We performed benchmark studies on the molecular geometry, electron properties and vibrational analysis of imidazole using semi-empirical, density functional theory and post Hartree-Fock methods. These studies validated the use of AM1 for the treatment of larger systems. Then, we treated the structural, physical and chemical relationships for a series of imidazole derivatives acting as angiotensin II AT1 receptor blockers using AM1. QSAR studies were done for these imidazole derivatives using a combination of various physicochemical descriptors. A multiple linear regression procedure was used to design the relationships between molecular descriptor and the activity of imidazole derivatives. Results validate the derived QSAR model.

  17. MODEL QSAR SENYAWA FLUOROKUINOLON BARU SEBAGAI ZAT ANTIBAKTERI Salmonella thypimurium

    Directory of Open Access Journals (Sweden)

    Eva Vaulina

    2006-11-01

    Full Text Available Modelling of novel Fluoroquinolone derivates as antibacterial compund of Salmonella thypimurium was conducted. The research was done as an initial step in discovering some new Fluoroquinolone compounds which have higher activity to Salmonella thypimurium. There are 16 compunds that use as the material of the research and they already have antibacterial activity data that expressed in Minimal Inhibitory Concentration (MIC, mg/mL. Calculation was performed by semiempirical AM1 method. The QSAR model was determined by multilinear regression analysis, with Log MIC as dependent variable and the independent variables are atomic net charges of C5 (qC5 and C7 (qC7, dipole moment (m, polarizability (a, n-octanol-water coefficien partition (Log P, molecular weight (Mw, and surface area of van der Waals (AvdW. The relationship between Log MIC and the descriptors which performed by statistical analysis is:(Log MIC = -2.119 + 34.541(qC5 – 19.748(qC7 – 0.919polar + 1.170logP + 0.111(Mw – 0.003(Avdw, with n =16, r = 0.907, r2 = 0.822, SD = 0.288, F calc = 6.938, F table = 3.374 , F calc/F table = 2.056 and PRESS = 0.749. The research can obtain the new coumpounds that modified from compound number 16 (etil fluoroquinolone, MIC prediction = 0.0354 mg/mL, (etil fluoroquinlone fosfate, 2.84. 10-19mg/mL, and (isopropyl fluoroquinlone, 0.1085 mg/mL, and compound number 2 (m-nitro fluoroquinolone sulfonat, 1.32. 10-11mg/mL. This results can be suggested to synthesis step

  18. QSAR Methods to Screen Endocrine Disruptors

    Directory of Open Access Journals (Sweden)

    Nicola Porta

    2016-08-01

    Full Text Available The identification of endocrine disrupting chemicals (EDCs is one of the important goals of environmental chemical hazard screening. We report on in silico methods addressing toxicological studies about EDCs with a special focus on the application of QSAR models for screening purpose. Since Estrogen-like (ER activity has been extensively studied, the majority of the available models are based on ER-related endpoints. Some of these models are here reviewed and described. As example for their application, we screen an assembled dataset of candidate substitutes for some known EDCs belonging to the chemical classes of phthalates, bisphenols and parabens, selected considering their toxicological relevance and broad application, with the general aim of preliminary assessing their ED potential. The goal of the substitution processes is to advance inherently safer chemicals and products, consistent with the principles of green chemistry. Results suggest that the integration of a family of different models accounting for different endpoints can be a convenient way to describe ED as properly as possible and allow also both to increase the confidence of the predictions and to maximize the probability that most active compounds are correctly found.

  19. Pharmacophore generation, atom-based 3D-QSAR, molecular docking and molecular dynamics simulation studies on benzamide analogues as FtsZ inhibitors.

    Science.gov (United States)

    Tripathy, Swayansiddha; Azam, Mohammed Afzal; Jupudi, Srikanth; Sahu, Susanta Kumar

    2017-10-11

    FtsZ is an appealing target for the design of antimicrobial agent that can be used to defeat the multidrug-resistant bacterial pathogens. Pharmacophore modelling, molecular docking and molecular dynamics (MD) simulation studies were performed on a series of three-substituted benzamide derivatives. In the present study a five-featured pharmacophore model with one hydrogen bond acceptors, one hydrogen bond donors, one hydrophobic and two aromatic rings was developed using 97 molecules having MIC values ranging from .07 to 957 μM. A statistically significant 3D-QSAR model was obtained using this pharmacophore hypothesis with a good correlation coefficient (R 2  = .8319), cross validated coefficient (Q 2  = .6213) and a high Fisher ratio (F = 103.9) with three component PLS factor. A good correlation between experimental and predicted activity of the training (R 2  = .83) and test set (R 2  = .67) molecules were displayed by ADHRR.1682 model. The generated model was further validated by enrichment studies using the decoy test and MAE-based criteria to measure the efficiency of the model. The docking studies of all selected inhibitors in the active site of FtsZ protein showed crucial hydrogen bond interactions with Val 207, Asn 263, Leu 209, Gly 205 and Asn-299 residues. The binding free energies of these inhibitors were calculated by the molecular mechanics/generalized born surface area VSGB 2.0 method. Finally, a 15 ns MD simulation was done to confirm the stability of the 4DXD-ligand complex. On a wider scope, the prospect of present work provides insight in designing molecules with better selective FtsZ inhibitory potential.

  20. QSAR study of benzimidazole derivatives inhibition on escherichia ...

    African Journals Online (AJOL)

    The paper describes a quantitative structure-activity relationship (QSAR) study of IC50 values of benzimidazole derivatives on escherichia coli methionine aminopeptidase. The activity of the 32 inhibitors has been estimated by means of multiple linear regression (MLR) and artificial neural network (ANN) techniques.

  1. DESIGN OF LOW CYTOTOXICITY DIARYLANILINE DERIVATIVES BASED ON QSAR RESULTS: AN APPLICATION OF ARTIFICIAL NEURAL NETWORK MODELLING

    Directory of Open Access Journals (Sweden)

    Ihsanul Arief

    2016-11-01

    Full Text Available Study on cytotoxicity of diarylaniline derivatives by using quantitative structure-activity relationship (QSAR has been done. The structures and cytotoxicities of  diarylaniline derivatives were obtained from the literature. Calculation of molecular and electronic parameters was conducted using Austin Model 1 (AM1, Parameterized Model 3 (PM3, Hartree-Fock (HF, and density functional theory (DFT methods.  Artificial neural networks (ANN analysis used to produce the best equation with configuration of input data-hidden node-output data = 5-8-1, value of r2 = 0.913; PRESS = 0.069. The best equation used to design and predict new diarylaniline derivatives.  The result shows that compound N1-(4′-Cyanophenyl-5-(4″-cyanovinyl-2″,6″-dimethyl-phenoxy-4-dimethylether benzene-1,2-diamine is the best-proposed compound with cytotoxicity value (CC50 of 93.037 μM.

  2. Estudos de QSAR 3D para um conjunto de inibidores de butirilcolinesterase humana QSAR 3D studies of a series of human butyrylcholinesterase inhibitors

    Directory of Open Access Journals (Sweden)

    Humberto F. Freitas

    2009-01-01

    Full Text Available Alzheimer's disease (AD is considered the main cause of cognitive decline in adults. The available therapies for AD treatment seek to maintain the activity of cholinergic system through the inhibition of the enzyme acetylcholinesterase. However, butyrylcholinesterase (BuChE can be considered an alternative target for AD treatment. Aiming at developing new BuChE inhibitors, robust QSAR 3D models with high predictive power were developed. The best model presents a good fit (r²=0.82, q²=0.76, with two PCs and high predictive power (r²predict=0.88. Analysis of regression vector shows that steric properties have considerable importance to the inhibition of the BuChE.

  3. What if the number of nanotoxicity data is too small for developing predictive Nano-QSAR models? An alternative read-across based approach for filling data gaps.

    Science.gov (United States)

    Gajewicz, Agnieszka

    2017-06-22

    Over the past decade, computational nanotoxicology, in particular Quantitative Structure-Activity Relationship models (Nano-QSAR) that help in assessing the biological effects of nanomaterials, have received much attention. In effect, a solid basis for uncovering the relationships between the structure and property/activity of nanoparticles has been created. Nonetheless, six years after the first pioneering computational studies focusing on the investigation of nanotoxicity were commenced, these computational methods still suffer from many limitations. These are mainly related to the paucity of widely available, systematically varied, libraries of experimental data necessary for the development and validation of such models. This results in the still-low acceptance of these methods as valuable research tools for nanosafety and raises the query as to whether these methods could gain wide acceptance of regulatory bodies as alternatives for traditional in vitro methods. This study aimed to give an answer to the following question: How to remedy the paucity of experimental nanotoxicity data and thereby, overcome key roadblock that hinders the development of approaches for data-driven modeling of nanoparticle properties and toxicities? Here, a simple and transparent read-across algorithm for a pre-screening hazard assessment of nanomaterials that provides reasonably accurate results by making the best use of existing limited set of observations will be introduced.

  4. Synthesis, antifungal activity, and QSAR studies of 1,6-dihydropyrimidine derivatives

    Directory of Open Access Journals (Sweden)

    Chirag Rami

    2013-01-01

    Full Text Available Introduction: A practical synthesis of pyrimidinone would be very helpful for chemists because pyrimidinone is found in many bioactive natural products and exhibits a wide range of biological properties. The biological significance of pyrimidine derivatives has led us to the synthesis of substituted pyrimidine. Materials and Methods: With the aim of developing potential antimicrobials, new series of 5-cyano-6-oxo-1,6-dihydro-pyrimidine derivatives namely 2-(5-cyano-6-oxo-4-substituted (aryl-1,6-dihydropyrimidin-2-ylthio-N-substituted (phenyl acetamide (C1-C41 were synthesized and characterized by Fourier transform infrared spectroscopy (FTIR, mass analysis, and proton nuclear magnetic resonance ( 1 H NMR. All the compounds were screened for their antifungal activity against Candida albicans (MTCC, 227. Results and Discussion: Quantitative structure activity relationship (QSAR studies of a series of 1,6-dihydro-pyrimidine were carried out to study various structural requirements for fungal inhibition. Various lipophilic, electronic, geometric, and spatial descriptors were correlated with antifungal activity using genetic function approximation. Developed models were found predictive as indicated by their square of predictive regression values (r 2pred and their internal and external cross-validation. Study reveals that CHI_3_C, Molecular_SurfaceArea, and Jurs_DPSA_1 contributed significantly to the activity along with some electronic, geometric, and quantum mechanical descriptors. Conclusion: A careful analysis of the antifungal activity data of synthesized compounds revealed that electron withdrawing substitution on N-phenyl acetamide ring of 1,6-dihydropyrimidine moiety possess good activity.

  5. Modeling Chronic Toxicity: A Comparison of Experimental Variability With (QSAR/Read-Across Predictions

    Directory of Open Access Journals (Sweden)

    Christoph Helma

    2018-04-01

    Full Text Available This study compares the accuracy of (QSAR/read-across predictions with the experimental variability of chronic lowest-observed-adverse-effect levels (LOAELs from in vivo experiments. We could demonstrate that predictions of the lazy structure-activity relationships (lazar algorithm within the applicability domain of the training data have the same variability as the experimental training data. Predictions with a lower similarity threshold (i.e., a larger distance from the applicability domain are also significantly better than random guessing, but the errors to be expected are higher and a manual inspection of prediction results is highly recommended.

  6. A Combination of 3D-QSAR, Molecular Docking and Molecular Dynamics Simulation Studies of Benzimidazole-Quinolinone Derivatives as iNOS Inhibitors

    Directory of Open Access Journals (Sweden)

    Peixun Liu

    2012-09-01

    Full Text Available Inducible Nitric Oxide Synthase (iNOS has been involved in a variety of diseases, and thus it is interesting to discover and optimize new iNOS inhibitors. In previous studies, a series of benzimidazole-quinolinone derivatives with high inhibitory activity against human iNOS were discovered. In this work, three-dimensional quantitative structure-activity relationships (3D-QSAR, molecular docking and molecular dynamics (MD simulation approaches were applied to investigate the functionalities of active molecular interaction between these active ligands and iNOS. A QSAR model with R2 of 0.9356, Q2 of 0.8373 and Pearson-R value of 0.9406 was constructed, which presents a good predictive ability in both internal and external validation. Furthermore, a combined analysis incorporating the obtained model and the MD results indicates: (1 compounds with the proper-size hydrophobic substituents at position 3 in ring-C (R3 substituent, hydrophilic substituents near the X6 of ring-D and hydrophilic or H-bond acceptor groups at position 2 in ring-B show enhanced biological activities; (2 Met368, Trp366, Gly365, Tyr367, Phe363, Pro344, Gln257, Val346, Asn364, Met349, Thr370, Glu371 and Tyr485 are key amino acids in the active pocket, and activities of iNOS inhibitors are consistent with their capability to alter the position of these important residues, especially Glu371 and Thr370. The results provide a set of useful guidelines for the rational design of novel iNOS inhibitors.

  7. Hepatotoxicity evaluation of traditional Chinese medicines using a computational molecular model.

    Science.gov (United States)

    Zhao, Pan; Liu, Bin; Wang, Chunya

    2017-11-01

    Liver injury caused by traditional Chinese medicines (TCMs) is reported from many countries around the world. TCM hepatotoxicity has attracted worldwide concerns. This study aims to develop a more applicable and optimal tool to evaluate TCM hepatotoxicity. A quantitative structure-activity relationship (QSAR) analysis was performed based on published data and U.S. Food and Drug Administration's Liver Toxicity Knowledge Base (LTKB). Eleven herbal ingredients with proven liver toxicity in the literature were added into the dataset besides chemicals from LTKB. The finally generated QSAR model yielded a sensitivity of 83.8%, a specificity of 70.1%, and an accuracy of 80.2%. Among the externally tested 20 ingredients from TCMs, 14 hepatotoxic ingredients were all accurately identified by the QSAR model derived from the dataset containing natural hepatotoxins. Adding natural hepatotoxins into the dataset makes the QSAR model more applicable for TCM hepatotoxicity assessment, which provides a right direction in the methodology study for TCM safety evaluation. The generated QSAR model has the practical value to prioritize the hepatotoxicity risk of TCM compounds. Furthermore, an open-access international specialized database on TCM hepatotoxicity should be quickly established.

  8. Neural network-based QSAR and insecticide discovery: spinetoram

    Science.gov (United States)

    Sparks, Thomas C.; Crouse, Gary D.; Dripps, James E.; Anzeveno, Peter; Martynow, Jacek; DeAmicis, Carl V.; Gifford, James

    2008-06-01

    Improvements in the efficacy and spectrum of the spinosyns, novel fermentation derived insecticide, has long been a goal within Dow AgroSciences. As large and complex fermentation products identifying specific modifications to the spinosyns likely to result in improved activity was a difficult process, since most modifications decreased the activity. A variety of approaches were investigated to identify new synthetic directions for the spinosyn chemistry including several explorations of the quantitative structure activity relationships (QSAR) of spinosyns, which initially were unsuccessful. However, application of artificial neural networks (ANN) to the spinosyn QSAR problem identified new directions for improved activity in the chemistry, which subsequent synthesis and testing confirmed. The ANN-based analogs coupled with other information on substitution effects resulting from spinosyn structure activity relationships lead to the discovery of spinetoram (XDE-175). Launched in late 2007, spinetoram provides both improved efficacy and an expanded spectrum while maintaining the exceptional environmental and toxicological profile already established for the spinosyn chemistry.

  9. A QSAR study of integrase strand transfer inhibitors based on a large set of pyrimidine, pyrimidone, and pyridopyrazine carboxamide derivatives

    Science.gov (United States)

    de Campos, Luana Janaína; de Melo, Eduardo Borges

    2017-08-01

    In the present study, 199 compounds derived from pyrimidine, pyrimidone and pyridopyrazine carboxamides with inhibitory activity against HIV-1 integrase were modeled. Subsequently, a multivariate QSAR study was conducted with 54 molecules employed by Ordered Predictors Selection (OPS) and Partial Least Squares (PLS) for the selection of variables and model construction, respectively. Topological, electrotopological, geometric, and molecular descriptors were used. The selected real model was robust and free from chance correlation; in addition, it demonstrated favorable internal and external statistical quality. Once statistically validated, the training model was used to predict the activity of a second data set (n = 145). The root mean square deviation (RMSD) between observed and predicted values was 0.698. Although it is a value outside of the standards, only 15 (10.34%) of the samples exhibited higher residual values than 1 log unit, a result considered acceptable. Results of Williams and Euclidean applicability domains relative to the prediction showed that the predictions did not occur by extrapolation and that the model is representative of the chemical space of test compounds.

  10. Development and validation of a predictive risk model for all-cause mortality in type 2 diabetes.

    Science.gov (United States)

    Robinson, Tom E; Elley, C Raina; Kenealy, Tim; Drury, Paul L

    2015-06-01

    Type 2 diabetes is common and is associated with an approximate 80% increase in the rate of mortality. Management decisions may be assisted by an estimate of the patient's absolute risk of adverse outcomes, including death. This study aimed to derive a predictive risk model for all-cause mortality in type 2 diabetes. We used primary care data from a large national multi-ethnic cohort of patients with type 2 diabetes in New Zealand and linked mortality records to develop a predictive risk model for 5-year risk of mortality. We then validated this model using information from a separate cohort of patients with type 2 diabetes. 26,864 people were included in the development cohort with a median follow up time of 9.1 years. We developed three models initially using demographic information and then progressively more clinical detail. The final model, which also included markers of renal disease, proved to give best prediction of all-cause mortality with a C-statistic of 0.80 in the development cohort and 0.79 in the validation cohort (7610 people) and was well calibrated. Ethnicity was a major factor with hazard ratios of 1.37 for indigenous Maori, 0.41 for East Asian and 0.55 for Indo Asian compared with European (P<0.001). We have developed a model using information usually available in primary care that provides good assessment of patient's risk of death. Results are similar to models previously published from smaller cohorts in other countries and apply to a wider range of patient ethnic groups. Copyright © 2015. Published by Elsevier Ireland Ltd.

  11. Molecular Modeling Studies of 11β-Hydroxysteroid Dehydrogenase Type 1 Inhibitors through Receptor-Based 3D-QSAR and Molecular Dynamics Simulations.

    Science.gov (United States)

    Qian, Haiyan; Chen, Jiongjiong; Pan, Youlu; Chen, Jianzhong

    2016-09-19

    11β-Hydroxysteroid dehydrogenase type 1 (11β-HSD1) is a potential target for the treatment of numerous human disorders, such as diabetes, obesity, and metabolic syndrome. In this work, molecular modeling studies combining molecular docking, 3D-QSAR, MESP, MD simulations and free energy calculations were performed on pyridine amides and 1,2,4-triazolopyridines as 11β-HSD1 inhibitors to explore structure-activity relationships and structural requirement for the inhibitory activity. 3D-QSAR models, including CoMFA and CoMSIA, were developed from the conformations obtained by docking strategy. The derived pharmacophoric features were further supported by MESP and Mulliken charge analyses using density functional theory. In addition, MD simulations and free energy calculations were employed to determine the detailed binding process and to compare the binding modes of inhibitors with different bioactivities. The binding free energies calculated by MM/PBSA showed a good correlation with the experimental biological activities. Free energy analyses and per-residue energy decomposition indicated the van der Waals interaction would be the major driving force for the interactions between an inhibitor and 11β-HSD1. These unified results may provide that hydrogen bond interactions with Ser170 and Tyr183 are favorable for enhancing activity. Thr124, Ser170, Tyr177, Tyr183, Val227, and Val231 are the key amino acid residues in the binding pocket. The obtained results are expected to be valuable for the rational design of novel potent 11β-HSD1 inhibitors.

  12. Molecular Modeling Studies of 11β-Hydroxysteroid Dehydrogenase Type 1 Inhibitors through Receptor-Based 3D-QSAR and Molecular Dynamics Simulations

    Directory of Open Access Journals (Sweden)

    Haiyan Qian

    2016-09-01

    Full Text Available 11β-Hydroxysteroid dehydrogenase type 1 (11β-HSD1 is a potential target for the treatment of numerous human disorders, such as diabetes, obesity, and metabolic syndrome. In this work, molecular modeling studies combining molecular docking, 3D-QSAR, MESP, MD simulations and free energy calculations were performed on pyridine amides and 1,2,4-triazolopyridines as 11β-HSD1 inhibitors to explore structure-activity relationships and structural requirement for the inhibitory activity. 3D-QSAR models, including CoMFA and CoMSIA, were developed from the conformations obtained by docking strategy. The derived pharmacophoric features were further supported by MESP and Mulliken charge analyses using density functional theory. In addition, MD simulations and free energy calculations were employed to determine the detailed binding process and to compare the binding modes of inhibitors with different bioactivities. The binding free energies calculated by MM/PBSA showed a good correlation with the experimental biological activities. Free energy analyses and per-residue energy decomposition indicated the van der Waals interaction would be the major driving force for the interactions between an inhibitor and 11β-HSD1. These unified results may provide that hydrogen bond interactions with Ser170 and Tyr183 are favorable for enhancing activity. Thr124, Ser170, Tyr177, Tyr183, Val227, and Val231 are the key amino acid residues in the binding pocket. The obtained results are expected to be valuable for the rational design of novel potent 11β-HSD1 inhibitors.

  13. QSAR-Driven Design and Discovery of Novel Compounds With Antiplasmodial and Transmission Blocking Activities.

    Science.gov (United States)

    Lima, Marilia N N; Melo-Filho, Cleber C; Cassiano, Gustavo C; Neves, Bruno J; Alves, Vinicius M; Braga, Rodolpho C; Cravo, Pedro V L; Muratov, Eugene N; Calit, Juliana; Bargieri, Daniel Y; Costa, Fabio T M; Andrade, Carolina H

    2018-01-01

    Malaria is a life-threatening infectious disease caused by parasites of the genus Plasmodium , affecting more than 200 million people worldwide every year and leading to about a half million deaths. Malaria parasites of humans have evolved resistance to all current antimalarial drugs, urging for the discovery of new effective compounds. Given that the inhibition of deoxyuridine triphosphatase of Plasmodium falciparum ( Pf dUTPase) induces wrong insertions in plasmodial DNA and consequently leading the parasite to death, this enzyme is considered an attractive antimalarial drug target. Using a combi-QSAR (quantitative structure-activity relationship) approach followed by virtual screening and in vitro experimental evaluation, we report herein the discovery of novel chemical scaffolds with in vitro potency against asexual blood stages of both P. falciparum multidrug-resistant and sensitive strains and against sporogonic development of P. berghei . We developed 2D- and 3D-QSAR models using a series of nucleosides reported in the literature as Pf dUTPase inhibitors. The best models were combined in a consensus approach and used for virtual screening of the ChemBridge database, leading to the identification of five new virtual Pf dUTPase inhibitors. Further in vitro testing on P. falciparum multidrug-resistant (W2) and sensitive (3D7) parasites showed that compounds LabMol-144 and LabMol-146 demonstrated fair activity against both strains and presented good selectivity versus mammalian cells. In addition, LabMol-144 showed good in vitro inhibition of P. berghei ookinete formation, demonstrating that hit-to-lead optimization based on this compound may also lead to new antimalarials with transmission blocking activity.

  14. Towards cheminformatics-based estimation of drug therapeutic index: Predicting the protective index of anticonvulsants using a new quantitative structure-index relationship approach.

    Science.gov (United States)

    Chen, Shangying; Zhang, Peng; Liu, Xin; Qin, Chu; Tao, Lin; Zhang, Cheng; Yang, Sheng Yong; Chen, Yu Zong; Chui, Wai Keung

    2016-06-01

    The overall efficacy and safety profile of a new drug is partially evaluated by the therapeutic index in clinical studies and by the protective index (PI) in preclinical studies. In-silico predictive methods may facilitate the assessment of these indicators. Although QSAR and QSTR models can be used for predicting PI, their predictive capability has not been evaluated. To test this capability, we developed QSAR and QSTR models for predicting the activity and toxicity of anticonvulsants at accuracy levels above the literature-reported threshold (LT) of good QSAR models as tested by both the internal 5-fold cross validation and external validation method. These models showed significantly compromised PI predictive capability due to the cumulative errors of the QSAR and QSTR models. Therefore, in this investigation a new quantitative structure-index relationship (QSIR) model was devised and it showed improved PI predictive capability that superseded the LT of good QSAR models. The QSAR, QSTR and QSIR models were developed using support vector regression (SVR) method with the parameters optimized by using the greedy search method. The molecular descriptors relevant to the prediction of anticonvulsant activities, toxicities and PIs were analyzed by a recursive feature elimination method. The selected molecular descriptors are primarily associated with the drug-like, pharmacological and toxicological features and those used in the published anticonvulsant QSAR and QSTR models. This study suggested that QSIR is useful for estimating the therapeutic index of drug candidates. Copyright © 2016. Published by Elsevier Inc.

  15. Linear and non-linear quantitative structure-activity relationship models on indole substitution patterns as inhibitors of HIV-1 attachment.

    Science.gov (United States)

    Nirouei, Mahyar; Ghasemi, Ghasem; Abdolmaleki, Parviz; Tavakoli, Abdolreza; Shariati, Shahab

    2012-06-01

    The antiviral drugs that inhibit human immunodeficiency virus (HIV) entry to the target cells are already in different phases of clinical trials. They prevent viral entry and have a highly specific mechanism of action with a low toxicity profile. Few QSAR studies have been performed on this group of inhibitors. This study was performed to develop a quantitative structure-activity relationship (QSAR) model of the biological activity of indole glyoxamide derivatives as inhibitors of the interaction between HIV glycoprotein gp120 and host cell CD4 receptors. Forty different indole glyoxamide derivatives were selected as a sample set and geometrically optimized using Gaussian 98W. Different combinations of multiple linear regression (MLR), genetic algorithms (GA) and artificial neural networks (ANN) were then utilized to construct the QSAR models. These models were also utilized to select the most efficient subsets of descriptors in a cross-validation procedure for non-linear log (1/EC50) prediction. The results that were obtained using GA-ANN were compared with MLR-MLR and MLR-ANN models. A high predictive ability was observed for the MLR, MLR-ANN and GA-ANN models, with root mean sum square errors (RMSE) of 0.99, 0.91 and 0.67, respectively (N = 40). In summary, machine learning methods were highly effective in designing QSAR models when compared to statistical method.

  16. QSAR, molecular docking studies of thiophene and imidazopyridine derivatives as polo-like kinase 1 inhibitors

    Science.gov (United States)

    Cao, Shandong

    2012-08-01

    The purpose of the present study was to develop in silico models allowing for a reliable prediction of polo-like kinase inhibitors based on a large diverse dataset of 136 compounds. As an effective method, quantitative structure activity relationship (QSAR) was applied using the comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA). The proposed QSAR models showed reasonable predictivity of thiophene analogs (Rcv2=0.533, Rpred2=0.845) and included four molecular descriptors, namely IC3, RDF075m, Mor02m and R4e+. The optimal model for imidazopyridine derivatives (Rcv2=0.776, Rpred2=0.876) was shown to perform good in prediction accuracy, using GATS2m and BEHe1 descriptors. Analysis of the contour maps helped to identify structural requirements for the inhibitors and served as a basis for the design of the next generation of the inhibitor analogues. Docking studies were also employed to position the inhibitors into the polo-like kinase active site to determine the most probable binding mode. These studies may help to understand the factors influencing the binding affinity of chemicals and to develop alternative methods for prescreening and designing of polo-like kinase inhibitors.

  17. Binding affinity toward human prion protein of some anti-prion compounds - Assessment based on QSAR modeling, molecular docking and non-parametric ranking.

    Science.gov (United States)

    Kovačević, Strahinja; Karadžić, Milica; Podunavac-Kuzmanović, Sanja; Jevrić, Lidija

    2018-01-01

    The present study is based on the quantitative structure-activity relationship (QSAR) analysis of binding affinity toward human prion protein (huPrP C ) of quinacrine, pyridine dicarbonitrile, diphenylthiazole and diphenyloxazole analogs applying different linear and non-linear chemometric regression techniques, including univariate linear regression, multiple linear regression, partial least squares regression and artificial neural networks. The QSAR analysis distinguished molecular lipophilicity as an important factor that contributes to the binding affinity. Principal component analysis was used in order to reveal similarities or dissimilarities among the studied compounds. The analysis of in silico absorption, distribution, metabolism, excretion and toxicity (ADMET) parameters was conducted. The ranking of the studied analogs on the basis of their ADMET parameters was done applying the sum of ranking differences, as a relatively new chemometric method. The main aim of the study was to reveal the most important molecular features whose changes lead to the changes in the binding affinities of the studied compounds. Another point of view on the binding affinity of the most promising analogs was established by application of molecular docking analysis. The results of the molecular docking were proven to be in agreement with the experimental outcome. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. 2D-QSAR in hydroxamic acid derivatives as peptide deformylase inhibitors and antibacterial agents.

    Science.gov (United States)

    Gupta, Manish K; Mishra, Pradeep; Prathipati, Philip; Saxena, Anil K

    2002-12-01

    Peptide deformylase catalyzes the removal of N-formyl group from the N-formylmethionine of ribosome synthesized polypeptide in eubacteria. Quantitative structure-activity relationship (QSAR) studies have been carried out in a series of beta-sulfonyl and beta-sulfinyl hydroxamic acid derivatives for their PDF enzyme inhibitory and antibacterial activities against Escherichia coli DC2 and Moraxella catarrhalis RA21 which demonstrate that the PDF inhibitory activity in cell free and whole cell system increases with increase in molar refractivity and hydrophobicity. The comparison of the QSARs between the cell free and whole cell system indicate that the active binding sites in PDF isolated from E. coli and in M. catarrhalis RA21 are similar and the whole cell antibacterial activity is mainly due to the inhibition of PDF. Apart from this the QSARs on some matrixmetelloproteins (COL-1, COL-3, MAT and HME) and natural endopeptidase (NEP) indicate the possibilities of introducing selectivity in these hydroxamic acid derivatives for their PDF inhibitory activity.

  19. Evaluation and comparison of benchmark QSAR models to predict a relevant REACH endpoint: The bioconcentration factor (BCF).

    Science.gov (United States)

    Gissi, Andrea; Lombardo, Anna; Roncaglioni, Alessandra; Gadaleta, Domenico; Mangiatordi, Giuseppe Felice; Nicolotti, Orazio; Benfenati, Emilio

    2015-02-01

    The bioconcentration factor (BCF) is an important bioaccumulation hazard assessment metric in many regulatory contexts. Its assessment is required by the REACH regulation (Registration, Evaluation, Authorization and Restriction of Chemicals) and by CLP (Classification, Labeling and Packaging). We challenged nine well-known and widely used BCF QSAR models against 851 compounds stored in an ad-hoc created database. The goodness of the regression analysis was assessed by considering the determination coefficient (R(2)) and the Root Mean Square Error (RMSE); Cooper's statistics and Matthew's Correlation Coefficient (MCC) were calculated for all the thresholds relevant for regulatory purposes (i.e. 100L/kg for Chemical Safety Assessment; 500L/kg for Classification and Labeling; 2000 and 5000L/kg for Persistent, Bioaccumulative and Toxic (PBT) and very Persistent, very Bioaccumulative (vPvB) assessment) to assess the classification, with particular attention to the models' ability to control the occurrence of false negatives. As a first step, statistical analysis was performed for the predictions of the entire dataset; R(2)>0.70 was obtained using CORAL, T.E.S.T. and EPISuite Arnot-Gobas models. As classifiers, ACD and logP-based equations were the best in terms of sensitivity, ranging from 0.75 to 0.94. External compound predictions were carried out for the models that had their own training sets. CORAL model returned the best performance (R(2)ext=0.59), followed by the EPISuite Meylan model (R(2)ext=0.58). The latter gave also the highest sensitivity on external compounds with values from 0.55 to 0.85, depending on the thresholds. Statistics were also compiled for compounds falling into the models Applicability Domain (AD), giving better performances. In this respect, VEGA CAESAR was the best model in terms of regression (R(2)=0.94) and classification (average sensitivity>0.80). This model also showed the best regression (R(2)=0.85) and sensitivity (average>0.70) for

  20. Have artificial neural networks met expectations in drug discovery as implemented in QSAR framework?

    Science.gov (United States)

    Dobchev, Dimitar; Karelson, Mati

    2016-07-01

    Artificial neural networks (ANNs) are highly adaptive nonlinear optimization algorithms that have been applied in many diverse scientific endeavors, ranging from economics, engineering, physics, and chemistry to medical science. Notably, in the past two decades, ANNs have been used widely in the process of drug discovery. In this review, the authors discuss advantages and disadvantages of ANNs in drug discovery as incorporated into the quantitative structure-activity relationships (QSAR) framework. Furthermore, the authors examine the recent studies, which span over a broad area with various diseases in drug discovery. In addition, the authors attempt to answer the question about the expectations of the ANNs in drug discovery and discuss the trends in this field. The old pitfalls of overtraining and interpretability are still present with ANNs. However, despite these pitfalls, the authors believe that ANNs have likely met many of the expectations of researchers and are still considered as excellent tools for nonlinear data modeling in QSAR. It is likely that ANNs will continue to be used in drug development in the future.

  1. DFT and 3D-QSAR Studies of Anti-Cancer Agents m-(4-Morpholinoquinazolin-2-yl) Benzamide Derivatives for Novel Compounds Design

    Science.gov (United States)

    Zhao, Siqi; Zhang, Guanglong; Xia, Shuwei; Yu, Liangmin

    2018-06-01

    As a group of diversified frameworks, quinazolin derivatives displayed a broad field of biological functions, especially as anticancer. To investigate the quantitative structure-activity relationship, 3D-QSAR models were generated with 24 quinazolin scaffold molecules. The experimental and predicted pIC50 values for both training and test set compounds showed good correlation, which proved the robustness and reliability of the generated QSAR models. The most effective CoMFA and CoMSIA were obtained with correlation coefficient r 2 ncv of 1.00 (both) and leave-one-out coefficient q 2 of 0.61 and 0.59, respectively. The predictive abilities of CoMFA and CoMSIA were quite good with the predictive correlation coefficients ( r 2 pred ) of 0.97 and 0.91. In addition, the statistic results of CoMFA and CoMSIA were used to design new quinazolin molecules.

  2. eTOXlab, an open source modeling framework for implementing predictive models in production environments.

    Science.gov (United States)

    Carrió, Pau; López, Oriol; Sanz, Ferran; Pastor, Manuel

    2015-01-01

    Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical industry) for the early assessment of the biological properties of new compounds. However, most of the tools currently available for developing QSAR models are not well suited for supporting the whole QSAR model life cycle in production environments. We have developed eTOXlab; an open source modeling framework designed to be used at the core of a self-contained virtual machine that can be easily deployed in production environments, providing predictions as web services. eTOXlab consists on a collection of object-oriented Python modules with methods mapping common tasks of standard modeling workflows. This framework allows building and validating QSAR models as well as predicting the properties of new compounds using either a command line interface or a graphic user interface (GUI). Simple models can be easily generated by setting a few parameters, while more complex models can be implemented by overriding pieces of the original source code. eTOXlab benefits from the object-oriented capabilities of Python for providing high flexibility: any model implemented using eTOXlab inherits the features implemented in the parent model, like common tools and services or the automatic exposure of the models as prediction web services. The particular eTOXlab architecture as a self-contained, portable prediction engine allows building models with confidential information within corporate facilities, which can be safely exported and used for prediction without disclosing the structures of the training series. The software presented here provides full support to the specific needs of users that want to develop, use and maintain predictive models in corporate environments. The technologies used by e

  3. In silico modeling to predict drug-induced phospholipidosis

    International Nuclear Information System (INIS)

    Choi, Sydney S.; Kim, Jae S.; Valerio, Luis G.; Sadrieh, Nakissa

    2013-01-01

    Drug-induced phospholipidosis (DIPL) is a preclinical finding during pharmaceutical drug development that has implications on the course of drug development and regulatory safety review. A principal characteristic of drugs inducing DIPL is known to be a cationic amphiphilic structure. This provides evidence for a structure-based explanation and opportunity to analyze properties and structures of drugs with the histopathologic findings for DIPL. In previous work from the FDA, in silico quantitative structure–activity relationship (QSAR) modeling using machine learning approaches has shown promise with a large dataset of drugs but included unconfirmed data as well. In this study, we report the construction and validation of a battery of complementary in silico QSAR models using the FDA's updated database on phospholipidosis, new algorithms and predictive technologies, and in particular, we address high performance with a high-confidence dataset. The results of our modeling for DIPL include rigorous external validation tests showing 80–81% concordance. Furthermore, the predictive performance characteristics include models with high sensitivity and specificity, in most cases above ≥ 80% leading to desired high negative and positive predictivity. These models are intended to be utilized for regulatory toxicology applied science needs in screening new drugs for DIPL. - Highlights: • New in silico models for predicting drug-induced phospholipidosis (DIPL) are described. • The training set data in the models is derived from the FDA's phospholipidosis database. • We find excellent predictivity values of the models based on external validation. • The models can support drug screening and regulatory decision-making on DIPL

  4. Designing of phenol-based β-carbonic anhydrase1 inhibitors through QSAR, molecular docking, and MD simulation approach.

    Science.gov (United States)

    Ahamad, Shahzaib; Hassan, Md Imtaiyaz; Dwivedi, Neeraja

    2018-05-01

    Tuberculosis (Tb) is an airborne infectious disease caused by Mycobacterium tuberculosis. Beta-carbonic anhydrase 1 ( β-CA1 ) has emerged as one of the potential targets for new antitubercular drug development. In this work, three-dimensional quantitative structure-activity relationships (3D-QSAR), molecular docking, and molecular dynamics (MD) simulation approaches were performed on a series of natural and synthetic phenol-based β-CA1 inhibitors. The developed 3D-QSAR model ( r 2  = 0.94, q 2  = 0.86, and pred_r 2  = 0.74) indicated that the steric and electrostatic factors are important parameters to modulate the bioactivity of phenolic compounds. Based on this indication, we designed 72 new phenolic inhibitors, out of which two compounds (D25 and D50) effectively stabilized β-CA1 receptor and, thus, are potential candidates for new generation antitubercular drug discovery program.

  5. QSAR Classification of ToxCast and Tox21 Chemicals on the Basis of Estrogen Receptor Assays (FutureToxII)

    Science.gov (United States)

    The ToxCast and Tox21 programs have tested ~8,200 chemicals in a broad screening panel of in vitro high-throughput screening (HTS) assays for estrogen receptor (ER) agonist and antagonist activity. The present work uses this large in vitro data set to develop in silico QSAR model...

  6. Combining molecular docking and QSAR studies for modeling the anti-tyrosinase activity of aromatic heterocycle thiosemicarbazone analogues

    Science.gov (United States)

    Dong, Huanhuan; Liu, Jing; Liu, Xiaoru; Yu, Yanying; Cao, Shuwen

    2018-01-01

    A collection of thirty-six aromatic heterocycle thiosemicarbazone analogues presented a broad span of anti-tyrosinase activities were designed and obtained. A robust and reliable two-dimensional quantitative structure-activity relationship model, as evidenced by the high q2 and r2 values (0.848 and 0.893, respectively), was gained based on the analogues to predict the quantitative chemical-biological relationship and the new modifier direction. Inhibitory activities of the compounds were found to greatly depend on molecular shape and orbital energy. Substituents brought out large ovality and high highest-occupied molecular orbital energy values helped to improve the activity of these analogues. The molecular docking results provided visual evidence for QSAR analysis and inhibition mechanism. Based on these, two novel tyrosinase inhibitors O04 and O05 with predicted IC50 of 0.5384 and 0.8752 nM were designed and suggested for further research.

  7. Modelling the effect of mixture components on permeation through skin.

    Science.gov (United States)

    Ghafourian, T; Samaras, E G; Brooks, J D; Riviere, J E

    2010-10-15

    A vehicle influences the concentration of penetrant within the membrane, affecting its diffusivity in the skin and rate of transport. Despite the huge amount of effort made for the understanding and modelling of the skin absorption of chemicals, a reliable estimation of the skin penetration potential from formulations remains a challenging objective. In this investigation, quantitative structure-activity relationship (QSAR) was employed to relate the skin permeation of compounds to the chemical properties of the mixture ingredients and the molecular structures of the penetrants. The skin permeability dataset consisted of permeability coefficients of 12 different penetrants each blended in 24 different solvent mixtures measured from finite-dose diffusion cell studies using porcine skin. Stepwise regression analysis resulted in a QSAR employing two penetrant descriptors and one solvent property. The penetrant descriptors were octanol/water partition coefficient, logP and the ninth order path molecular connectivity index, and the solvent property was the difference between boiling and melting points. The negative relationship between skin permeability coefficient and logP was attributed to the fact that most of the drugs in this particular dataset are extremely lipophilic in comparison with the compounds in the common skin permeability datasets used in QSAR. The findings show that compounds formulated in vehicles with small boiling and melting point gaps will be expected to have higher permeation through skin. The QSAR was validated internally, using a leave-many-out procedure, giving a mean absolute error of 0.396. The chemical space of the dataset was compared with that of the known skin permeability datasets and gaps were identified for future skin permeability measurements. Copyright 2010 Elsevier B.V. All rights reserved.

  8. synthesis, screening and qsar studies of 3-benzoyl-2-oxo/thioxo

    African Journals Online (AJOL)

    a

    divided into training and test sets. ... attention owing to their diverse range of biological properties such as calcium channel modulator [1] ... QSAR studies of antimicrobial activity represent an emerging and exceptionally important topic in the ...

  9. Molecular Modeling Studies of Thiophenyl C-Aryl Glucoside SGLT2 Inhibitors as Potential Antidiabetic Agents

    Directory of Open Access Journals (Sweden)

    Mukesh C. Sharma

    2014-01-01

    Full Text Available A QSAR study on thiophenyl derivatives as SGLT2 inhibitors as potential antidiabetic agents was performed with thirty-three compounds. Comparison of the obtained results indicated the superiority of the genetic algorithm over the simulated annealing and stepwise forward-backward variable method for feature selection. The best 2D QSAR model showed satisfactory statistical parameters for the data set (r2=0.8499, q2=0.8267, and pred_r2=0.7729 with four descriptors describing the nature of substituent groups and the environment of the substitution site. Evaluation of the model implied that electron-rich substitution position improves the inhibitory activity. The good predictive 3D-QSAR models by k-nearest neighbor (kNN method for molecular field analysis (MFA have cross-validated coefficient q2 value of 0.7663 and predicted r2 value of 0.7386. The results have showed that thiophenyl groups are necessary for activity and halogen, bulky, and less bulky groups in thiophenyl nucleus enhanced the biological activity. These studies are promising for the development of novel SGLT2 inhibitor, which may have potent antidiabetic activity.

  10. Prediction of Placental Barrier Permeability: A Model Based on Partial Least Squares Variable Selection Procedure

    Directory of Open Access Journals (Sweden)

    Yong-Hong Zhang

    2015-05-01

    Full Text Available Assessing the human placental barrier permeability of drugs is very important to guarantee drug safety during pregnancy. Quantitative structure–activity relationship (QSAR method was used as an effective assessing tool for the placental transfer study of drugs, while in vitro human placental perfusion is the most widely used method. In this study, the partial least squares (PLS variable selection and modeling procedure was used to pick out optimal descriptors from a pool of 620 descriptors of 65 compounds and to simultaneously develop a QSAR model between the descriptors and the placental barrier permeability expressed by the clearance indices (CI. The model was subjected to internal validation by cross-validation and y-randomization and to external validation by predicting CI values of 19 compounds. It was shown that the model developed is robust and has a good predictive potential (r2 = 0.9064, RMSE = 0.09, q2 = 0.7323, rp2 = 0.7656, RMSP = 0.14. The mechanistic interpretation of the final model was given by the high variable importance in projection values of descriptors. Using PLS procedure, we can rapidly and effectively select optimal descriptors and thus construct a model with good stability and predictability. This analysis can provide an effective tool for the high-throughput screening of the placental barrier permeability of drugs.

  11. Identification of phototransformation products of thalidomide and mixture toxicity assessment: an experimental and quantitative structural activity relationships (QSAR) approach.

    Science.gov (United States)

    Mahmoud, Waleed M M; Toolaram, Anju P; Menz, Jakob; Leder, Christoph; Schneider, Mandy; Kümmerer, Klaus

    2014-02-01

    The fate of thalidomide (TD) was investigated after irradiation with a medium-pressure Hg-lamp. The primary elimination of TD was monitored and structures of phototransformation products (PTPs) were assessed by LC-UV-FL-MS/MS. Environmentally relevant properties of TD and its PTPs as well as hydrolysis products (HTPs) were predicted using in silico QSAR models. Mutagenicity of TD and its PTPs was investigated in the Ames microplate format (MPF) aqua assay (Xenometrix, AG). Furthermore, a modified luminescent bacteria test (kinetic luminescent bacteria test (kinetic LBT)), using the luminescent bacteria species Vibrio fischeri, was applied for the initial screening of environmental toxicity. Additionally, toxicity of phthalimide, one of the identified PTPs, was investigated separately in the kinetic LBT. The UV irradiation eliminated TD itself without complete mineralization and led to the formation of several PTPs. TD and its PTPs did not exhibit mutagenic response in the Salmonella typhimurium strains TA 98, and TA 100 with and without metabolic activation. In contrast, QSAR analysis of PTPs and HTPs provided evidence for mutagenicity, genotoxicity and carcinogenicity using additional endpoints in silico software. QSAR analysis of different ecotoxicological endpoints, such as acute toxicity towards V. fischeri, provided positive alerts for several identified PTPs and HTPs. This was partially confirmed by the results of the kinetic LBT, in which a steady increase of acute and chronic toxicity during the UV-treatment procedure was observed for the photolytic mixtures at the highest tested concentration. Moreover, the number of PTPs within the reaction mixture that might be responsible for the toxification of TD during UV-treatment was successfully narrowed down by correlating the formation kinetics of PTPs with QSAR predictions and experimental toxicity data. Beyond that, further analysis of the commercially available PTP phthalimide indicated that transformation of

  12. The Human Glioblastoma Cell Culture Resource: Validated Cell Models Representing All Molecular Subtypes

    Directory of Open Access Journals (Sweden)

    Yuan Xie

    2015-10-01

    Full Text Available Glioblastoma (GBM is the most frequent and malignant form of primary brain tumor. GBM is essentially incurable and its resistance to therapy is attributed to a subpopulation of cells called glioma stem cells (GSCs. To meet the present shortage of relevant GBM cell (GC lines we developed a library of annotated and validated cell lines derived from surgical samples of GBM patients, maintained under conditions to preserve GSC characteristics. This collection, which we call the Human Glioblastoma Cell Culture (HGCC resource, consists of a biobank of 48 GC lines and an associated database containing high-resolution molecular data. We demonstrate that the HGCC lines are tumorigenic, harbor genomic lesions characteristic of GBMs, and represent all four transcriptional subtypes. The HGCC panel provides an open resource for in vitro and in vivo modeling of a large part of GBM diversity useful to both basic and translational GBM research.

  13. The use of QSAR methods for determination of n-octanol/water partition coefficient using the example of hydroxyester HE-1

    Science.gov (United States)

    Guziałowska-Tic, Joanna

    2017-10-01

    According to the Directive of the European Parliament and of the Council concerning the protection of animals used for scientific purposes, the number of experiments involving the use of animals needs to be reduced. The methods which can replace animal testing include computational prediction methods, for instance, the quantitative structure-activity relationships (QSAR). These methods are designed to find a cohesive relationship between differences in the values of the properties of molecules and the biological activity of a series of test compounds. This paper compares the results of the author's own results of examination on the n-octanol/water coefficient for the hydroxyester HE-1 with those generated by means of three models: Kowwin, MlogP, AlogP. The test results indicate that, in the case of molecular similarity, the highest determination coefficient was obtained for the model MlogP and the lowest root-mean square error was obtained for the Kowwin method. When comparing the mean logP value obtained using the QSAR models with the value resulting from the author's own experiments, it was observed that the best conformity was that recorded for the model AlogP, where relative error was 15.2%.

  14. The use of QSAR methods for determination of n-octanol/water partition coefficient using the example of hydroxyester HE-1

    Directory of Open Access Journals (Sweden)

    Guziałowska-Tic Joanna

    2017-01-01

    Full Text Available According to the Directive of the European Parliament and of the Council concerning the protection of animals used for scientific purposes, the number of experiments involving the use of animals needs to be reduced. The methods which can replace animal testing include computational prediction methods, for instance, the quantitative structure-activity relationships (QSAR. These methods are designed to find a cohesive relationship between differences in the values of the properties of molecules and the biological activity of a series of test compounds. This paper compares the results of the author's own results of examination on the n-octanol/water coefficient for the hydroxyester HE-1 with those generated by means of three models: Kowwin, MlogP, AlogP. The test results indicate that, in the case of molecular similarity, the highest determination coefficient was obtained for the model MlogP and the lowest root-mean square error was obtained for the Kowwin method. When comparing the mean logP value obtained using the QSAR models with the value resulting from the author's own experiments, it was observed that the best conformity was that recorded for the model AlogP, where relative error was 15.2%.

  15. Understanding the Molecular Determinant of Reversible Human Monoamine Oxidase B Inhibitors Containing 2H-Chromen-2-One Core: Structure-Based and Ligand-Based Derived Three-Dimensional Quantitative Structure-Activity Relationships Predictive Models.

    Science.gov (United States)

    Mladenović, Milan; Patsilinakos, Alexandros; Pirolli, Adele; Sabatino, Manuela; Ragno, Rino

    2017-04-24

    Monoamine oxidase B (MAO B) catalyzes the oxidative deamination of aryalkylamines neurotransmitters with concomitant reduction of oxygen to hydrogen peroxide. Consequently, the enzyme's malfunction can induce oxidative damage to mitochondrial DNA and mediates development of Parkinson's disease. Thus, MAO B emerges as a promising target for developing pharmaceuticals potentially useful to treat this vicious neurodegenerative condition. Aiming to contribute to the development of drugs with the reversible mechanism of MAO B inhibition only, herein, an extended in silico-in vitro procedure for the selection of novel MAO B inhibitors is demonstrated, including the following: (1) definition of optimized and validated structure-based three-dimensional (3-D) quantitative structure-activity relationships (QSAR) models derived from available cocrystallized inhibitor-MAO B complexes; (2) elaboration of SAR features for either irreversible or reversible MAO B inhibitors to characterize and improve coumarin-based inhibitor activity (Protein Data Bank ID: 2V61 ) as the most potent reversible lead compound; (3) definition of structure-based (SB) and ligand-based (LB) alignment rule assessments by which virtually any untested potential MAO B inhibitor might be evaluated; (4) predictive ability validation of the best 3-D QSAR model through SB/LB modeling of four coumarin-based external test sets (267 compounds); (5) design and SB/LB alignment of novel coumarin-based scaffolds experimentally validated through synthesis and biological evaluation in vitro. Due to the wide range of molecular diversity within the 3-D QSAR training set and derived features, the selected N probe-derived 3-D QSAR model proves to be a valuable tool for virtual screening (VS) of novel MAO B inhibitors and a platform for design, synthesis and evaluation of novel active structures. Accordingly, six highly active and selective MAO B inhibitors (picomolar to low nanomolar range of activity) were disclosed as a

  16. 3D-QSAR (CoMFA, CoMSIA), molecular docking and molecular dynamics simulations study of 6-aryl-5-cyano-pyrimidine derivatives to explore the structure requirements of LSD1 inhibitors.

    Science.gov (United States)

    Ding, Lina; Wang, Zhi-Zheng; Sun, Xu-Dong; Yang, Jing; Ma, Chao-Ya; Li, Wen; Liu, Hong-Min

    2017-08-01

    Recently, Histone Lysine Specific Demethylase 1 (LSD1) was regarded as a promising anticancer target for the novel drug discovery. And several small molecules as LSD1 inhibitors in different structures have been reported. In this work, we carried out a molecular modeling study on the 6-aryl-5-cyano-pyrimidine fragment LSD1 inhibitors using three-dimensional quantitative structure-activity relationship (3D-QSAR), molecular docking and molecular dynamics simulations. Comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were used to generate 3D-QSAR models. The results show that the best CoMFA model has q 2 =0.802, r 2 ncv =0.979, and the best CoMSIA model has q 2 =0.799, r 2 ncv =0.982. The electrostatic, hydrophobic and H-bond donor fields play important roles in the models. Molecular docking studies predict the binding mode and the interactions between the ligand and the receptor protein. Molecular dynamics simulations results reveal that the complex of the ligand and the receptor protein are stable at 300K. All the results can provide us more useful information for our further drug design. Copyright © 2017. Published by Elsevier Ltd.

  17. Ground-water models: Validate or invalidate

    Science.gov (United States)

    Bredehoeft, J.D.; Konikow, Leonard F.

    1993-01-01

    The word validation has a clear meaning to both the scientific community and the general public. Within the scientific community the validation of scientific theory has been the subject of philosophical debate. The philosopher of science, Karl Popper, argued that scientific theory cannot be validated, only invalidated. Popper’s view is not the only opinion in this debate; however, many scientists today agree with Popper (including the authors). To the general public, proclaiming that a ground-water model is validated carries with it an aura of correctness that we do not believe many of us who model would claim. We can place all the caveats we wish, but the public has its own understanding of what the word implies. Using the word valid with respect to models misleads the public; verification carries with it similar connotations as far as the public is concerned. Our point is this: using the terms validation and verification are misleading, at best. These terms should be abandoned by the ground-water community.

  18. Exploring pyrazolo[3,4-d]pyrimidine phosphodiesterase 1 (PDE1) inhibitors: a predictive approach combining comparative validated multiple molecular modelling techniques.

    Science.gov (United States)

    Amin, Sk Abdul; Bhargava, Sonam; Adhikari, Nilanjan; Gayen, Shovanlal; Jha, Tarun

    2018-02-01

    Phosphodiesterase 1 (PDE1) is a potential target for a number of neurodegenerative disorders such as Schizophrenia, Parkinson's and Alzheimer's diseases. A number of pyrazolo[3,4-d]pyrimidine PDE1 inhibitors were subjected to different molecular modelling techniques [such as regression-based quantitative structure-activity relationship (QSAR): multiple linear regression, support vector machine and artificial neural network; classification-based QSAR: Bayesian modelling and Recursive partitioning; Monte Carlo based QSAR; Open3DQSAR; pharmacophore mapping and molecular docking analyses] to get a detailed knowledge about the physicochemical and structural requirements for higher inhibitory activity. The planarity of the pyrimidinone ring plays an important role for PDE1 inhibition. The N-methylated function at the 5th position of the pyrazolo[3,4-d]pyrimidine core is required for interacting with the PDE1 enzyme. The cyclopentyl ring fused with the parent scaffold is necessary for PDE1 binding potency. The phenylamino substitution at 3rd position is crucial for PDE1 inhibition. The N2-substitution at the pyrazole moiety is important for PDE1 inhibition compared to the N1-substituted analogues. Moreover, the p-substituted benzyl side chain at N2-position helps to enhance the PDE1 inhibitory profile. Depending on these observations, some new molecules are predicted that may possess better PDE1 inhibition.

  19. Groundwater Model Validation

    Energy Technology Data Exchange (ETDEWEB)

    Ahmed E. Hassan

    2006-01-24

    Models have an inherent uncertainty. The difficulty in fully characterizing the subsurface environment makes uncertainty an integral component of groundwater flow and transport models, which dictates the need for continuous monitoring and improvement. Building and sustaining confidence in closure decisions and monitoring networks based on models of subsurface conditions require developing confidence in the models through an iterative process. The definition of model validation is postulated as a confidence building and long-term iterative process (Hassan, 2004a). Model validation should be viewed as a process not an end result. Following Hassan (2004b), an approach is proposed for the validation process of stochastic groundwater models. The approach is briefly summarized herein and detailed analyses of acceptance criteria for stochastic realizations and of using validation data to reduce input parameter uncertainty are presented and applied to two case studies. During the validation process for stochastic models, a question arises as to the sufficiency of the number of acceptable model realizations (in terms of conformity with validation data). Using a hierarchical approach to make this determination is proposed. This approach is based on computing five measures or metrics and following a decision tree to determine if a sufficient number of realizations attain satisfactory scores regarding how they represent the field data used for calibration (old) and used for validation (new). The first two of these measures are applied to hypothetical scenarios using the first case study and assuming field data consistent with the model or significantly different from the model results. In both cases it is shown how the two measures would lead to the appropriate decision about the model performance. Standard statistical tests are used to evaluate these measures with the results indicating they are appropriate measures for evaluating model realizations. The use of validation

  20. From basic physics to mechanisms of toxicity: the ``liquid drop'' approach applied to develop predictive classification models for toxicity of metal oxide nanoparticles

    Science.gov (United States)

    Sizochenko, Natalia; Rasulev, Bakhtiyor; Gajewicz, Agnieszka; Kuz'min, Victor; Puzyn, Tomasz; Leszczynski, Jerzy

    2014-10-01

    Many metal oxide nanoparticles are able to cause persistent stress to live organisms, including humans, when discharged to the environment. To understand the mechanism of metal oxide nanoparticles' toxicity and reduce the number of experiments, the development of predictive toxicity models is important. In this study, performed on a series of nanoparticles, the comparative quantitative-structure activity relationship (nano-QSAR) analyses of their toxicity towards E. coli and HaCaT cells were established. A new approach for representation of nanoparticles' structure is presented. For description of the supramolecular structure of nanoparticles the ``liquid drop'' model was applied. It is expected that a novel, proposed approach could be of general use for predictions related to nanomaterials. In addition, in our study fragmental simplex descriptors and several ligand-metal binding characteristics were calculated. The developed nano-QSAR models were validated and reliably predict the toxicity of all studied metal oxide nanoparticles. Based on the comparative analysis of contributed properties in both models the LDM-based descriptors were revealed to have an almost similar level of contribution to toxicity in both cases, while other parameters (van der Waals interactions, electronegativity and metal-ligand binding characteristics) have unequal contribution levels. In addition, the models developed here suggest different mechanisms of nanotoxicity for these two types of cells.Many metal oxide nanoparticles are able to cause persistent stress to live organisms, including humans, when discharged to the environment. To understand the mechanism of metal oxide nanoparticles' toxicity and reduce the number of experiments, the development of predictive toxicity models is important. In this study, performed on a series of nanoparticles, the comparative quantitative-structure activity relationship (nano-QSAR) analyses of their toxicity towards E. coli and HaCaT cells were

  1. Statistical validation of normal tissue complication probability models.

    Science.gov (United States)

    Xu, Cheng-Jian; van der Schaaf, Arjen; Van't Veld, Aart A; Langendijk, Johannes A; Schilstra, Cornelis

    2012-09-01

    To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. A penalized regression method, LASSO (least absolute shrinkage and selection operator), was used to build NTCP models for xerostomia after radiation therapy treatment of head-and-neck cancer. Model assessment was based on the likelihood function and the area under the receiver operating characteristic curve. Repeated double cross-validation showed the uncertainty and instability of the NTCP models and indicated that the statistical significance of model performance can be obtained by permutation testing. Repeated double cross-validation and permutation tests are recommended to validate NTCP models before clinical use. Copyright © 2012 Elsevier Inc. All rights reserved.

  2. Mathematical modeling of tetrahydroimidazole benzodiazepine-1-one derivatives as an anti HIV agent

    Science.gov (United States)

    Ojha, Lokendra Kumar

    2017-07-01

    The goal of the present work is the study of drug receptor interaction via QSAR (Quantitative Structure-Activity Relationship) analysis for 89 set of TIBO (Tetrahydroimidazole Benzodiazepine-1-one) derivatives. MLR (Multiple Linear Regression) method is utilized to generate predictive models of quantitative structure-activity relationships between a set of molecular descriptors and biological activity (IC50). The best QSAR model was selected having a correlation coefficient (r) of 0.9299 and Standard Error of Estimation (SEE) of 0.5022, Fisher Ratio (F) of 159.822 and Quality factor (Q) of 1.852. This model is statistically significant and strongly favours the substitution of sulphur atom, IS i.e. indicator parameter for -Z position of the TIBO derivatives. Two other parameter logP (octanol-water partition coefficient) and SAG (Surface Area Grid) also played a vital role in the generation of best QSAR model. All three descriptor shows very good stability towards data variation in leave-one-out (LOO).

  3. Evaluation of Novel Dual Acetyl- and Butyrylcholinesterase Inhibitors as Potential Anti-Alzheimer’s Disease Agents Using Pharmacophore, 3D-QSAR, and Molecular Docking Approaches

    Directory of Open Access Journals (Sweden)

    Xiaocong Pang

    2017-07-01

    Full Text Available DL0410, containing biphenyl and piperidine skeletons, was identified as an acetylcholinesterase (AChE and butyrylcholinesterase (BuChE inhibitor through high-throughput screening assays, and further studies affirmed its efficacy and safety for Alzheimer’s disease treatment. In our study, a series of novel DL0410 derivatives were evaluated for inhibitory activities towards AChE and BuChE. Among these derivatives, compounds 6-1 and 7-6 showed stronger AChE and BuChE inhibitory activities than DL0410. Then, pharmacophore modeling and three-dimensional quantitative structure activity relationship (3D-QSAR models were performed. The R2 of AChE and BuChE 3D-QSAR models for training set were found to be 0.925 and 0.883, while that of the test set were 0.850 and 0.881, respectively. Next, molecular docking methods were utilized to explore the putative binding modes. Compounds 6-1 and 7-6 could interact with the amino acid residues in the catalytic anionic site (CAS and peripheral anionic site (PAS of AChE/BuChE, which was similar with DL0410. Kinetics studies also suggested that the three compounds were all mixed-types of inhibitors. In addition, compound 6-1 showed better absorption and blood brain barrier permeability. These studies provide better insight into the inhibitory behaviors of DL0410 derivatives, which is beneficial for rational design of AChE and BuChE inhibitors in the future.

  4. Bond-based linear indices in QSAR: computational discovery of novel anti-trichomonal compounds

    Science.gov (United States)

    Marrero-Ponce, Yovani; Meneses-Marcel, Alfredo; Rivera-Borroto, Oscar M.; García-Domenech, Ramón; De Julián-Ortiz, Jesus Vicente; Montero, Alina; Escario, José Antonio; Barrio, Alicia Gómez; Pereira, David Montero; Nogal, Juan José; Grau, Ricardo; Torrens, Francisco; Vogel, Christian; Arán, Vicente J.

    2008-08-01

    Trichomonas vaginalis ( Tv) is the causative agent of the most common, non-viral, sexually transmitted disease in women and men worldwide. Since 1959, metronidazole (MTZ) has been the drug of choice in the systemic treatment of trichomoniasis. However, resistance to MTZ in some patients and the great cost associated with the development of new trichomonacidals make necessary the development of computational methods that shorten the drug discovery pipeline. Toward this end, bond-based linear indices, new TOMOCOMD-CARDD molecular descriptors, and linear discriminant analysis were used to discover novel trichomonacidal chemicals. The obtained models, using non-stochastic and stochastic indices, are able to classify correctly 89.01% (87.50%) and 82.42% (84.38%) of the chemicals in the training (test) sets, respectively. These results validate the models for their use in the ligand-based virtual screening. In addition, they show large Matthews' correlation coefficients ( C) of 0.78 (0.71) and 0.65 (0.65) for the training (test) sets, correspondingly. The result of predictions on the 10% full-out cross-validation test also evidences the robustness of the obtained models. Later, both models are applied to the virtual screening of 12 compounds already proved against Tv. As a result, they correctly classify 10 out of 12 (83.33%) and 9 out of 12 (75.00%) of the chemicals, respectively; which is the most important criterion for validating the models. Besides, these classification functions are applied to a library of seven chemicals in order to find novel antitrichomonal agents. These compounds are synthesized and tested for in vitro activity against Tv. As a result, experimental observations approached to theoretical predictions, since it was obtained a correct classification of 85.71% (6 out of 7) of the chemicals. Moreover, out of the seven compounds that are screened, synthesized and biologically assayed, six compounds (VA7-34, VA7-35, VA7-37, VA7-38, VA7-68, VA7-70) show

  5. Model Validation Status Review

    International Nuclear Information System (INIS)

    E.L. Hardin

    2001-01-01

    The primary objective for the Model Validation Status Review was to perform a one-time evaluation of model validation associated with the analysis/model reports (AMRs) containing model input to total-system performance assessment (TSPA) for the Yucca Mountain site recommendation (SR). This review was performed in response to Corrective Action Request BSC-01-C-01 (Clark 2001, Krisha 2001) pursuant to Quality Assurance review findings of an adverse trend in model validation deficiency. The review findings in this report provide the following information which defines the extent of model validation deficiency and the corrective action needed: (1) AMRs that contain or support models are identified, and conversely, for each model the supporting documentation is identified. (2) The use for each model is determined based on whether the output is used directly for TSPA-SR, or for screening (exclusion) of features, events, and processes (FEPs), and the nature of the model output. (3) Two approaches are used to evaluate the extent to which the validation for each model is compliant with AP-3.10Q (Analyses and Models). The approaches differ in regard to whether model validation is achieved within individual AMRs as originally intended, or whether model validation could be readily achieved by incorporating information from other sources. (4) Recommendations are presented for changes to the AMRs, and additional model development activities or data collection, that will remedy model validation review findings, in support of licensing activities. The Model Validation Status Review emphasized those AMRs that support TSPA-SR (CRWMS M and O 2000bl and 2000bm). A series of workshops and teleconferences was held to discuss and integrate the review findings. The review encompassed 125 AMRs (Table 1) plus certain other supporting documents and data needed to assess model validity. The AMRs were grouped in 21 model areas representing the modeling of processes affecting the natural and

  6. Model Validation Status Review

    Energy Technology Data Exchange (ETDEWEB)

    E.L. Hardin

    2001-11-28

    The primary objective for the Model Validation Status Review was to perform a one-time evaluation of model validation associated with the analysis/model reports (AMRs) containing model input to total-system performance assessment (TSPA) for the Yucca Mountain site recommendation (SR). This review was performed in response to Corrective Action Request BSC-01-C-01 (Clark 2001, Krisha 2001) pursuant to Quality Assurance review findings of an adverse trend in model validation deficiency. The review findings in this report provide the following information which defines the extent of model validation deficiency and the corrective action needed: (1) AMRs that contain or support models are identified, and conversely, for each model the supporting documentation is identified. (2) The use for each model is determined based on whether the output is used directly for TSPA-SR, or for screening (exclusion) of features, events, and processes (FEPs), and the nature of the model output. (3) Two approaches are used to evaluate the extent to which the validation for each model is compliant with AP-3.10Q (Analyses and Models). The approaches differ in regard to whether model validation is achieved within individual AMRs as originally intended, or whether model validation could be readily achieved by incorporating information from other sources. (4) Recommendations are presented for changes to the AMRs, and additional model development activities or data collection, that will remedy model validation review findings, in support of licensing activities. The Model Validation Status Review emphasized those AMRs that support TSPA-SR (CRWMS M&O 2000bl and 2000bm). A series of workshops and teleconferences was held to discuss and integrate the review findings. The review encompassed 125 AMRs (Table 1) plus certain other supporting documents and data needed to assess model validity. The AMRs were grouped in 21 model areas representing the modeling of processes affecting the natural and

  7. Discrete Fourier Transform-Based Multivariate Image Analysis: Application to Modeling of Aromatase Inhibitory Activity.

    Science.gov (United States)

    Barigye, Stephen J; Freitas, Matheus P; Ausina, Priscila; Zancan, Patricia; Sola-Penna, Mauro; Castillo-Garit, Juan A

    2018-02-12

    We recently generalized the formerly alignment-dependent multivariate image analysis applied to quantitative structure-activity relationships (MIA-QSAR) method through the application of the discrete Fourier transform (DFT), allowing for its application to noncongruent and structurally diverse chemical compound data sets. Here we report the first practical application of this method in the screening of molecular entities of therapeutic interest, with human aromatase inhibitory activity as the case study. We developed an ensemble classification model based on the two-dimensional (2D) DFT MIA-QSAR descriptors, with which we screened the NCI Diversity Set V (1593 compounds) and obtained 34 chemical compounds with possible aromatase inhibitory activity. These compounds were docked into the aromatase active site, and the 10 most promising compounds were selected for in vitro experimental validation. Of these compounds, 7419 (nonsteroidal) and 89 201 (steroidal) demonstrated satisfactory antiproliferative and aromatase inhibitory activities. The obtained results suggest that the 2D-DFT MIA-QSAR method may be useful in ligand-based virtual screening of new molecular entities of therapeutic utility.

  8. Improving the applicability of (Q)SARs for percutaneous penetration in regulatory risk assessment.

    NARCIS (Netherlands)

    Bouwman, T.; Cronin, M.T.; Bessems, J.G.; Sandt, J.J. van de

    2008-01-01

    The new regulatory framework REACH (Registration, Evaluation, and Authorisation of Chemicals) foresees the use of non-testing approaches, such as read-across, chemical categories, structure-activity relationships (SARs) and quantitative structure-activity relationships (QSARs). Although information

  9. 3D-QSAR, molecular docking, and molecular dynamic simulations for prediction of new Hsp90 inhibitors based on isoxazole scaffold.

    Science.gov (United States)

    Abbasi, Maryam; Sadeghi-Aliabadi, Hojjat; Amanlou, Massoud

    2018-05-01

    Heat shock protein 90(Hsp90), as a molecular chaperone, play a crucial role in folding and proper function of many proteins. Hsp90 inhibitors containing isoxazole scaffold are currently being used in the treatment of cancer as tumor suppressers. Here in the present studies, new compounds based on isoxazole scaffold were predicted using a combination of molecular modeling techniques including three-dimensional quantitative structure-activity relationship (3D-QSAR), molecular docking and molecular dynamic (MD) simulations. Comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were also done. The steric and electrostatic contour map of CoMFA and CoMSIA were created. Hydrophobic, hydrogen bond donor and acceptor of CoMSIA model also were generated, and new compounds were predicted by CoMFA and CoMSIA contour maps. To investigate the binding modes of the predicted compounds in the active site of Hsp90, a molecular docking simulation was carried out. MD simulations were also conducted to evaluate the obtained results on the best predicted compound and the best reported Hsp90 inhibitors in the 3D-QSAR model. Findings indicate that the predicted ligands were stable in the active site of Hsp90.

  10. A physically interpretable quantum-theoretic QSAR for some carbonic anhydrase inhibitors with diverse aromatic rings, obtained by a new QSAR procedure.

    Science.gov (United States)

    Clare, Brian W; Supuran, Claudiu T

    2005-03-15

    A QSAR based almost entirely on quantum theoretically calculated descriptors has been developed for a large and heterogeneous group of aromatic and heteroaromatic carbonic anhydrase inhibitors, using orbital energies, nodal angles, atomic charges, and some other intuitively appealing descriptors. Most calculations have been done at the B3LYP/6-31G* level of theory. For the first time we have treated five-membered rings by the same means that we have used for benzene rings in the past. Our flip regression technique has been expanded to encompass automatic variable selection. The statistical quality of the results, while not equal to those we have had with benzene derivatives, is very good considering the noncongeneric nature of the compounds. The most significant correlation was with charge on the atoms of the sulfonamide group, followed by the nodal orientation and the solvation energy calculated by COSMO and the charge polarization of the molecule calculated as the mean absolute Mulliken charge over all atoms.

  11. Validation of HEDR models

    International Nuclear Information System (INIS)

    Napier, B.A.; Simpson, J.C.; Eslinger, P.W.; Ramsdell, J.V. Jr.; Thiede, M.E.; Walters, W.H.

    1994-05-01

    The Hanford Environmental Dose Reconstruction (HEDR) Project has developed a set of computer models for estimating the possible radiation doses that individuals may have received from past Hanford Site operations. This document describes the validation of these models. In the HEDR Project, the model validation exercise consisted of comparing computational model estimates with limited historical field measurements and experimental measurements that are independent of those used to develop the models. The results of any one test do not mean that a model is valid. Rather, the collection of tests together provide a level of confidence that the HEDR models are valid

  12. 3D-QSAR Investigation of Synthetic Antioxidant Chromone Derivatives by Molecular Field Analysis

    Directory of Open Access Journals (Sweden)

    Jiraporn Ungwitayatorn

    2008-02-01

    Full Text Available A series of 7-hydroxy, 8-hydroxy and 7,8-dihydroxy synthetic chromone derivatives was evaluated for their DPPH free radical scavenging activities. A training set of 30 synthetic chromone derivatives was subject to three-dimensional quantitative structure-activity relationship (3D-QSAR studies using molecular field analysis (MFA. The substitutional requirements for favorable antioxidant activity were investigated and a predictive model that could be used for the design of novel antioxidants was derived. Regression analysis was carried out using genetic partial least squares (G/PLS method. A highly predictive and statistically significant model was generated. The predictive ability of the developed model was assessed using a test set of 5 compounds (r2pred = 0.924. The analyzed MFA model demonstrated a good fit, having r2 value of 0.868 and crossvalidated coefficient r2cv value of 0.771.

  13. QSAR, QSPR and QSRR in Terms of 3-D-MoRSE Descriptors for In Silico Screening of Clofibric Acid Analogues.

    Science.gov (United States)

    Di Tullio, Maurizio; Maccallini, Cristina; Ammazzalorso, Alessandra; Giampietro, Letizia; Amoroso, Rosa; De Filippis, Barbara; Fantacuzzi, Marialuigia; Wiczling, Paweł; Kaliszan, Roman

    2012-07-01

    A series of 27 analogues of clofibric acid, mostly heteroarylalkanoic derivatives, have been analyzed by a novel high-throughput reversed-phase HPLC method employing combined gradient of eluent's pH and organic modifier content. The such determined hydrophobicity (lipophilicity) parameters, log kw , and acidity constants, pKa , were subjected to multiple regression analysis to get a QSRR (Quantitative StructureRetention Relationships) and a QSPR (Quantitative Structure-Property Relationships) equation, respectively, describing these pharmacokinetics-determining physicochemical parameters in terms of the calculation chemistry derived structural descriptors. The previously determined in vitro log EC50 values - transactivation activity towards PPARα (human Peroxisome Proliferator-Activated Receptor α) - have also been described in a QSAR (Quantitative StructureActivity Relationships) equation in terms of the 3-D-MoRSE descriptors (3D-Molecule Representation of Structures based on Electron diffraction descriptors). The QSAR model derived can serve for an a priori prediction of bioactivity in vitro of any designed analogue, whereas the QSRR and the QSPR models can be used to evaluate lipophilicity and acidity, respectively, of the compounds, and hence to rational guide selection of structures of proper pharmacokinetics. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. MolProbity: all-atom structure validation for macromolecular crystallography

    International Nuclear Information System (INIS)

    Chen, Vincent B.; Arendall, W. Bryan III; Headd, Jeffrey J.; Keedy, Daniel A.; Immormino, Robert M.; Kapral, Gary J.; Murray, Laura W.; Richardson, Jane S.; Richardson, David C.

    2010-01-01

    MolProbity structure validation will diagnose most local errors in macromolecular crystal structures and help to guide their correction. MolProbity is a structure-validation web service that provides broad-spectrum solidly based evaluation of model quality at both the global and local levels for both proteins and nucleic acids. It relies heavily on the power and sensitivity provided by optimized hydrogen placement and all-atom contact analysis, complemented by updated versions of covalent-geometry and torsion-angle criteria. Some of the local corrections can be performed automatically in MolProbity and all of the diagnostics are presented in chart and graphical forms that help guide manual rebuilding. X-ray crystallography provides a wealth of biologically important molecular data in the form of atomic three-dimensional structures of proteins, nucleic acids and increasingly large complexes in multiple forms and states. Advances in automation, in everything from crystallization to data collection to phasing to model building to refinement, have made solving a structure using crystallography easier than ever. However, despite these improvements, local errors that can affect biological interpretation are widespread at low resolution and even high-resolution structures nearly all contain at least a few local errors such as Ramachandran outliers, flipped branched protein side chains and incorrect sugar puckers. It is critical both for the crystallographer and for the end user that there are easy and reliable methods to diagnose and correct these sorts of errors in structures. MolProbity is the authors’ contribution to helping solve this problem and this article reviews its general capabilities, reports on recent enhancements and usage, and presents evidence that the resulting improvements are now beneficially affecting the global database

  15. Synthesis, characterization, crystal structures, QSAR study and antibacterial activities of organotin bisphosphoramidates

    Czech Academy of Sciences Publication Activity Database

    Gholivand, K.; Valmoozi, A.A.E.; Gholami, A.; Dušek, Michal; Eigner, Václav; Abolghasemi, S.

    2016-01-01

    Roč. 806, Mar (2016), s. 33-44 ISSN 0022-328X R&D Projects: GA ČR GA15-12653S Institutional support: RVO:68378271 Keywords : bisphosphoramidate * organotin compounds * crystal structure * antibacterial activity * QSAR Subject RIV: BM - Solid Matter Physics ; Magnetism Impact factor: 2.184, year: 2016

  16. Three-dimensional all-speed CFD code for safety analysis of nuclear reactor containment: Status of GASFLOW parallelization, model development, validation and application

    Energy Technology Data Exchange (ETDEWEB)

    Xiao, Jianjun, E-mail: jianjun.xiao@kit.edu [Institute of Nuclear and Energy Technologies, Karlsruhe Institute of Technology, P.O. Box 3640, 76021 Karlsruhe (Germany); Travis, John R., E-mail: jack_travis@comcast.com [Engineering and Scientific Software Inc., 3010 Old Pecos Trail, Santa Fe, NM 87505 (United States); Royl, Peter, E-mail: peter.royl@partner.kit.edu [Institute of Nuclear and Energy Technologies, Karlsruhe Institute of Technology, P.O. Box 3640, 76021 Karlsruhe (Germany); Necker, Gottfried, E-mail: gottfried.necker@partner.kit.edu [Institute of Nuclear and Energy Technologies, Karlsruhe Institute of Technology, P.O. Box 3640, 76021 Karlsruhe (Germany); Svishchev, Anatoly, E-mail: anatoly.svishchev@kit.edu [Institute of Nuclear and Energy Technologies, Karlsruhe Institute of Technology, P.O. Box 3640, 76021 Karlsruhe (Germany); Jordan, Thomas, E-mail: thomas.jordan@kit.edu [Institute of Nuclear and Energy Technologies, Karlsruhe Institute of Technology, P.O. Box 3640, 76021 Karlsruhe (Germany)

    2016-05-15

    Highlights: • 3-D scalable semi-implicit pressure-based CFD code for containment safety analysis. • Robust solution algorithm valid for all-speed flows. • Well validated and widely used CFD code for hydrogen safety analysis. • Code applied in various types of nuclear reactor containments. • Parallelization enables high-fidelity models in large scale containment simulations. - Abstract: GASFLOW is a three dimensional semi-implicit all-speed CFD code which can be used to predict fluid dynamics, chemical kinetics, heat and mass transfer, aerosol transportation and other related phenomena involved in postulated accidents in nuclear reactor containments. The main purpose of the paper is to give a brief review on recent GASFLOW code development, validations and applications in the field of nuclear safety. GASFLOW code has been well validated by international experimental benchmarks, and has been widely applied to hydrogen safety analysis in various types of nuclear power plants in European and Asian countries, which have been summarized in this paper. Furthermore, four benchmark tests of a lid-driven cavity flow, low Mach number jet flow, 1-D shock tube and supersonic flow over a forward-facing step are presented in order to demonstrate the accuracy and wide-ranging capability of ICE’d ALE solution algorithm for all-speed flows. GASFLOW has been successfully parallelized using the paradigms of Message Passing Interface (MPI) and domain decomposition. The parallel version, GASFLOW-MPI, adds great value to large scale containment simulations by enabling high-fidelity models, including more geometric details and more complex physics. It will be helpful for the nuclear safety engineers to better understand the hydrogen safety related physical phenomena during the severe accident, to optimize the design of the hydrogen risk mitigation systems and to fulfill the licensing requirements by the nuclear regulatory authorities. GASFLOW-MPI is targeting a high

  17. Synthesis, Structural Studies and Biological Evaluation of Connections of Thiosemicarbazide, 1,2,4-Triazole and 1,3,4-Thiadiazole with Palmitic Acid.

    Science.gov (United States)

    Jóźwiak, Michał; Stępień, Karolina; Wrzosek, Małgorzata; Olejarz, Wioletta; Kubiak-Tomaszewska, Grażyna; Filipowska, Anna; Filipowski, Wojciech; Struga, Marta

    2018-04-03

    Thirty new derivatives of palmitic acid were efficiently synthesized. All obtained compounds can be divided into three groups of derivatives: Thiosemicarbazides (compounds 1 - 10 ), 1,2,4-triazoles (compounds 1a - 10a ) and 1,3,4-thiadiazoles (compounds 1b - 10b ) moieties. ¹H-NMR, 13 C-NMR and MS methods were used to confirm the structure of derivatives. All obtained compounds were tested in vitro against a number of microorganisms, including Gram-positive cocci, Gram-negative rods and Candida albicans . Compounds 4 , 5 , 6 , 8 showed significant inhibition against C. albicans . The range of MIC values was 50-1.56 μg/mL. The halogen atom, especially at the 3rd position of the phenyl group was significantly important for antifungal activity. The biological activity against Candida albicans and selected molecular descriptors were used as a basis for QSAR models, that have been determined by means of multiple linear regression. The models have been validated by means of the Leave-One-Out Cross Validation. The obtained QSAR models were characterized by high determination coefficients and good prediction power.

  18. 3D QSAR Studies of DAMNI Analogs as Possible Non-nucleoside Reverse Transcriptase Inhibitors

    Directory of Open Access Journals (Sweden)

    S. Ganguly

    2008-01-01

    Full Text Available The non-nucleoside inhibitors of HIV-1-reverse transcriptase (NNRTIs are an important class of drugs employed in antiviral therapy. Recently, a novel family of NNRTIs commonly referred to as 1-[2-diarylmethoxy] ethyl 2-methyl-5-nitroimidazoles (DAMNI derivatives have been discovered. The 3D-QSAR studies on DAMNI derivatives as NNRTIs was performed by comparative molecular field analysis (CoMFA and comparative molecular similarity indices analysis (CoMSIA methods to determine the factors required for the activity of these compounds. The global minimum energy conformer of the template molecule 15, the most active molecule of the series, was obtained by simulated annealing method and used to build the structures of the molecules in the dataset. The combination of steric and electrostatic fields in CoMSIA gave the best results with cross-validated and conventional correlation coefficients of 0.654 and 0.928 respectively. The predictive ability of CoMFA and CoMSIA were determined using a test set of ten DAMNI derivatives giving predictive correlation coefficients of 0.92 and 0.98 respectively indicating good predictive power. Further, the robustness of the models was verified by bootstrapping analysis. The information obtained from CoMFA and CoMSIA 3D contour maps may be of utility in the design of more potent DAMNI analogs as NNRTIs in future.

  19. A nonlinear QSAR study using oscillating search and SVM as an efficient algorithm to model the inhibition of reverse transcriptase by HEPT derivatives

    International Nuclear Information System (INIS)

    Ferkous, F.; Saihi, Y.

    2018-01-01

    Quantitative structure-activity relationships were constructed for 107 inhibitors of HIV-1 reverse transcriptase that are derivatives of 1-[(2-hydroxyethoxy)methyl]-6-(phenylthio)thymine (HEPT). A combination of a support vector machine (SVM) and oscillating search (OS) algorithms for feature selection was adopted to select the most appropriate descriptors. The application was optimized to obtain an SVM model to predict the biological activity EC50 of the HEPT derivatives with a minimum number of descriptors (SpMax4 B h (e) MLOGP MATS5m) and high values of R2 and Q2 (0.8662, 0.8769). The statistical results showed good correlation between the activity and three best descriptors were included in the best SVM model. The values of R2 and Q2 confirmed the stability and good predictive ability of the model. The SVM technique was adequate to produce an effective QSAR model and outperformed those in the literature and the predictive stages for the inhibitory activity of reverse transcriptase by HEPT derivatives. (author)

  20. QSAR study of benzimidazole derivatives inhibition on escherichia coli methionine Aminopeptidase

    Directory of Open Access Journals (Sweden)

    Zahra Garkani-Nejad

    2010-06-01

    Full Text Available The paper describes a quantitative structure-activity relationship (QSAR study of IC50 values of benzimidazole derivatives on escherichia coli methionine aminopeptidase. The activity of the 32 inhibitors has been estimated by means of multiple linear regression (MLR and artificial neural network (ANN techniques. The results obtained using the MLR method indicate that the activity of derivatives of benzimidazoles on CoII-loaded escherichia coli methionine aminopeptidase depend on different parameters containing topological descriptors, Burden eigen values, 3D MoRSE descriptors and 2D autocorrelation descriptors. The best artificial neural network model is a fully-connected, feed forward back propagation network with a 5-4-1 architecture. Standard error for the training set using this network was 0.193 with correlation coefficient 0.996 and for the prediction set standard error was 1.41 with correlation coefficient 0.802. Comparison of the quality of the ANN with different MLR models showed that ANN has a better predictive power.

  1. Building on a solid foundation: SAR and QSAR as a fundamental strategy to reduce animal testing.

    Science.gov (United States)

    Sullivan, K M; Manuppello, J R; Willett, C E

    2014-01-01

    The development of more efficient, ethical, and effective means of assessing the effects of chemicals on human health and the environment was a lifetime goal of Gilman Veith. His work has provided the foundation for the use of chemical structure for informing toxicological assessment by regulatory agencies the world over. Veith's scientific work influenced the early development of the SAR models in use today at the US Environmental Protection Agency. He was the driving force behind the Organisation for Economic Co-operation and Development QSAR Toolbox. Veith was one of a few early pioneers whose vision led to the linkage of chemical structure and biological activity as a means of predicting adverse apical outcomes (known as a mode of action, or an adverse outcome pathway approach), and he understood at an early stage the power that could be harnessed when combining computational and mechanistic biological approaches as a means of avoiding animal testing. Through the International QSAR Foundation he organized like-minded experts to develop non-animal methods and frameworks for the assessment of chemical hazard and risk for the benefit of public and environmental health. Avoiding animal testing was Gil's passion, and his work helped to initiate the paradigm shift in toxicology that is now rendering this feasible.

  2. The influence of R and S configurations of a series of amphetamine derivatives on quantitative structure–activity relationship models

    International Nuclear Information System (INIS)

    Fresqui, Maíra A.C.; Ferreira, Márcia M.C.; Trsic, Milan

    2013-01-01

    Highlights: ► The QSAR model is not dependent of ligand conformation. ► Amphetamines were analyzed by quantum chemical, steric and hydrophobic descriptors. ► CHELPG atomic charges on the benzene ring are one of the most important descriptors. ► The PLS models built were extensively validated. ► Manual docking supports the QSAR results by pi–pi stacking interactions. - Abstract: Chiral molecules need special attention in drug design. In this sense, the R and S configurations of a series of thirty-four amphetamines were evaluated by quantitative structure–activity relationship (QSAR). This class of compounds has antidepressant, anti-Parkinson and anti-Alzheimer effects against the enzyme monoamine oxidase A (MAO A). A set of thirty-eight descriptors, including electronic, steric and hydrophobic ones, were calculated. Variable selection was performed through the correlation coefficients followed by the ordered predictor selection (OPS) algorithm. Six descriptors (CHELPG atomic charges C3, C4 and C5, electrophilicity, molecular surface area and log P) were selected for both configurations and a satisfactory model was obtained by PLS regression with three latent variables with R 2 = 0.73 and Q 2 = 0.60, with external predictability Q 2 = 0.68, and R 2 = 0.76 and Q 2 = 0.67 with external predictability Q 2 = 0.50, for R and S configurations, respectively. To confirm the robustness of each model, leave-N-out cross validation (LNO) was carried out and the y-randomization test was used to check if these models present chance correlation. Moreover, both automated or a manual molecular docking indicate that the reaction of ligands with the enzyme occurs via pi–pi stacking interaction with Tyr407, inclined face-to-face interaction with Tyr444, while aromatic hydrogen–hydrogen interactions with Tyr197 are preferable for R instead of S configurations.

  3. Insights on Cytochrome P450 Enzymes and Inhibitors Obtained Through QSAR Studies

    Directory of Open Access Journals (Sweden)

    Maryam Foroozesh

    2012-08-01

    Full Text Available The cytochrome P450 (CYP superfamily of heme enzymes play an important role in the metabolism of a large number of endogenous and exogenous compounds, including most of the drugs currently on the market. Inhibitors of CYP enzymes have important roles in the treatment of several disease conditions such as numerous cancers and fungal infections in addition to their critical role in drug-drug interactions. Structure activity relationships (SAR, and three-dimensional quantitative structure activity relationships (3D-QSAR represent important tools in understanding the interactions of the inhibitors with the active sites of the CYP enzymes. A comprehensive account of the QSAR studies on the major human CYPs 1A1, 1A2, 1B1, 2A6, 2B6, 2C9, 2C19, 2D6, 2E1, 3A4 and a few other CYPs are detailed in this review which will provide us with an insight into the individual/common characteristics of the active sites of these enzymes and the enzyme-inhibitor interactions.

  4. Docking based 3d-QSAR studies applied at the BRAF inhibitors to understand the binding mechanism

    International Nuclear Information System (INIS)

    Mahmood, U.; Haq, Z.U.

    2011-01-01

    BRAF is a great therapeutic target in a wide variety of human cancers. It is the member of Ras Activating Factor (RAF) family of serine/throenine kinase. The mutated form of the BRAF has diverted all the attention towards itself because of increase severity and elevated kinase activity. The RAF signal transduction cascade is a conserved protein pathway that is involved in cell cycle progression and apoptosis. The ERK regulates phosphorylation of different proteins either in cytosol or in nucleus but disorders in ERK signaling pathway cause mutation in BRAF. This cascade in these cells may provide selection of mutated BRAF in which valine is substituted with glutamatic acid at position 600. This mutation occurs in activation loop. A number of inhibitors reported to target different members of RAF, some of them have potential to target the BRAF as well. Major reason for failure of previously reported inhibitors was due to the highly conserved sequence and confirmation of catalytic cleft which is always a center of consideration for binding of inhibitors to suppress the kinase activity. This is the first attempt to study and understand the BARF inhibitors - protein interactions in detail by utilizing 3D-QSAR and molecular docking techniques. Most reliable techniques of 3D QSAR i.e Comparative Molecular Field Analysis (CoMFA) and Comparative Molecular Similarity Indices Analysis (CoMSIA) were applied for three different data sets. The data sets selected for better evaluation of BRAF inhibitors belongs to 2, 6-Disubstituted Pyrazine, Pyridoimidazolones and its derivatives. Our models would offer help to better understand the structure-activity relationships that exist for these classes of compounds and also facilitate the design of novel inhibitors with good chemical diversity. (Author)

  5. A proposed best practice model validation framework for banks

    Directory of Open Access Journals (Sweden)

    Pieter J. (Riaan de Jongh

    2017-06-01

    Full Text Available Background: With the increasing use of complex quantitative models in applications throughout the financial world, model risk has become a major concern. The credit crisis of 2008–2009 provoked added concern about the use of models in finance. Measuring and managing model risk has subsequently come under scrutiny from regulators, supervisors, banks and other financial institutions. Regulatory guidance indicates that meticulous monitoring of all phases of model development and implementation is required to mitigate this risk. Considerable resources must be mobilised for this purpose. The exercise must embrace model development, assembly, implementation, validation and effective governance. Setting: Model validation practices are generally patchy, disparate and sometimes contradictory, and although the Basel Accord and some regulatory authorities have attempted to establish guiding principles, no definite set of global standards exists. Aim: Assessing the available literature for the best validation practices. Methods: This comprehensive literature study provided a background to the complexities of effective model management and focussed on model validation as a component of model risk management. Results: We propose a coherent ‘best practice’ framework for model validation. Scorecard tools are also presented to evaluate if the proposed best practice model validation framework has been adequately assembled and implemented. Conclusion: The proposed best practice model validation framework is designed to assist firms in the construction of an effective, robust and fully compliant model validation programme and comprises three principal elements: model validation governance, policy and process.

  6. MIANN models in medicinal, physical and organic chemistry.

    Science.gov (United States)

    González-Díaz, Humberto; Arrasate, Sonia; Sotomayor, Nuria; Lete, Esther; Munteanu, Cristian R; Pazos, Alejandro; Besada-Porto, Lina; Ruso, Juan M

    2013-01-01

    Reducing costs in terms of time, animal sacrifice, and material resources with computational methods has become a promising goal in Medicinal, Biological, Physical and Organic Chemistry. There are many computational techniques that can be used in this sense. In any case, almost all these methods focus on few fundamental aspects including: type (1) methods to quantify the molecular structure, type (2) methods to link the structure with the biological activity, and others. In particular, MARCH-INSIDE (MI), acronym for Markov Chain Invariants for Networks Simulation and Design, is a well-known method for QSAR analysis useful in step (1). In addition, the bio-inspired Artificial-Intelligence (AI) algorithms called Artificial Neural Networks (ANNs) are among the most powerful type (2) methods. We can combine MI with ANNs in order to seek QSAR models, a strategy which is called herein MIANN (MI & ANN models). One of the first applications of the MIANN strategy was in the development of new QSAR models for drug discovery. MIANN strategy has been expanded to the QSAR study of proteins, protein-drug interactions, and protein-protein interaction networks. In this paper, we review for the first time many interesting aspects of the MIANN strategy including theoretical basis, implementation in web servers, and examples of applications in Medicinal and Biological chemistry. We also report new applications of the MIANN strategy in Medicinal chemistry and the first examples in Physical and Organic Chemistry, as well. In so doing, we developed new MIANN models for several self-assembly physicochemical properties of surfactants and large reaction networks in organic synthesis. In some of the new examples we also present experimental results which were not published up to date.

  7. Validating EHR clinical models using ontology patterns.

    Science.gov (United States)

    Martínez-Costa, Catalina; Schulz, Stefan

    2017-12-01

    Clinical models are artefacts that specify how information is structured in electronic health records (EHRs). However, the makeup of clinical models is not guided by any formal constraint beyond a semantically vague information model. We address this gap by advocating ontology design patterns as a mechanism that makes the semantics of clinical models explicit. This paper demonstrates how ontology design patterns can validate existing clinical models using SHACL. Based on the Clinical Information Modelling Initiative (CIMI), we show how ontology patterns detect both modeling and terminology binding errors in CIMI models. SHACL, a W3C constraint language for the validation of RDF graphs, builds on the concept of "Shape", a description of data in terms of expected cardinalities, datatypes and other restrictions. SHACL, as opposed to OWL, subscribes to the Closed World Assumption (CWA) and is therefore more suitable for the validation of clinical models. We have demonstrated the feasibility of the approach by manually describing the correspondences between six CIMI clinical models represented in RDF and two SHACL ontology design patterns. Using a Java-based SHACL implementation, we found at least eleven modeling and binding errors within these CIMI models. This demonstrates the usefulness of ontology design patterns not only as a modeling tool but also as a tool for validation. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Quantitative structure–activity relationship model for amino acids as corrosion inhibitors based on the support vector machine and molecular design

    International Nuclear Information System (INIS)

    Zhao, Hongxia; Zhang, Xiuhui; Ji, Lin; Hu, Haixiang; Li, Qianshu

    2014-01-01

    Highlights: • Nonlinear quantitative structure–activity relationship (QSAR) model was built by the support vector machine. • Descriptors for QSAR model were selected by principal component analysis. • Binding energy was taken as one of the descriptors for QSAR model. • Acidic solution and protonation of the inhibitor were considered. - Abstract: The inhibition performance of nineteen amino acids was studied by theoretical methods. The affection of acidic solution and protonation of inhibitor were considered in molecular dynamics simulation and the results indicated that the protonated amino-group was not adsorbed on Fe (1 1 0) surface. Additionally, a nonlinear quantitative structure–activity relationship (QSAR) model was built by the support vector machine. The correlation coefficient was 0.97 and the root mean square error, the differences between predicted and experimental inhibition efficiencies (%), was 1.48. Furthermore, five new amino acids were theoretically designed and their inhibition efficiencies were predicted by the built QSAR model

  9. Skin sensitization: Modeling based on skin metabolism simulation and formation of protein conjugates

    DEFF Research Database (Denmark)

    Dimitrov, Sabcho; Low, Lawrence; Patlewicz, Grace

    2005-01-01

    alerting groups, three-dimensional (3D)-QSARs were developed to describe the multiplicity of physicochemical, steric, and electronic parameters. These 3D-QSARs, so-called pattern recognition-type models, were applied each time a latent alerting group was identified in a parent chemical or its generated...... in the model building. The TIssue MEtabolism Simulator (TIMES) software was used to integrate a skin metabolism simulator and 3D-QSARs to evaluate the reactivity of chemicals thus predicting their likely skin sensitization potency....

  10. Probing the Hypothesis of SAR Continuity Restoration by the Removal of Activity Cliffs Generators in QSAR.

    Science.gov (United States)

    Cruz-Monteagudo, Maykel; Medina-Franco, José L; Perera-Sardiña, Yunier; Borges, Fernanda; Tejera, Eduardo; Paz-Y-Miño, Cesar; Pérez-Castillo, Yunierkis; Sánchez-Rodríguez, Aminael; Contreras-Posada, Zuleidys; Cordeiro, M Natália D S

    2016-01-01

    In this work we report the first attempt to study the effect of activity cliffs over the generalization ability of machine learning (ML) based QSAR classifiers, using as study case a previously reported diverse and noisy dataset focused on drug induced liver injury (DILI) and more than 40 ML classification algorithms. Here, the hypothesis of structure-activity relationship (SAR) continuity restoration by activity cliffs removal is tested as a potential solution to overcome such limitation. Previously, a parallelism was established between activity cliffs generators (ACGs) and instances that should be misclassified (ISMs), a related concept from the field of machine learning. Based on this concept we comparatively studied the classification performance of multiple machine learning classifiers as well as the consensus classifier derived from predictive classifiers obtained from training sets including or excluding ACGs. The influence of the removal of ACGs from the training set over the virtual screening performance was also studied for the respective consensus classifiers algorithms. In general terms, the removal of the ACGs from the training process slightly decreased the overall accuracy of the ML classifiers and multi-classifiers, improving their sensitivity (the weakest feature of ML classifiers trained with ACGs) but decreasing their specificity. Although these results do not support a positive effect of the removal of ACGs over the classification performance of ML classifiers, the "balancing effect" of ACG removal demonstrated to positively influence the virtual screening performance of multi-classifiers based on valid base ML classifiers. Specially, the early recognition ability was significantly favored after ACGs removal. The results presented and discussed in this work represent the first step towards the application of a remedial solution to the activity cliffs problem in QSAR studies.

  11. QSTR with extended topochemical atom (ETA) indices. 14. QSAR modeling of toxicity of aromatic aldehydes to Tetrahymena pyriformis

    Energy Technology Data Exchange (ETDEWEB)

    Roy, Kunal, E-mail: kunalroy_in@yahoo.com [Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700 032 (India); Das, Rudra Narayan [Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700 032 (India)

    2010-11-15

    Aldehydes are a toxic class of chemicals causing severe health hazards. In this background, quantitative structure-toxicity relationship (QSTR) models have been developed in the present study using Extended Topochemical Atom (ETA) indices for a large group of 77 aromatic aldehydes for their acute toxicity against the protozoan ciliate Tetrahymena pyriformis. The ETA models have been compared with those developed using various non-ETA topological indices. Attempt was also made to include the n-octanol/water partition coefficient (log K{sub o/w}) as an additional descriptor considering the importance of hydrophobicity in toxicity prediction. Thirty different models were developed using different chemometric tools. All the models have been validated using internal validation and external validation techniques. The statistical quality of the ETA models was found to be comparable to that of the non-ETA models. The ETA models have shown the important effects of steric bulk, lipophilicity, presence of electronegative atom containing substituents and functionality of the aldehydic oxygen to the toxicity of the aldehydes. The best ETA model (without using log K{sub o/w}) shows encouraging statistical quality (Q{sub int}{sup 2}=0.709,Q{sub ext}{sup 2}=0.744). It is interesting to note that some of the topological models reported here are better in statistical quality than previously reported models using quantum chemical descriptors.

  12. Validation of the Colorado Retinopathy of Prematurity Screening Model.

    Science.gov (United States)

    McCourt, Emily A; Ying, Gui-Shuang; Lynch, Anne M; Palestine, Alan G; Wagner, Brandie D; Wymore, Erica; Tomlinson, Lauren A; Binenbaum, Gil

    2018-04-01

    The Colorado Retinopathy of Prematurity (CO-ROP) model uses birth weight, gestational age, and weight gain at the first month of life (WG-28) to predict risk of severe retinopathy of prematurity (ROP). In previous validation studies, the model performed very well, predicting virtually all cases of severe ROP and potentially reducing the number of infants who need ROP examinations, warranting validation in a larger, more diverse population. To validate the performance of the CO-ROP model in a large multicenter cohort. This study is a secondary analysis of data from the Postnatal Growth and Retinopathy of Prematurity (G-ROP) Study, a retrospective multicenter cohort study conducted in 29 hospitals in the United States and Canada between January 2006 and June 2012 of 6351 premature infants who received ROP examinations. Sensitivity and specificity for severe (early treatment of ROP [ETROP] type 1 or 2) ROP, and reduction in infants receiving examinations. The CO-ROP model was applied to the infants in the G-ROP data set with all 3 data points (infants would have received examinations if they met all 3 criteria: birth weight, large validation cohort. The model requires all 3 criteria to be met to signal a need for examinations, but some infants with a birth weight or gestational age above the thresholds developed severe ROP. Most of these infants who were not detected by the CO-ROP model had obvious deviation in expected weight trajectories or nonphysiologic weight gain. These findings suggest that the CO-ROP model needs to be revised before considering implementation into clinical practice.

  13. A Quantitative Structure-Activity Relationships (QSAR Study of Piperine Based Derivatives with Leishmanicidal Activity

    Directory of Open Access Journals (Sweden)

    Edilson Beserra Alencar Filho

    2017-04-01

    Full Text Available Leishmaniasis is a parasitic disease which represents a serious public health problem in developing countries. It is considered a neglected tropical disease, for which there is little initiative in the search for therapeutic alternatives by pharmaceutical industry. Natural products remain a great source of inspiration for obtaining bioactive molecules. In 2010, Singh and co-workers published the synthesis and in vitro biological activity of piperoyl-aminoacid conjugates, as well as of piperine, against cellular cultures of Leishmania donovani. The piperine is an alkaloid isolated from Piper nigrum that has many activities described in the literature. In this work, we present a Quantitative Structure-Activity Study of piperine derivatives tested by Singh and co-workers, aiming to highlight important molecular features for leishmanicidal activity, obtaining a mathematical model to predict the activity of new analogs. Compounds were submitted to a geometry optimization computational procedure at semiempirical level of quantum theory. Molecular descriptors for the set of compounds were calculated by E-Dragon online plataform, followed by a variable selection procedure using Ordered Predictors Selection algorithm. Validation parameters obtained showed that a good QSAR model, based on multiple linear regression, was obtained (R2 = 0.85; Q2 = 0.69, and the following conclusions regarding the structure-activity relationship were elucidated: Compounds with electronegative atoms on different substituent groups of analogs, absence of unsaturation on lateral chain, presence of ester instead of carboxyl, and large volumes (due the presence of additional aromatic rings trends to increase the activity against promastigote forms of leishmania. DOI: http://dx.doi.org/10.17807/orbital.v9i1.893

  14. Analytical models approximating individual processes: a validation method.

    Science.gov (United States)

    Favier, C; Degallier, N; Menkès, C E

    2010-12-01

    Upscaling population models from fine to coarse resolutions, in space, time and/or level of description, allows the derivation of fast and tractable models based on a thorough knowledge of individual processes. The validity of such approximations is generally tested only on a limited range of parameter sets. A more general validation test, over a range of parameters, is proposed; this would estimate the error induced by the approximation, using the original model's stochastic variability as a reference. This method is illustrated by three examples taken from the field of epidemics transmitted by vectors that bite in a temporally cyclical pattern, that illustrate the use of the method: to estimate if an approximation over- or under-fits the original model; to invalidate an approximation; to rank possible approximations for their qualities. As a result, the application of the validation method to this field emphasizes the need to account for the vectors' biology in epidemic prediction models and to validate these against finer scale models. Copyright © 2010 Elsevier Inc. All rights reserved.

  15. In silico environmental chemical science: properties and processes from statistical and computational modelling

    Energy Technology Data Exchange (ETDEWEB)

    Tratnyek, P. G.; Bylaska, Eric J.; Weber, Eric J.

    2017-01-01

    Quantitative structure–activity relationships (QSARs) have long been used in the environmental sciences. More recently, molecular modeling and chemoinformatic methods have become widespread. These methods have the potential to expand and accelerate advances in environmental chemistry because they complement observational and experimental data with “in silico” results and analysis. The opportunities and challenges that arise at the intersection between statistical and theoretical in silico methods are most apparent in the context of properties that determine the environmental fate and effects of chemical contaminants (degradation rate constants, partition coefficients, toxicities, etc.). The main example of this is the calibration of QSARs using descriptor variable data calculated from molecular modeling, which can make QSARs more useful for predicting property data that are unavailable, but also can make them more powerful tools for diagnosis of fate determining pathways and mechanisms. Emerging opportunities for “in silico environmental chemical science” are to move beyond the calculation of specific chemical properties using statistical models and toward more fully in silico models, prediction of transformation pathways and products, incorporation of environmental factors into model predictions, integration of databases and predictive models into more comprehensive and efficient tools for exposure assessment, and extending the applicability of all the above from chemicals to biologicals and materials.

  16. In silico environmental chemical science: properties and processes from statistical and computational modelling.

    Science.gov (United States)

    Tratnyek, Paul G; Bylaska, Eric J; Weber, Eric J

    2017-03-22

    Quantitative structure-activity relationships (QSARs) have long been used in the environmental sciences. More recently, molecular modeling and chemoinformatic methods have become widespread. These methods have the potential to expand and accelerate advances in environmental chemistry because they complement observational and experimental data with "in silico" results and analysis. The opportunities and challenges that arise at the intersection between statistical and theoretical in silico methods are most apparent in the context of properties that determine the environmental fate and effects of chemical contaminants (degradation rate constants, partition coefficients, toxicities, etc.). The main example of this is the calibration of QSARs using descriptor variable data calculated from molecular modeling, which can make QSARs more useful for predicting property data that are unavailable, but also can make them more powerful tools for diagnosis of fate determining pathways and mechanisms. Emerging opportunities for "in silico environmental chemical science" are to move beyond the calculation of specific chemical properties using statistical models and toward more fully in silico models, prediction of transformation pathways and products, incorporation of environmental factors into model predictions, integration of databases and predictive models into more comprehensive and efficient tools for exposure assessment, and extending the applicability of all the above from chemicals to biologicals and materials.

  17. Synthesis, Structural Studies and Biological Evaluation of Connections of Thiosemicarbazide, 1,2,4-Triazole and 1,3,4-Thiadiazole with Palmitic Acid

    Directory of Open Access Journals (Sweden)

    Michał Jóźwiak

    2018-04-01

    Full Text Available Thirty new derivatives of palmitic acid were efficiently synthesized. All obtained compounds can be divided into three groups of derivatives: Thiosemicarbazides (compounds 1–10, 1,2,4-triazoles (compounds 1a–10a and 1,3,4-thiadiazoles (compounds 1b–10b moieties. 1H-NMR, 13C-NMR and MS methods were used to confirm the structure of derivatives. All obtained compounds were tested in vitro against a number of microorganisms, including Gram-positive cocci, Gram-negative rods and Candida albicans. Compounds 4, 5, 6, 8 showed significant inhibition against C. albicans. The range of MIC values was 50–1.56 μg/mL. The halogen atom, especially at the 3rd position of the phenyl group was significantly important for antifungal activity. The biological activity against Candida albicans and selected molecular descriptors were used as a basis for QSAR models, that have been determined by means of multiple linear regression. The models have been validated by means of the Leave-One-Out Cross Validation. The obtained QSAR models were characterized by high determination coefficients and good prediction power.

  18. QSAR development and profiling of 72,524 REACH substances for PXR activation and CYP3A4 induction

    DEFF Research Database (Denmark)

    Abildgaard Rosenberg, Sine; Xia, M.; Huang, R.

    2017-01-01

    ,524 substances pre-registered under the EU chemicals regulation, REACH, and the models could predict 52.5% to 71.9% of the substances within their respective applicability domains. These predictions can, for example, be used for priority setting and in weight-of-evidence assessments of chemicals. Statistical...... analyses of the experimental drug dataset and the QSAR-predicted set of REACH substances were performed to identify similarities and differences in frequencies of overlapping positive results for PXR binding, PXR activation and CYP3A4 induction between the two datasets....

  19. Structure-activity relationships between sterols and their thermal stability in oil matrix.

    Science.gov (United States)

    Hu, Yinzhou; Xu, Junli; Huang, Weisu; Zhao, Yajing; Li, Maiquan; Wang, Mengmeng; Zheng, Lufei; Lu, Baiyi

    2018-08-30

    Structure-activity relationships between 20 sterols and their thermal stabilities were studied in a model oil system. All sterol degradations were found to be consistent with a first-order kinetic model with determination of coefficient (R 2 ) higher than 0.9444. The number of double bonds in the sterol structure was negatively correlated with the thermal stability of sterol, whereas the length of the branch chain was positively correlated with the thermal stability of sterol. A quantitative structure-activity relationship (QSAR) model to predict thermal stability of sterol was developed by using partial least squares regression (PLSR) combined with genetic algorithm (GA). A regression model was built with R 2 of 0.806. Almost all sterol degradation constants can be predicted accurately with R 2 of cross-validation equals to 0.680. Four important variables were selected in optimal QSAR model and the selected variables were observed to be related with information indices, RDF descriptors, and 3D-MoRSE descriptors. Copyright © 2018 Elsevier Ltd. All rights reserved.

  20. Validation of simulation models

    DEFF Research Database (Denmark)

    Rehman, Muniza; Pedersen, Stig Andur

    2012-01-01

    In philosophy of science, the interest for computational models and simulations has increased heavily during the past decades. Different positions regarding the validity of models have emerged but the views have not succeeded in capturing the diversity of validation methods. The wide variety...

  1. QSAR Study on Caffeine Derivatives Docked on Poly(ARNA Polymerase Protein Cid1

    Directory of Open Access Journals (Sweden)

    Teodora E. Harsa

    2016-06-01

    Full Text Available Caffeine is the most commonly ingested alkylxantine and is recognized as a psycho-stimulant. It improves some aspects of cognitive performance, however it reduces the cerebral blood flow both in animals and humans. In this paper a QSAR study on caffeine derivatives, docked on the Poly(ARNA polymerase protein cid1, is reported. A set of forty caffeine derivatives, downloaded from PubChem, was modeled, within the hypermolecule strategy; the predicted activity was LD50 and prediction was done on similarity clusters with the leaders chosen as the best docked ligands on the Poly(ARNA polymerase protein cid1. It was concluded that LD50 of the studied caffeines is not influenced by their binding to the target protein. This work is licensed under a Creative Commons Attribution 4.0 International License.

  2. The influence of R and S configurations of a series of amphetamine derivatives on quantitative structure-activity relationship models

    Energy Technology Data Exchange (ETDEWEB)

    Fresqui, Maira A.C., E-mail: maira@iqsc.usp.br [Institute of Chemistry of Sao Carlos, University of Sao Paulo, Av. Trabalhador Sao-carlense, 400, POB 780, 13560-970 Sao Carlos, SP (Brazil); Ferreira, Marcia M.C., E-mail: marcia@iqm.unicamp.br [Institute of Chemistry, University of Campinas - UNICAMP, POB 6154, 13083-970 Campinas, SP (Brazil); Trsic, Milan, E-mail: cra612@gmail.com [Institute of Chemistry of Sao Carlos, University of Sao Paulo, Av. Trabalhador Sao-carlense, 400, POB 780, 13560-970 Sao Carlos, SP (Brazil)

    2013-01-08

    Highlights: Black-Right-Pointing-Pointer The QSAR model is not dependent of ligand conformation. Black-Right-Pointing-Pointer Amphetamines were analyzed by quantum chemical, steric and hydrophobic descriptors. Black-Right-Pointing-Pointer CHELPG atomic charges on the benzene ring are one of the most important descriptors. Black-Right-Pointing-Pointer The PLS models built were extensively validated. Black-Right-Pointing-Pointer Manual docking supports the QSAR results by pi-pi stacking interactions. - Abstract: Chiral molecules need special attention in drug design. In this sense, the R and S configurations of a series of thirty-four amphetamines were evaluated by quantitative structure-activity relationship (QSAR). This class of compounds has antidepressant, anti-Parkinson and anti-Alzheimer effects against the enzyme monoamine oxidase A (MAO A). A set of thirty-eight descriptors, including electronic, steric and hydrophobic ones, were calculated. Variable selection was performed through the correlation coefficients followed by the ordered predictor selection (OPS) algorithm. Six descriptors (CHELPG atomic charges C3, C4 and C5, electrophilicity, molecular surface area and log P) were selected for both configurations and a satisfactory model was obtained by PLS regression with three latent variables with R{sup 2} = 0.73 and Q{sup 2} = 0.60, with external predictability Q{sup 2} = 0.68, and R{sup 2} = 0.76 and Q{sup 2} = 0.67 with external predictability Q{sup 2} = 0.50, for R and S configurations, respectively. To confirm the robustness of each model, leave-N-out cross validation (LNO) was carried out and the y-randomization test was used to check if these models present chance correlation. Moreover, both automated or a manual molecular docking indicate that the reaction of ligands with the enzyme occurs via pi-pi stacking interaction with Tyr407, inclined face-to-face interaction with Tyr444, while aromatic hydrogen-hydrogen interactions with Tyr197 are preferable

  3. Synthesis, QSAR, and Molecular Dynamics Simulation of Amidino-substituted Benzimidazoles as Dipeptidyl Peptidase III Inhibitors.

    Science.gov (United States)

    Rastija, Vesna; Agić, Dejan; Tomiš, Sanja; Nikolič, Sonja; Hranjec, Marijana; Grace, Karminski-Zamola; Abramić, Marija

    2015-01-01

    A molecular modeling study is performed on series of benzimidazol-based inhibitors of human dipeptidyl peptidase III (DPP III). An eight novel compounds were synthesized in excellent yields using green chemistry approach. This study is aimed to elucidate the structural features of benzimidazole derivatives required for antagonism of human DPP III activity using Quantitative Structure-Activity Relationship (QSAR) analysis, and to understand the mechanism of one of the most potent inhibitor binding into the active site of this enzyme, by molecular dynamics (MD) simulations. The best model obtained includes S3K and RDF045m descriptors which have explained 89.4 % of inhibitory activity. Depicted moiety for strong inhibition activity matches to the structure of most potent compound. MD simulation has revealed importance of imidazolinyl and phenyl groups in the mechanism of binding into the active site of human DPP III.

  4. Quantitative structure-activity relationship modeling of the toxicity of organothiophosphate pesticides to Daphnia magna and Cyprinus carpio

    NARCIS (Netherlands)

    Zvinavashe, E.; Du, T.; Griff, T.; Berg, van den J.H.J.; Soffers, A.E.M.F.; Vervoort, J.J.M.; Murk, A.J.; Rietjens, I.

    2009-01-01

    Within the REACH regulatory framework in the EU, quantitative structure-activity relationships (QSAR) models are expected to help reduce the number of animals used for experimental testing. The objective of this study was to develop QSAR models to describe the acute toxicity of organothiophosphate

  5. Test-driven verification/validation of model transformations

    Institute of Scientific and Technical Information of China (English)

    László LENGYEL; Hassan CHARAF

    2015-01-01

    Why is it important to verify/validate model transformations? The motivation is to improve the quality of the trans-formations, and therefore the quality of the generated software artifacts. Verified/validated model transformations make it possible to ensure certain properties of the generated software artifacts. In this way, verification/validation methods can guarantee different requirements stated by the actual domain against the generated/modified/optimized software products. For example, a verified/ validated model transformation can ensure the preservation of certain properties during the model-to-model transformation. This paper emphasizes the necessity of methods that make model transformation verified/validated, discusses the different scenarios of model transformation verification and validation, and introduces the principles of a novel test-driven method for verifying/ validating model transformations. We provide a solution that makes it possible to automatically generate test input models for model transformations. Furthermore, we collect and discuss the actual open issues in the field of verification/validation of model transformations.

  6. Publicly available models to predict normal boiling point of organic compounds

    International Nuclear Information System (INIS)

    Oprisiu, Ioana; Marcou, Gilles; Horvath, Dragos; Brunel, Damien Bernard; Rivollet, Fabien; Varnek, Alexandre

    2013-01-01

    Quantitative structure–property models to predict the normal boiling point (T b ) of organic compounds were developed using non-linear ASNNs (associative neural networks) as well as multiple linear regression – ISIDA-MLR and SQS (stochastic QSAR sampler). Models were built on a diverse set of 2098 organic compounds with T b varying in the range of 185–491 K. In ISIDA-MLR and ASNN calculations, fragment descriptors were used, whereas fragment, FPTs (fuzzy pharmacophore triplets), and ChemAxon descriptors were employed in SQS models. Prediction quality of the models has been assessed in 5-fold cross validation. Obtained models were implemented in the on-line ISIDA predictor at (http://infochim.u-strasbg.fr/webserv/VSEngine.html)

  7. QSARs for phenols and phenolates: oxidation potential as a predictor of reaction rate constants with photochemically produced oxidants.

    Science.gov (United States)

    Arnold, William A; Oueis, Yan; O'Connor, Meghan; Rinaman, Johanna E; Taggart, Miranda G; McCarthy, Rachel E; Foster, Kimberley A; Latch, Douglas E

    2017-03-22

    Quantitative structure-activity relationships (QSARs) for prediction of the reaction rate constants of phenols and phenolates with three photochemically produced oxidants, singlet oxygen, carbonate radical, and triplet excited state sensitizers/organic matter, are developed. The predictive variable is the one-electron oxidation potential (E 1 ), which is calculated for each species using density functional theory. The reaction rate constants are obtained from the literature, and for singlet oxygen, are augmented with new experimental data. Calculated E 1 values have a mean unsigned error compared to literature values of 0.04-0.06 V. For singlet oxygen, a single linear QSAR that includes both phenols and phenolates is developed that predicts experimental rate constants, on average, to within a factor of three. Predictions for only 6 out of 87 compounds are off by more than a factor of 10. A more limited data set for carbonate radical reactions with phenols and phenolates also gives a single linear QSAR with prediction of rate constant being accurate to within a factor of three. The data for the reactions of phenols with triplet state sensitizers demonstrate that two sensitizers, 2-acetonaphthone and methylene blue, most closely predict the reactivity trend of triplet excited state organic matter with phenols. Using sensitizers with stronger reduction potentials could lead to overestimation of rate constants and thus underestimation of phenolic pollutant persistence.

  8. A Conceptual Framework for Predicting the Toxicity of Reactive Chemicals: Modeling Soft Electrophilicity

    Science.gov (United States)

    Although the literature is replete with QSAR models developed for many toxic effects caused by reversible chemical interactions, the development of QSARs for the toxic effects of reactive chemicals lacks a consistent approach. While limitations exit, an appropriate starting-point...

  9. hERG blocking potential of acids and zwitterions characterized by three thresholds for acidity, size and reactivity

    DEFF Research Database (Denmark)

    Nikolov, Nikolai Georgiev; Dybdahl, Marianne; Jonsdottir, Svava Osk

    2014-01-01

    with a concordance of 91% by a decision tree based on the rule. Two external validations were performed with sets of 35 and 48 observations, respectively, both showing concordances of 91%. In addition, a global QSAR model of hERG blocking was constructed based on a large diverse training set of 1374 chemicals...... covering all ionization classes, externally validated showing high predictivity and compared to the decision tree. The decision tree was found to be superior for the acids and zwitterionic ampholytes classes....

  10. Pharmacokinetic modeling of gentamicin in treatment of infective endocarditis: Model development and validation of existing models

    Science.gov (United States)

    van der Wijk, Lars; Proost, Johannes H.; Sinha, Bhanu; Touw, Daan J.

    2017-01-01

    Gentamicin shows large variations in half-life and volume of distribution (Vd) within and between individuals. Thus, monitoring and accurately predicting serum levels are required to optimize effectiveness and minimize toxicity. Currently, two population pharmacokinetic models are applied for predicting gentamicin doses in adults. For endocarditis patients the optimal model is unknown. We aimed at: 1) creating an optimal model for endocarditis patients; and 2) assessing whether the endocarditis and existing models can accurately predict serum levels. We performed a retrospective observational two-cohort study: one cohort to parameterize the endocarditis model by iterative two-stage Bayesian analysis, and a second cohort to validate and compare all three models. The Akaike Information Criterion and the weighted sum of squares of the residuals divided by the degrees of freedom were used to select the endocarditis model. Median Prediction Error (MDPE) and Median Absolute Prediction Error (MDAPE) were used to test all models with the validation dataset. We built the endocarditis model based on data from the modeling cohort (65 patients) with a fixed 0.277 L/h/70kg metabolic clearance, 0.698 (±0.358) renal clearance as fraction of creatinine clearance, and Vd 0.312 (±0.076) L/kg corrected lean body mass. External validation with data from 14 validation cohort patients showed a similar predictive power of the endocarditis model (MDPE -1.77%, MDAPE 4.68%) as compared to the intensive-care (MDPE -1.33%, MDAPE 4.37%) and standard (MDPE -0.90%, MDAPE 4.82%) models. All models acceptably predicted pharmacokinetic parameters for gentamicin in endocarditis patients. However, these patients appear to have an increased Vd, similar to intensive care patients. Vd mainly determines the height of peak serum levels, which in turn correlate with bactericidal activity. In order to maintain simplicity, we advise to use the existing intensive-care model in clinical practice to avoid

  11. Pharmacokinetic modeling of gentamicin in treatment of infective endocarditis: Model development and validation of existing models.

    Directory of Open Access Journals (Sweden)

    Anna Gomes

    Full Text Available Gentamicin shows large variations in half-life and volume of distribution (Vd within and between individuals. Thus, monitoring and accurately predicting serum levels are required to optimize effectiveness and minimize toxicity. Currently, two population pharmacokinetic models are applied for predicting gentamicin doses in adults. For endocarditis patients the optimal model is unknown. We aimed at: 1 creating an optimal model for endocarditis patients; and 2 assessing whether the endocarditis and existing models can accurately predict serum levels. We performed a retrospective observational two-cohort study: one cohort to parameterize the endocarditis model by iterative two-stage Bayesian analysis, and a second cohort to validate and compare all three models. The Akaike Information Criterion and the weighted sum of squares of the residuals divided by the degrees of freedom were used to select the endocarditis model. Median Prediction Error (MDPE and Median Absolute Prediction Error (MDAPE were used to test all models with the validation dataset. We built the endocarditis model based on data from the modeling cohort (65 patients with a fixed 0.277 L/h/70kg metabolic clearance, 0.698 (±0.358 renal clearance as fraction of creatinine clearance, and Vd 0.312 (±0.076 L/kg corrected lean body mass. External validation with data from 14 validation cohort patients showed a similar predictive power of the endocarditis model (MDPE -1.77%, MDAPE 4.68% as compared to the intensive-care (MDPE -1.33%, MDAPE 4.37% and standard (MDPE -0.90%, MDAPE 4.82% models. All models acceptably predicted pharmacokinetic parameters for gentamicin in endocarditis patients. However, these patients appear to have an increased Vd, similar to intensive care patients. Vd mainly determines the height of peak serum levels, which in turn correlate with bactericidal activity. In order to maintain simplicity, we advise to use the existing intensive-care model in clinical practice to

  12. Concepts of Model Verification and Validation

    International Nuclear Information System (INIS)

    Thacker, B.H.; Doebling, S.W.; Hemez, F.M.; Anderson, M.C.; Pepin, J.E.; Rodriguez, E.A.

    2004-01-01

    VandV for all safety-related nuclear facility design, analyses, and operations. In fact, DNFSB 2002-1 recommends to the DOE and National Nuclear Security Administration (NNSA) that a VandV process be performed for all safety related software and analysis. Model verification and validation are the primary processes for quantifying and building credibility in numerical models. Verification is the process of determining that a model implementation accurately represents the developer's conceptual description of the model and its solution. Validation is the process of determining the degree to which a model is an accurate representation of the real world from the perspective of the intended uses of the model. Both verification and validation are processes that accumulate evidence of a model's correctness or accuracy for a specific scenario; thus, VandV cannot prove that a model is correct and accurate for all possible scenarios, but, rather, it can provide evidence that the model is sufficiently accurate for its intended use. Model VandV is fundamentally different from software VandV. Code developers developing computer programs perform software VandV to ensure code correctness, reliability, and robustness. In model VandV, the end product is a predictive model based on fundamental physics of the problem being solved. In all applications of practical interest, the calculations involved in obtaining solutions with the model require a computer code, e.g., finite element or finite difference analysis. Therefore, engineers seeking to develop credible predictive models critically need model VandV guidelines and procedures. The expected outcome of the model VandV process is the quantified level of agreement between experimental data and model prediction, as well as the predictive accuracy of the model. This report attempts to describe the general philosophy, definitions, concepts, and processes for conducting a successful VandV program. This objective is motivated by the need for

  13. Advanced training simulator models. Implementation and validation

    International Nuclear Information System (INIS)

    Borkowsky, Jeffrey; Judd, Jerry; Belblidia, Lotfi; O'farrell, David; Andersen, Peter

    2008-01-01

    Modern training simulators are required to replicate plant data for both thermal-hydraulic and neutronic response. Replication is required such that reactivity manipulation on the simulator properly trains the operator for reactivity manipulation at the plant. This paper discusses advanced models which perform this function in real-time using the coupled code system THOR/S3R. This code system models the all fluids systems in detail using an advanced, two-phase thermal-hydraulic a model. The nuclear core is modeled using an advanced, three-dimensional nodal method and also by using cycle-specific nuclear data. These models are configured to run interactively from a graphical instructor station or handware operation panels. The simulator models are theoretically rigorous and are expected to replicate the physics of the plant. However, to verify replication, the models must be independently assessed. Plant data is the preferred validation method, but plant data is often not available for many important training scenarios. In the absence of data, validation may be obtained by slower-than-real-time transient analysis. This analysis can be performed by coupling a safety analysis code and a core design code. Such a coupling exists between the codes RELAP5 and SIMULATE-3K (S3K). RELAP5/S3K is used to validate the real-time model for several postulated plant events. (author)

  14. The concept of validation of numerical models for consequence analysis

    International Nuclear Information System (INIS)

    Borg, Audun; Paulsen Husted, Bjarne; Njå, Ove

    2014-01-01

    Numerical models such as computational fluid dynamics (CFD) models are increasingly used in life safety studies and other types of analyses to calculate the effects of fire and explosions. The validity of these models is usually established by benchmark testing. This is done to quantitatively measure the agreement between the predictions provided by the model and the real world represented by observations in experiments. This approach assumes that all variables in the real world relevant for the specific study are adequately measured in the experiments and in the predictions made by the model. In this paper the various definitions of validation for CFD models used for hazard prediction are investigated to assess their implication for consequence analysis in a design phase. In other words, how is uncertainty in the prediction of future events reflected in the validation process? The sources of uncertainty are viewed from the perspective of the safety engineer. An example of the use of a CFD model is included to illustrate the assumptions the analyst must make and how these affect the prediction made by the model. The assessments presented in this paper are based on a review of standards and best practice guides for CFD modeling and the documentation from two existing CFD programs. Our main thrust has been to assess how validation work is performed and communicated in practice. We conclude that the concept of validation adopted for numerical models is adequate in terms of model performance. However, it does not address the main sources of uncertainty from the perspective of the safety engineer. Uncertainty in the input quantities describing future events, which are determined by the model user, outweighs the inaccuracies in the model as reported in validation studies. - Highlights: • Examine the basic concept of validation applied to models for consequence analysis. • Review standards and guides for validation of numerical models. • Comparison of the validation

  15. QSAR pre-screen of 70,983 substances for genotoxic carcinogenicity, mutagenicity and developmental toxicity in the EU FP7 project ChemScreen

    DEFF Research Database (Denmark)

    Wedebye, Eva Bay; Dybdahl, Marianne; Nikolov, Nikolai Georgiev

    2014-01-01

    be performed in REACH on known genotoxic carcinogens or germ cell mutagens with appropriate risk management measures implemented, a QSAR pre-screen for genotoxic carcinogenicity, germ cell mutagenicity and (limited) developmental toxicity was included in the project. Predictions for estrogenic and anti...... algorithms were applied to combine the predictions from the individual models to reach overall predictions for genotoxic carcinogenicity, germ cell mutagenicity and developmental toxicity. Furthermore, the full list of REACH pre-registered substances (143,835) was searched for substances containing certain...

  16. A Validated Clinical Risk Prediction Model for Lung Cancer in Smokers of All Ages and Exposure Types

    DEFF Research Database (Denmark)

    Markaki, Maria; Tsamardinos, Ioannis; Langhammer, Arnulf

    2018-01-01

    Lung cancer causes >1·6 million deaths annually, with early diagnosis being paramount to effective treatment. Here we present a validated risk assessment model for lung cancer screening. The prospective HUNT2 population study in Norway examined 65,237 people aged >20years in 1995-97. After a median...

  17. QSAR analysis for nano-sized layered manganese-calcium oxide in water oxidation: An application of chemometric methods in artificial photosynthesis.

    Science.gov (United States)

    Shahbazy, Mohammad; Kompany-Zareh, Mohsen; Najafpour, Mohammad Mahdi

    2015-11-01

    Water oxidation is among the most important reactions in artificial photosynthesis, and nano-sized layered manganese-calcium oxides are efficient catalysts toward this reaction. Herein, a quantitative structure-activity relationship (QSAR) model was constructed to predict the catalytic activities of twenty manganese-calcium oxides toward water oxidation using multiple linear regression (MLR) and genetic algorithm (GA) for multivariate calibration and feature selection, respectively. Although there are eight controlled parameters during synthesizing of the desired catalysts including ripening time, temperature, manganese content, calcium content, potassium content, the ratio of calcium:manganese, the average manganese oxidation state and the surface of catalyst, by using GA only three of them (potassium content, the ratio of calcium:manganese and the average manganese oxidation state) were selected as the most effective parameters on catalytic activities of these compounds. The model's accuracy criteria such as R(2)test and Q(2)test in order to predict catalytic rate for external test set experiments; were equal to 0.941 and 0.906, respectively. Therefore, model reveals acceptable capability to anticipate the catalytic activity. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Beyond Modeling: All-Atom Olfactory Receptor Model Simulations

    Directory of Open Access Journals (Sweden)

    Peter C Lai

    2012-05-01

    Full Text Available Olfactory receptors (ORs are a type of GTP-binding protein-coupled receptor (GPCR. These receptors are responsible for mediating the sense of smell through their interaction with odor ligands. OR-odorant interactions marks the first step in the process that leads to olfaction. Computational studies on model OR structures can validate experimental functional studies as well as generate focused and novel hypotheses for further bench investigation by providing a view of these interactions at the molecular level. Here we have shown the specific advantages of simulating the dynamic environment that is associated with OR-odorant interactions. We present a rigorous methodology that ranges from the creation of a computationally-derived model of an olfactory receptor to simulating the interactions between an OR and an odorant molecule. Given the ubiquitous occurrence of GPCRs in the membranes of cells, we anticipate that our OR-developed methodology will serve as a model for the computational structural biology of all GPCRs.

  19. IN-DRIFT MICROBIAL COMMUNITIES MODEL VALIDATION CALCULATIONS

    Energy Technology Data Exchange (ETDEWEB)

    D.M. Jolley

    2001-12-18

    The objective and scope of this calculation is to create the appropriate parameter input for MING 1.0 (CSCI 30018 V1.0, CRWMS M&O 1998b) that will allow the testing of the results from the MING software code with both scientific measurements of microbial populations at the site and laboratory and with natural analogs to the site. This set of calculations provides results that will be used in model validation for the ''In-Drift Microbial Communities'' model (CRWMS M&O 2000) which is part of the Engineered Barrier System Department (EBS) process modeling effort that eventually will feed future Total System Performance Assessment (TSPA) models. This calculation is being produced to replace MING model validation output that is effected by the supersession of DTN M09909SPAMINGl.003 using its replacement DTN M00106SPAIDMO 1.034 so that the calculations currently found in the ''In-Drift Microbial Communities'' AMR (CRWMS M&O 2000) will be brought up to date. This set of calculations replaces the calculations contained in sections 6.7.2, 6.7.3 and Attachment I of CRWMS M&O (2000) As all of these calculations are created explicitly for model validation, the data qualification status of all inputs can be considered corroborative in accordance with AP-3.15Q. This work activity has been evaluated in accordance with the AP-2.21 procedure, ''Quality Determinations and Planning for Scientific, Engineering, and Regulatory Compliance Activities'', and is subject to QA controls (BSC 2001). The calculation is developed in accordance with the AP-3.12 procedure, Calculations, and prepared in accordance with the ''Technical Work Plan For EBS Department Modeling FY 01 Work Activities'' (BSC 200 1) which includes controls for the management of electronic data.

  20. In-Drift Microbial Communities Model Validation Calculations

    Energy Technology Data Exchange (ETDEWEB)

    D. M. Jolley

    2001-09-24

    The objective and scope of this calculation is to create the appropriate parameter input for MING 1.0 (CSCI 30018 V1.0, CRWMS M&O 1998b) that will allow the testing of the results from the MING software code with both scientific measurements of microbial populations at the site and laboratory and with natural analogs to the site. This set of calculations provides results that will be used in model validation for the ''In-Drift Microbial Communities'' model (CRWMS M&O 2000) which is part of the Engineered Barrier System Department (EBS) process modeling effort that eventually will feed future Total System Performance Assessment (TSPA) models. This calculation is being produced to replace MING model validation output that is effected by the supersession of DTN MO9909SPAMING1.003 using its replacement DTN MO0106SPAIDM01.034 so that the calculations currently found in the ''In-Drift Microbial Communities'' AMR (CRWMS M&O 2000) will be brought up to date. This set of calculations replaces the calculations contained in sections 6.7.2, 6.7.3 and Attachment I of CRWMS M&O (2000) As all of these calculations are created explicitly for model validation, the data qualification status of all inputs can be considered corroborative in accordance with AP-3.15Q. This work activity has been evaluated in accordance with the AP-2.21 procedure, ''Quality Determinations and Planning for Scientific, Engineering, and Regulatory Compliance Activities'', and is subject to QA controls (BSC 2001). The calculation is developed in accordance with the AP-3.12 procedure, Calculations, and prepared in accordance with the ''Technical Work Plan For EBS Department Modeling FY 01 Work Activities'' (BSC 2001) which includes controls for the management of electronic data.

  1. In-Drift Microbial Communities Model Validation Calculation

    Energy Technology Data Exchange (ETDEWEB)

    D. M. Jolley

    2001-10-31

    The objective and scope of this calculation is to create the appropriate parameter input for MING 1.0 (CSCI 30018 V1.0, CRWMS M&O 1998b) that will allow the testing of the results from the MING software code with both scientific measurements of microbial populations at the site and laboratory and with natural analogs to the site. This set of calculations provides results that will be used in model validation for the ''In-Drift Microbial Communities'' model (CRWMS M&O 2000) which is part of the Engineered Barrier System Department (EBS) process modeling effort that eventually will feed future Total System Performance Assessment (TSPA) models. This calculation is being produced to replace MING model validation output that is effected by the supersession of DTN MO9909SPAMING1.003 using its replacement DTN MO0106SPAIDM01.034 so that the calculations currently found in the ''In-Drift Microbial Communities'' AMR (CRWMS M&O 2000) will be brought up to date. This set of calculations replaces the calculations contained in sections 6.7.2, 6.7.3 and Attachment I of CRWMS M&O (2000) As all of these calculations are created explicitly for model validation, the data qualification status of all inputs can be considered corroborative in accordance with AP-3.15Q. This work activity has been evaluated in accordance with the AP-2.21 procedure, ''Quality Determinations and Planning for Scientific, Engineering, and Regulatory Compliance Activities'', and is subject to QA controls (BSC 2001). The calculation is developed in accordance with the AP-3.12 procedure, Calculations, and prepared in accordance with the ''Technical Work Plan For EBS Department Modeling FY 01 Work Activities'' (BSC 2001) which includes controls for the management of electronic data.

  2. In-Drift Microbial Communities Model Validation Calculations

    International Nuclear Information System (INIS)

    Jolley, D.M.

    2001-01-01

    The objective and scope of this calculation is to create the appropriate parameter input for MING 1.0 (CSCI 30018 V1.0, CRWMS MandO 1998b) that will allow the testing of the results from the MING software code with both scientific measurements of microbial populations at the site and laboratory and with natural analogs to the site. This set of calculations provides results that will be used in model validation for the ''In-Drift Microbial Communities'' model (CRWMS MandO 2000) which is part of the Engineered Barrier System Department (EBS) process modeling effort that eventually will feed future Total System Performance Assessment (TSPA) models. This calculation is being produced to replace MING model validation output that is effected by the supersession of DTN MO9909SPAMING1.003 using its replacement DTN MO0106SPAIDM01.034 so that the calculations currently found in the ''In-Drift Microbial Communities'' AMR (CRWMS MandO 2000) will be brought up to date. This set of calculations replaces the calculations contained in sections 6.7.2, 6.7.3 and Attachment I of CRWMS MandO (2000) As all of these calculations are created explicitly for model validation, the data qualification status of all inputs can be considered corroborative in accordance with AP-3.15Q. This work activity has been evaluated in accordance with the AP-2.21 procedure, ''Quality Determinations and Planning for Scientific, Engineering, and Regulatory Compliance Activities'', and is subject to QA controls (BSC 2001). The calculation is developed in accordance with the AP-3.12 procedure, Calculations, and prepared in accordance with the ''Technical Work Plan For EBS Department Modeling FY 01 Work Activities'' (BSC 2001) which includes controls for the management of electronic data

  3. IN-DRIFT MICROBIAL COMMUNITIES MODEL VALIDATION CALCULATIONS

    International Nuclear Information System (INIS)

    D.M. Jolley

    2001-01-01

    The objective and scope of this calculation is to create the appropriate parameter input for MING 1.0 (CSCI 30018 V1.0, CRWMS M andO 1998b) that will allow the testing of the results from the MING software code with both scientific measurements of microbial populations at the site and laboratory and with natural analogs to the site. This set of calculations provides results that will be used in model validation for the ''In-Drift Microbial Communities'' model (CRWMS M andO 2000) which is part of the Engineered Barrier System Department (EBS) process modeling effort that eventually will feed future Total System Performance Assessment (TSPA) models. This calculation is being produced to replace MING model validation output that is effected by the supersession of DTN M09909SPAMINGl.003 using its replacement DTN M00106SPAIDMO 1.034 so that the calculations currently found in the ''In-Drift Microbial Communities'' AMR (CRWMS M andO 2000) will be brought up to date. This set of calculations replaces the calculations contained in sections 6.7.2, 6.7.3 and Attachment I of CRWMS M andO (2000) As all of these calculations are created explicitly for model validation, the data qualification status of all inputs can be considered corroborative in accordance with AP-3.15Q. This work activity has been evaluated in accordance with the AP-2.21 procedure, ''Quality Determinations and Planning for Scientific, Engineering, and Regulatory Compliance Activities'', and is subject to QA controls (BSC 2001). The calculation is developed in accordance with the AP-3.12 procedure, Calculations, and prepared in accordance with the ''Technical Work Plan For EBS Department Modeling FY 01 Work Activities'' (BSC 200 1) which includes controls for the management of electronic data

  4. Model validation and calibration based on component functions of model output

    International Nuclear Information System (INIS)

    Wu, Danqing; Lu, Zhenzhou; Wang, Yanping; Cheng, Lei

    2015-01-01

    The target in this work is to validate the component functions of model output between physical observation and computational model with the area metric. Based on the theory of high dimensional model representations (HDMR) of independent input variables, conditional expectations are component functions of model output, and the conditional expectations reflect partial information of model output. Therefore, the model validation of conditional expectations tells the discrepancy between the partial information of the computational model output and that of the observations. Then a calibration of the conditional expectations is carried out to reduce the value of model validation metric. After that, a recalculation of the model validation metric of model output is taken with the calibrated model parameters, and the result shows that a reduction of the discrepancy in the conditional expectations can help decrease the difference in model output. At last, several examples are employed to demonstrate the rationality and necessity of the methodology in case of both single validation site and multiple validation sites. - Highlights: • A validation metric of conditional expectations of model output is proposed. • HDRM explains the relationship of conditional expectations and model output. • An improved approach of parameter calibration updates the computational models. • Validation and calibration process are applied at single site and multiple sites. • Validation and calibration process show a superiority than existing methods

  5. Validation analysis of probabilistic models of dietary exposure to food additives.

    Science.gov (United States)

    Gilsenan, M B; Thompson, R L; Lambe, J; Gibney, M J

    2003-10-01

    The validity of a range of simple conceptual models designed specifically for the estimation of food additive intakes using probabilistic analysis was assessed. Modelled intake estimates that fell below traditional conservative point estimates of intake and above 'true' additive intakes (calculated from a reference database at brand level) were considered to be in a valid region. Models were developed for 10 food additives by combining food intake data, the probability of an additive being present in a food group and additive concentration data. Food intake and additive concentration data were entered as raw data or as a lognormal distribution, and the probability of an additive being present was entered based on the per cent brands or the per cent eating occasions within a food group that contained an additive. Since the three model components assumed two possible modes of input, the validity of eight (2(3)) model combinations was assessed. All model inputs were derived from the reference database. An iterative approach was employed in which the validity of individual model components was assessed first, followed by validation of full conceptual models. While the distribution of intake estimates from models fell below conservative intakes, which assume that the additive is present at maximum permitted levels (MPLs) in all foods in which it is permitted, intake estimates were not consistently above 'true' intakes. These analyses indicate the need for more complex models for the estimation of food additive intakes using probabilistic analysis. Such models should incorporate information on market share and/or brand loyalty.

  6. HEDR model validation plan

    International Nuclear Information System (INIS)

    Napier, B.A.; Gilbert, R.O.; Simpson, J.C.; Ramsdell, J.V. Jr.; Thiede, M.E.; Walters, W.H.

    1993-06-01

    The Hanford Environmental Dose Reconstruction (HEDR) Project has developed a set of computational ''tools'' for estimating the possible radiation dose that individuals may have received from past Hanford Site operations. This document describes the planned activities to ''validate'' these tools. In the sense of the HEDR Project, ''validation'' is a process carried out by comparing computational model predictions with field observations and experimental measurements that are independent of those used to develop the model

  7. Validation of models with multivariate output

    International Nuclear Information System (INIS)

    Rebba, Ramesh; Mahadevan, Sankaran

    2006-01-01

    This paper develops metrics for validating computational models with experimental data, considering uncertainties in both. A computational model may generate multiple response quantities and the validation experiment might yield corresponding measured values. Alternatively, a single response quantity may be predicted and observed at different spatial and temporal points. Model validation in such cases involves comparison of multiple correlated quantities. Multiple univariate comparisons may give conflicting inferences. Therefore, aggregate validation metrics are developed in this paper. Both classical and Bayesian hypothesis testing are investigated for this purpose, using multivariate analysis. Since, commonly used statistical significance tests are based on normality assumptions, appropriate transformations are investigated in the case of non-normal data. The methodology is implemented to validate an empirical model for energy dissipation in lap joints under dynamic loading

  8. QSAR models for predicting octanol/water and organic carbon/water partition coefficients of polychlorinated biphenyls.

    Science.gov (United States)

    Yu, S; Gao, S; Gan, Y; Zhang, Y; Ruan, X; Wang, Y; Yang, L; Shi, J

    2016-04-01

    Quantitative structure-property relationship modelling can be a valuable alternative method to replace or reduce experimental testing. In particular, some endpoints such as octanol-water (KOW) and organic carbon-water (KOC) partition coefficients of polychlorinated biphenyls (PCBs) are easier to predict and various models have been already developed. In this paper, two different methods, which are multiple linear regression based on the descriptors generated using Dragon software and hologram quantitative structure-activity relationships, were employed to predict suspended particulate matter (SPM) derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of 209 PCBs. The predictive ability of the derived models was validated using a test set. The performances of all these models were compared with EPI Suite™ software. The results indicated that the proposed models were robust and satisfactory, and could provide feasible and promising tools for the rapid assessment of the SPM derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of PCBs.

  9. 3D-QSAR studies and molecular docking on [5-(4-amino-1 H-benzoimidazol-2-yl)-furan-2-yl]-phosphonic acid derivatives as fructose-1,6-biphophatase inhibitors

    Science.gov (United States)

    Lan, Ping; Xie, Mei-Qi; Yao, Yue-Mei; Chen, Wan-Na; Chen, Wei-Min

    2010-12-01

    Fructose-1,6-biphophatase has been regarded as a novel therapeutic target for the treatment of type 2 diabetes mellitus (T2DM). 3D-QSAR and docking studies were performed on a series of [5-(4-amino-1 H-benzoimidazol-2-yl)-furan-2-yl]-phosphonic acid derivatives as fructose-1,6-biphophatase inhibitors. The CoMFA and CoMSIA models using thirty-seven molecules in the training set gave r cv 2 values of 0.614 and 0.598, r 2 values of 0.950 and 0.928, respectively. The external validation indicated that our CoMFA and CoMSIA models possessed high predictive powers with r 0 2 values of 0.994 and 0.994, r m 2 values of 0.751 and 0.690, respectively. Molecular docking studies revealed that a phosphonic group was essential for binding to the receptor, and some key features were also identified. A set of forty new analogues were designed by utilizing the results revealed in the present study, and were predicted with significantly improved potencies in the developed models. The findings can be quite useful to aid the designing of new fructose-1,6-biphophatase inhibitors with improved biological response.

  10. Making Validated Educational Models Central in Preschool Standards.

    Science.gov (United States)

    Schweinhart, Lawrence J.

    This paper presents some ideas to preschool educators and policy makers about how to make validated educational models central in standards for preschool education and care programs that are available to all 3- and 4-year-olds. Defining an educational model as a coherent body of program practices, curriculum content, program and child, and teacher…

  11. Model Validation in Ontology Based Transformations

    Directory of Open Access Journals (Sweden)

    Jesús M. Almendros-Jiménez

    2012-10-01

    Full Text Available Model Driven Engineering (MDE is an emerging approach of software engineering. MDE emphasizes the construction of models from which the implementation should be derived by applying model transformations. The Ontology Definition Meta-model (ODM has been proposed as a profile for UML models of the Web Ontology Language (OWL. In this context, transformations of UML models can be mapped into ODM/OWL transformations. On the other hand, model validation is a crucial task in model transformation. Meta-modeling permits to give a syntactic structure to source and target models. However, semantic requirements have to be imposed on source and target models. A given transformation will be sound when source and target models fulfill the syntactic and semantic requirements. In this paper, we present an approach for model validation in ODM based transformations. Adopting a logic programming based transformational approach we will show how it is possible to transform and validate models. Properties to be validated range from structural and semantic requirements of models (pre and post conditions to properties of the transformation (invariants. The approach has been applied to a well-known example of model transformation: the Entity-Relationship (ER to Relational Model (RM transformation.

  12. Novel Uses of In Vitro Data to Develop Quantitative Biological Activity Relationship Models for in Vivo Carcinogenicity Prediction.

    Science.gov (United States)

    Pradeep, Prachi; Povinelli, Richard J; Merrill, Stephen J; Bozdag, Serdar; Sem, Daniel S

    2015-04-01

    The availability of large in vitro datasets enables better insight into the mode of action of chemicals and better identification of potential mechanism(s) of toxicity. Several studies have shown that not all in vitro assays can contribute as equal predictors of in vivo carcinogenicity for development of hybrid Quantitative Structure Activity Relationship (QSAR) models. We propose two novel approaches for the use of mechanistically relevant in vitro assay data in the identification of relevant biological descriptors and development of Quantitative Biological Activity Relationship (QBAR) models for carcinogenicity prediction. We demonstrate that in vitro assay data can be used to develop QBAR models for in vivo carcinogenicity prediction via two case studies corroborated with firm scientific rationale. The case studies demonstrate the similarities between QBAR and QSAR modeling in: (i) the selection of relevant descriptors to be used in the machine learning algorithm, and (ii) the development of a computational model that maps chemical or biological descriptors to a toxic endpoint. The results of both the case studies show: (i) improved accuracy and sensitivity which is especially desirable under regulatory requirements, and (ii) overall adherence with the OECD/REACH guidelines. Such mechanism based models can be used along with QSAR models for prediction of mechanistically complex toxic endpoints. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. A QSAR/QSTR Study on the Environmental Health Impact by the Rocket Fuel 1,1-Dimethyl Hydrazine and its Transformation Products

    Directory of Open Access Journals (Sweden)

    Lars Carlsen

    2008-01-01

    Full Text Available QSAR/QSTR modelling constitutes an attractive approach to preliminary assessment of the impact on environmental health by a primary pollutant and the suite of transformation products that may be persistent in and toxic to the environment. The present paper studies the impact on environmental health by residuals of the rocket fuel 1,1-dimethyl hydrazine (heptyl and its transformation products. The transformation products, comprising a variety of nitrogen containing compounds are suggested all to possess a significant migration potential. In all cases the compounds were found being rapidly biodegradable. However, unexpected low microbial activity may cause significant changes. None of the studied compounds appear to be bioaccumulating. Apart from substances with an intact hydrazine structure or hydrazone structure the transformation products in general display rather low environmental toxicities. Thus, it is concluded that apparently further attention should be given to tri- and tetramethyl hydrazine and 1-formyl 2,2-dimethyl hydrazine as well as to the hydrazones of formaldehyde and acetaldehyde as these five compounds may contribute to the overall environmental toxicity of residual rocket fuel and its transformation products.

  14. Materials Informatics: Statistical Modeling in Material Science.

    Science.gov (United States)

    Yosipof, Abraham; Shimanovich, Klimentiy; Senderowitz, Hanoch

    2016-12-01

    Material informatics is engaged with the application of informatic principles to materials science in order to assist in the discovery and development of new materials. Central to the field is the application of data mining techniques and in particular machine learning approaches, often referred to as Quantitative Structure Activity Relationship (QSAR) modeling, to derive predictive models for a variety of materials-related "activities". Such models can accelerate the development of new materials with favorable properties and provide insight into the factors governing these properties. Here we provide a comparison between medicinal chemistry/drug design and materials-related QSAR modeling and highlight the importance of developing new, materials-specific descriptors. We survey some of the most recent QSAR models developed in materials science with focus on energetic materials and on solar cells. Finally we present new examples of material-informatic analyses of solar cells libraries produced from metal oxides using combinatorial material synthesis. Different analyses lead to interesting physical insights as well as to the design of new cells with potentially improved photovoltaic parameters. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Preliminary validation of a Monte Carlo model for IMRT fields

    International Nuclear Information System (INIS)

    Wright, Tracy; Lye, Jessica; Mohammadi, Mohammad

    2011-01-01

    Full text: A Monte Carlo model of an Elekta linac, validated for medium to large (10-30 cm) symmetric fields, has been investigated for small, irregular and asymmetric fields suitable for IMRT treatments. The model has been validated with field segments using radiochromic film in solid water. The modelled positions of the multileaf collimator (MLC) leaves have been validated using EBT film, In the model, electrons with a narrow energy spectrum are incident on the target and all components of the linac head are included. The MLC is modelled using the EGSnrc MLCE component module. For the validation, a number of single complex IMRT segments with dimensions approximately 1-8 cm were delivered to film in solid water (see Fig, I), The same segments were modelled using EGSnrc by adjusting the MLC leaf positions in the model validated for 10 cm symmetric fields. Dose distributions along the centre of each MLC leaf as determined by both methods were compared. A picket fence test was also performed to confirm the MLC leaf positions. 95% of the points in the modelled dose distribution along the leaf axis agree with the film measurement to within 1%/1 mm for dose difference and distance to agreement. Areas of most deviation occur in the penumbra region. A system has been developed to calculate the MLC leaf positions in the model for any planned field size.

  16. Grid-based Continual Analysis of Molecular Interior for Drug Discovery, QSAR and QSPR.

    Science.gov (United States)

    Potemkin, Andrey V; Grishina, Maria A; Potemkin, Vladimir A

    2017-01-01

    In 1979, R.D.Cramer and M.Milne made a first realization of 3D comparison of molecules by aligning them in space and by mapping their molecular fields to a 3D grid. Further, this approach was developed as the DYLOMMS (Dynamic Lattice- Oriented Molecular Modelling System) approach. In 1984, H.Wold and S.Wold proposed the use of partial least squares (PLS) analysis, instead of principal component analysis, to correlate the field values with biological activities. Then, in 1988, the method which was called CoMFA (Comparative Molecular Field Analysis) was introduced and the appropriate software became commercially available. Since 1988, a lot of 3D QSAR methods, algorithms and their modifications are introduced for solving of virtual drug discovery problems (e.g., CoMSIA, CoMMA, HINT, HASL, GOLPE, GRID, PARM, Raptor, BiS, CiS, ConGO,). All the methods can be divided into two groups (classes):1. Methods studying the exterior of molecules; 2) Methods studying the interior of molecules. A series of grid-based computational technologies for Continual Molecular Interior analysis (CoMIn) are invented in the current paper. The grid-based analysis is fulfilled by means of a lattice construction analogously to many other grid-based methods. The further continual elucidation of molecular structure is performed in various ways. (i) In terms of intermolecular interactions potentials. This can be represented as a superposition of Coulomb, Van der Waals interactions and hydrogen bonds. All the potentials are well known continual functions and their values can be determined in all lattice points for a molecule. (ii) In the terms of quantum functions such as electron density distribution, Laplacian and Hamiltonian of electron density distribution, potential energy distribution, the highest occupied and the lowest unoccupied molecular orbitals distribution and their superposition. To reduce time of calculations using quantum methods based on the first principles, an original quantum

  17. Systematic validation of non-equilibrium thermochemical models using Bayesian inference

    KAUST Repository

    Miki, Kenji

    2015-10-01

    © 2015 Elsevier Inc. The validation process proposed by Babuška et al. [1] is applied to thermochemical models describing post-shock flow conditions. In this validation approach, experimental data is involved only in the calibration of the models, and the decision process is based on quantities of interest (QoIs) predicted on scenarios that are not necessarily amenable experimentally. Moreover, uncertainties present in the experimental data, as well as those resulting from an incomplete physical model description, are propagated to the QoIs. We investigate four commonly used thermochemical models: a one-temperature model (which assumes thermal equilibrium among all inner modes), and two-temperature models developed by Macheret et al. [2], Marrone and Treanor [3], and Park [4]. Up to 16 uncertain parameters are estimated using Bayesian updating based on the latest absolute volumetric radiance data collected at the Electric Arc Shock Tube (EAST) installed inside the NASA Ames Research Center. Following the solution of the inverse problems, the forward problems are solved in order to predict the radiative heat flux, QoI, and examine the validity of these models. Our results show that all four models are invalid, but for different reasons: the one-temperature model simply fails to reproduce the data while the two-temperature models exhibit unacceptably large uncertainties in the QoI predictions.

  18. Discovery of Antibiotics-derived Polymers for Gene Delivery using Combinatorial Synthesis and Cheminformatics Modeling

    Science.gov (United States)

    Potta, Thrimoorthy; Zhen, Zhuo; Grandhi, Taraka Sai Pavan; Christensen, Matthew D.; Ramos, James; Breneman, Curt M.; Rege, Kaushal

    2014-01-01

    We describe the combinatorial synthesis and cheminformatics modeling of aminoglycoside antibiotics-derived polymers for transgene delivery and expression. Fifty-six polymers were synthesized by polymerizing aminoglycosides with diglycidyl ether cross-linkers. Parallel screening resulted in identification of several lead polymers that resulted in high transgene expression levels in cells. The role of polymer physicochemical properties in determining efficacy of transgene expression was investigated using Quantitative Structure-Activity Relationship (QSAR) cheminformatics models based on Support Vector Regression (SVR) and ‘building block’ polymer structures. The QSAR model exhibited high predictive ability, and investigation of descriptors in the model, using molecular visualization and correlation plots, indicated that physicochemical attributes related to both, aminoglycosides and diglycidyl ethers facilitated transgene expression. This work synergistically combines combinatorial synthesis and parallel screening with cheminformatics-based QSAR models for discovery and physicochemical elucidation of effective antibiotics-derived polymers for transgene delivery in medicine and biotechnology. PMID:24331709

  19. Assessing Discriminative Performance at External Validation of Clinical Prediction Models.

    Directory of Open Access Journals (Sweden)

    Daan Nieboer

    Full Text Available External validation studies are essential to study the generalizability of prediction models. Recently a permutation test, focusing on discrimination as quantified by the c-statistic, was proposed to judge whether a prediction model is transportable to a new setting. We aimed to evaluate this test and compare it to previously proposed procedures to judge any changes in c-statistic from development to external validation setting.We compared the use of the permutation test to the use of benchmark values of the c-statistic following from a previously proposed framework to judge transportability of a prediction model. In a simulation study we developed a prediction model with logistic regression on a development set and validated them in the validation set. We concentrated on two scenarios: 1 the case-mix was more heterogeneous and predictor effects were weaker in the validation set compared to the development set, and 2 the case-mix was less heterogeneous in the validation set and predictor effects were identical in the validation and development set. Furthermore we illustrated the methods in a case study using 15 datasets of patients suffering from traumatic brain injury.The permutation test indicated that the validation and development set were homogenous in scenario 1 (in almost all simulated samples and heterogeneous in scenario 2 (in 17%-39% of simulated samples. Previously proposed benchmark values of the c-statistic and the standard deviation of the linear predictors correctly pointed at the more heterogeneous case-mix in scenario 1 and the less heterogeneous case-mix in scenario 2.The recently proposed permutation test may provide misleading results when externally validating prediction models in the presence of case-mix differences between the development and validation population. To correctly interpret the c-statistic found at external validation it is crucial to disentangle case-mix differences from incorrect regression coefficients.

  20. Molecular determinants of juvenile hormone action as revealed by 3D QSAR analysis in Drosophila.

    Directory of Open Access Journals (Sweden)

    Denisa Liszeková

    Full Text Available BACKGROUND: Postembryonic development, including metamorphosis, of many animals is under control of hormones. In Drosophila and other insects these developmental transitions are regulated by the coordinate action of two principal hormones, the steroid ecdysone and the sesquiterpenoid juvenile hormone (JH. While the mode of ecdysone action is relatively well understood, the molecular mode of JH action remains elusive. METHODOLOGY/PRINCIPAL FINDINGS: To gain more insights into the molecular mechanism of JH action, we have tested the biological activity of 86 structurally diverse JH agonists in Drosophila melanogaster. The results were evaluated using 3D QSAR analyses involving CoMFA and CoMSIA procedures. Using this approach we have generated both computer-aided and species-specific pharmacophore fingerprints of JH and its agonists, which revealed that the most active compounds must possess an electronegative atom (oxygen or nitrogen at both ends of the molecule. When either of these electronegative atoms are replaced by carbon or the distance between them is shorter than 11.5 A or longer than 13.5 A, their biological activity is dramatically decreased. The presence of an electron-deficient moiety in the middle of the JH agonist is also essential for high activity. CONCLUSIONS/SIGNIFICANCE: The information from 3D QSAR provides guidelines and mechanistic scope for identification of steric and electrostatic properties as well as donor and acceptor hydrogen-bonding that are important features of the ligand-binding cavity of a JH target protein. In order to refine the pharmacophore analysis and evaluate the outcomes of the CoMFA and CoMSIA study we used pseudoreceptor modeling software PrGen to generate a putative binding site surrogate that is composed of eight amino acid residues corresponding to the defined molecular interactions.

  1. Comparative analysis of pharmaceuticals versus industrial chemicals acute aquatic toxicity classification according to the United Nations classification system for chemicals. Assessment of the (Q)SAR predictability of pharmaceuticals acute aquatic toxicity and their predominant acute toxic mode-of-action

    DEFF Research Database (Denmark)

    Sanderson, Hans; Thomsen, Marianne

    2009-01-01

    data. Pharmaceuticals were found to be more frequent than industrial chemicals in GHS category III. Acute toxicity was predictable (>92%) using a generic (Q)SAR ((Quantitative) Structure Activity Relationship) suggesting a narcotic MOA. Analysis of model prediction error suggests that 68...

  2. Estimation of the applicability domain of kernel-based machine learning models for virtual screening

    Directory of Open Access Journals (Sweden)

    Fechner Nikolas

    2010-03-01

    Full Text Available Abstract Background The virtual screening of large compound databases is an important application of structural-activity relationship models. Due to the high structural diversity of these data sets, it is impossible for machine learning based QSAR models, which rely on a specific training set, to give reliable results for all compounds. Thus, it is important to consider the subset of the chemical space in which the model is applicable. The approaches to this problem that have been published so far mostly use vectorial descriptor representations to define this domain of applicability of the model. Unfortunately, these cannot be extended easily to structured kernel-based machine learning models. For this reason, we propose three approaches to estimate the domain of applicability of a kernel-based QSAR model. Results We evaluated three kernel-based applicability domain estimations using three different structured kernels on three virtual screening tasks. Each experiment consisted of the training of a kernel-based QSAR model using support vector regression and the ranking of a disjoint screening data set according to the predicted activity. For each prediction, the applicability of the model for the respective compound is quantitatively described using a score obtained by an applicability domain formulation. The suitability of the applicability domain estimation is evaluated by comparing the model performance on the subsets of the screening data sets obtained by different thresholds for the applicability scores. This comparison indicates that it is possible to separate the part of the chemspace, in which the model gives reliable predictions, from the part consisting of structures too dissimilar to the training set to apply the model successfully. A closer inspection reveals that the virtual screening performance of the model is considerably improved if half of the molecules, those with the lowest applicability scores, are omitted from the screening

  3. Estimation of the applicability domain of kernel-based machine learning models for virtual screening.

    Science.gov (United States)

    Fechner, Nikolas; Jahn, Andreas; Hinselmann, Georg; Zell, Andreas

    2010-03-11

    The virtual screening of large compound databases is an important application of structural-activity relationship models. Due to the high structural diversity of these data sets, it is impossible for machine learning based QSAR models, which rely on a specific training set, to give reliable results for all compounds. Thus, it is important to consider the subset of the chemical space in which the model is applicable. The approaches to this problem that have been published so far mostly use vectorial descriptor representations to define this domain of applicability of the model. Unfortunately, these cannot be extended easily to structured kernel-based machine learning models. For this reason, we propose three approaches to estimate the domain of applicability of a kernel-based QSAR model. We evaluated three kernel-based applicability domain estimations using three different structured kernels on three virtual screening tasks. Each experiment consisted of the training of a kernel-based QSAR model using support vector regression and the ranking of a disjoint screening data set according to the predicted activity. For each prediction, the applicability of the model for the respective compound is quantitatively described using a score obtained by an applicability domain formulation. The suitability of the applicability domain estimation is evaluated by comparing the model performance on the subsets of the screening data sets obtained by different thresholds for the applicability scores. This comparison indicates that it is possible to separate the part of the chemspace, in which the model gives reliable predictions, from the part consisting of structures too dissimilar to the training set to apply the model successfully. A closer inspection reveals that the virtual screening performance of the model is considerably improved if half of the molecules, those with the lowest applicability scores, are omitted from the screening. The proposed applicability domain formulations

  4. System Advisor Model: Flat Plate Photovoltaic Performance Modeling Validation Report

    Energy Technology Data Exchange (ETDEWEB)

    Freeman, Janine [National Renewable Energy Lab. (NREL), Golden, CO (United States); Whitmore, Jonathan [National Renewable Energy Lab. (NREL), Golden, CO (United States); Kaffine, Leah [National Renewable Energy Lab. (NREL), Golden, CO (United States); Blair, Nate [National Renewable Energy Lab. (NREL), Golden, CO (United States); Dobos, Aron P. [National Renewable Energy Lab. (NREL), Golden, CO (United States)

    2013-12-01

    The System Advisor Model (SAM) is a free software tool that performs detailed analysis of both system performance and system financing for a variety of renewable energy technologies. This report provides detailed validation of the SAM flat plate photovoltaic performance model by comparing SAM-modeled PV system generation data to actual measured production data for nine PV systems ranging from 75 kW to greater than 25 MW in size. The results show strong agreement between SAM predictions and field data, with annualized prediction error below 3% for all fixed tilt cases and below 8% for all one axis tracked cases. The analysis concludes that snow cover and system outages are the primary sources of disagreement, and other deviations resulting from seasonal biases in the irradiation models and one axis tracking issues are discussed in detail.

  5. Benchmark validation of statistical models: Application to mediation analysis of imagery and memory.

    Science.gov (United States)

    MacKinnon, David P; Valente, Matthew J; Wurpts, Ingrid C

    2018-03-29

    This article describes benchmark validation, an approach to validating a statistical model. According to benchmark validation, a valid model generates estimates and research conclusions consistent with a known substantive effect. Three types of benchmark validation-(a) benchmark value, (b) benchmark estimate, and (c) benchmark effect-are described and illustrated with examples. Benchmark validation methods are especially useful for statistical models with assumptions that are untestable or very difficult to test. Benchmark effect validation methods were applied to evaluate statistical mediation analysis in eight studies using the established effect that increasing mental imagery improves recall of words. Statistical mediation analysis led to conclusions about mediation that were consistent with established theory that increased imagery leads to increased word recall. Benchmark validation based on established substantive theory is discussed as a general way to investigate characteristics of statistical models and a complement to mathematical proof and statistical simulation. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  6. QSAR Study of Insecticides of Phthalamide Derivatives Using Multiple Linear Regression and Artificial Neural Network Methods

    Directory of Open Access Journals (Sweden)

    Adi Syahputra

    2014-03-01

    Full Text Available Quantitative structure activity relationship (QSAR for 21 insecticides of phthalamides containing hydrazone (PCH was studied using multiple linear regression (MLR, principle component regression (PCR and artificial neural network (ANN. Five descriptors were included in the model for MLR and ANN analysis, and five latent variables obtained from principle component analysis (PCA were used in PCR analysis. Calculation of descriptors was performed using semi-empirical PM6 method. ANN analysis was found to be superior statistical technique compared to the other methods and gave a good correlation between descriptors and activity (r2 = 0.84. Based on the obtained model, we have successfully designed some new insecticides with higher predicted activity than those of previously synthesized compounds, e.g.2-(decalinecarbamoyl-5-chloro-N’-((5-methylthiophen-2-ylmethylene benzohydrazide, 2-(decalinecarbamoyl-5-chloro-N’-((thiophen-2-yl-methylene benzohydrazide and 2-(decaline carbamoyl-N’-(4-fluorobenzylidene-5-chlorobenzohydrazide with predicted log LC50 of 1.640, 1.672, and 1.769 respectively.

  7. A broad view of model validation

    International Nuclear Information System (INIS)

    Tsang, C.F.

    1989-10-01

    The safety assessment of a nuclear waste repository requires the use of models. Such models need to be validated to ensure, as much as possible, that they are a good representation of the actual processes occurring in the real system. In this paper we attempt to take a broad view by reviewing step by step the modeling process and bringing out the need to validating every step of this process. This model validation includes not only comparison of modeling results with data from selected experiments, but also evaluation of procedures for the construction of conceptual models and calculational models as well as methodologies for studying data and parameter correlation. The need for advancing basic scientific knowledge in related fields, for multiple assessment groups, and for presenting our modeling efforts in open literature to public scrutiny is also emphasized. 16 refs

  8. Eco-friendly synthesis, in vitro anti-proliferative evaluation, and 3D-QSAR analysis of a novel series of monocationic 2-aryl/heteroaryl-substituted 6-(2-imidazolinyl)benzothiazole mesylates.

    Science.gov (United States)

    Racané, Livio; Ptiček, Lucija; Sedić, Mirela; Grbčić, Petra; Kraljević Pavelić, Sandra; Bertoša, Branimir; Sović, Irena; Karminski-Zamola, Grace

    2018-04-17

    Herein, we describe the synthesis of twenty-one novel water-soluble monocationic 2-aryl/heteroaryl-substituted 6-(2-imidazolinyl)benzothiazole mesylates 3a-3u and present the results of their anti-proliferative assays. Efficient syntheses were achieved by three complementary simple two-step synthetic protocols based on the condensation reaction of aryl/heteroaryl carbaldehydes or carboxylic acid. We developed an eco-friendly synthetic protocol using glycerol as green solvent, particularly appropriate for the condensation of thermally and acid-sensitive heterocycles such as furan, benzofuran, pyrrole, and indole. Screening of anti-proliferative activity was performed on four human tumour cell lines in vitro including pancreatic cancer (CFPAC-1), metastatic colon cancer (SW620), hepatocellular carcinoma (HepG2), and cervical cancer (HeLa), as well as in normal human fibroblast cell lines. All tested compounds showed strong to moderate anti-proliferative activity on tested cell lines depending on the structure containing aryl/heteroaryl moiety coupled to 6-(2-imidazolinyl)benzothiazole moiety. The most potent cytostatic effects on all tested cell lines with [Formula: see text] values ranging from 0.1 to 3.70 [Formula: see text] were observed for benzothiazoles substituted with naphthalene-2-yl 3c, benzofuran-2-yl 3e, indole-3-yl 3j, indole-2-yl 3k, quinoline-2-yl 3s, and quinoline-3-yl 3t and derivatives substituted with phenyl 3a, naphthalene-1-yl 3b, benzothiazole-2-yl 3g, benzothiazole-6-yl 3h, N-methylindole-3-yl 3l, benzimidazole-2-yl 3n, benzimidazole-5(6)-yl 3o, and quinolone-4-yl 3u with [Formula: see text] values ranging from 1.1 to 29.1 [Formula: see text]. Based on obtained anti-proliferative activities, 3D-QSAR models for five cell lines were derived. Molecular volume, molecular surface, the sum of hydrophobic surface areas, molecular mass, and possibility of making dispersion forces were identified by QSAR analyses as molecular properties that are

  9. Cross validation for the classical model of structured expert judgment

    International Nuclear Information System (INIS)

    Colson, Abigail R.; Cooke, Roger M.

    2017-01-01

    We update the 2008 TU Delft structured expert judgment database with data from 33 professionally contracted Classical Model studies conducted between 2006 and March 2015 to evaluate its performance relative to other expert aggregation models. We briefly review alternative mathematical aggregation schemes, including harmonic weighting, before focusing on linear pooling of expert judgments with equal weights and performance-based weights. Performance weighting outperforms equal weighting in all but 1 of the 33 studies in-sample. True out-of-sample validation is rarely possible for Classical Model studies, and cross validation techniques that split calibration questions into a training and test set are used instead. Performance weighting incurs an “out-of-sample penalty” and its statistical accuracy out-of-sample is lower than that of equal weighting. However, as a function of training set size, the statistical accuracy of performance-based combinations reaches 75% of the equal weight value when the training set includes 80% of calibration variables. At this point the training set is sufficiently powerful to resolve differences in individual expert performance. The information of performance-based combinations is double that of equal weighting when the training set is at least 50% of the set of calibration variables. Previous out-of-sample validation work used a Total Out-of-Sample Validity Index based on all splits of the calibration questions into training and test subsets, which is expensive to compute and includes small training sets of dubious value. As an alternative, we propose an Out-of-Sample Validity Index based on averaging the product of statistical accuracy and information over all training sets sized at 80% of the calibration set. Performance weighting outperforms equal weighting on this Out-of-Sample Validity Index in 26 of the 33 post-2006 studies; the probability of 26 or more successes on 33 trials if there were no difference between performance

  10. Glycogen synthase kinase-3 inhibition by 3-anilino-4-phenylmaleimides: insights from 3D-QSAR and docking

    Science.gov (United States)

    Prasanna, Sivaprakasam; Daga, Pankaj R.; Xie, Aihua; Doerksen, Robert J.

    2009-02-01

    Glycogen synthase kinase-3, a serine/threonine kinase, has been implicated in a wide variety of pathological conditions such as diabetes, Alzheimer's disease, stroke, bipolar disorder, malaria and cancer. Herein we report 3D-QSAR analyses using CoMFA and CoMSIA and molecular docking studies on 3-anilino-4-phenylmaleimides as GSK-3α inhibitors, in order to better understand the mechanism of action and structure-activity relationship of these compounds. Comparison of the active site residues of GSK-3α and GSK-3β isoforms shows that all the key amino acids involved in polar interactions with the maleimides for the β isoform are the same in the α isoform, except that Asp133 in the β isoform is replaced by Glu196 in the α isoform. We prepared a homology model for GSK-3α, and showed that the change from Asp to Glu should not affect maleimide binding significantly. Docking studies revealed the binding poses of three subclasses of these ligands, namely anilino, N-methylanilino and indoline derivatives, within the active site of the β isoform, and helped to explain the difference in their inhibitory activity.

  11. An independent verification and validation of the Future Theater Level Model conceptual model

    Energy Technology Data Exchange (ETDEWEB)

    Hartley, D.S. III; Kruse, K.L.; Martellaro, A.J.; Packard, S.L.; Thomas, B. Jr.; Turley, V.K.

    1994-08-01

    This report describes the methodology and results of independent verification and validation performed on a combat model in its design stage. The combat model is the Future Theater Level Model (FTLM), under development by The Joint Staff/J-8. J-8 has undertaken its development to provide an analysis tool that addresses the uncertainties of combat more directly than previous models and yields more rapid study results. The methodology adopted for this verification and validation consisted of document analyses. Included were detailed examination of the FTLM design documents (at all stages of development), the FTLM Mission Needs Statement, and selected documentation for other theater level combat models. These documents were compared to assess the FTLM as to its design stage, its purpose as an analytical combat model, and its capabilities as specified in the Mission Needs Statement. The conceptual design passed those tests. The recommendations included specific modifications as well as a recommendation for continued development. The methodology is significant because independent verification and validation have not been previously reported as being performed on a combat model in its design stage. The results are significant because The Joint Staff/J-8 will be using the recommendations from this study in determining whether to proceed with develop of the model.

  12. 2D-QSAR and 3D-QSAR/CoMSIA Studies on a Series of (R-2-((2-(1H-Indol-2-ylethylamino-1-Phenylethan-1-ol with Human β3-Adrenergic Activity

    Directory of Open Access Journals (Sweden)

    Gastón Apablaza

    2017-03-01

    Full Text Available The β3 adrenergic receptor is raising as an important drug target for the treatment of pathologies such as diabetes, obesity, depression, and cardiac diseases among others. Several attempts to obtain selective and high affinity ligands have been made. Currently, Mirabegron is the only available drug on the market that targets this receptor approved for the treatment of overactive bladder. However, the FDA (Food and Drug Administration in USA and the MHRA (Medicines and Healthcare products Regulatory Agency in UK have made reports of potentially life-threatening side effects associated with the administration of Mirabegron, casting doubts on the continuity of this compound. Therefore, it is of utmost importance to gather information for the rational design and synthesis of new β3 adrenergic ligands. Herein, we present the first combined 2D-QSAR (two-dimensional Quantitative Structure-Activity Relationship and 3D-QSAR/CoMSIA (three-dimensional Quantitative Structure-Activity Relationship/Comparative Molecular Similarity Index Analysis study on a series of potent β3 adrenergic agonists of indole-alkylamine structure. We found a series of changes that can be made in the steric, hydrogen-bond donor and acceptor, lipophilicity and molar refractivity properties of the compounds to generate new promising molecules. Finally, based on our analysis, a summary and a regiospecific description of the requirements for improving β3 adrenergic activity is given.

  13. Model performance analysis and model validation in logistic regression

    Directory of Open Access Journals (Sweden)

    Rosa Arboretti Giancristofaro

    2007-10-01

    Full Text Available In this paper a new model validation procedure for a logistic regression model is presented. At first, we illustrate a brief review of different techniques of model validation. Next, we define a number of properties required for a model to be considered "good", and a number of quantitative performance measures. Lastly, we describe a methodology for the assessment of the performance of a given model by using an example taken from a management study.

  14. Calibration and validation of coarse-grained models of atomic systems: application to semiconductor manufacturing

    Science.gov (United States)

    Farrell, Kathryn; Oden, J. Tinsley

    2014-07-01

    Coarse-grained models of atomic systems, created by aggregating groups of atoms into molecules to reduce the number of degrees of freedom, have been used for decades in important scientific and technological applications. In recent years, interest in developing a more rigorous theory for coarse graining and in assessing the predictivity of coarse-grained models has arisen. In this work, Bayesian methods for the calibration and validation of coarse-grained models of atomistic systems in thermodynamic equilibrium are developed. For specificity, only configurational models of systems in canonical ensembles are considered. Among major challenges in validating coarse-grained models are (1) the development of validation processes that lead to information essential in establishing confidence in the model's ability predict key quantities of interest and (2), above all, the determination of the coarse-grained model itself; that is, the characterization of the molecular architecture, the choice of interaction potentials and thus parameters, which best fit available data. The all-atom model is treated as the "ground truth," and it provides the basis with respect to which properties of the coarse-grained model are compared. This base all-atom model is characterized by an appropriate statistical mechanics framework in this work by canonical ensembles involving only configurational energies. The all-atom model thus supplies data for Bayesian calibration and validation methods for the molecular model. To address the first challenge, we develop priors based on the maximum entropy principle and likelihood functions based on Gaussian approximations of the uncertainties in the parameter-to-observation error. To address challenge (2), we introduce the notion of model plausibilities as a means for model selection. This methodology provides a powerful approach toward constructing coarse-grained models which are most plausible for given all-atom data. We demonstrate the theory and

  15. Establishing model credibility involves more than validation

    International Nuclear Information System (INIS)

    Kirchner, T.

    1991-01-01

    One widely used definition of validation is that the quantitative test of the performance of a model through the comparison of model predictions to independent sets of observations from the system being simulated. The ability to show that the model predictions compare well with observations is often thought to be the most rigorous test that can be used to establish credibility for a model in the scientific community. However, such tests are only part of the process used to establish credibility, and in some cases may be either unnecessary or misleading. Naylor and Finger extended the concept of validation to include the establishment of validity for the postulates embodied in the model and the test of assumptions used to select postulates for the model. Validity of postulates is established through concurrence by experts in the field of study that the mathematical or conceptual model contains the structural components and mathematical relationships necessary to adequately represent the system with respect to the goals for the model. This extended definition of validation provides for consideration of the structure of the model, not just its performance, in establishing credibility. Evaluation of a simulation model should establish the correctness of the code and the efficacy of the model within its domain of applicability. (24 refs., 6 figs.)

  16. Towards policy relevant environmental modeling: contextual validity and pragmatic models

    Science.gov (United States)

    Miles, Scott B.

    2000-01-01

    "What makes for a good model?" In various forms, this question is a question that, undoubtedly, many people, businesses, and institutions ponder with regards to their particular domain of modeling. One particular domain that is wrestling with this question is the multidisciplinary field of environmental modeling. Examples of environmental models range from models of contaminated ground water flow to the economic impact of natural disasters, such as earthquakes. One of the distinguishing claims of the field is the relevancy of environmental modeling to policy and environment-related decision-making in general. A pervasive view by both scientists and decision-makers is that a "good" model is one that is an accurate predictor. Thus, determining whether a model is "accurate" or "correct" is done by comparing model output to empirical observations. The expected outcome of this process, usually referred to as "validation" or "ground truthing," is a stamp on the model in question of "valid" or "not valid" that serves to indicate whether or not the model will be reliable before it is put into service in a decision-making context. In this paper, I begin by elaborating on the prevailing view of model validation and why this view must change. Drawing from concepts coming out of the studies of science and technology, I go on to propose a contextual view of validity that can overcome the problems associated with "ground truthing" models as an indicator of model goodness. The problem of how we talk about and determine model validity has much to do about how we perceive the utility of environmental models. In the remainder of the paper, I argue that we should adopt ideas of pragmatism in judging what makes for a good model and, in turn, developing good models. From such a perspective of model goodness, good environmental models should facilitate communication, convey—not bury or "eliminate"—uncertainties, and, thus, afford the active building of consensus decisions, instead

  17. QSAR studies of the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by multiple linear regression (MLR) and support vector machine (SVM).

    Science.gov (United States)

    Qin, Zijian; Wang, Maolin; Yan, Aixia

    2017-07-01

    In this study, quantitative structure-activity relationship (QSAR) models using various descriptor sets and training/test set selection methods were explored to predict the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by using a multiple linear regression (MLR) and a support vector machine (SVM) method. 512 HCV NS3/4A protease inhibitors and their IC 50 values which were determined by the same FRET assay were collected from the reported literature to build a dataset. All the inhibitors were represented with selected nine global and 12 2D property-weighted autocorrelation descriptors calculated from the program CORINA Symphony. The dataset was divided into a training set and a test set by a random and a Kohonen's self-organizing map (SOM) method. The correlation coefficients (r 2 ) of training sets and test sets were 0.75 and 0.72 for the best MLR model, 0.87 and 0.85 for the best SVM model, respectively. In addition, a series of sub-dataset models were also developed. The performances of all the best sub-dataset models were better than those of the whole dataset models. We believe that the combination of the best sub- and whole dataset SVM models can be used as reliable lead designing tools for new NS3/4A protease inhibitors scaffolds in a drug discovery pipeline. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. The Interplay between QSAR/QSPR Studiesand Partial Order Ranking and Formal Concept Analyses

    Directory of Open Access Journals (Sweden)

    Lars Carlsen

    2009-04-01

    Full Text Available The often observed scarcity of physical-chemical and well as toxicological data hampers the assessment of potentially hazardous chemicals released to the environment. In such cases Quantitative Structure-Activity Relationships/Quantitative Structure-Property Relationships (QSAR/QSPR constitute an obvious alternative for rapidly, effectively and inexpensively generatng missing experimental values. However, typically further treatment of the data appears necessary, e.g., to elucidate the possible relations between the single compounds as well as implications and associations between the various parameters used for the combined characterization of the compounds under investigation. In the present paper the application of QSAR/QSPR in combination with Partial Order Ranking (POR methodologies will be reviewed and new aspects using Formal Concept Analysis (FCA will be introduced. Where POR constitutes an attractive method for, e.g., prioritizing a series of chemical substances based on a simultaneous inclusion of a range of parameters, FCA gives important information on the implications associations between the parameters. The combined approach thus constitutes an attractive method to a preliminary assessment of the impact on environmental and human health by primary pollutants or possibly by a primary pollutant well as a possible suite of transformation subsequent products that may be both persistent in and bioaccumulating and toxic.The present review focus on the environmental – and human health impact by residuals of the rocket fuel 1,1-dimethyl- hydrazine (heptyl and its transformation products as an illustrative example.

  19. Development and validation of a mass casualty conceptual model.

    Science.gov (United States)

    Culley, Joan M; Effken, Judith A

    2010-03-01

    To develop and validate a conceptual model that provides a framework for the development and evaluation of information systems for mass casualty events. The model was designed based on extant literature and existing theoretical models. A purposeful sample of 18 experts validated the model. Open-ended questions, as well as a 7-point Likert scale, were used to measure expert consensus on the importance of each construct and its relationship in the model and the usefulness of the model to future research. Computer-mediated applications were used to facilitate a modified Delphi technique through which a panel of experts provided validation for the conceptual model. Rounds of questions continued until consensus was reached, as measured by an interquartile range (no more than 1 scale point for each item); stability (change in the distribution of responses less than 15% between rounds); and percent agreement (70% or greater) for indicator questions. Two rounds of the Delphi process were needed to satisfy the criteria for consensus or stability related to the constructs, relationships, and indicators in the model. The panel reached consensus or sufficient stability to retain all 10 constructs, 9 relationships, and 39 of 44 indicators. Experts viewed the model as useful (mean of 5.3 on a 7-point scale). Validation of the model provides the first step in understanding the context in which mass casualty events take place and identifying variables that impact outcomes of care. This study provides a foundation for understanding the complexity of mass casualty care, the roles that nurses play in mass casualty events, and factors that must be considered in designing and evaluating information-communication systems to support effective triage under these conditions.

  20. Design, synthesis, α-glucosidase inhibitory activity, molecular docking and QSAR studies of benzimidazole derivatives

    Science.gov (United States)

    Dinparast, Leila; Valizadeh, Hassan; Bahadori, Mir Babak; Soltani, Somaieh; Asghari, Behvar; Rashidi, Mohammad-Reza

    2016-06-01

    In this study the green, one-pot, solvent-free and selective synthesis of benzimidazole derivatives is reported. The reactions were catalyzed by ZnO/MgO containing ZnO nanoparticles as a highly effective, non-toxic and environmentally friendly catalyst. The structure of synthesized benzimidazoles was characterized using spectroscopic technics (FT-IR, 1HNMR, 13CNMR). Synthesized compounds were evaluated for their α-glucosidase inhibitory potential. Compounds 3c, 3e, 3l and 4n were potent inhibitors with IC50 values ranging from 60.7 to 168.4 μM. In silico studies were performed to explore the binding modes and interactions between enzyme and synthesized benzimidazoles. Developed linear QSAR model based on density and molecular weight could predict bioactivity of newly synthesized compounds well. Molecular docking studies revealed the availability of some hydrophobic interactions. In addition, the bioactivity of most potent compounds had good correlation with estimated free energy of binding (ΔGbinding) which was calculated according to docked best conformations.

  1. Statistical Validation of Normal Tissue Complication Probability Models

    Energy Technology Data Exchange (ETDEWEB)

    Xu Chengjian, E-mail: c.j.xu@umcg.nl [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Schaaf, Arjen van der; Veld, Aart A. van' t; Langendijk, Johannes A. [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Schilstra, Cornelis [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Radiotherapy Institute Friesland, Leeuwarden (Netherlands)

    2012-09-01

    Purpose: To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. Methods and Materials: A penalized regression method, LASSO (least absolute shrinkage and selection operator), was used to build NTCP models for xerostomia after radiation therapy treatment of head-and-neck cancer. Model assessment was based on the likelihood function and the area under the receiver operating characteristic curve. Results: Repeated double cross-validation showed the uncertainty and instability of the NTCP models and indicated that the statistical significance of model performance can be obtained by permutation testing. Conclusion: Repeated double cross-validation and permutation tests are recommended to validate NTCP models before clinical use.

  2. Validation by simulation of a clinical trial model using the standardized mean and variance criteria.

    Science.gov (United States)

    Abbas, Ismail; Rovira, Joan; Casanovas, Josep

    2006-12-01

    To develop and validate a model of a clinical trial that evaluates the changes in cholesterol level as a surrogate marker for lipodystrophy in HIV subjects under alternative antiretroviral regimes, i.e., treatment with Protease Inhibitors vs. a combination of nevirapine and other antiretroviral drugs. Five simulation models were developed based on different assumptions, on treatment variability and pattern of cholesterol reduction over time. The last recorded cholesterol level, the difference from the baseline, the average difference from the baseline and level evolution, are the considered endpoints. Specific validation criteria based on a 10% minus or plus standardized distance in means and variances were used to compare the real and the simulated data. The validity criterion was met by all models for considered endpoints. However, only two models met the validity criterion when all endpoints were considered. The model based on the assumption that within-subjects variability of cholesterol levels changes over time is the one that minimizes the validity criterion, standardized distance equal to or less than 1% minus or plus. Simulation is a useful technique for calibration, estimation, and evaluation of models, which allows us to relax the often overly restrictive assumptions regarding parameters required by analytical approaches. The validity criterion can also be used to select the preferred model for design optimization, until additional data are obtained allowing an external validation of the model.

  3. Validation process of simulation model

    International Nuclear Information System (INIS)

    San Isidro, M. J.

    1998-01-01

    It is presented a methodology on empirical validation about any detailed simulation model. This king of validation it is always related with an experimental case. The empirical validation has a residual sense, because the conclusions are based on comparisons between simulated outputs and experimental measurements. This methodology will guide us to detect the fails of the simulation model. Furthermore, it can be used a guide in the design of posterior experiments. Three steps can be well differentiated: Sensitivity analysis. It can be made with a DSA, differential sensitivity analysis, and with a MCSA, Monte-Carlo sensitivity analysis. Looking the optimal domains of the input parameters. It has been developed a procedure based on the Monte-Carlo methods and Cluster techniques, to find the optimal domains of these parameters. Residual analysis. This analysis has been made on the time domain and on the frequency domain, it has been used the correlation analysis and spectral analysis. As application of this methodology, it is presented the validation carried out on a thermal simulation model on buildings, Esp., studying the behavior of building components on a Test Cell of LECE of CIEMAT. (Author) 17 refs

  4. 2D MI-DRAGON: a new predictor for protein-ligands interactions and theoretic-experimental studies of US FDA drug-target network, oxoisoaporphine inhibitors for MAO-A and human parasite proteins.

    Science.gov (United States)

    Prado-Prado, Francisco; García-Mera, Xerardo; Escobar, Manuel; Sobarzo-Sánchez, Eduardo; Yañez, Matilde; Riera-Fernandez, Pablo; González-Díaz, Humberto

    2011-12-01

    There are many pairs of possible Drug-Proteins Interactions that may take place or not (DPIs/nDPIs) between drugs with high affinity/non-affinity for different proteins. This fact makes expensive in terms of time and resources, for instance, the determination of all possible ligands-protein interactions for a single drug. In this sense, we can use Quantitative Structure-Activity Relationships (QSAR) models to carry out rational DPIs prediction. Unfortunately, almost all QSAR models predict activity against only one target. To solve this problem we can develop multi-target QSAR (mt-QSAR) models. In this work, we introduce the technique 2D MI-DRAGON a new predictor for DPIs based on two different well-known software. We use the software MARCH-INSIDE (MI) to calculate 3D structural parameters for targets and the software DRAGON was used to calculated 2D molecular descriptors all drugs showing known DPIs present in the Drug Bank (US FDA benchmark dataset). Both classes of parameters were used as input of different Artificial Neural Network (ANN) algorithms to seek an accurate non-linear mt-QSAR predictor. The best ANN model found is a Multi-Layer Perceptron (MLP) with profile MLP 21:21-31-1:1. This MLP classifies correctly 303 out of 339 DPIs (Sensitivity = 89.38%) and 480 out of 510 nDPIs (Specificity = 94.12%), corresponding to training Accuracy = 92.23%. The validation of the model was carried out by means of external predicting series with Sensitivity = 92.18% (625/678 DPIs; Specificity = 90.12% (730/780 nDPIs) and Accuracy = 91.06%. 2D MI-DRAGON offers a good opportunity for fast-track calculation of all possible DPIs of one drug enabling us to re-construct large drug-target or DPIs Complex Networks (CNs). For instance, we reconstructed the CN of the US FDA benchmark dataset with 855 nodes 519 drugs+336 targets). We predicted CN with similar topology (observed and predicted values of average distance are equal to 6.7 vs. 6.6). These CNs can be used to explore

  5. SWAT application in intensive irrigation systems: Model modification, calibration and validation

    Science.gov (United States)

    Dechmi, Farida; Burguete, Javier; Skhiri, Ahmed

    2012-11-01

    SummaryThe Soil and Water Assessment Tool (SWAT) is a well established, distributed, eco-hydrologic model. However, using the study case of an agricultural intensive irrigated watershed, it was shown that all the model versions are not able to appropriately reproduce the total streamflow in such system when the irrigation source is outside the watershed. The objective of this study was to modify the SWAT2005 version for correctly simulating the main hydrological processes. Crop yield, total streamflow, total suspended sediment (TSS) losses and phosphorus load calibration and validation were performed using field survey information and water quantity and quality data recorded during 2008 and 2009 years in Del Reguero irrigated watershed in Spain. The goodness of the calibration and validation results was assessed using five statistical measures, including the Nash-Sutcliffe efficiency (NSE). Results indicated that the average annual crop yield and actual evapotranspiration estimations were quite satisfactory. On a monthly basis, the values of NSE were 0.90 (calibration) and 0.80 (validation) indicating that the modified model could reproduce accurately the observed streamflow. The TSS losses were also satisfactorily estimated (NSE = 0.72 and 0.52 for the calibration and validation steps). The monthly temporal patterns and all the statistical parameters indicated that the modified SWAT-IRRIG model adequately predicted the total phosphorus (TP) loading. Therefore, the model could be used to assess the impacts of different best management practices on nonpoint phosphorus losses in irrigated systems.

  6. One Size Fits All? The Validity of a Composite Poverty Index Across Urban and Rural Households in South Africa.

    Science.gov (United States)

    Steinert, Janina Isabel; Cluver, Lucie Dale; Melendez-Torres, G J; Vollmer, Sebastian

    2018-01-01

    Composite indices have been prominently used in poverty research. However, validity of these indices remains subject to debate. This paper examines the validity of a common type of composite poverty indices using data from a cross-sectional survey of 2477 households in urban and rural KwaZulu-Natal, South Africa. Multiple-group comparisons in structural equation modelling were employed for testing differences in the measurement model across urban and rural groups. The analysis revealed substantial variations between urban and rural respondents both in the conceptualisation of poverty as well as in the weights and importance assigned to individual poverty indicators. The validity of a 'one size fits all' measurement model can therefore not be confirmed. In consequence, it becomes virtually impossible to determine a household's poverty level relative to the full sample. Findings from our analysis have important practical implications in nuancing how we can sensitively use composite poverty indices to identify poor people.

  7. All Validity Is Construct Validity. Or Is It?

    Science.gov (United States)

    Kane, Michael

    2012-01-01

    Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…

  8. Predictive Simulation of Material Failure Using Peridynamics -- Advanced Constitutive Modeling, Verification and Validation

    Science.gov (United States)

    2016-03-31

    AFRL-AFOSR-VA-TR-2016-0309 Predictive simulation of material failure using peridynamics- advanced constitutive modeling, verification , and validation... Self -explanatory. 8. PERFORMING ORGANIZATION REPORT NUMBER. Enter all unique alphanumeric report numbers assigned by the performing organization, e.g...for public release. Predictive simulation of material failure using peridynamics-advanced constitutive modeling, verification , and validation John T

  9. The effect of leverage and/or influential on structure-activity relationships.

    Science.gov (United States)

    Bolboacă, Sorana D; Jäntschi, Lorentz

    2013-05-01

    In the spirit of reporting valid and reliable Quantitative Structure-Activity Relationship (QSAR) models, the aim of our research was to assess how the leverage (analysis with Hat matrix, h(i)) and the influential (analysis with Cook's distance, D(i)) of QSAR models may reflect the models reliability and their characteristics. The datasets included in this research were collected from previously published papers. Seven datasets which accomplished the imposed inclusion criteria were analyzed. Three models were obtained for each dataset (full-model, h(i)-model and D(i)-model) and several statistical validation criteria were applied to the models. In 5 out of 7 sets the correlation coefficient increased when compounds with either h(i) or D(i) higher than the threshold were removed. Withdrawn compounds varied from 2 to 4 for h(i)-models and from 1 to 13 for D(i)-models. Validation statistics showed that D(i)-models possess systematically better agreement than both full-models and h(i)-models. Removal of influential compounds from training set significantly improves the model and is recommended to be conducted in the process of quantitative structure-activity relationships developing. Cook's distance approach should be combined with hat matrix analysis in order to identify the compounds candidates for removal.

  10. Implementation and automated validation of the minimal Z' model in FeynRules

    International Nuclear Information System (INIS)

    Basso, L.; Christensen, N.D.; Duhr, C.; Fuks, B.; Speckner, C.

    2012-01-01

    We describe the implementation of a well-known class of U(1) gauge models, the 'minimal' Z' models, in FeynRules. We also describe a new automated validation tool for FeynRules models which is controlled by a web interface and allows the user to run a complete set of 2 → 2 processes on different matrix element generators, different gauges, and compare between them all. If existing, the comparison with independent implementations is also possible. This tool has been used to validate our implementation of the 'minimal' Z' models. (authors)

  11. Validation of Hydrodynamic Numerical Model of a Pitching Wave Energy Converter

    DEFF Research Database (Denmark)

    López, Maria del Pilar Heras; Thomas, Sarah; Kramer, Morten Mejlhede

    2017-01-01

    Validation of numerical model is essential in the development of new technologies. Commercial software and codes available simulating wave energy converters (WECs) have not been proved to work for all the available and upcoming technologies yet. The present paper presents the first stages...... of the validation process of a hydrodynamic numerical model for a pitching wave energy converter. The development of dry tests, wave flume and wave basin experiments are going to be explained, lessons learned shared and results presented....

  12. Highly predictive hologram QSAR models of nitrile-containing cruzain inhibitors.

    Science.gov (United States)

    Silva, Daniel Gedder; Rocha, Josmar Rodrigues; Sartori, Geraldo Rodrigues; Montanari, Carlos Alberto

    2017-11-01

    The HQSAR, molecular docking, and ROCS were applied to a data-set of 57 cruzain inhibitors. The best HQSAR model (q 2  = .70, r 2  = .95, [Formula: see text] = .62, [Formula: see text] = .09 and [Formula: see text] = .26), employing well-balanced, diverse training (40) and test (17) sets, was obtained using atoms (A), bonds (B), and hydrogen (H) as fragment distinctions and 6-9 as fragment sizes. This model was then used to predict the unknown potencies of 121 compounds (the V1 database), giving rise to a satisfactory predictive r 2 value of .65 (external validation). By employing an extra external data-set comprising 1223 compounds (the V3 database) either retrieved from the ChEMBL or CDD databases, an overall ROC AUC score well over .70 was obtained. The contribution maps obtained with the best HQSAR model (model 3.4) are in agreement with the predicted binding mode and with the biological potencies of the studied compounds. We also screened these compounds using the ROCS method, a Gaussian-shape volume filter able to identify quickly the shapes that match a query molecule. The area under the curve (AUC) obtained with the ROC curves (ROC AUC) was .72, indicating that the method was very efficient in distinguishing between active and inactive cruzain inhibitors. These set of information guided us to propose novel cruzain inhibitors to be synthesized. Then, the best HQSAR model obtained was used to predict the pIC 50 values of these new compounds. Some compounds identified using this method have shown calculated potencies higher than those which have originated them.

  13. Cross-validation of an employee safety climate model in Malaysia.

    Science.gov (United States)

    Bahari, Siti Fatimah; Clarke, Sharon

    2013-06-01

    Whilst substantial research has investigated the nature of safety climate, and its importance as a leading indicator of organisational safety, much of this research has been conducted with Western industrial samples. The current study focuses on the cross-validation of a safety climate model in the non-Western industrial context of Malaysian manufacturing. The first-order factorial validity of Cheyne et al.'s (1998) [Cheyne, A., Cox, S., Oliver, A., Tomas, J.M., 1998. Modelling safety climate in the prediction of levels of safety activity. Work and Stress, 12(3), 255-271] model was tested, using confirmatory factor analysis, in a Malaysian sample. Results showed that the model fit indices were below accepted levels, indicating that the original Cheyne et al. (1998) safety climate model was not supported. An alternative three-factor model was developed using exploratory factor analysis. Although these findings are not consistent with previously reported cross-validation studies, we argue that previous studies have focused on validation across Western samples, and that the current study demonstrates the need to take account of cultural factors in the development of safety climate models intended for use in non-Western contexts. The results have important implications for the transferability of existing safety climate models across cultures (for example, in global organisations) and highlight the need for future research to examine cross-cultural issues in relation to safety climate. Copyright © 2013 National Safety Council and Elsevier Ltd. All rights reserved.

  14. Advances in the replacement and enhanced replacement method in QSAR and QSPR theories.

    Science.gov (United States)

    Mercader, Andrew G; Duchowicz, Pablo R; Fernández, Francisco M; Castro, Eduardo A

    2011-07-25

    The selection of an optimal set of molecular descriptors from a much greater pool of such regression variables is a crucial step in the development of QSAR and QSPR models. The aim of this work is to further improve this important selection process. For this reason three different alternatives for the initial steps of our recently developed enhanced replacement method (ERM) and replacement method (RM) are proposed. These approaches had previously proven to yield near optimal results with a much smaller number of linear regressions than the full search. The algorithms were tested on four different experimental data sets, formed by collections of 116, 200, 78, and 100 experimental records from different compounds and 1268, 1338, 1187, and 1306 molecular descriptors, respectively. The comparisons showed that one of the new alternatives further improves the ERM, which has shown to be superior to genetic algorithms for the selection of an optimal set of molecular descriptors from a much greater pool. The new proposed alternative also improves the simpler and the lower computational demand algorithm RM.

  15. Statistical Validation of Engineering and Scientific Models: Background

    International Nuclear Information System (INIS)

    Hills, Richard G.; Trucano, Timothy G.

    1999-01-01

    A tutorial is presented discussing the basic issues associated with propagation of uncertainty analysis and statistical validation of engineering and scientific models. The propagation of uncertainty tutorial illustrates the use of the sensitivity method and the Monte Carlo method to evaluate the uncertainty in predictions for linear and nonlinear models. Four example applications are presented; a linear model, a model for the behavior of a damped spring-mass system, a transient thermal conduction model, and a nonlinear transient convective-diffusive model based on Burger's equation. Correlated and uncorrelated model input parameters are considered. The model validation tutorial builds on the material presented in the propagation of uncertainty tutoriaI and uses the damp spring-mass system as the example application. The validation tutorial illustrates several concepts associated with the application of statistical inference to test model predictions against experimental observations. Several validation methods are presented including error band based, multivariate, sum of squares of residuals, and optimization methods. After completion of the tutorial, a survey of statistical model validation literature is presented and recommendations for future work are made

  16. 2D-QSAR and 3D-QSAR/CoMSIA Studies on a Series of (R)-2-((2-(1H-Indol-2-yl)ethyl)amino)-1-Phenylethan-1-ol with Human β₃-Adrenergic Activity.

    Science.gov (United States)

    Apablaza, Gastón; Montoya, Luisa; Morales-Verdejo, Cesar; Mellado, Marco; Cuellar, Mauricio; Lagos, Carlos F; Soto-Delgado, Jorge; Chung, Hery; Pessoa-Mahana, Carlos David; Mella, Jaime

    2017-03-05

    The β₃ adrenergic receptor is raising as an important drug target for the treatment of pathologies such as diabetes, obesity, depression, and cardiac diseases among others. Several attempts to obtain selective and high affinity ligands have been made. Currently, Mirabegron is the only available drug on the market that targets this receptor approved for the treatment of overactive bladder. However, the FDA (Food and Drug Administration) in USA and the MHRA (Medicines and Healthcare products Regulatory Agency) in UK have made reports of potentially life-threatening side effects associated with the administration of Mirabegron, casting doubts on the continuity of this compound. Therefore, it is of utmost importance to gather information for the rational design and synthesis of new β₃ adrenergic ligands. Herein, we present the first combined 2D-QSAR (two-dimensional Quantitative Structure-Activity Relationship) and 3D-QSAR/CoMSIA (three-dimensional Quantitative Structure-Activity Relationship/Comparative Molecular Similarity Index Analysis) study on a series of potent β₃ adrenergic agonists of indole-alkylamine structure. We found a series of changes that can be made in the steric, hydrogen-bond donor and acceptor, lipophilicity and molar refractivity properties of the compounds to generate new promising molecules. Finally, based on our analysis, a summary and a regiospecific description of the requirements for improving β₃ adrenergic activity is given.

  17. All the mathematics in the world: logical validity and classical set theory

    Directory of Open Access Journals (Sweden)

    David Charles McCarty

    2017-12-01

    Full Text Available A recognizable topological model construction shows that any consistent principles of classical set theory, including the validity of the law of the excluded third, together with a standard class theory, do not suffice to demonstrate the general validity of the law of the excluded third. This result calls into question the classical mathematician's ability to offer solid justifications for the logical principles he or she favors.

  18. Model Validation Using Coordinate Distance with Performance Sensitivity

    Directory of Open Access Journals (Sweden)

    Jiann-Shiun Lew

    2008-01-01

    Full Text Available This paper presents an innovative approach to model validation for a structure with significant parameter variations. Model uncertainty of the structural dynamics is quantified with the use of a singular value decomposition technique to extract the principal components of parameter change, and an interval model is generated to represent the system with parameter uncertainty. The coordinate vector, corresponding to the identified principal directions, of the validation system is computed. The coordinate distance between the validation system and the identified interval model is used as a metric for model validation. A beam structure with an attached subsystem, which has significant parameter uncertainty, is used to demonstrate the proposed approach.

  19. Some considerations for validation of repository performance assessment models

    International Nuclear Information System (INIS)

    Eisenberg, N.

    1991-01-01

    Validation is an important aspect of the regulatory uses of performance assessment. A substantial body of literature exists indicating the manner in which validation of models is usually pursued. Because performance models for a nuclear waste repository cannot be tested over the long time periods for which the model must make predictions, the usual avenue for model validation is precluded. Further impediments to model validation include a lack of fundamental scientific theory to describe important aspects of repository performance and an inability to easily deduce the complex, intricate structures characteristic of a natural system. A successful strategy for validation must attempt to resolve these difficulties in a direct fashion. Although some procedural aspects will be important, the main reliance of validation should be on scientific substance and logical rigor. The level of validation needed will be mandated, in part, by the uses to which these models are put, rather than by the ideal of validation of a scientific theory. Because of the importance of the validation of performance assessment models, the NRC staff has engaged in a program of research and international cooperation to seek progress in this important area. 2 figs., 16 refs

  20. Particle swarm optimization and genetic algorithm as feature selection techniques for the QSAR modeling of imidazo[1,5-a]pyrido[3,2-e]pyrazines, inhibitors of phosphodiesterase 10A.

    Science.gov (United States)

    Goodarzi, Mohammad; Saeys, Wouter; Deeb, Omar; Pieters, Sigrid; Vander Heyden, Yvan

    2013-12-01

    Quantitative structure-activity relationship (QSAR) modeling was performed for imidazo[1,5-a]pyrido[3,2-e]pyrazines, which constitute a class of phosphodiesterase 10A inhibitors. Particle swarm optimization (PSO) and genetic algorithm (GA) were used as feature selection techniques to find the most reliable molecular descriptors from a large pool. Modeling of the relationship between the selected descriptors and the pIC50 activity data was achieved by linear [multiple linear regression (MLR)] and non-linear [locally weighted regression (LWR) based on both Euclidean (E) and Mahalanobis (M) distances] methods. In addition, a stepwise MLR model was built using only a limited number of quantum chemical descriptors, selected because of their correlation with the pIC50 . The model was not found interesting. It was concluded that the LWR model, based on the Euclidean distance, applied on the descriptors selected by PSO has the best prediction ability. However, some other models behaved similarly. The root-mean-squared errors of prediction (RMSEP) for the test sets obtained by PSO/MLR, GA/MLR, PSO/LWRE, PSO/LWRM, GA/LWRE, and GA/LWRM models were 0.333, 0.394, 0.313, 0.333, 0.421, and 0.424, respectively. The PSO-selected descriptors resulted in the best prediction models, both linear and non-linear. © 2013 John Wiley & Sons A/S.

  1. SDG and qualitative trend based model multiple scale validation

    Science.gov (United States)

    Gao, Dong; Xu, Xin; Yin, Jianjin; Zhang, Hongyu; Zhang, Beike

    2017-09-01

    Verification, Validation and Accreditation (VV&A) is key technology of simulation and modelling. For the traditional model validation methods, the completeness is weak; it is carried out in one scale; it depends on human experience. The SDG (Signed Directed Graph) and qualitative trend based multiple scale validation is proposed. First the SDG model is built and qualitative trends are added to the model. And then complete testing scenarios are produced by positive inference. The multiple scale validation is carried out by comparing the testing scenarios with outputs of simulation model in different scales. Finally, the effectiveness is proved by carrying out validation for a reactor model.

  2. Assessment and improvement of biotransfer models to cow's milk and beef used in exposure assessment tools for organic pollutants.

    Science.gov (United States)

    Takaki, Koki; Wade, Andrew J; Collins, Chris D

    2015-11-01

    The aim of this study was to assess and improve the accuracy of biotransfer models for the organic pollutants (PCBs, PCDD/Fs, PBDEs, PFCAs, and pesticides) into cow's milk and beef used in human exposure assessment. Metabolic rate in cattle is known as a key parameter for this biotransfer, however few experimental data and no simulation methods are currently available. In this research, metabolic rate was estimated using existing QSAR biodegradation models of microorganisms (BioWIN) and fish (EPI-HL and IFS-HL). This simulated metabolic rate was then incorporated into the mechanistic cattle biotransfer models (RAIDAR, ACC-HUMAN, OMEGA, and CKow). The goodness of fit tests showed that RAIDAR, ACC-HUMAN, OMEGA model performances were significantly improved using either of the QSARs when comparing the new model outputs to observed data. The CKow model is the only one that separates the processes in the gut and liver. This model showed the lowest residual error of all the models tested when the BioWIN model was used to represent the ruminant metabolic process in the gut and the two fish QSARs were used to represent the metabolic process in the liver. Our testing included EUSES and CalTOX which are KOW-regression models that are widely used in regulatory assessment. New regressions based on the simulated rate of the two metabolic processes are also proposed as an alternative to KOW-regression models for a screening risk assessment. The modified CKow model is more physiologically realistic, but has equivalent usability to existing KOW-regression models for estimating cattle biotransfer of organic pollutants. Copyright © 2015. Published by Elsevier Ltd.

  3. Explicit validation of a surface shortwave radiation balance model over snow-covered complex terrain

    Science.gov (United States)

    Helbig, N.; Löwe, H.; Mayer, B.; Lehning, M.

    2010-09-01

    A model that computes the surface radiation balance for all sky conditions in complex terrain is presented. The spatial distribution of direct and diffuse sky radiation is determined from observations of incident global radiation, air temperature, and relative humidity at a single measurement location. Incident radiation under cloudless sky is spatially derived from a parameterization of the atmospheric transmittance. Direct and diffuse sky radiation for all sky conditions are obtained by decomposing the measured global radiation value. Spatial incident radiation values under all atmospheric conditions are computed by adjusting the spatial radiation values obtained from the parametric model with the radiation components obtained from the decomposition model at the measurement site. Topographic influences such as shading are accounted for. The radiosity approach is used to compute anisotropic terrain reflected radiation. Validations of the shortwave radiation balance model are presented in detail for a day with cloudless sky. For a day with overcast sky a first validation is presented. Validation of a section of the horizon line as well as of individual radiation components is performed with high-quality measurements. A new measurement setup was designed to determine terrain reflected radiation. There is good agreement between the measurements and the modeled terrain reflected radiation values as well as with incident radiation values. A comparison of the model with a fully three-dimensional radiative transfer Monte Carlo model is presented. That validation reveals a good agreement between modeled radiation values.

  4. Design of experiments in medical physics: Application to the AAA beam model validation.

    Science.gov (United States)

    Dufreneix, S; Legrand, C; Di Bartolo, C; Bremaud, M; Mesgouez, J; Tiplica, T; Autret, D

    2017-09-01

    The purpose of this study is to evaluate the usefulness of the design of experiments in the analysis of multiparametric problems related to the quality assurance in radiotherapy. The main motivation is to use this statistical method to optimize the quality assurance processes in the validation of beam models. Considering the Varian Eclipse system, eight parameters with several levels were selected: energy, MLC, depth, X, Y 1 and Y 2 jaw dimensions, wedge and wedge jaw. A Taguchi table was used to define 72 validation tests. Measurements were conducted in water using a CC04 on a TrueBeam STx, a TrueBeam Tx, a Trilogy and a 2300IX accelerator matched by the vendor. Dose was computed using the AAA algorithm. The same raw data was used for all accelerators during the beam modelling. The mean difference between computed and measured doses was 0.1±0.5% for all beams and all accelerators with a maximum difference of 2.4% (under the 3% tolerance level). For all beams, the measured doses were within 0.6% for all accelerators. The energy was found to be an influencing parameter but the deviations observed were smaller than 1% and not considered clinically significant. Designs of experiment can help define the optimal measurement set to validate a beam model. The proposed method can be used to identify the prognostic factors of dose accuracy. The beam models were validated for the 4 accelerators which were found dosimetrically equivalent even though the accelerator characteristics differ. Copyright © 2017 Associazione Italiana di Fisica Medica. Published by Elsevier Ltd. All rights reserved.

  5. Animal models of binge drinking, current challenges to improve face validity.

    Science.gov (United States)

    Jeanblanc, Jérôme; Rolland, Benjamin; Gierski, Fabien; Martinetti, Margaret P; Naassila, Mickael

    2018-05-05

    Binge drinking (BD), i.e., consuming a large amount of alcohol in a short period of time, is an increasing public health issue. Though no clear definition has been adopted worldwide the speed of drinking seems to be a keystone of this behavior. Developing relevant animal models of BD is a priority for gaining a better characterization of the neurobiological and psychobiological mechanisms underlying this dangerous and harmful behavior. Until recently, preclinical research on BD has been conducted mostly using forced administration of alcohol, but more recent studies used scheduled access to alcohol, to model more voluntary excessive intakes, and to achieve signs of intoxications that mimic the human behavior. The main challenges for future research are discussed regarding the need of good face validity, construct validity and predictive validity of animal models of BD. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. A comprehensive validation toolbox for regional ocean models - Outline, implementation and application to the Baltic Sea

    Science.gov (United States)

    Jandt, Simon; Laagemaa, Priidik; Janssen, Frank

    2014-05-01

    The systematic and objective comparison between output from a numerical ocean model and a set of observations, called validation in the context of this presentation, is a beneficial activity at several stages, starting from early steps in model development and ending at the quality control of model based products delivered to customers. Even though the importance of this kind of validation work is widely acknowledged it is often not among the most popular tasks in ocean modelling. In order to ease the validation work a comprehensive toolbox has been developed in the framework of the MyOcean-2 project. The objective of this toolbox is to carry out validation integrating different data sources, e.g. time-series at stations, vertical profiles, surface fields or along track satellite data, with one single program call. The validation toolbox, implemented in MATLAB, features all parts of the validation process - ranging from read-in procedures of datasets to the graphical and numerical output of statistical metrics of the comparison. The basic idea is to have only one well-defined validation schedule for all applications, in which all parts of the validation process are executed. Each part, e.g. read-in procedures, forms a module in which all available functions of this particular part are collected. The interface between the functions, the module and the validation schedule is highly standardized. Functions of a module are set up for certain validation tasks, new functions can be implemented into the appropriate module without affecting the functionality of the toolbox. The functions are assigned for each validation task in user specific settings, which are externally stored in so-called namelists and gather all information of the used datasets as well as paths and metadata. In the framework of the MyOcean-2 project the toolbox is frequently used to validate the forecast products of the Baltic Sea Marine Forecasting Centre. Hereby the performance of any new product

  7. Tracer travel time and model validation

    International Nuclear Information System (INIS)

    Tsang, Chin-Fu.

    1988-01-01

    The performance assessment of a nuclear waste repository demands much more in comparison to the safety evaluation of any civil constructions such as dams, or the resource evaluation of a petroleum or geothermal reservoir. It involves the estimation of low probability (low concentration) of radionuclide transport extrapolated 1000's of years into the future. Thus models used to make these estimates need to be carefully validated. A number of recent efforts have been devoted to the study of this problem. Some general comments on model validation were given by Tsang. The present paper discusses some issues of validation in regards to radionuclide transport. 5 refs

  8. Model validation: a systemic and systematic approach

    International Nuclear Information System (INIS)

    Sheng, G.; Elzas, M.S.; Cronhjort, B.T.

    1993-01-01

    The term 'validation' is used ubiquitously in association with the modelling activities of numerous disciplines including social, political natural, physical sciences, and engineering. There is however, a wide range of definitions which give rise to very different interpretations of what activities the process involves. Analyses of results from the present large international effort in modelling radioactive waste disposal systems illustrate the urgent need to develop a common approach to model validation. Some possible explanations are offered to account for the present state of affairs. The methodology developed treats model validation and code verification in a systematic fashion. In fact, this approach may be regarded as a comprehensive framework to assess the adequacy of any simulation study. (author)

  9. Validation of the hdm models forcrack initiation and development, rutting and roughness of the pavement

    Directory of Open Access Journals (Sweden)

    Ognjenović Slobodan

    2017-01-01

    Full Text Available Worldwide practice recommends validation of the HDM models with some other software that can be used for comparison of the forecasting results. The program package MATLAB is used in this case, as it enables for modelling of all the HDM models. A statistic validation of the results of the forecasts concerning the condition of the pavements in HDM with the on-field measuring results was also performed. This paper shall present the results of the validation of the coefficients of calibration of the deterioration models in HDM 4 on the Macedonian highways.

  10. Principles of Proper Validation

    DEFF Research Database (Denmark)

    Esbensen, Kim; Geladi, Paul

    2010-01-01

    to suffer from the same deficiencies. The PPV are universal and can be applied to all situations in which the assessment of performance is desired: prediction-, classification-, time series forecasting-, modeling validation. The key element of PPV is the Theory of Sampling (TOS), which allow insight......) is critically necessary for the inclusion of the sampling errors incurred in all 'future' situations in which the validated model must perform. Logically, therefore, all one data set re-sampling approaches for validation, especially cross-validation and leverage-corrected validation, should be terminated...

  11. Flavonoids as Vasorelaxant Agents: Synthesis, Biological Evaluation and Quantitative Structure Activities Relationship (QSAR Studies

    Directory of Open Access Journals (Sweden)

    Yongzhou Hu

    2011-09-01

    Full Text Available A series of 2-(2-diethylamino-ethoxychalcone and 6-prenyl(or its isomers-flavanones 10a,b and 11a–g were synthesized and evaluated for their vasorelaxant activities against rat aorta rings pretreated with 1 μM phenylephrine (PE. Several compounds showed potent vasorelaxant activities. Compound 10a (EC50 = 7.6 μM, Emax = 93.1%, the most potent one, would be a promising structural template for development of novel and more efficient vasodilators. Further, 2D-QSAR analysis of compounds 10a,b and 11c-e as well as thirty previously synthesized flavonoids 1-3 and 12-38 using Enhanced Replacement Method-Multiple Linear Regression (ERM-MLR was further performed based on an optimal set of molecular descriptors (H5m, SIC2, DISPe, Mor03u and L3m, leading to a reliable model with good predictive ability (Rtrain2 = 0.839, Qloo2 = 0.733 and Rtest2 = 0.804. The results provide good insights into the structure- activity relationships of the target compounds.

  12. Validity test and its consistency in the construction of patient loyalty model

    Science.gov (United States)

    Yanuar, Ferra

    2016-04-01

    The main objective of this present study is to demonstrate the estimation of validity values and its consistency based on structural equation model. The method of estimation was then implemented to an empirical data in case of the construction the patient loyalty model. In the hypothesis model, service quality, patient satisfaction and patient loyalty were determined simultaneously, each factor were measured by any indicator variables. The respondents involved in this study were the patients who ever got healthcare at Puskesmas in Padang, West Sumatera. All 394 respondents who had complete information were included in the analysis. This study found that each construct; service quality, patient satisfaction and patient loyalty were valid. It means that all hypothesized indicator variables were significant to measure their corresponding latent variable. Service quality is the most measured by tangible, patient satisfaction is the most mesured by satisfied on service and patient loyalty is the most measured by good service quality. Meanwhile in structural equation, this study found that patient loyalty was affected by patient satisfaction positively and directly. Service quality affected patient loyalty indirectly with patient satisfaction as mediator variable between both latent variables. Both structural equations were also valid. This study also proved that validity values which obtained here were also consistence based on simulation study using bootstrap approach.

  13. Validation of models in an imaging infrared simulation

    CSIR Research Space (South Africa)

    Willers, C

    2007-10-01

    Full Text Available threeprocessesfortransformingtheinformationbetweentheentities. Reality/ Problem Entity Conceptual Model Computerized Model Model Validation ModelVerification Model Qualification Computer Implementation Analysisand Modelling Simulationand Experimentation “Substantiationthata....C.Refsgaard ,ModellingGuidelines-terminology andguidingprinciples, AdvancesinWaterResources, Vol27,No1,January2004,?pp.71-82(12),Elsevier. et.al. [5]N.Oreskes,et.al.,Verification,Validation,andConfirmationof NumericalModelsintheEarthSciences,Science,Vol263, Number...

  14. Selection, calibration, and validation of models of tumor growth.

    Science.gov (United States)

    Lima, E A B F; Oden, J T; Hormuth, D A; Yankeelov, T E; Almeida, R C

    2016-11-01

    This paper presents general approaches for addressing some of the most important issues in predictive computational oncology concerned with developing classes of predictive models of tumor growth. First, the process of developing mathematical models of vascular tumors evolving in the complex, heterogeneous, macroenvironment of living tissue; second, the selection of the most plausible models among these classes, given relevant observational data; third, the statistical calibration and validation of models in these classes, and finally, the prediction of key Quantities of Interest (QOIs) relevant to patient survival and the effect of various therapies. The most challenging aspects of this endeavor is that all of these issues often involve confounding uncertainties: in observational data, in model parameters, in model selection, and in the features targeted in the prediction. Our approach can be referred to as "model agnostic" in that no single model is advocated; rather, a general approach that explores powerful mixture-theory representations of tissue behavior while accounting for a range of relevant biological factors is presented, which leads to many potentially predictive models. Then representative classes are identified which provide a starting point for the implementation of OPAL, the Occam Plausibility Algorithm (OPAL) which enables the modeler to select the most plausible models (for given data) and to determine if the model is a valid tool for predicting tumor growth and morphology ( in vivo ). All of these approaches account for uncertainties in the model, the observational data, the model parameters, and the target QOI. We demonstrate these processes by comparing a list of models for tumor growth, including reaction-diffusion models, phase-fields models, and models with and without mechanical deformation effects, for glioma growth measured in murine experiments. Examples are provided that exhibit quite acceptable predictions of tumor growth in laboratory

  15. A Component-Based Modeling and Validation Method for PLC Systems

    Directory of Open Access Journals (Sweden)

    Rui Wang

    2014-05-01

    Full Text Available Programmable logic controllers (PLCs are complex embedded systems that are widely used in industry. This paper presents a component-based modeling and validation method for PLC systems using the behavior-interaction-priority (BIP framework. We designed a general system architecture and a component library for a type of device control system. The control software and hardware of the environment were all modeled as BIP components. System requirements were formalized as monitors. Simulation was carried out to validate the system model. A realistic example from industry of the gates control system was employed to illustrate our strategies. We found a couple of design errors during the simulation, which helped us to improve the dependability of the original systems. The results of experiment demonstrated the effectiveness of our approach.

  16. Quantitative structure-activity relationships (QSAR) of 4-amino-2,6-diarylpyrimidine-5-carbonitriles with anti-inflammatory activity

    Energy Technology Data Exchange (ETDEWEB)

    Silva, Joao Bosco P. da; Ramos, Mozart N.; Barros Neto, Benicio de [Universidade Federal de Pernambuco (UFPE), Recife, PE (Brazil). Dept. de Quimica Fundamental]. E-mail: mramos@ufpe.br; Melo, Sebastiao Jose de [Universidade Federal de Pernambuco (UFPE), Recife, PE (Brazil). Dept. de Antibioticos]. E-mail: melosebastiao@yahoo.com.br; Falcao, Emerson Peter da Silva [Universidade Federal de Pernambuco (UFPE), Recife, PE (Brazil). Centro Academico de Vitoria de Santo Antao; Catanho, Maria Teresa J. de Almeida [Universidade Federal de Pernambuco (UFPE), Recife, PE (Brazil). Dept. de Biofisica e Radiobiologia

    2008-07-01

    The experimental anti-inflammatory activities of eight 4-amino-2,6-diarylpyrimidine-5- carbonitriles were subjected to a QSAR analysis based on results from B3LYP/6-31G(d,p) and AM1 electronic structure calculations. Principal component analyses and regressions based on these data indicate that potentially more active compounds should have low dipole moment and partition coefficient values and also be affected by the values of the charges of the carbon atoms through which the two aromatic rings are bonded to the pyrimidinic ring. Two new molecules were predicted to be at least as active as those with the highest activities used in the model building stage. One of them, having a methoxy group attached to one of the aromatic rings, was predicted to have an anti-inflammatory activity value of 52.3%. This molecule was synthesized and its experimental activity was found to be 52.8%, in agreement with the AM1 theoretical prediction. This value is 5% higher than the largest value used for modeling. (author)

  17. Quantitative structure-activity relationships (QSAR) of 4-amino-2,6-diarylpyrimidine-5-carbonitriles with anti-inflammatory activity

    International Nuclear Information System (INIS)

    Silva, Joao Bosco P. da; Ramos, Mozart N.; Barros Neto, Benicio de; Melo, Sebastiao Jose de; Falcao, Emerson Peter da Silva; Catanho, Maria Teresa J. de Almeida

    2008-01-01

    The experimental anti-inflammatory activities of eight 4-amino-2,6-diarylpyrimidine-5- carbonitriles were subjected to a QSAR analysis based on results from B3LYP/6-31G(d,p) and AM1 electronic structure calculations. Principal component analyses and regressions based on these data indicate that potentially more active compounds should have low dipole moment and partition coefficient values and also be affected by the values of the charges of the carbon atoms through which the two aromatic rings are bonded to the pyrimidinic ring. Two new molecules were predicted to be at least as active as those with the highest activities used in the model building stage. One of them, having a methoxy group attached to one of the aromatic rings, was predicted to have an anti-inflammatory activity value of 52.3%. This molecule was synthesized and its experimental activity was found to be 52.8%, in agreement with the AM1 theoretical prediction. This value is 5% higher than the largest value used for modeling. (author)

  18. Pyridones as NNRTIs against HIV-1 mutants: 3D-QSAR and protein informatics

    Science.gov (United States)

    Debnath, Utsab; Verma, Saroj; Jain, Surabhi; Katti, Setu B.; Prabhakar, Yenamandra S.

    2013-07-01

    CoMFA and CoMSIA based 3D-QSAR of HIV-1 RT wild and mutant (K103, Y181C, and Y188L) inhibitory activities of 4-benzyl/benzoyl pyridin-2-ones followed by protein informatics of corresponding non-nucleoside inhibitors' binding pockets from pdbs 2BAN, 3MED, 1JKH, and 2YNF were analysed to discover consensus features of the compounds for broad-spectrum activity. The CoMFA/CoMSIA models indicated that compounds with groups which lend steric-cum-electropositive fields in the vicinity of C5, hydrophobic field in the vicinity of C3 of pyridone region and steric field in aryl region produce broad-spectrum anti-HIV-1 RT activity. Also, a linker rendering electronegative field between pyridone and aryl moieties is common requirement for the activities. The protein informatics showed considerable alteration in residues 181 and 188 characteristics on mutation. Also, mutants' isoelectric points shifted in acidic direction. The study offered fresh avenues for broad-spectrum anti-HIV-1 agents through designing new molecules seeded with groups satisfying common molecular fields and concerns of mutating residues.

  19. Assessing Religious Orientations: Replication and Validation of the Commitment-Reflectivity Circumplex (CRC Model

    Directory of Open Access Journals (Sweden)

    Steven L. Isaak

    2017-09-01

    Full Text Available The Commitment-Reflectivity Circumplex (CRC model is a structural model of religious orientation that was designed to help organize and clarify measurement of foundational aspect of religiousness. The current study successfully replicated the CRC model using multidimensional scaling, and further evaluated the reliability, structure, and validity of their measures in both a university student sample (Study 1 and a nationally representative sample (Study 2. All 10 subscales of the Circumplex Religious Orientation Inventory (CROI demonstrated good reliability across both samples. A two-week test-retest of the CROI showed that the subscales are stable over time. A confirmatory factor analysis of the CROI in the representative adult sample demonstrated good model fit. Finally, the CROI’s validity was examined in relation to the Intrinsic, Extrinsic and Quest measures. Overall, the CROI appears to clarify much of the ambiguity inherent in the established scales by breaking down what were very broad orientations into very specific suborientations. The results suggest that the CRC model is applicable for diverse populations of adults. In addition, the CROI appears to be construct valid with good structural and psychometric properties across all 10 subscales.

  20. Feature Extraction for Structural Dynamics Model Validation

    Energy Technology Data Exchange (ETDEWEB)

    Farrar, Charles [Los Alamos National Laboratory; Nishio, Mayuko [Yokohama University; Hemez, Francois [Los Alamos National Laboratory; Stull, Chris [Los Alamos National Laboratory; Park, Gyuhae [Chonnam Univesity; Cornwell, Phil [Rose-Hulman Institute of Technology; Figueiredo, Eloi [Universidade Lusófona; Luscher, D. J. [Los Alamos National Laboratory; Worden, Keith [University of Sheffield

    2016-01-13

    As structural dynamics becomes increasingly non-modal, stochastic and nonlinear, finite element model-updating technology must adopt the broader notions of model validation and uncertainty quantification. For example, particular re-sampling procedures must be implemented to propagate uncertainty through a forward calculation, and non-modal features must be defined to analyze nonlinear data sets. The latter topic is the focus of this report, but first, some more general comments regarding the concept of model validation will be discussed.

  1. Modelling and validation of electromechanical shock absorbers

    Science.gov (United States)

    Tonoli, Andrea; Amati, Nicola; Girardello Detoni, Joaquim; Galluzzi, Renato; Gasparin, Enrico

    2013-08-01

    Electromechanical vehicle suspension systems represent a promising substitute to conventional hydraulic solutions. However, the design of electromechanical devices that are able to supply high damping forces without exceeding geometric dimension and mass constraints is a difficult task. All these challenges meet in off-road vehicle suspension systems, where the power density of the dampers is a crucial parameter. In this context, the present paper outlines a particular shock absorber configuration where a suitable electric machine and a transmission mechanism are utilised to meet off-road vehicle requirements. A dynamic model is used to represent the device. Subsequently, experimental tests are performed on an actual prototype to verify the functionality of the damper and validate the proposed model.

  2. Validation of ASTEC core degradation and containment models

    International Nuclear Information System (INIS)

    Kruse, Philipp; Brähler, Thimo; Koch, Marco K.

    2014-01-01

    Ruhr-Universitaet Bochum performed in a German funded project validation of in-vessel and containment models of the integral code ASTEC V2, jointly developed by IRSN (France) and GRS (Germany). In this paper selected results of this validation are presented. In the in-vessel part, the main point of interest was the validation of the code capability concerning cladding oxidation and hydrogen generation. The ASTEC calculations of QUENCH experiments QUENCH-03 and QUENCH-11 show satisfactory results, despite of some necessary adjustments in the input deck. Furthermore, the oxidation models based on the Cathcart–Pawel and Urbanic–Heidrick correlations are not suitable for higher temperatures while the ASTEC model BEST-FIT based on the Prater–Courtright approach at high temperature gives reliable enough results. One part of the containment model validation was the assessment of three hydrogen combustion models of ASTEC against the experiment BMC Ix9. The simulation results of these models differ from each other and therefore the quality of the simulations depends on the characteristic of each model. Accordingly, the CPA FRONT model, corresponding to the simplest necessary input parameters, provides the best agreement to the experimental data

  3. Predicting Error Bars for QSAR Models

    International Nuclear Information System (INIS)

    Schroeter, Timon; Schwaighofer, Anton; Mika, Sebastian; Ter Laak, Antonius; Suelzle, Detlev; Ganzer, Ursula; Heinrich, Nikolaus; Mueller, Klaus-Robert

    2007-01-01

    Unfavorable physicochemical properties often cause drug failures. It is therefore important to take lipophilicity and water solubility into account early on in lead discovery. This study presents log D 7 models built using Gaussian Process regression, Support Vector Machines, decision trees and ridge regression algorithms based on 14556 drug discovery compounds of Bayer Schering Pharma. A blind test was conducted using 7013 new measurements from the last months. We also present independent evaluations using public data. Apart from accuracy, we discuss the quality of error bars that can be computed by Gaussian Process models, and ensemble and distance based techniques for the other modelling approaches

  4. A discussion on validation of hydrogeological models

    International Nuclear Information System (INIS)

    Carrera, J.; Mousavi, S.F.; Usunoff, E.J.; Sanchez-Vila, X.; Galarza, G.

    1993-01-01

    Groundwater flow and solute transport are often driven by heterogeneities that elude easy identification. It is also difficult to select and describe the physico-chemical processes controlling solute behaviour. As a result, definition of a conceptual model involves numerous assumptions both on the selection of processes and on the representation of their spatial variability. Validating a numerical model by comparing its predictions with actual measurements may not be sufficient for evaluating whether or not it provides a good representation of 'reality'. Predictions will be close to measurements, regardless of model validity, if these are taken from experiments that stress well-calibrated model modes. On the other hand, predictions will be far from measurements when model parameters are very uncertain, even if the model is indeed a very good representation of the real system. Hence, we contend that 'classical' validation of hydrogeological models is not possible. Rather, models should be viewed as theories about the real system. We propose to follow a rigorous modeling approach in which different sources of uncertainty are explicitly recognized. The application of one such approach is illustrated by modeling a laboratory uranium tracer test performed on fresh granite, which was used as Test Case 1b in INTRAVAL. (author)

  5. Verification and validation for waste disposal models

    International Nuclear Information System (INIS)

    1987-07-01

    A set of evaluation criteria has been developed to assess the suitability of current verification and validation techniques for waste disposal methods. A survey of current practices and techniques was undertaken and evaluated using these criteria with the items most relevant to waste disposal models being identified. Recommendations regarding the most suitable verification and validation practices for nuclear waste disposal modelling software have been made

  6. Structural insights of Staphylococcus aureus FtsZ inhibitors through molecular docking, 3D-QSAR and molecular dynamics simulations.

    Science.gov (United States)

    Ballu, Srilata; Itteboina, Ramesh; Sivan, Sree Kanth; Manga, Vijjulatha

    2018-02-01

    Filamentous temperature-sensitive protein Z (FtsZ) is a protein encoded by the FtsZ gene that assembles into a Z-ring at the future site of the septum of bacterial cell division. Structurally, FtsZ is a homolog of eukaryotic tubulin but has low sequence similarity; this makes it possible to obtain FtsZ inhibitors without affecting the eukaryotic cell division. Computational studies were performed on a series of substituted 3-arylalkoxybenzamide derivatives reported as inhibitors of FtsZ activity in Staphylococcus aureus. Quantitative structure-activity relationship models (QSAR) models generated showed good statistical reliability, which is evident from r 2 ncv and r 2 loo values. The predictive ability of these models was determined and an acceptable predictive correlation (r 2 Pred ) values were obtained. Finally, we performed molecular dynamics simulations in order to examine the stability of protein-ligand interactions. This facilitated us to compare free binding energies of cocrystal ligand and newly designed molecule B1. The good concordance between the docking results and comparative molecular field analysis (CoMFA)/comparative molecular similarity indices analysis (CoMSIA) contour maps afforded obliging clues for the rational modification of molecules to design more potent FtsZ inhibitors.

  7. Novel approach for efficient predictions properties of large pool of nanomaterials based on limited set of species: nano-read-across

    International Nuclear Information System (INIS)

    Gajewicz, Agnieszka; Puzyn, Tomasz; Cronin, Mark T.D; Rasulev, Bakhtiyor; Leszczynski, Jerzy

    2015-01-01

    Creating suitable chemical categories and developing read-across methods, supported by quantum mechanical calculations, can be an effective solution to solving key problems related to current scarcity of data on the toxicity of various nanoparticles. This study has demonstrated that by applying a nano-read-across, the cytotoxicity of nano-sized metal oxides could be estimated with a similar level of accuracy as provided by quantitative structure-activity relationship for nanomaterials (nano-QSAR model(s)). The method presented is a suitable computational tool for the preliminary hazard assessment of nanomaterials. It also could be used for the identification of nanomaterials that may pose potential negative impact to human health and the environment. Such approaches are especially necessary when there is paucity of relevant and reliable data points to develop and validate nano-QSAR models. (paper)

  8. Structural system identification: Structural dynamics model validation

    Energy Technology Data Exchange (ETDEWEB)

    Red-Horse, J.R.

    1997-04-01

    Structural system identification is concerned with the development of systematic procedures and tools for developing predictive analytical models based on a physical structure`s dynamic response characteristics. It is a multidisciplinary process that involves the ability (1) to define high fidelity physics-based analysis models, (2) to acquire accurate test-derived information for physical specimens using diagnostic experiments, (3) to validate the numerical simulation model by reconciling differences that inevitably exist between the analysis model and the experimental data, and (4) to quantify uncertainties in the final system models and subsequent numerical simulations. The goal of this project was to develop structural system identification techniques and software suitable for both research and production applications in code and model validation.

  9. Validating the passenger traffic model for Copenhagen

    DEFF Research Database (Denmark)

    Overgård, Christian Hansen; VUK, Goran

    2006-01-01

    The paper presents a comprehensive validation procedure for the passenger traffic model for Copenhagen based on external data from the Danish national travel survey and traffic counts. The model was validated for the years 2000 to 2004, with 2004 being of particular interest because the Copenhagen...... matched the observed traffic better than those of the transit assignment model. With respect to the metro forecasts, the model over-predicts metro passenger flows by 10% to 50%. The wide range of findings from the project resulted in two actions. First, a project was started in January 2005 to upgrade...

  10. Bayesian risk-based decision method for model validation under uncertainty

    International Nuclear Information System (INIS)

    Jiang Xiaomo; Mahadevan, Sankaran

    2007-01-01

    This paper develops a decision-making methodology for computational model validation, considering the risk of using the current model, data support for the current model, and cost of acquiring new information to improve the model. A Bayesian decision theory-based method is developed for this purpose, using a likelihood ratio as the validation metric for model assessment. An expected risk or cost function is defined as a function of the decision costs, and the likelihood and prior of each hypothesis. The risk is minimized through correctly assigning experimental data to two decision regions based on the comparison of the likelihood ratio with a decision threshold. A Bayesian validation metric is derived based on the risk minimization criterion. Two types of validation tests are considered: pass/fail tests and system response value measurement tests. The methodology is illustrated for the validation of reliability prediction models in a tension bar and an engine blade subjected to high cycle fatigue. The proposed method can effectively integrate optimal experimental design into model validation to simultaneously reduce the cost and improve the accuracy of reliability model assessment

  11. Comparison of the Predictive Performance and Interpretability of Random Forest and Linear Models on Benchmark Data Sets.

    Science.gov (United States)

    Marchese Robinson, Richard L; Palczewska, Anna; Palczewski, Jan; Kidley, Nathan

    2017-08-28

    The ability to interpret the predictions made by quantitative structure-activity relationships (QSARs) offers a number of advantages. While QSARs built using nonlinear modeling approaches, such as the popular Random Forest algorithm, might sometimes be more predictive than those built using linear modeling approaches, their predictions have been perceived as difficult to interpret. However, a growing number of approaches have been proposed for interpreting nonlinear QSAR models in general and Random Forest in particular. In the current work, we compare the performance of Random Forest to those of two widely used linear modeling approaches: linear Support Vector Machines (SVMs) (or Support Vector Regression (SVR)) and partial least-squares (PLS). We compare their performance in terms of their predictivity as well as the chemical interpretability of the predictions using novel scoring schemes for assessing heat map images of substructural contributions. We critically assess different approaches for interpreting Random Forest models as well as for obtaining predictions from the forest. We assess the models on a large number of widely employed public-domain benchmark data sets corresponding to regression and binary classification problems of relevance to hit identification and toxicology. We conclude that Random Forest typically yields comparable or possibly better predictive performance than the linear modeling approaches and that its predictions may also be interpreted in a chemically and biologically meaningful way. In contrast to earlier work looking at interpretation of nonlinear QSAR models, we directly compare two methodologically distinct approaches for interpreting Random Forest models. The approaches for interpreting Random Forest assessed in our article were implemented using open-source programs that we have made available to the community. These programs are the rfFC package ( https://r-forge.r-project.org/R/?group_id=1725 ) for the R statistical

  12. CoMFA and CoMSIA studies on C-aryl glucoside SGLT2 inhibitors as potential anti-diabetic agents.

    Science.gov (United States)

    Vyas, V K; Bhatt, H G; Patel, P K; Jalu, J; Chintha, C; Gupta, N; Ghate, M

    2013-01-01

    SGLT2 has become a target of therapeutic interest in diabetes research. CoMFA and CoMSIA studies were performed on C-aryl glucoside SGLT2 inhibitors (180 analogues) as potential anti-diabetic agents. Three different alignment strategies were used for the compounds. The best CoMFA and CoMSIA models were obtained by means of Distill rigid body alignment of training and test sets, and found statistically significant with cross-validated coefficients (q²) of 0.602 and 0.618, respectively, and conventional coefficients (r²) of 0.905 and 0.902, respectively. Both models were validated by a test set of 36 compounds giving satisfactory predicted correlation coefficients (r² pred) of 0.622 and 0.584 for CoMFA and CoMSIA models, respectively. A comparison was made with earlier 3D QSAR study on SGLT2 inhibitors, which shows that our 3D QSAR models are better than earlier models to predict good inhibitory activity. CoMFA and CoMSIA models generated in this work can provide useful information to design new compounds and helped in prediction of activity prior to synthesis.

  13. Validation of mentorship model for newly qualified professional ...

    African Journals Online (AJOL)

    Newly qualified professional nurses (NQPNs) allocated to community health care services require the use of validated model to practice independently. Validation was done to adapt and assess if the model is understood and could be implemented by NQPNs and mentors employed in community health care services.

  14. Validation and Adaptation of Router and Switch Models

    NARCIS (Netherlands)

    Boltjes, B.; Fernandez Diaz, I.; Kock, B.A.; Langeveld, R.J.G.M.; Schoenmaker, G.

    2003-01-01

    This paper describes validating OPNET models of key devices for the next generation IP-based tactical network of the Royal Netherlands Army (RNLA). The task of TNO-FEL is to provide insight in scalability and performance of future deployed networks. Because validated models ol key Cisco equipment

  15. Validity of information security policy models

    Directory of Open Access Journals (Sweden)

    Joshua Onome Imoniana

    Full Text Available Validity is concerned with establishing evidence for the use of a method to be used with a particular set of population. Thus, when we address the issue of application of security policy models, we are concerned with the implementation of a certain policy, taking into consideration the standards required, through attribution of scores to every item in the research instrument. En today's globalized economic scenarios, the implementation of information security policy, in an information technology environment, is a condition sine qua non for the strategic management process of any organization. Regarding this topic, various studies present evidences that, the responsibility for maintaining a policy rests primarily with the Chief Security Officer. The Chief Security Officer, in doing so, strives to enhance the updating of technologies, in order to meet all-inclusive business continuity planning policies. Therefore, for such policy to be effective, it has to be entirely embraced by the Chief Executive Officer. This study was developed with the purpose of validating specific theoretical models, whose designs were based on literature review, by sampling 10 of the Automobile Industries located in the ABC region of Metropolitan São Paulo City. This sampling was based on the representativeness of such industries, particularly with regards to each one's implementation of information technology in the region. The current study concludes, presenting evidence of the discriminating validity of four key dimensions of the security policy, being such: the Physical Security, the Logical Access Security, the Administrative Security, and the Legal & Environmental Security. On analyzing the Alpha of Crombach structure of these security items, results not only attest that the capacity of those industries to implement security policies is indisputable, but also, the items involved, homogeneously correlate to each other.

  16. Atmospheric corrosion: statistical validation of models

    International Nuclear Information System (INIS)

    Diaz, V.; Martinez-Luaces, V.; Guineo-Cobs, G.

    2003-01-01

    In this paper we discuss two different methods for validation of regression models, applied to corrosion data. One of them is based on the correlation coefficient and the other one is the statistical test of lack of fit. Both methods are used here to analyse fitting of bi logarithmic model in order to predict corrosion for very low carbon steel substrates in rural and urban-industrial atmospheres in Uruguay. Results for parameters A and n of the bi logarithmic model are reported here. For this purpose, all repeated values were used instead of using average values as usual. Modelling is carried out using experimental data corresponding to steel substrates under the same initial meteorological conditions ( in fact, they are put in the rack at the same time). Results of correlation coefficient are compared with the lack of it tested at two different signification levels (α=0.01 and α=0.05). Unexpected differences between them are explained and finally, it is possible to conclude, at least in the studied atmospheres, that the bi logarithmic model does not fit properly the experimental data. (Author) 18 refs

  17. Discovery of molecular mechanism of a clinical herbal formula upregulating serum HDL-c levels in treatment of metabolic syndrome by in vivo and computational studies.

    Science.gov (United States)

    Chen, Meimei; Yang, Fafu; Kang, Jie; Gan, Huijuan; Lai, Xinmei; Gao, Yuxing

    2018-01-15

    Decreased HDL cholesterol (HDL-c) is considered as an independent risk factor of cardiovascular disease in metabolic syndrome (Mets). Wendan decoction (WDD), a famous clinical traditional Chinese medicine formula in Mets in China, which can obviously up-regulate serum HDL-c levels in Mets. However, till now, the molecular mechanism of up-regulation still remained unclear. In this study, an integrated approach that combined serum ABCA1 in vivo assay, QSAR modeling and molecular docking was developed to explore the molecular mechanism and chemical substance basis of WDD upregulating HDL-c levels. Compared with Mets model group, serum ABCA1 and HDL-c levels intervened by two different doses of WDD for two weeks were significantly up-regulated. Then, kohonen and LDA were applied to develop QSAR models for ABCA1 up-regulators based flavonoids. The derived QSAR model produced the overall accuracy of 100%, a very powerful tool for screening ABCA1 up-regulators. The QSAR model prediction revealed 67 flavonoids in WDD were ABCA1 up-regulators. Finally, they were subjected to the molecular docking to understand their roles in up-regulating ABCA1 expression, which led to discovery of 23 ABCA1 up-regulators targeting LXR beta. Overall, QSAR modeling and docking studies well accounted for the observed in vivo activities of ABCA1 affected by WDD. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. External validation of the Cairns Prediction Model (CPM) to predict conversion from laparoscopic to open cholecystectomy.

    Science.gov (United States)

    Hu, Alan Shiun Yew; Donohue, Peter O'; Gunnarsson, Ronny K; de Costa, Alan

    2018-03-14

    Valid and user-friendly prediction models for conversion to open cholecystectomy allow for proper planning prior to surgery. The Cairns Prediction Model (CPM) has been in use clinically in the original study site for the past three years, but has not been tested at other sites. A retrospective, single-centred study collected ultrasonic measurements and clinical variables alongside with conversion status from consecutive patients who underwent laparoscopic cholecystectomy from 2013 to 2016 in The Townsville Hospital, North Queensland, Australia. An area under the curve (AUC) was calculated to externally validate of the CPM. Conversion was necessary in 43 (4.2%) out of 1035 patients. External validation showed an area under the curve of 0.87 (95% CI 0.82-0.93, p = 1.1 × 10 -14 ). In comparison with most previously published models, which have an AUC of approximately 0.80 or less, the CPM has the highest AUC of all published prediction models both for internal and external validation. Crown Copyright © 2018. Published by Elsevier Inc. All rights reserved.

  19. Quantitative Structure-Use Relationship (QSUR) Model Descriptors

    Data.gov (United States)

    U.S. Environmental Protection Agency — This data set contains ToxPrint finger prints for all chemicals in FUse that had QSAR-ready SMILES strings as well as select physicochemical properties from the...

  20. Development of a Conservative Model Validation Approach for Reliable Analysis

    Science.gov (United States)

    2015-01-01

    CIE 2015 August 2-5, 2015, Boston, Massachusetts, USA [DRAFT] DETC2015-46982 DEVELOPMENT OF A CONSERVATIVE MODEL VALIDATION APPROACH FOR RELIABLE...obtain a conservative simulation model for reliable design even with limited experimental data. Very little research has taken into account the...3, the proposed conservative model validation is briefly compared to the conventional model validation approach. Section 4 describes how to account

  1. Development of Learning Models Based on Problem Solving and Meaningful Learning Standards by Expert Validity for Animal Development Course

    Science.gov (United States)

    Lufri, L.; Fitri, R.; Yogica, R.

    2018-04-01

    The purpose of this study is to produce a learning model based on problem solving and meaningful learning standards by expert assessment or validation for the course of Animal Development. This research is a development research that produce the product in the form of learning model, which consist of sub product, namely: the syntax of learning model and student worksheets. All of these products are standardized through expert validation. The research data is the level of validity of all sub products obtained using questionnaire, filled by validators from various field of expertise (field of study, learning strategy, Bahasa). Data were analysed using descriptive statistics. The result of the research shows that the problem solving and meaningful learning model has been produced. Sub products declared appropriate by expert include the syntax of learning model and student worksheet.

  2. Structure-Activity Relationships Based on 3D-QSAR CoMFA/CoMSIA and Design of Aryloxypropanol-Amine Agonists with Selectivity for the Human β3-Adrenergic Receptor and Anti-Obesity and Anti-Diabetic Profiles

    Directory of Open Access Journals (Sweden)

    Marcos Lorca

    2018-05-01

    Full Text Available The wide tissue distribution of the adrenergic β3 receptor makes it a potential target for the treatment of multiple pathologies such as diabetes, obesity, depression, overactive bladder (OAB, and cancer. Currently, there is only one drug on the market, mirabegron, approved for the treatment of OAB. In the present study, we have carried out an extensive structure-activity relationship analysis of a series of 41 aryloxypropanolamine compounds based on three-dimensional quantitative structure-activity relationship (3D-QSAR techniques. This is the first combined comparative molecular field analysis (CoMFA and comparative molecular similarity index analysis (CoMSIA study in a series of selective aryloxypropanolamines displaying anti-diabetes and anti-obesity pharmacological profiles. The best CoMFA and CoMSIA models presented values of r2ncv = 0.993 and 0.984 and values of r2test = 0.865 and 0.918, respectively. The results obtained were subjected to extensive external validation (q2, r2, r2m, etc. and a final series of compounds was designed and their biological activity was predicted (best pEC50 = 8.561.

  3. Validation of ecological state space models using the Laplace approximation

    DEFF Research Database (Denmark)

    Thygesen, Uffe Høgsbro; Albertsen, Christoffer Moesgaard; Berg, Casper Willestofte

    2017-01-01

    Many statistical models in ecology follow the state space paradigm. For such models, the important step of model validation rarely receives as much attention as estimation or hypothesis testing, perhaps due to lack of available algorithms and software. Model validation is often based on a naive...... for estimation in general mixed effects models. Implementing one-step predictions in the R package Template Model Builder, we demonstrate that it is possible to perform model validation with little effort, even if the ecological model is multivariate, has non-linear dynamics, and whether observations...... useful directions in which the model could be improved....

  4. A stochastic model for the synthesis and degradation of natural organic matter. Part III: Modeling Cu(II) complexation

    Energy Technology Data Exchange (ETDEWEB)

    Cabaniss, Stephen E. [Department of Chemistry, University of New Mexico, Albuquerque, NM 87131 (United States)], E-mail: cabaniss@unm.edu; Maurice, Patricia A. [Department of Geology and Civil Engineering, University of Notre Dame (United States); Madey, Greg [Department of Computer Science, University of Notre Dame (United States)

    2007-08-15

    An agent-based biogeochemical model has been developed which begins with biochemical precursor molecules and simulates the transformation and degradation of natural organic matter (NOM). This manuscript presents an empirical quantitative structure activity relationship (QSAR) which uses the numbers of ligand groups, charge density and heteroatom density of a molecule to estimate Cu-binding affinity (K{sub Cu}{sup '}) at pH 7.0 and ionic strength 0.10 for the molecules in this model. Calibration of this QSAR on a set of 41 model compounds gives a root mean square error of 0.88 log units and r{sup 2} 0.93. Two simulated NOM assemblages, one beginning with small molecules (tannins, terpenoids, flavonoids) and one with biopolymers (protein, lignin), give markedly different distributions of logK{sub Cu}{sup '}. However, calculations based on these logK{sub Cu}{sup '} distributions agree qualitatively with published experimental Cu(II) titration data from river and lake NOM samples.

  5. Discovery of novel urokinase plasminogen activator (uPA) inhibitors using ligand-based modeling and virtual screening followed by in vitro analysis.

    Science.gov (United States)

    Al-Sha'er, Mahmoud A; Khanfar, Mohammad A; Taha, Mutasem O

    2014-01-01

    Urokinase plasminogen activator (uPA)-a serine protease-is thought to play a central role in tumor metastasis and angiogenesis and, therefore, inhibition of this enzyme could be beneficial in treating cancer. Toward this end, we explored the pharmacophoric space of 202 uPA inhibitors using seven diverse sets of inhibitors to identify high-quality pharmacophores. Subsequently, we employed genetic algorithm-based quantitative structure-activity relationship (QSAR) analysis as a competition arena to select the best possible combination of pharmacophoric models and physicochemical descriptors that can explain bioactivity variation within the training inhibitors (r (2) 162 = 0.74, F-statistic = 64.30, r (2) LOO = 0.71, r (2) PRESS against 40 test inhibitors = 0.79). Three orthogonal pharmacophores emerged in the QSAR equation suggesting the existence of at least three binding modes accessible to ligands within the uPA binding pocket. This conclusion was supported by receiver operating characteristic (ROC) curve analyses of the QSAR-selected pharmacophores. Moreover, the three pharmacophores were comparable with binding interactions seen in crystallographic structures of bound ligands within the uPA binding pocket. We employed the resulting pharmacophoric models and associated QSAR equation to screen the national cancer institute (NCI) list of compounds. The captured hits were tested in vitro. Overall, our modeling workflow identified new low micromolar anti-uPA hits.

  6. Some Phthalocyanine and Naphthalocyanine Derivatives as Corrosion Inhibitors for Aluminium in Acidic Medium: Experimental, Quantum Chemical Calculations, QSAR Studies and Synergistic Effect of Iodide Ions

    Directory of Open Access Journals (Sweden)

    Masego Dibetsoe

    2015-08-01

    Full Text Available The effects of seven macrocyclic compounds comprising four phthalocyanines (Pcs namely 1,4,8,11,15,18,22,25-octabutoxy-29H,31H-phthalocyanine (Pc1, 2,3,9,10,16,17,23,24-octakis(octyloxy-29H,31H-phthalocyanine (Pc2, 2,9,16,23-tetra-tert-butyl-29H,31H-phthalocyanine (Pc3 and 29H,31H-phthalocyanine (Pc4, and three naphthalocyanines namely 5,9,14,18,23,27,32,36-octabutoxy-2,3-naphthalocyanine (nPc1, 2,11,20,29-tetra-tert-butyl-2,3-naphthalocyanine (nPc2 and 2,3-naphthalocyanine (nP3 were investigated on the corrosion of aluminium (Al in 1 M HCl using a gravimetric method, potentiodynamic polarization technique, quantum chemical calculations and quantitative structure activity relationship (QSAR. Synergistic effects of KI on the corrosion inhibition properties of the compounds were also investigated. All the studied compounds showed appreciable inhibition efficiencies, which decrease with increasing temperature from 30 °C to 70 °C. At each concentration of the inhibitor, addition of 0.1% KI increased the inhibition efficiency compared to the absence of KI indicating the occurrence of synergistic interactions between the studied molecules and I− ions. From the potentiodynamic polarization studies, the studied Pcs and nPcs are mixed type corrosion inhibitors both without and with addition of KI. The adsorption of the studied molecules on Al surface obeys the Langmuir adsorption isotherm, while the thermodynamic and kinetic parameters revealed that the adsorption of the studied compounds on Al surface is spontaneous and involves competitive physisorption and chemisorption mechanisms. The experimental results revealed the aggregated interactions between the inhibitor molecules and the results further indicated that the peripheral groups on the compounds affect these interactions. The calculated quantum chemical parameters and the QSAR results revealed the possibility of strong interactions between the studied inhibitors and metal surface. QSAR

  7. Model-based Systems Engineering: Creation and Implementation of Model Validation Rules for MOS 2.0

    Science.gov (United States)

    Schmidt, Conrad K.

    2013-01-01

    Model-based Systems Engineering (MBSE) is an emerging modeling application that is used to enhance the system development process. MBSE allows for the centralization of project and system information that would otherwise be stored in extraneous locations, yielding better communication, expedited document generation and increased knowledge capture. Based on MBSE concepts and the employment of the Systems Modeling Language (SysML), extremely large and complex systems can be modeled from conceptual design through all system lifecycles. The Operations Revitalization Initiative (OpsRev) seeks to leverage MBSE to modernize the aging Advanced Multi-Mission Operations Systems (AMMOS) into the Mission Operations System 2.0 (MOS 2.0). The MOS 2.0 will be delivered in a series of conceptual and design models and documents built using the modeling tool MagicDraw. To ensure model completeness and cohesiveness, it is imperative that the MOS 2.0 models adhere to the specifications, patterns and profiles of the Mission Service Architecture Framework, thus leading to the use of validation rules. This paper outlines the process by which validation rules are identified, designed, implemented and tested. Ultimately, these rules provide the ability to maintain model correctness and synchronization in a simple, quick and effective manner, thus allowing the continuation of project and system progress.

  8. Refining and validating a conceptual model of Clinical Nurse Leader integrated care delivery.

    Science.gov (United States)

    Bender, Miriam; Williams, Marjory; Su, Wei; Hites, Lisle

    2017-02-01

    To empirically validate a conceptual model of Clinical Nurse Leader integrated care delivery. There is limited evidence of frontline care delivery models that consistently achieve quality patient outcomes. Clinical Nurse Leader integrated care delivery is a promising nursing model with a growing record of success. However, theoretical clarity is necessary to generate causal evidence of effectiveness. Sequential mixed methods. A preliminary Clinical Nurse Leader practice model was refined and survey items developed to correspond with model domains, using focus groups and a Delphi process with a multi-professional expert panel. The survey was administered in 2015 to clinicians and administrators involved in Clinical Nurse Leader initiatives. Confirmatory factor analysis and structural equation modelling were used to validate the measurement and model structure. Final sample n = 518. The model incorporates 13 components organized into five conceptual domains: 'Readiness for Clinical Nurse Leader integrated care delivery'; 'Structuring Clinical Nurse Leader integrated care delivery'; 'Clinical Nurse Leader Practice: Continuous Clinical Leadership'; 'Outcomes of Clinical Nurse Leader integrated care delivery'; and 'Value'. Sample data had good fit with specified model and two-level measurement structure. All hypothesized pathways were significant, with strong coefficients suggesting good fit between theorized and observed path relationships. The validated model articulates an explanatory pathway of Clinical Nurse Leader integrated care delivery, including Clinical Nurse Leader practices that result in improved care dynamics and patient outcomes. The validated model provides a basis for testing in practice to generate evidence that can be deployed across the healthcare spectrum. © 2016 John Wiley & Sons Ltd.

  9. Numerical Validation of Chemical Compositional Model for Wettability Alteration Processes

    Science.gov (United States)

    Bekbauov, Bakhbergen; Berdyshev, Abdumauvlen; Baishemirov, Zharasbek; Bau, Domenico

    2017-12-01

    Chemical compositional simulation of enhanced oil recovery and surfactant enhanced aquifer remediation processes is a complex task that involves solving dozens of equations for all grid blocks representing a reservoir. In the present work, we perform a numerical validation of the newly developed mathematical formulation which satisfies the conservation laws of mass and energy and allows applying a sequential solution approach to solve the governing equations separately and implicitly. Through its application to the numerical experiment using a wettability alteration model and comparisons with existing chemical compositional model's numerical results, the new model has proven to be practical, reliable and stable.

  10. Prospective validation of pathologic complete response models in rectal cancer: Transferability and reproducibility.

    Science.gov (United States)

    van Soest, Johan; Meldolesi, Elisa; van Stiphout, Ruud; Gatta, Roberto; Damiani, Andrea; Valentini, Vincenzo; Lambin, Philippe; Dekker, Andre

    2017-09-01

    Multiple models have been developed to predict pathologic complete response (pCR) in locally advanced rectal cancer patients. Unfortunately, validation of these models normally omit the implications of cohort differences on prediction model performance. In this work, we will perform a prospective validation of three pCR models, including information whether this validation will target transferability or reproducibility (cohort differences) of the given models. We applied a novel methodology, the cohort differences model, to predict whether a patient belongs to the training or to the validation cohort. If the cohort differences model performs well, it would suggest a large difference in cohort characteristics meaning we would validate the transferability of the model rather than reproducibility. We tested our method in a prospective validation of three existing models for pCR prediction in 154 patients. Our results showed a large difference between training and validation cohort for one of the three tested models [Area under the Receiver Operating Curve (AUC) cohort differences model: 0.85], signaling the validation leans towards transferability. Two out of three models had a lower AUC for validation (0.66 and 0.58), one model showed a higher AUC in the validation cohort (0.70). We have successfully applied a new methodology in the validation of three prediction models, which allows us to indicate if a validation targeted transferability (large differences between training/validation cohort) or reproducibility (small cohort differences). © 2017 American Association of Physicists in Medicine.

  11. QSAR Studies of 6-Amino Uracil Base Analogues: A Thymidine Phosphorylase Inhibitor in Cancer Therapy

    Directory of Open Access Journals (Sweden)

    Surya Prakash B. N. Gupta

    2008-01-01

    Full Text Available A novel series of 6-amino uracil base analogue were synthesized. QSAR study was used to relate the selective nonsubstrate inhibitory activity of 6-amino uracil base analogue with various physicochemical descriptors. Stepwise multiple regression analysis was performed to find out the correlation between various physicochemical descriptors and biological activity of the compounds by using Openstat 2 version 6.5.1 and valstat statistical software. Out of the several equations developed, the best equation having the highest significance was selected for further study. The equation is able to explain 60% of total variance and are more than 95% significant as revealed by the F value.

  12. Using quantitative structure-activity relationships (QSAR) to predict toxic endpoints for polycyclic aromatic hydrocarbons (PAH).

    Science.gov (United States)

    Bruce, Erica D; Autenrieth, Robin L; Burghardt, Robert C; Donnelly, K C; McDonald, Thomas J

    2008-01-01

    Quantitative structure-activity relationships (QSAR) offer a reliable, cost-effective alternative to the time, money, and animal lives necessary to determine chemical toxicity by traditional methods. Additionally, humans are exposed to tens of thousands of chemicals in their lifetimes, necessitating the determination of chemical toxicity and screening for those posing the greatest risk to human health. This study developed models to predict toxic endpoints for three bioassays specific to several stages of carcinogenesis. The ethoxyresorufin O-deethylase assay (EROD), the Salmonella/microsome assay, and a gap junction intercellular communication (GJIC) assay were chosen for their ability to measure toxic endpoints specific to activation-, induction-, and promotion-related effects of polycyclic aromatic hydrocarbons (PAH). Shape-electronic, spatial, information content, and topological descriptors proved to be important descriptors in predicting the toxicity of PAH in these bioassays. Bioassay-based toxic equivalency factors (TEF(B)) were developed for several PAH using the quantitative structure-toxicity relationships (QSTR) developed. Predicting toxicity for a specific PAH compound, such as a bioassay-based potential potency (PP(B)) or a TEF(B), is possible by combining the predicted behavior from the QSTR models. These toxicity estimates may then be incorporated into a risk assessment for compounds that lack toxicity data. Accurate toxicity predictions are made by examining each type of endpoint important to the process of carcinogenicity, and a clearer understanding between composition and toxicity can be obtained.

  13. Image decomposition as a tool for validating stress analysis models

    Directory of Open Access Journals (Sweden)

    Mottershead J.

    2010-06-01

    Full Text Available It is good practice to validate analytical and numerical models used in stress analysis for engineering design by comparison with measurements obtained from real components either in-service or in the laboratory. In reality, this critical step is often neglected or reduced to placing a single strain gage at the predicted hot-spot of stress. Modern techniques of optical analysis allow full-field maps of displacement, strain and, or stress to be obtained from real components with relative ease and at modest cost. However, validations continued to be performed only at predicted and, or observed hot-spots and most of the wealth of data is ignored. It is proposed that image decomposition methods, commonly employed in techniques such as fingerprinting and iris recognition, can be employed to validate stress analysis models by comparing all of the key features in the data from the experiment and the model. Image decomposition techniques such as Zernike moments and Fourier transforms have been used to decompose full-field distributions for strain generated from optical techniques such as digital image correlation and thermoelastic stress analysis as well as from analytical and numerical models by treating the strain distributions as images. The result of the decomposition is 101 to 102 image descriptors instead of the 105 or 106 pixels in the original data. As a consequence, it is relatively easy to make a statistical comparison of the image descriptors from the experiment and from the analytical/numerical model and to provide a quantitative assessment of the stress analysis.

  14. A methodology for PSA model validation

    International Nuclear Information System (INIS)

    Unwin, S.D.

    1995-09-01

    This document reports Phase 2 of work undertaken by Science Applications International Corporation (SAIC) in support of the Atomic Energy Control Board's Probabilistic Safety Assessment (PSA) review. A methodology is presented for the systematic review and evaluation of a PSA model. These methods are intended to support consideration of the following question: To within the scope and depth of modeling resolution of a PSA study, is the resultant model a complete and accurate representation of the subject plant? This question was identified as a key PSA validation issue in SAIC's Phase 1 project. The validation methods are based on a model transformation process devised to enhance the transparency of the modeling assumptions. Through conversion to a 'success-oriented' framework, a closer correspondence to plant design and operational specifications is achieved. This can both enhance the scrutability of the model by plant personnel, and provide an alternative perspective on the model that may assist in the identification of deficiencies. The model transformation process is defined and applied to fault trees documented in the Darlington Probabilistic Safety Evaluation. A tentative real-time process is outlined for implementation and documentation of a PSA review based on the proposed methods. (author). 11 refs., 9 tabs., 30 refs

  15. Validated predictive modelling of the environmental resistome.

    Science.gov (United States)

    Amos, Gregory C A; Gozzard, Emma; Carter, Charlotte E; Mead, Andrew; Bowes, Mike J; Hawkey, Peter M; Zhang, Lihong; Singer, Andrew C; Gaze, William H; Wellington, Elizabeth M H

    2015-06-01

    Multi-drug-resistant bacteria pose a significant threat to public health. The role of the environment in the overall rise in antibiotic-resistant infections and risk to humans is largely unknown. This study aimed to evaluate drivers of antibiotic-resistance levels across the River Thames catchment, model key biotic, spatial and chemical variables and produce predictive models for future risk assessment. Sediment samples from 13 sites across the River Thames basin were taken at four time points across 2011 and 2012. Samples were analysed for class 1 integron prevalence and enumeration of third-generation cephalosporin-resistant bacteria. Class 1 integron prevalence was validated as a molecular marker of antibiotic resistance; levels of resistance showed significant geospatial and temporal variation. The main explanatory variables of resistance levels at each sample site were the number, proximity, size and type of surrounding wastewater-treatment plants. Model 1 revealed treatment plants accounted for 49.5% of the variance in resistance levels. Other contributing factors were extent of different surrounding land cover types (for example, Neutral Grassland), temporal patterns and prior rainfall; when modelling all variables the resulting model (Model 2) could explain 82.9% of variations in resistance levels in the whole catchment. Chemical analyses correlated with key indicators of treatment plant effluent and a model (Model 3) was generated based on water quality parameters (contaminant and macro- and micro-nutrient levels). Model 2 was beta tested on independent sites and explained over 78% of the variation in integron prevalence showing a significant predictive ability. We believe all models in this study are highly useful tools for informing and prioritising mitigation strategies to reduce the environmental resistome.

  16. A New Statistical Method to Determine the Degree of Validity of Health Economic Model Outcomes against Empirical Data.

    Science.gov (United States)

    Corro Ramos, Isaac; van Voorn, George A K; Vemer, Pepijn; Feenstra, Talitha L; Al, Maiwenn J

    2017-09-01

    The validation of health economic (HE) model outcomes against empirical data is of key importance. Although statistical testing seems applicable, guidelines for the validation of HE models lack guidance on statistical validation, and actual validation efforts often present subjective judgment of graphs and point estimates. To discuss the applicability of existing validation techniques and to present a new method for quantifying the degrees of validity statistically, which is useful for decision makers. A new Bayesian method is proposed to determine how well HE model outcomes compare with empirical data. Validity is based on a pre-established accuracy interval in which the model outcomes should fall. The method uses the outcomes of a probabilistic sensitivity analysis and results in a posterior distribution around the probability that HE model outcomes can be regarded as valid. We use a published diabetes model (Modelling Integrated Care for Diabetes based on Observational data) to validate the outcome "number of patients who are on dialysis or with end-stage renal disease." Results indicate that a high probability of a valid outcome is associated with relatively wide accuracy intervals. In particular, 25% deviation from the observed outcome implied approximately 60% expected validity. Current practice in HE model validation can be improved by using an alternative method based on assessing whether the model outcomes fit to empirical data at a predefined level of accuracy. This method has the advantage of assessing both model bias and parameter uncertainty and resulting in a quantitative measure of the degree of validity that penalizes models predicting the mean of an outcome correctly but with overly wide credible intervals. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  17. Validation and comparison of dispersion models of RTARC DSS

    International Nuclear Information System (INIS)

    Duran, J.; Pospisil, M.

    2004-01-01

    RTARC DSS (Real Time Accident Release Consequences - Decision Support System) is a computer code developed at the VUJE Trnava, Inc. (Stubna, M. et al, 1993). The code calculations include atmospheric transport and diffusion, dose assessment, evaluation and displaying of the affected zones, evaluation of the early health effects, concentration and dose rate time dependence in the selected sites etc. The simulation of the protective measures (sheltering, iodine administration) is involved. The aim of this paper is to present the process of validation of the RTARC dispersion models. RTARC includes models for calculations of release for very short (Method Monte Carlo - MEMOC), short (Gaussian Straight-Line Model) and long distances (Puff Trajectory Model - PTM). Validation of the code RTARC was performed using the results of comparisons and experiments summarized in the Table 1.: 1. Experiments and comparisons in the process of validation of the system RTARC - experiments or comparison - distance - model. Wind tunnel experiments (Universitaet der Bundeswehr, Muenchen) - Area of NPP - Method Monte Carlo. INEL (Idaho National Engineering Laboratory) - short/medium - Gaussian model and multi tracer atmospheric experiment - distances - PTM. Model Validation Kit - short distances - Gaussian model. STEP II.b 'Realistic Case Studies' - long distances - PTM. ENSEMBLE comparison - long distances - PTM (orig.)

  18. Alignment independent 3D-QSAR, quantum calculations and molecular docking of Mer specific tyrosine kinase inhibitors as anticancer drugs

    Directory of Open Access Journals (Sweden)

    Fereshteh Shiri

    2016-03-01

    Full Text Available Mer receptor tyrosine kinase is a promising novel cancer therapeutic target in many human cancers, because abnormal activation of Mer has been implicated in survival signaling and chemoresistance. 3D-QSAR analyses based on alignment independent descriptors were performed on a series of 81 Mer specific tyrosine kinase inhibitors. The fractional factorial design (FFD and the enhanced replacement method (ERM were applied and tested as variable selection algorithms for the selection of optimal subsets of molecular descriptors from a much greater pool of such regression variables. The data set was split into 65 molecules as the training set and 16 compounds as the test set. All descriptors were generated by using the GRid INdependent descriptors (GRIND approach. After variable selection, GRIND were correlated with activity values (pIC50 by PLS regression. Of the two applied variable selection methods, ERM had a noticeable improvement on the statistical parameters of PLS model, and yielded a q2 value of 0.77, an rpred2 of 0.94, and a low RMSEP value of 0.25. The GRIND information contents influencing the affinity on Mer specific tyrosine kinase were also confirmed by docking studies. In a quantum calculation study, the energy difference between HOMO and LUMO (gap implied the high interaction of the most active molecule in the active site of the protein. In addition, the molecular electrostatic potential energy at DFT level confirmed results obtained from the molecular docking. The identified key features obtained from the molecular modeling, enabled us to design novel kinase inhibitors.

  19. Alignment independent 3D-QSAR, quantum calculations and molecular docking of Mer specific tyrosine kinase inhibitors as anticancer drugs.

    Science.gov (United States)

    Shiri, Fereshteh; Pirhadi, Somayeh; Ghasemi, Jahan B

    2016-03-01

    Mer receptor tyrosine kinase is a promising novel cancer therapeutic target in many human cancers, because abnormal activation of Mer has been implicated in survival signaling and chemoresistance. 3D-QSAR analyses based on alignment independent descriptors were performed on a series of 81 Mer specific tyrosine kinase inhibitors. The fractional factorial design (FFD) and the enhanced replacement method (ERM) were applied and tested as variable selection algorithms for the selection of optimal subsets of molecular descriptors from a much greater pool of such regression variables. The data set was split into 65 molecules as the training set and 16 compounds as the test set. All descriptors were generated by using the GRid INdependent descriptors (GRIND) approach. After variable selection, GRIND were correlated with activity values (pIC50) by PLS regression. Of the two applied variable selection methods, ERM had a noticeable improvement on the statistical parameters of PLS model, and yielded a q (2) value of 0.77, an [Formula: see text] of 0.94, and a low RMSEP value of 0.25. The GRIND information contents influencing the affinity on Mer specific tyrosine kinase were also confirmed by docking studies. In a quantum calculation study, the energy difference between HOMO and LUMO (gap) implied the high interaction of the most active molecule in the active site of the protein. In addition, the molecular electrostatic potential energy at DFT level confirmed results obtained from the molecular docking. The identified key features obtained from the molecular modeling, enabled us to design novel kinase inhibitors.

  20. Investigation into adamantane-based M2 inhibitors with FB-QSAR.

    Science.gov (United States)

    Wei, Hang; Wang, Cheng-Hua; Du, Qi-Shi; Meng, Jianzong; Chou, Kuo-Chen

    2009-07-01

    Because of their high resistance rate to the existing drugs, influenza A viruses have become a threat to human beings. It is known that the replication of influenza A viruses needs a pH-gated proton channel, the so-called M2 channel. Therefore, to develop effective drugs against influenza A, the most logic strategy is to inhibit the M2 channel. Recently, the atomic structure of the M2 channel was determined by NMR spectroscopy (Schnell, J.R. and Chou, J.J., Nature, 2008, 451, 591-595). The high-resolution NMR structure has provided a solid basis for structure-based drug design approaches. In this study, a benchmark dataset has been constructed that contains 34 newly-developed adamantane-based M2 inhibitors and covers considerable structural diversities and wide range of bioactivities. Based on these compounds, an in-depth analysis was performed with the newly developed fragment-based quantitative structure-activity relationship (FB-QSAR) algorithm. The results thus obtained provide useful insights for dealing with the drug-resistant problem and designing effective adamantane-based antiflu drugs.

  1. Developing a model for validation and prediction of bank customer ...

    African Journals Online (AJOL)

    Credit risk is the most important risk of banks. The main approaches of the bank to reduce credit risk are correct validation using the final status and the validation model parameters. High fuel of bank reserves and lost or outstanding facilities of banks indicate the lack of appropriate validation models in the banking network.

  2. Validation of heat transfer models for gap cooling

    International Nuclear Information System (INIS)

    Okano, Yukimitsu; Nagae, Takashi; Murase, Michio

    2004-01-01

    For severe accident assessment of a light water reactor, models of heat transfer in a narrow annular gap between overheated core debris and a reactor pressure vessel are important for evaluating vessel integrity and accident management. The authors developed and improved the models of heat transfer. However, validation was not sufficient for applicability of the gap heat flux correlation to the debris cooling in the vessel lower head and applicability of the local boiling heat flux correlations to the high-pressure conditions. Therefore, in this paper, we evaluated the validity of the heat transfer models and correlations by analyses for ALPHA and LAVA experiments where molten aluminum oxide (Al 2 O 3 ) at about 2700 K was poured into the high pressure water pool in a small-scale simulated vessel lower head. In the heating process of the vessel wall, the calculated heating rate and peak temperature agreed well with the measured values, and the validity of the heat transfer models and gap heat flux correlation was confirmed. In the cooling process of the vessel wall, the calculated cooling rate was compared with the measured value, and the validity of the nucleate boiling heat flux correlation was confirmed. The peak temperatures of the vessel wall in ALPHA and LAVA experiments were lower than the temperature at the minimum heat flux point between film boiling and transition boiling, so the minimum heat flux correlation could not be validated. (author)

  3. Solar Sail Models and Test Measurements Correspondence for Validation Requirements Definition

    Science.gov (United States)

    Ewing, Anthony; Adams, Charles

    2004-01-01

    Solar sails are being developed as a mission-enabling technology in support of future NASA science missions. Current efforts have advanced solar sail technology sufficient to justify a flight validation program. A primary objective of this activity is to test and validate solar sail models that are currently under development so that they may be used with confidence in future science mission development (e.g., scalable to larger sails). Both system and model validation requirements must be defined early in the program to guide design cycles and to ensure that relevant and sufficient test data will be obtained to conduct model validation to the level required. A process of model identification, model input/output documentation, model sensitivity analyses, and test measurement correspondence is required so that decisions can be made to satisfy validation requirements within program constraints.

  4. Experimental validation of a thermodynamic boiler model under steady state and dynamic conditions

    International Nuclear Information System (INIS)

    Carlon, Elisa; Verma, Vijay Kumar; Schwarz, Markus; Golicza, Laszlo; Prada, Alessandro; Baratieri, Marco; Haslinger, Walter; Schmidl, Christoph

    2015-01-01

    Highlights: • Laboratory tests on two commercially available pellet boilers. • Steady state and a dynamic load cycle tests. • Pellet boiler model calibration based on data registered in stationary operation. • Boiler model validation with reference to both stationary and dynamic operation. • Validated model suitable for coupled simulation of building and heating system. - Abstract: Nowadays dynamic building simulation is an essential tool for the design of heating systems for residential buildings. The simulation of buildings heated by biomass systems, first of all needs detailed boiler models, capable of simulating the boiler both as a stand-alone appliance and as a system component. This paper presents the calibration and validation of a boiler model by means of laboratory tests. The chosen model, i.e. TRNSYS “Type 869”, has been validated for two commercially available pellet boilers of 6 and 12 kW nominal capacities. Two test methods have been applied: the first is a steady state test at nominal load and the second is a load cycle test including stationary operation at different loads as well as transient operation. The load cycle test is representative of the boiler operation in the field and characterises the boiler’s stationary and dynamic behaviour. The model had been calibrated based on laboratory data registered during stationary operation at different loads and afterwards it was validated by simulating both the stationary and the dynamic tests. Selected parameters for the validation were the heat transfer rates to water and the water temperature profiles inside the boiler and at the boiler outlet. Modelling results showed better agreement with experimental data during stationary operation rather than during dynamic operation. Heat transfer rates to water were predicted with a maximum deviation of 10% during the stationary operation, and a maximum deviation of 30% during the dynamic load cycle. However, for both operational regimes the

  5. Requirements Validation: Execution of UML Models with CPN Tools

    DEFF Research Database (Denmark)

    Machado, Ricardo J.; Lassen, Kristian Bisgaard; Oliveira, Sérgio

    2007-01-01

    Requirements validation is a critical task in any engineering project. The confrontation of stakeholders with static requirements models is not enough, since stakeholders with non-computer science education are not able to discover all the inter-dependencies between the elicited requirements. Eve...... requirements, where the system to be built must explicitly support the interaction between people within a pervasive cooperative workflow execution. A case study from a real project is used to illustrate the proposed approach....

  6. An approach to model validation and model-based prediction -- polyurethane foam case study.

    Energy Technology Data Exchange (ETDEWEB)

    Dowding, Kevin J.; Rutherford, Brian Milne

    2003-07-01

    Enhanced software methodology and improved computing hardware have advanced the state of simulation technology to a point where large physics-based codes can be a major contributor in many systems analyses. This shift toward the use of computational methods has brought with it new research challenges in a number of areas including characterization of uncertainty, model validation, and the analysis of computer output. It is these challenges that have motivated the work described in this report. Approaches to and methods for model validation and (model-based) prediction have been developed recently in the engineering, mathematics and statistical literatures. In this report we have provided a fairly detailed account of one approach to model validation and prediction applied to an analysis investigating thermal decomposition of polyurethane foam. A model simulates the evolution of the foam in a high temperature environment as it transforms from a solid to a gas phase. The available modeling and experimental results serve as data for a case study focusing our model validation and prediction developmental efforts on this specific thermal application. We discuss several elements of the ''philosophy'' behind the validation and prediction approach: (1) We view the validation process as an activity applying to the use of a specific computational model for a specific application. We do acknowledge, however, that an important part of the overall development of a computational simulation initiative is the feedback provided to model developers and analysts associated with the application. (2) We utilize information obtained for the calibration of model parameters to estimate the parameters and quantify uncertainty in the estimates. We rely, however, on validation data (or data from similar analyses) to measure the variability that contributes to the uncertainty in predictions for specific systems or units (unit-to-unit variability). (3) We perform statistical

  7. In silico models for predicting ready biodegradability under REACH: a comparative study.

    Science.gov (United States)

    Pizzo, Fabiola; Lombardo, Anna; Manganaro, Alberto; Benfenati, Emilio

    2013-10-01

    REACH (Registration Evaluation Authorization and restriction of Chemicals) legislation is a new European law which aims to raise the human protection level and environmental health. Under REACH all chemicals manufactured or imported for more than one ton per year must be evaluated for their ready biodegradability. Ready biodegradability is also used as a screening test for persistent, bioaccumulative and toxic (PBT) substances. REACH encourages the use of non-testing methods such as QSAR (quantitative structure-activity relationship) models in order to save money and time and to reduce the number of animals used for scientific purposes. Some QSAR models are available for predicting ready biodegradability. We used a dataset of 722 compounds to test four models: VEGA, TOPKAT, BIOWIN 5 and 6 and START and compared their performance on the basis of the following parameters: accuracy, sensitivity, specificity and Matthew's correlation coefficient (MCC). Performance was analyzed from different points of view. The first calculation was done on the whole dataset and VEGA and TOPKAT gave the best accuracy (88% and 87% respectively). Then we considered the compounds inside and outside the training set: BIOWIN 6 and 5 gave the best results for accuracy (81%) outside training set. Another analysis examined the applicability domain (AD). VEGA had the highest value for compounds inside the AD for all the parameters taken into account. Finally, compounds outside the training set and in the AD of the models were considered to assess predictive ability. VEGA gave the best accuracy results (99%) for this group of chemicals. Generally, START model gave poor results. Since BIOWIN, TOPKAT and VEGA models performed well, they may be used to predict ready biodegradability. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Experimental Validation of Flow Force Models for Fast Switching Valves

    DEFF Research Database (Denmark)

    Bender, Niels Christian; Pedersen, Henrik Clemmensen; Nørgård, Christian

    2017-01-01

    This paper comprises a detailed study of the forces acting on a Fast Switching Valve (FSV) plunger. The objective is to investigate to what extend different models are valid to be used for design purposes. These models depend on the geometry of the moving plunger and the properties of the surroun......This paper comprises a detailed study of the forces acting on a Fast Switching Valve (FSV) plunger. The objective is to investigate to what extend different models are valid to be used for design purposes. These models depend on the geometry of the moving plunger and the properties...... to compare and validate different models, where an effort is directed towards capturing the fluid squeeze effect just before material on material contact. The test data is compared with simulation data relying solely on analytic formulations. The general dynamics of the plunger is validated...

  9. Development and validation of a mortality risk model for pediatric sepsis

    Science.gov (United States)

    Chen, Mengshi; Lu, Xiulan; Hu, Li; Liu, Pingping; Zhao, Wenjiao; Yan, Haipeng; Tang, Liang; Zhu, Yimin; Xiao, Zhenghui; Chen, Lizhang; Tan, Hongzhuan

    2017-01-01

    Abstract Pediatric sepsis is a burdensome public health problem. Assessing the mortality risk of pediatric sepsis patients, offering effective treatment guidance, and improving prognosis to reduce mortality rates, are crucial. We extracted data derived from electronic medical records of pediatric sepsis patients that were collected during the first 24 hours after admission to the pediatric intensive care unit (PICU) of the Hunan Children's hospital from January 2012 to June 2014. A total of 788 children were randomly divided into a training (592, 75%) and validation group (196, 25%). The risk factors for mortality among these patients were identified by conducting multivariate logistic regression in the training group. Based on the established logistic regression equation, the logit probabilities for all patients (in both groups) were calculated to verify the model's internal and external validities. According to the training group, 6 variables (brain natriuretic peptide, albumin, total bilirubin, D-dimer, lactate levels, and mechanical ventilation in 24 hours) were included in the final logistic regression model. The areas under the curves of the model were 0.854 (0.826, 0.881) and 0.844 (0.816, 0.873) in the training and validation groups, respectively. The Mortality Risk Model for Pediatric Sepsis we established in this study showed acceptable accuracy to predict the mortality risk in pediatric sepsis patients. PMID:28514310

  10. Statistical validation of normal tissue complication probability models

    NARCIS (Netherlands)

    Xu, Cheng-Jian; van der Schaaf, Arjen; van t Veld, Aart; Langendijk, Johannes A.; Schilstra, Cornelis

    2012-01-01

    PURPOSE: To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. METHODS AND MATERIALS: A penalized regression method, LASSO (least absolute shrinkage

  11. Calibration and validation of earthquake catastrophe models. Case study: Impact Forecasting Earthquake Model for Algeria

    Science.gov (United States)

    Trendafiloski, G.; Gaspa Rebull, O.; Ewing, C.; Podlaha, A.; Magee, B.

    2012-04-01

    Calibration and validation are crucial steps in the production of the catastrophe models for the insurance industry in order to assure the model's reliability and to quantify its uncertainty. Calibration is needed in all components of model development including hazard and vulnerability. Validation is required to ensure that the losses calculated by the model match those observed in past events and which could happen in future. Impact Forecasting, the catastrophe modelling development centre of excellence within Aon Benfield, has recently launched its earthquake model for Algeria as a part of the earthquake model for the Maghreb region. The earthquake model went through a detailed calibration process including: (1) the seismic intensity attenuation model by use of macroseismic observations and maps from past earthquakes in Algeria; (2) calculation of the country-specific vulnerability modifiers by use of past damage observations in the country. The use of Benouar, 1994 ground motion prediction relationship was proven as the most appropriate for our model. Calculation of the regional vulnerability modifiers for the country led to 10% to 40% larger vulnerability indexes for different building types compared to average European indexes. The country specific damage models also included aggregate damage models for residential, commercial and industrial properties considering the description of the buildings stock given by World Housing Encyclopaedia and the local rebuilding cost factors equal to 10% for damage grade 1, 20% for damage grade 2, 35% for damage grade 3, 75% for damage grade 4 and 100% for damage grade 5. The damage grades comply with the European Macroseismic Scale (EMS-1998). The model was validated by use of "as-if" historical scenario simulations of three past earthquake events in Algeria M6.8 2003 Boumerdes, M7.3 1980 El-Asnam and M7.3 1856 Djidjelli earthquake. The calculated return periods of the losses for client market portfolio align with the

  12. Comparative performance of high-fidelity training models for flexible ureteroscopy: Are all models effective?

    Directory of Open Access Journals (Sweden)

    Shashikant Mishra

    2011-01-01

    Full Text Available Objective: We performed a comparative study of high-fidelity training models for flexible ureteroscopy (URS. Our objective was to determine whether high-fidelity non-virtual reality (VR models are as effective as the VR model in teaching flexible URS skills. Materials and Methods: Twenty-one trained urologists without clinical experience of flexible URS underwent dry lab simulation practice. After a warm-up period of 2 h, tasks were performed on a high-fidelity non-VR (Uro-scopic Trainer TM ; Endo-Urologie-Modell TM and a high-fidelity VR model (URO Mentor TM . The participants were divided equally into three batches with rotation on each of the three stations for 30 min. Performance of the trainees was evaluated by an expert ureteroscopist using pass rating and global rating score (GRS. The participants rated a face validity questionnaire at the end of each session. Results: The GRS improved statistically at evaluation performed after second rotation (P<0.001 for batches 1, 2 and 3. Pass ratings also improved significantly for all training models when the third and first rotations were compared (P<0.05. The batch that was trained on the VR-based model had more improvement on pass ratings on second rotation but could not achieve statistical significance. Most of the realistic domains were higher for a VR model as compared with the non-VR model, except the realism of the flexible endoscope. Conclusions: All the models used for training flexible URS were effective in increasing the GRS and pass ratings irrespective of the VR status.

  13. An analytic model for gate-all-around silicon nanowire tunneling field effect transistors

    International Nuclear Information System (INIS)

    Liu Ying; He Jin; Chan Mansun; Ye Yun; Zhao Wei; Wu Wen; Deng Wan-Ling; Wang Wen-Ping; Du Cai-Xia

    2014-01-01

    An analytical model of gate-all-around (GAA) silicon nanowire tunneling field effect transistors (NW-TFETs) is developted based on the surface potential solutions in the channel direction and considering the band to band tunneling (BTBT) efficiency. The three-dimensional Poisson equation is solved to obtain the surface potential distributions in the partition regions along the channel direction for the NW-TFET, and a tunneling current model using Kane's expression is developed. The validity of the developed model is shown by the good agreement between the model predictions and the TCAD simulation results. (condensed matter: electronic structure, electrical, magnetic, and optical properties)

  14. Validation techniques of agent based modelling for geospatial simulations

    Directory of Open Access Journals (Sweden)

    M. Darvishi

    2014-10-01

    Full Text Available One of the most interesting aspects of modelling and simulation study is to describe the real world phenomena that have specific properties; especially those that are in large scales and have dynamic and complex behaviours. Studying these phenomena in the laboratory is costly and in most cases it is impossible. Therefore, Miniaturization of world phenomena in the framework of a model in order to simulate the real phenomena is a reasonable and scientific approach to understand the world. Agent-based modelling and simulation (ABMS is a new modelling method comprising of multiple interacting agent. They have been used in the different areas; for instance, geographic information system (GIS, biology, economics, social science and computer science. The emergence of ABM toolkits in GIS software libraries (e.g. ESRI’s ArcGIS, OpenMap, GeoTools, etc for geospatial modelling is an indication of the growing interest of users to use of special capabilities of ABMS. Since ABMS is inherently similar to human cognition, therefore it could be built easily and applicable to wide range applications than a traditional simulation. But a key challenge about ABMS is difficulty in their validation and verification. Because of frequent emergence patterns, strong dynamics in the system and the complex nature of ABMS, it is hard to validate and verify ABMS by conventional validation methods. Therefore, attempt to find appropriate validation techniques for ABM seems to be necessary. In this paper, after reviewing on Principles and Concepts of ABM for and its applications, the validation techniques and challenges of ABM validation are discussed.

  15. Validation techniques of agent based modelling for geospatial simulations

    Science.gov (United States)

    Darvishi, M.; Ahmadi, G.

    2014-10-01

    One of the most interesting aspects of modelling and simulation study is to describe the real world phenomena that have specific properties; especially those that are in large scales and have dynamic and complex behaviours. Studying these phenomena in the laboratory is costly and in most cases it is impossible. Therefore, Miniaturization of world phenomena in the framework of a model in order to simulate the real phenomena is a reasonable and scientific approach to understand the world. Agent-based modelling and simulation (ABMS) is a new modelling method comprising of multiple interacting agent. They have been used in the different areas; for instance, geographic information system (GIS), biology, economics, social science and computer science. The emergence of ABM toolkits in GIS software libraries (e.g. ESRI's ArcGIS, OpenMap, GeoTools, etc) for geospatial modelling is an indication of the growing interest of users to use of special capabilities of ABMS. Since ABMS is inherently similar to human cognition, therefore it could be built easily and applicable to wide range applications than a traditional simulation. But a key challenge about ABMS is difficulty in their validation and verification. Because of frequent emergence patterns, strong dynamics in the system and the complex nature of ABMS, it is hard to validate and verify ABMS by conventional validation methods. Therefore, attempt to find appropriate validation techniques for ABM seems to be necessary. In this paper, after reviewing on Principles and Concepts of ABM for and its applications, the validation techniques and challenges of ABM validation are discussed.

  16. VALIDATION OF SIMULATION MODELS FOR DIFFERENTLY DESIGNED HEAT-PIPE EVACUATED TUBULAR COLLECTORS

    DEFF Research Database (Denmark)

    Fan, Jianhua; Dragsted, Janne; Furbo, Simon

    2007-01-01

    Differently designed heat-pipe evacuated tubular collectors have been investigated theoretically and experimentally. The theoretical work has included development of two TRNSYS [1] simulation models for heat-pipe evacuated tubular collectors utilizing solar radiation from all directions. One model...... coating on both sides. The input to the models is thus not a simple collector efficiency expression but the actual collector geometry. In this study, the TRNSYS models are validated with measurements for four differently designed heat-pipe evacuated tubular collectors. The collectors are produced...

  17. Modeling pedestrian shopping behavior using principles of bounded rationality: model comparison and validation

    Science.gov (United States)

    Zhu, Wei; Timmermans, Harry

    2011-06-01

    Models of geographical choice behavior have been dominantly based on rational choice models, which assume that decision makers are utility-maximizers. Rational choice models may be less appropriate as behavioral models when modeling decisions in complex environments in which decision makers may simplify the decision problem using heuristics. Pedestrian behavior in shopping streets is an example. We therefore propose a modeling framework for pedestrian shopping behavior incorporating principles of bounded rationality. We extend three classical heuristic rules (conjunctive, disjunctive and lexicographic rule) by introducing threshold heterogeneity. The proposed models are implemented using data on pedestrian behavior in Wang Fujing Street, the city center of Beijing, China. The models are estimated and compared with multinomial logit models and mixed logit models. Results show that the heuristic models are the best for all the decisions that are modeled. Validation tests are carried out through multi-agent simulation by comparing simulated spatio-temporal agent behavior with the observed pedestrian behavior. The predictions of heuristic models are slightly better than those of the multinomial logit models.

  18. Forty years of Fanger's model of thermal comfort: comfort for all?

    Science.gov (United States)

    van Hoof, J

    2008-06-01

    The predicted mean vote (PMV) model of thermal comfort, created by Fanger in the late 1960s, is used worldwide to assess thermal comfort. Fanger based his model on college-aged students for use in invariant environmental conditions in air-conditioned buildings in moderate thermal climate zones. Environmental engineering practice calls for a predictive method that is applicable to all types of people in any kind of building in every climate zone. In this publication, existing support and criticism, as well as modifications to the PMV model are discussed in light of the requirements by environmental engineering practice in the 21st century in order to move from a predicted mean vote to comfort for all. Improved prediction of thermal comfort can be achieved through improving the validity of the PMV model, better specification of the model's input parameters, and accounting for outdoor thermal conditions and special groups. The application range of the PMV model can be enlarged, for instance, by using the model to assess the effects of the thermal environment on productivity and behavior, and interactions with other indoor environmental parameters, and the use of information and communication technologies. Even with such modifications to thermal comfort evaluation, thermal comfort for all can only be achieved when occupants have effective control over their own thermal environment. The paper treats the assessment of thermal comfort using the PMV model of Fanger, and deals with the strengths and limitations of this model. Readers are made familiar to some opportunities for use in the 21st-century information society.

  19. BIOMOVS: an international model validation study

    International Nuclear Information System (INIS)

    Haegg, C.; Johansson, G.

    1988-01-01

    BIOMOVS (BIOspheric MOdel Validation Study) is an international study where models used for describing the distribution of radioactive and nonradioactive trace substances in terrestrial and aquatic environments are compared and tested. The main objectives of the study are to compare and test the accuracy of predictions between such models, explain differences in these predictions, recommend priorities for future research concerning the improvement of the accuracy of model predictions and act as a forum for the exchange of ideas, experience and information. (author)

  20. BIOMOVS: An international model validation study

    International Nuclear Information System (INIS)

    Haegg, C.; Johansson, G.

    1987-01-01

    BIOMOVS (BIOspheric MOdel Validation Study) is an international study where models used for describing the distribution of radioactive and nonradioactive trace substances in terrestrial and aquatic environments are compared and tested. The main objectives of the study are to compare and test the accuracy of predictions between such models, explain differences in these predictions, recommend priorities for future research concerning the improvement of the accuracy of model predictions and act as a forum for the exchange of ideas, experience and information. (orig.)