novo structure prediction: Topics by WorldWideScience.org

Sample records for novo structure prediction

Building a better fragment library for de novo protein structure prediction.

Directory of Open Access Journals (Sweden)

Saulo H P de Oliveira

Full Text Available Fragment-based approaches are the current standard for de novo protein structure prediction. These approaches rely on accurate and reliable fragment libraries to generate good structural models. In this work, we describe a novel method for structure fragment library generation and its application in fragment-based de novo protein structure prediction. The importance of correct testing procedures in assessing the quality of fragment libraries is demonstrated. In particular, the exclusion of homologs to the target from the libraries to correctly simulate a de novo protein structure prediction scenario, something which surprisingly is not always done. We demonstrate that fragments presenting different predominant predicted secondary structures should be treated differently during the fragment library generation step and that exhaustive and random search strategies should both be used. This information was used to develop a novel method, Flib. On a validation set of 41 structurally diverse proteins, Flib libraries presents both a higher precision and coverage than two of the state-of-the-art methods, NNMake and HHFrag. Flib also achieves better precision and coverage on the set of 275 protein domains used in the two previous experiments of the the Critical Assessment of Structure Prediction (CASP9 and CASP10. We compared Flib libraries against NNMake libraries in a structure prediction context. Of the 13 cases in which a correct answer was generated, Flib models were more accurate than NNMake models for 10. "Flib is available for download at: http://www.stats.ox.ac.uk/research/proteins/resources".
Building a Better Fragment Library for De Novo Protein Structure Prediction

Science.gov (United States)

de Oliveira, Saulo H. P.; Shi, Jiye; Deane, Charlotte M.

2015-01-01

Fragment-based approaches are the current standard for de novo protein structure prediction. These approaches rely on accurate and reliable fragment libraries to generate good structural models. In this work, we describe a novel method for structure fragment library generation and its application in fragment-based de novo protein structure prediction. The importance of correct testing procedures in assessing the quality of fragment libraries is demonstrated. In particular, the exclusion of homologs to the target from the libraries to correctly simulate a de novo protein structure prediction scenario, something which surprisingly is not always done. We demonstrate that fragments presenting different predominant predicted secondary structures should be treated differently during the fragment library generation step and that exhaustive and random search strategies should both be used. This information was used to develop a novel method, Flib. On a validation set of 41 structurally diverse proteins, Flib libraries presents both a higher precision and coverage than two of the state-of-the-art methods, NNMake and HHFrag. Flib also achieves better precision and coverage on the set of 275 protein domains used in the two previous experiments of the the Critical Assessment of Structure Prediction (CASP9 and CASP10). We compared Flib libraries against NNMake libraries in a structure prediction context. Of the 13 cases in which a correct answer was generated, Flib models were more accurate than NNMake models for 10. “Flib is available for download at: http://www.stats.ox.ac.uk/research/proteins/resources”. PMID:25901595
Sequential search leads to faster, more efficient fragment-based de novo protein structure prediction.

Science.gov (United States)

de Oliveira, Saulo H P; Law, Eleanor C; Shi, Jiye; Deane, Charlotte M

2018-04-01

Most current de novo structure prediction methods randomly sample protein conformations and thus require large amounts of computational resource. Here, we consider a sequential sampling strategy, building on ideas from recent experimental work which shows that many proteins fold cotranslationally. We have investigated whether a pseudo-greedy search approach, which begins sequentially from one of the termini, can improve the performance and accuracy of de novo protein structure prediction. We observed that our sequential approach converges when fewer than 20 000 decoys have been produced, fewer than commonly expected. Using our software, SAINT2, we also compared the run time and quality of models produced in a sequential fashion against a standard, non-sequential approach. Sequential prediction produces an individual decoy 1.5-2.5 times faster than non-sequential prediction. When considering the quality of the best model, sequential prediction led to a better model being produced for 31 out of 41 soluble protein validation cases and for 18 out of 24 transmembrane protein cases. Correct models (TM-Score > 0.5) were produced for 29 of these cases by the sequential mode and for only 22 by the non-sequential mode. Our comparison reveals that a sequential search strategy can be used to drastically reduce computational time of de novo protein structure prediction and improve accuracy. Data are available for download from: http://opig.stats.ox.ac.uk/resources. SAINT2 is available for download from: https://github.com/sauloho/SAINT2. saulo.deoliveira@dtc.ox.ac.uk. Supplementary data are available at Bioinformatics online.
De novo prediction of human chromosome structures: Epigenetic marking patterns encode genome architecture.

Science.gov (United States)

Di Pierro, Michele; Cheng, Ryan R; Lieberman Aiden, Erez; Wolynes, Peter G; Onuchic, José N

2017-11-14

Inside the cell nucleus, genomes fold into organized structures that are characteristic of cell type. Here, we show that this chromatin architecture can be predicted de novo using epigenetic data derived from chromatin immunoprecipitation-sequencing (ChIP-Seq). We exploit the idea that chromosomes encode a 1D sequence of chromatin structural types. Interactions between these chromatin types determine the 3D structural ensemble of chromosomes through a process similar to phase separation. First, a neural network is used to infer the relation between the epigenetic marks present at a locus, as assayed by ChIP-Seq, and the genomic compartment in which those loci reside, as measured by DNA-DNA proximity ligation (Hi-C). Next, types inferred from this neural network are used as an input to an energy landscape model for chromatin organization [Minimal Chromatin Model (MiChroM)] to generate an ensemble of 3D chromosome conformations at a resolution of 50 kilobases (kb). After training the model, dubbed Maximum Entropy Genomic Annotation from Biomarkers Associated to Structural Ensembles (MEGABASE), on odd-numbered chromosomes, we predict the sequences of chromatin types and the subsequent 3D conformational ensembles for the even chromosomes. We validate these structural ensembles by using ChIP-Seq tracks alone to predict Hi-C maps, as well as distances measured using 3D fluorescence in situ hybridization (FISH) experiments. Both sets of experiments support the hypothesis of phase separation being the driving process behind compartmentalization. These findings strongly suggest that epigenetic marking patterns encode sufficient information to determine the global architecture of chromosomes and that de novo structure prediction for whole genomes may be increasingly possible. Copyright © 2017 the Author(s). Published by PNAS.
Pushing the size limit of de novo structure ensemble prediction guided by sparse SDSL-EPR restraints to 200 residues: The monomeric and homodimeric forms of BAX

Science.gov (United States)

Fischer, Axel W.; Bordignon, Enrica; Bleicken, Stephanie; García-Sáez, Ana J.; Jeschke, Gunnar; Meiler, Jens

2016-01-01

Structure determination remains a challenge for many biologically important proteins. In particular, proteins that adopt multiple conformations often evade crystallization in all biologically relevant states. Although computational de novo protein folding approaches often sample biologically relevant conformations, the selection of the most accurate model for different functional states remains a formidable challenge, in particular, for proteins with more than about 150 residues. Electron paramagnetic resonance (EPR) spectroscopy can obtain limited structural information for proteins in well-defined biological states and thereby assist in selecting biologically relevant conformations. The present study demonstrates that de novo folding methods are able to accurately sample the folds of 192-residue long soluble monomeric Bcl-2-associated X protein (BAX). The tertiary structures of the monomeric and homodimeric forms of BAX were predicted using the primary structure as well as 25 and 11 EPR distance restraints, respectively. The predicted models were subsequently compared to respective NMR/X-ray structures of BAX. EPR restraints improve the protein-size normalized root-mean-square-deviation (RMSD100) of the most accurate models with respect to the NMR/crystal structure from 5.9 Å to 3.9 Å and from 5.7 Å to 3.3 Å, respectively. Additionally, the model discrimination is improved, which is demonstrated by an improvement of the enrichment from 5% to 15% and from 13% to 21%, respectively. PMID:27129417
Recurrence risk in de novo structural chromosomal rearrangements.

Science.gov (United States)

Röthlisberger, Benno; Kotzot, Dieter

2007-08-01

According to the textbook of Gardner and Sutherland [2004], the standard on genetic counseling for chromosome abnormalities, the recurrence risk of de novo structural or combined structural and numeric chromosome rearrangements is less than 0.5-2% and takes into account recurrence by chance, gonadal mosaicism, and somatic-gonadal mosaicism. However, these figures are roughly estimated and neither any systematic study nor exact or evidence-based risk calculations are available. To address this question, an extensive literature search was performed and surprisingly only 29 case reports of recurrence of de novo structural or combined structural and numeric chromosomal rearrangements were found. Thirteen of them were with a trisomy 21 due to an i(21q) replacing one normal chromosome 21. In eight of them low-level mosaicism in one of the parents was found either in fibroblasts or in blood or in both. As a consequence of the low number of cases and theoretical considerations (clinical consequences, mechanisms of formation, etc.), the recurrence risk should be reduced to less than 1% for a de novo i(21q) and to even less than 0.3% for all other de novo structural or combined structural and numeric chromosomal rearrangements. As the latter is lower than the commonly accepted risk of approximately 0.3% for indicating an invasive prenatal diagnosis and as the risk of abortion of a healthy fetus after chorionic villous sampling or amniocentesis is higher than approximately 0.5%, invasive prenatal investigation in most cases is not indicated and should only be performed if explicitly asked by the parents subsequent to appropriate genetic counseling. (c) 2007 Wiley-Liss, Inc.
Predicting survival of de novo metastatic breast cancer in Asian women: systematic review and validation study.

Science.gov (United States)

Miao, Hui; Hartman, Mikael; Bhoo-Pathy, Nirmala; Lee, Soo-Chin; Taib, Nur Aishah; Tan, Ern-Yu; Chan, Patrick; Moons, Karel G M; Wong, Hoong-Seam; Goh, Jeremy; Rahim, Siti Mastura; Yip, Cheng-Har; Verkooijen, Helena M

2014-01-01

In Asia, up to 25% of breast cancer patients present with distant metastases at diagnosis. Given the heterogeneous survival probabilities of de novo metastatic breast cancer, individual outcome prediction is challenging. The aim of the study is to identify existing prognostic models for patients with de novo metastatic breast cancer and validate them in Asia. We performed a systematic review to identify prediction models for metastatic breast cancer. Models were validated in 642 women with de novo metastatic breast cancer registered between 2000 and 2010 in the Singapore Malaysia Hospital Based Breast Cancer Registry. Survival curves for low, intermediate and high-risk groups according to each prognostic score were compared by log-rank test and discrimination of the models was assessed by concordance statistic (C-statistic). We identified 16 prediction models, seven of which were for patients with brain metastases only. Performance status, estrogen receptor status, metastatic site(s) and disease-free interval were the most common predictors. We were able to validate nine prediction models. The capacity of the models to discriminate between poor and good survivors varied from poor to fair with C-statistics ranging from 0.50 (95% CI, 0.48-0.53) to 0.63 (95% CI, 0.60-0.66). The discriminatory performance of existing prediction models for de novo metastatic breast cancer in Asia is modest. Development of an Asian-specific prediction model is needed to improve prognostication and guide decision making.
Predicting survival of de novo metastatic breast cancer in Asian women: systematic review and validation study.

Directory of Open Access Journals (Sweden)

Hui Miao

Full Text Available BACKGROUND: In Asia, up to 25% of breast cancer patients present with distant metastases at diagnosis. Given the heterogeneous survival probabilities of de novo metastatic breast cancer, individual outcome prediction is challenging. The aim of the study is to identify existing prognostic models for patients with de novo metastatic breast cancer and validate them in Asia. MATERIALS AND METHODS: We performed a systematic review to identify prediction models for metastatic breast cancer. Models were validated in 642 women with de novo metastatic breast cancer registered between 2000 and 2010 in the Singapore Malaysia Hospital Based Breast Cancer Registry. Survival curves for low, intermediate and high-risk groups according to each prognostic score were compared by log-rank test and discrimination of the models was assessed by concordance statistic (C-statistic. RESULTS: We identified 16 prediction models, seven of which were for patients with brain metastases only. Performance status, estrogen receptor status, metastatic site(s and disease-free interval were the most common predictors. We were able to validate nine prediction models. The capacity of the models to discriminate between poor and good survivors varied from poor to fair with C-statistics ranging from 0.50 (95% CI, 0.48-0.53 to 0.63 (95% CI, 0.60-0.66. CONCLUSION: The discriminatory performance of existing prediction models for de novo metastatic breast cancer in Asia is modest. Development of an Asian-specific prediction model is needed to improve prognostication and guide decision making.
De Novo Discovery of Structured ncRNA Motifs in Genomic Sequences

DEFF Research Database (Denmark)

Ruzzo, Walter L; Gorodkin, Jan

2014-01-01

De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphas...... on an approach based on the CMfinder CMfinder program as a case study. Applications to genomic screens for novel de novo structured ncRNA ncRNA s, including structured RNA elements in untranslated portions of protein-coding genes, are presented.......De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphasis...
De novo structural modeling and computational sequence analysis ...

African Journals Online (AJOL)

Different bioinformatics tools and machine learning techniques were used for protein structural classification. De novo protein modeling was performed by using I-TASSER server. The final model obtained was accessed by PROCHECK and DFIRE2, which confirmed that the final model is reliable. Until complete biochemical ...
Protein structure prediction using bee colony optimization metaheuristic

DEFF Research Database (Denmark)

Fonseca, Rasmus; Paluszewski, Martin; Winter, Pawel

2010-01-01

of the proteins structure, an energy potential and some optimization algorithm that ¿nds the structure with minimal energy. Bee Colony Optimization (BCO) is a relatively new approach to solving opti- mization problems based on the foraging behaviour of bees. Several variants of BCO have been suggested......Predicting the native structure of proteins is one of the most challenging problems in molecular biology. The goal is to determine the three-dimensional struc- ture from the one-dimensional amino acid sequence. De novo prediction algorithms seek to do this by developing a representation...... our BCO method to generate good solutions to the protein structure prediction problem. The results show that BCO generally ¿nds better solutions than simulated annealing which so far has been the metaheuristic of choice for this problem....
Automated de novo phasing and model building of coiled-coil proteins.

Science.gov (United States)

Rämisch, Sebastian; Lizatović, Robert; André, Ingemar

2015-03-01

Models generated by de novo structure prediction can be very useful starting points for molecular replacement for systems where suitable structural homologues cannot be readily identified. Protein-protein complexes and de novo-designed proteins are examples of systems that can be challenging to phase. In this study, the potential of de novo models of protein complexes for use as starting points for molecular replacement is investigated. The approach is demonstrated using homomeric coiled-coil proteins, which are excellent model systems for oligomeric systems. Despite the stereotypical fold of coiled coils, initial phase estimation can be difficult and many structures have to be solved with experimental phasing. A method was developed for automatic structure determination of homomeric coiled coils from X-ray diffraction data. In a benchmark set of 24 coiled coils, ranging from dimers to pentamers with resolutions down to 2.5 Å, 22 systems were automatically solved, 11 of which had previously been solved by experimental phasing. The generated models contained 71-103% of the residues present in the deposited structures, had the correct sequence and had free R values that deviated on average by 0.01 from those of the respective reference structures. The electron-density maps were of sufficient quality that only minor manual editing was necessary to produce final structures. The method, named CCsolve, combines methods for de novo structure prediction, initial phase estimation and automated model building into one pipeline. CCsolve is robust against errors in the initial models and can readily be modified to make use of alternative crystallographic software. The results demonstrate the feasibility of de novo phasing of protein-protein complexes, an approach that could also be employed for other small systems beyond coiled coils.
De novo protein structure prediction by dynamic fragment assembly and conformational space annealing.

Science.gov (United States)

Lee, Juyong; Lee, Jinhyuk; Sasaki, Takeshi N; Sasai, Masaki; Seok, Chaok; Lee, Jooyoung

2011-08-01

Ab initio protein structure prediction is a challenging problem that requires both an accurate energetic representation of a protein structure and an efficient conformational sampling method for successful protein modeling. In this article, we present an ab initio structure prediction method which combines a recently suggested novel way of fragment assembly, dynamic fragment assembly (DFA) and conformational space annealing (CSA) algorithm. In DFA, model structures are scored by continuous functions constructed based on short- and long-range structural restraint information from a fragment library. Here, DFA is represented by the full-atom model by CHARMM with the addition of the empirical potential of DFIRE. The relative contributions between various energy terms are optimized using linear programming. The conformational sampling was carried out with CSA algorithm, which can find low energy conformations more efficiently than simulated annealing used in the existing DFA study. The newly introduced DFA energy function and CSA sampling algorithm are implemented into CHARMM. Test results on 30 small single-domain proteins and 13 template-free modeling targets of the 8th Critical Assessment of protein Structure Prediction show that the current method provides comparable and complementary prediction results to existing top methods. Copyright © 2011 Wiley-Liss, Inc.
Algorithm for selection of optimized EPR distance restraints for de novo protein structure determination

Science.gov (United States)

Kazmier, Kelli; Alexander, Nathan S.; Meiler, Jens; Mchaourab, Hassane S.

2010-01-01

A hybrid protein structure determination approach combining sparse Electron Paramagnetic Resonance (EPR) distance restraints and Rosetta de novo protein folding has been previously demonstrated to yield high quality models (Alexander et al., 2008). However, widespread application of this methodology to proteins of unknown structures is hindered by the lack of a general strategy to place spin label pairs in the primary sequence. In this work, we report the development of an algorithm that optimally selects spin labeling positions for the purpose of distance measurements by EPR. For the α-helical subdomain of T4 lysozyme (T4L), simulated restraints that maximize sequence separation between the two spin labels while simultaneously ensuring pairwise connectivity of secondary structure elements yielded vastly improved models by Rosetta folding. 50% of all these models have the correct fold compared to only 21% and 8% correctly folded models when randomly placed restraints or no restraints are used, respectively. Moreover, the improvements in model quality require a limited number of optimized restraints, the number of which is determined by the pairwise connectivities of T4L α-helices. The predicted improvement in Rosetta model quality was verified by experimental determination of distances between spin labels pairs selected by the algorithm. Overall, our results reinforce the rationale for the combined use of sparse EPR distance restraints and de novo folding. By alleviating the experimental bottleneck associated with restraint selection, this algorithm sets the stage for extending computational structure determination to larger, traditionally elusive protein topologies of critical structural and biochemical importance. PMID:21074624
Protein Loop Structure Prediction Using Conformational Space Annealing.

Science.gov (United States)

Heo, Seungryong; Lee, Juyong; Joo, Keehyoung; Shin, Hang-Cheol; Lee, Jooyoung

2017-05-22

We have developed a protein loop structure prediction method by combining a new energy function, which we call E PLM (energy for protein loop modeling), with the conformational space annealing (CSA) global optimization algorithm. The energy function includes stereochemistry, dynamic fragment assembly, distance-scaled finite ideal gas reference (DFIRE), and generalized orientation- and distance-dependent terms. For the conformational search of loop structures, we used the CSA algorithm, which has been quite successful in dealing with various hard global optimization problems. We assessed the performance of E PLM with two widely used loop-decoy sets, Jacobson and RAPPER, and compared the results against the DFIRE potential. The accuracy of model selection from a pool of loop decoys as well as de novo loop modeling starting from randomly generated structures was examined separately. For the selection of a nativelike structure from a decoy set, E PLM was more accurate than DFIRE in the case of the Jacobson set and had similar accuracy in the case of the RAPPER set. In terms of sampling more nativelike loop structures, E PLM outperformed E DFIRE for both decoy sets. This new approach equipped with E PLM and CSA can serve as the state-of-the-art de novo loop modeling method.
From structure prediction to genomic screens for novel non-coding RNAs

DEFF Research Database (Denmark)

Gorodkin, Jan; Hofacker, Ivo L.

2011-01-01

Abstract: Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction....... This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early...... upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other....
Use of transient elastography to predict de novo recurrence after radiofrequency ablation for hepatocellular carcinoma.

Science.gov (United States)

Lee, Sang Hoon; Kim, Seung Up; Jang, Jeong Won; Bae, Si Hyun; Lee, Sanghun; Kim, Beom Kyung; Park, Jun Yong; Kim, Do Young; Ahn, Sang Hoon; Han, Kwang-Hyub

2015-01-01

Liver stiffness (LS) measurement using transient elastography can accurately assess the degree of liver fibrosis, which is associated with the risk of the development of hepatocellular carcinoma (HCC). We investigated whether LS values could predict HCC de novo recurrence after radiofrequency ablation (RFA). This retrospective, multicenter study analyzed 111 patients with HCC who underwent RFA and LS measurement using transient elastography between May 2005 and April 2011. All patients were followed until March 2013 to monitor for HCC recurrence. This study included 76 men and 35 women with a mean age of 62.4 years, and the mean LS value was 21.2 kPa. During the follow-up period (median 22.4 months), 47 (42.3%) patients experienced HCC de novo recurrence, and 18 (16.2%) died. Patients with recurrence had significantly more frequent liver cirrhosis, more frequent history of previous treatment for HCC, higher total bilirubin, larger spleen size, larger total tumor size, higher tumor number, higher LS values, and lower platelet counts than those without recurrence (all P13.0 kPa were at significantly greater risk for recurrence after RFA, with a hazard ratio (HR) of 3.115 (95% confidence interval [CI], 1.238-7.842, Pmeasurement is a useful predictor of HCC de novo recurrence and overall survival after RFA.
Spaced Seed Data Structures for De Novo Assembly

Directory of Open Access Journals (Sweden)

Inanç Birol

2015-01-01

Full Text Available De novo assembly of the genome of a species is essential in the absence of a reference genome sequence. Many scalable assembly algorithms use the de Bruijn graph (DBG paradigm to reconstruct genomes, where a table of subsequences of a certain length is derived from the reads, and their overlaps are analyzed to assemble sequences. Despite longer subsequences unlocking longer genomic features for assembly, associated increase in compute resources limits the practicability of DBG over other assembly archetypes already designed for longer reads. Here, we revisit the DBG paradigm to adapt it to the changing sequencing technology landscape and introduce three data structure designs for spaced seeds in the form of paired subsequences. These data structures address memory and run time constraints imposed by longer reads. We observe that when a fixed distance separates seed pairs, it provides increased sequence specificity with increased gap length. Further, we note that Bloom filters would be suitable to implicitly store spaced seeds and be tolerant to sequencing errors. Building on this concept, we describe a data structure for tracking the frequencies of observed spaced seeds. These data structure designs will have applications in genome, transcriptome and metagenome assemblies, and read error correction.
The dual role of fragments in fragment-assembly methods for de novo protein structure prediction

Science.gov (United States)

Handl, Julia; Knowles, Joshua; Vernon, Robert; Baker, David; Lovell, Simon C.

2013-01-01

In fragment-assembly techniques for protein structure prediction, models of protein structure are assembled from fragments of known protein structures. This process is typically guided by a knowledge-based energy function and uses a heuristic optimization method. The fragments play two important roles in this process: they define the set of structural parameters available, and they also assume the role of the main variation operators that are used by the optimiser. Previous analysis has typically focused on the first of these roles. In particular, the relationship between local amino acid sequence and local protein structure has been studied by a range of authors. The correlation between the two has been shown to vary with the window length considered, and the results of these analyses have informed directly the choice of fragment length in state-of-the-art prediction techniques. Here, we focus on the second role of fragments and aim to determine the effect of fragment length from an optimization perspective. We use theoretical analyses to reveal how the size and structure of the search space changes as a function of insertion length. Furthermore, empirical analyses are used to explore additional ways in which the size of the fragment insertion influences the search both in a simulation model and for the fragment-assembly technique, Rosetta. PMID:22095594
De novo protein structure generation from incomplete chemical shift assignments

Energy Technology Data Exchange (ETDEWEB)

Shen Yang [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States); Vernon, Robert; Baker, David [University of Washington, Department of Biochemistry and Howard Hughes Medical Institute (United States); Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)], E-mail: bax@nih.gov

2009-02-15

NMR chemical shifts provide important local structural information for proteins. Consistent structure generation from NMR chemical shift data has recently become feasible for proteins with sizes of up to 130 residues, and such structures are of a quality comparable to those obtained with the standard NMR protocol. This study investigates the influence of the completeness of chemical shift assignments on structures generated from chemical shifts. The Chemical-Shift-Rosetta (CS-Rosetta) protocol was used for de novo protein structure generation with various degrees of completeness of the chemical shift assignment, simulated by omission of entries in the experimental chemical shift data previously used for the initial demonstration of the CS-Rosetta approach. In addition, a new CS-Rosetta protocol is described that improves robustness of the method for proteins with missing or erroneous NMR chemical shift input data. This strategy, which uses traditional Rosetta for pre-filtering of the fragment selection process, is demonstrated for two paramagnetic proteins and also for two proteins with solid-state NMR chemical shift assignments.

Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale

DEFF Research Database (Denmark)

Liu, Siyang; Huang, Shujia; Rao, Junhua

2015-01-01

present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome......) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We...... assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction...
UniNovo: a universal tool for de novo peptide sequencing.

Science.gov (United States)

Jeong, Kyowon; Kim, Sangtae; Pevzner, Pavel A

2013-08-15

Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function. The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides). UniNovo is implemented in JAVA and tested on Windows, Ubuntu and OS X machines. UniNovo is available at http://proteomics.ucsd.edu/Software/UniNovo.html along with the manual.
Application of Generative Autoencoder in De Novo Molecular Design.

Science.gov (United States)

Blaschke, Thomas; Olivecrona, Marcus; Engkvist, Ola; Bajorath, Jürgen; Chen, Hongming

2018-01-01

A major challenge in computational chemistry is the generation of novel molecular structures with desirable pharmacological and physiochemical properties. In this work, we investigate the potential use of autoencoder, a deep learning methodology, for de novo molecular design. Various generative autoencoders were used to map molecule structures into a continuous latent space and vice versa and their performance as structure generator was assessed. Our results show that the latent space preserves chemical similarity principle and thus can be used for the generation of analogue structures. Furthermore, the latent space created by autoencoders were searched systematically to generate novel compounds with predicted activity against dopamine receptor type 2 and compounds similar to known active compounds not included in the trainings set were identified. © 2018 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Foldability of a Natural De Novo Evolved Protein.

Science.gov (United States)

Bungard, Dixie; Copple, Jacob S; Yan, Jing; Chhun, Jimmy J; Kumirov, Vlad K; Foy, Scott G; Masel, Joanna; Wysocki, Vicki H; Cordes, Matthew H J

2017-11-07

The de novo evolution of protein-coding genes from noncoding DNA is emerging as a source of molecular innovation in biology. Studies of random sequence libraries, however, suggest that young de novo proteins will not fold into compact, specific structures typical of native globular proteins. Here we show that Bsc4, a functional, natural de novo protein encoded by a gene that evolved recently from noncoding DNA in the yeast S. cerevisiae, folds to a partially specific three-dimensional structure. Bsc4 forms soluble, compact oligomers with high β sheet content and a hydrophobic core, and undergoes cooperative, reversible denaturation. Bsc4 lacks a specific quaternary state, however, existing instead as a continuous distribution of oligomer sizes, and binds dyes indicative of amyloid oligomers or molten globules. The combination of native-like and non-native-like properties suggests a rudimentary fold that could potentially act as a functional intermediate in the emergence of new folded proteins de novo. Copyright © 2017 Elsevier Ltd. All rights reserved.
From structure prediction to genomic screens for novel non-coding RNAs.

Science.gov (United States)

Gorodkin, Jan; Hofacker, Ivo L

2011-08-01

Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other.
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly

DEFF Research Database (Denmark)

Li, Yingrui; Zheng, Hancheng; Luo, Ruibang

2011-01-01

Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural variation (SV) in an Asian genome and an African genome. Our approach identifies small- and intermediate-size homozygous variants (1-50 kb) including insertions, deletions, inversions and their precise...
Use of transient elastography to predict de novo recurrence after radiofrequency ablation for hepatocellular carcinoma

Directory of Open Access Journals (Sweden)

Lee SH

2015-02-01

Full Text Available Sang Hoon Lee,1 Seung Up Kim,1–3 Jeong Won Jang,4 Si Hyun Bae,4 Sanghun Lee,1,3 Beom Kyung Kim,1–3 Jun Yong Park,1–3 Do Young Kim,1–3 Sang Hoon Ahn,1–3 Kwang–Hyub Han1–31Department of Internal Medicine, 2Institute of Gastroenterology, Yonsei University College of Medicine, 3Liver Cirrhosis Clinical Research Center, 4Department of Internal Medicine, College of Medicine, Catholic University of Korea, Seoul, KoreaBackground/purpose: Liver stiffness (LS measurement using transient elastography can accurately assess the degree of liver fibrosis, which is associated with the risk of the development of hepatocellular carcinoma (HCC. We investigated whether LS values could predict HCC de novo recurrence after radiofrequency ablation (RFA.Methods: This retrospective, multicenter study analyzed 111 patients with HCC who underwent RFA and LS measurement using transient elastography between May 2005 and April 2011. All patients were followed until March 2013 to monitor for HCC recurrence.Results: This study included 76 men and 35 women with a mean age of 62.4 years, and the mean LS value was 21.2 kPa. During the follow-up period (median 22.4 months, 47 (42.3% patients experienced HCC de novo recurrence, and 18 (16.2% died. Patients with recurrence had significantly more frequent liver cirrhosis, more frequent history of previous treatment for HCC, higher total bilirubin, larger spleen size, larger total tumor size, higher tumor number, higher LS values, and lower platelet counts than those without recurrence (all P<0.05. On multivariate analysis, together with previous anti-HCC treatment history, patients with LS values >13.0 kPa were at significantly greater risk for recurrence after RFA, with a hazard ratio (HR of 3.115 (95% confidence interval [CI], 1.238–7.842, P<0.05. Moreover, LS values independently predicted the mortality after RFA, with a HR of 9.834 (95% CI, 1.148–84.211, P<0.05, together with total bilirubin.Conclusions: Our
SV2: accurate structural variation genotyping and de novo mutation detection from whole genomes.

Science.gov (United States)

Antaki, Danny; Brandler, William M; Sebat, Jonathan

2018-05-15

Structural variation (SV) detection from short-read whole genome sequencing is error prone, presenting significant challenges for population or family-based studies of disease. Here, we describe SV2, a machine-learning algorithm for genotyping deletions and duplications from paired-end sequencing data. SV2 can rapidly integrate variant calls from multiple structural variant discovery algorithms into a unified call set with high genotyping accuracy and capability to detect de novo mutations. SV2 is freely available on GitHub (https://github.com/dantaki/SV2). jsebat@ucsd.edu. Supplementary data are available at Bioinformatics online.
Generative Recurrent Networks for De Novo Drug Design.

Science.gov (United States)

Gupta, Anvita; Müller, Alex T; Huisman, Berend J H; Fuchs, Jens A; Schneider, Petra; Schneider, Gisbert

2018-01-01

Generative artificial intelligence models present a fresh approach to chemogenomics and de novo drug design, as they provide researchers with the ability to narrow down their search of the chemical space and focus on regions of interest. We present a method for molecular de novo design that utilizes generative recurrent neural networks (RNN) containing long short-term memory (LSTM) cells. This computational model captured the syntax of molecular representation in terms of SMILES strings with close to perfect accuracy. The learned pattern probabilities can be used for de novo SMILES generation. This molecular design concept eliminates the need for virtual compound library enumeration. By employing transfer learning, we fine-tuned the RNN's predictions for specific molecular targets. This approach enables virtual compound design without requiring secondary or external activity prediction, which could introduce error or unwanted bias. The results obtained advocate this generative RNN-LSTM system for high-impact use cases, such as low-data drug discovery, fragment based molecular design, and hit-to-lead optimization for diverse drug targets. © 2017 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

Directory of Open Access Journals (Sweden)

Adeel Malik

2010-01-01

Full Text Available Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins as well as noncovalently (protein-carbohydrate complexes is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB, (ii Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.
From structure prediction to genomic screens for novel non-coding RNAs.

Directory of Open Access Journals (Sweden)

Jan Gorodkin

2011-08-01

Full Text Available Non-coding RNAs (ncRNAs are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs. A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other.
Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement

Science.gov (United States)

Spindel, J E; Begum, H; Akdemir, D; Collard, B; Redoña, E; Jannink, J-L; McCouch, S

2016-01-01

To address the multiple challenges to food security posed by global climate change, population growth and rising incomes, plant breeders are developing new crop varieties that can enhance both agricultural productivity and environmental sustainability. Current breeding practices, however, are unable to keep pace with demand. Genomic selection (GS) is a new technique that helps accelerate the rate of genetic gain in breeding by using whole-genome data to predict the breeding value of offspring. Here, we describe a new GS model that combines RR-BLUP with markers fit as fixed effects selected from the results of a genome-wide-association study (GWAS) on the RR-BLUP training data. We term this model GS + de novo GWAS. In a breeding population of tropical rice, GS + de novo GWAS outperformed six other models for a variety of traits and in multiple environments. On the basis of these results, we propose an extended, two-part breeding design that can be used to efficiently integrate novel variation into elite breeding populations, thus expanding genetic diversity and enhancing the potential for sustainable productivity gains. PMID:26860200
MRUniNovo: an efficient tool for de novo peptide sequencing utilizing the hadoop distributed computing framework.

Science.gov (United States)

Li, Chuang; Chen, Tao; He, Qiang; Zhu, Yunping; Li, Kenli

2017-03-15

Tandem mass spectrometry-based de novo peptide sequencing is a complex and time-consuming process. The current algorithms for de novo peptide sequencing cannot rapidly and thoroughly process large mass spectrometry datasets. In this paper, we propose MRUniNovo, a novel tool for parallel de novo peptide sequencing. MRUniNovo parallelizes UniNovo based on the Hadoop compute platform. Our experimental results demonstrate that MRUniNovo significantly reduces the computation time of de novo peptide sequencing without sacrificing the correctness and accuracy of the results, and thus can process very large datasets that UniNovo cannot. MRUniNovo is an open source software tool implemented in java. The source code and the parameter settings are available at http://bioinfo.hupo.org.cn/MRUniNovo/index.php. s131020002@hnu.edu.cn ; taochen1019@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
DNApi: A De Novo Adapter Prediction Algorithm for Small RNA Sequencing Data.

Science.gov (United States)

Tsuji, Junko; Weng, Zhiping

2016-01-01

With the rapid accumulation of publicly available small RNA sequencing datasets, third-party meta-analysis across many datasets is becoming increasingly powerful. Although removing the 3´ adapter is an essential step for small RNA sequencing analysis, the adapter sequence information is not always available in the metadata. The information can be also erroneous even when it is available. In this study, we developed DNApi, a lightweight Python software package that predicts the 3´ adapter sequence de novo and provides the user with cleansed small RNA sequences ready for down stream analysis. Tested on 539 publicly available small RNA libraries accompanied with 3´ adapter sequences in their metadata, DNApi shows near-perfect accuracy (98.5%) with fast runtime (~2.85 seconds per library) and efficient memory usage (~43 MB on average). In addition to 3´ adapter prediction, it is also important to classify whether the input small RNA libraries were already processed, i.e. the 3´ adapters were removed. DNApi perfectly judged that given another batch of datasets, 192 publicly available processed libraries were "ready-to-map" small RNA sequence. DNApi is compatible with Python 2 and 3, and is available at https://github.com/jnktsj/DNApi. The 731 small RNA libraries used for DNApi evaluation were from human tissues and were carefully and manually collected. This study also provides readers with the curated datasets that can be integrated into their studies.
Identification of a novel Plasmopara halstedii elicitor protein combining de novo peptide sequencing algorithms and RACE-PCR

Directory of Open Access Journals (Sweden)

Madlung Johannes

2010-05-01

Full Text Available Abstract Background Often high-quality MS/MS spectra of tryptic peptides do not match to any database entry because of only partially sequenced genomes and therefore, protein identification requires de novo peptide sequencing. To achieve protein identification of the economically important but still unsequenced plant pathogenic oomycete Plasmopara halstedii, we first evaluated the performance of three different de novo peptide sequencing algorithms applied to a protein digests of standard proteins using a quadrupole TOF (QStar Pulsar i. Results The performance order of the algorithms was PEAKS online > PepNovo > CompNovo. In summary, PEAKS online correctly predicted 45% of measured peptides for a protein test data set. All three de novo peptide sequencing algorithms were used to identify MS/MS spectra of tryptic peptides of an unknown 57 kDa protein of P. halstedii. We found ten de novo sequenced peptides that showed homology to a Phytophthora infestans protein, a closely related organism of P. halstedii. Employing a second complementary approach, verification of peptide prediction and protein identification was performed by creation of degenerate primers for RACE-PCR and led to an ORF of 1,589 bp for a hypothetical phosphoenolpyruvate carboxykinase. Conclusions Our study demonstrated that identification of proteins within minute amounts of sample material improved significantly by combining sensitive LC-MS methods with different de novo peptide sequencing algorithms. In addition, this is the first study that verified protein prediction from MS data by also employing a second complementary approach, in which RACE-PCR led to identification of a novel elicitor protein in P. halstedii.
Apoprotein Structure and Metal Binding Characterization of a de Novo Designed Peptide, α3DIV, that Sequesters Toxic Heavy Metals.

Science.gov (United States)

Plegaria, Jefferson S; Dzul, Stephen P; Zuiderweg, Erik R P; Stemmler, Timothy L; Pecoraro, Vincent L

2015-05-12

De novo protein design is a biologically relevant approach that provides a novel process in elucidating protein folding and modeling the metal centers of metalloproteins in a completely unrelated or simplified fold. An integral step in de novo protein design is the establishment of a well-folded scaffold with one conformation, which is a fundamental characteristic of many native proteins. Here, we report the NMR solution structure of apo α3DIV at pH 7.0, a de novo designed three-helix bundle peptide containing a triscysteine motif (Cys18, Cys28, and Cys67) that binds toxic heavy metals. The structure comprises 1067 NOE restraints derived from multinuclear multidimensional NOESY, as well as 138 dihedral angles (ψ, φ, and χ1). The backbone and heavy atoms of the 20 lowest energy structures have a root mean square deviation from the mean structure of 0.79 (0.16) Å and 1.31 (0.15) Å, respectively. When compared to the parent structure α3D, the substitution of Leu residues to Cys enhanced the α-helical content of α3DIV while maintaining the same overall topology and fold. In addition, solution studies on the metalated species illustrated metal-induced stability. An increase in the melting temperatures was observed for Hg(II), Pb(II), or Cd(II) bound α3DIV by 18-24 °C compared to its apo counterpart. Further, the extended X-ray absorption fine structure analysis on Hg(II)-α3DIV produced an average Hg(II)-S bond length at 2.36 Å, indicating a trigonal T-shaped coordination environment. Overall, the structure of apo α3DIV reveals an asymmetric distorted triscysteine metal binding site, which offers a model for native metalloregulatory proteins with thiol-rich ligands that function in regulating toxic heavy metals, such as ArsR, CadC, MerR, and PbrR.
De Novo generation of molecular structures using optimization to select graphs on a given lattice

DEFF Research Database (Denmark)

Bywater, R.P.; Poulsen, Thomas Agersten; Røgen, Peter

2004-01-01

A recurrent problem in organic chemistry is the generation of new molecular structures that conform to some predetermined set of structural constraints that are imposed in an endeavor to build certain required properties into the newly generated structure. An example of this is the pharmacophore...... model, used in medicinal chemistry to guide de novo design or selection of suitable structures from compound databases. We propose here a method that efficiently links up a selected number of required atom positions while at the same time directing the emergent molecular skeleton to avoid forbidden...... positions. The linkage process takes place on a lattice whose unit step length and overall geometry is designed to match typical architectures of organic molecules. We use an optimization method to select from the many different graphs possible. The approach is demonstrated in an example where crystal...
Extreme-Scale De Novo Genome Assembly

Energy Technology Data Exchange (ETDEWEB)

Georganas, Evangelos [Intel Corporation, Santa Clara, CA (United States); Hofmeyr, Steven [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Egan, Rob [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Buluc, Aydin [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Oliker, Leonid [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Rokhsar, Daniel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Yelick, Katherine [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.

2017-09-26

De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and the large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.
De novo prediction of structured RNAs from genomic sequences

DEFF Research Database (Denmark)

Gorodkin, Jan; Hofacker, Ivo L.; Þórarinsson, Elfar

2010-01-01

currently available, because evolutionary conservation highlights functionally important regions. Conserved secondary structure, rather than primary sequence, is the hallmark of many functionally important RNAs, because compensatory substitutions in base-paired regions preserve structure. Unfortunately...
Structural prediction in aphasia

Directory of Open Access Journals (Sweden)

Tessa Warren

2015-05-01

Full Text Available There is considerable evidence that young healthy comprehenders predict the structure of upcoming material, and that their processing is facilitated when they encounter material matching those predictions (e.g., Staub & Clifton, 2006; Yoshida, Dickey & Sturt, 2013. However, less is known about structural prediction in aphasia. There is evidence that lexical prediction may be spared in aphasia (Dickey et al., 2014; Love & Webb, 1977; cf. Mack et al, 2013. However, predictive mechanisms supporting facilitated lexical access may not necessarily support structural facilitation. Given that many people with aphasia (PWA exhibit syntactic deficits (e.g. Goodglass, 1993, PWA with such impairments may not engage in structural prediction. However, recent evidence suggests that some PWA may indeed predict upcoming structure (Hanne, Burchert, De Bleser, & Vashishth, 2015. Hanne et al. tracked the eyes of PWA (n=8 with sentence-comprehension deficits while they listened to reversible subject-verb-object (SVO and object-verb-subject (OVS sentences in German, in a sentence-picture matching task. Hanne et al. manipulated case and number marking to disambiguate the sentences’ structure. Gazes to an OVS or SVO picture during the unfolding of a sentence were assumed to indicate prediction of the structure congruent with that picture. According to this measure, the PWA’s structural prediction was impaired compared to controls, but they did successfully predict upcoming structure when morphosyntactic cues were strong and unambiguous. Hanne et al.’s visual-world evidence is suggestive, but their forced-choice sentence-picture matching task places tight constraints on possible structural predictions. Clearer evidence of structural prediction would come from paradigms where the content of upcoming material is not as constrained. The current study used self-paced reading study to examine structural prediction among PWA in less constrained contexts. PWA (n=17 who

DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.

Science.gov (United States)

Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald

2014-02-07

De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .
A Pareto Algorithm for Efficient De Novo Design of Multi-functional Molecules.

Science.gov (United States)

Daeyaert, Frits; Deem, Micheal W

2017-01-01

We have introduced a Pareto sorting algorithm into Synopsis, a de novo design program that generates synthesizable molecules with desirable properties. We give a detailed description of the algorithm and illustrate its working in 2 different de novo design settings: the design of putative dual and selective FGFR and VEGFR inhibitors, and the successful design of organic structure determining agents (OSDAs) for the synthesis of zeolites. We show that the introduction of Pareto sorting not only enables the simultaneous optimization of multiple properties but also greatly improves the performance of the algorithm to generate molecules with hard-to-meet constraints. This in turn allows us to suggest approaches to address the problem of false positive hits in de novo structure based drug design by introducing structural and physicochemical constraints in the designed molecules, and by forcing essential interactions between these molecules and their target receptor. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons.

Science.gov (United States)

Steinbiss, Sascha; Kastens, Sascha; Kurtz, Stefan

2012-11-07

Long terminal repeat (LTR) retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-)families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets), making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining the output of software for predicting LTR
de novo computational enzyme design.

Science.gov (United States)

Zanghellini, Alexandre

2014-10-01

Recent advances in systems and synthetic biology as well as metabolic engineering are poised to transform industrial biotechnology by allowing us to design cell factories for the sustainable production of valuable fuels and chemicals. To deliver on their promises, such cell factories, as much as their brick-and-mortar counterparts, will require appropriate catalysts, especially for classes of reactions that are not known to be catalyzed by enzymes in natural organisms. A recently developed methodology, de novo computational enzyme design can be used to create enzymes catalyzing novel reactions. Here we review the different classes of chemical reactions for which active protein catalysts have been designed as well as the results of detailed biochemical and structural characterization studies. We also discuss how combining de novo computational enzyme design with more traditional protein engineering techniques can alleviate the shortcomings of state-of-the-art computational design techniques and create novel enzymes with catalytic proficiencies on par with natural enzymes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Get phases from arsenic anomalous scattering: de novo SAD phasing of two protein structures crystallized in cacodylate buffer.

Directory of Open Access Journals (Sweden)

Xiang Liu

Full Text Available The crystal structures of two proteins, a putative pyrazinamidase/nicotinamidase from the dental pathogen Streptococcus mutans (SmPncA and the human caspase-6 (Casp6, were solved by de novo arsenic single-wavelength anomalous diffraction (As-SAD phasing method. Arsenic (As, an uncommonly used element in SAD phasing, was covalently introduced into proteins by cacodylic acid, the buffering agent in the crystallization reservoirs. In SmPncA, the only cysteine was bound to dimethylarsinoyl, which is a pentavalent arsenic group (As (V. This arsenic atom and a protein-bound zinc atom both generated anomalous signals. The predominant contribution, however, was from the As anomalous signals, which were sufficient to phase the SmPncA structure alone. In Casp6, four cysteines were found to bind cacodyl, a trivalent arsenic group (As (III, in the presence of the reducing agent, dithiothreitol (DTT, and arsenic atoms were the only anomalous scatterers for SAD phasing. Analyses and discussion of these two As-SAD phasing examples and comparison of As with other traditional heavy atoms that generate anomalous signals, together with a few arsenic-based de novo phasing cases reported previously strongly suggest that As is an ideal anomalous scatterer for SAD phasing in protein crystallography.
Structural Insight into the Core of CAD, the Multifunctional Protein Leading De Novo Pyrimidine Biosynthesis.

Science.gov (United States)

Moreno-Morcillo, María; Grande-García, Araceli; Ruiz-Ramos, Alba; Del Caño-Ochoa, Francisco; Boskovic, Jasminka; Ramón-Maiques, Santiago

2017-06-06

CAD, the multifunctional protein initiating and controlling de novo biosynthesis of pyrimidines in animals, self-assembles into ∼1.5 MDa hexamers. The structures of the dihydroorotase (DHO) and aspartate transcarbamoylase (ATC) domains of human CAD have been previously determined, but we lack information on how these domains associate and interact with the rest of CAD forming a multienzymatic unit. Here, we prove that a construct covering human DHO and ATC oligomerizes as a dimer of trimers and that this arrangement is conserved in CAD-like from fungi, which holds an inactive DHO-like domain. The crystal structures of the ATC trimer and DHO-like dimer from the fungus Chaetomium thermophilum confirm the similarity with the human CAD homologs. These results demonstrate that, despite being inactive, the fungal DHO-like domain has a conserved structural function. We propose a model that sets the DHO and ATC complex as the central element in the architecture of CAD. Copyright © 2017 Elsevier Ltd. All rights reserved.
Linguistic Structure Prediction

CERN Document Server

Smith, Noah A

2011-01-01

A major part of natural language processing now depends on the use of text data to build linguistic analyzers. We consider statistical, computational approaches to modeling linguistic structure. We seek to unify across many approaches and many kinds of linguistic structures. Assuming a basic understanding of natural language processing and/or machine learning, we seek to bridge the gap between the two fields. Approaches to decoding (i.e., carrying out linguistic structure prediction) and supervised and unsupervised learning of models that predict discrete structures as outputs are the focus. W
Pesquisa de novos elementos Pesquisa de novos elementos

Directory of Open Access Journals (Sweden)

Gil Mário de Macedo Grassi

1978-11-01

Full Text Available The present study deals with the discovery of new elements synthesized by man. The introduction discusses in general the theories about nuclear transmutation, which is the method employed in these syntheses. The study shows the importance of the Periodical Table since it is through this table that one can reach a prevision of new elements and its, properties. The discoveries of the transuranic elements, together wich the data of their first preparations are also tabulated The stability of these elements is also discussed, and future speculations are showedNeste trabalho estuda-se, teoricamente, a descoberta de novos elementos sintetizados pelo homem Na introdução apresentamos um apanhado geral sobre as teorias a respeito da transmutação nuclear, que é o método utilizado nestas sínteses. Em seguida, mostramos a importância da Tabela Periódica, pois é através dela que se chega à previsão dos novos elementos e de suas propriedades. As descobertas dos transurânicos, Já realizadas com êxito, juntamente com os dados de suas primeiras preparações são tabelados. A estabilidade destes novos elementos também é discutida, e apresentadas futuras especulações.
De novo pathway-based biomarker identification

DEFF Research Database (Denmark)

Alcaraz, Nicolas; List, Markus; Batra, Richa

2017-01-01

in a large cohort of breast cancer samples from The Cancer Genome Atlas (TCGA) revealed that MGs are considerably more stable than SG models, while also providing valuable insight into the cancer hallmarks that drive them. In addition, when tested on an independent benchmark non-TCGA dataset, MG features......Gene expression profiles have been extensively discussed as an aid to guide the therapy by predicting disease outcome for the patients suffering from complex diseases, such as cancer. However, prediction models built upon single-gene (SG) features show poor stability and performance on independent...... on their molecular subtypes can provide a detailed view of the disease and lead to more personalized therapies. We propose and discuss a novel MG approach based on de novo pathways, which for the first time have been used as features in a multi-class setting to predict cancer subtypes. Comprehensive evaluation...
De novo protein structure determination using sparse NMR data

International Nuclear Information System (INIS)

Bowers, Peter M.; Strauss, Charlie E.M.; Baker, David

2000-01-01

We describe a method for generating moderate to high-resolution protein structures using limited NMR data combined with the ab initio protein structure prediction method Rosetta. Peptide fragments are selected from proteins of known structure based on sequence similarity and consistency with chemical shift and NOE data. Models are built from these fragments by minimizing an energy function that favors hydrophobic burial, strand pairing, and satisfaction of NOE constraints. Models generated using this procedure with ∼1 NOE constraint per residue are in some cases closer to the corresponding X-ray structures than the published NMR solution structures. The method requires only the sparse constraints available during initial stages of NMR structure determination, and thus holds promise for increasing the speed with which protein solution structures can be determined
De novo ORFs in Drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences.

Directory of Open Access Journals (Sweden)

Josephine A Reinhardt

Full Text Available How non-coding DNA gives rise to new protein-coding genes (de novo genes is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs, while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important.
Neural Networks for protein Structure Prediction

DEFF Research Database (Denmark)

Bohr, Henrik

1998-01-01

This is a review about neural network applications in bioinformatics. Especially the applications to protein structure prediction, e.g. prediction of secondary structures, prediction of surface structure, fold class recognition and prediction of the 3-dimensional structure of protein backbones...
LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons

Directory of Open Access Journals (Sweden)

Steinbiss Sascha

2012-11-01

Full Text Available Abstract Background Long terminal repeat (LTR retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets, making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. Results We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. Conclusions LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining
Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

DEFF Research Database (Denmark)

Maretty, Lasse; Jensen, Jacob Malte; Petersen, Bent

2017-01-01

or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high......-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set...
De novo mutations in the genome organizer CTCF cause intellectual disability

DEFF Research Database (Denmark)

Gregor, Anne; Oti, Martin; Kouwenhoven, Evelyn N

2013-01-01

An increasing number of genes involved in chromatin structure and epigenetic regulation has been implicated in a variety of developmental disorders, often including intellectual disability. By trio exome sequencing and subsequent mutational screening we now identified two de novo frameshift...... mutations and one de novo missense mutation in CTCF in individuals with intellectual disability, microcephaly, and growth retardation. Furthermore, an individual with a larger deletion including CTCF was identified. CTCF (CCCTC-binding factor) is one of the most important chromatin organizers in vertebrates...... and is involved in various chromatin regulation processes such as higher order of chromatin organization, enhancer function, and maintenance of three-dimensional chromatin structure. Transcriptome analyses in all three individuals with point mutations revealed deregulation of genes involved in signal transduction...
Prediction of molecular crystal structures

International Nuclear Information System (INIS)

Beyer, Theresa

2001-01-01

The ab initio prediction of molecular crystal structures is a scientific challenge. Reliability of first-principle prediction calculations would show a fundamental understanding of crystallisation. Crystal structure prediction is also of considerable practical importance as different crystalline arrangements of the same molecule in the solid state (polymorphs)are likely to have different physical properties. A method of crystal structure prediction based on lattice energy minimisation has been developed in this work. The choice of the intermolecular potential and of the molecular model is crucial for the results of such studies and both of these criteria have been investigated. An empirical atom-atom repulsion-dispersion potential for carboxylic acids has been derived and applied in a crystal structure prediction study of formic, benzoic and the polymorphic system of tetrolic acid. As many experimental crystal structure determinations at different temperatures are available for the polymorphic system of paracetamol (acetaminophen), the influence of the variations of the molecular model on the crystal structure lattice energy minima, has also been studied. The general problem of prediction methods based on the assumption that the experimental thermodynamically stable polymorph corresponds to the global lattice energy minimum, is that more hypothetical low lattice energy structures are found within a few kJ mol -1 of the global minimum than are likely to be experimentally observed polymorphs. This is illustrated by the results for molecule I, 3-oxabicyclo(3.2.0)hepta-1,4-diene, studied for the first international blindtest for small organic crystal structures organised by the Cambridge Crystallographic Data Centre (CCDC) in May 1999. To reduce the number of predicted polymorphs, additional factors to thermodynamic criteria have to be considered. Therefore the elastic constants and vapour growth morphologies have been calculated for the lowest lattice energy
Prediction of molecular crystal structures

Energy Technology Data Exchange (ETDEWEB)

Beyer, Theresa

2001-07-01

The ab initio prediction of molecular crystal structures is a scientific challenge. Reliability of first-principle prediction calculations would show a fundamental understanding of crystallisation. Crystal structure prediction is also of considerable practical importance as different crystalline arrangements of the same molecule in the solid state (polymorphs)are likely to have different physical properties. A method of crystal structure prediction based on lattice energy minimisation has been developed in this work. The choice of the intermolecular potential and of the molecular model is crucial for the results of such studies and both of these criteria have been investigated. An empirical atom-atom repulsion-dispersion potential for carboxylic acids has been derived and applied in a crystal structure prediction study of formic, benzoic and the polymorphic system of tetrolic acid. As many experimental crystal structure determinations at different temperatures are available for the polymorphic system of paracetamol (acetaminophen), the influence of the variations of the molecular model on the crystal structure lattice energy minima, has also been studied. The general problem of prediction methods based on the assumption that the experimental thermodynamically stable polymorph corresponds to the global lattice energy minimum, is that more hypothetical low lattice energy structures are found within a few kJ mol{sup -1} of the global minimum than are likely to be experimentally observed polymorphs. This is illustrated by the results for molecule I, 3-oxabicyclo(3.2.0)hepta-1,4-diene, studied for the first international blindtest for small organic crystal structures organised by the Cambridge Crystallographic Data Centre (CCDC) in May 1999. To reduce the number of predicted polymorphs, additional factors to thermodynamic criteria have to be considered. Therefore the elastic constants and vapour growth morphologies have been calculated for the lowest lattice energy
de novo'' aneurysms following endovascular procedures

International Nuclear Information System (INIS)

Briganti, F.; Cirillo, S.; Caranci, F.; Esposito, F.; Maiuri, F.

2002-01-01

Two personal cases of ''de novo'' aneurysms of the anterior communicating artery (ACoA) occurring 9 and 4 years, respectively, after endovascular carotid occlusion are described. A review of the 30 reported cases (including our own two) of ''de novo'' aneurysms after occlusion of the major cerebral vessels has shown some features, including a rather long time interval after the endovascular procedure of up to 20-25 years (average 9.6 years), a preferential ACoA (36.3%) and internal carotid artery-posterior communicating artery (ICA-PCoA) (33.3%) location of the ''de novo'' aneurysms, and a 10% rate of multiple aneurysms. These data are compared with those of the group of reported spontaneous ''de novo'' aneurysms after SAH or previous aneurysm clipping. We agree that the frequency of ''de novo'' aneurysms after major-vessel occlusion (two among ten procedures in our series, or 20%) is higher than commonly reported (0 to 11%). For this reason, we suggest that patients who have been submitted to endovascular major-vessel occlusion be followed up for up to 20-25 years after the procedure, using non-invasive imaging studies such as MR angiography and high-resolution CT angiography. On the other hand, periodic digital angiography has a questionable risk-benefit ratio; it may be used when a ''de novo'' aneurysm is detected or suspected on non-invasive studies. The progressive enlargement of the ACoA after carotid occlusion, as described in our case 1, must be considered a radiological finding of risk for ''de novo'' aneurysm formation. (orig.)
An 11bp region with stem formation potential is essential for de novo DNA methylation of the RPS element.

Directory of Open Access Journals (Sweden)

Matthew Gentry

Full Text Available The initiation of DNA methylation in Arabidopsis is controlled by the RNA-directed DNA methylation (RdDM pathway that uses 24nt siRNAs to recruit de novo methyltransferase DRM2 to the target site. We previously described the REPETITIVE PETUNIA SEQUENCE (RPS fragment that acts as a hot spot for de novo methylation, for which it requires the cooperative activity of all three methyltransferases MET1, CMT3 and DRM2, but not the RdDM pathway. RPS contains two identical 11nt elements in inverted orientation, interrupted by a 18nt spacer, which resembles the features of a stemloop structure. The analysis of deletion/substitution derivatives of this region showed that deletion of one 11nt element RPS is sufficient to eliminate de novo methylation of RPS. In addition, deletion of a 10nt region directly adjacent to one of the 11nt elements, significantly reduced de novo methylation. When both 11nt regions were replaced by two 11nt elements with altered DNA sequence but unchanged inverted repeat homology, DNA methylation was not affected, indicating that de novo methylation was not targeted to a specific DNA sequence element. These data suggest that de novo DNA methylation is attracted by a secondary structure to which the two 11nt elements contribute, and that the adjacent 10nt region influences the stability of this structure. This resembles the recognition of structural features by DNA methyltransferases in animals and suggests that similar mechanisms exist in plants.
The limits of de novo DNA motif discovery.

Directory of Open Access Journals (Sweden)

David Simcha

Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

Improve accuracy and sensibility in glycan structure prediction by matching glycan isotope abundance

International Nuclear Information System (INIS)

Xu Guang; Liu Xin; Liu Qingyan; Zhou Yanhong; Li Jianjun

2012-01-01

Highlights: ► A glycan isotope pattern recognition strategy for glycomics. ► A new data preprocessing procedure to detect ion peaks in a giving MS spectrum. ► A linear soft margin SVM classification for isotope pattern recognition. - Abstract: Mass Spectrometry (MS) is a powerful technique for the determination of glycan structures and is capable of providing qualitative and quantitative information. Recent development in computational method offers an opportunity to use glycan structure databases and de novo algorithms for extracting valuable information from MS or MS/MS data. However, detecting low-intensity peaks that are buried in noisy data sets is still a challenge and an algorithm for accurate prediction and annotation of glycan structures from MS data is highly desirable. The present study describes a novel algorithm for glycan structure prediction by matching glycan isotope abundance (mGIA), which takes isotope masses, abundances, and spacing into account. We constructed a comprehensive database containing 808 glycan compositions and their corresponding isotope abundance. Unlike most previously reported methods, not only did we take into count the m/z values of the peaks but also their corresponding logarithmic Euclidean distance of the calculated and detected isotope vectors. Evaluation against a linear classifier, obtained by training mGIA algorithm with datasets of three different human tissue samples from Consortium for Functional Glycomics (CFG) in association with Support Vector Machine (SVM), was proposed to improve the accuracy of automatic glycan structure annotation. In addition, an effective data preprocessing procedure, including baseline subtraction, smoothing, peak centroiding and composition matching for extracting correct isotope profiles from MS data was incorporated. The algorithm was validated by analyzing the mouse kidney MS data from CFG, resulting in the identification of 6 more glycan compositions than the previous annotation
Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

Directory of Open Access Journals (Sweden)

Chen Xie

2012-09-01

Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.
De novo-based transcriptome profiling of male-sterile and fertile watermelon lines.

Science.gov (United States)

Rhee, Sun-Ju; Kwon, Taehyung; Seo, Minseok; Jang, Yoon Jeong; Sim, Tae Yong; Cho, Seoae; Han, Sang-Wook; Lee, Gung Pyo

2017-01-01

The whole-genome sequence of watermelon (Citrullus lanatus (Thunb.) Matsum. & Nakai), a valuable horticultural crop worldwide, was released in 2013. Here, we compared a de novo-based approach (DBA) to a reference-based approach (RBA) using RNA-seq data, to aid in efforts to improve the annotation of the watermelon reference genome and to obtain biological insight into male-sterility in watermelon. We applied these techniques to available data from two watermelon lines: the male-sterile line DAH3615-MS and the male-fertile line DAH3615. Using DBA, we newly annotated 855 watermelon transcripts, and found gene functional clusters predicted to be related to stimulus responses, nucleic acid binding, transmembrane transport, homeostasis, and Golgi/vesicles. Among the DBA-annotated transcripts, 138 de novo-exclusive differentially-expressed genes (DEDEGs) related to male sterility were detected. Out of 33 randomly selected newly annotated transcripts and DEDEGs, 32 were validated by RT-qPCR. This study demonstrates the usefulness and reliability of the de novo transcriptome assembly in watermelon, and provides new insights for researchers exploring transcriptional blueprints with regard to the male sterility.
RNA-SSPT: RNA Secondary Structure Prediction Tools.

Science.gov (United States)

Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

2013-01-01

The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.
De novo malignancy after pancreas transplantation in Japan.

Science.gov (United States)

Tomimaru, Y; Ito, T; Marubashi, S; Kawamoto, K; Tomokuni, A; Asaoka, T; Wada, H; Eguchi, H; Mori, M; Doki, Y; Nagano, H

2015-04-01

Long-term immunosuppression is associated with an increased risk of cancer. Especially, the immunosuppression in pancreas transplantation is more intensive than that in other organ transplantation because of its strong immunogenicity. Therefore, it suggests that the risk of post-transplant de novo malignancy might increase in pancreas transplantation. However, there have been few studies of de novo malignancy after pancreas transplantation. The aim of this study was to analyze the incidence of de novo malignancy after pancreas transplantation in Japan. Post-transplant patients with de novo malignancy were surveyed and characterized in Japan. Among 107 cases receiving pancreas transplantation in Japan between 2001 and 2010, de novo malignancy developed in 9 cases (8.4%): post-transplant lymphoproliferative disorders in 6 cases, colon cancer in 1 case, renal cancer in 1 case, and brain tumor in 1 case. We clarified the incidence of de novo malignancy after pancreas transplantation in Japan. Copyright © 2015 Elsevier Inc. All rights reserved.
De novo centriole formation in human cells is error-prone and does not require SAS-6 self-assembly.

Science.gov (United States)

Wang, Won-Jing; Acehan, Devrim; Kao, Chien-Han; Jane, Wann-Neng; Uryu, Kunihiro; Tsou, Meng-Fu Bryan

2015-11-26

Vertebrate centrioles normally propagate through duplication, but in the absence of preexisting centrioles, de novo synthesis can occur. Consistently, centriole formation is thought to strictly rely on self-assembly, involving self-oligomerization of the centriolar protein SAS-6. Here, through reconstitution of de novo synthesis in human cells, we surprisingly found that normal looking centrioles capable of duplication and ciliation can arise in the absence of SAS-6 self-oligomerization. Moreover, whereas canonically duplicated centrioles always form correctly, de novo centrioles are prone to structural errors, even in the presence of SAS-6 self-oligomerization. These results indicate that centriole biogenesis does not strictly depend on SAS-6 self-assembly, and may require preexisting centrioles to ensure structural accuracy, fundamentally deviating from the current paradigm.
Predicting RNA Structure Using Mutual Information

DEFF Research Database (Denmark)

Freyhult, E.; Moulton, V.; Gardner, P. P.

2005-01-01

, to display and predict conserved RNA secondary structure (including pseudoknots) from an alignment. Results: We show that MIfold can be used to predict simple pseudoknots, and that the performance can be adjusted to make it either more sensitive or more selective. We also demonstrate that the overall...... package. Conclusion: MIfold provides a useful supplementary tool to programs such as RNA Structure Logo, RNAalifold and COVE, and should be useful for automatically generating structural predictions for databases such as Rfam. Availability: MIfold is freely available from http......Background: With the ever-increasing number of sequenced RNAs and the establishment of new RNA databases, such as the Comparative RNA Web Site and Rfam, there is a growing need for accurately and automatically predicting RNA structures from multiple alignments. Since RNA secondary structure...
Structural design principles for self-assembled coordination polygons and polyhedra.

Science.gov (United States)

Young, Neil J; Hay, Benjamin P

2013-02-18

Strategies for the design of ligands that combine with metal ions to form high-symmetry coordination assemblies are reviewed. Evaluation of crystal structure evidence reveals that prior design approaches, based on the concept of complementary bonding vector angles, fail to predict the majority of known examples. After explaining the reasons for this failure, it is shown how an alternative approach, de novo structure-based design, provides a practical method that predicts a much wider range of component shapes encoded to direct the formation of such assemblies.
Structured RNAs and synteny regions in the pig genome

DEFF Research Database (Denmark)

Anthon, Christian; Tafer, Hakim; Havgaard, Jakob Hull

2014-01-01

annotation. To further enhance the reliability, 571 of the 3,556 structured RNAs were manually curated by methods depending on the RNA class while 1,581 were declared as pseudogenes. We further created a multiple alignment of pig against 20 representative vertebrates, from which RNAz predicted 83,859 de novo...
The Key Drivers behind Novo Nordisk’s Growth in the Diabetes Market in China

Directory of Open Access Journals (Sweden)

Hind Louiza CHITOUR

2013-12-01

Full Text Available To enter the Chinese Pharmaceutical market, “Big Pharma” has adopted different strategies to tackle the challenges specific to the country in terms of size, demographics, specific sales channels and logistics adjustments. While the majority of Global Pharmaceutical players have opted for an aggressive M&A approach to penetrate the Chinese market and gain local insight; the Danish Novo Nordisk has instead chosen a strategy focusing on innovation and developing its R&D structure to capitalize on the local talent pool. To illustrate Novo Nordisk’s growth strategy in the Mainland, we analyzed its competitiveness in the diabetes market by demonstrating the key drivers behind this success. We applied a various set of tools for this research: Novo Nordisk, Dong Bao Pharmaceutical executives’ interviews and personal observations accounting for the primary data, we also reviewed secondary data to perform a PEST analysis in addition to Porter’s competitive advantage model in order to extract the reasons behind Novo Nordisk’s marching success in the Mainland.
NovoPen Echo® insulin delivery device

Directory of Open Access Journals (Sweden)

Hyllested-Winge J

2016-01-01

Full Text Available Jacob Hyllested-Winge,1 Thomas Sparre,2 Line Kynemund Pedersen2 1Novo Nordisk Pharma Ltd, Tokyo, Japan; 2Novo Nordisk A/S, Søborg, Denmark Abstract: The introduction of insulin pen devices has provided easier, well-tolerated, and more convenient treatment regimens for patients with diabetes mellitus. When compared with vial and syringe regimens, insulin pens offer a greater clinical efficacy, improved quality of life, and increased dosing accuracy, particularly at low doses. The portable and discreet nature of pen devices reduces the burden on the patient, facilitates adherence, and subsequently contributes to the improvement in glycemic control. NovoPen Echo® is one of the latest members of the NovoPen® family that has been specifically designed for the pediatric population and is the first to combine half-unit increment (=0.5 U of insulin dosing with a simple memory function. The half-unit increment dosing amendments and accurate injection of 0.5 U of insulin are particularly beneficial for children (and insulin-sensitive adults/elders, who often require small insulin doses. The memory function can be used to record the time and amount of the last dose, reducing the fear of double dosing or missing a dose. The memory function also provides parents with extra confidence and security that their child is taking insulin at the correct doses and times. NovoPen Echo is a lightweight, durable insulin delivery pen; it is available in two different colors, which may help to distinguish between different types of insulin, providing more confidence for both users and caregivers. Studies have demonstrated a high level of patient satisfaction, with 80% of users preferring NovoPen Echo to other pediatric insulin pens. Keywords: NovoPen Echo®, memory function, half-unit increment dosing, adherence, children, adolescents
Improving the accuracy of protein secondary structure prediction using structural alignment

Directory of Open Access Journals (Sweden)

Gallin Warren J

2006-06-01

Full Text Available Abstract Background The accuracy of protein secondary structure prediction has steadily improved over the past 30 years. Now many secondary structure prediction methods routinely achieve an accuracy (Q3 of about 75%. We believe this accuracy could be further improved by including structure (as opposed to sequence database comparisons as part of the prediction process. Indeed, given the large size of the Protein Data Bank (>35,000 sequences, the probability of a newly identified sequence having a structural homologue is actually quite high. Results We have developed a method that performs structure-based sequence alignments as part of the secondary structure prediction process. By mapping the structure of a known homologue (sequence ID >25% onto the query protein's sequence, it is possible to predict at least a portion of that query protein's secondary structure. By integrating this structural alignment approach with conventional (sequence-based secondary structure methods and then combining it with a "jury-of-experts" system to generate a consensus result, it is possible to attain very high prediction accuracy. Using a sequence-unique test set of 1644 proteins from EVA, this new method achieves an average Q3 score of 81.3%. Extensive testing indicates this is approximately 4–5% better than any other method currently available. Assessments using non sequence-unique test sets (typical of those used in proteome annotation or structural genomics indicate that this new method can achieve a Q3 score approaching 88%. Conclusion By using both sequence and structure databases and by exploiting the latest techniques in machine learning it is possible to routinely predict protein secondary structure with an accuracy well above 80%. A program and web server, called PROTEUS, that performs these secondary structure predictions is accessible at http://wishart.biology.ualberta.ca/proteus. For high throughput or batch sequence analyses, the PROTEUS programs
Ensemble-based prediction of RNA secondary structures.

Science.gov (United States)

Aghaeepour, Nima; Hoos, Holger H

2013-04-24

Accurate structure prediction methods play an important role for the understanding of RNA function. Energy-based, pseudoknot-free secondary structure prediction is one of the most widely used and versatile approaches, and improved methods for this task have received much attention over the past five years. Despite the impressive progress that as been achieved in this area, existing evaluations of the prediction accuracy achieved by various algorithms do not provide a comprehensive, statistically sound assessment. Furthermore, while there is increasing evidence that no prediction algorithm consistently outperforms all others, no work has been done to exploit the complementary strengths of multiple approaches. In this work, we present two contributions to the area of RNA secondary structure prediction. Firstly, we use state-of-the-art, resampling-based statistical methods together with a previously published and increasingly widely used dataset of high-quality RNA structures to conduct a comprehensive evaluation of existing RNA secondary structure prediction procedures. The results from this evaluation clarify the performance relationship between ten well-known existing energy-based pseudoknot-free RNA secondary structure prediction methods and clearly demonstrate the progress that has been achieved in recent years. Secondly, we introduce AveRNA, a generic and powerful method for combining a set of existing secondary structure prediction procedures into an ensemble-based method that achieves significantly higher prediction accuracies than obtained from any of its component procedures. Our new, ensemble-based method, AveRNA, improves the state of the art for energy-based, pseudoknot-free RNA secondary structure prediction by exploiting the complementary strengths of multiple existing prediction procedures, as demonstrated using a state-of-the-art statistical resampling approach. In addition, AveRNA allows an intuitive and effective control of the trade-off between
Applications of contact predictions to structural biology

Directory of Open Access Journals (Sweden)

Felix Simkovic

2017-05-01

Full Text Available Evolutionary pressure on residue interactions, intramolecular or intermolecular, that are important for protein structure or function can lead to covariance between the two positions. Recent methodological advances allow much more accurate contact predictions to be derived from this evolutionary covariance signal. The practical application of contact predictions has largely been confined to structural bioinformatics, yet, as this work seeks to demonstrate, the data can be of enormous value to the structural biologist working in X-ray crystallography, cryo-EM or NMR. Integrative structural bioinformatics packages such as Rosetta can already exploit contact predictions in a variety of ways. The contribution of contact predictions begins at construct design, where structural domains may need to be expressed separately and contact predictions can help to predict domain limits. Structure solution by molecular replacement (MR benefits from contact predictions in diverse ways: in difficult cases, more accurate search models can be constructed using ab initio modelling when predictions are available, while intermolecular contact predictions can allow the construction of larger, oligomeric search models. Furthermore, MR using supersecondary motifs or large-scale screens against the PDB can exploit information, such as the parallel or antiparallel nature of any β-strand pairing in the target, that can be inferred from contact predictions. Contact information will be particularly valuable in the determination of lower resolution structures by helping to assign sequence register. In large complexes, contact information may allow the identity of a protein responsible for a certain region of density to be determined and then assist in the orientation of an available model within that density. In NMR, predicted contacts can provide long-range information to extend the upper size limit of the technique in a manner analogous but complementary to experimental
Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

DEFF Research Database (Denmark)

Maretty, Lasse; Jensen, Jacob Malte; Petersen, Bent

2017-01-01

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome......-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set...... or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high...
Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

DEFF Research Database (Denmark)

Maretty, Lasse; Jensen, Jacob Malte; Petersen, Bent

2017-01-01

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome...... or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high......-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set...
De novo triiodothyronine formation from thyrocytes activated by thyroid-stimulating hormone.

Science.gov (United States)

Citterio, Cintia E; Veluswamy, Balaji; Morgan, Sarah J; Galton, Valerie A; Banga, J Paul; Atkins, Stephen; Morishita, Yoshiaki; Neumann, Susanne; Latif, Rauf; Gershengorn, Marvin C; Smith, Terry J; Arvan, Peter

2017-09-15

The thyroid gland secretes primarily tetraiodothyronine (T 4 ), and some triiodothyronine (T 3 ). Under normal physiological circumstances, only one-fifth of circulating T 3 is directly released by the thyroid, but in states of hyperactivation of thyroid-stimulating hormone receptors (TSHRs), patients develop a syndrome of relative T 3 toxicosis. Thyroidal T 4 production results from iodination of thyroglobulin (TG) at residues Tyr 5 and Tyr 130 , whereas thyroidal T 3 production may originate in several different ways. In this study, the data demonstrate that within the carboxyl-terminal portion of mouse TG, T 3 is formed de novo independently of deiodination from T 4 We found that upon iodination in vitro , de novo T 3 formation in TG was decreased in mice lacking TSHRs. Conversely, de novo T 3 that can be formed upon iodination of TG secreted from PCCL3 (rat thyrocyte) cells was augmented from cells previously exposed to increased TSH, a TSHR agonist, a cAMP analog, or a TSHR-stimulating antibody. We present data suggesting that TSH-stimulated TG phosphorylation contributes to enhanced de novo T 3 formation. These effects were reversed within a few days after removal of the hyperstimulating conditions. Indeed, direct exposure of PCCL3 cells to human serum from two patients with Graves' disease, but not control sera, led to secretion of TG with an increased intrinsic ability to form T 3 upon in vitro iodination. Furthermore, TG secreted from human thyrocyte cultures hyperstimulated with TSH also showed an increased intrinsic ability to form T 3 Our data support the hypothesis that TG processing in the secretory pathway of TSHR-hyperstimulated thyrocytes alters the structure of the iodination substrate in a way that enhances de novo T 3 formation, contributing to the relative T 3 toxicosis of Graves' disease.
A prognostic scoring model for survival after locoregional therapy in de novo stage IV breast cancer.

Science.gov (United States)

Kommalapati, Anuhya; Tella, Sri Harsha; Goyal, Gaurav; Ganti, Apar Kishor; Krishnamurthy, Jairam; Tandra, Pavan Kumar

2018-05-02

The role of locoregional treatment (LRT) remains controversial in de novo stage IV breast cancer (BC). We sought to analyze the role of LRT and prognostic factors of overall survival (OS) in de novo stage IV BC patients treated with LRT utilizing the National Cancer Data Base (NCDB). The objective of the current study is to create and internally validate a prognostic scoring model to predict the long-term OS for de novo stage IV BC patients treated with LRT. We included de novo stage IV BC patients reported to NCDB between 2004 and 2015. Patients were divided into LRT and no-LRT subsets. We randomized LRT subset to training and validation cohorts. In the training cohort, a seventeen-point prognostic scoring system was developed based on the hazard ratios calculated using Cox-proportional method. We stratified both training and validation cohorts into two "groups" [group 1 (0-7 points) and group 2 (7-17 points)]. Kaplan-Meier method and log-rank test were used to compare OS between the two groups. Our prognostic score was validated internally by comparing the OS between the respective groups in both the training and validation cohorts. Among 67,978 patients, LRT subset (21,200) had better median OS as compared to that of no-LRT (45 vs. 24 months; p < 0.0001). The group 1 and group 2 in the training cohort showed a significant difference in the 3-year OS (p < 0.0001) (68 vs. 26%). On internal validation, comparable OS was seen between the respective groups in each cohort (p = 0.77). Our prognostic scoring system will help oncologists to predict the prognosis in de novo stage IV BC patients treated with LRT. Although firm treatment-related conclusions cannot be made due to the retrospective nature of the study, LRT appears to be associated with a better OS in specific subgroups.
Modeling fructose-load-induced hepatic de-novo lipogenesis by model simplification

Directory of Open Access Journals (Sweden)

Richard J Allen

2017-03-01

Full Text Available Hepatic de-novo lipogenesis is a metabolic process implemented in the pathogenesis of type 2 diabetes. Clinically, the rate of this process can be ascertained by use of labeled acetate and stimulation by fructose administration. A systems pharmacology model of this process is desirable because it facilitates the description, analysis, and prediction of this experiment. Due to the multiple enzymes involved in de-novo lipogenesis, and the limited data, it is desirable to use single functional expressions to encapsulate the flux between multiple enzymes. To accomplish this we developed a novel simplification technique which uses the available information about the properties of the individual enzymes to bound the parameters of a single governing ‘transfer function’. This method should be applicable to any model with linear chains of enzymes that are well stimulated. We validated this approach with computational simulations and analytical justification in a limiting case. Using this technique we generated a simple model of hepatic de-novo lipogenesis in these experimental conditions that matched prior data. This model can be used to assess pharmacological intervention at specific points on this pathway. We have demonstrated this with prospective simulation of acetyl-CoA carboxylase inhibition. This simplification technique suggests how the constituent properties of an enzymatic chain of reactions gives rise to the sensitivity (to substrate of the pathway as a whole.
Modeling fructose-load-induced hepatic de-novo lipogenesis by model simplification.

Science.gov (United States)

Allen, Richard J; Musante, Cynthia J

2017-01-01

Hepatic de-novo lipogenesis is a metabolic process implemented in the pathogenesis of type 2 diabetes. Clinically, the rate of this process can be ascertained by use of labeled acetate and stimulation by fructose administration. A systems pharmacology model of this process is desirable because it facilitates the description, analysis, and prediction of this experiment. Due to the multiple enzymes involved in de-novo lipogenesis, and the limited data, it is desirable to use single functional expressions to encapsulate the flux between multiple enzymes. To accomplish this we developed a novel simplification technique which uses the available information about the properties of the individual enzymes to bound the parameters of a single governing 'transfer function'. This method should be applicable to any model with linear chains of enzymes that are well stimulated. We validated this approach with computational simulations and analytical justification in a limiting case. Using this technique we generated a simple model of hepatic de-novo lipogenesis in these experimental conditions that matched prior data. This model can be used to assess pharmacological intervention at specific points on this pathway. We have demonstrated this with prospective simulation of acetyl-CoA carboxylase inhibition. This simplification technique suggests how the constituent properties of an enzymatic chain of reactions gives rise to the sensitivity (to substrate) of the pathway as a whole.

De Novo Construction of Redox Active Proteins.

Science.gov (United States)

Moser, C C; Sheehan, M M; Ennist, N M; Kodali, G; Bialas, C; Englander, M T; Discher, B M; Dutton, P L

2016-01-01

Relatively simple principles can be used to plan and construct de novo proteins that bind redox cofactors and participate in a range of electron-transfer reactions analogous to those seen in natural oxidoreductase proteins. These designed redox proteins are called maquettes. Hydrophobic/hydrophilic binary patterning of heptad repeats of amino acids linked together in a single-chain self-assemble into 4-alpha-helix bundles. These bundles form a robust and adaptable frame for uncovering the default properties of protein embedded cofactors independent of the complexities introduced by generations of natural selection and allow us to better understand what factors can be exploited by man or nature to manipulate the physical chemical properties of these cofactors. Anchoring of redox cofactors such as hemes, light active tetrapyrroles, FeS clusters, and flavins by His and Cys residues allow cofactors to be placed at positions in which electron-tunneling rates between cofactors within or between proteins can be predicted in advance. The modularity of heptad repeat designs facilitates the construction of electron-transfer chains and novel combinations of redox cofactors and new redox cofactor assisted functions. Developing de novo designs that can support cofactor incorporation upon expression in a cell is needed to support a synthetic biology advance that integrates with natural bioenergetic pathways. © 2016 Elsevier Inc. All rights reserved.
Characteristics and Prediction of RNA Structure

Directory of Open Access Journals (Sweden)

Hengwu Li

2014-01-01

Full Text Available RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is NP-hard. Most RNAs fold during transcription from DNA into RNA through a hierarchical pathway wherein secondary structures form prior to tertiary structures. Real RNA secondary structures often have local instead of global optimization because of kinetic reasons. The performance of RNA structure prediction may be improved by considering dynamic and hierarchical folding mechanisms. This study is a novel report on RNA folding that accords with the golden mean characteristic based on the statistical analysis of the real RNA secondary structures of all 480 sequences from RNA STRAND, which are validated by NMR or X-ray. The length ratios of domains in these sequences are approximately 0.382L, 0.5L, 0.618L, and L, where L is the sequence length. These points are just the important golden sections of sequence. With this characteristic, an algorithm is designed to predict RNA hierarchical structures and simulate RNA folding by dynamically folding RNA structures according to the above golden section points. The sensitivity and number of predicted pseudoknots of our algorithm are better than those of the Mfold, HotKnots, McQfold, ProbKnot, and Lhw-Zhu algorithms. Experimental results reflect the folding rules of RNA from a new angle that is close to natural folding.
De novo origin of human protein-coding genes.

Directory of Open Access Journals (Sweden)

Dong-Dong Wu

2011-11-01

Full Text Available The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA-seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes.
De Novo Origin of Human Protein-Coding Genes

Science.gov (United States)

Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

2011-01-01

The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831
Combined "de novo" and "ex novo" lipid fermentation in a mix-medium of corncob acid hydrolysate and soybean oil by Trichosporon dermatis.

Science.gov (United States)

Huang, Chao; Luo, Mu-Tan; Chen, Xue-Fang; Qi, Gao-Xiang; Xiong, Lian; Lin, Xiao-Qing; Wang, Can; Li, Hai-Long; Chen, Xin-De

2017-01-01

Microbial oil is one important bio-product for its important function in energy, chemical, and food industry. Finding suitable substrates is one key issue for its industrial application. Both hydrophilic and hydrophobic substrates can be utilized by oleaginous microorganisms with two different bio-pathways (" de novo " lipid fermentation and " ex novo " lipid fermentation). To date, most of the research on lipid fermentation has focused mainly on only one fermentation pathway and little work was carried out on both " de novo " and " ex novo " lipid fermentation simultaneously; thus, the advantages of both lipid fermentation cannot be fulfilled comprehensively. In this study, corncob acid hydrolysate with soybean oil was used as a mix-medium for combined " de novo " and " ex novo " lipid fermentation by oleaginous yeast Trichosporon dermatis . Both hydrophilic and hydrophobic substrates (sugars and soybean oil) in the medium can be utilized simultaneously and efficiently by T. dermatis . Different fermentation modes were compared and the batch mode was the most suitable for the combined fermentation. The influence of soybean oil concentration, inoculum size, and initial pH on the lipid fermentation was evaluated and 20 g/L soybean oil, 5% inoculum size, and initial pH 6.0 were suitable for this bioprocess. By this technology, the lipid composition of extracellular hydrophobic substrate (soybean oil) can be modified. Although adding emulsifier showed little beneficial effect on lipid production, it can modify the intracellular lipid composition of T. dermatis . The present study proves the potential and possibility of combined " de novo " and " ex novo " lipid fermentation. This technology can use hydrophilic and hydrophobic sustainable bio-resources to generate lipid feedstock for the production of biodiesel or other lipid-based chemical compounds and to treat some special wastes such as oil-containing wastewater.
Genes from scratch--the evolutionary fate of de novo genes.

Science.gov (United States)

Schlötterer, Christian

2015-04-01

Although considered an extremely unlikely event, many genes emerge from previously noncoding genomic regions. This review covers the entire life cycle of such de novo genes. Two competing hypotheses about the process of de novo gene birth are discussed as well as the high death rate of de novo genes. Despite the high death rate, some de novo genes are retained and remain functional, even in distantly related species, through their integration into gene networks. Further studies combining gene expression with ribosome profiling in multiple populations across different species will be instrumental for an improved understanding of the evolutionary processes operating on de novo genes. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
De novo transcriptome assembly of Setatria italica variety Taejin

Directory of Open Access Journals (Sweden)

Yeonhwa Jo

2016-06-01

Full Text Available Foxtail millet (Setaria italica belonging to the family Poaceae is an important millet that is widely cultivated in East Asia. Of the cultivated millets, the foxtail millet has the longest history and is one of the main food crops in South India and China. Moreover, foxtail millet is a model plant system for biofuel generation utilizing the C4 photosynthetic pathway. In this study, we carried out de novo transcriptome assembly for the foxtail millet variety Taejin collected from Korea using next-generation sequencing. We obtained a total of 8.676 GB raw data by paired-end sequencing. The raw data in this study can be available in NCBI SRA database with accession number of SRR3406552. The Trinity program was used to de novo assemble 145,332 transcripts. Using the TransDecoder program, we predicted 82,925 putative proteins. BLASTP was performed against the Swiss-Prot protein sequence database to annotate the functions of identified proteins, resulting in 20,555 potentially novel proteins. Taken together, this study provides transcriptome data for the foxtail millet variety Taejin by RNA-Seq.
De-novo discovery of differentially abundant transcription factor binding sites including their positional preference.

Science.gov (United States)

Keilwagen, Jens; Grau, Jan; Paponov, Ivan A; Posch, Stefan; Strickert, Marc; Grosse, Ivo

2011-02-10

Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from micro-array, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open
Algorithms for Protein Structure Prediction

DEFF Research Database (Denmark)

Paluszewski, Martin

-trace. Here we present three different approaches for reconstruction of C-traces from predictable measures. In our first approach [63, 62], the C-trace is positioned on a lattice and a tabu-search algorithm is applied to find minimum energy structures. The energy function is based on half-sphere-exposure (HSE......) is more robust than standard Monte Carlo search. In the second approach for reconstruction of C-traces, an exact branch and bound algorithm has been developed [67, 65]. The model is discrete and makes use of secondary structure predictions, HSE, CN and radius of gyration. We show how to compute good lower...... bounds for partial structures very fast. Using these lower bounds, we are able to find global minimum structures in a huge conformational space in reasonable time. We show that many of these global minimum structures are of good quality compared to the native structure. Our branch and bound algorithm...
Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.

Science.gov (United States)

Adhikari, Badri; Cheng, Jianlin

2017-08-29

Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because contact-guided protein modeling approach is valuable for predicting the folds of proteins that do not have structural templates, it is necessary for reconstruction studies to focus on hard-to-predict protein structures. Using a data set consisting of 496 structural domains released in recent CASP experiments and a dataset of 150 representative protein structures, in this work, we discuss three techniques to improve the reconstruction accuracy using true contacts - adding secondary structures, increasing contact distance thresholds, and adding non-contacts. We find that reconstruction using secondary structures and contacts can deliver accuracy higher than using full contact maps. Similarly, we demonstrate that non-contacts can improve reconstruction accuracy not only when the used non-contacts are true but also when they are predicted. On the dataset consisting of 150 proteins, we find that by simply using low ranked predicted contacts as non-contacts and adding them as additional restraints, can increase the reconstruction accuracy by 5% when the reconstructed models are evaluated using TM-score. Our findings suggest that secondary structures are invaluable companions of contacts for accurate reconstruction. Confirming some earlier findings, we also find that larger distance thresholds are useful for folding many protein structures which cannot be folded using the standard definition of contacts. Our findings also suggest that for more accurate reconstruction using predicted contacts it is useful to predict contacts at higher distance thresholds (beyond 8 Å) and predict non-contacts.
Critical Features of Fragment Libraries for Protein Structure Prediction.

Science.gov (United States)

Trevizani, Raphael; Custódio, Fábio Lima; Dos Santos, Karina Baptista; Dardenne, Laurent Emmanuel

2017-01-01

The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction.
Protein Structure Prediction by Protein Threading

Science.gov (United States)

Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong

The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.
Associations between Familial Rates of Psychiatric Disorders and De Novo Genetic Mutations in Autism

Directory of Open Access Journals (Sweden)

Kyleen Luhrs

2017-01-01

Full Text Available The purpose of this study was to examine the confluence of genetic and familial risk factors in children with Autism Spectrum Disorder (ASD with distinct de novo genetic events. We hypothesized that gene-disrupting mutations would be associated with reduced rates of familial psychiatric disorders relative to structural mutations. Participants included families of children with ASD in four groups: de novo duplication copy number variations (DUP, n=62, de novo deletion copy number variations (DEL, n=74, de novo likely gene-disrupting mutations (LGDM, n=267, and children without a known genetic etiology (NON, n=2111. Familial rates of psychiatric disorders were calculated from semistructured interviews. Results indicated overall increased rates of psychiatric disorders in DUP families compared to DEL and LGDM families, specific to paternal psychiatric histories, and particularly evident for depressive disorders. Higher rates of depressive disorders in maternal psychiatric histories were observed overall compared to paternal histories and higher rates of anxiety disorders were observed in paternal histories for LGDM families compared to DUP families. These findings support the notion of an additive contribution of genetic etiology and familial factors are associated with ASD risk and highlight critical need for continued work targeting these relationships.
Facilitating RNA structure prediction with microarrays.

Science.gov (United States)

Kierzek, Elzbieta; Kierzek, Ryszard; Turner, Douglas H; Catrina, Irina E

2006-01-17

Determining RNA secondary structure is important for understanding structure-function relationships and identifying potential drug targets. This paper reports the use of microarrays with heptamer 2'-O-methyl oligoribonucleotides to probe the secondary structure of an RNA and thereby improve the prediction of that secondary structure. When experimental constraints from hybridization results are added to a free-energy minimization algorithm, the prediction of the secondary structure of Escherichia coli 5S rRNA improves from 27 to 92% of the known canonical base pairs. Optimization of buffer conditions for hybridization and application of 2'-O-methyl-2-thiouridine to enhance binding and improve discrimination between AU and GU pairs are also described. The results suggest that probing RNA with oligonucleotide microarrays can facilitate determination of secondary structure.
Antibody structural modeling with prediction of immunoglobulin structure (PIGS)

DEFF Research Database (Denmark)

Marcatili, Paolo; Olimpieri, Pier Paolo; Chailyan, Anna

2014-01-01

Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful...... applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (∼10 min...... on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together....
PSPP: a protein structure prediction pipeline for computing clusters.

Directory of Open Access Journals (Sweden)

Michael S Lee

2009-07-01

Full Text Available Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster.The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML formats. So far, the pipeline has been used to study viral and bacterial proteomes.The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform
Crius: A Novel Fragment-Based Algorithm of De Novo Substrate Prediction for Enzymes.

Science.gov (United States)

Yao, Zhiqiang; Jiang, Shuiqin; Zhang, Lujia; Gao, Bei; He, Xiao; Zhang, John Z H; Wei, Dongzhi

2018-05-03

The study of enzyme substrate specificity is vital for developing potential applications of enzymes. However, the routine experimental procedures require lot of resources in the discovery of novel substrates. This article reports an in silico structure-based algorithm called Crius, which predicts substrates for enzyme. The results of this fragment-based algorithm show good agreements between the simulated and experimental substrate specificities, using a lipase from Candida antarctica (CALB), a nitrilase from Cyanobacterium syechocystis sp. PCC6803 (Nit6803), and an aldo-keto reductase from Gluconobacter oxydans (Gox0644). This opens new prospects of developing computer algorithms that can effectively predict substrates for an enzyme. This article is protected by copyright. All rights reserved. © 2018 The Protein Society.
RNAstructure: software for RNA secondary structure prediction and analysis.

Science.gov (United States)

Reuter, Jessica S; Mathews, David H

2010-03-15

To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.
Antibody structural modeling with prediction of immunoglobulin structure (PIGS)

KAUST Repository

Marcatili, Paolo

2014-11-06

© 2014 Nature America, Inc. All rights reserved. Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (~10 min on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together.
Genome-wide patterns and properties of de novo mutations in humans

NARCIS (Netherlands)

Francioli, Laurent C.; Polak, Paz P.; Koren, Amnon; Menelaou, Androniki; Chun, Sung; Renkens, Ivo; van Duijn, Cornelia M.; Swertz, Morris; Wijmenga, Cisca; van Ommen, Gertjan; Slagboom, P. Eline; Boomsma, Dorret I.; Ye, Kai; Guryev, Victor; Arndt, Peter F.; Kloosterman, Wigard P.; de Bakker, Paul I. W.; Sunyaev, Shamil R.

Mutations create variation in the population, fuel evolution and cause genetic diseases. Current knowledge about de novo mutations is incomplete and mostly indirect(1-10). Here we analyze 11,020 de novo mutations from the whole genomes of 250 families. We show that de novo mutations in the offspring

Genome-wide patterns and properties of de novo mutations in humans

NARCIS (Netherlands)

Francioli, L.C.; Polak, P.P.; Koren, A.; Menelaou, A.; Chun, S.; Renkens, I.; van Duijn, C.M.; Swertz, M.A.; Wijmenga, C.; van Ommen, G.J.; Slagboom, P.E.; Boomsma, D.I.; Ye, K.; Guryev, V.; Arndt, P.F.; Kloosterman, W.P.; Bakker, P.I.W.; Sunyaev, S.R.; Dijk, F.; Neerincx, P.B.T.; Pulit, S.L.; Deelen, P.; Elbers, C.C.; Palamara, P.F.; Pe'er, I.; Abdellaoui, A.; van Oven, M.; Vermaat, M.; Li, M.; Laros, J.F.J.; Stoneking, M.; de Knijff, P.; Kayser, M.; Veldink, J.H.; Van den Berg, L.H.; Byelas, H.; den Dunnen, J.T.; Dijkstra, M.; Amin, N.; van der Velde, K.J.; Hottenga, J.J.; van Setten, J.; van Leeuwen, E.M.; Kanterakis, A.; Kattenberg, V.M.; Karssen, L.C.; van Schaik, B.D.C.; Bot, J.; Nijman, I.J.; van Enckevort, D.; Mei, H.; Koval, V.; Estrada, K.; Medina-Gomez, C.; Lameijer, E.W.; Moed, M.H.; Hehir-Kwa, J.Y.; Handsaker, R.E.; McCarroll, S.A.; Vuzman, D.; Sohail, M.; Hormozdiari, F.; Marschall, T.; Schönhuth, A.; Beekman, M.; de Craen, A.J.; Suchiman, H.E.D.; Hofman, A.; Oostra, B.; Isaacs, A.; Rivadeneira, F.; Uitterlinden, A.G.; Willemsen, G.; Platteel, M.; Pitts, S.J.; Potluri, S.; Sundar, P.; Cox, D.R.; Li, Q.; Li, Y.; Du, Y.; Chen, R.; Cao, H.; Li, N.; Cao, S.; Wang, J.; Bovenberg, J.A.; Brandsma, M.

2015-01-01

Mutations create variation in the population, fuel evolution and cause genetic diseases. Current knowledge about de novo mutations is incomplete and mostly indirect. Here we analyze 11,020 de novo mutations from the whole genomes of 250 families. We show that de novo mutations in the offspring of
Alternative normalization methods demonstrate widespread cortical hypometabolism in untreated de novo Parkinson's disease

DEFF Research Database (Denmark)

Berti, Valentina; Polito, C; Borghammer, Per

2012-01-01

, recent studies suggested that conventional data normalization procedures may not always be valid, and demonstrated that alternative normalization strategies better allow detection of low magnitude changes. We hypothesized that these alternative normalization procedures would disclose more widespread...... metabolic alterations in de novo PD. METHODS: [18F]FDG PET scans of 26 untreated de novo PD patients (Hoehn & Yahr stage I-II) and 21 age-matched controls were compared using voxel-based analysis. Normalization was performed using gray matter (GM), white matter (WM) reference regions and Yakushev...... normalization. RESULTS: Compared to GM normalization, WM and Yakushev normalization procedures disclosed much larger cortical regions of relative hypometabolism in the PD group with extensive involvement of frontal and parieto-temporal-occipital cortices, and several subcortical structures. Furthermore...
Clinicopathologic factors associated with de novo metastatic breast cancer.

Science.gov (United States)

Shen, Tiansheng; Siegal, Gene P; Wei, Shi

2016-12-01

While breast cancers with distant metastasis at presentation (de novo metastasis) harbor significantly inferior clinical outcomes, there have been limited studies analyzing the clinicopathologic characteristics in this subset of patients. In this study, we analyzed 6126 breast cancers diagnosed between 1998 and 2013 to identify factors associated with de novo metastatic breast cancer. When compared to patients without metastasis at presentation, race, histologic grade, estrogen/progesterone receptor (ER/PR) and HER2 statuses were significantly associated with de novo metastasis in the entire cohort, whereas age, histologic grade, PR and HER2 status were the significant parameters in the subset of patients with locally advanced breast cancer (Stage IIB/III). The patients with de novo metastatic breast cancer had a significant older mean age and a lower proportion of HER2-positive tumors when compared to those with metastatic recurrence. Further, the HER2-rich subtype demonstrated a drastically higher incidence of de novo metastasis when compared to the luminal and triple-negative breast cancers in the entire cohort [odds ratio (OR)=5.68 and 2.27, respectively] and in the patients with locally advanced disease (OR=4.02 and 2.12, respectively), whereas no significant difference was seen between de novo metastatic cancers and those with metastatic recurrence. Moreover, the luminal and HER2-rich subtypes showed bone-seeking (OR=1.92) and liver-homing (OR=2.99) characteristics, respectively, for the sites of de novo metastasis, while the latter was not observed in those with metastatic recurrence. Our data suggest that an algorithm incorporating clinicopathologic factors, especially histologic grade and receptor profile, remains of significant benefit during decision making in newly diagnosed breast cancer in the pursuit of precision medicine. Copyright Â© 2016 Elsevier GmbH. All rights reserved.
Particulated articular cartilage: CAIS and DeNovo NT.

Science.gov (United States)

Farr, Jack; Cole, Brian J; Sherman, Seth; Karas, Vasili

2012-03-01

Cartilage Autograft Implantation System (CAIS; DePuy/Mitek, Raynham, MA) and DeNovo Natural Tissue (NT; ISTO, St. Louis, MO) are novel treatment options for focal articular cartilage defects in the knee. These methods involve the implantation of particulated articular cartilage from either autograft or juvenile allograft donor, respectively. In the laboratory and in animal models, both CAIS and DeNovo NT have demonstrated the ability of the transplanted cartilage cells to "escape" from the extracellular matrix, migrate, multiply, and form a new hyaline-like cartilage tissue matrix that integrates with the surrounding host tissue. In clinical practice, the technique for both CAIS and DeNovo NT is straightforward, requiring only a single surgery to affect cartilage repair. Clinical experience is limited, with short-term studies demonstrating both procedures to be safe, feasible, and effective, with improvements in subjective patient scores, and with magnetic resonance imaging evidence of good defect fill. While these treatment options appear promising, prospective randomized controlled studies are necessary to refine the indications and contraindications for both CAIS and DeNovo NT.
A comprehensive comparison of comparative RNA structure prediction approaches

DEFF Research Database (Denmark)

Gardner, P. P.; Giegerich, R.

2004-01-01

-finding and multiple-sequence-alignment algorithms. Results Here we evaluate a number of RNA folding algorithms using reliable RNA data-sets and compare their relative performance. Conclusions We conclude that comparative data can enhance structure prediction but structure-prediction-algorithms vary widely in terms......Background An increasing number of researchers have released novel RNA structure analysis and prediction algorithms for comparative approaches to structure prediction. Yet, independent benchmarking of these algorithms is rarely performed as is now common practice for protein-folding, gene...
Nucleic acid secondary structure prediction and display.

OpenAIRE

Stüber, K

1986-01-01

A set of programs has been developed for the prediction and display of nucleic acid secondary structures. Information from experimental data can be used to restrict or enforce secondary structural elements. The predictions can be displayed either on normal line printers or on graphic devices like plotters or graphic terminals.
Viral IRES prediction system - a web server for prediction of the IRES secondary structure in silico.

Directory of Open Access Journals (Sweden)

Jun-Jie Hong

Full Text Available The internal ribosomal entry site (IRES functions as cap-independent translation initiation sites in eukaryotic cells. IRES elements have been applied as useful tools for bi-cistronic expression vectors. Current RNA structure prediction programs are unable to predict precisely the potential IRES element. We have designed a viral IRES prediction system (VIPS to perform the IRES secondary structure prediction. In order to obtain better results for the IRES prediction, the VIPS can evaluate and predict for all four different groups of IRESs with a higher accuracy. RNA secondary structure prediction, comparison, and pseudoknot prediction programs were implemented to form the three-stage procedure for the VIPS. The backbone of VIPS includes: the RNAL fold program, aimed to predict local RNA secondary structures by minimum free energy method; the RNA Align program, intended to compare predicted structures; and pknotsRG program, used to calculate the pseudoknot structure. VIPS was evaluated by using UTR database, IRES database and Virus database, and the accuracy rate of VIPS was assessed as 98.53%, 90.80%, 82.36% and 80.41% for IRES groups 1, 2, 3, and 4, respectively. This advance useful search approach for IRES structures will facilitate IRES related studies. The VIPS on-line website service is available at http://140.135.61.250/vips/.
Computational predictions of zinc oxide hollow structures

Science.gov (United States)

Tuoc, Vu Ngoc; Huan, Tran Doan; Thao, Nguyen Thi

2018-03-01

Nanoporous materials are emerging as potential candidates for a wide range of technological applications in environment, electronic, and optoelectronics, to name just a few. Within this active research area, experimental works are predominant while theoretical/computational prediction and study of these materials face some intrinsic challenges, one of them is how to predict porous structures. We propose a computationally and technically feasible approach for predicting zinc oxide structures with hollows at the nano scale. The designed zinc oxide hollow structures are studied with computations using the density functional tight binding and conventional density functional theory methods, revealing a variety of promising mechanical and electronic properties, which can potentially find future realistic applications.
Effects prediction guidelines for structures subjected to ground motion

International Nuclear Information System (INIS)

1975-07-01

Part of the planning for an underground nuclear explosion (UNE) is determining the effects of expected ground motion on exposed structures. Because of the many types of structures and the wide variation in ground motion intensity typically encountered, no single prediction method is both adequate and feasible for a complete evaluation. Furthermore, the nature and variability of ground motion and structure damage prescribe effects predictions that are made probabilistically. Initially, prediction for a UNE involves a preliminary assessment of damage to establish overall project feasibility. Subsequent efforts require more detailed damage evaluations, based on structure inventories and analyses of specific structures, so that safety problems can be identified and safety and remedial measures can be recommended. To cover this broad range of effects prediction needs for a typical UNE project, three distinct but interrelated methods have been developed and are described. First, the fundamental practical and theoretical aspects of predicting the effects of dynamic ground motion on structures are summarized. Next, experimentally derived and theoretically determined observations of the behavior of typical structures subjected to ground motion are presented. Then, based on these fundamental considerations and on the observed behavior of structures, the formulation of the three effects prediction procedures is described, along with guidelines regarding their applicability. Example damage predictions for hypothetical UNEs demonstrate these procedures. To aid in identifying the vibration properties of complex structures, one chapter discusses alternatives in vibration testing, instrumentation, and data analysis. Finally, operational guidelines regarding data acquisition procedures, safety criteria, and remedial measures involved in conducting structure effects evaluations are discussed. (U.S.)
Cavitation during the protein misfolding cyclic amplification (PMCA) method – The trigger for de novo prion generation?

International Nuclear Information System (INIS)

Haigh, Cathryn L.; Drew, Simon C.

2015-01-01

The protein misfolding cyclic amplification (PMCA) technique has become a widely-adopted method for amplifying minute amounts of the infectious conformer of the prion protein (PrP). PMCA involves repeated cycles of 20 kHz sonication and incubation, during which the infectious conformer seeds the conversion of normally folded protein by a templating interaction. Recently, it has proved possible to create an infectious PrP conformer without the need for an infectious seed, by including RNA and the phospholipid POPG as essential cofactors during PMCA. The mechanism underpinning this de novo prion formation remains unknown. In this study, we first establish by spin trapping methods that cavitation bubbles formed during PMCA provide a radical-rich environment. Using a substrate preparation comparable to that employed in studies of de novo prion formation, we demonstrate by immuno-spin trapping that PrP- and RNA-centered radicals are generated during sonication, in addition to PrP-RNA cross-links. We further show that serial PMCA produces protease-resistant PrP that is oxidatively modified. We suggest a unique confluence of structural (membrane-mimetic hydrophobic/hydrophilic bubble interface) and chemical (ROS) effects underlie the phenomenon of de novo prion formation by PMCA, and that these effects have meaningful biological counterparts of possible relevance to spontaneous prion formation in vivo. - Highlights: • Sonication during PMCA generates free radicals at the surface of cavitation bubbles. • PrP-centered and RNA-centered radicals are formed in addition to PrP-RNA adducts. • De novo prions may result from ROS and structural constraints during cavitation
Cavitation during the protein misfolding cyclic amplification (PMCA) method – The trigger for de novo prion generation?

Energy Technology Data Exchange (ETDEWEB)

Haigh, Cathryn L., E-mail: chaigh@unimelb.edu.au [Department of Pathology, The University of Melbourne, Victoria 3010 (Australia); Drew, Simon C., E-mail: sdrew@unimelb.edu.au [Florey Department of Neuroscience and Mental Health, The University of Melbourne, Victoria 3010 (Australia)

2015-06-05

The protein misfolding cyclic amplification (PMCA) technique has become a widely-adopted method for amplifying minute amounts of the infectious conformer of the prion protein (PrP). PMCA involves repeated cycles of 20 kHz sonication and incubation, during which the infectious conformer seeds the conversion of normally folded protein by a templating interaction. Recently, it has proved possible to create an infectious PrP conformer without the need for an infectious seed, by including RNA and the phospholipid POPG as essential cofactors during PMCA. The mechanism underpinning this de novo prion formation remains unknown. In this study, we first establish by spin trapping methods that cavitation bubbles formed during PMCA provide a radical-rich environment. Using a substrate preparation comparable to that employed in studies of de novo prion formation, we demonstrate by immuno-spin trapping that PrP- and RNA-centered radicals are generated during sonication, in addition to PrP-RNA cross-links. We further show that serial PMCA produces protease-resistant PrP that is oxidatively modified. We suggest a unique confluence of structural (membrane-mimetic hydrophobic/hydrophilic bubble interface) and chemical (ROS) effects underlie the phenomenon of de novo prion formation by PMCA, and that these effects have meaningful biological counterparts of possible relevance to spontaneous prion formation in vivo. - Highlights: • Sonication during PMCA generates free radicals at the surface of cavitation bubbles. • PrP-centered and RNA-centered radicals are formed in addition to PrP-RNA adducts. • De novo prions may result from ROS and structural constraints during cavitation.
Stochastic Extreme Load Predictions for Marine Structures

DEFF Research Database (Denmark)

Jensen, Jørgen Juncher

1999-01-01

Development of rational design criteria for marine structures requires reliable estimates for the maximum wave-induced loads the structure may encounter during its operational lifetime. The paper discusses various methods for extreme value predictions taking into account the non-linearity of the ......Development of rational design criteria for marine structures requires reliable estimates for the maximum wave-induced loads the structure may encounter during its operational lifetime. The paper discusses various methods for extreme value predictions taking into account the non......-linearity of the waves and the response. As example the wave-induced bending moment in the ship hull girder is considered....
Perinatal Autopsy Findings in a Case of De Novo Hypohidrotic Ectodermal Dysplasia.

Science.gov (United States)

Chikkannaiah, Panduranga; Nagaraju, Smitha; Kangle, Rajit; Gosavi, Mansi

2015-01-01

Ectodermal dysplasia are group of inherited disorders involving the developmental defects of ectodermal structures like hair, teeth, nails, sweat glands, and others. X-linked recessive inheritance is most common. Here we describe perinatal autopsy findings in a case of de novo ectodermal dysplasia in a female fetus. To the best of our knowledge, this is the first fetal autopsy description in a case of ectodermal dysplasia.
''de novo'' aneurysms following endovascular procedures

Energy Technology Data Exchange (ETDEWEB)

Briganti, F.; Cirillo, S.; Caranci, F. [Department of Neurological Sciences, Services of Neuroradiology, ' ' Federico II' ' University, Naples (Italy); Esposito, F.; Maiuri, F. [Department of Neurological Sciences, Services of Neurosurgery, ' ' Federico II' ' University, Naples (Italy)

2002-07-01

Two personal cases of ''de novo'' aneurysms of the anterior communicating artery (ACoA) occurring 9 and 4 years, respectively, after endovascular carotid occlusion are described. A review of the 30 reported cases (including our own two) of ''de novo'' aneurysms after occlusion of the major cerebral vessels has shown some features, including a rather long time interval after the endovascular procedure of up to 20-25 years (average 9.6 years), a preferential ACoA (36.3%) and internal carotid artery-posterior communicating artery (ICA-PCoA) (33.3%) location of the ''de novo'' aneurysms, and a 10% rate of multiple aneurysms. These data are compared with those of the group of reported spontaneous ''de novo'' aneurysms after SAH or previous aneurysm clipping. We agree that the frequency of ''de novo'' aneurysms after major-vessel occlusion (two among ten procedures in our series, or 20%) is higher than commonly reported (0 to 11%). For this reason, we suggest that patients who have been submitted to endovascular major-vessel occlusion be followed up for up to 20-25 years after the procedure, using non-invasive imaging studies such as MR angiography and high-resolution CT angiography. On the other hand, periodic digital angiography has a questionable risk-benefit ratio; it may be used when a ''de novo'' aneurysm is detected or suspected on non-invasive studies. The progressive enlargement of the ACoA after carotid occlusion, as described in our case 1, must be considered a radiological finding of risk for ''de novo'' aneurysm formation. (orig.)
Language and national identity in Novo Cinema Galego

Directory of Open Access Journals (Sweden)

Brais ROMERO SUÁREZ

2015-12-01

Full Text Available The talk of town since its inception in 2010, the Cinema Novo Galego has been successful in all competitions and festivals that has been present. From the FIPRESCI prize in Cannes to the Best Emerging Director at Locarno, this new wave of cinema places Galicia in the world film stage. But does Novo Cinema Galego an accurate representation of Galicia? What's the role of Galicia in this movement?
Evolving stochastic context-free grammars for RNA secondary structure prediction

DEFF Research Database (Denmark)

Anderson, James WJ; Tataru, Paula Cristina; Stains, Joe

2012-01-01

Background Stochastic Context-Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few...... to structure prediction as has been previously suggested. Results These search techniques were applied to predict RNA secondary structure on a maximal data set and revealed new and interesting grammars, though none are dramatically better than classic grammars. In general, results showed that many grammars...... with quite different structure could have very similar predictive ability. Many ambiguous grammars were found which were at least as effective as the best current unambiguous grammars. Conclusions Overall the method of evolving SCFGs for RNA secondary structure prediction proved effective in finding many...
A composite method based on formal grammar and DNA structural features in detecting human polymerase II promoter region.

Directory of Open Access Journals (Sweden)

Sutapa Datta

Full Text Available An important step in understanding gene regulation is to identify the promoter regions where the transcription factor binding takes place. Predicting a promoter region de novo has been a theoretical goal for many researchers for a long time. There exists a number of in silico methods to predict the promoter region de novo but most of these methods are still suffering from various shortcomings, a major one being the selection of appropriate features of promoter region distinguishing them from non-promoters. In this communication, we have proposed a new composite method that predicts promoter sequences based on the interrelationship between structural profiles of DNA and primary sequence elements of the promoter regions. We have shown that a Context Free Grammar (CFG can formalize the relationships between different primary sequence features and by utilizing the CFG, we demonstrate that an efficient parser can be constructed for extracting these relationships from DNA sequences to distinguish the true promoter sequences from non-promoter sequences. Along with CFG, we have extracted the structural features of the promoter region to improve upon the efficiency of our prediction system. Extensive experiments performed on different datasets reveals that our method is effective in predicting promoter sequences on a genome-wide scale and performs satisfactorily as compared to other promoter prediction techniques.
A Composite Method Based on Formal Grammar and DNA Structural Features in Detecting Human Polymerase II Promoter Region

Science.gov (United States)

Datta, Sutapa; Mukhopadhyay, Subhasis

2013-01-01

An important step in understanding gene regulation is to identify the promoter regions where the transcription factor binding takes place. Predicting a promoter region de novo has been a theoretical goal for many researchers for a long time. There exists a number of in silico methods to predict the promoter region de novo but most of these methods are still suffering from various shortcomings, a major one being the selection of appropriate features of promoter region distinguishing them from non-promoters. In this communication, we have proposed a new composite method that predicts promoter sequences based on the interrelationship between structural profiles of DNA and primary sequence elements of the promoter regions. We have shown that a Context Free Grammar (CFG) can formalize the relationships between different primary sequence features and by utilizing the CFG, we demonstrate that an efficient parser can be constructed for extracting these relationships from DNA sequences to distinguish the true promoter sequences from non-promoter sequences. Along with CFG, we have extracted the structural features of the promoter region to improve upon the efficiency of our prediction system. Extensive experiments performed on different datasets reveals that our method is effective in predicting promoter sequences on a genome-wide scale and performs satisfactorily as compared to other promoter prediction techniques. PMID:23437045
RNA secondary structure prediction using soft computing.

Science.gov (United States)

Ray, Shubhra Sankar; Pal, Sankar K

2013-01-01

Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.
Predicting DNA-binding proteins and binding residues by complex structure prediction and application to human proteome.

Directory of Open Access Journals (Sweden)

Huiying Zhao

Full Text Available As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather than low-resolution two-state prediction of DNA-binding as most existing techniques do. The method first predicts protein-DNA complex structure by utilizing the template-based structure prediction technique HHblits, followed by binding affinity prediction based on a knowledge-based energy function (Distance-scaled finite ideal-gas reference state for protein-DNA interactions. A leave-one-out cross validation of the method based on 179 DNA-binding and 3797 non-binding protein domains achieves a Matthews correlation coefficient (MCC of 0.77 with high precision (94% and high sensitivity (65%. We further found 51% sensitivity for 82 newly determined structures of DNA-binding proteins and 56% sensitivity for the human proteome. In addition, the method provides a reasonably accurate prediction of DNA-binding residues in proteins based on predicted DNA-binding complex structures. Its application to human proteome leads to more than 300 novel DNA-binding proteins; some of these predicted structures were validated by known structures of homologous proteins in APO forms. The method [SPOT-Seq (DNA] is available as an on-line server at http://sparks-lab.org.

A de novo missense mutation of FGFR2 causes facial dysplasia syndrome in Holstein cattle

DEFF Research Database (Denmark)

Agerholm, Jørgen Steen; McEvoy, Fintan; Heegaard, Steffen

2017-01-01

was suspected as all recorded cases were progeny of the same sire. Detailed investigations were performed to characterize the syndrome and to reveal its cause. Results Seven malformed calves were submitted examination. All cases shared a common morphology with the most striking lesions being severe facial...... chromosome 26 where whole genome sequencing of a case-parent trio revealed two de novo variants perfectly associated with the disease: an intronic SNP in the DMBT1 gene and a single non-synonymous variant in the FGFR2 gene. This FGFR2 missense variant (c.927G>T) affects a gene encoding a member...... of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and across species. It is predicted to change an evolutionary conserved tryptophan into a cysteine residue (p.Trp309Cys). Both variant alleles were proven to result from de novo mutation events...
On the Origin of De Novo Genes in Arabidopsis thaliana Populations.

Science.gov (United States)

Li, Zi-Wen; Chen, Xi; Wu, Qiong; Hagmann, Jörg; Han, Ting-Shen; Zou, Yu-Pan; Ge, Song; Guo, Ya-Long

2016-08-03

De novo genes, which originate from ancestral nongenic sequences, are one of the most important sources of protein-coding genes. This origination process is crucial for the adaptation of organisms. However, how de novo genes arise and become fixed in a population or species remains largely unknown. Here, we identified 782 de novo genes from the model plant Arabidopsis thaliana and divided them into three types based on the availability of translational evidence, transcriptional evidence, and neither transcriptional nor translational evidence for their origin. Importantly, by integrating multiple types of omics data, including data from genomes, epigenomes, transcriptomes, and translatomes, we found that epigenetic modifications (DNA methylation and histone modification) play an important role in the origination process of de novo genes. Intriguingly, using the transcriptomes and methylomes from the same population of 84 accessions, we found that de novo genes that are transcribed in approximately half of the total accessions within the population are highly methylated, with lower levels of transcription than those transcribed at other frequencies within the population. We hypothesized that, during the origin of de novo gene alleles, those neutralized to low expression states via DNA methylation have relatively high probabilities of spreading and becoming fixed in a population. Our results highlight the process underlying the origin of de novo genes at the population level, as well as the importance of DNA methylation in this process. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Glucagon infusion increases rate of purine synthesis de novo in rat liver

International Nuclear Information System (INIS)

Itakura, Mitsuo; Maeda, Noriaki; Tsuchiya, Masami; Yamashita, Kamejiro

1987-01-01

Based on the parallel increases of glucagon, the second peak of hepatic cAMP, and the rate of purine synthesis de novo in the prereplicative period in regenerating rate liver after a 70% hepatectomy, it was hypothesized that glucagon is responsible for the increased rate of purine synthesis de novo. To test this hypothesis, the effect of glucagon or dibutyryl cAMP infusion on the rate of purine synthesis de novo in rat liver was studied. Glucagon infusion but not insulin or glucose infusion increased the rate of purine synthesis de novo, which was assayed by [ 14 C]glycine or [ 14 C]formate incorporation, by 2.7- to 4.3-fold. Glucagon infusion increased cAMP concentrations by 4.9-fold and 5-phosphoribosyl-1-pyrophosphate concentrations by 1.5-fold in liver but did not change the specific activity of amidophosphoribosyltransferase or purine ribonucleotide concentrations. Dibutyryl cAMP infusion also increased the rate of purine synthesis de novo by 2.2- to 4.0-fold. Because glucagon infusion increased the rate of purine synthesis de novo in the presence of unchanged purine ribonucleotide concentrations, it is concluded that glucagon after infusion or in animals after a 70% hepatectomy is playing an anabolic role to increase the rate of purine synthesis de novo by increasing cAMP and 5-phosphoribosyl-1-pyrophosphate concentrations
Prediction of the Secondary Structure of HIV-1 gp120

DEFF Research Database (Denmark)

Hansen, Jan; Lund, Ole; Nielsen, Jens O.

1996-01-01

Fourier transform infrared spectroscopy. The predicted secondary structure of gp120 compared well with data from NMR analysis of synthetic peptides from the V3 loop and the C4 region. As a first step towards modeling the tertiary structure of gp120, the predicted secondary structure may guide the design......The secondary structure of HIV-1 gp120 was predicted using multiple alignment and a combination of two independent methods based on neural network and nearest-neighbor algorithms. The methods agreed on the secondary structure for 80% of the residues in BH10 gp120. Six helices were predicted in HIV...
EVA: continuous automatic evaluation of protein structure prediction servers.

Science.gov (United States)

Eyrich, V A; Martí-Renom, M A; Przybylski, D; Madhusudhan, M S; Fiser, A; Pazos, F; Valencia, A; Sali, A; Rost, B

2001-12-01

Evaluation of protein structure prediction methods is difficult and time-consuming. Here, we describe EVA, a web server for assessing protein structure prediction methods, in an automated, continuous and large-scale fashion. Currently, EVA evaluates the performance of a variety of prediction methods available through the internet. Every week, the sequences of the latest experimentally determined protein structures are sent to prediction servers, results are collected, performance is evaluated, and a summary is published on the web. EVA has so far collected data for more than 3000 protein chains. These results may provide valuable insight to both developers and users of prediction methods. http://cubic.bioc.columbia.edu/eva. eva@cubic.bioc.columbia.edu
High frequencies of de novo CNVs in bipolar disorder and schizophrenia.

LENUS (Irish Health Repository)

Malhotra, Dheeraj

2011-12-22

While it is known that rare copy-number variants (CNVs) contribute to risk for some neuropsychiatric disorders, the role of CNVs in bipolar disorder is unclear. Here, we reasoned that a contribution of CNVs to mood disorders might be most evident for de novo mutations. We performed a genome-wide analysis of de novo CNVs in a cohort of 788 trios. Diagnoses of offspring included bipolar disorder (n = 185), schizophrenia (n = 177), and healthy controls (n = 426). Frequencies of de novo CNVs were significantly higher in bipolar disorder as compared with controls (OR = 4.8 [1.4,16.0], p = 0.009). De novo CNVs were particularly enriched among cases with an age at onset younger than 18 (OR = 6.3 [1.7,22.6], p = 0.006). We also confirmed a significant enrichment of de novo CNVs in schizophrenia (OR = 5.0 [1.5,16.8], p = 0.007). Our results suggest that rare spontaneous mutations are an important contributor to risk for bipolar disorder and other major neuropsychiatric diseases.
A Kernel for Protein Secondary Structure Prediction

OpenAIRE

Guermeur , Yann; Lifchitz , Alain; Vert , Régis

2004-01-01

http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&tid=10338&mode=toc; International audience; Multi-class support vector machines have already proved efficient in protein secondary structure prediction as ensemble methods, to combine the outputs of sets of classifiers based on different principles. In this chapter, their implementation as basic prediction methods, processing the primary structure or the profile of multiple alignments, is investigated. A kernel devoted to the task is in...
Complete fold annotation of the human proteome using a novel structural feature space.

Science.gov (United States)

Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong

2017-04-13

Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.
Emergence, Retention and Selection: A Trilogy of Origination for Functional De Novo Proteins from Ancestral LncRNAs in Primates.

Directory of Open Access Journals (Sweden)

Jia-Yu Chen

2015-07-01

Full Text Available While some human-specific protein-coding genes have been proposed to originate from ancestral lncRNAs, the transition process remains poorly understood. Here we identified 64 hominoid-specific de novo genes and report a mechanism for the origination of functional de novo proteins from ancestral lncRNAs with precise splicing structures and specific tissue expression profiles. Whole-genome sequencing of dozens of rhesus macaque animals revealed that these lncRNAs are generally not more selectively constrained than other lncRNA loci. The existence of these newly-originated de novo proteins is also not beyond anticipation under neutral expectation, as they generally have longer theoretical lifespan than their current age, due to their GC-rich sequence property enabling stable ORFs with lower chance of non-sense mutations. Interestingly, although the emergence and retention of these de novo genes are likely driven by neutral forces, population genetics study in 67 human individuals and 82 macaque animals revealed signatures of purifying selection on these genes specifically in human population, indicating a proportion of these newly-originated proteins are already functional in human. We thus propose a mechanism for creation of functional de novo proteins from ancestral lncRNAs during the primate evolution, which may contribute to human-specific genetic novelties by taking advantage of existed genomic contexts.
Blind Test of Physics-Based Prediction of Protein Structures

Science.gov (United States)

Shell, M. Scott; Ozkan, S. Banu; Voelz, Vincent; Wu, Guohong Albert; Dill, Ken A.

2009-01-01

We report here a multiprotein blind test of a computer method to predict native protein structures based solely on an all-atom physics-based force field. We use the AMBER 96 potential function with an implicit (GB/SA) model of solvation, combined with replica-exchange molecular-dynamics simulations. Coarse conformational sampling is performed using the zipping and assembly method (ZAM), an approach that is designed to mimic the putative physical routes of protein folding. ZAM was applied to the folding of six proteins, from 76 to 112 monomers in length, in CASP7, a community-wide blind test of protein structure prediction. Because these predictions have about the same level of accuracy as typical bioinformatics methods, and do not utilize information from databases of known native structures, this work opens up the possibility of predicting the structures of membrane proteins, synthetic peptides, or other foldable polymers, for which there is little prior knowledge of native structures. This approach may also be useful for predicting physical protein folding routes, non-native conformations, and other physical properties from amino acid sequences. PMID:19186130
De Novo Human Cardiac Myocytes for Medical Research: Promises and Challenges

Directory of Open Access Journals (Sweden)

Veronique Hamel

2017-01-01

Full Text Available The advent of cellular reprogramming technology has revolutionized biomedical research. De novo human cardiac myocytes can now be obtained from direct reprogramming of somatic cells (such as fibroblasts, from induced pluripotent stem cells (iPSCs, which are reprogrammed from somatic cells, and from human embryonic stem cells (hESCs. Such de novo human cardiac myocytes hold great promise for in vitro disease modeling and drug screening and in vivo cell therapy of heart disease. Here, we review the technique advancements for generating de novo human cardiac myocytes. We also discuss several challenges for the use of such cells in research and regenerative medicine, such as the immature phenotype and heterogeneity of de novo cardiac myocytes obtained with existing protocols. We focus on the recent advancements in addressing such challenges.
De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.

Directory of Open Access Journals (Sweden)

Jacqueline D Farrell

Full Text Available Perennial ryegrass is a highly heterozygous outbreeding grass species used for turf and forage production. Heterozygosity can affect de-Bruijn graph assembly making de novo transcriptome assembly of species such as perennial ryegrass challenging. Creating a reference transcriptome from a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals of this study were to perform RNA-sequencing on multiple tissues from a highly inbred genotype to develop a reference transcriptome. This was complemented with RNA-sequencing of a highly heterozygous genotype for SNP calling.De novo transcriptome assembly of the inbred genotype created 185,833 transcripts with an average length of 830 base pairs. Within the inbred reference transcriptome 78,560 predicted open reading frames were found of which 24,434 were predicted as complete. Functional annotation found 50,890 transcripts with a BLASTp hit from the Swiss-Prot non-redundant database, 58,941 transcripts with a Pfam protein domain and 1,151 transcripts encoding putative secreted peptides. To evaluate the reference transcriptome we targeted the high-affinity K+ transporter gene family and found multiple orthologs. Using the longest unique open reading frames as the reference sequence, 64,242 single nucleotide polymorphisms were found. One thousand sixty one open reading frames from the inbred genotype contained heterozygous sites, confirming the high degree of homozygosity.Our study has developed an annotated, comprehensive transcriptome reference for perennial ryegrass that can aid in determining genetic variation, expression analysis, genome annotation, and gene mapping.
Melhoramento do cafeeiro: IV - Café Mundo Novo

Directory of Open Access Journals (Sweden)

A. Carvalho

1952-06-01

Full Text Available Em um conjunto de cafeeiros existentes em Mundo Novo, hoje Urupês, na região Araraquarense do Estado de São Paulo, foram feitas seleções de vários cafeeiros baseando-se no seu aspecto vegetativo, na produção existente na época da seleção e na provável produção do ano seguinte. Estudou-se a origem da plantação inicial desse café, tanto em Urupês como em Jaú, chegando-se à conclusão de que é provavelmente originário desta última localidade. Progênies do café "Mundo Novo", anteriormente conhecido por "Sumatra" e derivado de plantas selecionadas em Urupês e Jaú, acham-se em estudo em seis localidades do Estado : Campinas, Ribeirão Prêto, Pindorama, Mococa, Jaú e Monte Alegre do Sul. No presente trabalho são apenas aproveitados dados referentes à variabilidade morfológica e característicos da produção das progênies dos primeiros cafeeiros selecionados em Urupês e estudados em Campinas, Jaú, Pindorama e Mococa. Em tôdas as localidades, observou-se variação nos caracteres morfológicos das progênies, verificando-se a ocorrência de plantas quase improdutivas. A maioria das progênies, no entanto, se caracteriza por acentuado vigor vegetativo. Foram estudadas as produções totais das progénies e das plantas, no período 1946-1951, notando-se que algumas progénies se salientaram pela elevada produção em tôdas as localidades. Os tipos de sementes "moca", "concha" e "chato" foram determinados em amostras de tôdas as plantas, por um período de três anos, notando-se que a variação ocorrida é da mesma ordem que a encontrada em outros cafeeiros em seleção. Procurou-se eliminar, pela seleção, cafeeiros com elevada produção de frutos sem sementes em uma ou duas lojas, característico êsse que parece ser hereditário. Os resultados obtidos de cruzamento entre os melhores cafeeiros "Mundo Novo" de Campinas e plantas da variedade murta, indicaram que esses cafeeiros são do tipo bourbon. Provavelmente
Ensemble Architecture for Prediction of Enzyme-ligand Binding Residues Using Evolutionary Information.

Science.gov (United States)

Pai, Priyadarshini P; Dattatreya, Rohit Kadam; Mondal, Sukanta

2017-11-01

Enzyme interactions with ligands are crucial for various biochemical reactions governing life. Over many years attempts to identify these residues for biotechnological manipulations have been made using experimental and computational techniques. The computational approaches have gathered impetus with the accruing availability of sequence and structure information, broadly classified into template-based and de novo methods. One of the predominant de novo methods using sequence information involves application of biological properties for supervised machine learning. Here, we propose a support vector machines-based ensemble for prediction of protein-ligand interacting residues using one of the most important discriminative contributing properties in the interacting residue neighbourhood, i. e., evolutionary information in the form of position-specific- scoring matrix (PSSM). The study has been performed on a non-redundant dataset comprising of 9269 interacting and 91773 non-interacting residues for prediction model generation and further evaluation. Of the various PSSM-based models explored, the proposed method named ROBBY (pRediction Of Biologically relevant small molecule Binding residues on enzYmes) shows an accuracy of 84.0 %, Matthews Correlation Coefficient of 0.343 and F-measure of 39.0 % on 78 test enzymes. Further, scope of adding domain knowledge such as pocket information has also been investigated; results showed significant enhancement in method precision. Findings are hoped to boost the reliability of small-molecule ligand interaction prediction for enzyme applications and drug design. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
De novo transcriptome assembly of the mycoheterotrophic plant Monotropa hypopitys

Directory of Open Access Journals (Sweden)

Alexey V. Beletsky

2017-03-01

Full Text Available Monotropa hypopitys (pinesap is a non-photosynthetic obligately mycoheterotrophic plant of the family Ericaceae. It obtains the carbon and other nutrients from the roots of surrounding autotrophic trees through the associated mycorrhizal fungi. In order to understand the evolutionary changes in the plant genome associated with transition to a heterotrophic lifestyle, we performed de novo transcriptomic analysis of M. hypopitys using next-generation sequencing. We obtained the RNA-Seq data from flowers, flower bracts and roots with haustoria using Illumina HiSeq2500 platform. The raw data obtained in this study can be available in NCBI SRA database with accession number of SRP069226. A total of 10.3 GB raw sequence data were obtained, corresponding to 103,357,809 raw reads. A total of 103,025,683 reads were filtered after removing low-quality reads and trimming the adapter sequences. The Trinity program was used to de novo assemble 98,349 unigens with an N50 of 1342 bp. Using the TransDecoder program, we predicted 43,505 putative proteins. 38,416 unigenes were annotated in the Swiss-Prot protein sequence database using BLASTX. The obtained transcriptomic data will be useful for further studies of the evolution of plant genomes upon transition to a non-photosynthetic lifestyle and the loss of photosynthesis-related functions.
Failure of Noninvasive Ventilation for De Novo Acute Hypoxemic Respiratory Failure: Role of Tidal Volume.

Science.gov (United States)

Carteaux, Guillaume; Millán-Guilarte, Teresa; De Prost, Nicolas; Razazi, Keyvan; Abid, Shariq; Thille, Arnaud W; Schortgen, Frédérique; Brochard, Laurent; Brun-Buisson, Christian; Mekontso Dessap, Armand

2016-02-01

A low or moderate expired tidal volume can be difficult to achieve during noninvasive ventilation for de novo acute hypoxemic respiratory failure (i.e., not due to exacerbation of chronic lung disease or cardiac failure). We assessed expired tidal volume and its association with noninvasive ventilation outcome. Prospective observational study. Twenty-four bed university medical ICU. Consecutive patients receiving noninvasive ventilation for acute hypoxemic respiratory failure between August 2010 and February 2013. Noninvasive ventilation was uniformly delivered using a simple algorithm targeting the expired tidal volume between 6 and 8 mL/kg of predicted body weight. Expired tidal volume was averaged and respiratory and hemodynamic variables were systematically recorded at each noninvasive ventilation session. Sixty-two patients were enrolled, including 47 meeting criteria for acute respiratory distress syndrome, and 32 failed noninvasive ventilation (51%). Pneumonia (n = 51, 82%) was the main etiology of acute hypoxemic respiratory failure. The median (interquartile range) expired tidal volume averaged over all noninvasive ventilation sessions (mean expired tidal volume) was 9.8 mL/kg predicted body weight (8.1-11.1 mL/kg predicted body weight). The mean expired tidal volume was significantly higher in patients who failed noninvasive ventilation as compared with those who succeeded (10.6 mL/kg predicted body weight [9.6-12.0] vs 8.5 mL/kg predicted body weight [7.6-10.2]; p = 0.001), and expired tidal volume was independently associated with noninvasive ventilation failure in multivariate analysis. This effect was mainly driven by patients with PaO2/FIO2 up to 200 mm Hg. In these patients, the expired tidal volume above 9.5 mL/kg predicted body weight predicted noninvasive ventilation failure with a sensitivity of 82% and a specificity of 87%. A low expired tidal volume is almost impossible to achieve in the majority of patients receiving noninvasive ventilation
De Novo Collapsing Glomerulopathy in a Renal Allograft Recipient

Directory of Open Access Journals (Sweden)

Kanodia K

2008-01-01

Full Text Available Collapsing glomerulopathy (CG, characterized histologically by segmental/global glomerular capillary collapse, podocyte hypertrophy and hypercellularity and tubulo-interstitial injury; is characterized clinically by massive proteinuria and rapid progressive renal failure. CG is known to recur in renal allograft and rarely de novo. We report de novo CG 3 years post-transplant in a patient who received renal allograft from haplo-identical type donor.
Purine biosynthesis de novo by lymphocytes in gout

International Nuclear Information System (INIS)

Kamoun, P.; Chanard, J.; Brami, M.; Funck-Brentano, J.L.

1978-01-01

A method of measurement in vitro of purine biosynthesis de novo in human circulating blood lymphocytes is proposed. The rate of early reactions of purine biosynthesis de novo was determined by the incorporation of [ 14 C]formate into N-formyl glycinamide ribonucleotide when the subsequent reactions of the metabolic pathway were completely inhibited by the antibiotic azaserine. Synthesis of 14 C-labelled N-formyl glycinamide ribonucleotide by lymphocytes was measured in healthy control subjects and patients with primary gout or hyperuricaemia secondary to renal failure, with or without allopurinol therapy. The average synthesis was higher in gouty patients without therapy than in control subjects, but the values contained overlap the normal range. In secondary hyperuricaemia the synthesis was at same value as in control subjects. These results are in agreement with the inconstant acceleration of purine biosynthesis de novo in gouty patients as seen by others with measurement of [ 14 C]glycine incorporation into urinary uric acid. (author)
De novo giant A2 aneurysm following anterior communicating artery occlusion.

Science.gov (United States)

Ibrahim, Tarik F; Hafez, Ahmad; Andrade-Barazarte, Hugo; Raj, Rahul; Niemela, Mika; Lehto, Hanna; Numminen, Jussi; Jarvelainen, Juha; Hernesniemi, Juha

2015-01-01

De novo intracranial aneurysms are reported to occur with varying incidence after intracranial aneurysm treatment. They are purported to be observed, however, with increased incidence after Hunterian ligation; particularly in cases of carotid artery occlusion for giant or complex aneurysms deemed unclippable. We report a case of right-sided de novo giant A2 aneurysm 6 years after an anterior communicating artery (ACoA) aneurysm clipping. We believe this de novo aneurysm developed in part due to patient-specific risk factors but also a significant change in cerebral hemodynamics. The ACoA became occluded after surgery that likely altered the cerebral hemodynamics and contributed to the de novo aneurysm. We believe this to be the first reported case of a giant de novo aneurysm in this location. Following parent vessel occlusion (mostly of the carotid artery), there are no reports of any de novo aneurysms in the pericallosal arteries let alone a giant one. The patient had a dominant right A1 and the sudden increase in A2 blood flow likely resulted in increased wall shear stress, particularly in the medial wall of the A2 where the aneurysm occurred 2 mm distal to the A1-2 junction. ACoA preservation is a key element of aneurysm surgery in this location. Suspected occlusion of this vessel may warrant closer radiographic follow-up in patients with other risk factors for aneurysm development.
RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

Science.gov (United States)

Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

2018-01-01

RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

Direct Visualization of De novo Lipogenesis in Single Living Cells

Science.gov (United States)

Li, Junjie; Cheng, Ji-Xin

2014-10-01

Increased de novo lipogenesis is being increasingly recognized as a hallmark of cancer. Despite recent advances in fluorescence microscopy, autoradiography and mass spectrometry, direct observation of de novo lipogenesis in living systems remains to be challenging. Here, by coupling stimulated Raman scattering (SRS) microscopy with isotope labeled glucose, we were able to trace the dynamic metabolism of glucose in single living cells with high spatial-temporal resolution. As the first direct visualization, we observed that glucose was largely utilized for lipid synthesis in pancreatic cancer cells, which occurs at a much lower rate in immortalized normal pancreatic epithelial cells. By inhibition of glycolysis and fatty acid synthase (FAS), the key enzyme for fatty acid synthesis, we confirmed the deuterium labeled lipids in cancer cells were from de novo lipid synthesis. Interestingly, we also found that prostate cancer cells exhibit relatively lower level of de novo lipogenesis, but higher fatty acid uptake compared to pancreatic cancer cells. Together, our results demonstrate a valuable tool to study dynamic lipid metabolism in cancer and other disorders.
Protein secondary structure: category assignment and predictability

DEFF Research Database (Denmark)

Andersen, Claus A.; Bohr, Henrik; Brunak, Søren

2001-01-01

In the last decade, the prediction of protein secondary structure has been optimized using essentially one and the same assignment scheme known as DSSP. We present here a different scheme, which is more predictable. This scheme predicts directly the hydrogen bonds, which stabilize the secondary......-forward neural network with one hidden layer on a data set identical to the one used in earlier work....
Response monitoring in de novo patients with Parkinson's disease.

Directory of Open Access Journals (Sweden)

Rita Willemssen

Full Text Available BACKGROUND: Parkinson's disease (PD is accompanied by dysfunctions in a variety of cognitive processes. One of these is error processing, which depends upon phasic decreases of medial prefrontal dopaminergic activity. Until now, there is no study evaluating these processes in newly diagnosed, untreated patients with PD ("de novo PD". METHODOLOGY/PRINCIPAL FINDINGS: Here we report large changes in performance monitoring processes using event-related potentials (ERPs in de novo PD-patients. The results suggest that increases in medial frontal dopaminergic activity after an error (Ne are decreased, relative to age-matched controls. In contrast, neurophysiological processes reflecting general motor response monitoring (Nc are enhanced in de novo patients. CONCLUSIONS/SIGNIFICANCE: It may be hypothesized that the Nc-increase is at costs of dopaminergic activity after an error; on a functional level errors may not always be detected and correct responses sometimes be misinterpreted as errors. This pattern differs from studies examining patients with a longer history of PD and may reflect compensatory processes, frequently occurring in pre-manifest stages of PD. From a clinical point of view the clearly attenuated Ne in the de novo PD patients may prove a useful additional tool for the early diagnosis of basal ganglia dysfunction in PD.
Precise detection of de novo single nucleotide variants in human genomes.

Science.gov (United States)

Gómez-Romero, Laura; Palacios-Flores, Kim; Reyes, José; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Schatz, Michael C; Palacios, Rafael

2018-05-07

The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use. Copyright © 2018 the Author(s). Published by PNAS.
De novo mutations in synaptic transmission genes including DNM1 cause epileptic encephalopathies

DEFF Research Database (Denmark)

2014-01-01

in five individuals and de novo mutations in GABBR2, FASN, and RYR3 in two individuals each. Unlike previous studies, this cohort is sufficiently large to show a significant excess of de novo mutations in epileptic encephalopathy probands compared to the general population using a likelihood analysis (p...... = 8.2 × 10(-4)), supporting a prominent role for de novo mutations in epileptic encephalopathies. We bring statistical evidence that mutations in DNM1 cause epileptic encephalopathy, find suggestive evidence for a role of three additional genes, and show that at least 12% of analyzed individuals have...... analyzed exome-sequencing data of 356 trios with the "classical" epileptic encephalopathies, infantile spasms and Lennox Gastaut syndrome, including 264 trios previously analyzed by the Epi4K/EPGP consortium. In this expanded cohort, we find 429 de novo mutations, including de novo mutations in DNM1...
A semi-supervised learning approach for RNA secondary structure prediction.

Science.gov (United States)

Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

2015-08-01

RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.
De novo transcriptome assembly of shrimp Palaemon serratus

Directory of Open Access Journals (Sweden)

Alejandra Perina

2017-03-01

Full Text Available The shrimp Palaemon serratus is a coastal decapod crustacean with a high commercial value. It is harvested for human consumption. In this study, we used Illumina sequencing technology (HiSeq 2000 to sequence, assemble and annotate the transcriptome of P. serratus. RNA was isolated from muscle of adults individuals and, from a pool of larvae. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2. The raw data in this study was deposited in NCBI SRA database with study accession number of SRP090769. The obtained data were subjected to de novo transcriptome assembly using Trinity software, and coding regions were predicted by TransDecoder. We used Blastp and Sma3s to annotate the identified proteins. The transcriptome data could provide some insight into the understanding of genes involved in the larval development and metamorphosis.
QCD dipole predictions for DIS and diffractive structure functions

International Nuclear Information System (INIS)

Royon, C.

1997-01-01

The proton structure function F 2 , the gluon density F G , and the longitudinal structure function F L are derived in the QCD dipole picture of BFKL dynamics. We use a three parameter fit to describe the 1994 H1 proton structure function F 2 data in the low x, moderate Q 2 range. Without any additional parameter, the gluon density and the longitudinal structure functions are predicted. The diffractive dissociation processes are also discussed within the same framework, and a new prediction for the proton diffractive structure function is obtained
Evolutionary rate variation and RNA secondary structure prediction

DEFF Research Database (Denmark)

Knudsen, B.; Andersen, E.S.; Damgaard, C.

2004-01-01

Predicting RNA secondary structure using evolutionary history can be carried out by using an alignment of related RNA sequences with conserved structure. Accurately determining evolutionary substitution rates for base pairs and single stranded nucleotides is a concern for methods based on this type...... by applying rates derived from tRNA and rRNA to the prediction of the much more rapidly evolving 5'-region of HIV-1. We find that the HIV-1 prediction is in agreement with experimental data, even though the relative evolutionary rate between A and G is significantly increased, both in stem and loop regions...
Distance matrix-based approach to protein structure prediction.

Science.gov (United States)

Kloczkowski, Andrzej; Jernigan, Robert L; Wu, Zhijun; Song, Guang; Yang, Lei; Kolinski, Andrzej; Pokarowski, Piotr

2009-03-01

Much structural information is encoded in the internal distances; a distance matrix-based approach can be used to predict protein structure and dynamics, and for structural refinement. Our approach is based on the square distance matrix D = [r(ij)(2)] containing all square distances between residues in proteins. This distance matrix contains more information than the contact matrix C, that has elements of either 0 or 1 depending on whether the distance r (ij) is greater or less than a cutoff value r (cutoff). We have performed spectral decomposition of the distance matrices D = sigma lambda(k)V(k)V(kT), in terms of eigenvalues lambda kappa and the corresponding eigenvectors v kappa and found that it contains at most five nonzero terms. A dominant eigenvector is proportional to r (2)--the square distance of points from the center of mass, with the next three being the principal components of the system of points. By predicting r (2) from the sequence we can approximate a distance matrix of a protein with an expected RMSD value of about 7.3 A, and by combining it with the prediction of the first principal component we can improve this approximation to 4.0 A. We can also explain the role of hydrophobic interactions for the protein structure, because r is highly correlated with the hydrophobic profile of the sequence. Moreover, r is highly correlated with several sequence profiles which are useful in protein structure prediction, such as contact number, the residue-wise contact order (RWCO) or mean square fluctuations (i.e. crystallographic temperature factors). We have also shown that the next three components are related to spatial directionality of the secondary structure elements, and they may be also predicted from the sequence, improving overall structure prediction. We have also shown that the large number of available HIV-1 protease structures provides a remarkable sampling of conformations, which can be viewed as direct structural information about the
De Novo Assembly and Characterization of the Transcriptome of Grasshopper Shirakiacris shirakii

Directory of Open Access Journals (Sweden)

Zhongying Qiu

2016-07-01

Full Text Available Background: The grasshopper Shirakiacris shirakii is an important agricultural pest and feeds mainly on gramineous plants, thereby causing economic damage to a wide range of crops. However, genomic information on this species is extremely limited thus far, and transcriptome data relevant to insecticide resistance and pest control are also not available. Methods: The transcriptome of S. shirakii was sequenced using the Illumina HiSeq platform, and we de novo assembled the transcriptome. Results: Its sequencing produced a total of 105,408,878 clean reads, and the de novo assembly revealed 74,657 unigenes with an average length of 680 bp and N50 of 1057 bp. A total of 28,173 unigenes were annotated for the NCBI non-redundant protein sequences (Nr, NCBI non-redundant nucleotide sequences (Nt, a manually-annotated and reviewed protein sequence database (Swiss-Prot, Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG databases. Based on the Nr annotation results, we manually identified 79 unigenes encoding cytochrome P450 monooxygenases (P450s, 36 unigenes encoding carboxylesterases (CarEs and 36 unigenes encoding glutathione S-transferases (GSTs in S. shirakii. Core RNAi components relevant to miroRNA, siRNA and piRNA pathways, including Pasha, Loquacious, Argonaute-1, Argonaute-2, Argonaute-3, Zucchini, Aubergine, enhanced RNAi-1 and Piwi, were expressed in S. shirakii. We also identified five unigenes that were homologous to the Sid-1 gene. In addition, the analysis of differential gene expressions revealed that a total of 19,764 unigenes were up-regulated and 4185 unigenes were down-regulated in larvae. In total, we predicted 7504 simple sequence repeats (SSRs from 74,657 unigenes. Conclusions: The comprehensive de novo transcriptomic data of S. shirakii will offer a series of valuable molecular resources for better studying insecticide resistance, RNAi and molecular marker discovery in the transcriptome.
Defining the maize transcriptome de novo using deep RNA-Seq

Energy Technology Data Exchange (ETDEWEB)

Martin, Jeffrey; Gross, Stephen; Choi, Cindy; Zhang, Tao; Lindquist, Erika; Wei, Chia-Lin; Wang, Zhong

2011-06-01

De novo assembly of the transcriptome is crucial for functional genomics studies in bioenergy research, since many of the organisms lack high quality reference genomes. In a previous study we successfully de novo assembled simple eukaryote transcriptomes exclusively from short Illumina RNA-Seq reads [1]. However, extensive alternative splicing, present in most of the higher eukaryotes, poses a significant challenge for current short read assembly processes. Furthermore, the size of next-generation datasets, often large for plant genomes, presents an informatics challenge. To tackle these challenges we present a combined experimental and informatics strategy for de novo assembly in higher eukaryotes. Using maize as a test case, preliminary results suggest our approach can resolve transcript variants and improve gene annotations.
Defining the maize transcriptome de novo using deep RNA-Seq

Energy Technology Data Exchange (ETDEWEB)

Martin, Jeffrey; Gross, Stephen; Choi, Cindy; Zhang, Tao; Lindquist, Erika; Wei, Chia-Lin; Wang, Zhong

2011-06-02

De novo assembly of the transcriptome is crucial for functional genomics studies in bioenergy research, since many of the organisms lack high quality reference genomes. In a previous study we successfully de novo assembled simple eukaryote transcriptomes exclusively from short Illumina RNA-Seq reads [1]. However, extensive alternative splicing, present in most of the higher eukaryotes, poses a significant challenge for current short read assembly processes. Furthermore, the size of next-generation datasets, often large for plant genomes, presents an informatics challenge. To tackle these challenges we present a combined experimental and informatics strategy for de novo assembly in higher eukaryotes. Using maize as a test case, preliminary results suggest our approach can resolve transcript variants and improve gene annotations.
QCD dipole prediction for dis and diffractive structure functions

International Nuclear Information System (INIS)

Royon, CH.

1996-01-01

The F 2 , F G , R = F L /F T proton structure functions are derived in the QCD dipole picture of BFKL dynamics. We get a three parameter fit describing the 1994 H1 proton structure function F 2 data in the low x, moderate Q 2 range. Without any additional parameter, the gluon density and the longitudinal structure functions are predicted. The diffractive dissociation processes are also discussed, and a new prediction for the proton diffractive structure function is obtained. (author)
I-TASSER server for protein 3D structure prediction

Directory of Open Access Journals (Sweden)

Zhang Yang

2008-01-01

Full Text Available Abstract Background Prediction of 3-dimensional protein structures from amino acid sequences represents one of the most important problems in computational structural biology. The community-wide Critical Assessment of Structure Prediction (CASP experiments have been designed to obtain an objective assessment of the state-of-the-art of the field, where I-TASSER was ranked as the best method in the server section of the recent 7th CASP experiment. Our laboratory has since then received numerous requests about the public availability of the I-TASSER algorithm and the usage of the I-TASSER predictions. Results An on-line version of I-TASSER is developed at the KU Center for Bioinformatics which has generated protein structure predictions for thousands of modeling requests from more than 35 countries. A scoring function (C-score based on the relative clustering structural density and the consensus significance score of multiple threading templates is introduced to estimate the accuracy of the I-TASSER predictions. A large-scale benchmark test demonstrates a strong correlation between the C-score and the TM-score (a structural similarity measurement with values in [0, 1] of the first models with a correlation coefficient of 0.91. Using a C-score cutoff > -1.5 for the models of correct topology, both false positive and false negative rates are below 0.1. Combining C-score and protein length, the accuracy of the I-TASSER models can be predicted with an average error of 0.08 for TM-score and 2 Å for RMSD. Conclusion The I-TASSER server has been developed to generate automated full-length 3D protein structural predictions where the benchmarked scoring system helps users to obtain quantitative assessments of the I-TASSER models. The output of the I-TASSER server for each query includes up to five full-length models, the confidence score, the estimated TM-score and RMSD, and the standard deviation of the estimations. The I-TASSER server is freely available
Correlating Structural Order with Structural Rearrangement in Dusty Plasma Liquids: Can Structural Rearrangement be Predicted by Static Structural Information?

Science.gov (United States)

Su, Yen-Shuo; Liu, Yu-Hsuan; I, Lin

2012-11-01

Whether the static microstructural order information is strongly correlated with the subsequent structural rearrangement (SR) and their predicting power for SR are investigated experimentally in the quenched dusty plasma liquid with microheterogeneities. The poor local structural order is found to be a good alarm to identify the soft spot and predict the short term SR. For the site with good structural order, the persistent time for sustaining the structural memory until SR has a large mean value but a broad distribution. The deviation of the local structural order from that averaged over nearest neighbors serves as a good second alarm to further sort out the short time SR sites. It has the similar sorting power to that using the temporal fluctuation of the local structural order over a small time interval.
Novos paradigmas literários

Directory of Open Access Journals (Sweden)

Denise Azevedo Duarte Guimarães

2005-12-01

Full Text Available O artigo estuda a emergência de novos paradigmas literários, procurando refletir acerca das textualidades contemporâneas. Focaliza os hipertextos informatizados e a poesia multimídia, com o intuito de desvendar como estão sendo criados novos procedimentos expressivos e em que medida eles podem ser identificados com reflexões teóricas anteriores acerca do texto literário impresso. Remete a questões ligadas à leitura dos diferentes tipos de signos e aos modos como eles se integram para a constituição dessas novíssimas linguagens híbridas em novos suportes.El artículo estudia la emergencia de nuevos paradigmas literarios, procurando reflejar acerca de las textualidades contemporáneas. Enfoca los hipertextos informatizados y la poesía multimedia, intentando desvendar cómo están siendo creados nuevos procedimientos expresivos y en qué medida ellos pueden ser identificados a reflexiones teóricas anteriores acerca del texto literario impreso. Remite a cuestiones ligadas a la lectura de los diferentes tipos de signos y a los modos cómo ellos se interaccionan para la constitución de los novísimos lenguajes híbridos en nuevos supuestos.This article investigates the emergence of new literary paradigms as it tries to understand new contemporary textualities. It analyses some hypertexts and multimedia poetry trying to trace how new expressive procedures are being created. How can these new languages be identified and what are their relations to previous theories which dealt with the literary printed text? This study approaches questions linked to the reading of different types of signs and the modes they function towards the fabrication of these new hybrid languages.
De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

Directory of Open Access Journals (Sweden)

Yeonhwa Jo

2016-06-01

Full Text Available The adzuki bean (Vigna angularis, a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas. The adzuki bean is known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries. Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red were paired-end sequenced using the Illumina HiSeq 2000 system. The raw data in this study can be available in NCBI SRA database with accession numbers of SRR3406660 and SRR3406553. After de novo transcriptome assembly using Trinity, we obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. We predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively, by the TransDecoder program. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of identified proteins. Taken together, we provide transcriptomes of two adzuki bean varieties by RNA-Seq, which might be usefully applied to generate molecular markers.
A Public Trial De Novo

DEFF Research Database (Denmark)

Vedel, Jane Bjørn; Gad, Christopher

2011-01-01

This article addresses the concept of “industrial interests” and examines its role in a topical controversy about a large research grant from a private foundation, the Novo Nordisk Foundation, to the University of Copenhagen. The authors suggest that the debate took the form of a “public trial” w.......” The article ends with a discussion of some implications of the analysis, including that policy making, academic research, and public debates might benefit from more detailed accounts of interests and stakes.......This article addresses the concept of “industrial interests” and examines its role in a topical controversy about a large research grant from a private foundation, the Novo Nordisk Foundation, to the University of Copenhagen. The authors suggest that the debate took the form of a “public trial......” where the grant and close(r) intermingling between industry and public research was prosecuted and defended. First, the authors address how the grant was framed in the media. Second, they redescribe the case by introducing new “evidence” that, because of this framing, did not reach “the court...
Prediction of degradation and fracture of structural materials

International Nuclear Information System (INIS)

Tomkins, B.

1992-01-01

Prediction of materials performance in an engineering integrity context requires the underpinning of predictive modelling tuned by inputs from design, fabrication, operating experience, and laboratory testing. In this regard, in addition to fracture resistance four important areas of time dependent degradation are considered - mechanical, environmental, irradiation and thermal. The status of prediction of materials performance is discussed in relation to a number of important components such as LWR reactor pressure vessels and steam generators, and Fast Reactor high temperature structures. In each case the role of materials modelling is examined and the balance of factors which contribute to the overall prediction of component integrity/reliability noted. Structural integrity arguments must follow a clear strategy if the required level of confidence is to be established. Various strategies and their evolution are discussed. (author)

New tips for structure prediction by comparative modeling

Science.gov (United States)

Rayan, Anwar

2009-01-01

Comparative modelling is utilized to predict the 3-dimensional conformation of a given protein (target) based on its sequence alignment to experimentally determined protein structure (template). The use of such technique is already rewarding and increasingly widespread in biological research and drug development. The accuracy of the predictions as commonly accepted depends on the score of sequence identity of the target protein to the template. To assess the relationship between sequence identity and model quality, we carried out an analysis of a set of 4753 sequence and structure alignments. Throughout this research, the model accuracy was measured by root mean square deviations of Cα atoms of the target-template structures. Surprisingly, the results show that sequence identity of the target protein to the template is not a good descriptor to predict the accuracy of the 3-D structure model. However, in a large number of cases, comparative modelling with lower sequence identity of target to template proteins led to more accurate 3-D structure model. As a consequence of this study, we suggest new tips for improving the quality of omparative models, particularly for models whose target-template sequence identity is below 50%. PMID:19255646
Cloud prediction of protein structure and function with PredictProtein for Debian.

Science.gov (United States)

Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Staniewski, Cedric; Rost, Burkhard

2013-01-01

We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome.
The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads.

Science.gov (United States)

Wang, Zhiwen; Hobson, Neil; Galindo, Leonardo; Zhu, Shilin; Shi, Daihu; McDill, Joshua; Yang, Linfeng; Hawkins, Simon; Neutelings, Godfrey; Datla, Raju; Lambert, Georgina; Galbraith, David W; Grassa, Christopher J; Geraldes, Armando; Cronk, Quentin C; Cullis, Christopher; Dash, Prasanta K; Kumar, Polumetla A; Cloutier, Sylvie; Sharpe, Andrew G; Wong, Gane K-S; Wang, Jun; Deyholos, Michael K

2012-11-01

Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Vfold: a web server for RNA structure and folding thermodynamics prediction.

Science.gov (United States)

Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

2014-01-01

The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".
Web Access to Digitised Content of the Exhibition Novo Mesto 1848-1918 at the Dolenjska Museum, Novo Mesto

Directory of Open Access Journals (Sweden)

Majda Pungerčar

2013-09-01

Full Text Available EXTENDED ABSTRACTFor the first time, the Dolenjska museum Novo mesto provided access to digitised museum resources when they took the decision to enrich the exhibition Novo mesto 1848-1918 by adding digital content. The following goals were identified: the digital content was created at the time of exhibition planning and design, it met the needs of different age groups of visitors, and during the exhibition the content was accessible via touch screen. As such, it also served for educational purposes (content-oriented lectures or problem solving team work. In the course of exhibition digital content was accessible on the museum website http://www.novomesto1848-1918.si. The digital content was divided into the following sections: the web photo gallery, the quiz and the game. The photo gallery was designed in the same way as the exhibition and the print catalogue and extended by the photos of contemporary Novo mesto and accompanied by the music from the orchestron machine. The following themes were outlined: the Austrian Empire, the Krka and Novo mesto, the town and its symbols, images of the town and people, administration and economy, social life and Novo mesto today followed by digitised archive materials and sources from that period such as the Commemorative book of the Uniformed Town Guard, the National Reading Room Guest Book, the Kazina guest book, the album of postcards and the Diploma of Honoured Citizen Josip Gerdešič. The Web application was also a tool for a simple and on line selection of digitised material and the creation of new digital content which proved to be much more convenient for lecturing than Power Point presentations. The quiz consisted of 40 questions relating to the exhibition theme and the catalogue. Each question offered a set of three answers only one of them being correct and illustrated by photography. The application auto selected ten questions and valued the answers immediately. The quiz could be accessed
Structural features that predict real-value fluctuations of globular proteins.

Science.gov (United States)

Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke

2012-05-01

It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. Copyright © 2012 Wiley Periodicals, Inc.
NovoTTF™-100A System (Tumor Treating Fields) transducer array layout planning for glioblastoma: a NovoTAL™ system user study.

Science.gov (United States)

Chaudhry, Aafia; Benson, Laura; Varshaver, Michael; Farber, Ori; Weinberg, Uri; Kirson, Eilon; Palti, Yoram

2015-11-11

Optune™, previously known as the NovoTTF-100A System™, generates Tumor Treating Fields (TTFields), an effective anti-mitotic therapy for glioblastoma. The system delivers intermediate frequency, alternating electric fields to the supratentorial brain. Patient therapy is personalized by configuring transducer array layout placement on the scalp to the tumor site using MRI measurements and the NovoTAL System. Transducer array layout mapping optimizes therapy by maximizing electric field intensity to the tumor site. This study evaluated physician performance in conducting transducer array layout mapping using the NovoTAL System compared with mapping performed by the Novocure in-house clinical team. Fourteen physicians (7 neuro-oncologists, 4 medical oncologists, and 3 neurosurgeons) evaluated five blinded cases of recurrent glioblastoma and performed head size and tumor location measurements using a standard Digital Imaging and Communications in Medicine reader. Concordance with Novocure measurement and intra- and inter-rater reliability were assessed using relevant correlation coefficients. The study criterion for success was a concordance correlation coefficient (CCC) >0.80. CCC for each physician versus Novocure on 20 MRI measurements was 0.96 (standard deviation, SD ± 0.03, range 0.90-1.00), indicating very high agreement between the two groups. Intra- and inter-rater reliability correlation coefficients were similarly high: 0.83 (SD ±0.15, range 0.54-1.00) and 0.80 (SD ±0.18, range 0.48-1.00), respectively. This user study demonstrated an excellent level of concordance between prescribing physicians and Novocure in-house clinical teams in performing transducer array layout planning. Intra-rater reliability was very high, indicating reproducible performance. Physicians prescribing TTFields, when trained on the NovoTAL System, can independently perform transducer array layout mapping required for the initiation and maintenance of patients on TTFields
Predicting beta-turns and their types using predicted backbone dihedral angles and secondary structures.

Science.gov (United States)

Kountouris, Petros; Hirst, Jonathan D

2010-07-31

Beta-turns are secondary structure elements usually classified as coil. Their prediction is important, because of their role in protein folding and their frequent occurrence in protein chains. We have developed a novel method that predicts beta-turns and their types using information from multiple sequence alignments, predicted secondary structures and, for the first time, predicted dihedral angles. Our method uses support vector machines, a supervised classification technique, and is trained and tested on three established datasets of 426, 547 and 823 protein chains. We achieve a Matthews correlation coefficient of up to 0.49, when predicting the location of beta-turns, the highest reported value to date. Moreover, the additional dihedral information improves the prediction of beta-turn types I, II, IV, VIII and "non-specific", achieving correlation coefficients up to 0.39, 0.33, 0.27, 0.14 and 0.38, respectively. Our results are more accurate than other methods. We have created an accurate predictor of beta-turns and their types. Our method, called DEBT, is available online at http://comp.chem.nottingham.ac.uk/debt/.
Predictive modelling-based design and experiments for synthesis and spinning of bioinspired silk fibres

Science.gov (United States)

Gronau, Greta; Jacobsen, Matthew M.; Huang, Wenwen; Rizzo, Daniel J.; Li, David; Staii, Cristian; Pugno, Nicola M.; Wong, Joyce Y.; Kaplan, David L.; Buehler, Markus J.

2016-01-01

Scalable computational modelling tools are required to guide the rational design of complex hierarchical materials with predictable functions. Here, we utilize mesoscopic modelling, integrated with genetic block copolymer synthesis and bioinspired spinning process, to demonstrate de novo materials design that incorporates chemistry, processing and material characterization. We find that intermediate hydrophobic/hydrophilic block ratios observed in natural spider silks and longer chain lengths lead to outstanding silk fibre formation. This design by nature is based on the optimal combination of protein solubility, self-assembled aggregate size and polymer network topology. The original homogeneous network structure becomes heterogeneous after spinning, enhancing the anisotropic network connectivity along the shear flow direction. Extending beyond the classical polymer theory, with insights from the percolation network model, we illustrate the direct proportionality between network conductance and fibre Young's modulus. This integrated approach provides a general path towards de novo functional network materials with enhanced mechanical properties and beyond (optical, electrical or thermal) as we have experimentally verified. PMID:26017575
Predictive modelling-based design and experiments for synthesis and spinning of bioinspired silk fibres.

Science.gov (United States)

Lin, Shangchao; Ryu, Seunghwa; Tokareva, Olena; Gronau, Greta; Jacobsen, Matthew M; Huang, Wenwen; Rizzo, Daniel J; Li, David; Staii, Cristian; Pugno, Nicola M; Wong, Joyce Y; Kaplan, David L; Buehler, Markus J

2015-05-28

Scalable computational modelling tools are required to guide the rational design of complex hierarchical materials with predictable functions. Here, we utilize mesoscopic modelling, integrated with genetic block copolymer synthesis and bioinspired spinning process, to demonstrate de novo materials design that incorporates chemistry, processing and material characterization. We find that intermediate hydrophobic/hydrophilic block ratios observed in natural spider silks and longer chain lengths lead to outstanding silk fibre formation. This design by nature is based on the optimal combination of protein solubility, self-assembled aggregate size and polymer network topology. The original homogeneous network structure becomes heterogeneous after spinning, enhancing the anisotropic network connectivity along the shear flow direction. Extending beyond the classical polymer theory, with insights from the percolation network model, we illustrate the direct proportionality between network conductance and fibre Young's modulus. This integrated approach provides a general path towards de novo functional network materials with enhanced mechanical properties and beyond (optical, electrical or thermal) as we have experimentally verified.
A Terra em Transe: o cosmopolitismo às avessas do cinema novo

Directory of Open Access Journals (Sweden)

Angela Prysthon

2008-11-01

Full Text Available Usando como referencial teórico os estudos culturais, este artigo analisa o cinema novo brasileiro como parte de uma estratégia terceiro mundista de conceber a cultura. A partir da emergência do conceito de terceiro mundo e das lutas de descolonização nos anos 1950 e 1960, a ideologia cosmopolita foi sendo vista pelos intelectuais de esquerda como a versão cultural da aliança com as forças hegemônicas da Europa e dos Estados Unidos. O projeto do cinema novo chama a atenção por suas afinidades ideológicas com o terceiro mundismo, mas, paradoxalmente, trazendo à tona uma política cosmopolita da periferia. Palavras-chave cinema novo, identidade, cultura brasileira, terceiro mundismo, estudos culturais. Abstract Using the cultural studies theoretical framework, this paper analyzes the cinema novo movement in Brazil as a part of the Third World conception of culture. Following the creation of the term "Third World" and the international politics of colonial independence of the 1950s and 1960s, a cosmopolitan attitude was seen by the intellectuals of the left as a cultural version of the alliance with the hegemonic forces of Europe and North America. Even though the cinema novo project can be associated with the ideology of an united Third World ,it brings about, paradoxically, a very cosmopolitan politics of the periphery. Key words cinema novo, identity, Brazilian culture, third world, cultural studies.
Eculizumab for drug-induced de novo posttransplantation thrombotic microangiopathy: A case report.

Science.gov (United States)

Safa, Kassem; Logan, Merranda S; Batal, Ibrahim; Gabardi, Steven; Rennke, Helmut G; Abdi, Reza

2015-02-01

De novo thrombotic microangiopathy (TMA) following renal transplantation is a severe complication associated with high rates of allograft failure. Several immunosuppressive agents are associated with TMA. Conventional approaches to managing this entity, such as withdrawal of the offending agent and/or plasmapheresis, often offer limited help, with high rates of treatment failure and graft loss. We herein report a case of drug induced de novo TMA successfully treated using the C5a inhibitor eculizumab in a renal transplant patient. This report highlights a potentially important role for eculizumab in settings where drug-induced de novo TMA is refractory to conventional therapies.
De novo autoimmune hepatitis after liver transplantation.

Science.gov (United States)

Lohse, Ansgar W; Weiler-Norman, Christina; Burdelski, Martin

2007-10-01

The Kings College group was the first to describe a clinical syndrome similar to autoimmune hepatitis in children and young adults transplanted for non-immune mediated liver diseases. They coined the term "de novo autoimmune hepatitis". Several other liver transplant centres confirmed this observation. Even though the condition is uncommon, patients with de novo AIH are now seen in most of the major transplant centres. The disease is usually characterized by features of acute hepatitis in otherwise stable transplant recipients. The most characteristic laboratory hallmark is a marked hypergammaglobulinaemia. Autoantibodies are common, mostly ANA. We described also a case of LKM1-positivity in a patients transplanted for Wilson's disease, however this patients did not develop clinical or histological features of AIH. Development of SLA/LP-autoantibodies is also not described. Therefore, serologically de novo AIH appears to correspond to type 1 AIH. Like classical AIH patients respond promptly to treatment with increased doses of prednisolone and azathioprine, while the calcineurin inhibitors cyclosporine or tacrolimus areof very limited value - which is not surprising, as almost all patients develop de novo AIH while receiving these drugs. Despite the good response to treatment, most patients remain a clinical challenge as complete stable remissions are uncommon and flares, relapses and chronic disease activity can often occur. Pathogenetically this syndrome is intriguing. It is not clear, if the immune response is directed against allo-antigens, neo-antigens in the liver, or self-antigens, possibly shared by donor and host cells. It is very likely that the inflammatory milieu due to alloreactive cells in the transplanted organ contribute to the disease process. Either leading to aberrant antigen presentation, or providing co-stimulatory signals leading to the breaking of self-tolerance. The development of this disease in the presence of treatment with calcineurin
A randomized, double-blind, cross-over, phase IV trial of oros-methylphenidate (CONCERTA(®)) and generic novo-methylphenidate ER-C (NOVO-generic).

Science.gov (United States)

Fallu, Angelo; Dabouz, Farida; Furtado, Melissa; Anand, Leena; Katzman, Martin A

2016-08-01

Attention-deficit/hyperactivity disorder (ADHD) is a common neurobehavioral disorder with onset during childhood. Multiple aspects of a child's development are hindered, in both home and school settings, with negative impacts on social, emotional, and cognitive functioning. If left untreated, ADHD is commonly associated with poor academic achievement and low occupational status, as well as increased risk of substance abuse and delinquency. The objective of this study was to evaluate adult ADHD subject reported outcomes when switched from a stable dose of CONCERTA(®) to the same dose of generic Novo-methylphenidate ER-C(®). Randomized, double-blind, cross-over, phase IV trial consisted of two phases in which participants with a primary diagnosis of ADHD were randomized in a 1:1 ratio to 3 weeks of treatment with CONCERTA or generic Novo-Methylphenidate ER-C. Following 3 weeks of treatment, participants were crossed-over to receive the other treatment for an additional 3 weeks. Primary efficacy was assessed through the use of the Treatment Satisfaction Questionnaire for Medication, Version II (TSQM-II). Participants with ADHD treated with CONCERTA were more satisfied in terms of efficacy and side effects compared to those receiving an equivalent dose of generic Novo-Methylphenidate ER-C. All participants chose to continue with CONCERTA treatment at the conclusion of the study. Although CONCERTA and generic Novo-Methylphenidate ER-C have been deemed bioequivalent, however the present findings demonstrate clinically and statistically significant differences between generic and branded CONCERTA. Further investigation of these differences is warranted.
Arginine de novo and nitric oxide production in disease states

OpenAIRE

Luiking, Yvette C.; Ten Have, Gabriella A. M.; Wolfe, Robert R.; Deutz, Nicolaas E. P.

2012-01-01

Arginine is derived from dietary protein intake, body protein breakdown, or endogenous de novo arginine production. The latter may be linked to the availability of citrulline, which is the immediate precursor of arginine and limiting factor for de novo arginine production. Arginine metabolism is highly compartmentalized due to the expression of the enzymes involved in arginine metabolism in various organs. A small fraction of arginine enters the NO synthase (NOS) pathway. Tetrahydrobiopterin ...
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.

Science.gov (United States)

Zheng, Ce; Kurgan, Lukasz

2008-10-10

beta-turn is a secondary protein structure type that plays significant role in protein folding, stability, and molecular recognition. To date, several methods for prediction of beta-turns from protein sequences were developed, but they are characterized by relatively poor prediction quality. The novelty of the proposed sequence-based beta-turn predictor stems from the usage of a window based information extracted from four predicted three-state secondary structures, which together with a selected set of position specific scoring matrix (PSSM) values serve as an input to the support vector machine (SVM) predictor. We show that (1) all four predicted secondary structures are useful; (2) the most useful information extracted from the predicted secondary structure includes the structure of the predicted residue, secondary structure content in a window around the predicted residue, and features that indicate whether the predicted residue is inside a secondary structure segment; (3) the PSSM values of Asn, Asp, Gly, Ile, Leu, Met, Pro, and Val were among the top ranked features, which corroborates with recent studies. The Asn, Asp, Gly, and Pro indicate potential beta-turns, while the remaining four amino acids are useful to predict non-beta-turns. Empirical evaluation using three nonredundant datasets shows favorable Q total, Q predicted and MCC values when compared with over a dozen of modern competing methods. Our method is the first to break the 80% Q total barrier and achieves Q total = 80.9%, MCC = 0.47, and Q predicted higher by over 6% when compared with the second best method. We use feature selection to reduce the dimensionality of the feature vector used as the input for the proposed prediction method. The applied feature set is smaller by 86, 62 and 37% when compared with the second and two third-best (with respect to MCC) competing methods, respectively. Experiments show that the proposed method constitutes an improvement over the competing prediction
Structure Prediction and Analysis of Neuraminidase Sequence Variants

Science.gov (United States)

Thayer, Kelly M.

2016-01-01

Analyzing protein structure has become an integral aspect of understanding systems of biochemical import. The laboratory experiment endeavors to introduce protein folding to ascertain structures of proteins for which the structure is unavailable, as well as to critically evaluate the quality of the prediction obtained. The model system used is the…
De novo mutation in the dopamine transporter gene associates dopamine dysfunction with autism spectrum disorder.

Science.gov (United States)

Hamilton, P J; Campbell, N G; Sharma, S; Erreger, K; Herborg Hansen, F; Saunders, C; Belovich, A N; Sahai, M A; Cook, E H; Gether, U; McHaourab, H S; Matthies, H J G; Sutcliffe, J S; Galli, A

2013-12-01

De novo genetic variation is an important class of risk factors for autism spectrum disorder (ASD). Recently, whole-exome sequencing of ASD families has identified a novel de novo missense mutation in the human dopamine (DA) transporter (hDAT) gene, which results in a Thr to Met substitution at site 356 (hDAT T356M). The dopamine transporter (DAT) is a presynaptic membrane protein that regulates dopaminergic tone in the central nervous system by mediating the high-affinity reuptake of synaptically released DA, making it a crucial regulator of DA homeostasis. Here, we report the first functional, structural and behavioral characterization of an ASD-associated de novo mutation in the hDAT. We demonstrate that the hDAT T356M displays anomalous function, characterized as a persistent reverse transport of DA (substrate efflux). Importantly, in the bacterial homolog leucine transporter, substitution of A289 (the homologous site to T356) with a Met promotes an outward-facing conformation upon substrate binding. In the substrate-bound state, an outward-facing transporter conformation is required for substrate efflux. In Drosophila melanogaster, the expression of hDAT T356M in DA neurons-lacking Drosophila DAT leads to hyperlocomotion, a trait associated with DA dysfunction and ASD. Taken together, our findings demonstrate that alterations in DA homeostasis, mediated by aberrant DAT function, may confer risk for ASD and related neuropsychiatric conditions.
PCI-SS: MISO dynamic nonlinear protein secondary structure prediction

Directory of Open Access Journals (Sweden)

Aboul-Magd Mohammed O

2009-07-01

Full Text Available Abstract Background Since the function of a protein is largely dictated by its three dimensional configuration, determining a protein's structure is of fundamental importance to biology. Here we report on a novel approach to determining the one dimensional secondary structure of proteins (distinguishing α-helices, β-strands, and non-regular structures from primary sequence data which makes use of Parallel Cascade Identification (PCI, a powerful technique from the field of nonlinear system identification. Results Using PSI-BLAST divergent evolutionary profiles as input data, dynamic nonlinear systems are built through a black-box approach to model the process of protein folding. Genetic algorithms (GAs are applied in order to optimize the architectural parameters of the PCI models. The three-state prediction problem is broken down into a combination of three binary sub-problems and protein structure classifiers are built using 2 layers of PCI classifiers. Careful construction of the optimization, training, and test datasets ensures that no homology exists between any training and testing data. A detailed comparison between PCI and 9 contemporary methods is provided over a set of 125 new protein chains guaranteed to be dissimilar to all training data. Unlike other secondary structure prediction methods, here a web service is developed to provide both human- and machine-readable interfaces to PCI-based protein secondary structure prediction. This server, called PCI-SS, is available at http://bioinf.sce.carleton.ca/PCISS. In addition to a dynamic PHP-generated web interface for humans, a Simple Object Access Protocol (SOAP interface is added to permit invocation of the PCI-SS service remotely. This machine-readable interface facilitates incorporation of PCI-SS into multi-faceted systems biology analysis pipelines requiring protein secondary structure information, and greatly simplifies high-throughput analyses. XML is used to represent the input
De novo assembly and phasing of a Korean human genome.

Science.gov (United States)

Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon

2016-10-13

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of

Prediction of RNA secondary structure using generalized centroid estimators.

Science.gov (United States)

Hamada, Michiaki; Kiryu, Hisanori; Sato, Kengo; Mituyama, Toutai; Asai, Kiyoshi

2009-02-15

Recent studies have shown that the methods for predicting secondary structures of RNAs on the basis of posterior decoding of the base-pairing probabilities has an advantage with respect to prediction accuracy over the conventionally utilized minimum free energy methods. However, there is room for improvement in the objective functions presented in previous studies, which are maximized in the posterior decoding with respect to the accuracy measures for secondary structures. We propose novel estimators which improve the accuracy of secondary structure prediction of RNAs. The proposed estimators maximize an objective function which is the weighted sum of the expected number of the true positives and that of the true negatives of the base pairs. The proposed estimators are also improved versions of the ones used in previous works, namely CONTRAfold for secondary structure prediction from a single RNA sequence and McCaskill-MEA for common secondary structure prediction from multiple alignments of RNA sequences. We clarify the relations between the proposed estimators and the estimators presented in previous works, and theoretically show that the previous estimators include additional unnecessary terms in the evaluation measures with respect to the accuracy. Furthermore, computational experiments confirm the theoretical analysis by indicating improvement in the empirical accuracy. The proposed estimators represent extensions of the centroid estimators proposed in Ding et al. and Carvalho and Lawrence, and are applicable to a wide variety of problems in bioinformatics. Supporting information and the CentroidFold software are available online at: http://www.ncrna.org/software/centroidfold/.
De novo FBXO11 mutations are associated with intellectual disability and behavioural anomalies.

Science.gov (United States)

Fritzen, Daniel; Kuechler, Alma; Grimmel, Mona; Becker, Jessica; Peters, Sophia; Sturm, Marc; Hundertmark, Hela; Schmidt, Axel; Kreiß, Martina; Strom, Tim M; Wieczorek, Dagmar; Haack, Tobias B; Beck-Wödl, Stefanie; Cremer, Kirsten; Engels, Hartmut

2018-05-01

Intellectual disability (ID) has an estimated prevalence of 1.5-2%. In most affected individuals, its genetic basis remains unclear. Whole exome sequencing (WES) studies have identified a multitude of novel causative gene defects and have shown that a large proportion of sporadic ID cases results from de novo mutations. Here, we present two unrelated individuals with similar clinical features and deleterious de novo variants in FBXO11 detected by WES. Individual 1, a 14-year-old boy, has mild ID as well as mild microcephaly, corrected cleft lip and alveolus, hyperkinetic disorder, mild brain atrophy and minor facial dysmorphism. WES detected a heterozygous de novo 1 bp insertion in the splice donor site of exon 3. Individual 2, a 3-year-old boy, showed ID and pre- and postnatal growth retardation, postnatal mild microcephaly, hyperkinetic and restless behaviour, as well as mild dysmorphism. WES detected a heterozygous de novo frameshift mutation. While ten individuals with ID and de novo variants in FBXO11 have been reported as part of larger studies, only one of the reports has some additional clinical data. Interestingly, the latter individual carries the identical mutation as our individual 2 and also displays ID, intrauterine growth retardation, microcephaly, behavioural anomalies, and dysmorphisms. Thus, we confirm deleterious de novo mutations in FBXO11 as a cause of ID and start the delineation of the associated clinical picture which may also comprise postnatal microcephaly or borderline small head size and behavioural anomalies.
Update on protein structure prediction: results of the 1995 IRBM workshop

DEFF Research Database (Denmark)

Hubbard, Tim; Tramontano, Anna; Hansen, Jan

1996-01-01

Computational tools for protein structure prediction are of great interest to molecular, structural and theoretical biologists due to a rapidly increasing number of protein sequences with no known structure. In October 1995, a workshop was held at IRBM to predict as much as possible about a numbe...
Update on protein structure prediction: results of the 1995 IRBM workshop

DEFF Research Database (Denmark)

Hubbard, Tim; Tramontano, Anna; Hansen, Jan

1996-01-01

Computational tools for protein structure prediction are of great interest to molecular, structural and theoretical biologists due to a rapidly increasing number of protein sequences with no known structure. In October 1995, a workshop was held at IRBM to predict as much as possible about a number...
Knowledge base and neural network approach for protein secondary structure prediction.

Science.gov (United States)

Patel, Maulika S; Mazumdar, Himanshu S

2014-11-21

Protein structure prediction is of great relevance given the abundant genomic and proteomic data generated by the genome sequencing projects. Protein secondary structure prediction is addressed as a sub task in determining the protein tertiary structure and function. In this paper, a novel algorithm, KB-PROSSP-NN, which is a combination of knowledge base and modeling of the exceptions in the knowledge base using neural networks for protein secondary structure prediction (PSSP), is proposed. The knowledge base is derived from a proteomic sequence-structure database and consists of the statistics of association between the 5-residue words and corresponding secondary structure. The predicted results obtained using knowledge base are refined with a Backpropogation neural network algorithm. Neural net models the exceptions of the knowledge base. The Q3 accuracy of 90% and 82% is achieved on the RS126 and CB396 test sets respectively which suggest improvement over existing state of art methods. Copyright © 2014 Elsevier Ltd. All rights reserved.
Predicting nucleic acid binding interfaces from structural models of proteins.

Science.gov (United States)

Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

2012-02-01

The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments

Directory of Open Access Journals (Sweden)

Kurgan Lukasz

2008-10-01

Full Text Available Abstract Background β-turn is a secondary protein structure type that plays significant role in protein folding, stability, and molecular recognition. To date, several methods for prediction of β-turns from protein sequences were developed, but they are characterized by relatively poor prediction quality. The novelty of the proposed sequence-based β-turn predictor stems from the usage of a window based information extracted from four predicted three-state secondary structures, which together with a selected set of position specific scoring matrix (PSSM values serve as an input to the support vector machine (SVM predictor. Results We show that (1 all four predicted secondary structures are useful; (2 the most useful information extracted from the predicted secondary structure includes the structure of the predicted residue, secondary structure content in a window around the predicted residue, and features that indicate whether the predicted residue is inside a secondary structure segment; (3 the PSSM values of Asn, Asp, Gly, Ile, Leu, Met, Pro, and Val were among the top ranked features, which corroborates with recent studies. The Asn, Asp, Gly, and Pro indicate potential β-turns, while the remaining four amino acids are useful to predict non-β-turns. Empirical evaluation using three nonredundant datasets shows favorable Qtotal, Qpredicted and MCC values when compared with over a dozen of modern competing methods. Our method is the first to break the 80% Qtotal barrier and achieves Qtotal = 80.9%, MCC = 0.47, and Qpredicted higher by over 6% when compared with the second best method. We use feature selection to reduce the dimensionality of the feature vector used as the input for the proposed prediction method. The applied feature set is smaller by 86, 62 and 37% when compared with the second and two third-best (with respect to MCC competing methods, respectively. Conclusion Experiments show that the proposed method constitutes an
Airline Maintenance Manpower Optimization from the De Novo Perspective

Science.gov (United States)

Liou, James J. H.; Tzeng, Gwo-Hshiung

Human resource management (HRM) is an important issue for today’s competitive airline marketing. In this paper, we discuss a multi-objective model designed from the De Novo perspective to help airlines optimize their maintenance manpower portfolio. The effectiveness of the model and solution algorithm is demonstrated in an empirical study of the optimization of the human resources needed for airline line maintenance. Both De Novo and traditional multiple objective programming (MOP) methods are analyzed. A comparison of the results with those of traditional MOP indicates that the proposed model and solution algorithm does provide better performance and an improved human resource portfolio.
QCD predictions for weak neutral current structure functions

International Nuclear Information System (INIS)

Wu Jimin

1987-01-01

Employing the analytic expression (to the next leading order) for non-singlet component of structure function which the author got from QCD theory and putting recent experiment result of neutral current structure function at Q 2 = 11 (GeV/C) 2 as input, the QCD prediction for neutral current structure function of their scaling violation behaviours was given
Predicting protein structures with a multiplayer online game.

Science.gov (United States)

Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran; Players, Foldit

2010-08-05

People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.
Structure and Sequence Search on Aptamer-Protein Docking

Science.gov (United States)

Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

2015-03-01

Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
(PS)2: protein structure prediction server version 3.0.

Science.gov (United States)

Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh

2015-07-01

Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Compound Structure-Independent Activity Prediction in High-Dimensional Target Space.

Science.gov (United States)

Balfer, Jenny; Hu, Ye; Bajorath, Jürgen

2014-08-01

Profiling of compound libraries against arrays of targets has become an important approach in pharmaceutical research. The prediction of multi-target compound activities also represents an attractive task for machine learning with potential for drug discovery applications. Herein, we have explored activity prediction in high-dimensional target space. Different types of models were derived to predict multi-target activities. The models included naïve Bayesian (NB) and support vector machine (SVM) classifiers based upon compound structure information and NB models derived on the basis of activity profiles, without considering compound structure. Because the latter approach can be applied to incomplete training data and principally depends on the feature independence assumption, SVM modeling was not applicable in this case. Furthermore, iterative hybrid NB models making use of both activity profiles and compound structure information were built. In high-dimensional target space, NB models utilizing activity profile data were found to yield more accurate activity predictions than structure-based NB and SVM models or hybrid models. An in-depth analysis of activity profile-based models revealed the presence of correlation effects across different targets and rationalized prediction accuracy. Taken together, the results indicate that activity profile information can be effectively used to predict the activity of test compounds against novel targets. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
RNA folding: structure prediction, folding kinetics and ion electrostatics.

Science.gov (United States)

Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

2015-01-01

Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.
Constraint Logic Programming approach to protein structure prediction

Directory of Open Access Journals (Sweden)

Fogolari Federico

2004-11-01

Full Text Available Abstract Background The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Results Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. Conclusions The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.
Constraint Logic Programming approach to protein structure prediction.

Science.gov (United States)

Dal Palù, Alessandro; Dovier, Agostino; Fogolari, Federico

2004-11-30

The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known) secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.
Mimicking the action of folding chaperones by Hamiltonian replica-exchange molecular dynamics simulations : Application in the refinement of de novo models

NARCIS (Netherlands)

Fan, Hao; Periole, Xavier; Mark, Alan E.

The efficiency of using a variant of Hamiltonian replica-exchange molecular dynamics (Chaperone H-replica-exchange molecular dynamics [CH-REMD]) for the refinement of protein structural models generated de novo is investigated. In CH-REMD, the interaction between the protein and its environment,
A multi-method approach toward de novo glycan characterization: a Man-5 case study.

Science.gov (United States)

Prien, Justin M; Prater, Bradley D; Cockrill, Steven L

2010-05-01

Regulatory agencies' expectations for biotherapeutic approval are becoming more stringent with regard to product characterization, where minor species as low as 0.1% of a given profile are typically identified. The mission of this manuscript is to demonstrate a multi-method approach toward de novo glycan characterization and quantitation, including minor species at or approaching the 0.1% benchmark. Recently, unexpected isomers of the Man(5)GlcNAc(2) (M(5)) were reported (Prien JM, Ashline DJ, Lapadula AJ, Zhang H, Reinhold VN. 2009. The high mannose glycans from bovine ribonuclease B isomer characterization by ion trap mass spectrometry (MS). J Am Soc Mass Spectrom. 20:539-556). In the current study, quantitative analysis of these isomers found in commercial M(5) standard demonstrated that they are in low abundance (2-aminobenzoic acid to detect and chromatographically resolve multiple M(5) isomers in bovine ribonuclease B. With this multi-method approach, we have the capabilities to comprehensively characterize a biotherapeutic's glycan array in a de novo manner, including structural isomers at >/=0.1% of the total chromatographic peak area.
Modular Engineering Concept at Novo Nordisk Engineering

DEFF Research Database (Denmark)

Moelgaard, Gert; Miller, Thomas Dedenroth

1997-01-01

This report describes the concept of a new engineering method at Novo Nordisk Engineering: Modular Engineering (ME). Three tools are designed to support project phases with different levels of detailing and abstraction. ME supports a standard, cross-functional breakdown of projects that facilitates...
Free energy minimization to predict RNA secondary structures and computational RNA design.

Science.gov (United States)

Churkin, Alexander; Weinbrand, Lina; Barash, Danny

2015-01-01

Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Science.gov (United States)

Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

2016-01-11

Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.
De novo synthesis of adenine nucleotides in different skeletal muscle fiber types

International Nuclear Information System (INIS)

Tullson, P.C.; John-Alder, H.B.; Hood, D.A.; Terjung, R.L.

1988-01-01

Management of adenine nucleotide catabolism differs among skeletal muscle fiber types. This study evaluated whether there are corresponding differences in the rates of de novo synthesis of adenine nucleotide among fiber type sections of skeletal muscle using an isolated perfused rat hindquarter preparation. Label incorporation into adenine nucleotides from the [1-14C]glycine precursor was determined and used to calculate synthesis rates based on the intracellular glycine specific radioactivity. Results show that intracellular glycine is closely related to the direct precursor pool. Rates of de novo synthesis were highest in fast-twitch red muscle (57.0 +/- 4.0, 58.2 +/- 4.4 nmol.h-1.g-1; deep red gastrocnemius and vastus lateralis), relatively high in slow-twitch red muscle (47.0 +/- 3.1; soleus), and low in fast-twitch white muscle (26.1 +/- 2.0 and 21.6 +/- 2.3; superficial white gastrocnemius and vastus lateralis). Rates for four mixed muscles were intermediate, ranging between 32.3 and 37.3. Specific de novo synthesis rates exhibited a strong correlation (r = 0.986) with muscle section citrate synthase activity. Turnover rates (de novo synthesis rate/adenine nucleotide pool size) were highest in high oxidative muscle (0.82-1.06%/h), lowest in low oxidative muscle (0.30-0.35%/h), and intermediate in mixed muscle (0.44-0.55%/h). Our results demonstrate that differences in adenine nucleotide management among fiber types extends to the process of de novo adenine nucleotide synthesis
De novo transcriptome assembly of Sorghum bicolor variety Taejin

Directory of Open Access Journals (Sweden)

Yeonhwa Jo

2016-06-01

Full Text Available Sorghum (Sorghum bicolor, also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748 GB of raw data. The raw data in this study can be available in NCBI SRA database with accession number of SRX1715644. Using the Trinity program, we identified 222,161 transcripts from sorghum variety Taejin. We further predicted coding regions within the assembled transcripts by the TransDecoder program, resulting in a total of 148,531 proteins. We carried out BLASTP against the Swiss-Prot protein sequence database to annotate the functions of the identified proteins. To our knowledge, this is the first transcriptome data for a sorghum variety derived from Korea, and it can be usefully applied to the generation of genetic markers.
De novo assembly of a haplotype-resolved human genome.

Science.gov (United States)

Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

2015-06-01

The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
Wegener's granulomatosis occurring de novo during pregnancy.

Science.gov (United States)

Alfhaily, F; Watts, R; Leather, A

2009-01-01

Wegener's granulomatosis (WG) is rarely diagnosed during the reproductive years and uncommonly manifests for the first time during pregnancy. We report a case of de novo WG presenting at 30 weeks gestation with classical symptoms of WG (ENT, pulmonary). The diagnosis was confirmed by radiological, laboratory, and histological investigations. With a multidisciplinary approach, she had a successful vaginal delivery of a healthy baby. She was treated successfully by a combination of steroids, azathioprine and intravenous immunoglobulin in the active phase of disease for induction of remission and by azathioprine and steroids for maintenance of remission. The significant improvement in her symptoms allowed us to continue her pregnancy to 37 weeks when delivery was electively induced. Transplacental transmission of PR3-ANCA occurred but the neonate remained well. This case of de novo WG during pregnancy highlights the seriousness of this disease and the challenge in management of such patients.
Evolutionary Structure Prediction of Stoichiometric Compounds

Science.gov (United States)

Zhu, Qiang; Oganov, Artem

2014-03-01

In general, for a given ionic compound AmBn\\ at ambient pressure condition, its stoichiometry reflects the valence state ratio between per chemical specie (i.e., the charges for each anion and cation). However, compounds under high pressure exhibit significantly behavior, compared to those analogs at ambient condition. Here we developed a method to solve the crystal structure prediction problem based on the evolutionary algorithms, which can predict both the stable compounds and their crystal structures at arbitrary P,T-conditions, given just the set of chemical elements. By applying this method to a wide range of binary ionic systems (Na-Cl, Mg-O, Xe-O, Cs-F, etc), we discovered a lot of compounds with brand new stoichimetries which can become thermodynamically stable. Further electronic structure analysis on these novel compounds indicates that several factors can contribute to this extraordinary phenomenon: (1) polyatomic anions; (2) free electron localization; (3) emergence of new valence states; (4) metallization. In particular, part of the results have been confirmed by experiment, which warrants that this approach can play a crucial role in new materials design under extreme pressure conditions. This work is funded by DARPA (Grants No. W31P4Q1210008 and W31P4Q1310005), NSF (EAR-1114313 and DMR-1231586).
Predicted crystal structures of molybdenum under high pressure

Energy Technology Data Exchange (ETDEWEB)

Wang, Bing; Zhang, Guang Biao [Institute for Computational Materials Science, School of Physics and Electronics, Henan University, Kaifeng 475004 (China); Wang, Yuan Xu, E-mail: wangyx@henu.edu.cn [Institute for Computational Materials Science, School of Physics and Electronics, Henan University, Kaifeng 475004 (China); Guizhou Provincial Key Laboratory of Computational Nano-Material Science, Institute of Applied Physics, Guizhou Normal College, Guiyang 550018 (China)

2013-04-15

Highlights: ► A double-hexagonal close-packed (dhcp) structure of molybdenum is predicted. ► Calculated acoustic velocity confirms the bcc–dhcp phase transition at 660 GPa. ► The valence electrons of dhcp Mo are mostly localized in the interstitial sites. -- Abstract: The high-pressure structures of molybdenum (Mo) at zero temperature have been extensively explored through the newly developed particle swarm optimization (PSO) algorithm on crystal structural prediction. All the experimental and earlier theoretical structures were successfully reproduced in certain pressure ranges, validating our methodology in application to Mo. A double-hexagonal close-packed (dhcp) structure found by Mikhaylushkin et al. (2008) [12] is confirmed by the present PSO calculations. The lattice parameters and physical properties of the dhcp phase were investigated based on first principles calculations. The phase transition occurs only from bcc phase to dhcp phase at 660 GPa and at zero temperature. The calculated acoustic velocities also indicate a transition from the bcc to dhcp phases for Mo. More intriguingly, the calculated density of states (DOS) shows that the dhcp structure remains metallic. The calculated electron density difference (EDD) reveals that its valence electrons are localized in the interstitial regions.
Detection of de novo single nucleotide variants in offspring of atomic-bomb survivors close to the hypocenter by whole-genome sequencing.

Science.gov (United States)

Horai, Makiko; Mishima, Hiroyuki; Hayashida, Chisa; Kinoshita, Akira; Nakane, Yoshibumi; Matsuo, Tatsuki; Tsuruda, Kazuto; Yanagihara, Katsunori; Sato, Shinya; Imanishi, Daisuke; Imaizumi, Yoshitaka; Hata, Tomoko; Miyazaki, Yasushi; Yoshiura, Koh-Ichiro

2018-03-01

Ionizing radiation released by the atomic bombs at Hiroshima and Nagasaki, Japan, in 1945 caused many long-term illnesses, including increased risks of malignancies such as leukemia and solid tumours. Radiation has demonstrated genetic effects in animal models, leading to concerns over the potential hereditary effects of atomic bomb-related radiation. However, no direct analyses of whole DNA have yet been reported. We therefore investigated de novo variants in offspring of atomic-bomb survivors by whole-genome sequencing (WGS). We collected peripheral blood from three trios, each comprising a father (atomic-bomb survivor with acute radiation symptoms), a non-exposed mother, and their child, none of whom had any past history of haematological disorders. One trio of non-exposed individuals was included as a control. DNA was extracted and the numbers of de novo single nucleotide variants in the children were counted by WGS with sequencing confirmation. Gross structural variants were also analysed. Written informed consent was obtained from all participants prior to the study. There were 62, 81, and 42 de novo single nucleotide variants in the children of atomic-bomb survivors, compared with 48 in the control trio. There were no gross structural variants in any trio. These findings are in accord with previously published results that also showed no significant genetic effects of atomic-bomb radiation on second-generation survivors.
Prediction of backbone dihedral angles and protein secondary structure using support vector machines

Directory of Open Access Journals (Sweden)

Hirst Jonathan D

2009-12-01

Full Text Available Abstract Background The prediction of the secondary structure of a protein is a critical step in the prediction of its tertiary structure and, potentially, its function. Moreover, the backbone dihedral angles, highly correlated with secondary structures, provide crucial information about the local three-dimensional structure. Results We predict independently both the secondary structure and the backbone dihedral angles and combine the results in a loop to enhance each prediction reciprocally. Support vector machines, a state-of-the-art supervised classification technique, achieve secondary structure predictive accuracy of 80% on a non-redundant set of 513 proteins, significantly higher than other methods on the same dataset. The dihedral angle space is divided into a number of regions using two unsupervised clustering techniques in order to predict the region in which a new residue belongs. The performance of our method is comparable to, and in some cases more accurate than, other multi-class dihedral prediction methods. Conclusions We have created an accurate predictor of backbone dihedral angles and secondary structure. Our method, called DISSPred, is available online at http://comp.chem.nottingham.ac.uk/disspred/.
Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing

DEFF Research Database (Denmark)

Wu, Jia Qian; Shteynberg, David; Arumugam, Manimozhiyan

2004-01-01

an alternative approach: reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on dual-genome de novo predictions from TWINSCAN. We tested 444 TWINSCAN-predicted rat genes that showed significant homology to known human genes implicated in disease but that were partially...... in the single-intron experiment. Spliced sequences were amplified in 46 cases (34%). We conclude that this procedure for elucidating gene structures with native cDNA sequences is cost-effective and will become even more so as it is further optimized.......The publication of a draft sequence of a third mammalian genome--that of the rat--suggests a need to rethink genome annotation. New mammalian sequences will not receive the kind of labor-intensive annotation efforts that are currently being devoted to human. In this paper, we demonstrate...
Contingency Table Browser - prediction of early stage protein structure.

Science.gov (United States)

Kalinowska, Barbara; Krzykalski, Artur; Roterman, Irena

2015-01-01

The Early Stage (ES) intermediate represents the starting structure in protein folding simulations based on the Fuzzy Oil Drop (FOD) model. The accuracy of FOD predictions is greatly dependent on the accuracy of the chosen intermediate. A suitable intermediate can be constructed using the sequence-structure relationship information contained in the so-called contingency table - this table expresses the likelihood of encountering various structural motifs for each tetrapeptide fragment in the amino acid sequence. The limited accuracy with which such structures could previously be predicted provided the motivation for a more indepth study of the contingency table itself. The Contingency Table Browser is a tool which can visualize, search and analyze the table. Our work presents possible applications of Contingency Table Browser, among them - analysis of specific protein sequences from the point of view of their structural ambiguity.
G-LoSA for Prediction of Protein-Ligand Binding Sites and Structures.

Science.gov (United States)

Lee, Hui Sun; Im, Wonpil

2017-01-01

Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.
Persistent hyperthyroidism and de novo Graves' ophthalmopathy after total thyroidectomy.

Science.gov (United States)

Tay, Wei Lin; Loh, Wann Jia; Lee, Lianne Ai Ling; Chng, Chiaw Ling

2017-01-01

We report a patient with Graves' disease who remained persistently hyperthyroid after a total thyroidectomy and also developed de novo Graves' ophthalmopathy 5 months after surgery. She was subsequently found to have a mature cystic teratoma containing struma ovarii after undergoing a total hysterectomy and salpingo-oophorectomy for an incidental ovarian lesion. It is important to investigate for other causes of primary hyperthyroidism when thyrotoxicosis persists after total thyroidectomy.TSH receptor antibody may persist after total thyroidectomy and may potentially contribute to the development of de novo Graves' ophthalmopathy.
Selecting Superior De Novo Transcriptome Assemblies: Lessons Learned by Leveraging the Best Plant Genome.

Directory of Open Access Journals (Sweden)

Loren A Honaas

Full Text Available Whereas de novo assemblies of RNA-Seq data are being published for a growing number of species across the tree of life, there are currently no broadly accepted methods for evaluating such assemblies. Here we present a detailed comparison of 99 transcriptome assemblies, generated with 6 de novo assemblers including CLC, Trinity, SOAP, Oases, ABySS and NextGENe. Controlled analyses of de novo assemblies for Arabidopsis thaliana and Oryza sativa transcriptomes provide new insights into the strengths and limitations of transcriptome assembly strategies. We find that the leading assemblers generate reassuringly accurate assemblies for the majority of transcripts. At the same time, we find a propensity for assemblers to fail to fully assemble highly expressed genes. Surprisingly, the instance of true chimeric assemblies is very low for all assemblers. Normalized libraries are reduced in highly abundant transcripts, but they also lack 1000s of low abundance transcripts. We conclude that the quality of de novo transcriptome assemblies is best assessed through consideration of a combination of metrics: 1 proportion of reads mapping to an assembly 2 recovery of conserved, widely expressed genes, 3 N50 length statistics, and 4 the total number of unigenes. We provide benchmark Illumina transcriptome data and introduce SCERNA, a broadly applicable modular protocol for de novo assembly improvement. Finally, our de novo assembly of the Arabidopsis leaf transcriptome revealed ~20 putative Arabidopsis genes lacking in the current annotation.
Role of de novo biosynthesis in ecosystem scale monoterpene emissions from a boreal Scots pine forest

Directory of Open Access Journals (Sweden)

R. Taipale

2011-08-01

Full Text Available Monoterpene emissions from Scots pine have traditionally been assumed to originate as evaporation from specialized storage pools. More recently, the significance of de novo emissions, originating directly from monoterpene biosynthesis, has been recognized. To study the role of biosynthesis at the ecosystem scale, we measured monoterpene emissions from a Scots pine dominated forest in southern Finland using the disjunct eddy covariance method combined with proton transfer reaction mass spectrometry. The interpretation of the measurements was based on a correlation analysis and a hybrid emission algorithm describing both de novo and pool emissions. During the measurement period May–August 2007, the monthly medians of daytime emissions were 200, 290, 180, and 200 μg m⁻² h⁻¹. The emissions were partly light dependent, probably due to de novo biosynthesis. The emission potential for both de novo and pool emissions exhibited a decreasing summertime trend. The ratio of the de novo emission potential to the total emission potential varied between 30 % and 46 %. Although the monthly changes were not significant, the ratio always differed statistically from zero, suggesting that the role of de novo biosynthesis was observable. Given the uncertainties in this study, we conclude that more accurate estimates of the contribution of de novo emissions are required for improving monoterpene emission algorithms for Scots pine dominated forests.
LocARNA-P: Accurate boundary prediction and improved detection of structural RNAs

DEFF Research Database (Denmark)

Will, Sebastian; Joshi, Tejal; Hofacker, Ivo L.

2012-01-01

Current genomic screens for noncoding RNAs (ncRNAs) predict a large number of genomic regions containing potential structural ncRNAs. The analysis of these data requires highly accurate prediction of ncRNA boundaries and discrimination of promising candidate ncRNAs from weak predictions. Existing...... methods struggle with these goals because they rely on sequence-based multiple sequence alignments, which regularly misalign RNA structure and therefore do not support identification of structural similarities. To overcome this limitation, we compute columnwise and global reliabilities of alignments based...... on sequence and structure similarity; we refer to these structure-based alignment reliabilities as STARs. The columnwise STARs of alignments, or STAR profiles, provide a versatile tool for the manual and automatic analysis of ncRNAs. In particular, we improve the boundary prediction of the widely used nc...
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.

Science.gov (United States)

Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A

2014-01-01

As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies

Science.gov (United States)

Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

2014-01-01

As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Parallel protein secondary structure prediction based on neural networks.

Science.gov (United States)

Zhong, Wei; Altun, Gulsah; Tian, Xinmin; Harrison, Robert; Tai, Phang C; Pan, Yi

2004-01-01

Protein secondary structure prediction has a fundamental influence on today's bioinformatics research. In this work, binary and tertiary classifiers of protein secondary structure prediction are implemented on Denoeux belief neural network (DBNN) architecture. Hydrophobicity matrix, orthogonal matrix, BLOSUM62 and PSSM (position specific scoring matrix) are experimented separately as the encoding schemes for DBNN. The experimental results contribute to the design of new encoding schemes. New binary classifier for Helix versus not Helix ( approximately H) for DBNN produces prediction accuracy of 87% when PSSM is used for the input profile. The performance of DBNN binary classifier is comparable to other best prediction methods. The good test results for binary classifiers open a new approach for protein structure prediction with neural networks. Due to the time consuming task of training the neural networks, Pthread and OpenMP are employed to parallelize DBNN in the hyperthreading enabled Intel architecture. Speedup for 16 Pthreads is 4.9 and speedup for 16 OpenMP threads is 4 in the 4 processors shared memory architecture. Both speedup performance of OpenMP and Pthread is superior to that of other research. With the new parallel training algorithm, thousands of amino acids can be processed in reasonable amount of time. Our research also shows that hyperthreading technology for Intel architecture is efficient for parallel biological algorithms.
De novo nonsense mutations in ASXL1 cause Bohring-Opitz syndrome

DEFF Research Database (Denmark)

Hoischen, Alexander; van Bon, Bregje W M; Rodríguez-Santiago, Benjamín

2011-01-01

Bohring-Opitz syndrome is characterized by severe intellectual disability, distinctive facial features and multiple congenital malformations. We sequenced the exomes of three individuals with Bohring-Opitz syndrome and in each identified heterozygous de novo nonsense mutations in ASXL1, which...... is required for maintenance of both activation and silencing of Hox genes. In total, 7 out of 13 subjects with a Bohring-Opitz phenotype had de novo ASXL1 mutations, suggesting that the syndrome is genetically heterogeneous....

Structural protein descriptors in 1-dimension and their sequence-based predictions.

Science.gov (United States)

Kurgan, Lukasz; Disfani, Fatemeh Miri

2011-09-01

The last few decades observed an increasing interest in development and application of 1-dimensional (1D) descriptors of protein structure. These descriptors project 3D structural features onto 1D strings of residue-wise structural assignments. They cover a wide-range of structural aspects including conformation of the backbone, burying depth/solvent exposure and flexibility of residues, and inter-chain residue-residue contacts. We perform first-of-its-kind comprehensive comparative review of the existing 1D structural descriptors. We define, review and categorize ten structural descriptors and we also describe, summarize and contrast over eighty computational models that are used to predict these descriptors from the protein sequences. We show that the majority of the recent sequence-based predictors utilize machine learning models, with the most popular being neural networks, support vector machines, hidden Markov models, and support vector and linear regressions. These methods provide high-throughput predictions and most of them are accessible to a non-expert user via web servers and/or stand-alone software packages. We empirically evaluate several recent sequence-based predictors of secondary structure, disorder, and solvent accessibility descriptors using a benchmark set based on CASP8 targets. Our analysis shows that the secondary structure can be predicted with over 80% accuracy and segment overlap (SOV), disorder with over 0.9 AUC, 0.6 Matthews Correlation Coefficient (MCC), and 75% SOV, and relative solvent accessibility with PCC of 0.7 and MCC of 0.6 (0.86 when homology is used). We demonstrate that the secondary structure predicted from sequence without the use of homology modeling is as good as the structure extracted from the 3D folds predicted by top-performing template-based methods.
Advances in Rosetta structure prediction for difficult molecular-replacement problems

International Nuclear Information System (INIS)

DiMaio, Frank

2013-01-01

Modeling advances using Rosetta structure prediction to aid in solving difficult molecular-replacement problems are discussed. Recent work has shown the effectiveness of structure-prediction methods in solving difficult molecular-replacement problems. The Rosetta protein structure modeling suite can aid in the solution of difficult molecular-replacement problems using templates from 15 to 25% sequence identity; Rosetta refinement guided by noisy density has consistently led to solved structures where other methods fail. In this paper, an overview of the use of Rosetta for these difficult molecular-replacement problems is provided and new modeling developments that further improve model quality are described. Several variations to the method are introduced that significantly reduce the time needed to generate a model and the sampling required to improve the starting template. The improvements are benchmarked on a set of nine difficult cases and it is shown that this improved method obtains consistently better models in less running time. Finally, strategies for best using Rosetta to solve difficult molecular-replacement problems are presented and future directions for the role of structure-prediction methods in crystallography are discussed
The impact of employee satisfaction on productivity in Tiskarna Novo mesto, Ltd.

Directory of Open Access Journals (Sweden)

Simona Cimperman

2016-06-01

Full Text Available Research Question: Does employee satisfaction, impact on productivity? How are these two variables associated? What is the job satisfaction in Tiskarna Novo mesto, Ltd. What needs to be done to make employees more satisfied at work and, consequently, more productive? Purpose: The purpose of the study is to determine what are the factors that influence employee satisfaction Tiskarna Novo mesto, Ltd. and check the connection between work satisfaction and employee productivity. The aim of the research is to examine what is the level of job satisfaction of employees in Tiskarna Novo mesto, Ltd. And find our reasons and factors that prevent employees were satisfied in the workplace. Method: In this study we used a descriptive method and the method of combining the study of domestic and foreign literature. Pending the results we have come to interview employees in the Tiskarna Novo mesto, Ltd. Results: We conducted a survey among employees in Tiskarna Novo mesto, Ltd and we came to the conclusion that the employees are medium satisfied – the average grade point job satisfaction of employees was 3.1 (evaluated on a 5-point Likert scale. The worst assessed was factor in job satisfaction opportunity for advancement and educational opportunities. We have found out that factors like receiving praise and awards as well as good interpersonal relations are those that affect good on job satisfaction, on the other hand conflict is the one that reduces job satisfaction. The existence of links between work satisfaction and productivity were not found (r = -0.061. Organization: The organization and managers, it is important to know which are the factors by which employees are satisfied or dissatisfied. Results of the research will give managers a clear picture of the factors of satisfaction / dissatisfaction and opinion on productivity. Society: The employees it means a lot to have your job satisfaction and consequently they are more productive. Originality: The
Interplay between De Novo Biosynthesis and Sequestration of Cyanogenic Glucosides in Arthropods

DEFF Research Database (Denmark)

Fürstenberg-Hägg, Joel

(Zygaenidae, Lepidoptera) both sequester (take up and accumulate) the CNglcs linamarin and lotaustralin from their food plants (Fabacea) and biosynthesize them de novo from valine and isoleucine. The presented research demonstrates that de novo biosynthesis of CNglcs in Z. filipendulae is dependent...
Validation of Molecular Dynamics Simulations for Prediction of Three-Dimensional Structures of Small Proteins.

Science.gov (United States)

Kato, Koichi; Nakayoshi, Tomoki; Fukuyoshi, Shuichi; Kurimoto, Eiji; Oda, Akifumi

2017-10-12

Although various higher-order protein structure prediction methods have been developed, almost all of them were developed based on the three-dimensional (3D) structure information of known proteins. Here we predicted the short protein structures by molecular dynamics (MD) simulations in which only Newton's equations of motion were used and 3D structural information of known proteins was not required. To evaluate the ability of MD simulationto predict protein structures, we calculated seven short test protein (10-46 residues) in the denatured state and compared their predicted and experimental structures. The predicted structure for Trp-cage (20 residues) was close to the experimental structure by 200-ns MD simulation. For proteins shorter or longer than Trp-cage, root-mean square deviation values were larger than those for Trp-cage. However, secondary structures could be reproduced by MD simulations for proteins with 10-34 residues. Simulations by replica exchange MD were performed, but the results were similar to those from normal MD simulations. These results suggest that normal MD simulations can roughly predict short protein structures and 200-ns simulations are frequently sufficient for estimating the secondary structures of protein (approximately 20 residues). Structural prediction method using only fundamental physical laws are useful for investigating non-natural proteins, such as primitive proteins and artificial proteins for peptide-based drug delivery systems.
Predicting Protein Secondary Structure with Markov Models

DEFF Research Database (Denmark)

Fischer, Paul; Larsen, Simon; Thomsen, Claus

2004-01-01

we are considering here, is to predict the secondary structure from the primary one. To this end we train a Markov model on training data and then use it to classify parts of unknown protein sequences as sheets, helices or coils. We show how to exploit the directional information contained...... in the Markov model for this task. Classifications that are purely based on statistical models might not always be biologically meaningful. We present combinatorial methods to incorporate biological background knowledge to enhance the prediction performance....
Novo Jornalismo: fronteiras litero-factuais em A sangue Frio e em Radical Chique

Directory of Open Access Journals (Sweden)

Francisco Aquinei Timóteo Queirós

2012-12-01

Full Text Available A pesquisa busca analisar de que forma fato e ficção se entrecruzam no “movimento” do Novo Jornalismo, a partir das obras A sangue Frio e Radical Chique e o Novo Jornalismo, de Truman Capote e Tom Wolfe, respectivamente. Pretende-se, a partir da investigação do corpus em estudo, revelar os aspectos que aproximam o fato jornalístico, a notícia e a reportagem às técnicas literárias do romance, do conto e da crônica. O estudo investiga o Novo Jornalismo sob o viés de textos centrais das áreas de teoria literária e estudos jornalísticos utilizando autores como Mikhail Bakhtin, Hayden White, Paul Ricoeur, Muniz Sodré; além de referenciar outros escritores que, como Tom Wolfe e Truman Capote, fizeram parte de um grande movimento renovador do jornalismo literário nos anos 1950, 1960 e 1970 chamado, genericamente, de Novo Jornalismo.
Demanda dos principais metais e novos materiais : analise de tendencias

OpenAIRE

Wilson Trigueiro de Sousa

1990-01-01

Resumo: Neste trabalho são analisadas algumas tendências na área de novos materiais na tentativa de obter um melhor entendimento das repercussões das atuais inovações tecnológicas para o setor mineral. Inicialmente são revisados os principais estudos sobre as mudanças ocorridas por volta de 1972/74 no comportamento da demanda dos metais mais importantes. Entre as possíveis causas, está o progresso técnico, que tornou possível o surgimento de novos materiais e o aperfeiçoamento de outros em us...
The immobilization of heavy metals in soil by bioaugmentation of a UV-mutant Bacillus subtilis 38 assisted by NovoGro biostimulation and changes of soil microbial community

Energy Technology Data Exchange (ETDEWEB)

Wang, Ting [MOE Key Laboratory of Pollution Processes and Environmental Criteria, College of Environmental Science and Engineering, Nankai University, Tianjin 300071 (China); Urban Transport Emission Control Research Centre, College of Environmental Science and Engineering, Nankai University, Tianjin 300071 (China); Sun, Hongwen, E-mail: sunhongwen@nankai.edu.cn [MOE Key Laboratory of Pollution Processes and Environmental Criteria, College of Environmental Science and Engineering, Nankai University, Tianjin 300071 (China); Mao, Hongjun [Urban Transport Emission Control Research Centre, College of Environmental Science and Engineering, Nankai University, Tianjin 300071 (China); Zhang, Yanfeng; Wang, Cuiping; Zhang, Zhiyuan; Wang, Baolin; Sun, Lei [MOE Key Laboratory of Pollution Processes and Environmental Criteria, College of Environmental Science and Engineering, Nankai University, Tianjin 300071 (China)

2014-08-15

Highlights: • A UV-mutated species, Bacillus subtilis 38, is a good sorbent for multi-metals (Cd, Cr, Hg and Pb). • B38 mixed with NovoGro exhibited a synergetic effect on the immobilization of heavy metals in soil. • DTPA, M3 and BCR were suitable for predicting metal bioavailability for specific classes of plant. • The NovoGro could enhance the proliferation of both exotic B38 and native microbes. • It's a practical strategy for the remediation of actual farmland polluted by multi-heavy metals. - Abstract: Bacillus subtilis 38 (B38) is a mutant species of Bacillus subtilis acquired by UV irradiation with high cadmium tolerance. This study revealed that B38 was a good biosorbent for the adsorption of multiple heavy metals (cadmium, chromium, mercury, and lead). Simultaneous application of B38 and NovoGro (SNB) exhibited a synergetic effect on the immobilization of heavy metals in soil. The heavy metal concentrations in the edible part of the tested plants (lettuce, radish, and soybean) under SNB treatment decreased by 55.4–97.9% compared to the control. Three single extraction methods, diethylenetriaminepentaacetic acid (DTPA), Mehlich 3 (M3), and the first step of the Community Bureau of Reference method (BCR1), showed good predictive capacities for metal bioavailability to leafy, rhizome, and leguminous plant, respectively. The polymerase chain reaction–denaturing gradient gel electrophoresis (PCR–DGGE) profiles revealed that NovoGro could enhance the proliferation of both exotic B38 and native microbes. Finally, the technology was checked in the field, the reduction in heavy metal concentrations in the edible part of radish was in the range between 30.8% and 96.0% after bioremediation by SNB treatment. This study provides a practical strategy for the remediation of farmland contaminated by multiple heavy metals.
The immobilization of heavy metals in soil by bioaugmentation of a UV-mutant Bacillus subtilis 38 assisted by NovoGro biostimulation and changes of soil microbial community

International Nuclear Information System (INIS)

Wang, Ting; Sun, Hongwen; Mao, Hongjun; Zhang, Yanfeng; Wang, Cuiping; Zhang, Zhiyuan; Wang, Baolin; Sun, Lei

2014-01-01

Highlights: • A UV-mutated species, Bacillus subtilis 38, is a good sorbent for multi-metals (Cd, Cr, Hg and Pb). • B38 mixed with NovoGro exhibited a synergetic effect on the immobilization of heavy metals in soil. • DTPA, M3 and BCR were suitable for predicting metal bioavailability for specific classes of plant. • The NovoGro could enhance the proliferation of both exotic B38 and native microbes. • It's a practical strategy for the remediation of actual farmland polluted by multi-heavy metals. - Abstract: Bacillus subtilis 38 (B38) is a mutant species of Bacillus subtilis acquired by UV irradiation with high cadmium tolerance. This study revealed that B38 was a good biosorbent for the adsorption of multiple heavy metals (cadmium, chromium, mercury, and lead). Simultaneous application of B38 and NovoGro (SNB) exhibited a synergetic effect on the immobilization of heavy metals in soil. The heavy metal concentrations in the edible part of the tested plants (lettuce, radish, and soybean) under SNB treatment decreased by 55.4–97.9% compared to the control. Three single extraction methods, diethylenetriaminepentaacetic acid (DTPA), Mehlich 3 (M3), and the first step of the Community Bureau of Reference method (BCR1), showed good predictive capacities for metal bioavailability to leafy, rhizome, and leguminous plant, respectively. The polymerase chain reaction–denaturing gradient gel electrophoresis (PCR–DGGE) profiles revealed that NovoGro could enhance the proliferation of both exotic B38 and native microbes. Finally, the technology was checked in the field, the reduction in heavy metal concentrations in the edible part of radish was in the range between 30.8% and 96.0% after bioremediation by SNB treatment. This study provides a practical strategy for the remediation of farmland contaminated by multiple heavy metals
Cortical thickness in de novo patients with Parkinson disease and mild cognitive impairment with consideration of clinical phenotype and motor laterality.

Science.gov (United States)

Danti, S; Toschi, N; Diciotti, S; Tessa, C; Poletti, M; Del Dotto, P; Lucetti, C

2015-12-01

Parkinson's disease (PD) is a progressive neurodegenerative disorder with motor and non-motor symptoms, including cognitive deficits. Several magnetic resonance imaging approaches have been applied to investigate brain atrophy in PD. The aim of this study was to detect early structural cortical and subcortical changes in de novo PD whilst distinguishing cognitive status, clinical phenotype and motor laterality. Eighteen de novo PD with mild cognitive impairment (PD-MCI), 18 de novo PD without MCI (PD-NC) and 18 healthy control subjects were evaluated. In the PD-MCI group, nine were tremor dominant and nine were postural instability gait disorder (PIGD) phenotype; 11 had right-sided symptom dominance and seven had left-sided symptom dominance. FreeSurfer was used to measure cortical thickness/folding, subcortical structures and to study group differences as well as the association with clinical and neuropsychological data. Parkinson's disease with MCI showed regional thinning in the right frontal, right middle temporal areas and left insula compared to PD-NC. A reduction of the volume of the left and right thalamus and left hippocampus was found in PD-MCI compared to PD-NC. PD-MCI PIGD showed regional thinning in the right inferior parietal area compared to healthy controls. A decreased volume of the left thalamus was reported in PD-MCI with right-sided symptom dominance compared to PD-NC and PD-MCI with left-sided symptom dominance. When MCI was present, PD patients showed a fronto-temporo-parietal pattern of cortical thinning. This cortical pattern does not appear to be influenced by motor laterality, although one-sided symptom dominance may contribute to volumetric reduction of specific subcortical structures. © 2015 EAN.
The equivalent thermal conductivity of lattice core sandwich structure: A predictive model

International Nuclear Information System (INIS)

Cheng, Xiangmeng; Wei, Kai; He, Rujie; Pei, Yongmao; Fang, Daining

2016-01-01

Highlights: • A predictive model of the equivalent thermal conductivity was established. • Both the heat conduction and radiation were considered. • The predictive results were in good agreement with experiment and FEM. • Some methods for improving the thermal protection performance were proposed. - Abstract: The equivalent thermal conductivity of lattice core sandwich structure was predicted using a novel model. The predictive results were in good agreement with experimental and Finite Element Method results. The thermal conductivity of the lattice core sandwich structure was attributed to both core conduction and radiation. The core conduction caused thermal conductivity only relied on the relative density of the structure. And the radiation caused thermal conductivity increased linearly with the thickness of the core. It was found that the equivalent thermal conductivity of the lattice core sandwich structure showed a highly dependent relationship on temperature. At low temperatures, the structure exhibited a nearly thermal insulated behavior. With the temperature increasing, the thermal conductivity of the structure increased owing to radiation. Therefore, some attempts, such as reducing the emissivity of the core or designing multilayered structure, are believe to be of benefit for improving the thermal protection performance of the structure at high temperatures.
Indirect coupling of phosphate release to de novo tension generation during muscle contraction.

Science.gov (United States)

Davis, J S; Rodgers, M E

1995-01-01

A key question in muscle contraction is how tension generation is coupled to the chemistry of the actomyosin ATPase. Biochemical and mechanochemical experiments link tension generation to a change in structure associated with phosphate release. Length-jump and temperature-jump experiments, on the other hand, implicate phase 2slow, a significantly faster, markedly strain-sensitive kinetic process in tension generation. We use a laser temperature jump to probe the kinetics and mechanism of tension generation in skinned rabbit psoas fibers--an appropriate method since both phosphate release and phase 2slow are readily perturbed by temperature. Kinetics characteristic of the structural change associated with phosphate release are observed only when phosphate is added to fibers. When present, it causes a reduction in fiber tension; otherwise, no force is generated when it is perturbed. We therefore exclude this step from tension generation. The kinetics of de novo tension generation by the temperature-jump equivalent of phase 2slow appear unaffected by phosphate binding. We therefore propose that phosphate release is indirectly coupled to de novo tension generation via a steady-state flux through an irreversible step. We conclude that tension generation occurs in the absence of chemical change as the result of an entropy-driven transition between strongly bound crossbridges in the actomyosin-ADP state. The mechanism resembles the operation of a clock, with phosphate release providing the energy to tension the spring, and the irreversible step functions as the escapement mechanism, which is followed in turn by tension generation as the movement of the hands. Images Fig. 6 PMID:7479824
Combining neural networks for protein secondary structure prediction

DEFF Research Database (Denmark)

Riis, Søren Kamaric

1995-01-01

In this paper structured neural networks are applied to the problem of predicting the secondary structure of proteins. A hierarchical approach is used where specialized neural networks are designed for each structural class and then combined using another neural network. The submodels are designed...... by using a priori knowledge of the mapping between protein building blocks and the secondary structure and by using weight sharing. Since none of the individual networks have more than 600 adjustable weights over-fitting is avoided. When ensembles of specialized experts are combined the performance...
Reduced Fragment Diversity for Alpha and Alpha-Beta Protein Structure Prediction using Rosetta.

Science.gov (United States)

Abbass, Jad; Nebel, Jean-Christophe

2017-01-01

Protein structure prediction is considered a main challenge in computational biology. The biannual international competition, Critical Assessment of protein Structure Prediction (CASP), has shown in its eleventh experiment that free modelling target predictions are still beyond reliable accuracy, therefore, much effort should be made to improve ab initio methods. Arguably, Rosetta is considered as the most competitive method when it comes to targets with no homologues. Relying on fragments of length 9 and 3 from known structures, Rosetta creates putative structures by assembling candidate fragments. Generally, the structure with the lowest energy score, also known as first model, is chosen to be the "predicted one". A thorough study has been conducted on the role and diversity of 3-mers involved in Rosetta's model "refinement" phase. Usage of the standard number of 3-mers - i.e. 200 - has been shown to degrade alpha and alpha-beta protein conformations initially achieved by assembling 9-mers. Therefore, a new prediction pipeline is proposed for Rosetta where the "refinement" phase is customised according to a target's structural class prediction. Over 8% improvement in terms of first model structure accuracy is reported for alpha and alpha-beta classes when decreasing the number of 3- mers. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
On the performance of de novo pathway enrichment

DEFF Research Database (Denmark)

Batra, Richa; Alcaraz, Nicolas; Gitzhofer, Kevin

2017-01-01

De novo pathway enrichment is a powerful approach to discover previously uncharacterized molecular mechanisms in addition to already known pathways. To achieve this, condition-specific functional modules are extracted from large interaction networks. Here, we give an overview of the state...
O Estado Novo, o rádio e seus órgãos reguladores

Directory of Open Access Journals (Sweden)

Othon Jambeiro

2003-01-01

Full Text Available This paper analyses the Brazilian broadcasting regulation in the 1935-1945 period, including the Estado Novo coup d'etat preparation, its implementation, consolidation and fall. It is taken into account the historical context and the national and international changes which occured in that period. It also analyses the rules for broadcasting and mass communication media, particularly on control issues. Functions, structure and acting of the Departmento de Imprensa e Propaganda - DIP area studied, in the perspective of understanding the role of the most important intelligence agency of the Vargas government.
A de novo designed monomeric, compact three helix bundle protein on a carbohydrate template

DEFF Research Database (Denmark)

Malik, Leila; Nygård, Jesper; Christensen, Niels Johan

2015-01-01

De novo design and chemical synthesis of proteins and of other artificial structures, which mimic them, is a central strategy for understanding protein folding and for accessing proteins with novel functions. We have previously described carbohydrates as templates for the assembly of artificial...... the template could facilitate protein folding. Here we report the design and synthesis of 3-helix bundle carboproteins on deoxy-hexopyranosides. The carboproteins were analyzed by CD, AUC, SAXS, and NMR, which revealed the formation of the first compact, and folded monomeric carboprotein distinctly different...
Transferência do fator caturra para o cultivar Mundo Novo de Coffea arabica Transfer of the CT gene to Mundo Novo cultivar

Directory of Open Access Journals (Sweden)

A. Carvalho

1972-01-01

Full Text Available No presente trabalho são relatados os estudos realizados visando à introdução do gene Ct (caturra que contribui para reduzir a altura da planta, no cultivar Mundo" Novo de Coffea arabica.Estudaram-se, em ensaios de produtividade, as populações Fv F.,, F3 e F4. Nessas populações e principalmente entre os descendentes dos "caféeiros H 2077-2-5 e H 2077-2-12, foram selecionadas plantas homozigotas para os alelos Ct e também para os alelos responsáveis pela cor do fruto xc ou Xc. Essas combinações foram denominadas 'Catuaí Amarelo' e 'Catuaí Vermelho', respectivamente, e suas características são apresentadas. Os novos cultivares vêm-se mostrando de interesse econômico para as regiões cafeeiras não somente pelo porte pequeno, mas também pela produtividade, pelo vigor vegetativo e pela precocidade.The successful transfer of the Ct gene for short internode to the tall cultivar of Coffea arábica'Mundo Novo' is reported. Individual selections were carried out in the F1, F2, F3 and F4 generations. It was found that early selection in the F2 generation was quite effective. A remarkably good correlation was found between productitivity of F2 plants and the yield of the F3 and F4 generations. Plants of the F4 generation have shown reasonable uniformity and high yield in several trials. The new selections showed to be early producers. Two new cultivars were released namely 'Catuaí Amarelo' and 'Catuaí Vermelho'. The former has yellow fruits whereas the latter has red fruits. The plants are much shorter that the ones of Mundo Novo. The new cultivars have a very strong secondary and tertiary branching. Because of these characteristics Catuaí Amarelo and Catuaí Vermelho are being planted in large scale replacing the tall cultivars.
Exploiting the Past and the Future in Protein Secondary Structure Prediction

DEFF Research Database (Denmark)

Baldi, Pierre; Brunak, Søren; Frasconi, P

1999-01-01

predictions based on variable ranges of dependencies. These architectures extend recurrent neural networks, introducing non-causal bidirectional dynamics to capture both upstream and downstream information. The prediction algorithm is completed by the use of mixtures of estimators that leverage evolutionary......Motivation: Predicting the secondary structure of a protein (alpha-helix, beta-sheet, coil) is an important step towards elucidating its three-dimensional structure, as well as its function. Presently, the best predictors are based on machine learning approaches, in particular neural network...

De Novo Glutamine Synthesis

Science.gov (United States)

He, Qiao; Shi, Xinchong; Zhang, Linqi; Yi, Chang; Zhang, Xuezhen

2016-01-01

Purpose: The aim of this study was to investigate the role of de novo glutamine (Gln) synthesis in the proliferation of C6 glioma cells and its detection with 13N-ammonia. Methods: Chronic Gln-deprived C6 glioma (0.06C6) cells were established. The proliferation rates of C6 and 0.06C6 cells were measured under the conditions of Gln deprivation along with or without the addition of ammonia or glutamine synthetase (GS) inhibitor. 13N-ammonia uptake was assessed in C6 cells by gamma counting and in rats with C6 and 0.06C6 xenografts by micro–positron emission tomography (PET) scanning. The expression of GS in C6 cells and xenografts was assessed by Western blotting and immunohistochemistry, respectively. Results: The Gln-deprived C6 cells showed decreased proliferation ability but had a significant increase in GS expression. Furthermore, we found that low concentration of ammonia was sufficient to maintain the proliferation of Gln-deprived C6 cells, and 13N-ammonia uptake in C6 cells showed Gln-dependent decrease, whereas inhibition of GS markedly reduced the proliferation of C6 cells as well as the uptake of 13N-ammoina. Additionally, microPET/computed tomography exhibited that subcutaneous 0.06C6 xenografts had higher 13N-ammonia uptake and GS expression in contrast to C6 xenografts. Conclusion: De novo Gln synthesis through ammonia–glutamate reaction plays an important role in the proliferation of C6 cells. 13N-ammonia can be a potential metabolic PET tracer for Gln-dependent tumors. PMID:27118759
Prediction of Seismic Damage-Based Degradation in RC Structures

DEFF Research Database (Denmark)

Kirkegaard, Poul Henning; Gupta, Vinay K.; Nielsen, Søren R.K.

Estimation of structural damage from known increase in the fundamental period of a structure after an earthquake or prediction of degradation of stiffness and strength for known damage requires reliable correlations between these response functionals. This study proposes a modified Clough-Johnsto...
De novo structural modeling and computational sequence analysis ...

African Journals Online (AJOL)

Jane

2011-07-25

Jul 25, 2011 ... fold recognition and ab initio protein structures, classification of structural motifs and ... stringent cross validation method to evaluate the method's performance ..... Hauser H, Jagels K, Moule S, Mungall K, Norbertczak H,.
RaptorX-Property: a web server for protein structure property prediction.

Science.gov (United States)

Wang, Sheng; Li, Wei; Liu, Shiwang; Xu, Jinbo

2016-07-08

RaptorX Property (http://raptorx2.uchicago.edu/StructurePropertyPred/predict/) is a web server predicting structure property of a protein sequence without using any templates. It outperforms other servers, especially for proteins without close homologs in PDB or with very sparse sequence profile (i.e. carries little evolutionary information). This server employs a powerful in-house deep learning model DeepCNF (Deep Convolutional Neural Fields) to predict secondary structure (SS), solvent accessibility (ACC) and disorder regions (DISO). DeepCNF not only models complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent property labels. Our experimental results show that, tested on CASP10, CASP11 and the other benchmarks, this server can obtain ∼84% Q3 accuracy for 3-state SS, ∼72% Q8 accuracy for 8-state SS, ∼66% Q3 accuracy for 3-state solvent accessibility, and ∼0.89 area under the ROC curve (AUC) for disorder prediction. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Integrated Structural Biology for α-Helical Membrane Protein Structure Determination.

Science.gov (United States)

Xia, Yan; Fischer, Axel W; Teixeira, Pedro; Weiner, Brian; Meiler, Jens

2018-04-03

While great progress has been made, only 10% of the nearly 1,000 integral, α-helical, multi-span membrane protein families are represented by at least one experimentally determined structure in the PDB. Previously, we developed the algorithm BCL::MP-Fold, which samples the large conformational space of membrane proteins de novo by assembling predicted secondary structure elements guided by knowledge-based potentials. Here, we present a case study of rhodopsin fold determination by integrating sparse and/or low-resolution restraints from multiple experimental techniques including electron microscopy, electron paramagnetic resonance spectroscopy, and nuclear magnetic resonance spectroscopy. Simultaneous incorporation of orthogonal experimental restraints not only significantly improved the sampling accuracy but also allowed identification of the correct fold, which is demonstrated by a protein size-normalized transmembrane root-mean-square deviation as low as 1.2 Å. The protocol developed in this case study can be used for the determination of unknown membrane protein folds when limited experimental restraints are available. Copyright © 2018 Elsevier Ltd. All rights reserved.
Manual for the prediction of blast and fragment loadings on structures

Energy Technology Data Exchange (ETDEWEB)

1980-11-01

The purpose of this manual is to provide Architect-Engineer (AE) firms guidance for the prediction of air blast, ground shock and fragment loadings on structures as a result of accidental explosions in or near these structures. Information in this manual is the result of an extensive literature survey and data gathering effort, supplemented by some original analytical studies on various aspects of blast phenomena. Many prediction equations and graphs are presented, accompanied by numerous example problems illustrating their use. The manual is complementary to existing structural design manuals and is intended to reflect the current state-of-the-art in prediction of blast and fragment loads for accidental explosions of high explosives at the Pantex Plant. In some instances, particularly for explosions within blast-resistant structures of complex geometry, rational estimation of these loads is beyond the current state-of-the-art.
Novo Jornalismo: fronteiras litero-factuais em A sangue Frio e em Radical Chique

Directory of Open Access Journals (Sweden)

Francisco Aquinei Timóteo Queirós

2012-03-01

Full Text Available http://dx.doi.org/10.5007/1984-784X.2012v12n18p130 A pesquisa busca analisar de que forma fato e ficção se entrecruzam no “movimento” do Novo Jornalismo, a partir das obras A sangue Frio e Radical Chique e o Novo Jornalismo, de Truman Capote e Tom Wolfe, respectivamente. Pretende-se, a partir da investigação do corpus em estudo, revelar os aspectos que aproximam o fato jornalístico, a notícia e a reportagem às técnicas literárias do romance, do conto e da crônica. O estudo investiga o Novo Jornalismo sob o viés de textos centrais das áreas de teoria literária e estudos jornalísticos utilizando autores como Mikhail Bakhtin, Hayden White, Paul Ricoeur, Muniz Sodré; além de referenciar outros escritores que, como Tom Wolfe e Truman Capote, fizeram parte de um grande movimento renovador do jornalismo literário nos anos 1950, 1960 e 1970 chamado, genericamente, de Novo Jornalismo.
New tips for structure prediction by comparative modeling

OpenAIRE

Rayan, Anwar

2009-01-01

Comparative modelling is utilized to predict the 3-dimensional conformation of a given protein (target) based on its sequence alignment to experimentally determined protein structure (template). The use of such technique is already rewarding and increasingly widespread in biological research and drug development. The accuracy of the predictions as commonly accepted depends on the score of sequence identity of the target protein to the template. To assess the relationship between sequence iden...
De novo design of an RNA tile that self-assembles into a homo-octameric nanoprism

Science.gov (United States)

Yu, Jinwen; Liu, Zhiyu; Jiang, Wen; Wang, Guansong; Mao, Chengde

2015-01-01

Rational, de novo design of RNA nanostructures can potentially integrate a wide array of structural and functional diversities. Such nanostructures have great promises in biomedical applications. Despite impressive progress in this field, all RNA building blocks (or tiles) reported so far are not geometrically well defined. They are generally flexible and can only assemble into a mixture of complexes with different sizes. To achieve defined structures, multiple tiles with different sequences are needed. In this study, we design an RNA tile that can homo-oligomerize into a uniform RNA nanostructure. The designed RNA nanostructure is characterized by gel electrophoresis, atomic force microscopy and cryogenic electron microscopy imaging. We believe that development along this line would help RNA nanotechnology to reach the structural control that is currently associated with DNA nanotechnology.
Automatic prediction of facial trait judgments: appearance vs. structural models.

Directory of Open Access Journals (Sweden)

Mario Rojas

Full Text Available Evaluating other individuals with respect to personality characteristics plays a crucial role in human relations and it is the focus of attention for research in diverse fields such as psychology and interactive computer systems. In psychology, face perception has been recognized as a key component of this evaluation system. Multiple studies suggest that observers use face information to infer personality characteristics. Interactive computer systems are trying to take advantage of these findings and apply them to increase the natural aspect of interaction and to improve the performance of interactive computer systems. Here, we experimentally test whether the automatic prediction of facial trait judgments (e.g. dominance can be made by using the full appearance information of the face and whether a reduced representation of its structure is sufficient. We evaluate two separate approaches: a holistic representation model using the facial appearance information and a structural model constructed from the relations among facial salient points. State of the art machine learning methods are applied to a derive a facial trait judgment model from training data and b predict a facial trait value for any face. Furthermore, we address the issue of whether there are specific structural relations among facial points that predict perception of facial traits. Experimental results over a set of labeled data (9 different trait evaluations and classification rules (4 rules suggest that a prediction of perception of facial traits is learnable by both holistic and structural approaches; b the most reliable prediction of facial trait judgments is obtained by certain type of holistic descriptions of the face appearance; and c for some traits such as attractiveness and extroversion, there are relationships between specific structural features and social perceptions.
Facebook - Um novo espaço autobiográfico?

Directory of Open Access Journals (Sweden)

Maria Tereza Lima

2015-07-01

Full Text Available O arigo "Facebook - Um novo espaço autobiográfico?" tem como objetivo central investigar como a perspectiva autobiográfica e biográfica se configura em uma rede social. Levando em consideração esse novo espaço de exteriorização da memória, analisamos as escolhas de uma pessoa ao postar os mais diversos gêneros textuais no Facebook e verificamos até que ponto tais fragmentos textuais narram a história de um indivíduo. Quais textos são postados? O que foi escolhido e o que foi excluído desse perfil? O autor trava um pacto de leitura com o leitor? Se levarmos em consideração que os textos postados nessa rede social são textos produzidos pelo próprio autor do perfil e de autores diversos, como configuraremos esses espaços virtuais? Autobiográficos e biográficos? Quem escreve a página virtual é o próprio autor do perfil ou múltiplos autores? Com as redes sociais, surge um novo modelo de autobiografia e de biógrafo? Esses e tantos outros questionamentos nortearam nossas investigações e permitiram-nos conhecer um pouco mais sobre as estratégias autobiográficas dos autores virtuais contemporâneos.
Hill-Climbing search and diversification within an evolutionary approach to protein structure prediction.

Science.gov (United States)

Chira, Camelia; Horvath, Dragos; Dumitrescu, D

2011-07-30

Proteins are complex structures made of amino acids having a fundamental role in the correct functioning of living cells. The structure of a protein is the result of the protein folding process. However, the general principles that govern the folding of natural proteins into a native structure are unknown. The problem of predicting a protein structure with minimum-energy starting from the unfolded amino acid sequence is a highly complex and important task in molecular and computational biology. Protein structure prediction has important applications in fields such as drug design and disease prediction. The protein structure prediction problem is NP-hard even in simplified lattice protein models. An evolutionary model based on hill-climbing genetic operators is proposed for protein structure prediction in the hydrophobic - polar (HP) model. Problem-specific search operators are implemented and applied using a steepest-ascent hill-climbing approach. Furthermore, the proposed model enforces an explicit diversification stage during the evolution in order to avoid local optimum. The main features of the resulting evolutionary algorithm - hill-climbing mechanism and diversification strategy - are evaluated in a set of numerical experiments for the protein structure prediction problem to assess their impact to the efficiency of the search process. Furthermore, the emerging consolidated model is compared to relevant algorithms from the literature for a set of difficult bidimensional instances from lattice protein models. The results obtained by the proposed algorithm are promising and competitive with those of related methods.
Hill-Climbing search and diversification within an evolutionary approach to protein structure prediction

Directory of Open Access Journals (Sweden)

Chira Camelia

2011-07-01

Full Text Available Abstract Proteins are complex structures made of amino acids having a fundamental role in the correct functioning of living cells. The structure of a protein is the result of the protein folding process. However, the general principles that govern the folding of natural proteins into a native structure are unknown. The problem of predicting a protein structure with minimum-energy starting from the unfolded amino acid sequence is a highly complex and important task in molecular and computational biology. Protein structure prediction has important applications in fields such as drug design and disease prediction. The protein structure prediction problem is NP-hard even in simplified lattice protein models. An evolutionary model based on hill-climbing genetic operators is proposed for protein structure prediction in the hydrophobic - polar (HP model. Problem-specific search operators are implemented and applied using a steepest-ascent hill-climbing approach. Furthermore, the proposed model enforces an explicit diversification stage during the evolution in order to avoid local optimum. The main features of the resulting evolutionary algorithm - hill-climbing mechanism and diversification strategy - are evaluated in a set of numerical experiments for the protein structure prediction problem to assess their impact to the efficiency of the search process. Furthermore, the emerging consolidated model is compared to relevant algorithms from the literature for a set of difficult bidimensional instances from lattice protein models. The results obtained by the proposed algorithm are promising and competitive with those of related methods.
De Novo Heart Failure After Kidney Transplantation: Trends in Incidence and Outcomes.

Science.gov (United States)

Lenihan, Colin R; Liu, Sai; Deswal, Anita; Montez-Rath, Maria E; Winkelmayer, Wolfgang C

2018-03-29

Heart failure is an important cause of morbidity and mortality following kidney transplantation. Some studies in the general population have shown that the incidence of heart failure has decreased during the past 20 years. However, it is not currently known whether such a trend exists in the kidney transplantation population. Retrospective observational cohort study. Adult patients included in the US Renal Data System who underwent their first kidney transplantation in the United States between 1998 and 2010 with at least 6 months of continuous Medicare parts A and B coverage before transplantation and no prior evidence for a diagnosis of heart failure before kidney transplantation. Calendar year of transplantation and calendar year of posttransplantation heart failure diagnosis. De novo posttransplantation heart failure defined using International Classification of Diseases, Ninth Revision diagnosis codes and mortality following de novo posttransplantation heart failure diagnosis. Secular trends in de novo post-kidney transplantation heart failure were examined using Cox proportional hazards analysis. Within a study cohort of 48,771 patients, 7,269 developed de novo heart failure within 3 years of kidney transplantation, with a median time to heart failure of 0.76 years. The adjusted HR for heart failure with death as competing risk comparing patients who underwent transplantation in 2010 with those who underwent transplantation in 1998 was 0.69 (95% CI, 0.60-0.79). No temporal trend in mortality following a diagnosis of post-kidney transplantation heart failure was observed. Potential residual confounding from either incorrectly ascertained or unavailable confounders. The cohort was limited to Medicare beneficiaries. Adjusted for demographic and clinical characteristics, the risk for developing de novo post-kidney transplantation heart failure has declined significantly between 1998 and 2010, with no apparent change in subsequent mortality. Copyright © 2018
Clinical Prediction Models for Cardiovascular Disease: Tufts Predictive Analytics and Comparative Effectiveness Clinical Prediction Model Database.

Science.gov (United States)

Wessler, Benjamin S; Lai Yh, Lana; Kramer, Whitney; Cangelosi, Michael; Raman, Gowri; Lutz, Jennifer S; Kent, David M

2015-07-01

Clinical prediction models (CPMs) estimate the probability of clinical outcomes and hold the potential to improve decision making and individualize care. For patients with cardiovascular disease, there are numerous CPMs available although the extent of this literature is not well described. We conducted a systematic review for articles containing CPMs for cardiovascular disease published between January 1990 and May 2012. Cardiovascular disease includes coronary heart disease, heart failure, arrhythmias, stroke, venous thromboembolism, and peripheral vascular disease. We created a novel database and characterized CPMs based on the stage of development, population under study, performance, covariates, and predicted outcomes. There are 796 models included in this database. The number of CPMs published each year is increasing steadily over time. Seven hundred seventeen (90%) are de novo CPMs, 21 (3%) are CPM recalibrations, and 58 (7%) are CPM adaptations. This database contains CPMs for 31 index conditions, including 215 CPMs for patients with coronary artery disease, 168 CPMs for population samples, and 79 models for patients with heart failure. There are 77 distinct index/outcome pairings. Of the de novo models in this database, 450 (63%) report a c-statistic and 259 (36%) report some information on calibration. There is an abundance of CPMs available for a wide assortment of cardiovascular disease conditions, with substantial redundancy in the literature. The comparative performance of these models, the consistency of effects and risk estimates across models and the actual and potential clinical impact of this body of literature is poorly understood. © 2015 American Heart Association, Inc.
Exploring high-pressure FeB{sub 2}: Structural and electronic properties predictions

Energy Technology Data Exchange (ETDEWEB)

Harran, Ismail [School of Physical Science and Technology, Key Laboratory of Advanced Technologies of Materials, Ministry of Education of China, Southwest Jiaotong University, Chengdu, 610031 (China); Al Fashir University (Sudan); Wang, Hongyan [School of Physical Science and Technology, Key Laboratory of Advanced Technologies of Materials, Ministry of Education of China, Southwest Jiaotong University, Chengdu, 610031 (China); Chen, Yuanzheng, E-mail: cyz@calypso.org.cn [School of Physical Science and Technology, Key Laboratory of Advanced Technologies of Materials, Ministry of Education of China, Southwest Jiaotong University, Chengdu, 610031 (China); Jia, Mingzhen [School of Physical Science and Technology, Key Laboratory of Advanced Technologies of Materials, Ministry of Education of China, Southwest Jiaotong University, Chengdu, 610031 (China); Wu, Nannan [School of Mathematics, Physics and Biological Engineering, Inner Mongolia University of Science & Technology, Baotou, 014010 (China)

2016-09-05

The high pressure (HP) structural phase of FeB{sub 2} compound is investigated by using first-principles crystal structure prediction based on the CALYPSO technique. A thermodynamically stable phase of FeB{sub 2} with space group Imma is predicted at pressure above 225 GPa, which is characterized by a layered orthorhombic structure containing puckered graphite-like boron layers. Its electronic and mechanical properties are identified and analyzed. The feature of band structures favors the occurrence of superconductivity, whereas, the calculated Pugh's ratio reveals that the HP Imma structure exhibits ductile mechanical property. - Highlights: • The high pressure structural phase of FeB{sub 2} compound is firstly investigated by the CALYPSO technique. • A thermodynamically stable Imma phase of FeB{sub 2} is predicted at pressure above 225 GPa. • The Imma structure is characterized by a 2D boron network containing puckered graphite-like boron layers. • The band feature of Imma structure favors the occurrence of superconductivity. • The calculated Pugh's ratio suggests that the Imma structure exhibits ductile mechanical property.
Development of laboratory acceleration test method for service life prediction of concrete structures

International Nuclear Information System (INIS)

Cho, M. S.; Song, Y. C.; Bang, K. S.; Lee, J. S.; Kim, D. K.

1999-01-01

Service life prediction of nuclear power plants depends on the application of history of structures, field inspection and test, the development of laboratory acceleration tests, their analysis method and predictive model. In this study, laboratory acceleration test method for service life prediction of concrete structures and application of experimental test results are introduced. This study is concerned with environmental condition of concrete structures and is to develop the acceleration test method for durability factors of concrete structures e.g. carbonation, sulfate attack, freeze-thaw cycles and shrinkage-expansion etc
Photoreactivation of conversion and de novo suppressor mutation in Escherichia coli

Energy Technology Data Exchange (ETDEWEB)

Bockrath, R C; Plamer, J E [Indiana Univ., Indianapolis (USA). Dept. of Microbiology

1977-04-01

Studies of mutagenesis and photoreactivation in various E.coli strains have shown that conversion mutation of a mutant containing an amber suppressor to one containing an ochre suppressor is sensitive to photoreactivation. Direct photoreactivation by photoreactivating light (PRL) after uv mutagenesis reduced mutation frequencies by a factor of about 2 for each minute of exposure during the first 5 to 8 min of exposure for cells with normal repair capacity. Conversion and potential de novo suppressor mutations were about equally sensitive. For conversion, the sensitivities to PRL were identical in the repair-normal and excisions-repair-deficient strains. For de novo suppressor mutation, the rate of mutation frequency reduction by PRL in the repair-deficient strain was about one-half that in the other strains. The results suggest that ultraviolet radiation produces both de novo suppressor mutation and conversion at the sup(E,B) locus by photoreversible pyrimidine dimers in the DNA. The causative dimers could be Thy()Cyt dimers in the transcribed strand or the non-transcribed strand, respectively.
MCTBI: a web server for predicting metal ion effects in RNA structures.

Science.gov (United States)

Sun, Li-Zhen; Zhang, Jing-Xiang; Chen, Shi-Jie

2017-08-01

Metal ions play critical roles in RNA structure and function. However, web servers and software packages for predicting ion effects in RNA structures are notably scarce. Furthermore, the existing web servers and software packages mainly neglect ion correlation and fluctuation effects, which are potentially important for RNAs. We here report a new web server, the MCTBI server (http://rna.physics.missouri.edu/MCTBI), for the prediction of ion effects for RNA structures. This server is based on the recently developed MCTBI, a model that can account for ion correlation and fluctuation effects for nucleic acid structures and can provide improved predictions for the effects of metal ions, especially for multivalent ions such as Mg 2+ effects, as shown by extensive theory-experiment test results. The MCTBI web server predicts metal ion binding fractions, the most probable bound ion distribution, the electrostatic free energy of the system, and the free energy components. The results provide mechanistic insights into the role of metal ions in RNA structure formation and folding stability, which is important for understanding RNA functions and the rational design of RNA structures. © 2017 Sun et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
De-novo design of antimicrobial peptides for plant protection.

Directory of Open Access Journals (Sweden)

Benjamin Zeitler

Full Text Available This work describes the de-novo design of peptides that inhibit a broad range of plant pathogens. Four structurally different groups of peptides were developed that differ in size and position of their charged and hydrophobic clusters and were assayed for their ability to inhibit bacterial growth and fungal spore germination. Several peptides are highly active at concentrations between 0,1 and 1 µg/ml against plant pathogenic bacteria, such as Pseudomonas syringae, Pectobacterium carotovorum, and Xanthomonas vesicatoria. Importantly, no hemolytic activity could be detected for these peptides at concentrations up to 200 µg/ml. Moreover, the peptides are also active after spraying on the plant surface demonstrating a possible way of application. In sum, our designed peptides represent new antimicrobial agents and with the increasing demand for antimicrobial compounds for production of "healthy" food, these peptides might serve as templates for novel antibacterial and antifungal agents.

Identifying wrong assemblies in de novo short read primary ...

Indian Academy of Sciences (India)

2016-08-05

Aug 5, 2016 ... Most of these assemblies are done using some de novo short read assemblers and other related approaches. .... benchmarking projects like Assemblathon 1, Assemblathon ... from a large insert library (at least 1000 bases).
KrillDB: A de novo transcriptome database for the Antarctic krill (Euphausia superba.

Directory of Open Access Journals (Sweden)

Gabriele Sales

Full Text Available Antarctic krill (Euphausia superba is a key species in the Southern Ocean with an estimated biomass between 100 and 500 million tonnes. Changes in krill population viability would have catastrophic effect on the Antarctic ecosystem. One looming threat due to elevated levels of anthropogenic atmospheric carbon dioxide (CO2 is ocean acidification (lowering of sea water pH by CO2 dissolving into the oceans. The genetics of Antarctic krill has long been of scientific interest for both for the analysis of population structure and analysis of functional genetics. However, the genetic resources available for the species are relatively modest. We have developed the most advanced genetic database on Euphausia superba, KrillDB, which includes comprehensive data sets of former and present transcriptome projects. In particular, we have built a de novo transcriptome assembly using more than 360 million Illumina sequence reads generated from larval krill including individuals subjected to different CO2 levels. The database gives access to: 1 the full list of assembled genes and transcripts; 2 their level of similarity to transcripts and proteins from other species; 3 the predicted protein domains contained within each transcript; 4 their predicted GO terms; 5 the level of expression of each transcript in the different larval stages and CO2 treatments. All references to external entities (sequences, domains, GO terms are equipped with a link to the appropriate source database. Moreover, the software implements a full-text search engine that makes it possible to submit free-form queries. KrillDB represents the first large-scale attempt at classifying and annotating the full krill transcriptome. For this reason, we believe it will constitute a cornerstone of future approaches devoted to physiological and molecular study of this key species in the Southern Ocean food web.
A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures

Science.gov (United States)

2014-01-01

Background Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. Results We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. Conclusions Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in
Analysis of 60 706 Exomes Questions the Role of De Novo Variants Previously Implicated in Cardiac Disease

DEFF Research Database (Denmark)

Paludan-Müller, Christian; Ahlberg, Gustav; Ghouse, Jonas

2017-01-01

BACKGROUND: De novo variants in the exome occur at a rate of 1 per individual per generation, and because of the low reproductive fitness for de novo variants causing severe disease, the likelihood of finding these as standing variations in the general population is low. Therefore, this study...... sought to evaluate the pathogenicity of de novo variants previously associated with cardiac disease based on a large population-representative exome database. METHODS AND RESULTS: We performed a literature search for previous publications on de novo variants associated with severe arrhythmias...... trio studies (>1000 subjects). Of the monogenic variants, 11% (23/211) were present in ExAC, whereas 26% (802/3050) variants believed to increase susceptibility of disease were identified in ExAC. Monogenic de novo variants in ExAC had a total allele count of 109 and with ≈844 expected cases in Ex...
Similar prognosis of transformed and de novo diffuse large B-cell lymphomas in patients treated with immunochemotherapy.

Science.gov (United States)

Sorigue, Marc; Garcia, Olga; Baptista, Maria Joao; Sancho, Juan-Manuel; Tapia, Gustavo; Mate, José Luis; Feliu, Evarist; Navarro, José-Tomás; Ribera, Josep-Maria

2017-03-22

The prognosis of diffuse large B-cell lymphomas (DLBCL) transformed from indolent lymphoma (TL) has been considered poorer than that of de novo DLBCL. However, it seems to have improved since the introduction of rituximab. We compared the characteristics (including the cell-of-origin), and the prognosis of 29 patients with TL and 101 with de novo DLBCL treated with immunochemotherapy. Patients with TL and de novo DLBCL had similar characteristics. All TL cases evolving from follicular lymphoma were germinal-center B-cell-like, while those TL from marginal zone lymphoma or chronic lymphocytic leukemia were non-germinal-center B-cell-like. The complete response rate was similar in TL and de novo DLBCL (62 vs. 66%, P=.825). The 5-year overall and progression-free survival probabilities (95% CI) were 59% (40-78) and 41% (22-60) for TL and 63% (53-73) and 60% (50-70) for de novo DLBCL, respectively (P=.732 for overall survival and P=.169 for progression-free survival). In this study, the prognosis of TL and de novo DLBCL treated with immunochemotherapy was similar. The role of intensification with stem cell transplantation in the management of TL may be questionable in the rituximab era. Copyright © 2016 Elsevier España, S.L.U. All rights reserved.
Bi-objective integer programming for RNA secondary structure prediction with pseudoknots.

Science.gov (United States)

Legendre, Audrey; Angel, Eric; Tahi, Fariza

2018-01-15

RNA structure prediction is an important field in bioinformatics, and numerous methods and tools have been proposed. Pseudoknots are specific motifs of RNA secondary structures that are difficult to predict. Almost all existing methods are based on a single model and return one solution, often missing the real structure. An alternative approach would be to combine different models and return a (small) set of solutions, maximizing its quality and diversity in order to increase the probability that it contains the real structure. We propose here an original method for predicting RNA secondary structures with pseudoknots, based on integer programming. We developed a generic bi-objective integer programming algorithm allowing to return optimal and sub-optimal solutions optimizing simultaneously two models. This algorithm was then applied to the combination of two known models of RNA secondary structure prediction, namely MEA and MFE. The resulting tool, called BiokoP, is compared with the other methods in the literature. The results show that the best solution (structure with the highest F 1 -score) is, in most cases, given by BiokoP. Moreover, the results of BiokoP are homogeneous, regardless of the pseudoknot type or the presence or not of pseudoknots. Indeed, the F 1 -scores are always higher than 70% for any number of solutions returned. The results obtained by BiokoP show that combining the MEA and the MFE models, as well as returning several optimal and several sub-optimal solutions, allow to improve the prediction of secondary structures. One perspective of our work is to combine better mono-criterion models, in particular to combine a model based on the comparative approach with the MEA and the MFE models. This leads to develop in the future a new multi-objective algorithm to combine more than two models. BiokoP is available on the EvryRNA platform: https://EvryRNA.ibisc.univ-evry.fr .
Cascaded bidirectional recurrent neural networks for protein secondary structure prediction.

Science.gov (United States)

Chen, Jinmiao; Chaudhari, Narendra

2007-01-01

Protein secondary structure (PSS) prediction is an important topic in bioinformatics. Our study on a large set of non-homologous proteins shows that long-range interactions commonly exist and negatively affect PSS prediction. Besides, we also reveal strong correlations between secondary structure (SS) elements. In order to take into account the long-range interactions and SS-SS correlations, we propose a novel prediction system based on cascaded bidirectional recurrent neural network (BRNN). We compare the cascaded BRNN against another two BRNN architectures, namely the original BRNN architecture used for speech recognition as well as Pollastri's BRNN that was proposed for PSS prediction. Our cascaded BRNN achieves an overall three state accuracy Q3 of 74.38\\%, and reaches a high Segment OVerlap (SOV) of 66.0455. It outperforms the original BRNN and Pollastri's BRNN in both Q3 and SOV. Specifically, it improves the SOV score by 4-6%.
Theoretical prediction of low-density hexagonal ZnO hollow structures

Energy Technology Data Exchange (ETDEWEB)

Tuoc, Vu Ngoc, E-mail: tuoc.vungoc@hust.edu.vn [Institute of Engineering Physics, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi (Viet Nam); Huan, Tran Doan [Institute of Materials Science, University of Connecticut, Storrs, Connecticut 06269-3136 (United States); Thao, Nguyen Thi [Institute of Engineering Physics, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi (Viet Nam); Hong Duc University, 307 Le Lai, Thanh Hoa City (Viet Nam); Tuan, Le Manh [Hong Duc University, 307 Le Lai, Thanh Hoa City (Viet Nam)

2016-10-14

Along with wurtzite and zinc blende, zinc oxide (ZnO) has been found in a large number of polymorphs with substantially different properties and, hence, applications. Therefore, predicting and synthesizing new classes of ZnO polymorphs are of great significance and have been gaining considerable interest. Herein, we perform a density functional theory based tight-binding study, predicting several new series of ZnO hollow structures using the bottom-up approach. The geometry of the building blocks allows for obtaining a variety of hexagonal, low-density nanoporous, and flexible ZnO hollow structures. Their stability is discussed by means of the free energy computed within the lattice-dynamics approach. Our calculations also indicate that all the reported hollow structures are wide band gap semiconductors in the same fashion with bulk ZnO. The electronic band structures of the ZnO hollow structures are finally examined in detail.
Sparse RNA folding revisited: space-efficient minimum free energy structure prediction.

Science.gov (United States)

Will, Sebastian; Jabbari, Hosna

2016-01-01

RNA secondary structure prediction by energy minimization is the central computational tool for the analysis of structural non-coding RNAs and their interactions. Sparsification has been successfully applied to improve the time efficiency of various structure prediction algorithms while guaranteeing the same result; however, for many such folding problems, space efficiency is of even greater concern, particularly for long RNA sequences. So far, space-efficient sparsified RNA folding with fold reconstruction was solved only for simple base-pair-based pseudo-energy models. Here, we revisit the problem of space-efficient free energy minimization. Whereas the space-efficient minimization of the free energy has been sketched before, the reconstruction of the optimum structure has not even been discussed. We show that this reconstruction is not possible in trivial extension of the method for simple energy models. Then, we present the time- and space-efficient sparsified free energy minimization algorithm SparseMFEFold that guarantees MFE structure prediction. In particular, this novel algorithm provides efficient fold reconstruction based on dynamically garbage-collected trace arrows. The complexity of our algorithm depends on two parameters, the number of candidates Z and the number of trace arrows T; both are bounded by [Formula: see text], but are typically much smaller. The time complexity of RNA folding is reduced from [Formula: see text] to [Formula: see text]; the space complexity, from [Formula: see text] to [Formula: see text]. Our empirical results show more than 80 % space savings over RNAfold [Vienna RNA package] on the long RNAs from the RNA STRAND database (≥2500 bases). The presented technique is intentionally generalizable to complex prediction algorithms; due to their high space demands, algorithms like pseudoknot prediction and RNA-RNA-interaction prediction are expected to profit even stronger than "standard" MFE folding. SparseMFEFold is free
Rapid centriole assembly in Naegleria reveals conserved roles for both de novo and mentored assembly.

Science.gov (United States)

Fritz-Laylin, Lillian K; Levy, Yaron Y; Levitan, Edward; Chen, Sean; Cande, W Zacheus; Lai, Elaine Y; Fulton, Chandler

2016-03-01

Centrioles are eukaryotic organelles whose number and position are critical for cilia formation and mitosis. Many cell types assemble new centrioles next to existing ones ("templated" or mentored assembly). Under certain conditions, centrioles also form without pre-existing centrioles (de novo). The synchronous differentiation of Naegleria amoebae to flagellates represents a unique opportunity to study centriole assembly, as nearly 100% of the population transitions from having no centrioles to having two within minutes. Here, we find that Naegleria forms its first centriole de novo, immediately followed by mentored assembly of the second. We also find both de novo and mentored assembly distributed among all major eukaryote lineages. We therefore propose that both modes are ancestral and have been conserved because they serve complementary roles, with de novo assembly as the default when no pre-existing centriole is available, and mentored assembly allowing precise regulation of number, timing, and location of centriole assembly. © 2016 Wiley Periodicals, Inc.
Bridge Structure Deformation Prediction Based on GNSS Data Using Kalman-ARIMA-GARCH Model.

Science.gov (United States)

Xin, Jingzhou; Zhou, Jianting; Yang, Simon X; Li, Xiaoqing; Wang, Yu

2018-01-19

Bridges are an essential part of the ground transportation system. Health monitoring is fundamentally important for the safety and service life of bridges. A large amount of structural information is obtained from various sensors using sensing technology, and the data processing has become a challenging issue. To improve the prediction accuracy of bridge structure deformation based on data mining and to accurately evaluate the time-varying characteristics of bridge structure performance evolution, this paper proposes a new method for bridge structure deformation prediction, which integrates the Kalman filter, autoregressive integrated moving average model (ARIMA), and generalized autoregressive conditional heteroskedasticity (GARCH). Firstly, the raw deformation data is directly pre-processed using the Kalman filter to reduce the noise. After that, the linear recursive ARIMA model is established to analyze and predict the structure deformation. Finally, the nonlinear recursive GARCH model is introduced to further improve the accuracy of the prediction. Simulation results based on measured sensor data from the Global Navigation Satellite System (GNSS) deformation monitoring system demonstrated that: (1) the Kalman filter is capable of denoising the bridge deformation monitoring data; (2) the prediction accuracy of the proposed Kalman-ARIMA-GARCH model is satisfactory, where the mean absolute error increases only from 3.402 mm to 5.847 mm with the increment of the prediction step; and (3) in comparision to the Kalman-ARIMA model, the Kalman-ARIMA-GARCH model results in superior prediction accuracy as it includes partial nonlinear characteristics (heteroscedasticity); the mean absolute error of five-step prediction using the proposed model is improved by 10.12%. This paper provides a new way for structural behavior prediction based on data processing, which can lay a foundation for the early warning of bridge health monitoring system based on sensor data using sensing
Bridge Structure Deformation Prediction Based on GNSS Data Using Kalman-ARIMA-GARCH Model

Directory of Open Access Journals (Sweden)

Jingzhou Xin

2018-01-01

Full Text Available Bridges are an essential part of the ground transportation system. Health monitoring is fundamentally important for the safety and service life of bridges. A large amount of structural information is obtained from various sensors using sensing technology, and the data processing has become a challenging issue. To improve the prediction accuracy of bridge structure deformation based on data mining and to accurately evaluate the time-varying characteristics of bridge structure performance evolution, this paper proposes a new method for bridge structure deformation prediction, which integrates the Kalman filter, autoregressive integrated moving average model (ARIMA, and generalized autoregressive conditional heteroskedasticity (GARCH. Firstly, the raw deformation data is directly pre-processed using the Kalman filter to reduce the noise. After that, the linear recursive ARIMA model is established to analyze and predict the structure deformation. Finally, the nonlinear recursive GARCH model is introduced to further improve the accuracy of the prediction. Simulation results based on measured sensor data from the Global Navigation Satellite System (GNSS deformation monitoring system demonstrated that: (1 the Kalman filter is capable of denoising the bridge deformation monitoring data; (2 the prediction accuracy of the proposed Kalman-ARIMA-GARCH model is satisfactory, where the mean absolute error increases only from 3.402 mm to 5.847 mm with the increment of the prediction step; and (3 in comparision to the Kalman-ARIMA model, the Kalman-ARIMA-GARCH model results in superior prediction accuracy as it includes partial nonlinear characteristics (heteroscedasticity; the mean absolute error of five-step prediction using the proposed model is improved by 10.12%. This paper provides a new way for structural behavior prediction based on data processing, which can lay a foundation for the early warning of bridge health monitoring system based on sensor data
Chemical structure-based predictive model for methanogenic anaerobic biodegradation potential.

Science.gov (United States)

Meylan, William; Boethling, Robert; Aronson, Dallas; Howard, Philip; Tunkel, Jay

2007-09-01

Many screening-level models exist for predicting aerobic biodegradation potential from chemical structure, but anaerobic biodegradation generally has been ignored by modelers. We used a fragment contribution approach to develop a model for predicting biodegradation potential under methanogenic anaerobic conditions. The new model has 37 fragments (substructures) and classifies a substance as either fast or slow, relative to the potential to be biodegraded in the "serum bottle" anaerobic biodegradation screening test (Organization for Economic Cooperation and Development Guideline 311). The model correctly classified 90, 77, and 91% of the chemicals in the training set (n = 169) and two independent validation sets (n = 35 and 23), respectively. Accuracy of predictions of fast and slow degradation was equal for training-set chemicals, but fast-degradation predictions were less accurate than slow-degradation predictions for the validation sets. Analysis of the signs of the fragment coefficients for this and the other (aerobic) Biowin models suggests that in the context of simple group contribution models, the majority of positive and negative structural influences on ultimate degradation are the same for aerobic and methanogenic anaerobic biodegradation.
Protein 8-class secondary structure prediction using conditional neural fields.

Science.gov (United States)

Wang, Zhiyong; Zhao, Feng; Peng, Jian; Xu, Jinbo

2011-10-01

Compared with the protein 3-class secondary structure (SS) prediction, the 8-class prediction gains less attention and is also much more challenging, especially for proteins with few sequence homologs. This paper presents a new probabilistic method for 8-class SS prediction using conditional neural fields (CNFs), a recently invented probabilistic graphical model. This CNF method not only models the complex relationship between sequence features and SS, but also exploits the interdependency among SS types of adjacent residues. In addition to sequence profiles, our method also makes use of non-evolutionary information for SS prediction. Tested on the CB513 and RS126 data sets, our method achieves Q8 accuracy of 64.9 and 64.7%, respectively, which are much better than the SSpro8 web server (51.0 and 48.0%, respectively). Our method can also be used to predict other structure properties (e.g. solvent accessibility) of a protein or the SS of RNA. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
De novo assembly of the perennial ryegrass transcriptome using an RNA-seq strategy

DEFF Research Database (Denmark)

Farrell, Jacqueline Danielle; Byrne, Stephen; Paina, Cristiana

2014-01-01

a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals of this study were to perform RNA-sequencing on multiple tissues from a highly inbred genotype to develop a reference transcriptome. This was complemented with RNA-sequencing of a highly heterozygous genotype...... for SNP calling. Result De novo transcriptome assembly of the inbred genotype created 185,833 transcripts with an average length of 830 base pairs. Within the inbred reference transcriptome 78,560 predicted open reading frames were found of which 24,434 were predicted as complete. Functional annotation...... multiple orthologs. Using the longest unique open reading frames as the reference sequence, 64,242 single nucleotide polymorphisms were found. One thousand sixty one open reading frames from the inbred genotype contained heterozygous sites, confirming the high degree of homozygosity. Conclusion Our study...
Structure prediction of AlnOm clusters

International Nuclear Information System (INIS)

Smok, P

2011-01-01

Genetic algorithm simulations, using Buckingham potential to represent the anion-anion and cation-anion short-range interactions, were performed in order to predict the equilibrium positions of the Al and O ions in Al n O m clusters. In order to find the equilibrium structures of compounds a self-organizing genetic algorithm were constructed. The calculation were carried out for several clusters Al n O m , with different numbers of aluminium and oxygen atoms.
Protein Secondary Structure Prediction Using AutoEncoder Network and Bayes Classifier

Science.gov (United States)

Wang, Leilei; Cheng, Jinyong

2018-03-01

Protein secondary structure prediction is belong to bioinformatics,and it's important in research area. In this paper, we propose a new prediction way of protein using bayes classifier and autoEncoder network. Our experiments show some algorithms including the construction of the model, the classification of parameters and so on. The data set is a typical CB513 data set for protein. In terms of accuracy, the method is the cross validation based on the 3-fold. Then we can get the Q3 accuracy. Paper results illustrate that the autoencoder network improved the prediction accuracy of protein secondary structure.
De novo mutation in the dopamine transporter gene associates dopamine dysfunction with autism spectrum disorder

DEFF Research Database (Denmark)

Hamilton, P J; Campbell, N G; Sharma, S

2013-01-01

De novo genetic variation is an important class of risk factors for autism spectrum disorder (ASD). Recently, whole-exome sequencing of ASD families has identified a novel de novo missense mutation in the human dopamine (DA) transporter (hDAT) gene, which results in a Thr to Met substitution...
Novel de novo BRCA2 mutation in a patient with a family history of breast cancer

DEFF Research Database (Denmark)

Hansen, Thomas V O; Bisgaard, Marie Luise; Jønson, Lars

2008-01-01

whole blood. The paternity was determined by single nucleotide polymorphism (SNP) microarray analysis. Parental origin of the de novo mutation was determined by establishing mutation-SNP haplotypes by variant specific PCR, while de novo and mosaic status was investigated by sequencing of DNA from......BACKGROUND: BRCA2 germ-line mutations predispose to breast and ovarian cancer. Mutations are widespread and unclassified splice variants are frequently encountered. We describe the parental origin and functional characterization of a novel de novo BRCA2 splice site mutation found in a patient...... and synthesis of a truncated BRCA2 protein. The aberrant splicing was verified by RT-PCR analysis on RNA isolated from whole blood of the affected patient. The mutation was not found in any of the patient's parents or in the mother's carcinoma, showing it is a de novo mutation. Variant specific PCR indicates...
Structural health monitoring for fatigue life prediction of orthotropic brdige decks

NARCIS (Netherlands)

Pijpers, R.J.M.; Pahlavan, P.L.; Paulissen, J.H.; Hakkesteegt, H.C.; Jansen, T.H.

2013-01-01

Infrastructure asset owners are more and more confronted with structures reaching the end of their structural life. Structural Health Monitoring (SHM) systems should provide up-to-date information about the actual condition, as well predict the structural life and required maintenance of the assets

De novo assembly of highly diverse viral populations

Directory of Open Access Journals (Sweden)

Yang Xiao

2012-09-01

Full Text Available Abstract Background Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo population consensus assembly is valuable both as a single linear representation of the population and as a backbone on which intra-host variants can be accurately mapped. The availability of consensus assemblies and robustly mapped variants are crucial to the genetic study of viral disease progression, transmission dynamics, and viral evolution. Existing de novo assembly techniques fail to robustly assemble ultra-deep sequence data from genetically heterogeneous populations such as viruses into full-length genomes due to the presence of extensive genetic variability, contaminants, and variable sequence coverage. Results We present VICUNA, a de novo assembly algorithm suitable for generating consensus assemblies from genetically heterogeneous populations. We demonstrate its effectiveness on Dengue, Human Immunodeficiency and West Nile viral populations, representing a range of intra-host diversity. Compared to state-of-the-art assemblers designed for haploid or diploid systems, VICUNA recovers full-length consensus and captures insertion/deletion polymorphisms in diverse samples. Final assemblies maintain a high base calling accuracy. VICUNA program is publicly available at: http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/ viral-genomics-analysis-software. Conclusions We developed VICUNA, a publicly available software tool, that enables consensus assembly of ultra-deep sequence derived from diverse viral populations. While VICUNA was developed for the analysis of viral populations, its application to other heterogeneous sequence data sets such as metagenomic or tumor cell population samples may prove beneficial in these fields of research.
Structural genomic variation in childhood epilepsies with complex phenotypes

DEFF Research Database (Denmark)

Helbig, Ingo; Swinkels, Marielle E M; Aten, Emmelien

2014-01-01

of CNVs in patients with unclassified epilepsies and complex phenotypes. A total of 222 patients from three European countries, including patients with structural lesions on magnetic resonance imaging (MRI), dysmorphic features, and multiple congenital anomalies, were clinically evaluated and screened.......9%). Segregation of all identified variants could be assessed in 42 patients, 11 of which were de novo. The frequency of all structural variants and de novo variants was not statistically different between patients with or without MRI abnormalities or MRI subcategories. Patients with dysmorphic features were more...
A method for probing the mutational landscape of amyloid structure.

Science.gov (United States)

O'Donnell, Charles W; Waldispühl, Jérôme; Lis, Mieszko; Halfmann, Randal; Devadas, Srinivas; Lindquist, Susan; Berger, Bonnie

2011-07-01

Proteins of all kinds can self-assemble into highly ordered β-sheet aggregates known as amyloid fibrils, important both biologically and clinically. However, the specific molecular structure of a fibril can vary dramatically depending on sequence and environmental conditions, and mutations can drastically alter amyloid function and pathogenicity. Experimental structure determination has proven extremely difficult with only a handful of NMR-based models proposed, suggesting a need for computational methods. We present AmyloidMutants, a statistical mechanics approach for de novo prediction and analysis of wild-type and mutant amyloid structures. Based on the premise of protein mutational landscapes, AmyloidMutants energetically quantifies the effects of sequence mutation on fibril conformation and stability. Tested on non-mutant, full-length amyloid structures with known chemical shift data, AmyloidMutants offers roughly 2-fold improvement in prediction accuracy over existing tools. Moreover, AmyloidMutants is the only method to predict complete super-secondary structures, enabling accurate discrimination of topologically dissimilar amyloid conformations that correspond to the same sequence locations. Applied to mutant prediction, AmyloidMutants identifies a global conformational switch between Aβ and its highly-toxic 'Iowa' mutant in agreement with a recent experimental model based on partial chemical shift data. Predictions on mutant, yeast-toxic strains of HET-s suggest similar alternate folds. When applied to HET-s and a HET-s mutant with core asparagines replaced by glutamines (both highly amyloidogenic chemically similar residues abundant in many amyloids), AmyloidMutants surprisingly predicts a greatly reduced capacity of the glutamine mutant to form amyloid. We confirm this finding by conducting mutagenesis experiments. Our tool is publically available on the web at http://amyloid.csail.mit.edu/. lindquist_admin@wi.mit.edu; bab@csail.mit.edu.
Application of Functional Use Predictions to Aid in Structure ...

Science.gov (United States)

Humans are potentially exposed to thousands of anthropogenic chemicals in commerce. Recent work has shown that the bulk of this exposure may occur in near-field indoor environments (e.g., home, school, work, etc.). Advances in suspect screening analyses (SSA) now allow an improved understanding of the chemicals present in these environments. However, due to the nature of suspect screening techniques, investigators are often left with chemical formula predictions, with the possibility of many chemical structures matching to each formula. Here, newly developed quantitative structure-use relationship (QSUR) models are used to identify potential exposure sources for candidate structures. Previously, a suspect screening workflow was introduced and applied to house dust samples collected from the U.S. Department of Housing and Urban Development’s American Healthy Homes Survey (AHHS) [Rager, et al., Env. Int. 88 (2016)]. This workflow utilized the US EPA’s Distributed Structure-Searchable Toxicity (DSSTox) Database to link identified molecular features to molecular formulas, and ultimately chemical structures. Multiple QSUR models were applied to support the evaluation of candidate structures. These QSURs predict the likelihood of a chemical having a functional use commonly associated with consumer products having near-field use. For 3,228 structures identified as possible chemicals in AHHS house dust samples, we were able to obtain the required descriptors to appl
Three-dimensional protein structure prediction: Methods and computational strategies.

Science.gov (United States)

Dorn, Márcio; E Silva, Mariel Barbachan; Buriol, Luciana S; Lamb, Luis C

2014-10-12

A long standing problem in structural bioinformatics is to determine the three-dimensional (3-D) structure of a protein when only a sequence of amino acid residues is given. Many computational methodologies and algorithms have been proposed as a solution to the 3-D Protein Structure Prediction (3-D-PSP) problem. These methods can be divided in four main classes: (a) first principle methods without database information; (b) first principle methods with database information; (c) fold recognition and threading methods; and (d) comparative modeling methods and sequence alignment strategies. Deterministic computational techniques, optimization techniques, data mining and machine learning approaches are typically used in the construction of computational solutions for the PSP problem. Our main goal with this work is to review the methods and computational strategies that are currently used in 3-D protein prediction. Copyright © 2014 Elsevier Ltd. All rights reserved.
Protein structure predictions with Monte Carlo simulated annealing: Case for the β-sheet

Science.gov (United States)

Okamoto, Y.; Fukugita, M.; Kawai, H.; Nakazawa, T.

Work is continued for a prediction of three-dimensional structure of peptides and proteins with Monte Carlo simulated annealing using only a generic energy function and amino acid sequence as input. We report that β-sheet like structure is successfully predicted for a fragment of bovine pancreatic trypsin inhibitor which is known to have the β-sheet structure in nature. Together with the results for α-helix structure reported earlier, this means that a successful prediction can be made, at least at a qualitative level, for two dominant building blocks of proteins, α-helix and β-sheet, from the information of amino acid sequence alone.
A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads

Directory of Open Access Journals (Sweden)

Stanley Kimbung Mbandi

2014-02-01

Full Text Available Downstream analyses of short-reads from next-generation sequencing platforms are often preceded by a pre-processing step that removes uncalled and wrongly called bases. Standard approaches rely on their associated base quality scores to retain the read or a portion of it when the score is above a predefined threshold. It is difficult to differentiate sequencing error from biological variation without a reference using quality scores. The effects of quality score based trimming have not been systematically studied in de novo transcriptome assembly. Using RNA-Seq data produced from Illumina, we teased out the effects of quality score base filtering or trimming on de novo transcriptome reconstruction. We showed that assemblies produced from reads subjected to different quality score thresholds contain truncated and missing transfrags when compared to those from untrimmed reads. Our data supports the fact that de novo assembling of untrimmed data is challenging for de Bruijn graph assemblers. However, our results indicates that comparing the assemblies from untrimmed and trimmed read subsets can suggest appropriate filtering parameters and enable selection of the optimum de novo transcriptome assembly in non-model organisms.
MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.

Science.gov (United States)

Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka

2018-05-08

Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on
Efficient assembly of de novo human artificial chromosomes from large genomic loci

Directory of Open Access Journals (Sweden)

Stromberg Gregory

2005-07-01

Full Text Available Abstract Background Human Artificial Chromosomes (HACs are potentially useful vectors for gene transfer studies and for functional annotation of the genome because of their suitability for cloning, manipulating and transferring large segments of the genome. However, development of HACs for the transfer of large genomic loci into mammalian cells has been limited by difficulties in manipulating high-molecular weight DNA, as well as by the low overall frequencies of de novo HAC formation. Indeed, to date, only a small number of large (>100 kb genomic loci have been reported to be successfully packaged into de novo HACs. Results We have developed novel methodologies to enable efficient assembly of HAC vectors containing any genomic locus of interest. We report here the creation of a novel, bimolecular system based on bacterial artificial chromosomes (BACs for the construction of HACs incorporating any defined genomic region. We have utilized this vector system to rapidly design, construct and validate multiple de novo HACs containing large (100–200 kb genomic loci including therapeutically significant genes for human growth hormone (HGH, polycystic kidney disease (PKD1 and ß-globin. We report significant differences in the ability of different genomic loci to support de novo HAC formation, suggesting possible effects of cis-acting genomic elements. Finally, as a proof of principle, we have observed sustained ß-globin gene expression from HACs incorporating the entire 200 kb ß-globin genomic locus for over 90 days in the absence of selection. Conclusion Taken together, these results are significant for the development of HAC vector technology, as they enable high-throughput assembly and functional validation of HACs containing any large genomic locus. We have evaluated the impact of different genomic loci on the frequency of HAC formation and identified segments of genomic DNA that appear to facilitate de novo HAC formation. These genomic loci
De novo synthesis of milk triglycerides in humans

Science.gov (United States)

Mammary gland (MG) de novo lipogenesis contributes significantly to milk fat in animals but little is known in humans. Objective: To test the hypothesis that the incorporation of 13C carbons from [U-13C]glucose into fatty acids (FA) and glycerol in triglycerides (TG) will be greater: 1) in milk tha...
Virality Prediction and Community Structure in Social Networks

Science.gov (United States)

Weng, Lilian; Menczer, Filippo; Ahn, Yong-Yeol

2013-08-01

How does network structure affect diffusion? Recent studies suggest that the answer depends on the type of contagion. Complex contagions, unlike infectious diseases (simple contagions), are affected by social reinforcement and homophily. Hence, the spread within highly clustered communities is enhanced, while diffusion across communities is hampered. A common hypothesis is that memes and behaviors are complex contagions. We show that, while most memes indeed spread like complex contagions, a few viral memes spread across many communities, like diseases. We demonstrate that the future popularity of a meme can be predicted by quantifying its early spreading pattern in terms of community concentration. The more communities a meme permeates, the more viral it is. We present a practical method to translate data about community structure into predictive knowledge about what information will spread widely. This connection contributes to our understanding in computational social science, social media analytics, and marketing applications.
76 FR 68767 - Draft Guidance for Industry and Food and Drug Administration Staff; De Novo Classification...

Science.gov (United States)

2011-11-07

... DEPARTMENT OF HEALTH AND HUMAN SERVICES Food and Drug Administration [Docket No. FDA-2011-D-0689] Draft Guidance for Industry and Food and Drug Administration Staff; De Novo Classification Process... for Industry and Food and Drug Administration Staff; De Novo Classification Process (Evaluation of...
De novo Biosynthesis of "Non-Natural" Thaxtomin Phytotoxins.

Science.gov (United States)

Winn, Michael; Francis, Daniel; Micklefield, Jason

2018-03-30

Thaxtomins are diketopiperazine phytotoxins produced by Streptomyces scabies and other actinobacterial plant pathogens that inhibit cellulose biosynthesis in plants. Due to their potent bioactivity and novel mode of action there has been considerable interest in developing thaxtomins as herbicides for crop protection. To address the need for more stable derivatives, we have developed a new approach for structural diversification of thaxtomins. Genes encoding the thaxtomin NRPS from S. scabies, along with genes encoding a promiscuous tryptophan synthase (TrpS) from Salmonella typhimurium, were assembled in a heterologous host Streptomyces albus. Upon feeding indole derivatives to the engineered S. albus strain, tryptophan intermediates with alternative substituents are biosynthesized and incorporated by the NRPS to deliver a series of thaxtomins with different functionalities in place of the nitro group. The approach described herein, demonstrates how genes from different pathways and different bacterial origins can be combined in a heterologous host to create a de novo biosynthetic pathway to "non-natural" product target compounds. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Polymer physics predicts the effects of structural variants on chromatin architecture.

Science.gov (United States)

Bianco, Simona; Lupiáñez, Darío G; Chiariello, Andrea M; Annunziatella, Carlo; Kraft, Katerina; Schöpflin, Robert; Wittler, Lars; Andrey, Guillaume; Vingron, Martin; Pombo, Ana; Mundlos, Stefan; Nicodemi, Mario

2018-05-01

Structural variants (SVs) can result in changes in gene expression due to abnormal chromatin folding and cause disease. However, the prediction of such effects remains a challenge. Here we present a polymer-physics-based approach (PRISMR) to model 3D chromatin folding and to predict enhancer-promoter contacts. PRISMR predicts higher-order chromatin structure from genome-wide chromosome conformation capture (Hi-C) data. Using the EPHA4 locus as a model, the effects of pathogenic SVs are predicted in silico and compared to Hi-C data generated from mouse limb buds and patient-derived fibroblasts. PRISMR deconvolves the folding complexity of the EPHA4 locus and identifies SV-induced ectopic contacts and alterations of 3D genome organization in homozygous or heterozygous states. We show that SVs can reconfigure topologically associating domains, thereby producing extensive rewiring of regulatory interactions and causing disease by gene misexpression. PRISMR can be used to predict interactions in silico, thereby providing a tool for analyzing the disease-causing potential of SVs.
De novo insertions and deletions of predominantly paternal origin are associated with autism spectrum disorder

Science.gov (United States)

Dong, Shan; Walker, Michael F.; Carriero, Nicholas J.; DiCola, Michael; Willsey, A. Jeremy; Ye, Adam Y.; Waqar, Zainulabedin; Gonzalez, Luis E.; Overton, John D.; Frahm, Stephanie; Keaney, John F.; Teran, Nicole A.; Dea, Jeanselle; Mandell, Jeffrey D.; Bal, Vanessa Hus; Sullivan, Catherine A.; DiLullo, Nicholas M.; Khalil, Rehab O.; Gockley, Jake; Yuksel, Zafer; Sertel, Sinem M.; Ercan-Sencicek, A. Gulhan; Gupta, Abha R.; Mane, Shrikant M.; Sheldon, Michael; Brooks, Andrew I.; Roeder, Kathryn; Devlin, Bernie; State, Matthew W.; Wei, Liping; Sanders, Stephan J.

2014-01-01

SUMMARY Whole-exome sequencing (WES) studies have demonstrated the contribution of de novo loss-of-function single nucleotide variants to autism spectrum disorders (ASD). However, challenges in the reliable detection of de novo insertions and deletions (indels) have limited inclusion of these variants in prior analyses. Through the application of a robust indel detection method to WES data from 787 ASD families (2,963 individuals), we demonstrate that de novo frameshift indels contribute to ASD risk (OR=1.6; 95%CI=1.0-2.7; p=0.03), are more common in female probands (p=0.02), are enriched among genes encoding FMRP targets (p=6×10−9), and arise predominantly on the paternal chromosome (p<0.001). Based on mutation rates in probands versus unaffected siblings, de novo frameshift indels contribute to risk in approximately 3.0% of individuals with ASD. Finally, through observing clustering of mutations in unrelated probands, we report two novel ASD-associated genes: KMT2E (MLL5), a chromatin regulator, and RIMS1, a regulator of synaptic vesicle release. PMID:25284784
Prediction of coronal structure of the solar eclipse of October 23, 1976

International Nuclear Information System (INIS)

Schatten, K.H.

1976-01-01

Earlier work on the prediction of solar eclipse coronal structures is briefly summarised. A computer drawn plot made on October 18 1976 showed the field time structure predicted for the time of the solar eclipse on October 23. A very dipolar coronal field was indicated, and a very large equatorial streamer was predicted for both the east and west limbs of the Sun, due to the lack of very strong active regions near either limb. Nested coronal arches were seen within this equatorial streamer, and many small arches were also seen on both limbs. The main feature, however, is the prediction of the two large bright streamers marking the solar equator, with polar plumes in a characteristic dipole fashion. At the time of the eclipse it is hoped that a high resolution photograph will allow much of the structure to be discovered. (U.K.)
RANDOM FUNCTIONS AND INTERVAL METHOD FOR PREDICTING THE RESIDUAL RESOURCE OF BUILDING STRUCTURES

Directory of Open Access Journals (Sweden)

Shmelev Gennadiy Dmitrievich

2017-11-01

Full Text Available Subject: possibility of using random functions and interval prediction method for estimating the residual life of building structures in the currently used buildings. Research objectives: coordination of ranges of values to develop predictions and random functions that characterize the processes being predicted. Materials and methods: when performing this research, the method of random functions and the method of interval prediction were used. Results: in the course of this work, the basic properties of random functions, including the properties of families of random functions, are studied. The coordination of time-varying impacts and loads on building structures is considered from the viewpoint of their influence on structures and representation of the structures’ behavior in the form of random functions. Several models of random functions are proposed for predicting individual parameters of structures. For each of the proposed models, its scope of application is defined. The article notes that the considered approach of forecasting has been used many times at various sites. In addition, the available results allowed the authors to develop a methodology for assessing the technical condition and residual life of building structures for the currently used facilities. Conclusions: we studied the possibility of using random functions and processes for the purposes of forecasting the residual service lives of structures in buildings and engineering constructions. We considered the possibility of using an interval forecasting approach to estimate changes in defining parameters of building structures and their technical condition. A comprehensive technique for forecasting the residual life of building structures using the interval approach is proposed.
MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing

DEFF Research Database (Denmark)

Lindgreen, Stinus; Gardner, Paul P; Krogh, Anders

2007-01-01

function that considers sequence conservation, covariation and basepairing probabilities. The results show that the method is very competitive to similar programs available today, both in terms of accuracy and computational efficiency. AVAILABILITY: Source code available from http://mastr.binf.ku.dk/......MOTIVATION: As more non-coding RNAs are discovered, the importance of methods for RNA analysis increases. Since the structure of ncRNA is intimately tied to the function of the molecule, programs for RNA structure prediction are necessary tools in this growing field of research. Furthermore......, it is known that RNA structure is often evolutionarily more conserved than sequence. However, few existing methods are capable of simultaneously considering multiple sequence alignment and structure prediction. RESULT: We present a novel solution to the problem of simultaneous structure prediction...
Alpha complexes in protein structure prediction

DEFF Research Database (Denmark)

Winter, Pawel; Fonseca, Rasmus

2015-01-01

Reducing the computational effort and increasing the accuracy of potential energy functions is of utmost importance in modeling biological systems, for instance in protein structure prediction, docking or design. Evaluating interactions between nonbonded atoms is the bottleneck of such computations......-complexes from scratch for every configuration encountered during the search for the native structure would make this approach hopelessly slow. However, it is argued that kinetic a-complexes can be used to reduce the computational effort of determining the potential energy when "moving" from one configuration...... to a neighboring one. As a consequence, relatively expensive (initial) construction of an a-complex is expected to be compensated by subsequent fast kinetic updates during the search process. Computational results presented in this paper are limited. However, they suggest that the applicability of a...
TMDIM: an improved algorithm for the structure prediction of transmembrane domains of bitopic dimers

Science.gov (United States)

Cao, Han; Ng, Marcus C. K.; Jusoh, Siti Azma; Tai, Hio Kuan; Siu, Shirley W. I.

2017-09-01

α-Helical transmembrane proteins are the most important drug targets in rational drug development. However, solving the experimental structures of these proteins remains difficult, therefore computational methods to accurately and efficiently predict the structures are in great demand. We present an improved structure prediction method TMDIM based on Park et al. (Proteins 57:577-585, 2004) for predicting bitopic transmembrane protein dimers. Three major algorithmic improvements are introduction of the packing type classification, the multiple-condition decoy filtering, and the cluster-based candidate selection. In a test of predicting nine known bitopic dimers, approximately 78% of our predictions achieved a successful fit (RMSD PHP, MySQL and Apache, with all major browsers supported.

TRANSPORT OF PATIENTS FOR PRIMARY PTCA FROM GENERAL HOSPITAL NOVO MESTO TO LJUBLJANA IN 2002

Directory of Open Access Journals (Sweden)

Renata Okrajšek

2004-12-01

Full Text Available Background. The treatment of acute coronary syndrome (ACS with ST-segment elevation with primary percutaneous transluminal coronary angioplasty (PTCA is the best way to treat these patients. Primary PTCA is also practicable with patients who are admitted into institution without catheter laboratory. The transport of patients into the tertiary institution is safe, but it is important to keep the time of ischemia as short as possible and to reach the time interval of door-balloon as recommended by the guidelines. The ACS patients with ST-segment elevation that were directed into General Hospital Novo mesto after examination at the internistic emergency department have been redirected to KC Ljubljana for realization of PTCA since October 2001.Methods. A prospective analysis of patients with ACS with STsegment elevation, who had been transferred from General Hospital Novo mesto to KC Ljubljana in the period from January 1, 2002 to December 31, 2002 to have a primary PTCA, was performed. The analysis comprised the following: the time interval of handling the patients at Internistic department of General Hospital Novo mesto, the time of transport of patients to Ljubljana and total time interval from the arrival of patients to General Hospital Novo mesto to the first inflation of balloon in Ljubljana. We monitored the complications that occurred during the treatment of the patients.Results. In the above mentioned period 29 patients (24 males and 5 females were transported from the General Hospital Novo mesto to the KC Ljubljana to have a primary PTCA performed. The total time interval measured between the patients’ arrival to General Hospital Novo mesto to the first inflation of balloon in Ljubljana in the year 2002 was 145 minutes, which is 17 minutes better than in the previous period. The time interval recommended by the guidelines was achieved with four patients.Conclusions. By recognizing the problems that had encountered with directing the
Illumina-based de novo transcriptome sequencing and analysis

Indian Academy of Sciences (India)

In the present study, we used Illumina HiSeq technology to perform de novo assembly of heart and musk gland transcriptomes from the Chinese forest musk deer. A total of 239,383 transcripts and 176,450 unigenes were obtained, of which 37,329 unigenes were matched to known sequences in the NCBI nonredundant ...
Developing de novo human artificial chromosomes in embryonic stem cells using HSV-1 amplicon technology.

Science.gov (United States)

Moralli, Daniela; Monaco, Zoia L

2015-02-01

De novo artificial chromosomes expressing genes have been generated in human embryonic stem cells (hESc) and are maintained following differentiation into other cell types. Human artificial chromosomes (HAC) are small, functional, extrachromosomal elements, which behave as normal chromosomes in human cells. De novo HAC are generated following delivery of alpha satellite DNA into target cells. HAC are characterized by high levels of mitotic stability and are used as models to study centromere formation and chromosome organisation. They are successful and effective as gene expression vectors since they remain autonomous and can accommodate larger genes and regulatory regions for long-term expression studies in cells unlike other viral gene delivery vectors currently used. Transferring the essential DNA sequences for HAC formation intact across the cell membrane has been challenging for a number of years. A highly efficient delivery system based on HSV-1 amplicons has been used to target DNA directly to the ES cell nucleus and HAC stably generated in human embryonic stem cells (hESc) at high frequency. HAC were detected using an improved protocol for hESc chromosome harvesting, which consistently produced high-quality metaphase spreads that could routinely detect HAC in hESc. In tumour cells, the input DNA often integrated in the host chromosomes, but in the host ES genome, it remained intact. The hESc containing the HAC formed embryoid bodies, generated teratoma in mice, and differentiated into neuronal cells where the HAC were maintained. The HAC structure and chromatin composition was similar to the endogenous hESc chromosomes. This review will discuss the technological advances in HAC vector delivery using HSV-1 amplicons and the improvements in the identification of de novo HAC in hESc.
De novo biosynthesis of anthocyanins in Saccharomyces cerevisiae.

Science.gov (United States)

Eichenberger, Michael; Hansson, Anders; Fischer, David; Dürr, Lara; Naesby, Michael

2018-06-01

Anthocyanins (ACNs) are plant secondary metabolites responsible for most of the red, purple and blue colors of flowers, fruits and vegetables. They are increasingly used in the food and beverage industry as natural alternative to artificial colorants. Production of these compounds by fermentation of microorganisms would provide an attractive alternative. In this study, Saccharomyces cerevisiae was engineered for de novo production of the three basic anthocyanins, as well as the three main trans-flavan-3-ols. Enzymes from different plant sources were screened and efficient variants found for most steps of the biosynthetic pathway. However, the anthocyanidin synthase was identified as a major obstacle to efficient production. In yeast, this enzyme converts the majority of its natural substrates leucoanthocyanidins into the off-pathway flavonols. Nonetheless, de novo biosynthesis of ACNs was shown for the first time in yeast and for the first time in a single microorganism. It provides a framework for optimizing the activity of anthocyanidin synthase and represents an important step towards sustainable industrial production of these highly relevant molecules in yeast.
Structure life prediction at high temperature: present and future capabilities

International Nuclear Information System (INIS)

Chaboche, J.L.

1987-01-01

The life prediction techniques for high temperature conditions include several aspects which are considered successively in this article. Crack initiation criteria themselves, defined for the isolated volume element (the tension-compression specimen for example), including parametric relationships and continuous damage approaches and calculation of local stress and strain fields in the structure and their evolution under cyclic plasticity, which poses several difficult problems to obtain stabilized cyclic solutions are examined. The use of crack initiation criteria or damage rules from the result of the cyclic inelastic analysis and the prediction of crack growth in the structure are considered. Different levels are considered for the predictive tools: the classical approach, future methods presently under development and intermediate rules, which are already in use. Several examples are given on materials and components used either in the nuclear industry or in gas turbine engines. (author)
Predicting adverse drug reaction profiles by integrating protein interaction networks with drug structures.

Science.gov (United States)

Huang, Liang-Chin; Wu, Xiaogang; Chen, Jake Y

2013-01-01

The prediction of adverse drug reactions (ADRs) has become increasingly important, due to the rising concern on serious ADRs that can cause drugs to fail to reach or stay in the market. We proposed a framework for predicting ADR profiles by integrating protein-protein interaction (PPI) networks with drug structures. We compared ADR prediction performances over 18 ADR categories through four feature groups-only drug targets, drug targets with PPI networks, drug structures, and drug targets with PPI networks plus drug structures. The results showed that the integration of PPI networks and drug structures can significantly improve the ADR prediction performance. The median AUC values for the four groups were 0.59, 0.61, 0.65, and 0.70. We used the protein features in the best two models, "Cardiac disorders" (median-AUC: 0.82) and "Psychiatric disorders" (median-AUC: 0.76), to build ADR-specific PPI networks with literature supports. For validation, we examined 30 drugs withdrawn from the U.S. market to see if our approach can predict their ADR profiles and explain why they were withdrawn. Except for three drugs having ADRs in the categories we did not predict, 25 out of 27 withdrawn drugs (92.6%) having severe ADRs were successfully predicted by our approach. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
LoopIng: a template-based tool for predicting the structure of protein loops.

KAUST Repository

Messih, Mario Abdel

2015-08-06

Predicting the structure of protein loops is very challenging, mainly because they are not necessarily subject to strong evolutionary pressure. This implies that, unlike the rest of the protein, standard homology modeling techniques are not very effective in modeling their structure. However, loops are often involved in protein function, hence inferring their structure is important for predicting protein structure as well as function.We describe a method, LoopIng, based on the Random Forest automated learning technique, which, given a target loop, selects a structural template for it from a database of loop candidates. Compared to the most recently available methods, LoopIng is able to achieve similar accuracy for short loops (4-10 residues) and significant enhancements for long loops (11-20 residues). The quality of the predictions is robust to errors that unavoidably affect the stem regions when these are modeled. The method returns a confidence score for the predicted template loops and has the advantage of being very fast (on average: 1 min/loop).www.biocomputing.it/loopinganna.tramontano@uniroma1.itSupplementary data are available at Bioinformatics online.
Electronic structure prediction via data-mining the empirical pseudopotential method

Energy Technology Data Exchange (ETDEWEB)

Zenasni, H; Aourag, H [LEPM, URMER, Departement of Physics, University Abou Bakr Belkaid, Tlemcen 13000 (Algeria); Broderick, S R; Rajan, K [Department of Materials Science and Engineering, Iowa State University, Ames, Iowa 50011-2230 (United States)

2010-01-15

We introduce a new approach for accelerating the calculation of the electronic structure of new materials by utilizing the empirical pseudopotential method combined with data mining tools. Combining data mining with the empirical pseudopotential method allows us to convert an empirical approach to a predictive approach. Here we consider tetrahedrally bounded III-V Bi semiconductors, and through the prediction of form factors based on basic elemental properties we can model the band structure and charge density for these semi-conductors, for which limited results exist. This work represents a unique approach to modeling the electronic structure of a material which may be used to identify new promising semi-conductors and is one of the few efforts utilizing data mining at an electronic level. (Abstract Copyright [2010], Wiley Periodicals, Inc.)
Ab-initio conformational epitope structure prediction using genetic algorithm and SVM for vaccine design.

Science.gov (United States)

Moghram, Basem Ameen; Nabil, Emad; Badr, Amr

2018-01-01

T-cell epitope structure identification is a significant challenging immunoinformatic problem within epitope-based vaccine design. Epitopes or antigenic peptides are a set of amino acids that bind with the Major Histocompatibility Complex (MHC) molecules. The aim of this process is presented by Antigen Presenting Cells to be inspected by T-cells. MHC-molecule-binding epitopes are responsible for triggering the immune response to antigens. The epitope's three-dimensional (3D) molecular structure (i.e., tertiary structure) reflects its proper function. Therefore, the identification of MHC class-II epitopes structure is a significant step towards epitope-based vaccine design and understanding of the immune system. In this paper, we propose a new technique using a Genetic Algorithm for Predicting the Epitope Structure (GAPES), to predict the structure of MHC class-II epitopes based on their sequence. The proposed Elitist-based genetic algorithm for predicting the epitope's tertiary structure is based on Ab-Initio Empirical Conformational Energy Program for Peptides (ECEPP) Force Field Model. The developed secondary structure prediction technique relies on Ramachandran Plot. We used two alignment algorithms: the ROSS alignment and TM-Score alignment. We applied four different alignment approaches to calculate the similarity scores of the dataset under test. We utilized the support vector machine (SVM) classifier as an evaluation of the prediction performance. The prediction accuracy and the Area Under Receiver Operating Characteristic (ROC) Curve (AUC) were calculated as measures of performance. The calculations are performed on twelve similarity-reduced datasets of the Immune Epitope Data Base (IEDB) and a large dataset of peptide-binding affinities to HLA-DRB1*0101. The results showed that GAPES was reliable and very accurate. We achieved an average prediction accuracy of 93.50% and an average AUC of 0.974 in the IEDB dataset. Also, we achieved an accuracy of 95
Uridine monophosphate synthetase enables eukaryotic de novo NAD+ biosynthesis from quinolinic acid.

Science.gov (United States)

McReynolds, Melanie R; Wang, Wenqing; Holleran, Lauren M; Hanna-Rose, Wendy

2017-07-07

NAD + biosynthesis is an attractive and promising therapeutic target for influencing health span and obesity-related phenotypes as well as tumor growth. Full and effective use of this target for therapeutic benefit requires a complete understanding of NAD + biosynthetic pathways. Here, we report a previously unrecognized role for a conserved phosphoribosyltransferase in NAD + biosynthesis. Because a required quinolinic acid phosphoribosyltransferase (QPRTase) is not encoded in its genome, Caenorhabditis elegans are reported to lack a de novo NAD + biosynthetic pathway. However, all the genes of the kynurenine pathway required for quinolinic acid (QA) production from tryptophan are present. Thus, we investigated the presence of de novo NAD + biosynthesis in this organism. By combining isotope-tracing and genetic experiments, we have demonstrated the presence of an intact de novo biosynthesis pathway for NAD + from tryptophan via QA, highlighting the functional conservation of this important biosynthetic activity. Supplementation with kynurenine pathway intermediates also boosted NAD + levels and partially reversed NAD + -dependent phenotypes caused by mutation of pnc-1 , which encodes a nicotinamidase required for NAD + salvage biosynthesis, demonstrating contribution of de novo synthesis to NAD + homeostasis. By investigating candidate phosphoribosyltransferase genes in the genome, we determined that the conserved uridine monophosphate phosphoribosyltransferase (UMPS), which acts in pyrimidine biosynthesis, is required for NAD + biosynthesis in place of the missing QPRTase. We suggest that similar underground metabolic activity of UMPS may function in other organisms. This mechanism for NAD + biosynthesis creates novel possibilities for manipulating NAD + biosynthetic pathways, which is key for the future of therapeutics. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Predictive modeling of neuroanatomic structures for brain atrophy detection

Science.gov (United States)

Hu, Xintao; Guo, Lei; Nie, Jingxin; Li, Kaiming; Liu, Tianming

2010-03-01

In this paper, we present an approach of predictive modeling of neuroanatomic structures for the detection of brain atrophy based on cross-sectional MRI image. The underlying premise of applying predictive modeling for atrophy detection is that brain atrophy is defined as significant deviation of part of the anatomy from what the remaining normal anatomy predicts for that part. The steps of predictive modeling are as follows. The central cortical surface under consideration is reconstructed from brain tissue map and Regions of Interests (ROI) on it are predicted from other reliable anatomies. The vertex pair-wise distance between the predicted vertex and the true one within the abnormal region is expected to be larger than that of the vertex in normal brain region. Change of white matter/gray matter ratio within a spherical region is used to identify the direction of vertex displacement. In this way, the severity of brain atrophy can be defined quantitatively by the displacements of those vertices. The proposed predictive modeling method has been evaluated by using both simulated atrophies and MRI images of Alzheimer's disease.
Critical assessment of methods of protein structure prediction (CASP) - round x

KAUST Repository

Moult, John; Fidelis, Krzysztof; Kryshtafovych, Andriy; Schwede, Torsten; Tramontano, Anna

2013-01-01

This article is an introduction to the special issue of the journal PROTEINS, dedicated to the tenth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. The 10 CASP experiments span almost 20 years of progress in the field of protein structure modeling, and there have been enormous advances in methods and model accuracy in that period. Notable in this round is the first sustained improvement of models with refinement methods, using molecular dynamics. For the first time, we tested the ability of modeling methods to make use of sparse experimental three-dimensional contact information, such as may be obtained from new experimental techniques, with encouraging results. On the other hand, new contact prediction methods, though holding considerable promise, have yet to make an impact in CASP testing. The nature of CASP targets has been changing in recent CASPs, reflecting shifts in experimental structural biology, with more irregular structures, more multi-domain and multi-subunit structures, and less standard versions of known folds. When allowance is made for these factors, we continue to see steady progress in the overall accuracy of models, particularly resulting from improvement of non-template regions.
Critical assessment of methods of protein structure prediction (CASP) - round x

KAUST Repository

Moult, John

2013-12-17

This article is an introduction to the special issue of the journal PROTEINS, dedicated to the tenth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. The 10 CASP experiments span almost 20 years of progress in the field of protein structure modeling, and there have been enormous advances in methods and model accuracy in that period. Notable in this round is the first sustained improvement of models with refinement methods, using molecular dynamics. For the first time, we tested the ability of modeling methods to make use of sparse experimental three-dimensional contact information, such as may be obtained from new experimental techniques, with encouraging results. On the other hand, new contact prediction methods, though holding considerable promise, have yet to make an impact in CASP testing. The nature of CASP targets has been changing in recent CASPs, reflecting shifts in experimental structural biology, with more irregular structures, more multi-domain and multi-subunit structures, and less standard versions of known folds. When allowance is made for these factors, we continue to see steady progress in the overall accuracy of models, particularly resulting from improvement of non-template regions.
CentroidFold: a web server for RNA secondary structure prediction

OpenAIRE

Sato, Kengo; Hamada, Michiaki; Asai, Kiyoshi; Mituyama, Toutai

2009-01-01

The CentroidFold web server (http://www.ncrna.org/centroidfold/) is a web application for RNA secondary structure prediction powered by one of the most accurate prediction engine. The server accepts two kinds of sequence data: a single RNA sequence and a multiple alignment of RNA sequences. It responses with a prediction result shown as a popular base-pair notation and a graph representation. PDF version of the graph representation is also available. For a multiple alignment sequence, the ser...
De novo synthesis of purine nucleotides in different fiber types of rat skeletal muscle

International Nuclear Information System (INIS)

Tullson, P.C.; John-Alder, H.; Hood, D.A.; Terjung, R.L.

1986-01-01

The contribution of de novo purine nucleotide synthesis to nucleotide metabolism in skeletal muscles is not known. The authors have determined rates of de novo synthesis in soleus (slow-twitch red), red gastrocnemius (fast-twitch red), and white gastrocnemius (fast-twitch white) using the perfused rat hindquarter. 14 C glycine incorporation into ATP was linear after 1 and 2 hours of perfusion with 0.2 mM added glycine. The intracellular (I) and extracellular (E) specific activity of 14 C glycine was determined by HPLC of phenylisothiocyanate derivatives of neutralized PCA extracts. The rates of de novo synthesis when expressed relative to muscle ATP content show slow and fast-twitch red muscles to be similar and about twice as great as fast-twitch white muscles. This could represent a greater turnover of the adenine nucleotide pool in more oxidative red muscle types
Predicting Consensus Structures for RNA Alignments Via Pseudo-Energy Minimization

Directory of Open Access Journals (Sweden)

Junilda Spirollari

2009-01-01

Full Text Available Thermodynamic processes with free energy parameters are often used in algorithms that solve the free energy minimization problem to predict secondary structures of single RNA sequences. While results from these algorithms are promising, an observation is that single sequence-based methods have moderate accuracy and more information is needed to improve on RNA secondary structure prediction, such as covariance scores obtained from multiple sequence alignments. We present in this paper a new approach to predicting the consensus secondary structure of a set of aligned RNA sequences via pseudo-energy minimization. Our tool, called RSpredict, takes into account sequence covariation and employs effective heuristics for accuracy improvement. RSpredict accepts, as input data, a multiple sequence alignment in FASTA or ClustalW format and outputs the consensus secondary structure of the input sequences in both the Vienna style Dot Bracket format and the Connectivity Table format. Our method was compared with some widely used tools including KNetFold, Pfold and RNAalifold. A comprehensive test on different datasets including Rfam sequence alignments and a multiple sequence alignment obtained from our study on the Drosophila X chromosome reveals that RSpredict is competitive with the existing tools on the tested datasets. RSpredict is freely available online as a web server and also as a jar file for download at http:// datalab.njit.edu/biology/RSpredict.
De novo point mutations in patients diagnosed with ataxic cerebral palsy.

Science.gov (United States)

Parolin Schnekenberg, Ricardo; Perkins, Emma M; Miller, Jack W; Davies, Wayne I L; D'Adamo, Maria Cristina; Pessia, Mauro; Fawcett, Katherine A; Sims, David; Gillard, Elodie; Hudspith, Karl; Skehel, Paul; Williams, Jonathan; O'Regan, Mary; Jayawant, Sandeep; Jefferson, Rosalind; Hughes, Sarah; Lustenberger, Andrea; Ragoussis, Jiannis; Jackson, Mandy; Tucker, Stephen J; Németh, Andrea H

2015-07-01

Cerebral palsy is a sporadic disorder with multiple likely aetiologies, but frequently considered to be caused by birth asphyxia. Genetic investigations are rarely performed in patients with cerebral palsy and there is little proven evidence of genetic causes. As part of a large project investigating children with ataxia, we identified four patients in our cohort with a diagnosis of ataxic cerebral palsy. They were investigated using either targeted next generation sequencing or trio-based exome sequencing and were found to have mutations in three different genes, KCNC3, ITPR1 and SPTBN2. All the mutations were de novo and associated with increased paternal age. The mutations were shown to be pathogenic using a combination of bioinformatics analysis and in vitro model systems. This work is the first to report that the ataxic subtype of cerebral palsy can be caused by de novo dominant point mutations, which explains the sporadic nature of these cases. We conclude that at least some subtypes of cerebral palsy may be caused by de novo genetic mutations and patients with a clinical diagnosis of cerebral palsy should be genetically investigated before causation is ascribed to perinatal asphyxia or other aetiologies. © The Author (2015). Published by Oxford University Press on behalf of the Guarantors of Brain.
3.3 Ga SHRIMP U-Pb zircon age of a felsic metavolcanic rock from the Mundo Novo greenstone belt in the São Francisco craton, Bahia (NE Brazil)

Science.gov (United States)

Peucat, J. J.; Mascarenhas, J. F.; Barbosa, J. S. F.; de Souza, S. L.; Marinho, M. M.; Fanning, C. M.; Leite, C. M. M.

2002-07-01

Felsic metavolcanics associated with supracrustal rocks provide U-Pb zircon and Sm-Nd TDM ages of approximately 3.3 Ga, which establish an Archean age of the Mundo Novo greenstone belt. A granodioritic gneiss from the Mairi complex, located on the eastern boundary of the Mundo Novo greenstone belt, exhibits a zircon evaporation minimum age of 3.04 Ga and a Nd model age of 3.2 Ga. These results constrain the occurrence of at least three major geological units in this area: the Archean Mundo Novo greenstone belt, the Archean Mairi gneisses, and the adjoining Paleoproterozoic (<2.1 Ga) Jacobina sedimentary basin. The Jacobina basin follows the same trend as the Archean structure, extending southward to the Contendas-Mirante belt, in which a similar Archean-Paleoproterozoic association appears. We postulate that during the Paleoproterozoic in the eastern margin of the Gavião block, these Archean greenstone belts constituted a zone of weakness along which a late-stage orogenic sedimentary basin developed.
Model-Based GUI Testing Using Uppaal at Novo Nordisk

DEFF Research Database (Denmark)

H. Hjort, Ulrik; Rasmussen, Jacob Illum; Larsen, Kim Guldstrand

2009-01-01

This paper details a collaboration between Aalborg University and Novo Nordiskin developing an automatic model-based test generation tool for system testing of the graphical user interface of a medical device on an embedded platform. The tool takes as input an UML Statemachine model and generates...
Towards accurate de novo assembly for genomes with repeats

NARCIS (Netherlands)

Bucur, Doina

2017-01-01

De novo genome assemblers designed for short k-mer length or using short raw reads are unlikely to recover complex features of the underlying genome, such as repeats hundreds of bases long. We implement a stochastic machine-learning method which obtains accurate assemblies with repeats and

Engineering and introduction of de novo disulphide bridges in ...

Indian Academy of Sciences (India)

The engineeringof de novo disulphide bridges has been explored as a means to increase the thermal stability of enzymes in the rationalmethod of protein engineering. In this study, Disulphide by Design software, homology modelling and moleculardynamics simulations were used to select appropriate amino acid pairs for ...
On the relevance of sophisticated structural annotations for disulfide connectivity pattern prediction.

Directory of Open Access Journals (Sweden)

Julien Becker

Full Text Available Disulfide bridges strongly constrain the native structure of many proteins and predicting their formation is therefore a key sub-problem of protein structure and function inference. Most recently proposed approaches for this prediction problem adopt the following pipeline: first they enrich the primary sequence with structural annotations, second they apply a binary classifier to each candidate pair of cysteines to predict disulfide bonding probabilities and finally, they use a maximum weight graph matching algorithm to derive the predicted disulfide connectivity pattern of a protein. In this paper, we adopt this three step pipeline and propose an extensive study of the relevance of various structural annotations and feature encodings. In particular, we consider five kinds of structural annotations, among which three are novel in the context of disulfide bridge prediction. So as to be usable by machine learning algorithms, these annotations must be encoded into features. For this purpose, we propose four different feature encodings based on local windows and on different kinds of histograms. The combination of structural annotations with these possible encodings leads to a large number of possible feature functions. In order to identify a minimal subset of relevant feature functions among those, we propose an efficient and interpretable feature function selection scheme, designed so as to avoid any form of overfitting. We apply this scheme on top of three supervised learning algorithms: k-nearest neighbors, support vector machines and extremely randomized trees. Our results indicate that the use of only the PSSM (position-specific scoring matrix together with the CSP (cysteine separation profile are sufficient to construct a high performance disulfide pattern predictor and that extremely randomized trees reach a disulfide pattern prediction accuracy of [Formula: see text] on the benchmark dataset SPX[Formula: see text], which corresponds to
Adaptive Neuro-Fuzzy Inference System Models for Force Prediction of a Mechatronic Flexible Structure

DEFF Research Database (Denmark)

Achiche, S.; Shlechtingen, M.; Raison, M.

2016-01-01

This paper presents the results obtained from a research work investigating the performance of different Adaptive Neuro-Fuzzy Inference System (ANFIS) models developed to predict excitation forces on a dynamically loaded flexible structure. For this purpose, a flexible structure is equipped...... obtained from applying a random excitation force on the flexible structure. The performance of the developed models is evaluated by analyzing the prediction capabilities based on a normalized prediction error. The frequency domain is considered to analyze the similarity of the frequencies in the predicted...... of the sampling frequency and sensor location on the model performance is investigated. The results obtained in this paper show that ANFIS models can be used to set up reliable force predictors for dynamical loaded flexible structures, when a certain degree of inaccuracy is accepted. Furthermore, the comparison...
Structural Dynamic Analyses And Test Predictions For Spacecraft Structures With Non-Linearities

Science.gov (United States)

Vergniaud, Jean-Baptiste; Soula, Laurent; Newerla, Alfred

2012-07-01

The overall objective of the mechanical development and verification process is to ensure that the spacecraft structure is able to sustain the mechanical environments encountered during launch. In general the spacecraft structures are a-priori assumed to behave linear, i.e. the responses to a static load or dynamic excitation, respectively, will increase or decrease proportionally to the amplitude of the load or excitation induced. However, past experiences have shown that various non-linearities might exist in spacecraft structures and the consequences of their dynamic effects can significantly affect the development and verification process. Current processes are mainly adapted to linear spacecraft structure behaviour. No clear rules exist for dealing with major structure non-linearities. They are handled outside the process by individual analysis and margin policy, and analyses after tests to justify the CLA coverage. Non-linearities can primarily affect the current spacecraft development and verification process on two aspects. Prediction of flights loads by launcher/satellite coupled loads analyses (CLA): only linear satellite models are delivered for performing CLA and no well-established rules exist how to properly linearize a model when non- linearities are present. The potential impact of the linearization on the results of the CLA has not yet been properly analyzed. There are thus difficulties to assess that CLA results will cover actual flight levels. Management of satellite verification tests: the CLA results generated with a linear satellite FEM are assumed flight representative. If the internal non- linearities are present in the tested satellite then there might be difficulties to determine which input level must be passed to cover satellite internal loads. The non-linear behaviour can also disturb the shaker control, putting the satellite at risk by potentially imposing too high levels. This paper presents the results of a test campaign performed in
Evolution and structural organization of the C proteins of paramyxovirinae.

Directory of Open Access Journals (Sweden)

Michael K Lo

Full Text Available The phosphoprotein (P gene of most Paramyxovirinae encodes several proteins in overlapping frames: P and V, which share a common N-terminus (PNT, and C, which overlaps PNT. Overlapping genes are of particular interest because they encode proteins originated de novo, some of which have unknown structural folds, challenging the notion that nature utilizes only a limited, well-mapped area of fold space. The C proteins cluster in three groups, comprising measles, Nipah, and Sendai virus. We predicted that all C proteins have a similar organization: a variable, disordered N-terminus and a conserved, α-helical C-terminus. We confirmed this predicted organization by biophysically characterizing recombinant C proteins from Tupaia paramyxovirus (measles group and human parainfluenza virus 1 (Sendai group. We also found that the C of the measles and Nipah groups have statistically significant sequence similarity, indicating a common origin. Although the C of the Sendai group lack sequence similarity with them, we speculate that they also have a common origin, given their similar genomic location and structural organization. Since C is dispensable for viral replication, unlike PNT, we hypothesize that C may have originated de novo by overprinting PNT in the ancestor of Paramyxovirinae. Intriguingly, in measles virus and Nipah virus, PNT encodes STAT1-binding sites that overlap different regions of the C-terminus of C, indicating they have probably originated independently. This arrangement, in which the same genetic region encodes simultaneously a crucial functional motif (a STAT1-binding site and a highly constrained region (the C-terminus of C, seems paradoxical, since it should severely reduce the ability of the virus to adapt. The fact that it originated twice suggests that it must be balanced by an evolutionary advantage, perhaps from reducing the size of the genetic region vulnerable to mutations.
Aromatic claw: A new fold with high aromatic content that evades structural prediction: Aromatic Claw

Energy Technology Data Exchange (ETDEWEB)

Sachleben, Joseph R. [Biomolecular NMR Core Facility, University of Chicago, Chicago Illinois; Adhikari, Aashish N. [Department of Chemistry, University of Chicago, Chicago Illinois; Gawlak, Grzegorz [Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois; Hoey, Robert J. [Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois; Liu, Gaohua [Northeast Structural Genomics Consortium (NESG), Department of Molecular Biology and Biochemistry, School of Arts and Sciences, and Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, and Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway New Jersey; Joachimiak, Andrzej [Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois; Biological Sciences Division, Argonne National Laboratory, Argonne Illinois; Montelione, Gaetano T. [Northeast Structural Genomics Consortium (NESG), Department of Molecular Biology and Biochemistry, School of Arts and Sciences, and Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, and Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway New Jersey; Sosnick, Tobin R. [Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois; Koide, Shohei [Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois; Department of Biochemistry and Molecular Pharmacology and the Perlmutter Cancer Center, New York University School of Medicine, New York New York

2016-11-10

We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded β-sheet with a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.
Infant Mortality in Novo Hamburgo: Associated Factors and Cardiovascular Causes

Directory of Open Access Journals (Sweden)

Camila de Andrade Brum

2015-04-01

Full Text Available Background: Infant mortality has decreased in Brazil, but remains high as compared to that of other developing countries. In 2010, the Rio Grande do Sul state had the lowest infant mortality rate in Brazil. However, the municipality of Novo Hamburgo had the highest infant mortality rate in the Porto Alegre metropolitan region. Objective: To describe the causes of infant mortality in the municipality of Novo Hamburgo from 2007 to 2010, identifying which causes were related to heart diseases and if they were diagnosed in the prenatal period, and to assess the access to healthcare services. Methods: This study assessed infants of the municipality of Novo Hamburgo, who died, and whose data were collected from the infant death investigation records. Results: Of the 157 deaths in that period, 35.3% were reducible through diagnosis and early treatment, 25% were reducible through partnership with other sectors, 19.2% were non-preventable, 11.5% were reducible by means of appropriate pregnancy monitoring, 5.1% were reducible through appropriate delivery care, and 3.8% were ill defined. The major cause of death related to heart disease (13.4%, which was significantly associated with the variables ‘age at death’, ‘gestational age’ and ‘birth weight’. Regarding access to healthcare services, 60.9% of the pregnant women had a maximum of six prenatal visits. Conclusion: It is mandatory to enhance prenatal care and newborn care at hospitals and basic healthcare units to prevent infant mortality.
Infant Mortality in Novo Hamburgo: Associated Factors and Cardiovascular Causes

Energy Technology Data Exchange (ETDEWEB)

Brum, Camila de Andrade [Instituto de Cardiologia/Fundação Universitária de Cardiologia (IC/FUC), Porto Alegre, RS (Brazil); Stein, Airton Tetelbom [Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS (Brazil); Grupo Hospitalar Conceição (GHC), Porto Alegre, RS (Brazil); Universidade Luterana do Brasil (ULBRA), Porto Alegre, RS (Brazil); Pellanda, Lucia Campos, E-mail: luciapell.pesquisa@cardiologia.org.br [Instituto de Cardiologia/Fundação Universitária de Cardiologia (IC/FUC), Porto Alegre, RS (Brazil); Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS (Brazil)

2015-04-15

Infant mortality has decreased in Brazil, but remains high as compared to that of other developing countries. In 2010, the Rio Grande do Sul state had the lowest infant mortality rate in Brazil. However, the municipality of Novo Hamburgo had the highest infant mortality rate in the Porto Alegre metropolitan region. To describe the causes of infant mortality in the municipality of Novo Hamburgo from 2007 to 2010, identifying which causes were related to heart diseases and if they were diagnosed in the prenatal period, and to assess the access to healthcare services. This study assessed infants of the municipality of Novo Hamburgo, who died, and whose data were collected from the infant death investigation records. Of the 157 deaths in that period, 35.3% were reducible through diagnosis and early treatment, 25% were reducible through partnership with other sectors, 19.2% were non-preventable, 11.5% were reducible by means of appropriate pregnancy monitoring, 5.1% were reducible through appropriate delivery care, and 3.8% were ill defined. The major cause of death related to heart disease (13.4%), which was significantly associated with the variables ‘age at death’, ‘gestational age’ and ‘birth weight’. Regarding access to healthcare services, 60.9% of the pregnant women had a maximum of six prenatal visits. It is mandatory to enhance prenatal care and newborn care at hospitals and basic healthcare units to prevent infant mortality.
Infant Mortality in Novo Hamburgo: Associated Factors and Cardiovascular Causes

International Nuclear Information System (INIS)

Brum, Camila de Andrade; Stein, Airton Tetelbom; Pellanda, Lucia Campos

2015-01-01

Infant mortality has decreased in Brazil, but remains high as compared to that of other developing countries. In 2010, the Rio Grande do Sul state had the lowest infant mortality rate in Brazil. However, the municipality of Novo Hamburgo had the highest infant mortality rate in the Porto Alegre metropolitan region. To describe the causes of infant mortality in the municipality of Novo Hamburgo from 2007 to 2010, identifying which causes were related to heart diseases and if they were diagnosed in the prenatal period, and to assess the access to healthcare services. This study assessed infants of the municipality of Novo Hamburgo, who died, and whose data were collected from the infant death investigation records. Of the 157 deaths in that period, 35.3% were reducible through diagnosis and early treatment, 25% were reducible through partnership with other sectors, 19.2% were non-preventable, 11.5% were reducible by means of appropriate pregnancy monitoring, 5.1% were reducible through appropriate delivery care, and 3.8% were ill defined. The major cause of death related to heart disease (13.4%), which was significantly associated with the variables ‘age at death’, ‘gestational age’ and ‘birth weight’. Regarding access to healthcare services, 60.9% of the pregnant women had a maximum of six prenatal visits. It is mandatory to enhance prenatal care and newborn care at hospitals and basic healthcare units to prevent infant mortality
Concept of combinatorial de novo design of drug-like molecules by particle swarm optimization.

Science.gov (United States)

Hartenfeller, Markus; Proschak, Ewgenij; Schüller, Andreas; Schneider, Gisbert

2008-07-01

We present a fast stochastic optimization algorithm for fragment-based molecular de novo design (COLIBREE, Combinatorial Library Breeding). The search strategy is based on a discrete version of particle swarm optimization. Molecules are represented by a scaffold, which remains constant during optimization, and variable linkers and side chains. Different linkers represent virtual chemical reactions. Side-chain building blocks were obtained from pseudo-retrosynthetic dissection of large compound databases. Here, ligand-based design was performed using chemically advanced template search (CATS) topological pharmacophore similarity to reference ligands as fitness function. A weighting scheme was included for particle swarm optimization-based molecular design, which permits the use of many reference ligands and allows for positive and negative design to be performed simultaneously. In a case study, the approach was applied to the de novo design of potential peroxisome proliferator-activated receptor subtype-selective agonists. The results demonstrate the ability of the technique to cope with large combinatorial chemistry spaces and its applicability to focused library design. The technique was able to perform exploitation of a known scheme and at the same time explorative search for novel ligands within the framework of a given molecular core structure. It thereby represents a practical solution for compound screening in the early hit and lead finding phase of a drug discovery project.
Heterologous aggregates promote de novo prion appearance via more than one mechanism.

Directory of Open Access Journals (Sweden)

Fatih Arslan

2015-01-01

Full Text Available Prions are self-perpetuating conformational variants of particular proteins. In yeast, prions cause heritable phenotypic traits. Most known yeast prions contain a glutamine (Q/asparagine (N-rich region in their prion domains. [PSI+], the prion form of Sup35, appears de novo at dramatically enhanced rates following transient overproduction of Sup35 in the presence of [PIN+], the prion form of Rnq1. Here, we establish the temporal de novo appearance of Sup35 aggregates during such overexpression in relation to other cellular proteins. Fluorescently-labeled Sup35 initially forms one or a few dots when overexpressed in [PIN+] cells. One of the dots is perivacuolar, colocalizes with the aggregated Rnq1 dot and grows into peripheral rings/lines, some of which also colocalize with Rnq1. Sup35 dots that are not near the vacuole do not always colocalize with Rnq1 and disappear by the time rings start to grow. Bimolecular fluorescence complementation failed to detect any interaction between Sup35-VN and Rnq1-VC in [PSI+][PIN+] cells. In contrast, all Sup35 aggregates, whether newly induced or in established [PSI+], completely colocalize with the molecular chaperones Hsp104, Sis1, Ssa1 and eukaryotic release factor Sup45. In the absence of [PIN+], overexpressed aggregating proteins such as the Q/N-rich Pin4C or the non-Q/N-rich Mod5 can also promote the de novo appearance of [PSI+]. Similar to Rnq1, overexpressed Pin4C transiently colocalizes with newly appearing Sup35 aggregates. However, no interaction was detected between Mod5 and Sup35 during [PSI+] induction in the absence of [PIN+]. While the colocalization of Sup35 and aggregates of Rnq1 or Pin4C are consistent with the model that the heterologous aggregates cross-seed the de novo appearance of [PSI+], the lack of interaction between Mod5 and Sup35 leaves open the possibility of other mechanisms. We also show that Hsp104 is required in the de novo appearance of [PSI+] aggregates in a [PIN
Whole-Genome de novo Sequencing Of Quail And Grey Partridge

DEFF Research Database (Denmark)

Holm, Lars-Erik; Panitz, Frank; Burt, Dave

2011-01-01

The development in sequencing methods has made it possible to perform whole genome de novo sequencing of species without large commercial interests. Within the EU-financed QUANTOMICS project (KBBE-2A-222664), we have performed de novo sequencing of quail (Coturnix coturnix) and grey partridge...... (Perdix perdix) on a Genome Analyzer GAII (Illumina) using paired-end sequencing. The amount of generated sequences amounts to 8 to 9 Gb for each species. The analysis and assembly of the generated sequences is ongoing. Access to the whole genome sequence from these two species will enable enhanced...... comparative studies towards the chicken genome and will aid in identifying evolutionarily conserved sequences within the Galliformes. The obtained sequences from quail and partridge represent a beginning of generating the whole genome sequence for these species. The continuation of establishing the genome...
De novo assembly of plant body plan: a step ahead of Deadpool.

Science.gov (United States)

Kareem, Abdul; Radhakrishnan, Dhanya; Sondhi, Yash; Aiyaz, Mohammed; Roy, Merin V; Sugimoto, Kaoru; Prasad, Kalika

2016-08-01

While in the movie Deadpool it is possible for a human to recreate an arm from scratch, in reality plants can even surpass that. Not only can they regenerate lost parts, but also the whole plant body can be reborn from a few existing cells. Despite the decades old realization that plant cells possess the ability to regenerate a complete shoot and root system, it is only now that the underlying mechanisms are being unraveled. De novo plant regeneration involves the initiation of regenerative mass, acquisition of the pluripotent state, reconstitution of stem cells and assembly of regulatory interactions. Recent studies have furthered our understanding on the making of a complete plant system in the absence of embryonic positional cues. We review the recent studies probing the molecular mechanisms of de novo plant regeneration in response to external inductive cues and our current knowledge of direct reprogramming of root to shoot and vice versa. We further discuss how de novo regeneration can be exploited to meet the demands of green culture industries and to serve as a general model to address the fundamental questions of regeneration across the plant kingdom.
Critical assessment of methods of protein structure prediction (CASP)-round IX

KAUST Repository

Moult, John; Fidelis, Krzysztof; Kryshtafovych, Andriy; Tramontano, Anna

2011-01-01

This article is an introduction to the special issue of the journal PROTEINS, dedicated to the ninth Critical Assessment of Structure Prediction (CASP) experiment to assess the state of the art in protein structure modeling. The article describes the conduct of the experiment, the categories of prediction included, and outlines the evaluation and assessment procedures. Methods for modeling protein structure continue to advance, although at a more modest pace than in the early CASP experiments. CASP developments of note are indications of improvement in model accuracy for some classes of target, an improved ability to choose the most accurate of a set of generated models, and evidence of improvement in accuracy for short "new fold" models. In addition, a new analysis of regions of models not derivable from the most obvious template structure has revealed better performance than expected.
Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks.

Science.gov (United States)

Babaei, Sepideh; Geranmayeh, Amir; Seyyedsalehi, Seyyed Ali

2010-12-01

The supervised learning of recurrent neural networks well-suited for prediction of protein secondary structures from the underlying amino acids sequence is studied. Modular reciprocal recurrent neural networks (MRR-NN) are proposed to model the strong correlations between adjacent secondary structure elements. Besides, a multilayer bidirectional recurrent neural network (MBR-NN) is introduced to capture the long-range intramolecular interactions between amino acids in formation of the secondary structure. The final modular prediction system is devised based on the interactive integration of the MRR-NN and the MBR-NN structures to arbitrarily engage the neighboring effects of the secondary structure types concurrent with memorizing the sequential dependencies of amino acids along the protein chain. The advanced combined network augments the percentage accuracy (Q₃) to 79.36% and boosts the segment overlap (SOV) up to 70.09% when tested on the PSIPRED dataset in three-fold cross-validation. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
A case of de novo duplication of 15q24-q26.3

Directory of Open Access Journals (Sweden)

Hye Ran Kim

2011-06-01

Full Text Available Distal duplication, or trisomy 15q, is an extremely rare chromosomal disorder characterized by prenatal and postnatal overgrowth, mental retardation, and craniofacial malformations. Additional abnormalities typically include an unusually short neck, malformations of the fingers and toes, scoliosis and skeletal malformations, genital abnormalities, particularly in affected males, and, in some cases, cardiac defects. The range and severity of symptoms and physical findings may vary from case to case, depending upon the length and location of the duplicated portion of chromosome 15q. Most reported cases of duplication of the long arm of chromosome 15 frequently have more than one segmental imbalance resulting from unbalanced translocations involving chromosome 15 and deletions in another chromosome, as well as other structural chromosomal abnormalities. We report a female newborn with a de novo duplication, 15q24- q26.3, showing intrauterine overgrowth, a narrow asymmetric face with down-slanting palpebral fissures, a large, prominent nose, and micrognathia, arachnodactyly, camptodactyly, congenital heart disease, hydronephrosis, and hydroureter. Chromosomal analysis showed a 46,XX,inv(9(p12q13,dup(15(q24q26.3. Array comparative genomic hybridization analysis revealed a gain of 42 clones on 15q24-q26.3. This case represents the only reported patient with a de novo 15q24-q26.3 duplication that did not result from an unbalanced translocation and did not have a concomitant monosomic component in Korea.
The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations

Directory of Open Access Journals (Sweden)

Moye Wang

2016-04-01

Full Text Available As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5–10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design.
Optimal neural networks for protein-structure prediction

International Nuclear Information System (INIS)

Head-Gordon, T.; Stillinger, F.H.

1993-01-01

The successful application of neural-network algorithms for prediction of protein structure is stymied by three problem areas: the sparsity of the database of known protein structures, poorly devised network architectures which make the input-output mapping opaque, and a global optimization problem in the multiple-minima space of the network variables. We present a simplified polypeptide model residing in two dimensions with only two amino-acid types, A and B, which allows the determination of the global energy structure for all possible sequences of pentamer, hexamer, and heptamer lengths. This model simplicity allows us to compile a complete structural database and to devise neural networks that reproduce the tertiary structure of all sequences with absolute accuracy and with the smallest number of network variables. These optimal networks reveal that the three problem areas are convoluted, but that thoughtful network designs can actually deconvolute these detrimental traits to provide network algorithms that genuinely impact on the ability of the network to generalize or learn the desired mappings. Furthermore, the two-dimensional polypeptide model shows sufficient chemical complexity so that transfer of neural-network technology to more realistic three-dimensional proteins is evident
Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

Directory of Open Access Journals (Sweden)

Borodovsky Mark

2006-03-01

Full Text Available Abstract Background The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorithms imply that information about other (homologous proteins is not available, while algorithms of the second type imply that information about homologous proteins is available, and use it intensively. The single-sequence algorithms could make an important contribution to studies of proteins with no detected homologs, however the accuracy of protein secondary structure prediction from a single-sequence is not as high as when the additional evolutionary information is present. Results In this paper, we further refine and extend the hidden semi-Markov model (HSMM initially considered in the BSPSS algorithm. We introduce an improved residue dependency model by considering the patterns of statistically significant amino acid correlation at structural segment borders. We also derive models that specialize on different sections of the dependency structure and incorporate them into HSMM. In addition, we implement an iterative training method to refine estimates of HSMM parameters. The three-state-per-residue accuracy and other accuracy measures of the new method, IPSSP, are shown to be comparable or better than ones for BSPSS as well as for PSIPRED, tested under the single-sequence condition. Conclusions We have shown that new dependency models and training methods bring further improvements to single-sequence protein secondary structure prediction. The results are obtained under cross-validation conditions using a dataset with no pair of sequences having significant sequence similarity. As new sequences are added to the database it is possible to augment the dependency structure and obtain even higher accuracy. Current and future advances should contribute to the improvement of function prediction for orphan proteins inscrutable
Structural prediction and analysis of VIH-related peptides from selected crustacean species.

Science.gov (United States)

Nagaraju, Ganji Purna Chandra; Kumari, Nunna Siva; Prasad, Ganji Lakshmi Vara; Rajitha, Balney; Meenu, Madan; Rao, Manam Sreenivasa; Naik, Bannoth Reddya

2009-08-17

The tentative elucidation of the 3D-structure of vitellogenesis inhibiting hormone (VIH) peptides is conversely underprivileged by difficulties in gaining enough peptide or protein, diffracting crystals, and numerous extra technical aspects. As a result, no structural information is available for VIH peptide sequences registered in the Genbank. In this situation, it is not surprising that predictive methods have achieved great interest. Here, in this study the molt-inhibiting hormone (MIH) of the kuruma prawn (Marsupenaeus japonicus) is used, to predict the structure of four VIHrelated peptides in the crustacean species. The high similarity of the 3D-structures and the calculated physiochemical characteristics of these peptides suggest a common fold for the entire family.

Towards a unified fatigue life prediction method for marine structures

CERN Document Server

Cui, Weicheng; Wang, Fang

2014-01-01

In order to apply the damage tolerance design philosophy to design marine structures, accurate prediction of fatigue crack growth under service conditions is required. Now, more and more people have realized that only a fatigue life prediction method based on fatigue crack propagation (FCP) theory has the potential to explain various fatigue phenomena observed. In this book, the issues leading towards the development of a unified fatigue life prediction (UFLP) method based on FCP theory are addressed. Based on the philosophy of the UFLP method, the current inconsistency between fatigue design and inspection of marine structures could be resolved. This book presents the state-of-the-art and recent advances, including those by the authors, in fatigue studies. It is designed to lead the future directions and to provide a useful tool in many practical applications. It is intended to address to engineers, naval architects, research staff, professionals and graduates engaged in fatigue prevention design and survey ...
LiveBench-1: continuous benchmarking of protein structure prediction servers.

Science.gov (United States)

Bujnicki, J M; Elofsson, A; Fischer, D; Rychlewski, L

2001-02-01

We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targets--where standard methods such as PSI-BLAST fail--the most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join.
Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments.

Directory of Open Access Journals (Sweden)

Hahnbeom Park

Full Text Available Protein loop modeling is a tool for predicting protein local structures of particular interest, providing opportunities for applications involving protein structure prediction and de novo protein design. Until recently, the majority of loop modeling methods have been developed and tested by reconstructing loops in frameworks of experimentally resolved structures. In many practical applications, however, the protein loops to be modeled are located in inaccurate structural environments. These include loops in model structures, low-resolution experimental structures, or experimental structures of different functional forms. Accordingly, discrepancies in the accuracy of the structural environment assumed in development of the method and that in practical applications present additional challenges to modern loop modeling methods. This study demonstrates a new strategy for employing a hybrid energy function combining physics-based and knowledge-based components to help tackle this challenge. The hybrid energy function is designed to combine the strengths of each energy component, simultaneously maintaining accurate loop structure prediction in a high-resolution framework structure and tolerating minor environmental errors in low-resolution structures. A loop modeling method based on global optimization of this new energy function is tested on loop targets situated in different levels of environmental errors, ranging from experimental structures to structures perturbed in backbone as well as side chains and template-based model structures. The new method performs comparably to force field-based approaches in loop reconstruction in crystal structures and better in loop prediction in inaccurate framework structures. This result suggests that higher-accuracy predictions would be possible for a broader range of applications. The web server for this method is available at http://galaxy.seoklab.org/loop with the PS2 option for the scoring function.
MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.

Science.gov (United States)

Fang, Chao; Shang, Yi; Xu, Dong

2018-05-01

Protein secondary structure prediction can provide important information for protein 3D structure prediction and protein functions. Deep learning offers a new opportunity to significantly improve prediction accuracy. In this article, a new deep neural network architecture, named the Deep inception-inside-inception (Deep3I) network, is proposed for protein secondary structure prediction and implemented as a software tool MUFOLD-SS. The input to MUFOLD-SS is a carefully designed feature matrix corresponding to the primary amino acid sequence of a protein, which consists of a rich set of information derived from individual amino acid, as well as the context of the protein sequence. Specifically, the feature matrix is a composition of physio-chemical properties of amino acids, PSI-BLAST profile, and HHBlits profile. MUFOLD-SS is composed of a sequence of nested inception modules and maps the input matrix to either eight states or three states of secondary structures. The architecture of MUFOLD-SS enables effective processing of local and global interactions between amino acids in making accurate prediction. In extensive experiments on multiple datasets, MUFOLD-SS outperformed the best existing methods and other deep neural networks significantly. MUFold-SS can be downloaded from http://dslsrv8.cs.missouri.edu/~cf797/MUFoldSS/download.html. © 2018 Wiley Periodicals, Inc.
A probabilistic fragment-based protein structure prediction algorithm.

Directory of Open Access Journals (Sweden)

David Simoncini

Full Text Available Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representation of proteins. It is during this coarse-grained phase that fragment-based methods sample intensely the conformational space. If the native-like region is sampled more, the accuracy of the final all-atom predictions may be improved accordingly. In this work we present EdaFold, a new method for fragment-based protein structure prediction based on an Estimation of Distribution Algorithm. Fragment-based approaches build protein models by assembling short fragments from known protein structures. Whereas the probability mass functions over the fragment libraries are uniform in the usual case, we propose an algorithm that learns from previously generated decoys and steers the search toward native-like regions. A comparison with Rosetta AbInitio protocol shows that EdaFold is able to generate models with lower energies and to enhance the percentage of near-native coarse-grained decoys on a benchmark of [Formula: see text] proteins. The best coarse-grained models produced by both methods were refined into all-atom models and used in molecular replacement. All atom decoys produced out of EdaFold's decoy set reach high enough accuracy to solve the crystallographic phase problem by molecular replacement for some test proteins. EdaFold showed a higher success rate in molecular replacement when compared to Rosetta. Our study suggests that improving low resolution coarse-grained decoys allows computational methods to avoid subsequent sampling issues during all-atom refinement and to produce better all-atom models. EdaFold can be downloaded from http://www.riken.jp/zhangiru/software.html [corrected].
Bitter or not? BitterPredict, a tool for predicting taste from chemical structure.

Science.gov (United States)

Dagan-Wiener, Ayana; Nissim, Ido; Ben Abu, Natalie; Borgonovo, Gigliola; Bassoli, Angela; Niv, Masha Y

2017-09-21

Bitter taste is an innately aversive taste modality that is considered to protect animals from consuming toxic compounds. Yet, bitterness is not always noxious and some bitter compounds have beneficial effects on health. Hundreds of bitter compounds were reported (and are accessible via the BitterDB http://bitterdb.agri.huji.ac.il/dbbitter.php ), but numerous additional bitter molecules are still unknown. The dramatic chemical diversity of bitterants makes bitterness prediction a difficult task. Here we present a machine learning classifier, BitterPredict, which predicts whether a compound is bitter or not, based on its chemical structure. BitterDB was used as the positive set, and non-bitter molecules were gathered from literature to create the negative set. Adaptive Boosting (AdaBoost), based on decision trees machine-learning algorithm was applied to molecules that were represented using physicochemical and ADME/Tox descriptors. BitterPredict correctly classifies over 80% of the compounds in the hold-out test set, and 70-90% of the compounds in three independent external sets and in sensory test validation, providing a quick and reliable tool for classifying large sets of compounds into bitter and non-bitter groups. BitterPredict suggests that about 40% of random molecules, and a large portion (66%) of clinical and experimental drugs, and of natural products (77%) are bitter.
De novo status epilepticus is associated with adverse outcome: An 11-year retrospective study in Hong Kong.

Science.gov (United States)

Lui, Hoi Ki Kate; Hui, Kwok Fai; Fong, Wing Chi; Ip, Chun Tak; Lui, Hiu Tung Colin

2016-08-01

To identify predictors of poor clinical outcome in patients presenting to the intensive care units with status epilepticus (SE), in particular for patients presenting with de novo status epileptics. A retrospective review was performed on patients admitted to the intensive care units with status epilepticus in two hospitals in Hong Kong over an 11-year period from 2003 to 2013. A total of 87 SE cases were analyzed. The mean age of patients was 49.3 years (SD 14.9 years). Eighteen subjects (20.7%) had breakthrough seizure, which was the most common etiology for the status epilepticus episodes. Seventy-eight subjects (89.7%) had convulsive status epilepticus (CSE) and 9 subjects (10.3%) had non-convulsive status epilepticus (NCSE) on presentation. The 30-day mortality rate of all subjects was 18.4%. Non-convulsive status epilepticus was more common in patients with de novo status epilepticus when compared to those with existing history of epilepsy (15.5% Vs. 0%, p=0.03). Patients with de novo status epilepticus were older (52 Vs 43, p=0.009). De novo status epilepticus was associated with longer status duration (median 2.5 days, IQR 5 days), longer ICU stay (median 7.5 days, IQR 9 days) and poorer outcome (OR 4.15, 95% CI 1.53-11.2). For patients presenting to intensive care units with status epilepticus, those with de novo status epileptics were older and were more likely to develop non-convulsive status epilepticus. De novo status epilepticus was associated with poorer outcome. Continuous EEG monitoring would help identifying NCSE and potentially help improving clinical outcomes. Copyright © 2016 British Epilepsy Association. Published by Elsevier Ltd. All rights reserved.
Study on effect of mean stress on fatigue life prediction of thin film structure

Energy Technology Data Exchange (ETDEWEB)

Shin, Myung Soo [Ahtti Co., Seongnam (Korea, Republic of); Park, Jun Hyu [Tongmyong University, Busan (Korea, Republic of); Kim, Jung Yup [Korea Institute of Machinery and Materials, Daejeon (Korea, Republic of)

2016-04-15

This paper describes the effect of mean stress on fatigue life prediction of structure made with thin film. It is well known that the mean stress influences fatigue life prediction of mechanical structure. We investigated a reasonable method for considering mean stress when fatigue strength assessment of micro structure of thin film should be performed. Fatigue tests of smooth specimen of beryllium-copper (BeCu) thin film were performed in ambient air at R = 0.1 with 5 Hz. A micro probe was designed and made with BeCu thin film by the precision press process. Fatigue tests of micro structure were performed with 5 Hz frequency, in ambient air to verify the fatigue life predicted by computer simulation through FE analysis. The fatigue life predicted by the Sa -N curve modified by Goodman method with principal stress through FE analysis shows a more reasonable result than other methods.
Study on effect of mean stress on fatigue life prediction of thin film structure

International Nuclear Information System (INIS)

Shin, Myung Soo; Park, Jun Hyu; Kim, Jung Yup

2016-01-01

This paper describes the effect of mean stress on fatigue life prediction of structure made with thin film. It is well known that the mean stress influences fatigue life prediction of mechanical structure. We investigated a reasonable method for considering mean stress when fatigue strength assessment of micro structure of thin film should be performed. Fatigue tests of smooth specimen of beryllium-copper (BeCu) thin film were performed in ambient air at R = 0.1 with 5 Hz. A micro probe was designed and made with BeCu thin film by the precision press process. Fatigue tests of micro structure were performed with 5 Hz frequency, in ambient air to verify the fatigue life predicted by computer simulation through FE analysis. The fatigue life predicted by the Sa -N curve modified by Goodman method with principal stress through FE analysis shows a more reasonable result than other methods
Melhoramento do cafeeiro: XXXVIII. Observações sobre progênies do cultivar Mundo-Novo de Coffea arabica na estação experimental de Mococa Coffee breeding: XXXVIII-observation on progenies of the Mundo-Novo cultivars of Coffea arabica in the Mococa experimental station

Directory of Open Access Journals (Sweden)

Túlio R. Rocha

1980-01-01

Full Text Available Os dados analisados no experimento localizado em Mococa sobre a produtividade de 112 progênies dos cultivares Mundo-Novo S1 e S2, Bourbon-Amarelo, BourbonVermelho e Caturra-Vermelho de Coffea arabica no período de 1955 a 1971, indicaram que as de Mundo-Novo S1, de prefixos MP 474, MP 502, MP 469, MP 492 e MP 475, revelaram-se como as mais produtivas, assemelhando-se a algumas progênies 'Mundo--Novo' S2. Dentre estas, destacou-se a de prefixo MP 388-6, que atingiu o nível mais elevado de produção do experimento. As progênies de 'Mundo-Novo', em conjunto, produziram 44% a mais do que as de Bourbon-Amarelo e, estas, 60% a mais do que as de Bourbon-Vermelho e Caturra-Vermelho. A altura e o diâmetro da copa atingiram valores médios mais elevados para as progênies de 'Mundo-Novo'. Verificaram-se correlações positivas e altamente significativas entre altura média da planta e diâmetro médio da copa com a produção das progênies. As progênies mais produtivas revelaram rendimento (relação entre peso de café maduro e beneficiado de aproximadamente 6,0 e porcentagem de sementes normais, do tipo chato, acima de 80. Quanto ao tamanho das sementes do tipo chato, duas progênies 'Mundo-Novo' S1, MP 474 e MP 452, apresentaram peneira média maior, permi-tindo seleção de plantas com essa característica e com elevada produção.Coffee progenies of the Mundo-Novo cultivars of Coffea arabica were studied in an experiment located at the Mococa Experimental Station of the Instituto Agronômico in comparison with Bourbon-Amarelo, Bourbon-Vermelho and Caturra-Vermelho cultivars of the same species. During a period of 17 consecutive cropping years (1955-1971, Mundo-Novo yielded approximately 44% more than Bourbon-Amarelo and this cultivars yielded 60% more than Bourbon-Vermelho and Caturra-Vermelho. Among the 89 S1 'Mundo-Novo' progenies, MP 474, MP 502, MP 469, MP 492 and MP 475 yielded as much as the two best 'Mundo-Novo' S2 progenies. Greater
Structural MRI-Based Predictions in Patients with Treatment-Refractory Depression (TRD.

Directory of Open Access Journals (Sweden)

Blair A Johnston

Full Text Available The application of machine learning techniques to psychiatric neuroimaging offers the possibility to identify robust, reliable and objective disease biomarkers both within and between contemporary syndromal diagnoses that could guide routine clinical practice. The use of quantitative methods to identify psychiatric biomarkers is consequently important, particularly with a view to making predictions relevant to individual patients, rather than at a group-level. Here, we describe predictions of treatment-refractory depression (TRD diagnosis using structural T1-weighted brain scans obtained from twenty adult participants with TRD and 21 never depressed controls. We report 85% accuracy of individual subject diagnostic prediction. Using an automated feature selection method, the major brain regions supporting this significant classification were in the caudate, insula, habenula and periventricular grey matter. It was not, however, possible to predict the degree of 'treatment resistance' in individual patients, at least as quantified by the Massachusetts General Hospital (MGH-S clinical staging method; but the insula was again identified as a region of interest. Structural brain imaging data alone can be used to predict diagnostic status, but not MGH-S staging, with a high degree of accuracy in patients with TRD.
Nucleic acid helix structure determination from NMR proton chemical shifts

Energy Technology Data Exchange (ETDEWEB)

Werf, Ramon M. van der; Tessari, Marco; Wijmenga, Sybren S., E-mail: S.Wijmenga@science.ru.nl [Radboud University Nijmegen, Department of Biophysical Chemistry, Institute of Molecules and Materials (Netherlands)

2013-06-15

We present a method for de novo derivation of the three-dimensional helix structure of nucleic acids using non-exchangeable proton chemical shifts as sole source of experimental restraints. The method is called chemical shift de novo structure derivation protocol employing singular value decomposition (CHEOPS) and uses iterative singular value decomposition to optimize the structure in helix parameter space. The correct performance of CHEOPS and its range of application are established via an extensive set of structure derivations using either simulated or experimental chemical shifts as input. The simulated input data are used to assess in a defined manner the effect of errors or limitations in the input data on the derived structures. We find that the RNA helix parameters can be determined with high accuracy. We finally demonstrate via three deposited RNA structures that experimental proton chemical shifts suffice to derive RNA helix structures with high precision and accuracy. CHEOPS provides, subject to further development, new directions for high-resolution NMR structure determination of nucleic acids.
Progression of MDS-UPDRS Scores Over Five Years in De Novo Parkinson Disease from the Parkinson's Progression Markers Initiative Cohort.

Science.gov (United States)

Holden, Samantha K; Finseth, Taylor; Sillau, Stefan H; Berman, Brian D

2018-01-01

The Movement Disorder Society Unified Parkinson Disease Rating Scale (MDS-UDPRS) is a commonly used tool to measure Parkinson disease (PD) progression. Longitudinal changes in MDS-UPDRS scores in de novo PD have not been established. Determine progression rates of MDS-UPDRS scores in de novo PD. 362 participants from the Parkinson's Progression Markers Initiative, a multicenter longitudinal cohort study of de novo PD, were included. Longitudinal progression of MDS-UPDRS total and subscale scores were modeled using mixed model regression. MDS-UPDRS scores increased in a linear fashion over five years in de novo PD. MDS-UPDRS total score increased an estimated 4.0 points/year, Part I 0.25 points/year, Part II 1.0 points/year, and Part III 2.4 points/year. The expected average progression of MDS-UPDRS scores in de novo PD from this study can assist in clinical monitoring and provide comparative data for detection of disease modification in treatment trials.
Enhanced root growth in phosphate-starved Arabidopsis by stimulating de novo phospholipid biosynthesis through the overexpression of LYSOPHOSPHATIDIC ACID ACYLTRANSFERASE 2 (LPAT2).

Science.gov (United States)

Angkawijaya, Artik Elisa; Nguyen, Van Cam; Nakamura, Yuki

2017-09-01

Upon phosphate starvation, plants retard shoot growth but promote root development presumably to enhance phosphate assimilation from the ground. Membrane lipid remodelling is a metabolic adaptation that replaces membrane phospholipids by non-phosphorous galactolipids, thereby allowing plants to obtain scarce phosphate yet maintain the membrane structure. However, stoichiometry of this phospholipid-to-galactolipid conversion may not account for the massive demand of membrane lipids that enables active growth of roots under phosphate starvation, thereby suggesting the involvement of de novo phospholipid biosynthesis, which is not represented in the current model. We overexpressed an endoplasmic reticulum-localized lysophosphatidic acid acyltransferase, LPAT2, a key enzyme that catalyses the last step of de novo phospholipid biosynthesis. Two independent LPAT2 overexpression lines showed no visible phenotype under normal conditions but showed increased root length under phosphate starvation, with no effect on phosphate starvation response including marker gene expression, root hair development and anthocyanin accumulation. Accompanying membrane glycerolipid profiling of LPAT2-overexpressing plants revealed an increased content of major phospholipid classes and distinct responses to phosphate starvation between shoot and root. The findings propose a revised model of membrane lipid remodelling, in which de novo phospholipid biosynthesis mediated by LPAT2 contributes significantly to root development under phosphate starvation. © 2017 John Wiley & Sons Ltd.
De Novo Coding Variants Are Strongly Associated with Tourette Disorder

DEFF Research Database (Denmark)

Willsey, A Jeremy; Fernandez, Thomas V; Yu, Dongmei

2017-01-01

Whole-exome sequencing (WES) and de novo variant detection have proven a powerful approach to gene discovery in complex neurodevelopmental disorders. We have completed WES of 325 Tourette disorder trios from the Tourette International Collaborative Genetics cohort and a replication sample of 186 ...
De Novo Insertions and Deletions of Predominantly Paternal Origin Are Associated with Autism Spectrum Disorder

Directory of Open Access Journals (Sweden)

Shan Dong

2014-10-01

Full Text Available Summary: Whole-exome sequencing (WES studies have demonstrated the contribution of de novo loss-of-function single-nucleotide variants (SNVs to autism spectrum disorder (ASD. However, challenges in the reliable detection of de novo insertions and deletions (indels have limited inclusion of these variants in prior analyses. By applying a robust indel detection method to WES data from 787 ASD families (2,963 individuals, we demonstrate that de novo frameshift indels contribute to ASD risk (OR = 1.6; 95% CI = 1.0–2.7; p = 0.03, are more common in female probands (p = 0.02, are enriched among genes encoding FMRP targets (p = 6 × 10−9, and arise predominantly on the paternal chromosome (p < 0.001. On the basis of mutation rates in probands versus unaffected siblings, we conclude that de novo frameshift indels contribute to risk in approximately 3% of individuals with ASD. Finally, by observing clustering of mutations in unrelated probands, we uncover two ASD-associated genes: KMT2E (MLL5, a chromatin regulator, and RIMS1, a regulator of synaptic vesicle release. : Insertions and deletions (indels have proven especially difficult to detect in exome sequencing data. Dong et al. now identify indels in exome data for 787 autism spectrum disorder (ASD families. They demonstrate association between de novo indels that alter the reading frame and ASD. Furthermore, by observing clustering of indels in unrelated probands, they uncover two additional ASD-associated genes: KMT2E (MLL5, a chromatin regulator, and RIMS1, a regulator of synaptic vesicle release.
Novo-desenvolvimento, capital social e desigualdade social

Directory of Open Access Journals (Sweden)

Ana Cristina de Oliveira Oliveira

2012-03-01

Full Text Available Este artigo aborda a tendência de enfrentamento da desigualdade social a partir, no campo econômico, da versão do novo-desenvolvimentismo e, no campo político e ideológico, a partir da noção de capital social, na tentativa de realizar um "capitalismo com face mais humana". Discutiremos duas ordens de questões, considerando a especificidade da formação social brasileira de capitalismo dependente: 1 a “construção de Estados fortes” para
assegurar as condições de acumulação do capital, ampliando as margens do mercado de consumo, aliviando a pobreza e controlando possíveis tensões políticas e 2 a difusão da necessidade de construir uma sociedade em harmonia, que se traduz na incorporação da ética empreendedora dos empresários em todas as esferas sociais. Entendemos que este escopo político-econômico revela uma nova pedagogia da hegemonia, sustentada numa suposta alternativa
de gerenciamento das novas expressões da “questão social”, voltada para educar o conformismo e ocultar o conflito de classes.
Palavras-chave: questão social; novo-desenvolvimentismo; capital social; inclusão forçada
Crystal structure prediction of flexible molecules using parallel genetic algorithms with a standard force field.

Science.gov (United States)

Kim, Seonah; Orendt, Anita M; Ferraro, Marta B; Facelli, Julio C

2009-10-01

This article describes the application of our distributed computing framework for crystal structure prediction (CSP) the modified genetic algorithms for crystal and cluster prediction (MGAC), to predict the crystal structure of flexible molecules using the general Amber force field (GAFF) and the CHARMM program. The MGAC distributed computing framework includes a series of tightly integrated computer programs for generating the molecule's force field, sampling crystal structures using a distributed parallel genetic algorithm and local energy minimization of the structures followed by the classifying, sorting, and archiving of the most relevant structures. Our results indicate that the method can consistently find the experimentally known crystal structures of flexible molecules, but the number of missing structures and poor ranking observed in some crystals show the need for further improvement of the potential. Copyright 2009 Wiley Periodicals, Inc.
Antibody modeling using the prediction of immunoglobulin structure (PIGS) web server [corrected].

Science.gov (United States)

Marcatili, Paolo; Olimpieri, Pier Paolo; Chailyan, Anna; Tramontano, Anna

2014-12-01

Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (∼10 min on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together.
De novo mutations in ATP1A3 cause alternating hemiplegia of childhood

DEFF Research Database (Denmark)

Heinzen, Erin L; Swoboda, Kathryn J; Hitomi, Yuki

2012-01-01

and their unaffected parents to identify de novo nonsynonymous mutations in ATP1A3 in all seven individuals. In a subsequent sequence analysis of ATP1A3 in 98 other patients with AHC, we found that ATP1A3 mutations were likely to be responsible for at least 74% of the cases; we also identified one inherited mutation...... affecting the level of protein expression. This work identifies de novo ATP1A3 mutations as the primary cause of AHC and offers insight into disease pathophysiology by expanding the spectrum of phenotypes associated with mutations in ATP1A3....

Structural syntactic prediction measured with ELAN: evidence from ERPs.

Science.gov (United States)

Fonteneau, Elisabeth

2013-02-08

The current study used event-related potentials (ERPs) to investigate how and when argument structure information is used during the processing of sentences with a filler-gap dependency. We hypothesize that one specific property - animacy (living vs. non-living) - is used by the parser during the building of the syntactic structure. Participants heard sentences that were rated off-line as having an expected noun (Who did the Lion King chase the caravan with?) or an unexpected noun (Who did Lion King chase the animal with?). This prediction is based on the animacy properties relation between the wh-word and the noun in the object position. ERPs from the noun in the unexpected condition (animal) elicited a typical Early Left Anterior Negativity (ELAN)/P600 complex compared to the noun in the expected condition (caravan). Firstly, these results demonstrate that the ELAN reflects not only grammatical category violation but also animacy property expectations in filler-gap dependency. Secondly, our data suggests that the language comprehension system is able to make detailed predictions about aspects of the upcoming words to build up the syntactic structure. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

Science.gov (United States)

Ellington, Roni; Wachira, James; Nkwanta, Asamoah

2010-01-01

The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses…
Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

KAUST Repository

Guturu, H.

2013-11-11

Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and \\'through-DNA\\' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.
Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

KAUST Repository

Guturu, H.; Doxey, A. C.; Wenger, A. M.; Bejerano, G.

2013-01-01

Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.
Prediction of the Fundamental Period of Infilled RC Frame Structures Using Artificial Neural Networks

Directory of Open Access Journals (Sweden)

Panagiotis G. Asteris

2016-01-01

Full Text Available The fundamental period is one of the most critical parameters for the seismic design of structures. There are several literature approaches for its estimation which often conflict with each other, making their use questionable. Furthermore, the majority of these approaches do not take into account the presence of infill walls into the structure despite the fact that infill walls increase the stiffness and mass of structure leading to significant changes in the fundamental period. In the present paper, artificial neural networks (ANNs are used to predict the fundamental period of infilled reinforced concrete (RC structures. For the training and the validation of the ANN, a large data set is used based on a detailed investigation of the parameters that affect the fundamental period of RC structures. The comparison of the predicted values with analytical ones indicates the potential of using ANNs for the prediction of the fundamental period of infilled RC frame structures taking into account the crucial parameters that influence its value.
HBV core protein allosteric modulators differentially alter cccDNA biosynthesis from de novo infection and intracellular amplification pathways.

Science.gov (United States)

Guo, Fang; Zhao, Qiong; Sheraz, Muhammad; Cheng, Junjun; Qi, Yonghe; Su, Qing; Cuconati, Andrea; Wei, Lai; Du, Yanming; Li, Wenhui; Chang, Jinhong; Guo, Ju-Tao

2017-09-01

Hepatitis B virus (HBV) core protein assembles viral pre-genomic (pg) RNA and DNA polymerase into nucleocapsids for reverse transcriptional DNA replication to take place. Several chemotypes of small molecules, including heteroaryldihydropyrimidines (HAPs) and sulfamoylbenzamides (SBAs), have been discovered to allosterically modulate core protein structure and consequentially alter the kinetics and pathway of core protein assembly, resulting in formation of irregularly-shaped core protein aggregates or "empty" capsids devoid of pre-genomic RNA and viral DNA polymerase. Interestingly, in addition to inhibiting nucleocapsid assembly and subsequent viral genome replication, we have now demonstrated that HAPs and SBAs differentially modulate the biosynthesis of covalently closed circular (ccc) DNA from de novo infection and intracellular amplification pathways by inducing disassembly of nucleocapsids derived from virions as well as double-stranded DNA-containing progeny nucleocapsids in the cytoplasm. Specifically, the mistimed cuing of nucleocapsid uncoating prevents cccDNA formation during de novo infection of hepatocytes, while transiently accelerating cccDNA synthesis from cytoplasmic progeny nucleocapsids. Our studies indicate that elongation of positive-stranded DNA induces structural changes of nucleocapsids, which confers ability of mature nucleocapsids to bind CpAMs and triggers its disassembly. Understanding the molecular mechanism underlying the dual effects of the core protein allosteric modulators on nucleocapsid assembly and disassembly will facilitate the discovery of novel core protein-targeting antiviral agents that can more efficiently suppress cccDNA synthesis and cure chronic hepatitis B.
HBV core protein allosteric modulators differentially alter cccDNA biosynthesis from de novo infection and intracellular amplification pathways.

Directory of Open Access Journals (Sweden)

Fang Guo

2017-09-01

Full Text Available Hepatitis B virus (HBV core protein assembles viral pre-genomic (pg RNA and DNA polymerase into nucleocapsids for reverse transcriptional DNA replication to take place. Several chemotypes of small molecules, including heteroaryldihydropyrimidines (HAPs and sulfamoylbenzamides (SBAs, have been discovered to allosterically modulate core protein structure and consequentially alter the kinetics and pathway of core protein assembly, resulting in formation of irregularly-shaped core protein aggregates or "empty" capsids devoid of pre-genomic RNA and viral DNA polymerase. Interestingly, in addition to inhibiting nucleocapsid assembly and subsequent viral genome replication, we have now demonstrated that HAPs and SBAs differentially modulate the biosynthesis of covalently closed circular (ccc DNA from de novo infection and intracellular amplification pathways by inducing disassembly of nucleocapsids derived from virions as well as double-stranded DNA-containing progeny nucleocapsids in the cytoplasm. Specifically, the mistimed cuing of nucleocapsid uncoating prevents cccDNA formation during de novo infection of hepatocytes, while transiently accelerating cccDNA synthesis from cytoplasmic progeny nucleocapsids. Our studies indicate that elongation of positive-stranded DNA induces structural changes of nucleocapsids, which confers ability of mature nucleocapsids to bind CpAMs and triggers its disassembly. Understanding the molecular mechanism underlying the dual effects of the core protein allosteric modulators on nucleocapsid assembly and disassembly will facilitate the discovery of novel core protein-targeting antiviral agents that can more efficiently suppress cccDNA synthesis and cure chronic hepatitis B.
HBV core protein allosteric modulators differentially alter cccDNA biosynthesis from de novo infection and intracellular amplification pathways

Science.gov (United States)

Guo, Fang; Zhao, Qiong; Cheng, Junjun; Qi, Yonghe; Su, Qing; Wei, Lai; Li, Wenhui; Chang, Jinhong

2017-01-01

Hepatitis B virus (HBV) core protein assembles viral pre-genomic (pg) RNA and DNA polymerase into nucleocapsids for reverse transcriptional DNA replication to take place. Several chemotypes of small molecules, including heteroaryldihydropyrimidines (HAPs) and sulfamoylbenzamides (SBAs), have been discovered to allosterically modulate core protein structure and consequentially alter the kinetics and pathway of core protein assembly, resulting in formation of irregularly-shaped core protein aggregates or “empty” capsids devoid of pre-genomic RNA and viral DNA polymerase. Interestingly, in addition to inhibiting nucleocapsid assembly and subsequent viral genome replication, we have now demonstrated that HAPs and SBAs differentially modulate the biosynthesis of covalently closed circular (ccc) DNA from de novo infection and intracellular amplification pathways by inducing disassembly of nucleocapsids derived from virions as well as double-stranded DNA-containing progeny nucleocapsids in the cytoplasm. Specifically, the mistimed cuing of nucleocapsid uncoating prevents cccDNA formation during de novo infection of hepatocytes, while transiently accelerating cccDNA synthesis from cytoplasmic progeny nucleocapsids. Our studies indicate that elongation of positive-stranded DNA induces structural changes of nucleocapsids, which confers ability of mature nucleocapsids to bind CpAMs and triggers its disassembly. Understanding the molecular mechanism underlying the dual effects of the core protein allosteric modulators on nucleocapsid assembly and disassembly will facilitate the discovery of novel core protein-targeting antiviral agents that can more efficiently suppress cccDNA synthesis and cure chronic hepatitis B. PMID:28945802
Decision tree analysis to stratify risk of de novo non-melanoma skin cancer following liver transplantation.

Science.gov (United States)

Tanaka, Tomohiro; Voigt, Michael D

2018-03-01

Non-melanoma skin cancer (NMSC) is the most common de novo malignancy in liver transplant (LT) recipients; it behaves more aggressively and it increases mortality. We used decision tree analysis to develop a tool to stratify and quantify risk of NMSC in LT recipients. We performed Cox regression analysis to identify which predictive variables to enter into the decision tree analysis. Data were from the Organ Procurement Transplant Network (OPTN) STAR files of September 2016 (n = 102984). NMSC developed in 4556 of the 105984 recipients, a mean of 5.6 years after transplant. The 5/10/20-year rates of NMSC were 2.9/6.3/13.5%, respectively. Cox regression identified male gender, Caucasian race, age, body mass index (BMI) at LT, and sirolimus use as key predictive or protective factors for NMSC. These factors were entered into a decision tree analysis. The final tree stratified non-Caucasians as low risk (0.8%), and Caucasian males > 47 years, BMI decision tree model accurately stratifies the risk of developing NMSC in the long-term after LT.
Analysis of energy-based algorithms for RNA secondary structure prediction

Directory of Open Access Journals (Sweden)

Hajiaghayi Monir

2012-02-01

Full Text Available Abstract Background RNA molecules play critical roles in the cells of organisms, including roles in gene regulation, catalysis, and synthesis of proteins. Since RNA function depends in large part on its folded structures, much effort has been invested in developing accurate methods for prediction of RNA secondary structure from the base sequence. Minimum free energy (MFE predictions are widely used, based on nearest neighbor thermodynamic parameters of Mathews, Turner et al. or those of Andronescu et al. Some recently proposed alternatives that leverage partition function calculations find the structure with maximum expected accuracy (MEA or pseudo-expected accuracy (pseudo-MEA methods. Advances in prediction methods are typically benchmarked using sensitivity, positive predictive value and their harmonic mean, namely F-measure, on datasets of known reference structures. Since such benchmarks document progress in improving accuracy of computational prediction methods, it is important to understand how measures of accuracy vary as a function of the reference datasets and whether advances in algorithms or thermodynamic parameters yield statistically significant improvements. Our work advances such understanding for the MFE and (pseudo-MEA-based methods, with respect to the latest datasets and energy parameters. Results We present three main findings. First, using the bootstrap percentile method, we show that the average F-measure accuracy of the MFE and (pseudo-MEA-based algorithms, as measured on our largest datasets with over 2000 RNAs from diverse families, is a reliable estimate (within a 2% range with high confidence of the accuracy of a population of RNA molecules represented by this set. However, average accuracy on smaller classes of RNAs such as a class of 89 Group I introns used previously in benchmarking algorithm accuracy is not reliable enough to draw meaningful conclusions about the relative merits of the MFE and MEA-based algorithms
Dysplastic vs. Common Naevus-associated vs. De novo Melanomas: An Observational Retrospective Study of 1,021 Patients

Directory of Open Access Journals (Sweden)

Alejandro Martin-Gorgojo

2018-03-01

Full Text Available The aim of this case-case study was to determine the differences between dysplastic and common naevus-associated melanomas (NAM and de novo melanomas. A total of 1,021 prospectively collected patients with invasive cutaneous melanoma from an oncology referral centre were included in the study. Of these, 75.51% had de novo melanomas, 12.93% dysplastic NAM, and 11.56% common NAM. Dysplastic NAM, compared with de novo melanomas, were associated with intermittently photo-exposed sites, atypical melanocytic naevi, decreased tumour thickness, and presence of MC1R non-synonymous variants. Common NAM were more frequent on the trunk and of superficial spreading type. Comparison of dysplastic with common NAM showed significant difference only with regard to mitoses. Both subtypes of NAM shared less aggressive traits than de novo melanomas, albeit with no significant differences in survival after multivariate adjustment. In conclusion, NAM present with less aggressive traits, mostly due to a greater awareness among patients of changing moles than due to their intrinsic biological characteristics.
Novel structures of oxygen adsorbed on a Zr(0001) surface predicted from first principles

Energy Technology Data Exchange (ETDEWEB)

Gao, Bo [State Key Laboratory of Superhard Materials, Jilin University, Changchun, 130012 (China); Beijing computational science research center, Beijing,100084 (China); Wang, Jianyun [State Key Laboratory of Superhard Materials, Jilin University, Changchun, 130012 (China); Lv, Jian [State Key Laboratory of Superhard Materials, Jilin University, Changchun, 130012 (China); College of Materials Science and Engineering, Jilin University, Changchun, 130012 (China); Gao, Xingyu [Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, 100088 (China); CAEP Software Center for High Performance Numerical Simulation, Beijing, 100088 (China); Zhao, Yafan [CAEP Software Center for High Performance Numerical Simulation, Beijing, 100088 (China); Wang, Yanchao, E-mail: wyc@calypso.cn [State Key Laboratory of Superhard Materials, Jilin University, Changchun, 130012 (China); Beijing computational science research center, Beijing,100084 (China); College of Materials Science and Engineering, Jilin University, Changchun, 130012 (China); Song, Haifeng, E-mail: song_haifeng@iapcm.ac.cn [Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, 100088 (China); CAEP Software Center for High Performance Numerical Simulation, Beijing, 100088 (China); Ma, Yanming [State Key Laboratory of Superhard Materials, Jilin University, Changchun, 130012 (China); Beijing computational science research center, Beijing,100084 (China)

2017-01-30

Highlights: • Two stable structures of O adsorbed on a Zr(0001) surface are predicted with SLAM. • A stable structure of O adsorbed on a Zr(0001) surface is proposed with MLAM. • The calculated work function change is agreement with experimental value. - Abstract: The structures of O atoms adsorbed on a metal surface influence the metal properties significantly. Thus, studying O chemisorption on a Zr surface is of great interest. We investigated O adsorption on a Zr(0001) surface using our newly developed structure-searching method combined with first-principles calculations. A novel structural prototype with a unique combination of surface face-centered cubic (SFCC) and surface hexagonal close-packed (SHCP) O adsorption sites was predicted using a single-layer adsorption model (SLAM) for a 0.5 and 1.0 monolayer (ML) O coverage. First-principles calculations based on the SLAM revealed that the new predicted structures are energetically favorable compared with the well-known SFCC structures for a low O coverage (0.5 and 1.0 ML). Furthermore, on basis of our predicted SFCC + SHCP structures, a new structure within multi-layer adsorption model (MLAM) was proposed to be more stable at the O coverage of 1.0 ML, in which adsorbed O atoms occupy the SFCC + SHCP sites and the substitutional octahedral sites. The calculated work functions indicate that the SFCC + SHCP configuration has the lowest work function of all known structures at an O coverage of 0.5 ML within the SLAM, which agrees with the experimental trend of work function with variation in O coverage.
Probabilistic approaches to life prediction of nuclear plant structural components

International Nuclear Information System (INIS)

Villain, B.; Pitner, P.; Procaccia, H.

1996-01-01

In the last decade there has been an increasing interest at EDF in developing and applying probabilistic methods for a variety of purposes. In the field of structural integrity and reliability they are used to evaluate the effect of deterioration due to aging mechanisms, mainly on major passive structural components such as steam generators, pressure vessels and piping in nuclear plants. Because there can be numerous uncertainties involved in a assessment of the performance of these structural components, probabilistic methods. The benefits of a probabilistic approach are the clear treatment of uncertainly and the possibility to perform sensitivity studies from which it is possible to identify and quantify the effect of key factors and mitigative actions. They thus provide information to support effective decisions to optimize In-Service Inspection planning and maintenance strategies and for realistic lifetime prediction or reassessment. The purpose of the paper is to discuss and illustrate the methods available at EDF for probabilistic component life prediction. This includes a presentation of software tools in classical, Bayesian and structural reliability, and an application on two case studies (steam generator tube bundle, reactor pressure vessel). (authors)
Probabilistic approaches to life prediction of nuclear plant structural components

International Nuclear Information System (INIS)

Villain, B.; Pitner, P.; Procaccia, H.

1996-01-01

In the last decade there has been an increasing interest at EDF in developing and applying probabilistic methods for a variety of purposes. In the field of structural integrity and reliability they are used to evaluate the effect of deterioration due to aging mechanisms, mainly on major passive structural components such as steam generators, pressure vessels and piping in nuclear plants. Because there can be numerous uncertainties involved in an assessment of the performance of these structural components, probabilistic methods provide an attractive alternative or supplement to more conventional deterministic methods. The benefits of a probabilistic approach are the clear treatment of uncertainty and the possibility to perform sensitivity studies from which it is possible to identify and quantify the effect of key factors and mitigative actions. They thus provide information to support effective decisions to optimize In-Service Inspection planning and maintenance strategies and for realistic lifetime prediction or reassessment. The purpose of the paper is to discuss and illustrate the methods available at EDF for probabilistic component life prediction. This includes a presentation of software tools in classical, Bayesian and structural reliability, and an application on two case studies (steam generator tube bundle, reactor pressure vessel)
De novo identification of replication-timing domains in the human genome by deep learning.

Science.gov (United States)

Liu, Feng; Ren, Chao; Li, Hao; Zhou, Pingkun; Bo, Xiaochen; Shu, Wenjie

2016-03-01

The de novo identification of the initiation and termination zones-regions that replicate earlier or later than their upstream and downstream neighbours, respectively-remains a key challenge in DNA replication. Building on advances in deep learning, we developed a novel hybrid architecture combining a pre-trained, deep neural network and a hidden Markov model (DNN-HMM) for the de novo identification of replication domains using replication timing profiles. Our results demonstrate that DNN-HMM can significantly outperform strong, discriminatively trained Gaussian mixture model-HMM (GMM-HMM) systems and other six reported methods that can be applied to this challenge. We applied our trained DNN-HMM to identify distinct replication domain types, namely the early replication domain (ERD), the down transition zone (DTZ), the late replication domain (LRD) and the up transition zone (UTZ), using newly replicated DNA sequencing (Repli-Seq) data across 15 human cells. A subsequent integrative analysis revealed that these replication domains harbour unique genomic and epigenetic patterns, transcriptional activity and higher-order chromosomal structure. Our findings support the 'replication-domain' model, which states (1) that ERDs and LRDs, connected by UTZs and DTZs, are spatially compartmentalized structural and functional units of higher-order chromosomal structure, (2) that the adjacent DTZ-UTZ pairs form chromatin loops and (3) that intra-interactions within ERDs and LRDs tend to be short-range and long-range, respectively. Our model reveals an important chromatin organizational principle of the human genome and represents a critical step towards understanding the mechanisms regulating replication timing. Our DNN-HMM method and three additional algorithms can be freely accessed at https://github.com/wenjiegroup/DNN-HMM The replication domain regions identified in this study are available in GEO under the accession ID GSE53984. shuwj@bmi.ac.cn or boxc
A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

Science.gov (United States)

Spencer, Matt; Eickholt, Jesse; Jianlin Cheng

2015-01-01

Ab initio protein secondary structure (SS) predictions are utilized to generate tertiary structure predictions, which are increasingly demanded due to the rapid discovery of proteins. Although recent developments have slightly exceeded previous methods of SS prediction, accuracy has stagnated around 80 percent and many wonder if prediction cannot be advanced beyond this ceiling. Disciplines that have traditionally employed neural networks are experimenting with novel deep learning techniques in attempts to stimulate progress. Since neural networks have historically played an important role in SS prediction, we wanted to determine whether deep learning could contribute to the advancement of this field as well. We developed an SS predictor that makes use of the position-specific scoring matrix generated by PSI-BLAST and deep learning network architectures, which we call DNSS. Graphical processing units and CUDA software optimize the deep network architecture and efficiently train the deep networks. Optimal parameters for the training process were determined, and a workflow comprising three separately trained deep networks was constructed in order to make refined predictions. This deep learning network approach was used to predict SS for a fully independent test dataset of 198 proteins, achieving a Q3 accuracy of 80.7 percent and a Sov accuracy of 74.2 percent.
Improving protein fold recognition and structural class prediction accuracies using physicochemical properties of amino acids.

Science.gov (United States)

Raicar, Gaurav; Saini, Harsh; Dehzangi, Abdollah; Lal, Sunil; Sharma, Alok

2016-08-07

Predicting the three-dimensional (3-D) structure of a protein is an important task in the field of bioinformatics and biological sciences. However, directly predicting the 3-D structure from the primary structure is hard to achieve. Therefore, predicting the fold or structural class of a protein sequence is generally used as an intermediate step in determining the protein's 3-D structure. For protein fold recognition (PFR) and structural class prediction (SCP), two steps are required - feature extraction step and classification step. Feature extraction techniques generally utilize syntactical-based information, evolutionary-based information and physicochemical-based information to extract features. In this study, we explore the importance of utilizing the physicochemical properties of amino acids for improving PFR and SCP accuracies. For this, we propose a Forward Consecutive Search (FCS) scheme which aims to strategically select physicochemical attributes that will supplement the existing feature extraction techniques for PFR and SCP. An exhaustive search is conducted on all the existing 544 physicochemical attributes using the proposed FCS scheme and a subset of physicochemical attributes is identified. Features extracted from these selected attributes are then combined with existing syntactical-based and evolutionary-based features, to show an improvement in the recognition and prediction performance on benchmark datasets. Copyright © 2016 Elsevier Ltd. All rights reserved.
The NOVO Network: the original scientific basis for its establishment and our R&D vision

OpenAIRE

Winkel, Jørgen; Edwards, Kasper; Dellve, L.; Schiller, B.; Westgaard, Rolf H.

2017-01-01

The NOVO network is a Nordic non-governmental professional association whose aims are to foster the scientific progress, knowledge and development of the working environment within Healthcare as an integrated part of production system development. The vision is a “Nordic Model for Sustainable Systems” in the healthcare sector. It was founded in 2006 in Copenhagen and was financially supported by the Nordic Council of Ministers from 2007 to 2015. The motivation to establish the NOVO Network ar...
Predicting Reactive Transport Dynamics in Carbonates using Initial Pore Structure

Science.gov (United States)

Menke, H. P.; Nunes, J. P. P.; Blunt, M. J.

2017-12-01

Understanding rock-fluid interaction at the pore-scale is imperative for accurate predictive modelling of carbon storage permanence. However, coupled reactive transport models are computationally expensive, requiring either a sacrifice of resolution or high performance computing to solve relatively simple geometries. Many recent studies indicate that initial pore structure many be the dominant mechanism in determining the dissolution regime. Here we investigate how well the initial pore structure is predictive of distribution and amount of dissolution during reactive flow using particle tracking on the initial image. Two samples of carbonate rock with varying initial pore space heterogeneity were reacted with reservoir condition CO2-saturated brine and scanned dynamically during reactive flow at a 4-μm resolution between 4 and 40 times using 4D X-ray micro-tomography over the course of 1.5 hours using μ-CT. Flow was modelled on the initial binarized image using a Navier-Stokes solver. Particle tracking was then run on the velocity fields, the streamlines were traced, and the streamline density was calculated both on a voxel-by-voxel and a channel-by-channel basis. The density of streamlines was then compared to the amount of dissolution in subsequent time steps during reaction. It was found that for the flow and transport regimes studied, the streamline density distribution in the initial image accurately predicted the dominant pathways of dissolution and gave good indicators of the type of dissolution regime that would later develop. This work suggests that the eventual reaction-induced changes in pore structure are deterministic rather than stochastic and can be predicted with high resolution imaging of unreacted rock.
Molecular phylogeny and predicted 3D structure of plant beta-D-N-acetylhexosaminidase.

Science.gov (United States)

Hossain, Md Anowar; Roslan, Hairul Azman

2014-01-01

beta-D-N-Acetylhexosaminidase, a family 20 glycosyl hydrolase, catalyzes the removal of β-1,4-linked N-acetylhexosamine residues from oligosaccharides and their conjugates. We constructed phylogenetic tree of β-hexosaminidases to analyze the evolutionary history and predicted functions of plant hexosaminidases. Phylogenetic analysis reveals the complex history of evolution of plant β-hexosaminidase that can be described by gene duplication events. The 3D structure of tomato β-hexosaminidase (β-Hex-Sl) was predicted by homology modeling using 1now as a template. Structural conformity studies of the best fit model showed that more than 98% of the residues lie inside the favoured and allowed regions where only 0.9% lie in the unfavourable region. Predicted 3D structure contains 531 amino acids residues with glycosyl hydrolase20b domain-I and glycosyl hydrolase20 superfamily domain-II including the (β/α)8 barrel in the central part. The α and β contents of the modeled structure were found to be 33.3% and 12.2%, respectively. Eleven amino acids were found to be involved in ligand-binding site; Asp(330) and Glu(331) could play important roles in enzyme-catalyzed reactions. The predicted model provides a structural framework that can act as a guide to develop a hypothesis for β-Hex-Sl mutagenesis experiments for exploring the functions of this class of enzymes in plant kingdom.

Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

Science.gov (United States)

Kisand, Veljo; Lettieri, Teresa

2013-04-01

De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize
Predicting effects of noncoding variants with deep learning-based sequence model.

Science.gov (United States)

Zhou, Jian; Troyanskaya, Olga G

2015-10-01

Identifying functional effects of noncoding variants is a major challenge in human genetics. To predict the noncoding-variant effects de novo from sequence, we developed a deep learning-based algorithmic framework, DeepSEA (http://deepsea.princeton.edu/), that directly learns a regulatory sequence code from large-scale chromatin-profiling data, enabling prediction of chromatin effects of sequence alterations with single-nucleotide sensitivity. We further used this capability to improve prioritization of functional variants including expression quantitative trait loci (eQTLs) and disease-associated variants.
Protein Function Prediction Based on Sequence and Structure Information

KAUST Repository

Smaili, Fatima Z.

2016-05-25

The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.
Characterization and analysis of a de novo transcriptome from the pygmy grasshopper Tetrix japonica.

Science.gov (United States)

Qiu, Zhongying; Liu, Fei; Lu, Huimeng; Huang, Yuan

2017-05-01

The pygmy grasshopper Tetrix japonica is a common insect distributed throughout the world, and it has the potential for use in studies of body colour polymorphism, genomics and the biology of Tetrigoidea (Insecta: Orthoptera). However, limited biological information is available for this insect. Here, we conducted a de novo transcriptome study of adult and larval T. japonica to provide a better understanding of its gene expression and develop genomic resources for future work. We sequenced and explored the characteristics of the de novo transcriptome of T. japonica using Illumina HiSeq 2000 platform. A total of 107 608 206 paired-end clean reads were assembled into 61 141 unigenes using the trinity software; the mean unigene size was 771 bp, and the N50 length was 1238 bp. A total of 29 225 unigenes were functionally annotated to the NCBI nonredundant protein sequences (Nr), NCBI nonredundant nucleotide sequences (Nt), a manually annotated and reviewed protein sequence database (Swiss-Prot), Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. A large number of putative genes that are potentially involved in pigment pathways, juvenile hormone (JH) metabolism and signalling pathways were identified in the T. japonica transcriptome. Additionally, 165 769 and 156 796 putative single nucleotide polymorphisms occurred in the adult and larvae transcriptomes, respectively, and a total of 3162 simple sequence repeats were detected in this assembly. This comprehensive transcriptomic data for T. japonica will provide a usable resource for gene predictions, signalling pathway investigations and molecular marker development for this species and other pygmy grasshoppers. © 2016 John Wiley & Sons Ltd.
The sequential structure of brain activation predicts skill.

Science.gov (United States)

Anderson, John R; Bothell, Daniel; Fincham, Jon M; Moon, Jungaa

2016-01-29

In an fMRI study, participants were trained to play a complex video game. They were scanned early and then again after substantial practice. While better players showed greater activation in one region (right dorsal striatum) their relative skill was better diagnosed by considering the sequential structure of whole brain activation. Using a cognitive model that played this game, we extracted a characterization of the mental states that are involved in playing a game and the statistical structure of the transitions among these states. There was a strong correspondence between this measure of sequential structure and the skill of different players. Using multi-voxel pattern analysis, it was possible to recognize, with relatively high accuracy, the cognitive states participants were in during particular scans. We used the sequential structure of these activation-recognized states to predict the skill of individual players. These findings indicate that important features about information-processing strategies can be identified from a model-based analysis of the sequential structure of brain activation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Fast computational methods for predicting protein structure from primary amino acid sequence

Science.gov (United States)

Agarwal, Pratul Kumar [Knoxville, TN

2011-07-19

The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
A de novo designed 11 kDa polypeptide: model for amyloidogenic intrinsically disordered proteins.

Science.gov (United States)

Topilina, Natalya I; Ermolenkov, Vladimir V; Sikirzhytski, Vitali; Higashiya, Seiichiro; Lednev, Igor K; Welch, John T

2010-07-01

A de novo polypeptide GH(6)[(GA)(3)GY(GA)(3)GE](8)GAH(6) (YE8) has a significant number of identical weakly interacting beta-strands with the turns and termini functionalized by charged amino acids to control polypeptide folding and aggregation. YE8 exists in a soluble, disordered form at neutral pH but is responsive to changes in pH and ionic strength. The evolution of YE8 secondary structure has been successfully quantified during all stages of polypeptide fibrillation by deep UV resonance Raman (DUVRR) spectroscopy combined with other morphological, structural, spectral, and tinctorial characterization. The YE8 folding kinetics at pH 3.5 are strongly dependent on polypeptide concentration with a lag phase that can be eliminated by seeding with a solution of folded fibrillar YE8. The lag phase of polypeptide folding is concentration dependent leading to the conclusion that beta-sheet folding of the 11-kDa amyloidogenic polypeptide is completely aggregation driven.
Prognostic factors in de novo myelodysplastic syndrome in young and middle-aged people

Directory of Open Access Journals (Sweden)

Наталья Николаевна Климкович

2015-01-01

Full Text Available We spent multivariate analysis of clinical and laboratory parameters for the prediction of de-novo myelodysplastic syndromes (MDS patients aged 18-60 years. The results of clinical application of prognostic systems in MDS show that there is a large variability within individual risk groups, especially at low-risk MDS. So now hematologists conduct research aimed at identifying additional adverse risk MDS. This is done so that patients with low-risk MDS embodiments and unfavorable prognosis could benefit from early therapeutic intervention, and not only be clinician monitored until disease progression. We found that additional adverse risk factors for the development of MDS are the expression of CD95 in bone marrow ≤40 % and FLT3≥60 %. The expression level of CD95 in bone marrow cells≤40 % and FLT3≥60 % can be considered as a prognostic marker progression of MDS and time start specific therapy
Bayesian Inference using Neural Net Likelihood Models for Protein Secondary Structure Prediction

Directory of Open Access Journals (Sweden)

Seong-Gon Kim

2011-06-01

Full Text Available Several techniques such as Neural Networks, Genetic Algorithms, Decision Trees and other statistical or heuristic methods have been used to approach the complex non-linear task of predicting Alpha-helicies, Beta-sheets and Turns of a proteins secondary structure in the past. This project introduces a new machine learning method by using an offline trained Multilayered Perceptrons (MLP as the likelihood models within a Bayesian Inference framework to predict secondary structures proteins. Varying window sizes are used to extract neighboring amino acid information and passed back and forth between the Neural Net models and the Bayesian Inference process until there is a convergence of the posterior secondary structure probability.
Spontaneous de novo vaginal adenosis resembling Bartholinâ€™s ...

African Journals Online (AJOL)

Adebayo Alade Adewole

Spontaneous de novo vaginal adenosis resembling Bartholin's cyst: A case report ... 6 by 5 cm. The cervix, uterus, adnexa and Pouch of Douglas (POD) were normal. .... of vaginal cancer.2–4 Although, DES exposed daughters have an.
The prediction and discovery of Rayleigh line fine structure

International Nuclear Information System (INIS)

Fabelinskii, Immanuil L

2000-01-01

The history of the theoretical prediction and experimental discovery of the Rayleigh line fine structure (which belongs to one of the most important phenomena in optics and physics of condensed matter) is discussed along with the history of first publications concerning this topic. (from the history of physics)
Model Predictive Vibration Control Efficient Constrained MPC Vibration Control for Lightly Damped Mechanical Structures

CERN Document Server

Takács, Gergely

2012-01-01

Real-time model predictive controller (MPC) implementation in active vibration control (AVC) is often rendered difficult by fast sampling speeds and extensive actuator-deformation asymmetry. If the control of lightly damped mechanical structures is assumed, the region of attraction containing the set of allowable initial conditions requires a large prediction horizon, making the already computationally demanding on-line process even more complex. Model Predictive Vibration Control provides insight into the predictive control of lightly damped vibrating structures by exploring computationally efficient algorithms which are capable of low frequency vibration control with guaranteed stability and constraint feasibility. In addition to a theoretical primer on active vibration damping and model predictive control, Model Predictive Vibration Control provides a guide through the necessary steps in understanding the founding ideas of predictive control applied in AVC such as: · the implementation of ...
First Principles Prediction of Structure, Structure Selectivity, and Thermodynamic Stability under Realistic Conditions

Energy Technology Data Exchange (ETDEWEB)

Ceder, Gerbrand [Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States). Dept. of Materials and Engineering

2018-01-28

Novel materials are often the enabler for new energy technologies. In ab-initio computational materials science, method are developed to predict the behavior of materials starting from the laws of physics, so that properties can be predicted before compounds have to be synthesized and tested. As such, a virtual materials laboratory can be constructed, saving time and money. The objectives of this program were to develop first-principles theory to predict the structure and thermodynamic stability of materials. Since its inception the program focused on the development of the cluster expansion to deal with the increased complexity of complex oxides. This research led to the incorporation of vibrational degrees of freedom in ab-initio thermodynamics, developed methods for multi-component cluster expansions, included the explicit configurational degrees of freedom of localized electrons, developed the formalism for stability in aqueous environments, and culminated in the first ever approach to produce exact ground state predictions of the cluster expansion. Many of these methods have been disseminated to the larger theory community through the Materials Project, pymatgen software, or individual codes. We summarize three of the main accomplishments.
Predicting and validating protein interactions using network structure.

Directory of Open Access Journals (Sweden)

Pao-Yang Chen

2008-07-01

Full Text Available Protein interactions play a vital part in the function of a cell. As experimental techniques for detection and validation of protein interactions are time consuming, there is a need for computational methods for this task. Protein interactions appear to form a network with a relatively high degree of local clustering. In this paper we exploit this clustering by suggesting a score based on triplets of observed protein interactions. The score utilises both protein characteristics and network properties. Our score based on triplets is shown to complement existing techniques for predicting protein interactions, outperforming them on data sets which display a high degree of clustering. The predicted interactions score highly against test measures for accuracy. Compared to a similar score derived from pairwise interactions only, the triplet score displays higher sensitivity and specificity. By looking at specific examples, we show how an experimental set of interactions can be enriched and validated. As part of this work we also examine the effect of different prior databases upon the accuracy of prediction and find that the interactions from the same kingdom give better results than from across kingdoms, suggesting that there may be fundamental differences between the networks. These results all emphasize that network structure is important and helps in the accurate prediction of protein interactions. The protein interaction data set and the program used in our analysis, and a list of predictions and validations, are available at http://www.stats.ox.ac.uk/bioinfo/resources/PredictingInteractions.
Artificial Intelligence in Prediction of Secondary Protein Structure Using CB513 Database

Science.gov (United States)

Avdagic, Zikrija; Purisevic, Elvir; Omanovic, Samir; Coralic, Zlatan

2009-01-01

In this paper we describe CB513 a non-redundant dataset, suitable for development of algorithms for prediction of secondary protein structure. A program was made in Borland Delphi for transforming data from our dataset to make it suitable for learning of neural network for prediction of secondary protein structure implemented in MATLAB Neural-Network Toolbox. Learning (training and testing) of neural network is researched with different sizes of windows, different number of neurons in the hidden layer and different number of training epochs, while using dataset CB513. PMID:21347158
Prediction of protein–protein interactions: unifying evolution and structure at protein interfaces

International Nuclear Information System (INIS)

Tuncbag, Nurcan; Gursoy, Attila; Keskin, Ozlem

2011-01-01

The vast majority of the chores in the living cell involve protein–protein interactions. Providing details of protein interactions at the residue level and incorporating them into protein interaction networks are crucial toward the elucidation of a dynamic picture of cells. Despite the rapid increase in the number of structurally known protein complexes, we are still far away from a complete network. Given experimental limitations, computational modeling of protein interactions is a prerequisite to proceed on the way to complete structural networks. In this work, we focus on the question 'how do proteins interact?' rather than 'which proteins interact?' and we review structure-based protein–protein interaction prediction approaches. As a sample approach for modeling protein interactions, PRISM is detailed which combines structural similarity and evolutionary conservation in protein interfaces to infer structures of complexes in the protein interaction network. This will ultimately help us to understand the role of protein interfaces in predicting bound conformations
The Prediction of Botulinum Toxin Structure Based on in Silico and in Vitro Analysis

Science.gov (United States)

Suzuki, Tomonori; Miyazaki, Satoru

2011-01-01

Many of biological system mediated through protein-protein interactions. Knowledge of protein-protein complex structure is required for understanding the function. The determination of huge size and flexible protein-protein complex structure by experimental studies remains difficult, costly and five-consuming, therefore computational prediction of protein structures by homolog modeling and docking studies is valuable method. In addition, MD simulation is also one of the most powerful methods allowing to see the real dynamics of proteins. Here, we predict protein-protein complex structure of botulinum toxin to analyze its property. These bioinformatics methods are useful to report the relation between the flexibility of backbone structure and the activity.
Immobilization of cadmium in soils by UV-mutated Bacillus subtilis 38 bioaugmentation and NovoGro amendment

International Nuclear Information System (INIS)

Jiang Chunxiao; Sun Hongwen; Sun Tieheng; Zhang Qingmin; Zhang Yanfeng

2009-01-01

Immobilization of cadmium (10 mg Cd per kilogram soil) in soil by bioaugmentation of a UV-mutated microorganism, Bacillus subtilis 38 accompanied with amendment of a bio-fertilizer, NovoGro was investigated using extractable cadmium (E-Cd) by DTPA. B. subtilis 38, the mutant with the strongest resistance against Cd, could bioaccumulate Cd four times greater than the original wild type. Single bioaugmentation of B. subtilis 38 (SB treatment) to soil however did not reduce E-Cd significantly, while the amendment of NovoGro (SN treatment) reduced E-Cd remarkably. Simultaneous application of B. subtilis 38 and NovoGro (SNB treatment) exhibited a synergetic effect compared to the single SB and SN treatment. The immobilization effect was significantly affected by temperature, soil moisture, and pH. It seems that the immobilization on Cd reached the maximum when environmental conditions favored the activity of microorganisms. Under the optimum conditions, after 90 days incubation, E-Cd was 3.34, 3.39, 2.25 and 0.87 mg kg -1 in the control soil, SB, SN and SNB soils, respectively. NovoGro not only showed a great capacity for Cd adsorption, but also promoted the growth of B. subtilis 38. This study provides a potential cost-effective technique for in situ remediation of Cd contaminated soils with bioaugmentation.
Less-structured time in children's daily lives predicts self-directed executive functioning.

Science.gov (United States)

Barker, Jane E; Semenov, Andrei D; Michaelson, Laura; Provan, Lindsay S; Snyder, Hannah R; Munakata, Yuko

2014-01-01

Executive functions (EFs) in childhood predict important life outcomes. Thus, there is great interest in attempts to improve EFs early in life. Many interventions are led by trained adults, including structured training activities in the lab, and less-structured activities implemented in schools. Such programs have yielded gains in children's externally-driven executive functioning, where they are instructed on what goal-directed actions to carry out and when. However, it is less clear how children's experiences relate to their development of self-directed executive functioning, where they must determine on their own what goal-directed actions to carry out and when. We hypothesized that time spent in less-structured activities would give children opportunities to practice self-directed executive functioning, and lead to benefits. To investigate this possibility, we collected information from parents about their 6-7 year-old children's daily, annual, and typical schedules. We categorized children's activities as "structured" or "less-structured" based on categorization schemes from prior studies on child leisure time use. We assessed children's self-directed executive functioning using a well-established verbal fluency task, in which children generate members of a category and can decide on their own when to switch from one subcategory to another. The more time that children spent in less-structured activities, the better their self-directed executive functioning. The opposite was true of structured activities, which predicted poorer self-directed executive functioning. These relationships were robust (holding across increasingly strict classifications of structured and less-structured time) and specific (time use did not predict externally-driven executive functioning). We discuss implications, caveats, and ways in which potential interpretations can be distinguished in future work, to advance an understanding of this fundamental aspect of growing up.
De novo nonsense mutations in ASXL1 cause Bohring-Opitz syndrome

NARCIS (Netherlands)

Hoischen, Alexander; van Bon, Bregje W. M.; Rodríguez-Santiago, Benjamín; Gilissen, Christian; Vissers, Lisenka E. L. M.; de Vries, Petra; Janssen, Irene; van Lier, Bart; Hastings, Rob; Smithson, Sarah F.; Newbury-Ecob, Ruth; Kjaergaard, Susanne; Goodship, Judith; McGowan, Ruth; Bartholdi, Deborah; Rauch, Anita; Peippo, Maarit; Cobben, Jan M.; Wieczorek, Dagmar; Gillessen-Kaesbach, Gabriele; Veltman, Joris A.; Brunner, Han G.; de Vries, Bert B. B. A.

2011-01-01

Bohring-Opitz syndrome is characterized by severe intellectual disability, distinctive facial features and multiple congenital malformations. We sequenced the exomes of three individuals with Bohring-Opitz syndrome and in each identified heterozygous de novo nonsense mutations in ASXL1, which is

Human native lipoprotein-induced de novo DNA methylation is associated with repression of inflammatory genes in THP-1 macrophages.

Science.gov (United States)

Rangel-Salazar, Rubén; Wickström-Lindholm, Marie; Aguilar-Salinas, Carlos A; Alvarado-Caudillo, Yolanda; Døssing, Kristina B V; Esteller, Manel; Labourier, Emmanuel; Lund, Gertrud; Nielsen, Finn C; Rodríguez-Ríos, Dalia; Solís-Martínez, Martha O; Wrobel, Katarzyna; Wrobel, Kazimierz; Zaina, Silvio

2011-11-25

We previously showed that a VLDL- and LDL-rich mix of human native lipoproteins induces a set of repressive epigenetic marks, i.e. de novo DNA methylation, histone 4 hypoacetylation and histone 4 lysine 20 (H4K20) hypermethylation in THP-1 macrophages. Here, we: 1) ask what gene expression changes accompany these epigenetic responses; 2) test the involvement of candidate factors mediating the latter. We exploited genome expression arrays to identify target genes for lipoprotein-induced silencing, in addition to RNAi and expression studies to test the involvement of candidate mediating factors. The study was conducted in human THP-1 macrophages. Native lipoprotein-induced de novo DNA methylation was associated with a general repression of various critical genes for macrophage function, including pro-inflammatory genes. Lipoproteins showed differential effects on epigenetic marks, as de novo DNA methylation was induced by VLDL and to a lesser extent by LDL, but not by HDL, and VLDL induced H4K20 hypermethylation, while HDL caused H4 deacetylation. The analysis of candidate factors mediating VLDL-induced DNA hypermethylation revealed that this response was: 1) surprisingly, mediated exclusively by the canonical maintenance DNA methyltransferase DNMT1, and 2) independent of the Dicer/micro-RNA pathway. Our work provides novel insights into epigenetic gene regulation by native lipoproteins. Furthermore, we provide an example of DNMT1 acting as a de novo DNA methyltransferase independently of canonical de novo enzymes, and show proof of principle that de novo DNA methylation can occur independently of a functional Dicer/micro-RNA pathway in mammals.
Mesoscopic structure prediction of nanoparticle assembly and coassembly: Theoretical foundation

KAUST Repository

Hur, Kahyun; Hennig, Richard G.; Escobedo, Fernando A.; Wiesner, Ulrich

2010-01-01

structures and interactions. We validate our approach by comparing its predictions with previous simulation results for model systems. We illustrate the flexibility of our approach by applying it to hybrid systems composed of block copolymers and ligand
De novo complex intra chromosomal rearrangement after ICSI: characterisation by BACs micro array-CGH

Directory of Open Access Journals (Sweden)

Quimsiyeh Mazin

2008-12-01

Full Text Available Abstract Background In routine Assisted Reproductive Technology (ART men with severe oligozoospermia or azoospermia should be informed about the risk of de novo congenital or chromosomal abnormalities in ICSI program. Also the benefits of preimplantation or prenatal genetic diagnosis practice need to be explained to the couple. Methods From a routine ICSI attempt, using ejaculated sperm from male with severe oligozoospermia and having normal karyotype, a 30 years old pregnant woman was referred to prenatal diagnosis in the 17th week for bichorionic biamniotic twin gestation. Amniocentesis was performed because of the detection of an increased foetal nuchal translucency for one of the fetus by the sonographic examination during the 12th week of gestation (WG. Chromosome and DNA studies of the fetus were realized on cultured amniocytes Results Conventional, molecular cytogenetic and microarray CGH experiments allowed us to conclude that the fetus had a de novo pericentromeric inversion associated with a duplication of the 9p22.1-p24 chromosomal region, 46,XY,invdup(9(p22.1p24 [arrCGH 9p22.1p24 (RP11-130C19 → RP11-87O1x3]. As containing the critical 9p22 region, our case is in coincidence with the general phenotype features of the partial trisomy 9p syndrome with major growth retardation, microcephaly and microretrognathia. Conclusion This de novo complex chromosome rearrangement illustrates the possible risk of chromosome or gene defects in ICSI program and the contribution of array-CGH for mapping rapidly de novo chromosomal imbalance.
De novo and salvage pathway precursor incorporation during DNA replication at the nuclear matrix

International Nuclear Information System (INIS)

Panzeter, P.L.

1988-01-01

Total nuclear DNA can be empirically subdivided into low salt-soluble (LS) DNA (75-80%), high salt-soluble (HS) DNA (18-23%), and nuclear matrix-associated (NM) DNA which remains tightly bound to the nuclear matrix (∼2%). The most-newly replicated DNA is that associated with the nuclear matrix in regenerating rat liver. Analyses of the DNA fractions after various pulse times revealed that the salvage and de novo pathway DNA precursors investigated were incorporated preferentially into NM-DNA at early pulse times, after which the radioactivity became progressively incorporated into HS- and LS-DNA, respectively. These results support two models of nuclear matrix-associated DNA replication, proposed previously, and a third model presented in this dissertation. In addition, the incorporation of de novo pathway precursors lagged significantly (> 10 minutes) behind the incorporation of precursors entering through the salvage pathway. Channeling of salvage pathway precursors to DNA replication sites would explain the more rapid uptake of salvage precursors into NM-DNA than de novo precursors. To investigate the possibility of this heretofore in vitro phenomenon, the incorporation of the salvage precursor, ( 3 H)deoxythymidine, and the de novo precursor, ( 14 C)orotic acid, into NM-DNA and dTTP was examined in regenerating rat liver. There was no significant difference between the incorporation pattern of ( 14 C)orotic acid into NM-DNA thymine and that of ( 14 C)orotic acid into soluble dTTP. Contrastingly, the salvage pathway precursor, ( 3 H)deoxythymidine, labeled NM-DNA before labeling the dTTP pool
Molecular Phylogeny and Predicted 3D Structure of Plant beta-D-N-Acetylhexosaminidase

Directory of Open Access Journals (Sweden)

Md. Anowar Hossain

2014-01-01

Full Text Available beta-D-N-Acetylhexosaminidase, a family 20 glycosyl hydrolase, catalyzes the removal of β-1,4-linked N-acetylhexosamine residues from oligosaccharides and their conjugates. We constructed phylogenetic tree of β-hexosaminidases to analyze the evolutionary history and predicted functions of plant hexosaminidases. Phylogenetic analysis reveals the complex history of evolution of plant β-hexosaminidase that can be described by gene duplication events. The 3D structure of tomato β-hexosaminidase (β-Hex-Sl was predicted by homology modeling using 1now as a template. Structural conformity studies of the best fit model showed that more than 98% of the residues lie inside the favoured and allowed regions where only 0.9% lie in the unfavourable region. Predicted 3D structure contains 531 amino acids residues with glycosyl hydrolase20b domain-I and glycosyl hydrolase20 superfamily domain-II including the (β/α8 barrel in the central part. The α and β contents of the modeled structure were found to be 33.3% and 12.2%, respectively. Eleven amino acids were found to be involved in ligand-binding site; Asp(330 and Glu(331 could play important roles in enzyme-catalyzed reactions. The predicted model provides a structural framework that can act as a guide to develop a hypothesis for β-Hex-Sl mutagenesis experiments for exploring the functions of this class of enzymes in plant kingdom.
Improved hybrid optimization algorithm for 3D protein structure prediction.

Science.gov (United States)

Zhou, Changjun; Hou, Caixia; Wei, Xiaopeng; Zhang, Qiang

2014-07-01

A new improved hybrid optimization algorithm - PGATS algorithm, which is based on toy off-lattice model, is presented for dealing with three-dimensional protein structure prediction problems. The algorithm combines the particle swarm optimization (PSO), genetic algorithm (GA), and tabu search (TS) algorithms. Otherwise, we also take some different improved strategies. The factor of stochastic disturbance is joined in the particle swarm optimization to improve the search ability; the operations of crossover and mutation that are in the genetic algorithm are changed to a kind of random liner method; at last tabu search algorithm is improved by appending a mutation operator. Through the combination of a variety of strategies and algorithms, the protein structure prediction (PSP) in a 3D off-lattice model is achieved. The PSP problem is an NP-hard problem, but the problem can be attributed to a global optimization problem of multi-extremum and multi-parameters. This is the theoretical principle of the hybrid optimization algorithm that is proposed in this paper. The algorithm combines local search and global search, which overcomes the shortcoming of a single algorithm, giving full play to the advantage of each algorithm. In the current universal standard sequences, Fibonacci sequences and real protein sequences are certified. Experiments show that the proposed new method outperforms single algorithms on the accuracy of calculating the protein sequence energy value, which is proved to be an effective way to predict the structure of proteins.
Melhoramento do cafeeiro: XLII. Produtividade de progênies derivadas de hibridação dos cultivares Laurina e Mundo Novo Coffee breeding: XLII. Yield of progenies from crosses of Laurina and Mundo Novo cultivars of Coffea arabica L.

Directory of Open Access Journals (Sweden)

Alcides Carvalho

1988-01-01

Full Text Available O cultivar Laurina de Coffea arabica L. caracteriza-se pelo pequeno porte, folhas de dimensões reduzidas, frutos afilados na base, sementes pequenas e afiladas, pequeno rendimento e reduzida produção. Apresenta, no entanto, bebida de boa qualidade e baixo teor de cafeína nas sementes. Suas principais características são controladas pela ação de um par de alelos recessivos lrlr, de acentuado efeito pleiotrópico. Devido ao atual interesse do comércio por produto de baixo teor de cafeína, iniciaram-se pesquisas tendo em vista principalmente aumentar a produtividade do 'Laurina'. Para esse fim, realizaram-se numerosas hibridações de cafeeiros do 'Laurina' com os do 'Mundo Novo' (Coffea arabica e, posteriormente, retrocruzamentos com o 'Mundo Novo'. Estudaram-se as progênies F2 e retrocruzamentos com o 'Mundo Novo' (RC em Campinas, em um experimento, anotando-se as produções por oito anos consecutivos. Separaram-se algumas progênies F2 em dois grupos, antes do plantio: normais (LrLr,Lrlr e laurina (Irlr. Como testemunhas, usaram-se progênies do 'Mundo Novo' e 'Catuaí Amarelo' de C. arabica. O conjunto de plantas F2 do grupo laurina e os retrocruzamentos tiveram produção média maior do que as plantas F2 normais, porém menor do que as testemunhas. Alguns retrocruzamentos e progênies F2 apresentaram plantas com razoável produtividade, indicando que, através de retrocruzamentos com o 'Mundo Novo', podem-se obter novos tipos comerciais com as características morfológicas do 'Laurina'. Fizeram-se considerações sobre a melhor capacidade de combinação do 'Laurina' com algumas seleções do 'Mundo Novo'.The Laurina cultivars of Coffea arabica L. has a reduced plant size, small leaves, small and pointed seeds and low yield capacity. However the seeds have a good cup quality and the desirable characteristic of low caffeine content The Laurina phenotype is supposed to be controlled by a pair of recessive alleles lrlr, with
CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction

Science.gov (United States)

Puton, Tomasz; Kozlowski, Lukasz P.; Rother, Kristian M.; Bujnicki, Janusz M.

2013-01-01

We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks. PMID:23435231
CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction.

Science.gov (United States)

Puton, Tomasz; Kozlowski, Lukasz P; Rother, Kristian M; Bujnicki, Janusz M

2013-04-01

We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks.
CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction

KAUST Repository

Cui, Xuefeng

2016-06-15

Motivation: Protein homology detection, a fundamental problem in computational biology, is an indispensable step toward predicting protein structures and understanding protein functions. Despite the advances in recent decades on sequence alignment, threading and alignment-free methods, protein homology detection remains a challenging open problem. Recently, network methods that try to find transitive paths in the protein structure space demonstrate the importance of incorporating network information of the structure space. Yet, current methods merge the sequence space and the structure space into a single space, and thus introduce inconsistency in combining different sources of information. Method: We present a novel network-based protein homology detection method, CMsearch, based on cross-modal learning. Instead of exploring a single network built from the mixture of sequence and structure space information, CMsearch builds two separate networks to represent the sequence space and the structure space. It then learns sequence–structure correlation by simultaneously taking sequence information, structure information, sequence space information and structure space information into consideration. Results: We tested CMsearch on two challenging tasks, protein homology detection and protein structure prediction, by querying all 8332 PDB40 proteins. Our results demonstrate that CMsearch is insensitive to the similarity metrics used to define the sequence and the structure spaces. By using HMM–HMM alignment as the sequence similarity metric, CMsearch clearly outperforms state-of-the-art homology detection methods and the CASP-winning template-based protein structure prediction methods.
Child Support Payment: A Structural Model of Predictive Variables.

Science.gov (United States)

Wright, David W.; Price, Sharon J.

A major area of concern in divorced families is compliance with child support payments. Aspects of the former spouse relationship that are predictive of compliance with court-ordered payment of child support were investigated in a sample of 58 divorced persons all of whom either paid or received child support. Structured interviews and…
Demonstration of de novo synthesis of enzymes by density labelling with stable isotopes

International Nuclear Information System (INIS)

Huebner, G.; Hirschberg, K.

1977-01-01

The technique of in vivo density labelling of proteins with H 2 18 O and 2 H 2 O has been used to investigate hormonal regulation and developmental expression of enzymes in plant cells. Buoyant density data obtained from isopycnic equilibrium centrifugation demonstrated that the cytokinine-induced nitrate reductase activity and the gibberellic acid-induced phosphatase activity in isolated embryos of Agrostemma githago are activities of enzymes synthesized de novo. The increase in alanine-specific aminopeptidase in germinating A. githago seeds is not due to de novo synthesis but to the release of preformed enzyme. On the basis of this result it is possible to apply the enzyme aminopeptidase as an internal density standard in equilibrium centrifugation. Density labelling experiments on proteins in pea cotyledons have been used to study the change in the activity of acid phosphatase, alanine-specific aminopeptidase, and peroxidase during germination. The activities of these enzymes increase in cotyledons of Pisum sativum. Density labelling by 18 O and 2 H demonstrates de novo synthesis of these three enzymes. The differential time course of enzyme induction shows the advantage of using H 2 18 O as labelling substance in cases when the enzyme was synthesized immediately at the beginning of germination. At this stage of development the amino-acid pool available for synthesis is formed principally by means of hydrolysis of storage proteins. The incorporation of 2 H into the new proteins takes place in a measurable amount at a stage of growth in which the amino acids are also synthesized de novo. The enzyme acid phosphatase of pea cotyledons was chosen to demonstrate the possibility of using the density labelling technique to detect protein turnover. (author)
Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data

Directory of Open Access Journals (Sweden)

Duan Jialei

2012-08-01

Full Text Available Abstract Background Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq. The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. Results In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. Conclusions It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.
Hydrogen-bond coordination in organic crystal structures: statistics, predictions and applications.

Science.gov (United States)

Galek, Peter T A; Chisholm, James A; Pidcock, Elna; Wood, Peter A

2014-02-01

Statistical models to predict the number of hydrogen bonds that might be formed by any donor or acceptor atom in a crystal structure have been derived using organic structures in the Cambridge Structural Database. This hydrogen-bond coordination behaviour has been uniquely defined for more than 70 unique atom types, and has led to the development of a methodology to construct hypothetical hydrogen-bond arrangements. Comparing the constructed hydrogen-bond arrangements with known crystal structures shows promise in the assessment of structural stability, and some initial examples of industrially relevant polymorphs, co-crystals and hydrates are described.
Predicting deleterious nsSNPs: an analysis of sequence and structural attributes

Directory of Open Access Journals (Sweden)

Saqi Mansoor AS

2006-04-01

Full Text Available Abstract Background There has been an explosion in the number of single nucleotide polymorphisms (SNPs within public databases. In this study we focused on non-synonymous protein coding single nucleotide polymorphisms (nsSNPs, some associated with disease and others which are thought to be neutral. We describe the distribution of both types of nsSNPs using structural and sequence based features and assess the relative value of these attributes as predictors of function using machine learning methods. We also address the common problem of balance within machine learning methods and show the effect of imbalance on nsSNP function prediction. We show that nsSNP function prediction can be significantly improved by 100% undersampling of the majority class. The learnt rules were then applied to make predictions of function on all nsSNPs within Ensembl. Results The measure of prediction success is greatly affected by the level of imbalance in the training dataset. We found the balanced dataset that included all attributes produced the best prediction. The performance as measured by the Matthews correlation coefficient (MCC varied between 0.49 and 0.25 depending on the imbalance. As previously observed, the degree of sequence conservation at the nsSNP position is the single most useful attribute. In addition to conservation, structural predictions made using a balanced dataset can be of value. Conclusion The predictions for all nsSNPs within Ensembl, based on a balanced dataset using all attributes, are available as a DAS annotation. Instructions for adding the track to Ensembl are at http://www.brightstudy.ac.uk/das_help.html
Offspring social network structure predicts fitness in families.

Science.gov (United States)

Royle, Nick J; Pike, Thomas W; Heeb, Philipp; Richner, Heinz; Kölliker, Mathias

2012-12-22

Social structures such as families emerge as outcomes of behavioural interactions among individuals, and can evolve over time if families with particular types of social structures tend to leave more individuals in subsequent generations. The social behaviour of interacting individuals is typically analysed as a series of multiple dyadic (pair-wise) interactions, rather than a network of interactions among multiple individuals. However, in species where parents feed dependant young, interactions within families nearly always involve more than two individuals simultaneously. Such social networks of interactions at least partly reflect conflicts of interest over the provision of costly parental investment. Consequently, variation in family network structure reflects variation in how conflicts of interest are resolved among family members. Despite its importance in understanding the evolution of emergent properties of social organization such as family life and cooperation, nothing is currently known about how selection acts on the structure of social networks. Here, we show that the social network structure of broods of begging nestling great tits Parus major predicts fitness in families. Although selection at the level of the individual favours large nestlings, selection at the level of the kin-group primarily favours families that resolve conflicts most effectively.
NxRepair: error correction in de novo sequence assembly using Nextera mate pairs

Directory of Open Access Journals (Sweden)

Rebecca R. Murphy

2015-06-01

Full Text Available Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identify and correct large-scale errors. We show that NxRepair can identify and correct large scaffolding errors, without use of a reference sequence, resulting in quantitative improvements in the assembly quality. NxRepair can be downloaded from GitHub or PyPI, the Python Package Index; a tutorial and user documentation are also available.
RNA 3D modules in genome-wide predictions of RNA 2D structure

DEFF Research Database (Denmark)

Theis, Corinna; Zirbel, Craig L; Zu Siederdissen, Christian Höner

2015-01-01

. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D......Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational...... approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution...
Novos encontros de anofelíneos em recipientes artificiais

Directory of Open Access Journals (Sweden)

Oswaldo Paulo Forattini

1998-12-01

Full Text Available Assinalam-se novos encontros de anofelíneos em recipientes artificiais. Um deles diz respeito a formas imaturas de Anopheles bellator em criadouros experimentais e outro é concernente ao achado de An. albitarsis l.s., em recipiente abandonado. Tecem-se considerações sobre a pressão seletiva representada pela produção, cada vez maior, de objetos descartáveis.
Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.

Directory of Open Access Journals (Sweden)

Mile Sikić

2009-01-01

Full Text Available Identifying interaction sites in proteins provides important clues to the function of a protein and is becoming increasingly relevant in topics such as systems biology and drug discovery. Although there are numerous papers on the prediction of interaction sites using information derived from structure, there are only a few case reports on the prediction of interaction residues based solely on protein sequence. Here, a sliding window approach is combined with the Random Forests method to predict protein interaction sites using (i a combination of sequence- and structure-derived parameters and (ii sequence information alone. For sequence-based prediction we achieved a precision of 84% with a 26% recall and an F-measure of 40%. When combined with structural information, the prediction performance increases to a precision of 76% and a recall of 38% with an F-measure of 51%. We also present an attempt to rationalize the sliding window size and demonstrate that a nine-residue window is the most suitable for predictor construction. Finally, we demonstrate the applicability of our prediction methods by modeling the Ras-Raf complex using predicted interaction sites as target binding interfaces. Our results suggest that it is possible to predict protein interaction sites with quite a high accuracy using only sequence information.

Prediction of material damage in orthotropic metals for virtual structural testing

OpenAIRE

Ravindran, S.

2010-01-01

Models based on the Continuum Damage Mechanics principle are increasingly used for predicting the initiation and growth of damage in materials. The growing reliance on 3-D finite element (FE) virtual structural testing demands implementation and validation of robust material models that can predict the material behaviour accurately. The use of these models within numerical analyses requires suitable material data. EU aerospace companies along with Cranfield University and other similar resear...
Human native lipoprotein-induced de novo DNA methylation is associated with repression of inflammatory genes in THP-1 macrophages

Directory of Open Access Journals (Sweden)

Rangel-Salazar Rubén

2011-11-01

Full Text Available Abstract Background We previously showed that a VLDL- and LDL-rich mix of human native lipoproteins induces a set of repressive epigenetic marks, i.e. de novo DNA methylation, histone 4 hypoacetylation and histone 4 lysine 20 (H4K20 hypermethylation in THP-1 macrophages. Here, we: 1 ask what gene expression changes accompany these epigenetic responses; 2 test the involvement of candidate factors mediating the latter. We exploited genome expression arrays to identify target genes for lipoprotein-induced silencing, in addition to RNAi and expression studies to test the involvement of candidate mediating factors. The study was conducted in human THP-1 macrophages. Results Native lipoprotein-induced de novo DNA methylation was associated with a general repression of various critical genes for macrophage function, including pro-inflammatory genes. Lipoproteins showed differential effects on epigenetic marks, as de novo DNA methylation was induced by VLDL and to a lesser extent by LDL, but not by HDL, and VLDL induced H4K20 hypermethylation, while HDL caused H4 deacetylation. The analysis of candidate factors mediating VLDL-induced DNA hypermethylation revealed that this response was: 1 surprisingly, mediated exclusively by the canonical maintenance DNA methyltransferase DNMT1, and 2 independent of the Dicer/micro-RNA pathway. Conclusions Our work provides novel insights into epigenetic gene regulation by native lipoproteins. Furthermore, we provide an example of DNMT1 acting as a de novo DNA methyltransferase independently of canonical de novo enzymes, and show proof of principle that de novo DNA methylation can occur independently of a functional Dicer/micro-RNA pathway in mammals.
De novo post-pollen mitosis II tobacco pollen tube transcriptome

Czech Academy of Sciences Publication Activity Database

Hafidh, Said; Breznenová, Katarína; Honys, David

2012-01-01

Roč. 7, č. 8 (2012), s. 918-921 ISSN 1559-2316 R&D Projects: GA ČR GPP501/11/P321; GA ČR GA522/09/0858 Institutional research plan: CEZ:AV0Z50380511 Keywords : de novo pollen tube transcriptome * male gametophyte development * pollen tube growth Subject RIV: ED - Physiology
Predicting protein structures with a multiplayer online game

OpenAIRE

Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran

2010-01-01

People exert significant amounts of problem solving effort playing computer games. Simple image- and text-recognition tasks have been successfully crowd-sourced through gamesi, ii, iii, but it is not clear if more complex scientific problems can be similarly solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search sp...
Why Is There a Glass Ceiling for Threading Based Protein Structure Prediction Methods?

Science.gov (United States)

Skolnick, Jeffrey; Zhou, Hongyi

2017-04-20

Despite their different implementations, comparison of the best threading approaches to the prediction of evolutionary distant protein structures reveals that they tend to succeed or fail on the same protein targets. This is true despite the fact that the structural template library has good templates for all cases. Thus, a key question is why are certain protein structures threadable while others are not. Comparison with threading results on a set of artificial sequences selected for stability further argues that the failure of threading is due to the nature of the protein structures themselves. Using a new contact map based alignment algorithm, we demonstrate that certain folds are highly degenerate in that they can have very similar coarse grained fractions of native contacts aligned and yet differ significantly from the native structure. For threadable proteins, this is not the case. Thus, contemporary threading approaches appear to have reached a plateau, and new approaches to structure prediction are required.
AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model.

Science.gov (United States)

Ma, Jianzhu; Wang, Sheng

2015-01-01

The solvent accessibility of protein residues is one of the driving forces of protein folding, while the contact number of protein residues limits the possibilities of protein conformations. The de novo prediction of these properties from protein sequence is important for the study of protein structure and function. Although these two properties are certainly related with each other, it is challenging to exploit this dependency for the prediction. We present a method AcconPred for predicting solvent accessibility and contact number simultaneously, which is based on a shared weight multitask learning framework under the CNF (conditional neural fields) model. The multitask learning framework on a collection of related tasks provides more accurate prediction than the framework trained only on a single task. The CNF method not only models the complex relationship between the input features and the predicted labels, but also exploits the interdependency among adjacent labels. Trained on 5729 monomeric soluble globular protein datasets, AcconPred could reach 0.68 three-state accuracy for solvent accessibility and 0.75 correlation for contact number. Tested on the 105 CASP11 domain datasets for solvent accessibility, AcconPred could reach 0.64 accuracy, which outperforms existing methods.
Advancing viral RNA structure prediction: measuring the thermodynamics of pyrimidine-rich internal loops.

Science.gov (United States)

Phan, Andy; Mailey, Katherine; Saeki, Jessica; Gu, Xiaobo; Schroeder, Susan J

2017-05-01

Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites. © 2017 Phan et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Computational tools for experimental determination and theoretical prediction of protein structure

Energy Technology Data Exchange (ETDEWEB)

O`Donoghue, S.; Rost, B.

1995-12-31

This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. The authors intend to review the state of the art in the experimental determination of protein 3D structure (focus on nuclear magnetic resonance), and in the theoretical prediction of protein function and of protein structure in 1D, 2D and 3D from sequence. All the atomic resolution structures determined so far have been derived from either X-ray crystallography (the majority so far) or Nuclear Magnetic Resonance (NMR) Spectroscopy (becoming increasingly more important). The authors briefly describe the physical methods behind both of these techniques; the major computational methods involved will be covered in some detail. They highlight parallels and differences between the methods, and also the current limitations. Special emphasis will be given to techniques which have application to ab initio structure prediction. Large scale sequencing techniques increase the gap between the number of known proteins sequences and that of known protein structures. They describe the scope and principles of methods that contribute successfully to closing that gap. Emphasis will be given on the specification of adequate testing procedures to validate such methods.
A novel de novo mutation in ATP1A3 and childhood-onset schizophrenia

Science.gov (United States)

Smedemark-Margulies, Niklas; Brownstein, Catherine A.; Vargas, Sigella; Tembulkar, Sahil K.; Towne, Meghan C.; Shi, Jiahai; Gonzalez-Cuevas, Elisa; Liu, Kevin X.; Bilguvar, Kaya; Kleiman, Robin J.; Han, Min-Joon; Torres, Alcy; Berry, Gerard T.; Yu, Timothy W.; Beggs, Alan H.; Agrawal, Pankaj B.; Gonzalez-Heydrich, Joseph

2016-01-01

We describe a child with onset of command auditory hallucinations and behavioral regression at 6 yr of age in the context of longer standing selective mutism, aggression, and mild motor delays. His genetic evaluation included chromosomal microarray analysis and whole-exome sequencing. Sequencing revealed a previously unreported heterozygous de novo mutation c.385G>A in ATP1A3, predicted to result in a p.V129M amino acid change. This gene codes for a neuron-specific isoform of the catalytic α-subunit of the ATP-dependent transmembrane sodium–potassium pump. Heterozygous mutations in this gene have been reported as causing both sporadic and inherited forms of alternating hemiplegia of childhood and rapid-onset dystonia parkinsonism. We discuss the literature on phenotypes associated with known variants in ATP1A3, examine past functional studies of the role of ATP1A3 in neuronal function, and describe a novel clinical presentation associated with mutation of this gene. PMID:27626066
Axonal regeneration and development of de novo axons from distal dendrites of adult feline commissural interneurons after a proximal axotomy

DEFF Research Database (Denmark)

Fenrich, Keith K; Skelton, Nicole; MacDermid, Victoria E

2007-01-01

Following proximal axotomy, several types of neurons sprout de novo axons from distal dendrites. These processes may represent a means of forming new circuits following spinal cord injury. However, it is not know whether mammalian spinal interneurons, axotomized as a result of a spinal cord injury......, develop de novo axons. Our goal was to determine whether spinal commissural interneurons (CINs), axotomized by 3-4-mm midsagittal transection at C3, form de novo axons from distal dendrites. All experiments were performed on adult cats. CINs in C3 were stained with extracellular injections of Neurobiotin...... at 4-5 weeks post injury. The somata of axotomized CINs were identified by the presence of immunoreactivity for the axonal growth-associated protein-43 (GAP-43). Nearly half of the CINs had de novo axons that emerged from distal dendrites. These axons lacked immunoreactivity for the dendritic protein...
Integrating chemical footprinting data into RNA secondary structure prediction.

Directory of Open Access Journals (Sweden)

Kourosh Zarringhalam

Full Text Available Chemical and enzymatic footprinting experiments, such as shape (selective 2'-hydroxyl acylation analyzed by primer extension, yield important information about RNA secondary structure. Indeed, since the [Formula: see text]-hydroxyl is reactive at flexible (loop regions, but unreactive at base-paired regions, shape yields quantitative data about which RNA nucleotides are base-paired. Recently, low error rates in secondary structure prediction have been reported for three RNAs of moderate size, by including base stacking pseudo-energy terms derived from shape data into the computation of minimum free energy secondary structure. Here, we describe a novel method, RNAsc (RNA soft constraints, which includes pseudo-energy terms for each nucleotide position, rather than only for base stacking positions. We prove that RNAsc is self-consistent, in the sense that the nucleotide-specific probabilities of being unpaired in the low energy Boltzmann ensemble always become more closely correlated with the input shape data after application of RNAsc. From this mathematical perspective, the secondary structure predicted by RNAsc should be 'correct', in as much as the shape data is 'correct'. We benchmark RNAsc against the previously mentioned method for eight RNAs, for which both shape data and native structures are known, to find the same accuracy in 7 out of 8 cases, and an improvement of 25% in one case. Furthermore, we present what appears to be the first direct comparison of shape data and in-line probing data, by comparing yeast asp-tRNA shape data from the literature with data from in-line probing experiments we have recently performed. With respect to several criteria, we find that shape data appear to be more robust than in-line probing data, at least in the case of asp-tRNA.
De novo and inherited private variants in MAP1B in periventricular nodular heterotopia.

Science.gov (United States)

Heinzen, Erin L; O'Neill, Adam C; Zhu, Xiaolin; Allen, Andrew S; Bahlo, Melanie; Chelly, Jamel; Dobyns, William B; Freytag, Saskia; Guerrini, Renzo; Leventer, Richard J; Poduri, Annapurna; Robertson, Stephen P; Walsh, Christopher A; Zhang, Mengqi

2018-05-08

Periventricular nodular heterotopia (PVNH) is a malformation of cortical development commonly associated with epilepsy. We exome sequenced 202 individuals with sporadic PVNH to identify novel genetic risk loci. We first performed a trio-based analysis and identified 219 de novo variants. Although no novel genes were implicated in this initial analysis, PVNH cases were found overall to have a significant excess of nonsynonymous de novo variants in intolerant genes (p = 3.27x10-7), suggesting a role for rare new alleles in genes yet to be associated with the condition. Using a gene-level collapsing analysis comparing cases and controls, we identified a genome-wide significant signal driven by four ultra-rare loss-of-function heterozygous variants in MAP1B, including one de novo variant. In at least one instance, the MAP1B variant was inherited from a parent with previously undiagnosed PVNH. The PVNH was frontally predominant and associated with perisylvian polymicrogyria. These results implicate MAP1B in PVNH. More broadly, our findings suggest that detrimental mutations likely arising in immediately preceding generations with incomplete penetrance may also be responsible for some apparently sporadic diseases.
Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data.

Science.gov (United States)

Al-Nakeeb, Kosai; Petersen, Thomas Nordahl; Sicheritz-Pontén, Thomas

2017-11-21

Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling. Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers. Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at: https://bitbucket.org/kosaidtu/norgal .
Critical importance of the de novo pyrimidine biosynthesis pathway for Trypanosoma cruzi growth in the mammalian host cell cytoplasm

International Nuclear Information System (INIS)

Hashimoto, Muneaki; Morales, Jorge; Fukai, Yoshihisa; Suzuki, Shigeo; Takamiya, Shinzaburo; Tsubouchi, Akiko; Inoue, Syou; Inoue, Masayuki; Kita, Kiyoshi; Harada, Shigeharu; Tanaka, Akiko; Aoki, Takashi; Nara, Takeshi

2012-01-01

Highlights: ► We established Trypanosoma cruzi lacking the gene for carbamoyl phosphate synthetase II. ► Disruption of the cpsII gene significantly reduced the growth of epimastigotes. ► In particular, the CPSII-null mutant severely retarded intracellular growth. ► The de novo pyrimidine pathway is critical for the parasite growth in the host cell. -- Abstract: The intracellular parasitic protist Trypanosoma cruzi is the causative agent of Chagas disease in Latin America. In general, pyrimidine nucleotides are supplied by both de novo biosynthesis and salvage pathways. While epimastigotes—an insect form—possess both activities, amastigotes—an intracellular replicating form of T. cruzi—are unable to mediate the uptake of pyrimidine. However, the requirement of de novo pyrimidine biosynthesis for parasite growth and survival has not yet been elucidated. Carbamoyl-phosphate synthetase II (CPSII) is the first and rate-limiting enzyme of the de novo biosynthetic pathway, and increased CPSII activity is associated with the rapid proliferation of tumor cells. In the present study, we showed that disruption of the T. cruzicpsII gene significantly reduced parasite growth. In particular, the growth of amastigotes lacking the cpsII gene was severely suppressed. Thus, the de novo pyrimidine pathway is important for proliferation of T. cruzi in the host cell cytoplasm and represents a promising target for chemotherapy against Chagas disease.
Novos liberalismos e a Grande Recessão: princípios para uma política externa crítica

Directory of Open Access Journals (Sweden)

Igor Abdalla

2014-06-01

Full Text Available O artigo analisa a emergência, nas últimas décadas, de novo liberalismo internacionalista de cunho tecnocrático, que se divorcia do liberalismo clássico criado pelo filósofo crítico Immanuel Kant. O novo liberalismo, que coincide com o processo de globalização das finanças, inverte o elemento emancipatório do liberalismo kantiano para apresentar-se como instância de ratificação do poder. Como resultado, os novos liberais são incapazes de analisar criticamente eventos como a Grande Recessão. Em contraposição ao novo liberalismo tecnocrático propõem-se princípios para uma política externa crítica para o Brasil. Em termos empíricos, escrutina-se a evolução do processo de globalização das finanças do ponto de vista do poder, com enfoque sobre as crises financeiras no mundo em desenvolvimento e a Grande Recessão de 2008. Propugnam-se os seguintes argumentos: (i o novo liberalismo contradiz o liberalismo clássico; (ii o novo liberalismo legitima interesses de atores hegemônicos voltados para a liberalização e a desregulamentação financeiras sem limites, que se encontram na raiz da Grande Recessão; (iii a política externa brasileira deve resgatar elementos do liberalismo clássico no contexto de crise gerado pela Grande Recessão.
GalaxyHomomer: a web server for protein homo-oligomer structure prediction from a monomer sequence or structure.

Science.gov (United States)

Baek, Minkyung; Park, Taeyong; Heo, Lim; Park, Chiwook; Seok, Chaok

2017-07-03

Homo-oligomerization of proteins is abundant in nature, and is often intimately related with the physiological functions of proteins, such as in metabolism, signal transduction or immunity. Information on the homo-oligomer structure is therefore important to obtain a molecular-level understanding of protein functions and their regulation. Currently available web servers predict protein homo-oligomer structures either by template-based modeling using homo-oligomer templates selected from the protein structure database or by ab initio docking of monomer structures resolved by experiment or predicted by computation. The GalaxyHomomer server, freely accessible at http://galaxy.seoklab.org/homomer, carries out template-based modeling, ab initio docking or both depending on the availability of proper oligomer templates. It also incorporates recently developed model refinement methods that can consistently improve model quality. Moreover, the server provides additional options that can be chosen by the user depending on the availability of information on the monomer structure, oligomeric state and locations of unreliable/flexible loops or termini. The performance of the server was better than or comparable to that of other available methods when tested on benchmark sets and in a recent CASP performed in a blind fashion. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evaluation of multiple protein docking structures using correctly predicted pairwise subunits

Directory of Open Access Journals (Sweden)

Esquivel-Rodríguez Juan

2012-03-01

Full Text Available Abstract Background Many functionally important proteins in a cell form complexes with multiple chains. Therefore, computational prediction of multiple protein complexes is an important task in bioinformatics. In the development of multiple protein docking methods, it is important to establish a metric for evaluating prediction results in a reasonable and practical fashion. However, since there are only few works done in developing methods for multiple protein docking, there is no study that investigates how accurate structural models of multiple protein complexes should be to allow scientists to gain biological insights. Methods We generated a series of predicted models (decoys of various accuracies by our multiple protein docking pipeline, Multi-LZerD, for three multi-chain complexes with 3, 4, and 6 chains. We analyzed the decoys in terms of the number of correctly predicted pair conformations in the decoys. Results and conclusion We found that pairs of chains with the correct mutual orientation exist even in the decoys with a large overall root mean square deviation (RMSD to the native. Therefore, in addition to a global structure similarity measure, such as the global RMSD, the quality of models for multiple chain complexes can be better evaluated by using the local measurement, the number of chain pairs with correct mutual orientation. We termed the fraction of correctly predicted pairs (RMSD at the interface of less than 4.0Å as fpair and propose to use it for evaluation of the accuracy of multiple protein docking.
Long-read sequencing and de novo assembly of a Chinese genome

Science.gov (United States)

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...
Cinema utópico: a construção de um novo homem e um novo mundo

OpenAIRE

Erika Savernini Lopes

2011-01-01

O cinema, desde seus primórdios, prefigurou o espaço cibernético como um novo espaço imaterial construído coletivamente. A concepção desse outro lugar não físico para o qual o homem poderia migrar estabelece para o cinema e para o ciberespaço uma relação direta com as utopias. Na acepção do romance filosófico de Thomas More, a Utopia define-se como um outro espaço não-existente, irrealizável e ideal que diagnostica o atual. O cinema carregaria caracteres fundamentais da Utopia tanto no que se...
Electrostatics, structure prediction, and the energy landscapes for protein folding and binding.

Science.gov (United States)

Tsai, Min-Yeh; Zheng, Weihua; Balamurugan, D; Schafer, Nicholas P; Kim, Bobby L; Cheung, Margaret S; Wolynes, Peter G

2016-01-01

While being long in range and therefore weakly specific, electrostatic interactions are able to modulate the stability and folding landscapes of some proteins. The relevance of electrostatic forces for steering the docking of proteins to each other is widely acknowledged, however, the role of electrostatics in establishing specifically funneled landscapes and their relevance for protein structure prediction are still not clear. By introducing Debye-Hückel potentials that mimic long-range electrostatic forces into the Associative memory, Water mediated, Structure, and Energy Model (AWSEM), a transferable protein model capable of predicting tertiary structures, we assess the effects of electrostatics on the landscapes of thirteen monomeric proteins and four dimers. For the monomers, we find that adding electrostatic interactions does not improve structure prediction. Simulations of ribosomal protein S6 show, however, that folding stability depends monotonically on electrostatic strength. The trend in predicted melting temperatures of the S6 variants agrees with experimental observations. Electrostatic effects can play a range of roles in binding. The binding of the protein complex KIX-pKID is largely assisted by electrostatic interactions, which provide direct charge-charge stabilization of the native state and contribute to the funneling of the binding landscape. In contrast, for several other proteins, including the DNA-binding protein FIS, electrostatics causes frustration in the DNA-binding region, which favors its binding with DNA but not with its protein partner. This study highlights the importance of long-range electrostatics in functional responses to problems where proteins interact with their charged partners, such as DNA, RNA, as well as membranes. © 2015 The Protein Society.

De novo mutations in HCN1 cause early infantile epileptic encephalopathy.

Science.gov (United States)

Nava, Caroline; Dalle, Carine; Rastetter, Agnès; Striano, Pasquale; de Kovel, Carolien G F; Nabbout, Rima; Cancès, Claude; Ville, Dorothée; Brilstra, Eva H; Gobbi, Giuseppe; Raffo, Emmanuel; Bouteiller, Delphine; Marie, Yannick; Trouillard, Oriane; Robbiano, Angela; Keren, Boris; Agher, Dahbia; Roze, Emmanuel; Lesage, Suzanne; Nicolas, Aude; Brice, Alexis; Baulac, Michel; Vogt, Cornelia; El Hajj, Nady; Schneider, Eberhard; Suls, Arvid; Weckhuysen, Sarah; Gormley, Padhraig; Lehesjoki, Anna-Elina; De Jonghe, Peter; Helbig, Ingo; Baulac, Stéphanie; Zara, Federico; Koeleman, Bobby P C; Haaf, Thomas; LeGuern, Eric; Depienne, Christel

2014-06-01

Hyperpolarization-activated, cyclic nucleotide-gated (HCN) channels contribute to cationic Ih current in neurons and regulate the excitability of neuronal networks. Studies in rat models have shown that the Hcn1 gene has a key role in epilepsy, but clinical evidence implicating HCN1 mutations in human epilepsy is lacking. We carried out exome sequencing for parent-offspring trios with fever-sensitive, intractable epileptic encephalopathy, leading to the discovery of two de novo missense HCN1 mutations. Screening of follow-up cohorts comprising 157 cases in total identified 4 additional amino acid substitutions. Patch-clamp recordings of Ih currents in cells expressing wild-type or mutant human HCN1 channels showed that the mutations had striking but divergent effects on homomeric channels. Individuals with mutations had clinical features resembling those of Dravet syndrome with progression toward atypical absences, intellectual disability and autistic traits. These findings provide clear evidence that de novo HCN1 point mutations cause a recognizable early-onset epileptic encephalopathy in humans.
De novo FGF12 mutation in 2 patients with neonatal-onset epilepsy

Science.gov (United States)

Guella, Ilaria; Huh, Linda; McKenzie, Marna B.; Toyota, Eric B.; Bebin, E. Martina; Thompson, Michelle L.; Cooper, Gregory M.; Evans, Daniel M.; Buerki, Sarah E.; Adam, Shelin; Van Allen, Margot I.; Nelson, Tanya N.; Connolly, Mary B.; Farrer, Matthew J.

2016-01-01

Objective: We describe 2 additional patients with early-onset epilepsy with a de novo FGF12 mutation. Methods: Whole-exome sequencing was performed in 2 unrelated patients with early-onset epilepsy and their unaffected parents. Genetic variants were assessed by comparative trio analysis. Clinical evolution, EEG, and neuroimaging are described. The phenotype and response to treatment was reviewed and compared to affected siblings in the original report. Results: We identified the same FGF12 de novo mutation reported previously (c.G155A, p.R52H) in 2 additional patients with early-onset epilepsy. Similar to the original brothers described, both presented with tonic seizures in the first month of life. In the first patient, seizures responded to sodium channel blockers and her development was normal at 11 months. Patient 2 is a 15-year-old girl with treatment-resistant focal epilepsy, moderate intellectual disability, and autism. Carbamazepine (sodium channel blocker) was tried later in her course but not continued due to an allergic reaction. Conclusions: The identification of a recurrent de novo mutation in 2 additional unrelated probands with early-onset epilepsy supports the role of FGF12 p.R52H in disease pathogenesis. Affected carriers presented with similar early clinical phenotypes; however, this report expands the phenotype associated with this mutation which contrasts with the progressive course and early mortality of the siblings in the original report. PMID:27872899
Identification of de novo mutations of Duchénnè/Becker muscular dystrophies in southern Spain.

Science.gov (United States)

Garcia, Susana; de Haro, Tomás; Zafra-Ceres, Mercedes; Poyatos, Antonio; Gomez-Capilla, Jose A; Gomez-Llorente, Carolina

2014-01-01

Duchénnè/Becker muscular dystrophies (DMD/BMD) are X-linked diseases, which are caused by a de novo gene mutation in one-third of affected males. The study objectives were to determine the incidence of DMD/BMD in Andalusia (Spain) and to establish the percentage of affected males in whom a de novo gene mutation was responsible. Multiplex ligation-dependent probe amplification (MLPA) technology was applied to determine the incidence of DMD/BMD in 84 males with suspicion of the disease and 106 female relatives. Dystrophin gene exon deletion (89.5%) or duplication (10.5%) was detected in 38 of the 84 males by MLPA technology; de novo mutations account for 4 (16.7%) of the 24 mother-son pairs studied. MLPA technology is adequate for the molecular diagnosis of DMD/BMD and establishes whether the mother carries the molecular alteration responsible for the disease, a highly relevant issue for genetic counseling.
Structural properties of MHC class II ligands, implications for the prediction of MHC class II epitopes.

Directory of Open Access Journals (Sweden)

Kasper Winther Jørgensen

2010-12-01

Full Text Available Major Histocompatibility class II (MHC-II molecules sample peptides from the extracellular space allowing the immune system to detect the presence of foreign microbes from this compartment. Prediction of MHC class II ligands is complicated by the open binding cleft of the MHC class II molecule, allowing binding of peptides extending out of the binding groove. Furthermore, only a few HLA-DR alleles have been characterized with a sufficient number of peptides (100-200 peptides per allele to derive accurate description of their binding motif. Little work has been performed characterizing structural properties of MHC class II ligands. Here, we perform one such large-scale analysis. A large set of SYFPEITHI MHC class II ligands covering more than 20 different HLA-DR molecules was analyzed in terms of their secondary structure and surface exposure characteristics in the context of the native structure of the corresponding source protein. We demonstrated that MHC class II ligands are significantly more exposed and have significantly more coil content than other peptides in the same protein with similar predicted binding affinity. We next exploited this observation to derive an improved prediction method for MHC class II ligands by integrating prediction of MHC- peptide binding with prediction of surface exposure and protein secondary structure. This combined prediction method was shown to significantly outperform the state-of-the-art MHC class II peptide binding prediction method when used to identify MHC class II ligands. We also tried to integrate N- and O-glycosylation in our prediction methods but this additional information was found not to improve prediction performance. In summary, these findings strongly suggest that local structural properties influence antigen processing and/or the accessibility of peptides to the MHC class II molecule.
Management, nutrition, and lactation performance are related to bulk tank milk de novo fatty acid concentration on northeastern US dairy farms.

Science.gov (United States)

Woolpert, M E; Dann, H M; Cotanch, K W; Melilli, C; Chase, L E; Grant, R J; Barbano, D M

2016-10-01

This study investigated the relationship of management practices, dietary characteristics, milk composition, and lactation performance with de novo fatty acid (FA) concentration in bulk tank milk from commercial dairy farms with Holstein, Jersey, and mixed-breed cows. It was hypothesized that farms with higher de novo milk FA concentrations would more commonly use management and nutrition practices known to optimize ruminal conditions that enhance de novo synthesis of milk FA. Farms (n=44) located in Vermont and northeastern New York were selected based on a history of high de novo (HDN; 26.18±0.94g/100g of FA; mean ± standard deviation) or low de novo (LDN; 24.19±1.22g/100g of FA) FA in bulk tank milk. Management practices were assessed during one visit to each farm in March or April, 2014. Total mixed ration samples were collected and analyzed for chemical composition using near infrared spectroscopy. We found no differences in days in milk at the farm level. Yield of milk fat, true protein, and de novo FA per cow per day were higher for HDN versus LDN farms. The HDN farms had lower freestall stocking density (cows/stall) than LDN farms. Additionally, tiestall feeding frequency was higher for HDN than LDN farms. No differences between HDN and LDN farms were detected for dietary dry matter, crude protein, neutral detergent fiber, starch, or percentage of forage in the diet. However, dietary ether extract was lower for HDN than LDN farms. This research indicates that overcrowded freestalls, reduced feeding frequency, and greater dietary ether extract content are associated with lower de novo FA synthesis and reduced milk fat and true protein yields on commercial dairy farms. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil. by NGS and de novo transcriptome assembly.

Directory of Open Access Journals (Sweden)

Humberto J Debat

Full Text Available Yerba mate (Ilex paraguariensis A. St.-Hil. is an important subtropical tree crop cultivated on 326,000 ha in Argentina, Brazil and Paraguay, with a total yield production of more than 1,000,000 t. Yerba mate presents a strong limitation regarding sequence information. The NCBI GenBank lacks an EST database of yerba mate and depicts only 80 DNA sequences, mostly uncharacterized. In this scenario, in order to elucidate the yerba mate gene landscape by means of NGS, we explored and discovered a vast collection of I. paraguariensis transcripts. Total RNA from I. paraguariensis was sequenced by Illumina HiSeq-2000 obtaining 72,031,388 pair-end 100 bp sequences. High quality reads were de novo assembled into 44,907 transcripts encompassing 40 million bases with an estimated coverage of 180X. Multiple sequence analysis allowed us to predict that yerba mate contains ∼ 32,355 genes and 12,551 gene variants or isoforms. We identified and categorized members of more than 100 metabolic pathways. Overall, we have identified ∼ 1,000 putative transcription factors, genes involved in heat and oxidative stress, pathogen response, as well as disease resistance and hormone response. We have also identified, based in sequence homology searches, novel transcripts related to osmotic, drought, salinity and cold stress, senescence and early flowering. We have also pinpointed several members of the gene silencing pathway, and characterized the silencing effector Argonaute1. We predicted a diverse supply of putative microRNA precursors involved in developmental processes. We present here the first draft of the transcribed genomes of the yerba mate chloroplast and mitochondrion. The putative sequence and predicted structure of the caffeine synthase of yerba mate is presented. Moreover, we provide a collection of over 10,800 SSR accessible to the scientific community interested in yerba mate genetic improvement. This contribution broadly expands the limited knowledge
Exploring the Genes of Yerba Mate (Ilex paraguariensis A. St.-Hil.) by NGS and De Novo Transcriptome Assembly

Science.gov (United States)

Aguilera, Patricia M.; Bubillo, Rosana E.; Otegui, Mónica B.; Ducasse, Daniel A.; Zapata, Pedro D.; Marti, Dardo A.

2014-01-01

Yerba mate (Ilex paraguariensis A. St.-Hil.) is an important subtropical tree crop cultivated on 326,000 ha in Argentina, Brazil and Paraguay, with a total yield production of more than 1,000,000 t. Yerba mate presents a strong limitation regarding sequence information. The NCBI GenBank lacks an EST database of yerba mate and depicts only 80 DNA sequences, mostly uncharacterized. In this scenario, in order to elucidate the yerba mate gene landscape by means of NGS, we explored and discovered a vast collection of I. paraguariensis transcripts. Total RNA from I. paraguariensis was sequenced by Illumina HiSeq-2000 obtaining 72,031,388 pair-end 100 bp sequences. High quality reads were de novo assembled into 44,907 transcripts encompassing 40 million bases with an estimated coverage of 180X. Multiple sequence analysis allowed us to predict that yerba mate contains ∼32,355 genes and 12,551 gene variants or isoforms. We identified and categorized members of more than 100 metabolic pathways. Overall, we have identified ∼1,000 putative transcription factors, genes involved in heat and oxidative stress, pathogen response, as well as disease resistance and hormone response. We have also identified, based in sequence homology searches, novel transcripts related to osmotic, drought, salinity and cold stress, senescence and early flowering. We have also pinpointed several members of the gene silencing pathway, and characterized the silencing effector Argonaute1. We predicted a diverse supply of putative microRNA precursors involved in developmental processes. We present here the first draft of the transcribed genomes of the yerba mate chloroplast and mitochondrion. The putative sequence and predicted structure of the caffeine synthase of yerba mate is presented. Moreover, we provide a collection of over 10,800 SSR accessible to the scientific community interested in yerba mate genetic improvement. This contribution broadly expands the limited knowledge of yerba mate genes
Prediction of elastic-plastic response of structural elements subjected to cyclic loading

International Nuclear Information System (INIS)

El Haddad, M.H.; Samaan, S.

1985-01-01

A simplified elastic-plastic analysis is developed to predict stress strain and force deformation response of structural metallic elements subjected to irregular cyclic loadings. In this analysis a simple elastic-plastic method for predicting the skeleton force deformation curve is developed. In this method, elastic and fully plastic solutions are first obtained for unknown quantities, such as deflection or local strains. Elastic and fully plastic contributions are then combined to obtain an elastic-plastic solution. The skeleton curve is doubled to establish the shape of the hysteresis loop. The complete force deformation response can therefore be simulated through reversal by reversal in accordance with hysteresis looping and material memory. Several examples of structural elements with various cross sections made from various materials and subjected to irregular cyclic loadings, are analysed. A close agreement is obtained between experimental results found in the literature and present predictions. (orig.)
Inelastic spectra to predict period elongation of structures under earthquake loading

DEFF Research Database (Denmark)

Katsanos, Evangelos; Sextos, A.G.

2015-01-01

Period lengthening, exhibited by structures when subjected to strong ground motions, constitutes an implicit proxy of structural inelasticity and associated damage. However, the reliable prediction of the inelastic period is tedious and a multi-parametric task, which is related to both epistemic ...... for period lengthening as a function of Ry and Tel. These equations may be used in the framework of the earthquake record selection and scaling....
Mapping monomeric threading to protein-protein structure prediction.

Science.gov (United States)

Guerler, Aysam; Govindarajoo, Brandon; Zhang, Yang

2013-03-25

The key step of template-based protein-protein structure prediction is the recognition of complexes from experimental structure libraries that have similar quaternary fold. Maintaining two monomer and dimer structure libraries is however laborious, and inappropriate library construction can degrade template recognition coverage. We propose a novel strategy SPRING to identify complexes by mapping monomeric threading alignments to protein-protein interactions based on the original oligomer entries in the PDB, which does not rely on library construction and increases the efficiency and quality of complex template recognitions. SPRING is tested on 1838 nonhomologous protein complexes which can recognize correct quaternary template structures with a TM score >0.5 in 1115 cases after excluding homologous proteins. The average TM score of the first model is 60% and 17% higher than that by HHsearch and COTH, respectively, while the number of targets with an interface RMSD benchmark proteins. Although the relative performance of SPRING and ZDOCK depends on the level of homology filters, a combination of the two methods can result in a significantly higher model quality than ZDOCK at all homology thresholds. These data demonstrate a new efficient approach to quaternary structure recognition that is ready to use for genome-scale modeling of protein-protein interactions due to the high speed and accuracy.
Drug-Eluting Balloons in the Treatment of Coronary De Novo Lesions

DEFF Research Database (Denmark)

Richelsen, Rasmus Kapalu Broge; Overvad, Thure Filskov; Jensen, Svend Eggert

2016-01-01

Drug-eluting balloons (DEBs) have emerged as a new application in percutaneous coronary intervention. DEBs have proven successful in the treatment of in-stent restenosis, but their role in de novo lesions is less clear. This paper provides a review of the current studies where DEBs have been used...
Structure-based function prediction of the expanding mollusk tyrosinase family

Science.gov (United States)

Huang, Ronglian; Li, Li; Zhang, Guofan

2017-11-01

Tyrosinase (Ty) is a common enzyme found in many different animal groups. In our previous study, genome sequencing revealed that the Ty family is expanded in the Pacific oyster ( Crassostrea gigas). Here, we examine the larger number of Ty family members in the Pacific oyster by high-level structure prediction to obtain more information about their function and evolution, especially the unknown role in biomineralization. We verified 12 Ty gene sequences from Crassostrea gigas genome and Pinctada fucata martensii transcriptome. By using phylogenetic analysis of these Tys with functionally known Tys from other molluscan species, eight subgroups were identified (CgTy_s1, CgTy_s2, MolTy_s1, MolTy-s2, MolTy-s3, PinTy-s1, PinTy-s2 and PviTy). Structural data and surface pockets of the dinuclear copper center in the eight subgroups of molluscan Ty were obtained using the latest versions of prediction online servers. Structural comparison with other Ty proteins from the protein databank revealed functionally important residues (HA1, HA2, HA3, HB1, HB2, HB3, Z1-Z9) and their location within these protein structures. The structural and chemical features of these pockets which may related to the substrate binding showed considerable variability among mollusks, which undoubtedly defines Ty substrate binding. Finally, we discuss the potential driving forces of Ty family evolution in mollusks. Based on these observations, we conclude that the Ty family has rapidly evolved as a consequence of substrate adaptation in mollusks.
CNNH_PSS: protein 8-class secondary structure prediction by convolutional neural network with highway.

Science.gov (United States)

Zhou, Jiyun; Wang, Hongpeng; Zhao, Zhishan; Xu, Ruifeng; Lu, Qin

2018-05-08

Protein secondary structure is the three dimensional form of local segments of proteins and its prediction is an important problem in protein tertiary structure prediction. Developing computational approaches for protein secondary structure prediction is becoming increasingly urgent. We present a novel deep learning based model, referred to as CNNH_PSS, by using multi-scale CNN with highway. In CNNH_PSS, any two neighbor convolutional layers have a highway to deliver information from current layer to the output of the next one to keep local contexts. As lower layers extract local context while higher layers extract long-range interdependencies, the highways between neighbor layers allow CNNH_PSS to have ability to extract both local contexts and long-range interdependencies. We evaluate CNNH_PSS on two commonly used datasets: CB6133 and CB513. CNNH_PSS outperforms the multi-scale CNN without highway by at least 0.010 Q8 accuracy and also performs better than CNF, DeepCNF and SSpro8, which cannot extract long-range interdependencies, by at least 0.020 Q8 accuracy, demonstrating that both local contexts and long-range interdependencies are indeed useful for prediction. Furthermore, CNNH_PSS also performs better than GSM and DCRNN which need extra complex model to extract long-range interdependencies. It demonstrates that CNNH_PSS not only cost less computer resource, but also achieves better predicting performance. CNNH_PSS have ability to extracts both local contexts and long-range interdependencies by combing multi-scale CNN and highway network. The evaluations on common datasets and comparisons with state-of-the-art methods indicate that CNNH_PSS is an useful and efficient tool for protein secondary structure prediction.
QuaBingo: A Prediction System for Protein Quaternary Structure Attributes Using Block Composition

Directory of Open Access Journals (Sweden)

Chi-Hua Tung

2016-01-01

Full Text Available Background. Quaternary structures of proteins are closely relevant to gene regulation, signal transduction, and many other biological functions of proteins. In the current study, a new method based on protein-conserved motif composition in block format for feature extraction is proposed, which is termed block composition. Results. The protein quaternary assembly states prediction system which combines blocks with functional domain composition, called QuaBingo, is constructed by three layers of classifiers that can categorize quaternary structural attributes of monomer, homooligomer, and heterooligomer. The building of the first layer classifier uses support vector machines (SVM based on blocks and functional domains of proteins, and the second layer SVM was utilized to process the outputs of the first layer. Finally, the result is determined by the Random Forest of the third layer. We compared the effectiveness of the combination of block composition, functional domain composition, and pseudoamino acid composition of the model. In the 11 kinds of functional protein families, QuaBingo is 23% of Matthews Correlation Coefficient (MCC higher than the existing prediction system. The results also revealed the biological characterization of the top five block compositions. Conclusions. QuaBingo provides better predictive ability for predicting the quaternary structural attributes of proteins.
De novo peptide design and experimental validation of histone methyltransferase inhibitors.

Directory of Open Access Journals (Sweden)

James Smadbeck

Full Text Available Histones are small proteins critical to the efficient packaging of DNA in the nucleus. DNA–protein complexes, known as nucleosomes, are formed when the DNA winds itself around the surface of the histones. The methylation of histone residues by enhancer of zeste homolog 2 (EZH2 maintains gene repression over successive cell generations. Overexpression of EZH2 can silence important tumor suppressor genes leading to increased invasiveness of many types of cancers. This makes the inhibition of EZH2 an important target in the development of cancer therapeutics. We employed a three-stage computational de novo peptide design method to design inhibitory peptides of EZH2. The method consists of a sequence selection stage and two validation stages for fold specificity and approximate binding affinity. The sequence selection stage consists of an integer linear optimization model that was solved to produce a rank-ordered list of amino acid sequences with increased stability in the bound peptide-EZH2 structure. These sequences were validated through the calculation of the fold specificity and approximate binding affinity of the designed peptides. Here we report the discovery of novel EZH2 inhibitory peptides using the de novo peptide design method. The computationally discovered peptides were experimentally validated in vitro using dose titrations and mechanism of action enzymatic assays. The peptide with the highest in vitro response, SQ037, was validated in nucleo using quantitative mass spectrometry-based proteomics. This peptide had an IC50 of 13.5 mM, demonstrated greater potency as an inhibitor when compared to the native and K27A mutant control peptides, and demonstrated competitive inhibition versus the peptide substrate. Additionally, this peptide demonstrated high specificity to the EZH2 target in comparison to other histone methyltransferases. The validated peptides are the first computationally designed peptides that directly inhibit EZH2
De novo peptide design and experimental validation of histone methyltransferase inhibitors.

Directory of Open Access Journals (Sweden)

James Smadbeck

Full Text Available Histones are small proteins critical to the efficient packaging of DNA in the nucleus. DNA-protein complexes, known as nucleosomes, are formed when the DNA winds itself around the surface of the histones. The methylation of histone residues by enhancer of zeste homolog 2 (EZH2 maintains gene repression over successive cell generations. Overexpression of EZH2 can silence important tumor suppressor genes leading to increased invasiveness of many types of cancers. This makes the inhibition of EZH2 an important target in the development of cancer therapeutics. We employed a three-stage computational de novo peptide design method to design inhibitory peptides of EZH2. The method consists of a sequence selection stage and two validation stages for fold specificity and approximate binding affinity. The sequence selection stage consists of an integer linear optimization model that was solved to produce a rank-ordered list of amino acid sequences with increased stability in the bound peptide-EZH2 structure. These sequences were validated through the calculation of the fold specificity and approximate binding affinity of the designed peptides. Here we report the discovery of novel EZH2 inhibitory peptides using the de novo peptide design method. The computationally discovered peptides were experimentally validated in vitro using dose titrations and mechanism of action enzymatic assays. The peptide with the highest in vitro response, SQ037, was validated in nucleo using quantitative mass spectrometry-based proteomics. This peptide had an IC50 of 13.5 [Formula: see text]M, demonstrated greater potency as an inhibitor when compared to the native and K27A mutant control peptides, and demonstrated competitive inhibition versus the peptide substrate. Additionally, this peptide demonstrated high specificity to the EZH2 target in comparison to other histone methyltransferases. The validated peptides are the first computationally designed peptides that directly
Synergistic interactions between Drosophila orthologues of genes spanned by de novo human CNVs support multiple-hit models of autism.

Science.gov (United States)

Grice, Stuart J; Liu, Ji-Long; Webber, Caleb

2015-03-01

Autism spectrum disorders (ASDs) are highly heritable and characterised by deficits in social interaction and communication, as well as restricted and repetitive behaviours. Although a number of highly penetrant ASD gene variants have been identified, there is growing evidence to support a causal role for combinatorial effects arising from the contributions of multiple loci. By examining synaptic and circadian neurological phenotypes resulting from the dosage variants of unique human:fly orthologues in Drosophila, we observe numerous synergistic interactions between pairs of informatically-identified candidate genes whose orthologues are jointly affected by large de novo copy number variants (CNVs). These CNVs were found in the genomes of individuals with autism, including a patient carrying a 22q11.2 deletion. We first demonstrate that dosage alterations of the unique Drosophila orthologues of candidate genes from de novo CNVs that harbour only a single candidate gene display neurological defects similar to those previously reported in Drosophila models of ASD-associated variants. We then considered pairwise dosage changes within the set of orthologues of candidate genes that were affected by the same single human de novo CNV. For three of four CNVs with complete orthologous relationships, we observed significant synergistic effects following the simultaneous dosage change of gene pairs drawn from a single CNV. The phenotypic variation observed at the Drosophila synapse that results from these interacting genetic variants supports a concordant phenotypic outcome across all interacting gene pairs following the direction of human gene copy number change. We observe both specificity and transitivity between interactors, both within and between CNV candidate gene sets, supporting shared and distinct genetic aetiologies. We then show that different interactions affect divergent synaptic processes, demonstrating distinct molecular aetiologies. Our study illustrates
Mesoscopic structure prediction of nanoparticle assembly and coassembly: Theoretical foundation

KAUST Repository

Hur, Kahyun

2010-01-01

In this work, we present a theoretical framework that unifies polymer field theory and density functional theory in order to efficiently predict ordered nanostructure formation of systems having considerable complexity in terms of molecular structures and interactions. We validate our approach by comparing its predictions with previous simulation results for model systems. We illustrate the flexibility of our approach by applying it to hybrid systems composed of block copolymers and ligand coated nanoparticles. We expect that our approach will enable the treatment of multicomponent self-assembly with a level of molecular complexity that approaches experimental systems. © 2010 American Institute of Physics.
Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

Directory of Open Access Journals (Sweden)

Yunyun Liang

2015-01-01

Full Text Available Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM. Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS, segmented PsePSSM, and segmented autocovariance transformation (ACT based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640 are adopted in this paper. Then a 700-dimensional (700D feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA. To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.
De novo transcriptome assembly of the calanoid copepod Neocalanus flemingeri: A new resource for emergence from diapause.

Science.gov (United States)

Roncalli, Vittoria; Cieslak, Matthew C; Sommer, Stephanie A; Hopcroft, Russell R; Lenz, Petra H

2018-02-01

Copepods, small planktonic crustaceans, are key links between primary producers and upper trophic levels, including many economically important fishes. In the subarctic North Pacific, the life cycle of copepods like Neocalanus flemingeri includes an ontogenetic migration to depth followed by a period of diapause (a type of dormancy) characterized by arrested development and low metabolic activity. The end of diapause is marked by the production of the first brood of eggs. Recent temperature anomalies in the North Pacific have raised concerns about potential negative effects on N. flemingeri. Since diapause is a developmental program, its progress can be tracked using through global gene expression. Thus, a reference transcriptome was developed as a first step towards physiological profiling of diapausing females using high-throughput Illumina sequencing. The de novo transcriptome, the first for this species was designed to investigate the diapause period. RNA-Seq reads were obtained for dormant to reproductive N. flemingeri females. A high quality de novo transcriptome was obtained by first assembling reads from each individual using Trinity software followed by clustering with CAP3 Assembly Program. This assembly consisted of 140,841transcripts (contigs). Bench-marking universal single-copy orthologs analysis identified 85% of core eukaryotic genes, with 79% predicted to be complete. Comparison with other calanoid transcriptomes confirmed its quality and degree of completeness. Trinity assembly of reads originating from multiple individuals led to fragmentation. Thus, the workflow applied here differed from the one recommended by Trinity, but was required to obtain a good assembly. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

Statistical properties of thermodynamically predicted RNA secondary structures in viral genomes

Science.gov (United States)

Spanò, M.; Lillo, F.; Miccichè, S.; Mantegna, R. N.

2008-10-01

By performing a comprehensive study on 1832 segments of 1212 complete genomes of viruses, we show that in viral genomes the hairpin structures of thermodynamically predicted RNA secondary structures are more abundant than expected under a simple random null hypothesis. The detected hairpin structures of RNA secondary structures are present both in coding and in noncoding regions for the four groups of viruses categorized as dsDNA, dsRNA, ssDNA and ssRNA. For all groups, hairpin structures of RNA secondary structures are detected more frequently than expected for a random null hypothesis in noncoding rather than in coding regions. However, potential RNA secondary structures are also present in coding regions of dsDNA group. In fact, we detect evolutionary conserved RNA secondary structures in conserved coding and noncoding regions of a large set of complete genomes of dsDNA herpesviruses.
MemBrain: An Easy-to-Use Online Webserver for Transmembrane Protein Structure Prediction

Science.gov (United States)

Yin, Xi; Yang, Jing; Xiao, Feng; Yang, Yang; Shen, Hong-Bin

2018-03-01

Membrane proteins are an important kind of proteins embedded in the membranes of cells and play crucial roles in living organisms, such as ion channels, transporters, receptors. Because it is difficult to determinate the membrane protein's structure by wet-lab experiments, accurate and fast amino acid sequence-based computational methods are highly desired. In this paper, we report an online prediction tool called MemBrain, whose input is the amino acid sequence. MemBrain consists of specialized modules for predicting transmembrane helices, residue-residue contacts and relative accessible surface area of α-helical membrane proteins. MemBrain achieves a prediction accuracy of 97.9% of A TMH, 87.1% of A P, 3.2 ± 3.0 of N-score, 3.1 ± 2.8 of C-score. MemBrain-Contact obtains 62%/64.1% prediction accuracy on training and independent dataset on top L/5 contact prediction, respectively. And MemBrain-Rasa achieves Pearson correlation coefficient of 0.733 and its mean absolute error of 13.593. These prediction results provide valuable hints for revealing the structure and function of membrane proteins. MemBrain web server is free for academic use and available at www.csbio.sjtu.edu.cn/bioinf/MemBrain/. [Figure not available: see fulltext.
Structural maturation and brain activity predict future working memory capacity during childhood development.

Science.gov (United States)

Ullman, Henrik; Almeida, Rita; Klingberg, Torkel

2014-01-29

Human working memory capacity develops during childhood and is a strong predictor of future academic performance, in particular, achievements in mathematics and reading. Predicting working memory development is important for the early identification of children at risk for poor cognitive and academic development. Here we show that structural and functional magnetic resonance imaging data explain variance in children's working memory capacity 2 years later, which was unique variance in addition to that predicted using cognitive tests. While current working memory capacity correlated with frontoparietal cortical activity, the future capacity could be inferred from structure and activity in basal ganglia and thalamus. This gives a novel insight into the neural mechanisms of childhood development and supports the idea that neuroimaging can have a unique role in predicting children's cognitive development.
Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

Science.gov (United States)

Zhang, Wenxuan; Yang, Jianyi; He, Baoji; Walker, Sara Elizabeth; Zhang, Hongjiu; Govindarajoo, Brandon; Virtanen, Jouko; Xue, Zhidong; Shen, Hong-Bin; Zhang, Yang

2016-09-01

We tested two pipelines developed for template-free protein structure prediction in the CASP11 experiment. First, the QUARK pipeline constructs structure models by reassembling fragments of continuously distributed lengths excised from unrelated proteins. Five free-modeling (FM) targets have the model successfully constructed by QUARK with a TM-score above 0.4, including the first model of T0837-D1, which has a TM-score = 0.736 and RMSD = 2.9 Å to the native. Detailed analysis showed that the success is partly attributed to the high-resolution contact map prediction derived from fragment-based distance-profiles, which are mainly located between regular secondary structure elements and loops/turns and help guide the orientation of secondary structure assembly. In the Zhang-Server pipeline, weakly scoring threading templates are re-ordered by the structural similarity to the ab initio folding models, which are then reassembled by I-TASSER based structure assembly simulations; 60% more domains with length up to 204 residues, compared to the QUARK pipeline, were successfully modeled by the I-TASSER pipeline with a TM-score above 0.4. The robustness of the I-TASSER pipeline can stem from the composite fragment-assembly simulations that combine structures from both ab initio folding and threading template refinements. Despite the promising cases, challenges still exist in long-range beta-strand folding, domain parsing, and the uncertainty of secondary structure prediction; the latter of which was found to affect nearly all aspects of FM structure predictions, from fragment identification, target classification, structure assembly, to final model selection. Significant efforts are needed to solve these problems before real progress on FM could be made. Proteins 2016; 84(Suppl 1):76-86. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules

Directory of Open Access Journals (Sweden)

Michaeli Shulamit

2007-10-01

Full Text Available Abstract Background In recent years, RNA molecules that are not translated into proteins (ncRNAs have drawn a great deal of attention, as they were shown to be involved in many cellular functions. One of the most important computational problems regarding ncRNA is to predict the secondary structure of a molecule from its sequence. In particular, we attempted to predict the secondary structure for a set of unaligned ncRNA molecules that are taken from the same family, and thus presumably have a similar structure. Results We developed the RNAspa program, which comparatively predicts the secondary structure for a set of ncRNA molecules in linear time in the number of molecules. We observed that in a list of several hundred suboptimal minimal free energy (MFE predictions, as provided by the RNAsubopt program of the Vienna package, it is likely that at least one suggested structure would be similar to the true, correct one. The suboptimal solutions of each molecule are represented as a layer of vertices in a graph. The shortest path in this graph is the basis for structural predictions for the molecule. We also show that RNA secondary structures can be compared very rapidly by a simple string Edit-Distance algorithm with a minimal loss of accuracy. We show that this approach allows us to more deeply explore the suboptimal structure space. Conclusion The algorithm was tested on three datasets which include several ncRNA families taken from the Rfam database. These datasets allowed for comparison of the algorithm with other methods. In these tests, RNAspa performed better than four other programs.
Protein structure based prediction of catalytic residues.

Science.gov (United States)

Fajardo, J Eduardo; Fiser, Andras

2013-02-22

Worldwide structural genomics projects continue to release new protein structures at an unprecedented pace, so far nearly 6000, but only about 60% of these proteins have any sort of functional annotation. We explored a range of features that can be used for the prediction of functional residues given a known three-dimensional structure. These features include various centrality measures of nodes in graphs of interacting residues: closeness, betweenness and page-rank centrality. We also analyzed the distance of functional amino acids to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), and the use of relative entropy as a measure of sequence conservation. From the selected features, neural networks were trained to identify catalytic residues. We found that using distance to the GCM together with amino acid type provide a good discriminant function, when combined independently with sequence conservation. Using an independent test set of 29 annotated protein structures, the method returned 411 of the initial 9262 residues as the most likely to be involved in function. The output 411 residues contain 70 of the annotated 111 catalytic residues. This represents an approximately 14-fold enrichment of catalytic residues on the entire input set (corresponding to a sensitivity of 63% and a precision of 17%), a performance competitive with that of other state-of-the-art methods. We found that several of the graph based measures utilize the same underlying feature of protein structures, which can be simply and more effectively captured with the distance to GCM definition. This also has the added the advantage of simplicity and easy implementation. Meanwhile sequence conservation remains by far the most influential feature in identifying functional residues. We also found that due the rapid changes in size and composition of sequence databases, conservation calculations must be recalibrated for specific reference databases.
A recurrent de novo mutation in KCNC1 causes progressive myoclonus epilepsy

DEFF Research Database (Denmark)

Muona, M.; Berkovic, S. F.; Dibbens, L. M.

2015-01-01

Progressive myoclonus epilepsies (PMEs) are a group of rare, inherited disorders manifesting with action myoclonus, tonicclonic seizures and ataxia. We sequenced the exomes of 84 unrelated individuals with PME of unknown cause and molecularly solved 26 cases (31%). Remarkably, a recurrent de novo...
A SA8000 e a responsabilidade social das empresas: a emergência de um novo paradigma?

OpenAIRE

Lopes, Ana Catarina Marques Figueiredo Caetano

2004-01-01

Mestrado em Desenvolvimento e Cooperação Internacional A Responsabilidade Social das Empresas, não é um tema novo. Durante muito tempo foi discutido e interpretado no âmbito do debate sobre as responsabilidades que uma empresa deve assumir para além daquelas que tem perante os seus accionistas, e das impostas por lei. Hoje, observamos a emergência de um novo paradigma assente no reconhecimento que o sector privado não só pode, mas deve, fazer mais para combater a pobreza, preservar o meio ...
Anisotropic Elastoplastic Damage Mechanics Method to Predict Fatigue Life of the Structure

Directory of Open Access Journals (Sweden)

Hualiang Wan

2016-01-01

Full Text Available New damage mechanics method is proposed to predict the low-cycle fatigue life of metallic structures under multiaxial loading. The microstructure mechanical model is proposed to simulate anisotropic elastoplastic damage evolution. As the micromodel depends on few material parameters, the present method is very concise and suitable for engineering application. The material parameters in damage evolution equation are determined by fatigue experimental data of standard specimens. By employing further development on the ANSYS platform, the anisotropic elastoplastic damage mechanics-finite element method is developed. The fatigue crack propagation life of satellite structure is predicted using the present method and the computational results comply with the experimental data very well.
Prediction and constancy of cognitive-motivational structures in mothers and their adolescents.

Science.gov (United States)

Malerstein, A J; Ahern, M M; Pulos, S; Arasteh, J D

1995-01-01

Three clinically-derived, cognitive-motivational structures were predicted in 68 adolescents from their caregiving situations as revealed in their mothers' interviews, elicited six years earlier. Basic to each structure is a motivational concern and its related social cognitive style, a style which corresponds to a Piagetian cognitive stage: concrete operational, intuitive or symbolic. Because these structure types parse a non-clinical population, current views of health and accordingly goals of treatment may need modification.
Combining sequence-based prediction methods and circular dichroism and infrared spectroscopic data to improve protein secondary structure determinations

Directory of Open Access Journals (Sweden)

Lees Jonathan G

2008-01-01

Full Text Available Abstract Background A number of sequence-based methods exist for protein secondary structure prediction. Protein secondary structures can also be determined experimentally from circular dichroism, and infrared spectroscopic data using empirical analysis methods. It has been proposed that comparable accuracy can be obtained from sequence-based predictions as from these biophysical measurements. Here we have examined the secondary structure determination accuracies of sequence prediction methods with the empirically determined values from the spectroscopic data on datasets of proteins for which both crystal structures and spectroscopic data are available. Results In this study we show that the sequence prediction methods have accuracies nearly comparable to those of spectroscopic methods. However, we also demonstrate that combining the spectroscopic and sequences techniques produces significant overall improvements in secondary structure determinations. In addition, combining the extra information content available from synchrotron radiation circular dichroism data with sequence methods also shows improvements. Conclusion Combining sequence prediction with experimentally determined spectroscopic methods for protein secondary structure content significantly enhances the accuracy of the overall results obtained.
Study on Strategic Planning of Road and Bridge Infrastructure Development in City Planning: Taking Porto-novo City of Benin Republic as Example

Directory of Open Access Journals (Sweden)

Boko-haya Dossa Didier

2018-01-01

Full Text Available Concern about the townlet infrastructure construction in developing country is one of the crucial part of county town planning and development. By taking the overall planning and design in a case study of Porto-novo city at Republic of Benin, this paper analyzes the characteristics and opportunities of Porto-novo city and puts forward corresponding infrastructure construction strategy. In the end, the paper comes up with specific plan of planning and design under the background of Porto-novo's planning of development strategy.
Managing uncertainty in metabolic network structure and improving predictions using EnsembleFBA.

Directory of Open Access Journals (Sweden)

Matthew B Biggs

2017-03-01

Full Text Available Genome-scale metabolic network reconstructions (GENREs are repositories of knowledge about the metabolic processes that occur in an organism. GENREs have been used to discover and interpret metabolic functions, and to engineer novel network structures. A major barrier preventing more widespread use of GENREs, particularly to study non-model organisms, is the extensive time required to produce a high-quality GENRE. Many automated approaches have been developed which reduce this time requirement, but automatically-reconstructed draft GENREs still require curation before useful predictions can be made. We present a novel approach to the analysis of GENREs which improves the predictive capabilities of draft GENREs by representing many alternative network structures, all equally consistent with available data, and generating predictions from this ensemble. This ensemble approach is compatible with many reconstruction methods. We refer to this new approach as Ensemble Flux Balance Analysis (EnsembleFBA. We validate EnsembleFBA by predicting growth and gene essentiality in the model organism Pseudomonas aeruginosa UCBPP-PA14. We demonstrate how EnsembleFBA can be included in a systems biology workflow by predicting essential genes in six Streptococcus species and mapping the essential genes to small molecule ligands from DrugBank. We found that some metabolic subsystems contributed disproportionately to the set of predicted essential reactions in a way that was unique to each Streptococcus species, leading to species-specific outcomes from small molecule interactions. Through our analyses of P. aeruginosa and six Streptococci, we show that ensembles increase the quality of predictions without drastically increasing reconstruction time, thus making GENRE approaches more practical for applications which require predictions for many non-model organisms. All of our functions and accompanying example code are available in an open online repository.
Antimicrobial peptide capsids of de novo design.

Science.gov (United States)

De Santis, Emiliana; Alkassem, Hasan; Lamarre, Baptiste; Faruqui, Nilofar; Bella, Angelo; Noble, James E; Micale, Nicola; Ray, Santanu; Burns, Jonathan R; Yon, Alexander R; Hoogenboom, Bart W; Ryadnov, Maxim G

2017-12-22

The spread of bacterial resistance to antibiotics poses the need for antimicrobial discovery. With traditional search paradigms being exhausted, approaches that are altogether different from antibiotics may offer promising and creative solutions. Here, we introduce a de novo peptide topology that-by emulating the virus architecture-assembles into discrete antimicrobial capsids. Using the combination of high-resolution and real-time imaging, we demonstrate that these artificial capsids assemble as 20-nm hollow shells that attack bacterial membranes and upon landing on phospholipid bilayers instantaneously (seconds) convert into rapidly expanding pores causing membrane lysis (minutes). The designed capsids show broad antimicrobial activities, thus executing one primary function-they destroy bacteria on contact.
The Novo Okno copper deposit of olistostrome origin (Bor, Eastern Serbia

Directory of Open Access Journals (Sweden)

Antonijević Ivan

2011-01-01

Full Text Available The copper deposit Novo Okno, uncovered at present, with non-ore and ore clasts of massive sulphides (from 0.5 to 50 m3 in size, has many distinctive features that indicate its olistostrome origin. The deposit is chaotic in structure, unstratified, with the lower surface unconformable over the underlying parent rock of the basin. It is a lens-like body, with the longer axis directed east and west, variable in thickness from 15 to 28 metres, about 335 metres long and less than 140 metres wide. These and other characteristics of the body indicate a unified, reworked, olistostrome copper deposit formed from primary ore bodies of the Bor mineral deposit and vulcanite, destroyed by volcanic explosion into blocks and rocks of Turonian age and extrusion and concurrent deposition on the land surface. Gravitational massive sliding of the consolidated rocks down the slopes of the volcanic relief and chaotic accumulation of ore and non-ore clasts (olistoliths in a marine basin evolved in the Upper Turonian and the Lower Senonian.
Globalização: novo paradigma das ciências sociais

Directory of Open Access Journals (Sweden)

Octavio Ianni

1994-08-01

Full Text Available As ciências sociais estão sendo desafiadas a pensar a globalização do mundo. No fim do século XX, quando se anuncia o XXI, elas se defrontam com os dilemas que se abrem com a globalização das coisas, gentes e idéias. Há processos e estruturas sociais, econômicos, políticos, culturais e outros que apenas começam a ser estudados. Além do que é local, nacional e regional, colocam-se problemas novos e fundamentais com a emergência da sociedade global. As fronteiras geográficas e históricas, culturais e civilizatórias parecem modificar-se em direções e formas surpreendentes. Indivíduo, grupo, classe, coletividade e povo são colocados diante de outros horizontes. O próprio pensamento científico é desafiado a elaborar conceitos e interpretações para dar conta de realidades pouco conhecidas. As teorias da globalização, que começam a ser esboçadas, revelam o empenho das ciências sociais em explicar o que há de novo no que vai pelo mundo.Social sciences are now being challenged to think on the world's globalization. At the end of the twentieth century and dawn of the twenty first, they are faced with the dilemas that open up with the globalization of things, people and ideas: There are social, economical, political, cultural and other processes and structures that are just begining to be studied. Besides what is local, national and regional, new and fundamental problems appear with the rising global society. The geographic, historical, cultural and civilizatorian limits seem to change in surprising ways and directions. The individual, group, class, colectivity and people are put before other horizons. The scientific thinking itself is called upon to elaborate concepts and interpretations to account for little known realities. The globalization theories that are just being sketched show the efforts of Social Sciences to explain what is new going on in the world.
Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome.

Science.gov (United States)

Weisberg, Alexandra J; Kim, Gunjune; Westwood, James H; Jelesko, John G

2017-11-10

Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is "leaves of three, let it be", which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species.
Molecular interaction of the first 3 enzymes of the de novo pyrimidine biosynthetic pathway of Trypanosoma cruzi

International Nuclear Information System (INIS)

Nara, Takeshi; Hashimoto, Muneaki; Hirawake, Hiroko; Liao, Chien-Wei; Fukai, Yoshihisa; Suzuki, Shigeo; Tsubouchi, Akiko; Morales, Jorge; Takamiya, Shinzaburo; Fujimura, Tsutomu; Taka, Hikari; Mineki, Reiko; Fan, Chia-Kwung; Inaoka, Daniel Ken; Inoue, Masayuki; Tanaka, Akiko; Harada, Shigeharu; Kita, Kiyoshi

2012-01-01

Highlights: ► An Escherichia coli strain co-expressing CPSII, ATC, and DHO of Trypanosoma cruzi was constructed. ► Molecular interactions between CPSII, ATC, and DHO of T. cruzi were demonstrated. ► CPSII bound with both ATC and DHO. ► ATC bound with both CPSII and DHO. ► A functional tri-enzyme complex might precede the establishment of the fused enzyme. -- Abstract: The first 3 reaction steps of the de novo pyrimidine biosynthetic pathway are catalyzed by carbamoyl-phosphate synthetase II (CPSII), aspartate transcarbamoylase (ATC), and dihydroorotase (DHO), respectively. In eukaryotes, these enzymes are structurally classified into 2 types: (1) a CPSII-DHO-ATC fusion enzyme (CAD) found in animals, fungi, and amoebozoa, and (2) stand-alone enzymes found in plants and the protist groups. In the present study, we demonstrate direct intermolecular interactions between CPSII, ATC, and DHO of the parasitic protist Trypanosoma cruzi, which is the causative agent of Chagas disease. The 3 enzymes were expressed in a bacterial expression system and their interactions were examined. Immunoprecipitation using an antibody specific for each enzyme coupled with Western blotting-based detection using antibodies for the counterpart enzymes showed co-precipitation of all 3 enzymes. From an evolutionary viewpoint, the formation of a functional tri-enzyme complex may have preceded—and led to—gene fusion to produce the CAD protein. This is the first report to demonstrate the structural basis of these 3 enzymes as a model of CAD. Moreover, in conjunction with the essentiality of de novo pyrimidine biosynthesis in the parasite, our findings provide a rationale for new strategies for developing drugs for Chagas disease, which target the intermolecular interactions of these 3 enzymes.
SGC method for predicting the standard enthalpy of formation of pure compounds from their molecular structures

International Nuclear Information System (INIS)

Albahri, Tareq A.; Aljasmi, Abdulla F.

2013-01-01

Highlights: • ΔH° f is predicted from the molecular structure of the compounds alone. • ANN-SGC model predicts ΔH° f with a correlation coefficient of 0.99. • ANN-MNLR model predicts ΔH° f with a correlation coefficient of 0.90. • Better definition of the atom-type molecular groups is presented. • The method is better than others in terms of combined simplicity, accuracy and generality. - Abstract: A theoretical method for predicting the standard enthalpy of formation of pure compounds from various chemical families is presented. Back propagation artificial neural networks were used to investigate several structural group contribution (SGC) methods available in literature. The networks were used to probe the structural groups that have significant contribution to the overall enthalpy of formation property of pure compounds and arrive at the set of groups that can best represent the enthalpy of formation for about 584 substances. The 51 atom-type structural groups listed provide better definitions of group contributions than others in the literature. The proposed method can predict the standard enthalpy of formation of pure compounds with an AAD of 11.38 kJ/mol and a correlation coefficient of 0.9934 from only their molecular structure. The results are further compared with those of the traditional SGC method based on MNLR as well as other methods in the literature
Methodology for predicting ultimate pressure capacity of the ACR-1000 containment structure

International Nuclear Information System (INIS)

Saudy, A.M.; Awad, A.; Elgohary, M.

2006-01-01

The Advanced CANDU Reactor or the ACR-1000 is developed by Atomic Energy of Canada Limited (AECL) to be the next step in the evolution of the CANDU product line. It is based on the proven CANDU technology and incorporates advanced design technologies. The ACR containment structure is an essential element of the overall defense in depth approach to reactor safety, and is a physical barrier against the release of radioactive material to the environment. Therefore, it is important to provide a robust design with an adequate margin of safety. One of the key design requirements of the ACR containment structure is to have an ultimate pressure capacity that is at least twice the design pressure Using standard design codes, the containment structure is expected to behave elastically at least up to 1.5 times the design pressure. Beyond this pressure level, the concrete containment structure with reinforcements and post-tension tendons behaves in a highly non-linear manner and exhibits a complex response when cracks initiate and propagate. To predict the structural non-linear responses, at least two critical features are involved. These are: the structural idealization by the geometry and material property models, and the adopted solution algorithm. Therefore, detailed idealization of the concrete structure is needed in order to accurately predict its ultimate pressure capacity. This paper summarizes the analysis methodology to be carried out to establish the ultimate pressure capacity of the ACR containment structure and to confirm that the structure meets the specified design requirements. (author)

FreeContact: fast and free software for protein contact prediction from residue co-evolution.

Science.gov (United States)

Kaján, László; Hopf, Thomas A; Kalaš, Matúš; Marks, Debora S; Rost, Burkhard

2014-03-26

20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct Coupling Analysis (EVfold-mfDCA); the method PSICOV applied a related concept by estimating a sparse inverse covariance matrix. Both methods (EVfold-mfDCA and PSICOV) are publicly available, but both require too much CPU time for interactive applications. On top, EVfold-mfDCA depends on proprietary software. Here, we present FreeContact, a fast, open source implementation of EVfold-mfDCA and PSICOV. On a test set of 140 proteins, FreeContact was almost eight times faster than PSICOV without decreasing prediction performance. The EVfold-mfDCA implementation of FreeContact was over 220 times faster than PSICOV with negligible performance decrease. EVfold-mfDCA was unavailable for testing due to its dependency on proprietary software. FreeContact is implemented as the free C++ library "libfreecontact", complete with command line tool "freecontact", as well as Perl and Python modules. All components are available as Debian packages. FreeContact supports the BioXSD format for interoperability. FreeContact provides the opportunity to compute reliable contact predictions in any environment (desktop or cloud).
BayesMotif: de novo protein sorting motif discovery from impure datasets.

Science.gov (United States)

Hu, Jianjun; Zhang, Fan

2010-01-18

Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of
Resveratrol induces growth inhibition and apoptosis in metastatic breast cancer cells via de novo ceramide signaling.

Science.gov (United States)

Scarlatti, Francesca; Sala, Giusy; Somenzi, Giulia; Signorelli, Paola; Sacchi, Nicoletta; Ghidoni, Riccardo

2003-12-01

Resveratrol (3,4',5-trans-trihydroxystilbene), a phytoalexin present in grapes and red wine, is emerging as a natural compound with potential anticancer properties. Here we show that resveratrol can induce growth inhibition and apoptosis in MDA-MB-231, a highly invasive and metastatic breast cancer cell line, in concomitance with a dramatic endogenous increase of growth inhibitory/proapoptotic ceramide. We found that accumulation of ceramide derives from both de novo ceramide synthesis and sphingomyelin hydrolysis. More specifically we demonstrated that ceramide accumulation induced by resveratrol can be traced to the activation of serine palmitoyltransferase (SPT), the key enzyme of de novo ceramide biosynthetic pathway, and neutral sphingomyelinase (nSMase), a main enzyme involved in the sphingomyelin/ceramide pathway. However, by using specific inhibitors of SPT, myriocin and L-cycloserine, and nSMase, gluthatione and manumycin, we found that only the SPT inhibitors could counteract the biological effects induced by resveratrol. Thus, resveratrol seems to exert its growth inhibitory/apoptotic effect on the metastatic breast cancer cell line MDA-MB-231 by activating the de novo ceramide synthesis pathway.
A Swedish family with de novo alpha-synuclein A53T mutation: evidence for early cortical dysfunction

DEFF Research Database (Denmark)

Puschmann, Andreas; Ross, Owen A; Vilariño-Güell, Carles

2009-01-01

A de novo alpha-synuclein A53T (p.Ala53 Th; c.209G > A) mutation has been identified in a Swedish family with autosomal dominant Parkinson's disease (PD). Two affected individuals had early-onset (before 31 and 40 years), severe levodopa-responsive PD with prominent dysphasia, dysarthria, and cog......A de novo alpha-synuclein A53T (p.Ala53 Th; c.209G > A) mutation has been identified in a Swedish family with autosomal dominant Parkinson's disease (PD). Two affected individuals had early-onset (before 31 and 40 years), severe levodopa-responsive PD with prominent dysphasia, dysarthria......) and the Greek-American Family H kindreds. One unaffected family member carried the mutation haplotype without the c.209A mutation, strongly suggesting its de novo occurrence within this family. Furthermore, a novel mutation c.488G > A (p.Arg163His; R163H) in the presenilin-2 (PSEN2) gene was detected...
External validation of structure-biodegradation relationship (SBR) models for predicting the biodegradability of xenobiotics.

Science.gov (United States)

Devillers, J; Pandard, P; Richard, B

2013-01-01

Biodegradation is an important mechanism for eliminating xenobiotics by biotransforming them into simple organic and inorganic products. Faced with the ever growing number of chemicals available on the market, structure-biodegradation relationship (SBR) and quantitative structure-biodegradation relationship (QSBR) models are increasingly used as surrogates of the biodegradation tests. Such models have great potential for a quick and cheap estimation of the biodegradation potential of chemicals. The Estimation Programs Interface (EPI) Suite™ includes different models for predicting the potential aerobic biodegradability of organic substances. They are based on different endpoints, methodologies and/or statistical approaches. Among them, Biowin 5 and 6 appeared the most robust, being derived from the largest biodegradation database with results obtained only from the Ministry of International Trade and Industry (MITI) test. The aim of this study was to assess the predictive performances of these two models from a set of 356 chemicals extracted from notification dossiers including compatible biodegradation data. Another set of molecules with no more than four carbon atoms and substituted by various heteroatoms and/or functional groups was also embodied in the validation exercise. Comparisons were made with the predictions obtained with START (Structural Alerts for Reactivity in Toxtree). Biowin 5 and Biowin 6 gave satisfactorily prediction results except for the prediction of readily degradable chemicals. A consensus model built with Biowin 1 allowed the diminution of this tendency.
Evaluation of effect of Ophiostoma novo-ulmi on four major wood ...

African Journals Online (AJOL)

Evaluation of effect of Ophiostoma novo-ulmi on four major wood species of the elm family in Rasht (North West of Iran) ... the diameter size of vessels and the number of xylary rays in four species: Ulmus carpinifolia, Ulmus glabra, Zelkova carpinifolia and Celtis australis as important factors in host resistance to elm disease.
A multicontroller structure for teaching and designing predictive control strategies

International Nuclear Information System (INIS)

Hodouin, D.; Desbiens, A.

1999-01-01

The paper deals with the unification of the existing linear control algorithms in order to facilitate their transfer to the engineering students and to industry's engineers. The resulting control algorithm is the Global Predictive Control (GlobPC), which is now taught at the graduate and continuing education levels. GlobPC is based on an internal model framework where three independent control criteria are minimized: one for tracking, one for regulation and one for feedforward. This structure allows to obtain desired tracking, regulation and feedforward behaviors in an optimal way while keeping them perfectly separated. It also cleanly separates the deterministic and stochastic predictions of the process model output. (author)
A Method to Predict the Structure and Stability of RNA/RNA Complexes.

Science.gov (United States)

Xu, Xiaojun; Chen, Shi-Jie

2016-01-01

RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.
Model structural uncertainty quantification and hydrologic parameter and prediction error analysis using airborne electromagnetic data

DEFF Research Database (Denmark)

Minsley, B. J.; Christensen, Nikolaj Kruse; Christensen, Steen

Model structure, or the spatial arrangement of subsurface lithological units, is fundamental to the hydrological behavior of Earth systems. Knowledge of geological model structure is critically important in order to make informed hydrological predictions and management decisions. Model structure...... is never perfectly known, however, and incorrect assumptions can be a significant source of error when making model predictions. We describe a systematic approach for quantifying model structural uncertainty that is based on the integration of sparse borehole observations and large-scale airborne...... electromagnetic (AEM) data. Our estimates of model structural uncertainty follow a Bayesian framework that accounts for both the uncertainties in geophysical parameter estimates given AEM data, and the uncertainties in the relationship between lithology and geophysical parameters. Using geostatistical sequential...
De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences

Directory of Open Access Journals (Sweden)

Shairul Izan

2017-08-01

Full Text Available Whole Genome Shotgun (WGS sequences of plant species often contain an abundance of reads that are derived from the chloroplast genome. Up to now these reads have generally been identified and assembled into chloroplast genomes based on homology to chloroplasts from related species. This re-sequencing approach may select against structural differences between the genomes especially in non-model species for which no close relatives have been sequenced before. The alternative approach is to de novo assemble the chloroplast genome from total genomic DNA sequences. In this study, we used k-mer frequency tables to identify and extract the chloroplast reads from the WGS reads and assemble these using a highly integrated and automated custom pipeline. Our strategy includes steps aimed at optimizing assemblies and filling gaps which are left due to coverage variation in the WGS dataset. We have successfully de novo assembled three complete chloroplast genomes from plant species with a range of nuclear genome sizes to demonstrate the universality of our approach: Solanum lycopersicum (0.9 Gb, Aegilops tauschii (4 Gb and Paphiopedilum henryanum (25 Gb. We also highlight the need to optimize the choice of k and the amount of data used. This new and cost-effective method for de novo short read assembly will facilitate the study of complete chloroplast genomes with more accurate analyses and inferences, especially in non-model plant genomes.
High Incidence of De Novo and Subclinical Atrial Fibrillation in Patients With Hypertrophic Cardiomyopathy and Cardiac Rhythm Management Device.

Science.gov (United States)

Wilke, Iris; Witzel, Katrin; Münch, Julia; Pecha, Simon; Blankenberg, Stephan; Reichenspurner, Hermann; Willems, Stephan; Patten, Monica; Aydin, Ali

2016-07-01

Atrial fibrillation (AF) is an important prognostic parameter in patients with hypertrophic cardiomyopathy (HCM). Though cardiac rhythm management (CRM) devices (e.g., ICD, pacemaker or implantable loop recorder) can detect subclinical AF, data describing the incidence of AF are rare. We therefore investigated the incidence and clinical impact of de novo and subclinical AF detected by CRM devices in patients with HCM. In our retrospective single-center study, we included patients with HCM and need for CRM devices. The primary endpoint of the study was the incidence of clinical and subclinical de novo AF. During follow-up, patients were screened for adverse events like stroke, ventricular arrhythmia, heart failure, or death. From 192 HCM patients, 44 patients received a CRM device (38 ICDs, 5 pacemakers, 1 implantable loop recorder). In 14 of these patients (32%), AF had been documented before device implantation. Thirty (68%) patients were free from AF at the time of implantation. During a median follow-up of 595 days (interquartile range, 367-890 days), de novo AF was recorded in 16 of these 30 patients (53%). Fourteen (88%) of the 16 patients with de novo AF were free from any clinical symptoms, so these patients were classified to have subclinical AF. In logistic regression analysis, age was the only significant predictor for an increased risk of AF. AF is common in patients with HCM who need a CRM device. More than 50% of these patients develop de novo AF that was predominantly subclinical in our cohort. © 2016 Wiley Periodicals, Inc.
Whole Exome Sequencing for a Patient with Rubinstein-Taybi Syndrome Reveals de Novo Variants besides an Overt CREBBP Mutation

Directory of Open Access Journals (Sweden)

Hee Jeong Yoo

2015-03-01

Full Text Available Rubinstein-Taybi syndrome (RSTS is a rare condition with a prevalence of 1 in 125,000–720,000 births and characterized by clinical features that include facial, dental, and limb dysmorphology and growth retardation. Most cases of RSTS occur sporadically and are caused by de novo mutations. Cytogenetic or molecular abnormalities are detected in only 55% of RSTS cases. Previous genetic studies have yielded inconsistent results due to the variety of methods used for genetic analysis. The purpose of this study was to use whole exome sequencing (WES to evaluate the genetic causes of RSTS in a young girl presenting with an Autism phenotype. We used the Autism diagnostic observation schedule (ADOS and Autism diagnostic interview revised (ADI-R to confirm her diagnosis of Autism. In addition, various questionnaires were used to evaluate other psychiatric features. We used WES to analyze the DNA sequences of the patient and her parents and to search for de novo variants. The patient showed all the typical features of Autism, WES revealed a de novo frameshift mutation in CREBBP and de novo sequence variants in TNC and IGFALS genes. Mutations in the CREBBP gene have been extensively reported in RSTS patients, while potential missense mutations in TNC and IGFALS genes have not previously been associated with RSTS. The TNC and IGFALS genes are involved in central nervous system development and growth. It is possible for patients with RSTS to have additional de novo variants that could account for previously unexplained phenotypes.
De novo determination of internuclear vector orientations from residual dipolar couplings measured in three independent alignment media

International Nuclear Information System (INIS)

Ruan Ke; Briggman, Kathryn B.; Tolman, Joel R.

2008-01-01

The straightforward interpretation of solution state residual dipolar couplings (RDCs) in terms of internuclear vector orientations generally requires prior knowledge of the alignment tensor, which in turn is normally estimated using a structural model. We have developed a protocol which allows the requirement for prior structural knowledge to be dispensed with as long as RDC measurements can be made in three independent alignment media. This approach, called Rigid Structure from Dipolar Couplings (RSDC), allows vector orientations and alignment tensors to be determined de novo from just three independent sets of RDCs. It is shown that complications arising from the existence of multiple solutions can be overcome by careful consideration of alignment tensor magnitudes in addition to the agreement between measured and calculated RDCs. Extensive simulations as well applications to the proteins ubiquitin and Staphylococcal protein GB1 demonstrate that this method can provide robust determinations of alignment tensors and amide N-H bond orientations often with better than 10 o accuracy, even in the presence of modest levels of internal dynamics
Analysis of pyrimidine synthesis "de novo" intermediates in urine and dried urine filter- paper strips with HPLC-electrospray tandem mass spectrometry

NARCIS (Netherlands)

van Kuilenburg, André B. P.; van Lenthe, Henk; Löffler, Monika; van Gennip, Albert H.

2004-01-01

BACKGROUND: The concentrations of the pyrimidine "de novo" metabolites and their degradation products in urine are useful indicators for the diagnosis of an inborn error of the pyrimidine de novo pathway or a urea-cycle defect. Until now, no procedure was available that allowed the analysis of all
Prediction of RNA secondary structures: from theory to models and real molecules

International Nuclear Information System (INIS)

Schuster, Peter

2006-01-01

RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from two letter alphabets form perfect structures when the nucleotides can form a base pair, as is the case with {GC} or {AU}, but the relation between the sequences and structures differs strongly from the four letter alphabet. A comprehensive theory of RNA structure is presented, which is based on the concepts of sequence space and shape space, being a space of structures. It sets the stage for modelling processes in ensembles of RNA molecules like evolutionary optimization or kinetic folding as dynamical phenomena guided by mappings between the two spaces. The number of minimum free energy (mfe) structures is always smaller than the number of sequences, even for two letter alphabets. Folding of RNA molecules into mfe energy structures constitutes a non-invertible mapping from sequence space onto shape space. The preimage of a structure in sequence space is defined as its neutral network. Similarly the set of suboptimal structures is the preimage of a sequence in shape space. This set represents the conformation space of a given sequence. The evolutionary optimization of structures in populations is a process taking place in sequence space, whereas kinetic folding occurs in molecular ensembles that optimize free energy in conformation space. Efficient folding algorithms based on dynamic programming are available for the prediction of secondary structures for given sequences. The inverse problem, the computation of sequences for predefined structures, is an important tool for the design of RNA molecules with tailored properties. Simultaneous folding or cofolding of two or more RNA
De novo formation of centrosomes in vertebrate cells arrested during S phase

NARCIS (Netherlands)

Khodjakov, A; Rieder, CL; Sluder, G; Cassels, G; Sibon, O; Wang, CL

2002-01-01

The centrosome usually replicates in a semiconservative fashion, i.e., new centrioles form in association with preexisting "maternal" centrioles. De novo formation of centrioles has been reported for a few highly specialized cell types but it has not been seen in vertebrate somatic cells. We find
De novo mutations of GCK, HNF1A and HNF4A may be more frequent in MODY than previously assumed.

Science.gov (United States)

Stanik, Juraj; Dusatkova, Petra; Cinek, Ondrej; Valentinova, Lucia; Huckova, Miroslava; Skopkova, Martina; Dusatkova, Lenka; Stanikova, Daniela; Pura, Mikulas; Klimes, Iwar; Lebl, Jan; Gasperikova, Daniela; Pruhova, Stepanka

2014-03-01

MODY is mainly characterised by an early onset of diabetes and a positive family history of diabetes with an autosomal dominant mode of inheritance. However, de novo mutations have been reported anecdotally. The aim of this study was to systematically revisit a large collection of MODY patients to determine the minimum prevalence of de novo mutations in the most prevalent MODY genes (i.e. GCK, HNF1A, HNF4A). Analysis of 922 patients from two national MODY centres (Slovakia and the Czech Republic) identified 150 probands (16%) who came from pedigrees that did not fulfil the criterion of two generations with diabetes but did fulfil the remaining criteria. The GCK, HNF1A and HNF4A genes were analysed by direct sequencing. Mutations in GCK, HNF1A or HNF4A genes were detected in 58 of 150 individuals. Parents of 28 probands were unavailable for further analysis, and in 19 probands the mutation was inherited from an asymptomatic parent. In 11 probands the mutations arose de novo. In our cohort of MODY patients from two national centres the de novo mutations in GCK, HNF1A and HNF4A were present in 7.3% of the 150 families without a history of diabetes and 1.2% of all of the referrals for MODY testing. This is the largest collection of de novo MODY mutations to date, and our findings indicate a much higher frequency of de novo mutations than previously assumed. Therefore, genetic testing of MODY could be considered for carefully selected individuals without a family history of diabetes.
A rice gene of de novo origin negatively regulates pathogen-induced defense response.

Directory of Open Access Journals (Sweden)

Wenfei Xiao

Full Text Available How defense genes originated with the evolution of their specific pathogen-responsive traits remains an important problem. It is generally known that a form of duplication can generate new genes, suggesting that a new gene usually evolves from an ancestral gene. However, we show that a new defense gene in plants may evolve by de novo origination, resulting in sophisticated disease-resistant functions in rice. Analyses of gene evolution showed that this new gene, OsDR10, had homologs only in the closest relative, Leersia genus, but not other subfamilies of the grass family; therefore, it is a rice tribe-specific gene that may have originated de novo in the tribe. We further show that this gene may evolve a highly conservative rice-specific function that contributes to the regulation difference between rice and other plant species in response to pathogen infections. Biologic analyses including gene silencing, pathologic analysis, and mutant characterization by transformation showed that the OsDR10-suppressed plants enhanced resistance to a broad spectrum of Xanthomonas oryzae pv. oryzae strains, which cause bacterial blight disease. This enhanced disease resistance was accompanied by increased accumulation of endogenous salicylic acid (SA and suppressed accumulation of endogenous jasmonic acid (JA as well as modified expression of a subset of defense-responsive genes functioning both upstream and downstream of SA and JA. These data and analyses provide fresh insights into the new biologic and evolutionary processes of a de novo gene recruited rapidly.
A rare case of de novo gigantic ovarian abscess within an endometrioma.

Science.gov (United States)

Hameed, Aisha; Mehta, Vaishali; Sinha, Prabha

2010-06-01

We are reporting a rare case of de novo ovarian abscess in an endometrioma. Ovarian abscess within an endometrioma is a rare gynecological problem, but de novo abscess in the endometrioma is even rarer. Most of the ovarian abscesses develop in the endometriomas following interventions, e.g., aspiration, pelvic surgery, and oocyte retrieval. We are presenting a case of a spontaneous giant abscess in a large ovarian cyst in a nulliparous woman who presented with acute abdomen. Patient was treated in a district general hospital with multidisciplinary approach. Thirteen liters of the pus were drained. She has had a sub total (supra cervical) hysterectomy and bilateral salpingo-oophorectomy (BSO) performed. Histology of the abscess wall confirmed endometriotic nature of the cyst. Patient made an uneventful recovery and was discharged home on the 14th postoperative day. This case highlights that endometrioma and its complication can present as a surgical emergency and should be dealt as one.
A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure.

Science.gov (United States)

Fan, Ming; Zheng, Bin; Li, Lihua

2015-10-01

Knowledge of the structural class of a given protein is important for understanding its folding patterns. Although a lot of efforts have been made, it still remains a challenging problem for prediction of protein structural class solely from protein sequences. The feature extraction and classification of proteins are the main problems in prediction. In this research, we extended our earlier work regarding these two aspects. In protein feature extraction, we proposed a scheme by calculating the word frequency and word position from sequences of amino acid, reduced amino acid, and secondary structure. For an accurate classification of the structural class of protein, we developed a novel Multi-Agent Ada-Boost (MA-Ada) method by integrating the features of Multi-Agent system into Ada-Boost algorithm. Extensive experiments were taken to test and compare the proposed method using four benchmark datasets in low homology. The results showed classification accuracies of 88.5%, 96.0%, 88.4%, and 85.5%, respectively, which are much better compared with the existing methods. The source code and dataset are available on request.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.