WorldWideScience

Sample records for acid sequence features

  1. Protein location prediction using atomic composition and global features of the amino acid sequence

    Energy Technology Data Exchange (ETDEWEB)

    Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India); Nair, Achuthsankar S. [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India)

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  2. Extraction of Sequence Conservation Features for the Prioritization of Candidate Single Amino Acid Polymorphisms

    Directory of Open Access Journals (Sweden)

    Jiaxin Wu

    2011-03-01

    Full Text Available Although remarkable success has been achieved by genome-wide association (GWA studies over the past few years, genetic variants discovered in GWA studies can typically account for only a small fraction of heritability of most common diseases. As such, the identification of multiple rare variants that are associated with complex diseases has been receiving more and more attentions. However, most of the recently developed statistical approaches for detecting association of rare variants with diseases require the selection of functional variants before the successive analysis, making an effective bioinformatics method for filtering out non-relevant rare variants indispensible. In this paper, we focus on a specific type of genetic variants called single amino acid polymorphisms (SAAPs. We propose to prioritize candidate SAAPs for a specific disease according to their association scores that are calculated using a guilt-by-association model with a set of features derived from protein sequences. We validate the proposed approach in a systematic way and demonstrate that the proposed model is powerful in distinguishing disease-associated SAAPs for the specific disease of interest.

  3. Classifying Genomic Sequences by Sequence Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hua Liu; Dian Jiao; Xiao Sun

    2005-01-01

    Traditional sequence analysis depends on sequence alignment. In this study, we analyzed various functional regions of the human genome based on sequence features, including word frequency, dinucleotide relative abundance, and base-base correlation. We analyzed the human chromosome 22 and classified the upstream,exon, intron, downstream, and intergenic regions by principal component analysis and discriminant analysis of these features. The results show that we could classify the functional regions of genome based on sequence feature and discriminant analysis.

  4. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    Directory of Open Access Journals (Sweden)

    Selvaraj S

    2011-10-01

    Full Text Available Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching. Results We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix. Conclusions The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.

  5. Protein sequence classification using feature hashing.

    Science.gov (United States)

    Caragea, Cornelia; Silvescu, Adrian; Mitra, Prasenjit

    2012-06-21

    Recent advances in next-generation sequencing technologies have resulted in an exponential increase in the rate at which protein sequence data are being acquired. The k-gram feature representation, commonly used for protein sequence classification, usually results in prohibitively high dimensional input spaces, for large values of k. Applying data mining algorithms to these input spaces may be intractable due to the large number of dimensions. Hence, using dimensionality reduction techniques can be crucial for the performance and the complexity of the learning algorithms. In this paper, we study the applicability of feature hashing to protein sequence classification, where the original high-dimensional space is "reduced" by hashing the features into a low-dimensional space, using a hash function, i.e., by mapping features into hash keys, where multiple features can be mapped (at random) to the same hash key, and "aggregating" their counts. We compare feature hashing with the "bag of k-grams" approach. Our results show that feature hashing is an effective approach to reducing dimensionality on protein sequence classification tasks.

  6. The peculiar structural features of kiwi fruit pectin methylesterase: amino acid sequence, oligosaccharides structure, and modeling of the interaction with its natural proteinaceous inhibitor.

    Science.gov (United States)

    Ciardiello, M Antonietta; D'Avino, Rossana; Amoresano, Angela; Tuppo, Lisa; Carpentieri, Andrea; Carratore, Vito; Tamburrini, Maurizio; Giovane, Alfonso; Pucci, Piero; Camardella, Laura

    2008-04-01

    Pectin methylesterase (PME) from kiwi fruit (Actinidia deliciosa) is a glycoprotein, showing an apparent molecular mass of 50 kDa upon size exclusion chromatography and SDS-PAGE. The primary structure, elucidated by direct sequencing of the protein, comprises 321 amino acid residues providing a molecular mass of 35 kDa. The protein has an acetylated Thr residue at the amino terminus and five N-glycosylation consensus sequences, four of which are actually glycosylated. A careful investigation of the oligosaccharide structures demonstrated that PME glycans belong to complex type oligosaccharides essentially consisting of xylosylated polyfucosylated biantennary structures. Alignment with known mature plant PME sequences indicates that the postulated active site residues are conserved. Kiwi PME activity is inhibited following the interaction with the proteinaceous inhibitor PMEI, isolated from the same source. Gel-filtration experiments show that kiwi PME/PMEI complex is stable in a large pH range and dissociates only at pH 10.0. Modeling of the interaction with the inhibitor was performed by using the crystal structure of the complex between kiwi PMEI and tomato PME as a template. The model shows that the binding site is the same reported for tomato PME. However, additional salt link interactions are found to connect the external loops of kiwi PME to PMEI. This finding may explain the higher pH stability of the complex formed by the two kiwi proteins respect to that formed by PMEI and tomato PME.

  7. Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition.

    Science.gov (United States)

    Zhao, Xiaowei; Ning, Qiao; Ai, Meiyue; Chai, Haiting; Yang, Guifu

    2016-06-07

    As a selective and reversible protein post-translational modification, S-glutathionylation generates mixed disulfides between glutathione (GSH) and cysteine residues, and plays an important role in regulating protein activity, stability, and redox regulation. To fully understand S-glutathionylation mechanisms, identification of substrates and specific S-Glutathionylated sites is crucial. Experimental identification of S-glutathionylated sites is labor-intensive and time consuming, so establishing an effective computational method is much desirable due to their convenient and fast speed. Therefore, in this study, a new bioinformatics tool named SSGlu (Species-Specific identification of Protein S-glutathionylation Sites) was developed to identify species-specific protein S-glutathionylated sites, utilizing support vector machines that combine multiple sequence-derived features with a two-step feature selection. By 5-fold cross validation, the performance of SSGlu was measured with an AUC of 0.8105 and 0.8041 for Homo sapiens and Mus musculus, respectively. Additionally, SSGlu was compared with the existing methods, and the higher MCC and AUC of SSGlu demonstrated that SSGlu was very promising to predict S-glutathionylated sites. Furthermore, a site-specific analysis showed that S-glutathionylation intimately correlated with the features derived from its surrounding sites. The conclusions derived from this study might help to understand more of the S-glutathionylation mechanism and guide the related experimental validation. For public access, SSGlu is freely accessible at http://59.73.198.144:8080/SSGlu/.

  8. Replacement of amino acid sequence features of a- and c-subunits of ATP synthases of Alkaliphilic Bacillus with the Bacillus consensus sequence results in defective oxidative phosphorylation and non-fermentative growth at pH 10.5.

    Science.gov (United States)

    Wang, ZhenXiong; Hicks, David B; Guffanti, Arthur A; Baldwin, Katisha; Krulwich, Terry Ann

    2004-06-18

    Mitchell's (Mitchell, P. (1961) Nature 191, 144-148) chemiosmotic model of energy coupling posits a bulk electrochemical proton gradient (Deltap) as the sole driving force for proton-coupled ATP synthesis via oxidative phosphorylation (OXPHOS) and for other bioenergetic work. Two properties of proton-coupled OXPHOS by alkaliphilic Bacillus species pose a challenge to this tenet: robust ATP synthesis at pH 10.5 that does not correlate with the magnitude of the Deltap and the failure of artificially imposed potentials to substitute for respiration-generated potentials in energizing ATP synthesis at high pH (Krulwich, T. (1995) Mol. Microbiol. 15, 403-410). Here we show that these properties, in alkaliphilic Bacillus pseudofirmus OF4, depend upon alkaliphile-specific features in the proton pathway through the a- and c-subunits of ATP synthase. Site-directed changes were made in six such features to the corresponding sequence in Bacillus megaterium, which reflects the consensus sequence for non-alkaliphilic Bacillus. Five of the six single mutants assembled an active ATPase/ATP synthase, and four of these mutants exhibited a specific defect in non-fermentative growth at high pH. Most of these mutants lost the ability to generate the high phosphorylation potentials at low bulk Deltap that are characteristic of alkaliphiles. The aLys(180) and aGly(212) residues that are predicted to be in the proton uptake pathway of the a-subunit were specifically implicated in pH-dependent restriction of proton flux through the ATP synthase to and from the bulk phase. The evidence included greatly enhanced ATP synthesis in response to an artificially imposed potential at high pH. The findings demonstrate that the ATP synthase of extreme alkaliphiles has special features that are required for non-fermentative growth and OXPHOS at high pH.

  9. Feature-based Image Sequence Compression Coding

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    A novel compressing method for video teleconference applications is presented. Semantic-based coding based on human image feature is realized, where human features are adopted as parameters. Model-based coding and the concept of vector coding are combined with the work on image feature extraction to obtain the result.

  10. Chip-based sequencing nucleic acids

    Science.gov (United States)

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  11. Stable 2D Feature Tracking for Long Video Sequences

    Directory of Open Access Journals (Sweden)

    Jong-Seung Park

    2008-12-01

    Full Text Available In this paper, we propose a 2D feature tracking method that is stable to long video sequences. To improve the stability of long tracking, we use trajectory information about 2D features. We predict the expected feature states and compute a rough estimate of the feature location on the current image frame using the history of previous feature states up to the current frame. A search window is positioned at the estimated location and similarity measures are computed within the search window. Once the feature position is determined from the similarity measures, the current feature states are appended to the history bu®er. The outlier rejection stage is also introduced to reduce false matches. Experimental results from real video sequences showed that the proposed method stably tracks point features for long frame sequences.

  12. Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix

    Science.gov (United States)

    Yu, Lulu; Zhang, Yusen; Gutman, Ivan; Shi, Yongtang; Dehmer, Matthias

    2017-01-01

    We develop a novel position-feature-based model for protein sequences by employing physicochemical properties of 20 amino acids and the measure of graph energy. The method puts the emphasis on sequence order information and describes local dynamic distributions of sequences, from which one can get a characteristic B-vector. Afterwards, we apply the relative entropy to the sequences representing B-vectors to measure their similarity/dissimilarity. The numerical results obtained in this study show that the proposed methods leads to meaningful results compared with competitors such as Clustal W. PMID:28393857

  13. Improving protein structural class prediction using novel combined sequence information and predicted secondary structural features.

    Science.gov (United States)

    Dai, Qi; Wu, Li; Li, Lihua

    2011-12-01

    Protein structural class prediction solely from protein sequences is a challenging problem in bioinformatics. Numerous efficient methods have been proposed for protein structural class prediction, but challenges remain. Using novel combined sequence information coupled with predicted secondary structural features (PSSF), we proposed a novel scheme to improve prediction of protein structural classes. Given an amino acid sequence, we first transformed it into a reduced amino acid sequence and calculated its word frequencies and word position features to combine novel sequence information. Then we added the PSSF to the combine sequence information to predict protein structural classes. The proposed method was tested on four benchmark datasets in low homology and achieved the overall prediction accuracies of 83.1%, 87.0%, 94.5%, and 85.2%, respectively. The comparison with existing methods demonstrates that the overall improvements range from 2.3% to 27.5%, which indicates that the proposed method is more efficient, especially for low-homology amino acid sequences.

  14. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  15. Identifying features in biological sequences: Sixth workshop report

    Energy Technology Data Exchange (ETDEWEB)

    Burks, C. [Los Alamos National Lab., NM (United States); Myers, E. [Univ. of Arizona (United States); Pearson, W.R. [Univ. of Virginia (United States)

    1995-12-31

    This report covers the sixth of an annual series of workshops held at the Aspen Center for Physics concentrating particularly on the identification of features in DNA sequence, and more broadly on related topics in computational molecular biology. The workshop series originally focused primarily on discussion of current needs and future strategies for identifying and predicting the presence of complex functional units on sequenced, but otherwise uncharacterized, genomic DNA. We addressed the need for computationally-based, automatic tools for synthesizing available data about individual consensus sequences and local compositional patterns into the composite objects (e.g., genes) that are -- as composite entities -- the true object of interest when scanning DNA sequences. The workshop was structured to promote sustained informal contact and exchange of expertise between molecular biologists, computer scientists, and mathematicians.

  16. Incorporating secondary structural features into sequence information for predicting protein structural class.

    Science.gov (United States)

    Liao, Bo; Peng, Ting; Chen, Haowen; Lin, Yaping

    2013-10-01

    Knowledge of structural classes is applied in numerous important predictive tasks that address structural and functional features of proteins, although the prediction accuracy of the protein structural classes is not high. In this study, 45 different features were rationally designed to model the differences between protein structural classes, among which, 30 of them reflect the combined protein sequence information. In terms of correlation function, the protein sequence can be converted to a digital signal sequence, from which we can generate 20 discrete Fourier spectrum numbers. According to the segments of amino with different characteristics occurring in protein sequences, the frequencies of the 10 kinds of segments of amino acid (motifs) in protein are calculated. Other features include the secondary structural information :10 features were proposed to model the strong adjacent correlations in the secondary structural elements and capture the long-range spatial interactions between secondary structures, other 5 features were designed to differentiate α/β from α+β classes , which is a major problem of the existing algorithm. The methods were proposed based on a large set of low-identity sequences for which secondary structure is predicted from their sequence (based on PSI-PRED). By means of this method, the overall prediction accuracy of four benchmark datasets were all improved. Especially for the dataset FC699, 25PDB and D1189 which are 1.26%, 1% and 0.85% higher than the best previous method respectively.

  17. Methods for analyzing nucleic acid sequences

    Science.gov (United States)

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  18. Sequence features contributing to chromosomal rearrangements in Neisseria gonorrhoeae.

    Directory of Open Access Journals (Sweden)

    Russell Spencer-Smith

    Full Text Available Through whole genome sequence alignments, breakpoints in chromosomal synteny can be identified and the sequence features associated with these determined. Alignments of the genome sequences of Neisseria gonorrhoeae strain FA1090, N.gonorrhoeae strain NCCP11945, and N. gonorrhoeae strain TCDC-NG08107 reveal chromosomal rearrangements that have occurred. Based on these alignments and dot plot pair-wise comparisons, the overall chromosomal arrangement of strain NCCP11945 and TCDC-NG08107 are very similar, with no large inversions or translocations. The insertion of the Gonococcal Genetic Island in strain NCCP11945 is the most prominent distinguishing feature differentiating these strains. When strain NCCP11945 is compared to strain FA1090, however, 14 breakpoints in chromosomal synteny are identified between these gonococcal strains. The majority of these, 11 of 14, are associated with a prophage, IS elements, or IS-like repeat enclosed elements which appear to have played a role in the rearrangements observed. Additional rearrangements of small regions of the genome are associated with pilin genes. Evidence presented here suggests that the rearrangements of blocks of sequence are mediated by activation of prophage and associated IS elements and reintegration elsewhere in the genome or by homologous recombination between IS-like elements that have generated inversions.

  19. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Science.gov (United States)

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  20. Sequence features responsible for intron retention in human

    Directory of Open Access Journals (Sweden)

    Sakabe Noboru

    2007-02-01

    Full Text Available Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.

  1. Analysis of the Repertoire Features of TCR Beta Chain CDR3 in Human by High-Throughput Sequencing

    Directory of Open Access Journals (Sweden)

    Xianliang Hou

    2016-07-01

    Full Text Available Background/Aims: To ward off a wide variety of pathogens, the human adaptive immune system harbors a vast array of T-cell receptors, collectively referred to as the TCR repertoire. Assessment of the repertoire features of TCR is vital for us to deeper understand of immune behaviour and immune response. Methods: In this study, we used a combination of multiplex-PCR, Illumina sequencing and IMGT (ImMunoGeneTics/HighV-QUEST for a standardized analysis of the repertoire features of TCR beta chain in the blood of healthy individuals, including the repertoire features of public TCR complementarity-determining regions (CDR3 sequences, highly expanded clones, long TCR CDR3 sequences. Results: We found that public CDR3 sequences and high-frequency sequences had the same characteristics, both of them had fewer nucleotide additions and shorter CDR3 length, which were closer to the germline sequence. Moreover, our studies provided evidence that public amino acid sequences are produced by multiple nucleotide sequences. Notably, there was skewed VDJ segment usage in long CDR3 sequences, the expression levels of 10 TRβV segments, 7 TRβJ segments and 2 TRβD segments were significantly different in the long CDR3 sequences compared to the short CDR3 sequences. Moreover, we identified that extensive N additions and increase of D gene usage contributing to TCR CDR3 length, and observed there was distinct usage frequency of amino acids in long CDR3 sequences compared to the short CDR3 sequences. Conclusions: Some repertoire features could be observed in the public sequences, highly abundance clones, and long TCR CDR3 sequences, which might be helpful for further study of immune behavior and immune response.

  2. FeatureMap3D - a tool to map protein features and sequence conservation onto homologous structures in the PDB

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Rapacki, Krzysztof; Stærfeldt, Hans Henrik;

    2006-01-01

    FeatureMap3D is a web-based tool that maps protein features onto 3D structures. The user provides sequences annotated with any feature of interest, such as post-translational modifications, protease cleavage sites or exonic structure and FeatureMap3D will then search the Protein Data Bank (PDB) f...

  3. Automatic discovery of cross-family sequence features associated with protein function

    Directory of Open Access Journals (Sweden)

    Krings Andrea

    2006-01-01

    Full Text Available Abstract Background Methods for predicting protein function directly from amino acid sequences are useful tools in the study of uncharacterised protein families and in comparative genomics. Until now, this problem has been approached using machine learning techniques that attempt to predict membership, or otherwise, to predefined functional categories or subcellular locations. A potential drawback of this approach is that the human-designated functional classes may not accurately reflect the underlying biology, and consequently important sequence-to-function relationships may be missed. Results We show that a self-supervised data mining approach is able to find relationships between sequence features and functional annotations. No preconceived ideas about functional categories are required, and the training data is simply a set of protein sequences and their UniProt/Swiss-Prot annotations. The main technical aspect of the approach is the co-evolution of amino acid-based regular expressions and keyword-based logical expressions with genetic programming. Our experiments on a strictly non-redundant set of eukaryotic proteins reveal that the strongest and most easily detected sequence-to-function relationships are concerned with targeting to various cellular compartments, which is an area already well studied both experimentally and computationally. Of more interest are a number of broad functional roles which can also be correlated with sequence features. These include inhibition, biosynthesis, transcription and defence against bacteria. Despite substantial overlaps between these functions and their corresponding cellular compartments, we find clear differences in the sequence motifs used to predict some of these functions. For example, the presence of polyglutamine repeats appears to be linked more strongly to the "transcription" function than to the general "nuclear" function/location. Conclusion We have developed a novel and useful approach for

  4. A machine-learning approach for predicting palmitoylation sites from integrated sequence-based features.

    Science.gov (United States)

    Li, Liqi; Luo, Qifa; Xiao, Weidong; Li, Jinhui; Zhou, Shiwen; Li, Yongsheng; Zheng, Xiaoqi; Yang, Hua

    2017-02-01

    Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein-protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.

  5. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    Science.gov (United States)

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  6. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    Science.gov (United States)

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  7. Intumescent features of nucleic acids and proteins

    Energy Technology Data Exchange (ETDEWEB)

    Alongi, Jenny, E-mail: jenny.alongi@polito.it; Cuttica, Fabio; Blasio, Alessandro Di; Carosio, Federico; Malucelli, Giulio

    2014-09-10

    Highlights: • The combustion resistance of DNA and caseins to different heat fluxes was studied. • Upon heating, DNA and caseins exhibited an intumescent behaviour. • The char derived from DNA was more stable and coherent than that from caseins. - Abstract: Are nucleic acids and proteins intumescent molecules? In order to get an answer, in the present manuscript, powders of deoxyribose nucleic acids (DNA) and caseins have been exposed to different heat fluxes under a cone calorimeter source and to the direct application of a propane flame. Under these conditions, DNA and caseins exhibited a typical intumescent behaviour, generating a coherent expanded cellular carbonaceous residue (char), extremely resistant to heat exposure. The resulting volumetric expansion as well as the resistance of the formed char turned out to be dependent on (i) the chemical structure of the chosen biomacromolecule, (ii) the evolution of ammonia and (iii) the adopted heat flux in cone calorimetry tests (namely, 25, 35, 50 and 75 kW/m{sup 2}). The presence of ribose units within the DNA backbone determined the formation of highly expanded and coherent residues as compared to those obtained from caseins. Indeed, under a heat flux of 35 kW/m{sup 2}, when a carbon source (i.e. common cane sugar) was added to caseins, the resulting char was similar to that formed by DNA. Furthermore, the char expansion was ascribed to the evolution of ammonia released by these biomacromolecules upon heating, as detected by thermogravimetry coupled to infrared spectroscopy, and confirmed by scanning electron microscopy experiments performed on the bubbles present in the residues of flammability tests.

  8. Inference of Global HIV-1 Sequence Patterns and Preliminary Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Yan Wang; Reda Rawi; Daniel Hoffmann; Binlian Sun; Rongge Yang

    2013-01-01

    The epidemiology of HIV-1 varies in different areas of the world,and it is possible that this complexity may leave unique footprints in the viral genome.Thus,we attempted to find significant patterns in global HIV-1 genome sequences.By applying the rule inference algorithm RIPPER (Repeated Incremental Pruning to Produce Error Reduction) to multiple sequence alignments of Env sequences from four classes of compiled datasets,we generated four sets of signature patterns.We found that these patterns were able to distinguish southeastern Asian from nonsoutheastern Asian sequences with 97.5% accuracy,Chinese from non-Chinese sequences with 98.3% accuracy,African from non-African sequences with 88.4% accuracy,and southern African from non-southern African sequences with 91.2% accuracy.These patterns showed different associations with subtypes and with amino acid positions.In addition,some signature patterns were characteristic of the geographic area from which the sample was taken.Amino acid features corresponding to the phylogenetic clustering of HIV-1 sequences were consistent with some of the deduced patterns.Using a combination of patterns inferred from subtypes B,C,and all subtypes chimeric with CRF01_AE worldwide,we found that signature patterns of subtype C were extremely common in some sampled countries (for example,Zambia in southern Africa),which may hint at the origin of this HIV-1 subtype and the need to pay special attention to this area of Africa.Signature patterns of subtype B sequences were associated with different countries.Even more,there are distinct patterns at single position 21 with glycine,leucine and isoleucine corresponding to subtype C,B and all possible recombination forms chimeric with CRF01_AE,which also indicate distinct geographic features.Our method widens the scope of inference of signature from geographic,genetic,and genomic viewpoints.These findings may provide a valuable reference for epidemiological research or vaccine design.

  9. Proteome sequence features carry signatures of the environmental niche of prokaryotes

    Directory of Open Access Journals (Sweden)

    Supek Fran

    2011-01-01

    Full Text Available Abstract Background Prokaryotic environmental adaptations occur at different levels within cells to ensure the preservation of genome integrity, proper protein folding and function as well as membrane fluidity. Although specific composition and structure of cellular components suitable for the variety of extreme conditions has already been postulated, a systematic study describing such adaptations has not yet been performed. We therefore explored whether the environmental niche of a prokaryote could be deduced from the sequence of its proteome. Finally, we aimed at finding the precise differences between proteome sequences of prokaryotes from different environments. Results We analyzed the proteomes of 192 prokaryotes from different habitats. We collected detailed information about the optimal growth conditions of each microorganism. Furthermore, we selected 42 physico-chemical properties of amino acids and computed their values for each proteome. Further, on the same set of features we applied two fundamentally different machine learning methods, Support Vector Machines and Random Forests, to successfully classify between bacteria and archaea, halophiles and non-halophiles, as well as mesophiles, thermophiles and mesothermophiles. Finally, we performed feature selection by using Random Forests. Conclusions To our knowledge, this is the first time that three different classification cases (domain of life, halophilicity and thermophilicity of proteome adaptation are successfully performed with the same set of 42 features. The characteristic features of a specific adaptation constitute a signature that may help understanding the mechanisms of adaptation to extreme environments.

  10. Application of intelligent techniques for classification of bacteria using protein sequence-derived features.

    Science.gov (United States)

    Banerjee, Amit Kumar; Ravi, Vadlamani; Murty, U S N; Sengupta, Neelava; Karuna, Batepatti

    2013-07-01

    Standard molecular experimental methodologies and mathematical procedures often fail to answer many phylogeny and classification related issues. Modern artificial intelligent-based techniques, such as radial basis function, genetic algorithm, artificial neural network, and support vector machines are of ample potential in this regard. Reliance on a large number of essential parameters will aid in enhanced robustness, reliability, and better accuracy as opposed to single molecular parameter. This study was conducted with dataset of computed protein physicochemical properties belonging to 20 different bacterial genera. A total of 57 sequential and structural parameters derived from protein sequences were considered for the initial classification. Feature selection based techniques were employed to find out the most important features influencing the dataset. Various amino acids, hydrophobicity, relative sulfur percentage, and codon number were selected as important parameters during the study. Comparative analyses were performed applying RapidMiner data mining platform. Support vector machine proved to be the best method with maximum accuracy of more than 91%.

  11. Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites.

    Science.gov (United States)

    Lee, Tzong-Yi; Chen, Shu-An; Hung, Hsin-Yi; Ou, Yu-Yen

    2011-03-09

    Ubiquitin (Ub) is a small protein that consists of 76 amino acids about 8.5 kDa. In ubiquitin conjugation, the ubiquitin is majorly conjugated on the lysine residue of protein by Ub-ligating (E3) enzymes. Three major enzymes participate in ubiquitin conjugation. They are E1, E2 and E3 which are responsible for activating, conjugating and ligating ubiquitin, respectively. Ubiquitin conjugation in eukaryotes is an important mechanism of the proteasome-mediated degradation of a protein and regulating the activity of transcription factors. Motivated by the importance of ubiquitin conjugation in biological processes, this investigation develops a method, UbSite, which uses utilizes an efficient radial basis function (RBF) network to identify protein ubiquitin conjugation (ubiquitylation) sites. This work not only investigates the amino acid composition but also the structural characteristics, physicochemical properties, and evolutionary information of amino acids around ubiquitylation (Ub) sites. With reference to the pathway of ubiquitin conjugation, the substrate sites for E3 recognition, which are distant from ubiquitylation sites, are investigated. The measurement of F-score in a large window size (-20∼+20) revealed a statistically significant amino acid composition and position-specific scoring matrix (evolutionary information), which are mainly located distant from Ub sites. The distant information can be used effectively to differentiate Ub sites from non-Ub sites. As determined by five-fold cross-validation, the model that was trained using the combination of amino acid composition and evolutionary information performs best in identifying ubiquitin conjugation sites. The prediction sensitivity, specificity, and accuracy are 65.5%, 74.8%, and 74.5%, respectively. Although the amino acid sequences around the ubiquitin conjugation sites do not contain conserved motifs, the cross-validation result indicates that the integration of distant sequence features of Ub

  12. Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites.

    Directory of Open Access Journals (Sweden)

    Tzong-Yi Lee

    Full Text Available Ubiquitin (Ub is a small protein that consists of 76 amino acids about 8.5 kDa. In ubiquitin conjugation, the ubiquitin is majorly conjugated on the lysine residue of protein by Ub-ligating (E3 enzymes. Three major enzymes participate in ubiquitin conjugation. They are E1, E2 and E3 which are responsible for activating, conjugating and ligating ubiquitin, respectively. Ubiquitin conjugation in eukaryotes is an important mechanism of the proteasome-mediated degradation of a protein and regulating the activity of transcription factors. Motivated by the importance of ubiquitin conjugation in biological processes, this investigation develops a method, UbSite, which uses utilizes an efficient radial basis function (RBF network to identify protein ubiquitin conjugation (ubiquitylation sites. This work not only investigates the amino acid composition but also the structural characteristics, physicochemical properties, and evolutionary information of amino acids around ubiquitylation (Ub sites. With reference to the pathway of ubiquitin conjugation, the substrate sites for E3 recognition, which are distant from ubiquitylation sites, are investigated. The measurement of F-score in a large window size (-20∼+20 revealed a statistically significant amino acid composition and position-specific scoring matrix (evolutionary information, which are mainly located distant from Ub sites. The distant information can be used effectively to differentiate Ub sites from non-Ub sites. As determined by five-fold cross-validation, the model that was trained using the combination of amino acid composition and evolutionary information performs best in identifying ubiquitin conjugation sites. The prediction sensitivity, specificity, and accuracy are 65.5%, 74.8%, and 74.5%, respectively. Although the amino acid sequences around the ubiquitin conjugation sites do not contain conserved motifs, the cross-validation result indicates that the integration of distant sequence

  13. New complete genome sequences of human rhinoviruses shed light on their phylogeny and genomic features

    Directory of Open Access Journals (Sweden)

    Zdobnov Evgeny M

    2007-07-01

    Full Text Available Abstract Background Human rhinoviruses (HRV, the most frequent cause of respiratory infections, include 99 different serotypes segregating into two species, A and B. Rhinoviruses share extensive genomic sequence similarity with enteroviruses and both are part of the picornavirus family. Nevertheless they differ significantly at the phenotypic level. The lack of HRV full-length genome sequences and the absence of analysis comparing picornaviruses at the whole genome level limit our knowledge of the genomic features supporting these differences. Results Here we report complete genome sequences of 12 HRV-A and HRV-B serotypes, more than doubling the current number of available HRV sequences. The whole-genome maximum-likelihood phylogenetic analysis suggests that HRV-B and human enteroviruses (HEV diverged from the last common ancestor after their separation from HRV-A. On the other hand, compared to HEV, HRV-B are more related to HRV-A in the capsid and 3B-C regions. We also identified the presence of a 2C cis-acting replication element (cre in HRV-B that is not present in HRV-A, and that had been previously characterized only in HEV. In contrast to HEV viruses, HRV-A and HRV-B share also markedly lower GC content along the whole genome length. Conclusion Our findings provide basis to speculate about both the biological similarities and the differences (e.g. tissue tropism, temperature adaptation or acid lability of these three groups of viruses.

  14. An improved classification of G-protein-coupled receptors using sequence-derived features

    Directory of Open Access Journals (Sweden)

    Peng Zhen-Ling

    2010-08-01

    Full Text Available Abstract Background G-protein-coupled receptors (GPCRs play a key role in diverse physiological processes and are the targets of almost two-thirds of the marketed drugs. The 3 D structures of GPCRs are largely unavailable; however, a large number of GPCR primary sequences are known. To facilitate the identification and characterization of novel receptors, it is therefore very valuable to develop a computational method to accurately predict GPCRs from the protein primary sequences. Results We propose a new method called PCA-GPCR, to predict GPCRs using a comprehensive set of 1497 sequence-derived features. The principal component analysis is first employed to reduce the dimension of the feature space to 32. Then, the resulting 32-dimensional feature vectors are fed into a simple yet powerful classification algorithm, called intimate sorting, to predict GPCRs at five levels. The prediction at the first level determines whether a protein is a GPCR or a non-GPCR. If it is predicted to be a GPCR, then it will be further predicted into certain family, subfamily, sub-subfamily and subtype by the classifiers at the second, third, fourth, and fifth levels, respectively. To train the classifiers applied at five levels, a non-redundant dataset is carefully constructed, which contains 3178, 1589, 4772, 4924, and 2741 protein sequences at the respective levels. Jackknife tests on this training dataset show that the overall accuracies of PCA-GPCR at five levels (from the first to the fifth can achieve up to 99.5%, 88.8%, 80.47%, 80.3%, and 92.34%, respectively. We further perform predictions on a dataset of 1238 GPCRs at the second level, and on another two datasets of 167 and 566 GPCRs respectively at the fourth level. The overall prediction accuracies of our method are consistently higher than those of the existing methods to be compared. Conclusions The comprehensive set of 1497 features is believed to be capable of capturing information about amino acid

  15. The origin of biased sequence depth in sequence-independent nucleic acid amplification and optimization for efficient massive parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Toon Rosseel

    Full Text Available Sequence Independent Single Primer Amplification is one of the most widely used random amplification approaches in virology for sequencing template preparation. This technique relies on oligonucleotides consisting of a 3' random part used to prime complementary DNA synthesis and a 5' defined tag sequence for subsequent amplification. Recently, this amplification method was combined with next generation sequencing to obtain viral sequences. However, these studies showed a biased distribution of the resulting sequence reads over the analyzed genomes. The aim of this study was to elucidate the mechanisms that lead to biased sequence depth when using random amplification. Avian paramyxovirus type 8 was used as a model RNA virus to investigate these mechanisms. We showed, based on in silico analysis of the sequence depth in relation to GC-content, predicted RNA secondary structure and sequence complementarity to the 3' part of the tag sequence, that the tag sequence has the main contribution to the observed bias in sequence depth. We confirmed this finding experimentally using both fragmented and non-fragmented viral RNAs as well as primers differing in random oligomer length (6 or 12 nucleotides and in the sequence of the amplification tag. The observed oligonucleotide annealing bias can be reduced by extending the random oligomer sequence and by in silico combining sequence data from SISPA experiments using different 5' defined tag sequences. These findings contribute to the optimization of random nucleic acid amplification protocols that are currently required for downstream applications such as viral metagenomics and microarray analysis.

  16. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    Science.gov (United States)

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  17. Prediction of antimicrobial peptides based on sequence alignment and feature selection methods.

    Directory of Open Access Journals (Sweden)

    Ping Wang

    Full Text Available Antimicrobial peptides (AMPs represent a class of natural peptides that form a part of the innate immune system, and this kind of 'nature's antibiotics' is quite promising for solving the problem of increasing antibiotic resistance. In view of this, it is highly desired to develop an effective computational method for accurately predicting novel AMPs because it can provide us with more candidates and useful insights for drug design. In this study, a new method for predicting AMPs was implemented by integrating the sequence alignment method and the feature selection method. It was observed that, the overall jackknife success rate by the new predictor on a newly constructed benchmark dataset was over 80.23%, and the Mathews correlation coefficient is 0.73, indicating a good prediction. Moreover, it is indicated by an in-depth feature analysis that the results are quite consistent with the previously known knowledge that some amino acids are preferential in AMPs and that these amino acids do play an important role for the antimicrobial activity. For the convenience of most experimental scientists who want to use the prediction method without the interest to follow the mathematical details, a user-friendly web-server is provided at http://amp.biosino.org/.

  18. Peptide Nucleic Acids Having Enhanced Binding Affinity and Sequence Specificity

    DEFF Research Database (Denmark)

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and binding affinity. Methods of increasing binding affinity and sequence specificity of peptide nucleic aci...

  19. CANADA: designing nucleic acid sequences for nanobiotechnology applications.

    Science.gov (United States)

    Feldkamp, Udo

    2010-02-01

    The design of nucleic acid sequences for a highly specific and efficient hybridization is a crucial step in DNA computing and DNA-based nanotechnology applications. The CANADA package contains software tools for designing DNA sequences that meet these and other requirements, as well as for analyzing and handling sequences. CANADA is freely available, including a detailed manual and example input files, at http://ls11-www.cs.uni-dortmund.de/molcomp/downloads.

  20. Representation of protein-sequence information by amino acid subalphabets

    DEFF Research Database (Denmark)

    Andersen, C.A.F.; Brunak, Søren

    2004-01-01

    -sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...

  1. Complete amino acid sequence of the Aspergillus cytotoxin mitogillin

    Energy Technology Data Exchange (ETDEWEB)

    Fernandez-Luna, J.L.; Lopez-Otin, C.; Soriano, F.; Mendez, E.

    1985-02-12

    The complete amino acid sequence of the cytotoxin mitogillin has been determined by sequencing the intact chain and peptide fragments produced by cleavage at methionyl, arginyl, lysyl, and tryptophanyl residues and at one aspartic acid-proline bond. The protein consists of 149 amino acid residues with alanine at the NH/sub 2/ terminus and histidine at the COOH terminus. The calculated Mr of the native mitogillin was 16,867. The native molecule presents two disulfide bridges, one between cysteine residues at positions 5 and 147 and another one between cysteine residues at positions 75 and 131. The amino acid sequence of mitogillin shows 86% homology with another cytotoxic protein called alpha-sarcin.

  2. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    Science.gov (United States)

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  3. Amino acid sequences of proteins from Leptospira serovar pomona

    Directory of Open Access Journals (Sweden)

    Alves Selmo F

    2000-01-01

    Full Text Available This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  4. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences.

    Science.gov (United States)

    Derr, Julien; Manapat, Michael L; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A; Chen, Irene A

    2012-05-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.

  5. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Directory of Open Access Journals (Sweden)

    Xiaoxia Yang

    Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  6. Effective automated feature construction and selection for classification of biological sequences.

    Directory of Open Access Journals (Sweden)

    Uday Kamath

    Full Text Available Many open problems in bioinformatics involve elucidating underlying functional signals in biological sequences. DNA sequences, in particular, are characterized by rich architectures in which functional signals are increasingly found to combine local and distal interactions at the nucleotide level. Problems of interest include detection of regulatory regions, splice sites, exons, hypersensitive sites, and more. These problems naturally lend themselves to formulation as classification problems in machine learning. When classification is based on features extracted from the sequences under investigation, success is critically dependent on the chosen set of features.We present an algorithmic framework (EFFECT for automated detection of functional signals in biological sequences. We focus here on classification problems involving DNA sequences which state-of-the-art work in machine learning shows to be challenging and involve complex combinations of local and distal features. EFFECT uses a two-stage process to first construct a set of candidate sequence-based features and then select a most effective subset for the classification task at hand. Both stages make heavy use of evolutionary algorithms to efficiently guide the search towards informative features capable of discriminating between sequences that contain a particular functional signal and those that do not.To demonstrate its generality, EFFECT is applied to three separate problems of importance in DNA research: the recognition of hypersensitive sites, splice sites, and ALU sites. Comparisons with state-of-the-art algorithms show that the framework is both general and powerful. In addition, a detailed analysis of the constructed features shows that they contain valuable biological information about DNA architecture, allowing biologists and other researchers to directly inspect the features and potentially use the insights obtained to assist wet-laboratory studies on retainment or modification

  7. Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis.

    Directory of Open Access Journals (Sweden)

    Lu-Lu Zheng

    Full Text Available Pyrrolidone carboxylic acid (PCA is formed during a common post-translational modification (PTM of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR and incremental feature selection (IFS. We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations.

  8. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection

    Directory of Open Access Journals (Sweden)

    Xin Ma

    2015-01-01

    Full Text Available The prediction of RNA-binding proteins is one of the most challenging problems in computation biology. Although some studies have investigated this problem, the accuracy of prediction is still not sufficient. In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR method, followed by incremental feature selection (IFS. We incorporated features of conjoint triad features and three novel features: binding propensity (BP, nonbinding propensity (NBP, and evolutionary information combined with physicochemical properties (EIPP. The results showed that these novel features have important roles in improving the performance of the predictor. Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient. High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.

  9. Robust prediction of B-factor profile from sequence using two-stage SVR based on random forest feature selection.

    Science.gov (United States)

    Pan, Xiao-Yong; Shen, Hong-Bin

    2009-01-01

    B-factor is highly correlated with protein internal motion, which is used to measure the uncertainty in the position of an atom within a crystal structure. Although the rapid progress of structural biology in recent years makes more accurate protein structures available than ever, with the avalanche of new protein sequences emerging during the post-genomic Era, the gap between the known protein sequences and the known protein structures becomes wider and wider. It is urgent to develop automated methods to predict B-factor profile from the amino acid sequences directly, so as to be able to timely utilize them for basic research. In this article, we propose a novel approach, called PredBF, to predict the real value of B-factor. We firstly extract both global and local features from the protein sequences as well as their evolution information, then the random forests feature selection is applied to rank their importance and the most important features are inputted to a two-stage support vector regression (SVR) for prediction, where the initial predicted outputs from the 1(st) SVR are further inputted to the 2nd layer SVR for final refinement. Our results have revealed that a systematic analysis of the importance of different features makes us have deep insights into the different contributions of features and is very necessary for developing effective B-factor prediction tools. The two-layer SVR prediction model designed in this study further enhanced the robustness of predicting the B-factor profile. As a web server, PredBF is freely available at: http://www.csbio.sjtu.edu.cn/bioinf/PredBF for academic use.

  10. SeqVISTA: a graphical tool for sequence feature visualization and comparison

    Directory of Open Access Journals (Sweden)

    Niu Tianhua

    2003-01-01

    Full Text Available Abstract Background Many readers will sympathize with the following story. You are viewing a gene sequence in Entrez, and you want to find whether it contains a particular sequence motif. You reach for the browser's "find in page" button, but those darn spaces every 10 bp get in the way. And what if the motif is on the opposite strand? Subsequently, your favorite sequence analysis software informs you that there is an interesting feature at position 13982–14013. By painstakingly counting the 10 bp blocks, you are able to examine the sequence at this location. But now you want to see what other features have been annotated close by, and this information is buried several screenfuls higher up the web page. Results SeqVISTA presents a holistic, graphical view of features annotated on nucleotide or protein sequences. This interactive tool highlights the residues in the sequence that correspond to features chosen by the user, and allows easy searching for sequence motifs or extraction of particular subsequences. SeqVISTA is able to display results from diverse sequence analysis tools in an integrated fashion, and aims to provide much-needed unity to the bioinformatics resources scattered around the Internet. Our viewer may be launched on a GenBank record by a single click of a button installed in the web browser. Conclusion SeqVISTA allows insights to be gained by viewing the totality of sequence annotations and predictions, which may be more revealing than the sum of their parts. SeqVISTA runs on any operating system with a Java 1.4 virtual machine. It is freely available to academic users at http://zlab.bu.edu/SeqVISTA.

  11. Analysis on n-gram statistics and linguistic features of whole genome protein sequences

    Institute of Scientific and Technical Information of China (English)

    DONG Qi-wen; WANG Xiao-long; LIN Lei

    2008-01-01

    To obtain the statistical sequence analysis on a large number of genomic and proteomie sequences available for different organisms,the n-grams of whole genome protein sequences from 20 organisms were extracted.Their linguistic features were analyzed by two tests:Zipf power law and Shannon entropy,developed for analysis of natural languages and symbolic sequences.The natural genome proteins and the artificial genome proteins were compared with each other and some statistical features of n-grams were discovered.The results show that:the n-grams of whole genome protein sequences approximately follow the Zipf law when n is larger than 4;the Shannon n-gram entropy of natural genome proteins is lower than that of artificial proteins;a simple unigram model can distinguish different organisms;there exist organism-specific usages of "phrases" in protein sequences.It is suggested that further detailed analysis on n-gram of whole genome protein sequences will result in a powerful model for mapping the relationship of protein sequence,structure and function.

  12. Aligning, analyzing, and visualizing sequences for antibody engineering: Automated recognition of immunoglobulin variable region features.

    Science.gov (United States)

    Jarasch, Alexander; Skerra, Arne

    2017-01-01

    The analysis and comparison of large numbers of immunoglobulin (Ig) sequences that arise during an antibody selection campaign can be time-consuming and tedious. Typically, the identification and annotation of framework as well as complementarity-determining regions (CDRs) is based on multiple sequence alignments using standardized numbering schemes, which allow identification of equivalent residues among different family members but often necessitate expert knowledge and manual intervention. Moreover, due to the enormous length variability of some CDRs the benefit of conventional Ig numbering schemes is limited and the calculation of correct sequence alignments can become challenging. Whereas, in principle, a well established set of rules permits the assignment of CDRs from the amino acid sequence alone, no currently available sequence alignment editor provides an algorithm to annotate new Ig sequences accordingly. Here we present a unique pattern matching method implemented into our recently developed ANTICALIgN editor that automatically identifies all hypervariable and framework regions in experimentally elucidated antibody sequences using so-called "regular expressions." By combination of this widely supported software syntax with the unique capabilities of real-time aligning, editing and analyzing extended sets of amino acid and/or nucleotide sequences simultaneously on a local workstation, ANTICALIgN provides a powerful utility for antibody engineering. Proteins 2016; 85:65-71. © 2016 Wiley Periodicals, Inc.

  13. Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains

    Directory of Open Access Journals (Sweden)

    Branislava Gemovic

    2013-01-01

    Full Text Available There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM, a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.

  14. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Science.gov (United States)

    2010-07-01

    ... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences... sequences are specifically excluded from this definition. Sequences with fewer than four specifically... acids are not intended to be embraced by this definition. Any amino acid sequence that contains...

  15. The amino acid sequence of Escherichia coli cyanase.

    Science.gov (United States)

    Chin, C C; Anderson, P M; Wold, F

    1983-01-10

    The amino acid sequence of the enzyme cyanase (cyanate hydrolase) from Escherichia coli has been determined by automatic Edman degradation of the intact protein and of its component peptides. The primary peptides used in the sequencing were produced by cyanogen bromide cleavage at the methionine residues, yielding 4 peptides plus free homoserine from the NH2-terminal methionine, and by trypsin cleavage at the 7 arginine residues after acetylation of the lysines. Secondary peptides required for overlaps and COOH-terminal sequences were produced by chymotrypsin or clostripain cleavage of some of the larger peptides. The complete sequence of the cyanase subunit consists of 156 amino acid residues (Mr 16,350). Based on the observation that the cysteine-containing peptide is obtained as a disulfide-linked dimer, it is proposed that the covalent structure of cyanase is made up of two subunits linked by a disulfide bond between the single cystine residue in each subunit. The native enzyme (Mr 150,000) then appears to be a complex of four or five such subunit dimers.

  16. NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.

    Science.gov (United States)

    Pruitt, Kim D; Tatusova, Tatiana; Brown, Garth R; Maglott, Donna R

    2012-01-01

    The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of genomic, transcript and protein sequence records. These records are selected and curated from public sequence archives and represent a significant reduction in redundancy compared to the volume of data archived by the International Nucleotide Sequence Database Collaboration. The database includes over 16,00 organisms, 2.4 × 0(6) genomic records, 13 × 10(6) proteins and 2 × 10(6) RNA records spanning prokaryotes, eukaryotes and viruses (RefSeq release 49, September 2011). The RefSeq database is maintained by a combined approach of automated analyses, collaboration and manual curation to generate an up-to-date representation of the sequence, its features, names and cross-links to related sources of information. We report here on recent growth, the status of curating the human RefSeq data set, more extensive feature annotation and current policy for eukaryotic genome annotation via the NCBI annotation pipeline. More information about the resource is available online (see http://www.ncbi.nlm.nih.gov/RefSeq/).

  17. SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.

    Directory of Open Access Journals (Sweden)

    Steven N Hart

    Full Text Available BACKGROUND: Structural variation (SV represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. RESULTS: We developed and validated SoftSearch using real and synthetic datasets. SoftSearch's key features are 1 not requiring secondary (or exhaustive primary alignment, 2 portability into established sequencing workflows, and 3 is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.. SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. CONCLUSIONS: We show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance.

  18. Application of next generation sequencing to human gene fusion detection: computational tools, features and perspectives.

    Science.gov (United States)

    Wang, Qingguo; Xia, Junfeng; Jia, Peilin; Pao, William; Zhao, Zhongming

    2013-07-01

    Gene fusions are important genomic events in human cancer because their fusion gene products can drive the development of cancer and thus are potential prognostic tools or therapeutic targets in anti-cancer treatment. Major advancements have been made in computational approaches for fusion gene discovery over the past 3 years due to improvements and widespread applications of high-throughput next generation sequencing (NGS) technologies. To identify fusions from NGS data, existing methods typically leverage the strengths of both sequencing technologies and computational strategies. In this article, we review the NGS and computational features of existing methods for fusion gene detection and suggest directions for future development.

  19. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    Directory of Open Access Journals (Sweden)

    Ruan Jishou

    2007-04-01

    Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are

  20. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    Science.gov (United States)

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  1. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Directory of Open Access Journals (Sweden)

    Meiler Arno

    2012-09-01

    Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  2. Spatial distribution features of sequence types of moderate and strong earthquake in Chinese mainland

    Institute of Scientific and Technical Information of China (English)

    JIANG Hai-kun; LI Yong-li; QU Yan-jun; HUA Ai-jun; ZHENG Jian-chang; DAI Lei; HOU Hai-feng

    2006-01-01

    Based on 294 earthquake sequences with magnitude greater than or equal to 5.0 occurred in Chinese mainland since 1970, the spatial distribution features of sequence types have been studied. In southwestern China, it takes mainshock-aftershock sequence type (MAT) as the major in Chuan-Dian rhombic block and concerned Xianshuihe-Anninghe-Xiaojiang seismic belt, as well as in Jinshajiang-Honghe seismic belt. Multiple mainshock type (MMT) mainly distributes in western Yunnan, and Longlin and Lancang areas in Tengchong-Baoshan block in west of Nujiang-Lancangjiang fault zone. A few isolated earthquake type (IET) mainly occurred in northwestern Sichuan and there is no IET occurred in Yunnan region. In northwestern China, it takes mainshock-aftershock sequence type (MAT) as the major in west segment of South Tianshan in Xinjiang region. Some MMT also occurred in this area in the intersection of Kalpin block and the Puchang fault zone. It takes IET as the major in middle Tianshan in Xinjiang. Along the Qilianshan seismic belt, most of sequences are MAT. In Qinghai region, it takes MAT as the major, but the regional feature of the spatial distribution of sequence types is not very clear. In North China, it takes MAT as the major in Yinshan-Yanshan-Bohai seismic belt, north edge of North China, and in Hebei plain seismic belt, as well as in sub-plate of lower river area of Yangtze River. In intersection of north segment of Shanxi seismic belt and the NW-trending Yinshan-Yanshan-Bohai seismic belt, there are several moderate or strong MMT with magnitude from 5.0 to 6.0 occurred. In south of North China around the latitude line of 35°N, it takes IET as the major. The spatial distribution of sequence types is relevant to the patterns of tectonic movements.MAT is mostly produced by the ruptures of locked units or asperities or the neonatal separating segments inside the fault zones. MMT is generally relevant to the conjugate structures or intersection of many tectonic settings

  3. What Matters in Implicit Task Sequence Learning: Perceptual Stimulus Features, Task Sets, or Correlated Streams of Information?

    Science.gov (United States)

    Weiermann, Brigitte; Cock, Josephine; Meier, Beat

    2010-01-01

    Implicit task sequence learning may be attributed to learning the order of perceptual stimulus features associated with the task sequence, learning a series of automatic task set activations, or learning an integrated sequence that derives from 2 correlated streams of information. In the present study, our purpose was to distinguish among these 3…

  4. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    KAUST Repository

    Chen, Peng

    2013-07-23

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  5. Prediction of peptide drift time in ion mobility mass spectrometry from sequence-based features

    KAUST Repository

    Wang, Bing

    2013-05-09

    Background: Ion mobility-mass spectrometry (IMMS), an analytical technique which combines the features of ion mobility spectrometry (IMS) and mass spectrometry (MS), can rapidly separates ions on a millisecond time-scale. IMMS becomes a powerful tool to analyzing complex mixtures, especially for the analysis of peptides in proteomics. The high-throughput nature of this technique provides a challenge for the identification of peptides in complex biological samples. As an important parameter, peptide drift time can be used for enhancing downstream data analysis in IMMS-based proteomics.Results: In this paper, a model is presented based on least square support vectors regression (LS-SVR) method to predict peptide ion drift time in IMMS from the sequence-based features of peptide. Four descriptors were extracted from peptide sequence to represent peptide ions by a 34-component vector. The parameters of LS-SVR were selected by a grid searching strategy, and a 10-fold cross-validation approach was employed for the model training and testing. Our proposed method was tested on three datasets with different charge states. The high prediction performance achieve demonstrate the effectiveness and efficiency of the prediction model.Conclusions: Our proposed LS-SVR model can predict peptide drift time from sequence information in relative high prediction accuracy by a test on a dataset of 595 peptides. This work can enhance the confidence of protein identification by combining with current protein searching techniques. 2013 Wang et al.; licensee BioMed Central Ltd.

  6. Sequencing of bovine herpesvirus 4 v.test strain reveals important genome features

    Directory of Open Access Journals (Sweden)

    Gillet Laurent

    2011-08-01

    Full Text Available Abstract Background Bovine herpesvirus 4 (BoHV-4 is a useful model for the human pathogenic gammaherpesviruses Epstein-Barr virus and Kaposi's Sarcoma-associated Herpesvirus. Although genome manipulations of this virus have been greatly facilitated by the cloning of the BoHV-4 V.test strain as a Bacterial Artificial Chromosome (BAC, the lack of a complete genome sequence for this strain limits its experimental use. Methods In this study, we have determined the complete sequence of BoHV-4 V.test strain by a pyrosequencing approach. Results The long unique coding region (LUR consists of 108,241 bp encoding at least 79 open reading frames and is flanked by several polyrepetitive DNA units (prDNA. As previously suggested, we showed that the prDNA unit located at the left prDNA-LUR junction (prDNA-G differs from the other prDNA units (prDNA-inner. Namely, the prDNA-G unit lacks the conserved pac-2 cleavage and packaging signal in its right terminal region. Based on the mechanisms of cleavage and packaging of herpesvirus genomes, this feature implies that only genomes bearing left and right end prDNA units are encapsulated into virions. Conclusions In this study, we have determined the complete genome sequence of the BAC-cloned BoHV-4 V.test strain and identified genome organization features that could be important in other herpesviruses.

  7. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    OpenAIRE

    Kasarda, D.D.; Okita, T W; Bernardin, J. E.; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with a...

  8. FASTERp: A Feature Array Search Tool for Estimating Resemblance of Protein Sequences

    Energy Technology Data Exchange (ETDEWEB)

    Macklin, Derek; Egan, Rob; Wang, Zhong

    2014-03-14

    Metagenome sequencing efforts have provided a large pool of billions of genes for identifying enzymes with desirable biochemical traits. However, homology search with billions of genes in a rapidly growing database has become increasingly computationally impractical. Here we present our pilot efforts to develop a novel alignment-free algorithm for homology search. Specifically, we represent individual proteins as feature vectors that denote the presence or absence of short kmers in the protein sequence. Similarity between feature vectors is then computed using the Tanimoto score, a distance metric that can be rapidly computed on bit string representations of feature vectors. Preliminary results indicate good correlation with optimal alignment algorithms (Spearman r of 0.87, ~;;1,000,000 proteins from Pfam), as well as with heuristic algorithms such as BLAST (Spearman r of 0.86, ~;;1,000,000 proteins). Furthermore, a prototype of FASTERp implemented in Python runs approximately four times faster than BLAST on a small scale dataset (~;;1000 proteins). We are optimizing and scaling to improve FASTERp to enable rapid homology searches against billion-protein databases, thereby enabling more comprehensive gene annotation efforts.

  9. One common structural feature of "words" in protein sequences and human texts.

    Science.gov (United States)

    Zemková, M; Trifonov, E N; Zahradník, D

    2014-01-01

    Frequently discussed analogy between genetic and human texts is explored by comparison of alternation of polar and non-polar amino-acid residues in proteins and alternation of consonants and vowels in human texts. In human languages, the usage of possible combinations of consonants and vowels is influenced by pronounceability of the combinations. Similarly, oligopeptide composition of proteins is influenced by requirements of protein folding and stability. One special type of structure often present in proteins is amphipathic α-helices in which polar and non-polar amino acids alternate with the period 3.5 residues, not unlike alternation of consonants and vowels. In this study, we evaluated the contribution made by amphipathic alternations to the protein sequence texts (20-24%). Their proportion is lower than respective values for alternating words in human texts (57-89%). The proteomes (full sets of proteins for selected organisms) were transformed into ranked sequences of n-grams (words of length n), including periodical amphipathic structures. Similarly, human texts were transformed into sequences of alternating consonants and vowels. Analysis of the vocabularies shows that in both types of texts (human languages and proteins) the alternating words are dominant or highly preferred, thus, strengthening the analogy between these two types of texts. The contribution of amphipathic words in the upper parts of the ranked lists for 10 analyzed proteomes varies between 58 and 74%. In human texts respective values range between 90 and 100%.

  10. A scheme for multiple sequence alignment optimization--an improvement based on family representative mechanics features.

    Science.gov (United States)

    Liu, Xin; Zhao, Ya-Pu

    2009-12-21

    As a basic tool of modern biology, sequence alignment can provide us useful information in fold, function, and active site of protein. For many cases, the increased quality of sequence alignment means a better performance. The motivation of present work is to increase ability of the existing scoring scheme/algorithm by considering residue-residue correlations better. Based on a coarse-grained approach, the hydrophobic force between each pair of residues is written out from protein sequence. It results in the construction of an intramolecular hydrophobic force network that describes the whole residue-residue interactions of each protein molecule, and characterizes protein's biological properties in the hydrophobic aspect. A former work has suggested that such network can characterize the top weighted feature regarding hydrophobicity. Moreover, for each homologous protein of a family, the corresponding network shares some common and representative family characters that eventually govern the conservation of biological properties during protein evolution. In present work, we score such family representative characters of a protein by the deviation of its intramolecular hydrophobic force network from that of background. Such score can assist the existing scoring schemes/algorithms, and boost up the ability of multiple sequences alignment, e.g. achieving a prominent increase (approximately 50%) in searching the structurally alike residue segments at a low identity level. As the theoretical basis is different, the present scheme can assist most existing algorithms, and improve their efficiency remarkably.

  11. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

    Science.gov (United States)

    Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

    2016-01-01

    The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814

  12. Electrochemical microfluidic biosensor for the detection of nucleic acid sequences.

    Science.gov (United States)

    Goral, Vasiliy N; Zaytseva, Natalya V; Baeumner, Antje J

    2006-03-01

    A microfluidic biosensor with electrochemical detection for the quantification of nucleic acid sequences was developed. In contrast to most microbiosensors that are based on fluorescence for signal generation, it takes advantage of the simplicity and high sensitivity provided by an amperometric and coulorimetric detection system. An interdigitated ultramicroelectrode array (IDUA) was fabricated in a glass chip and integrated directly with microchannels made of poly(dimethylsiloxane) (PDMS). The assembly was packaged into a Plexiglas housing providing fluid and electrical connections. IDUAs were characterized amperometrically and using cyclic voltammetry with respect to static and dynamic responses for the presence of a reversible redox couple-potassium hexacyanoferrate (ii)/hexacyanoferrate (iii) (ferri/ferrocyanide). A combined concentration of 0.5 microM of ferro/ferricyanide was determined as lower limit of detection with a dynamic range of 5 orders of magnitude. Background signals were negligible and the IDUA responded in a highly reversible manner to the injection of various volumes and various concentrations of the electrochemical marker. For the detection of nucleic acid sequences, liposomes entrapping the electrochemical marker were tagged with a DNA probe, and superparamagnetic beads were coated with a second DNA probe. A single stranded DNA target sequence hybridized with both probes. The sandwich was captured in the microfluidic channel just upstream of the IDUA via a magnet located in the outside housing. Liposomes were lysed using a detergent and the amount of released ferro/ferricyanide was quantified while passing by the IDUA. Optimal location of the magnet with respect to the IDUA was investigated, the effect of dextran sulfate on the hybridization reaction was studied and the amount of magnetic beads used in the assay was optimized. A dose response curve using varying concentrations of target DNA molecules was carried out demonstrating a limit of

  13. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  14. Large-scale oscillation of structure-related DNA sequence features in human chromosome 21

    Science.gov (United States)

    Li, Wentian; Miramontes, Pedro

    2006-08-01

    Human chromosome 21 is the only chromosome in the human genome that exhibits oscillation of the (G+C) content of a cycle length of hundreds kilobases (kb) ( 500kb near the right telomere). We aim at establishing the existence of a similar periodicity in structure-related sequence features in order to relate this (G+C)% oscillation to other biological phenomena. The following quantities are shown to oscillate with the same 500kb periodicity in human chromosome 21: binding energy calculated by two sets of dinucleotide-based thermodynamic parameters, AA/TT and AAA/TTT bi- and tri-nucleotide density, 5'-TA-3' dinucleotide density, and signal for 10- or 11-base periodicity of AA/TT or AAA/TTT. These intrinsic quantities are related to structural features of the double helix of DNA molecules, such as base-pair binding, untwisting or unwinding, stiffness, and a putative tendency for nucleosome formation.

  15. Targeted deep sequencing improves outcome stratification in chronic myelomonocytic leukemia with low risk cytogenetic features

    Science.gov (United States)

    Palomo, Laura; Garcia, Olga; Arnan, Montse; Xicoy, Blanca; Fuster, Francisco; Cabezón, Marta; Coll, Rosa; Ademà, Vera; Grau, Javier; Jiménez, Maria-José; Pomares, Helena; Marcé, Sílvia; Mallo, Mar; Millá, Fuensanta; Alonso, Esther; Sureda, Anna; Gallardo, David; Feliu, Evarist; Ribera, Josep-Maria; Solé, Francesc; Zamora, Lurdes

    2016-01-01

    Clonal cytogenetic abnormalities are found in 20-30% of patients with chronic myelomonocytic leukemia (CMML), while gene mutations are present in >90% of cases. Patients with low risk cytogenetic features account for 80% of CMML cases and often fall into the low risk categories of CMML prognostic scoring systems, but the outcome differs considerably among them. We performed targeted deep sequencing of 83 myeloid-related genes in 56 CMML patients with low risk cytogenetic features or uninformative conventional cytogenetics (CC) at diagnosis, with the aim to identify the genetic characteristics of patients with a more aggressive disease. Targeted sequencing was also performed in a subset of these patients at time of acute myeloid leukemia (AML) transformation. Overall, 98% of patients harbored at least one mutation. Mutations in cell signaling genes were acquired at time of AML progression. Mutations in ASXL1, EZH2 and NRAS correlated with higher risk features and shorter overall survival (OS) and progression free survival (PFS). Patients with SRSF2 mutations associated with poorer OS, while absence of TET2 mutations (TET2wt) was predictive of shorter PFS. A decrease in OS and PFS was observed as the number of adverse risk gene mutations (ASXL1, EZH2, NRAS and SRSF2) increased. On multivariate analyses, CMML-specific scoring system (CPSS) and presence of adverse risk gene mutations remained significant for OS, while CPSS and TET2wt were predictive of PFS. These results confirm that mutation analysis can add prognostic value to patients with CMML and low risk cytogenetic features or uninformative CC. PMID:27486981

  16. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    Science.gov (United States)

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  17. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Science.gov (United States)

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  18. Characterization of soybean genomic features by analysis of its expressed sequence tags

    DEFF Research Database (Denmark)

    Tian, Ai-Guo; Wang, Jun; Cui, Peng

    2004-01-01

    We analyzed 314,254 soybean expressed sequence tags (ESTs), including 29,540 from our laboratory and 284,714 from GenBank. These ESTs were assembled into 56,147 unigenes. About 76.92% of the unigenes were homologous to genes from Arabidopsis thaliana ( Arabidopsis). The putative products of these......We analyzed 314,254 soybean expressed sequence tags (ESTs), including 29,540 from our laboratory and 284,714 from GenBank. These ESTs were assembled into 56,147 unigenes. About 76.92% of the unigenes were homologous to genes from Arabidopsis thaliana ( Arabidopsis). The putative products...... to be fast-evolving. Soybean unigenes with no match to genes within the Arabidopsis genome were identified as soybean-specific genes. These genes were mainly involved in nodule development and the synthesis of seed storage proteins. In addition, we also identified 61 genes regulated by salicylic acid, 1......,322 transcription factor genes and 326 disease resistance-like genes from soybean unigenes. SSR analysis showed that the soybean genome was more complex than the Arabidopsis and the Medicago truncatula genomes. GC content in soybean unigene sequences is similar to that in Arabidopsis and M. truncatula. Furthermore...

  19. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  20. Complete DNA sequence of Kuraishia capsulata illustrates novel genomic features among budding yeasts (Saccharomycotina).

    Science.gov (United States)

    Morales, Lucia; Noel, Benjamin; Porcel, Betina; Marcet-Houben, Marina; Hullo, Marie-Francoise; Sacerdot, Christine; Tekaia, Fredj; Leh-Louis, Véronique; Despons, Laurence; Khanna, Varun; Aury, Jean-Marc; Barbe, Valérie; Couloux, Arnaud; Labadie, Karen; Pelletier, Eric; Souciet, Jean-Luc; Boekhout, Teun; Gabaldon, Toni; Wincker, Patrick; Dujon, Bernard

    2013-01-01

    The numerous yeast genome sequences presently available provide a rich source of information for functional as well as evolutionary genomics but unequally cover the large phylogenetic diversity of extant yeasts. We present here the complete sequence of the nuclear genome of the haploid-type strain of Kuraishia capsulata (CBS1993(T)), a nitrate-assimilating Saccharomycetales of uncertain taxonomy, isolated from tunnels of insect larvae underneath coniferous barks and characterized by its copious production of extracellular polysaccharides. The sequence is composed of seven scaffolds, one per chromosome, totaling 11.4 Mb and containing 6,029 protein-coding genes, ~13.5% of which being interrupted by introns. This GC-rich yeast genome (45.7%) appears phylogenetically related with the few other nitrate-assimilating yeasts sequenced so far, Ogataea polymorpha, O. parapolymorpha, and Dekkera bruxellensis, with which it shares a very reduced number of tRNA genes, a novel tRNA sparing strategy, and a common nitrate assimilation cluster, three specific features to this group of yeasts. Centromeres were recognized in GC-poor troughs of each scaffold. The strain bears MAT alpha genes at a single MAT locus and presents a significant degree of conservation with Saccharomyces cerevisiae genes, suggesting that it can perform sexual cycles in nature, although genes involved in meiosis were not all recognized. The complete absence of conservation of synteny between K. capsulata and any other yeast genome described so far, including the three other nitrate-assimilating species, validates the interest of this species for long-range evolutionary genomic studies among Saccharomycotina yeasts.

  1. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    Science.gov (United States)

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

  2. A Novel Sequence-Based Feature for the Identification of DNA-Binding Sites in Proteins Using Jensen–Shannon Divergence

    Directory of Open Access Journals (Sweden)

    Truong Khanh Linh Dang

    2016-10-01

    Full Text Available The knowledge of protein-DNA interactions is essential to fully understand the molecular activities of life. Many research groups have developed various tools which are either structure- or sequence-based approaches to predict the DNA-binding residues in proteins. The structure-based methods usually achieve good results, but require the knowledge of the 3D structure of protein; while sequence-based methods can be applied to high-throughput of proteins, but require good features. In this study, we present a new information theoretic feature derived from Jensen–Shannon Divergence (JSD between amino acid distribution of a site and the background distribution of non-binding sites. Our new feature indicates the difference of a certain site from a non-binding site, thus it is informative for detecting binding sites in proteins. We conduct the study with a five-fold cross validation of 263 proteins utilizing the Random Forest classifier. We evaluate the functionality of our new features by combining them with other popular existing features such as position-specific scoring matrix (PSSM, orthogonal binary vector (OBV, and secondary structure (SS. We notice that by adding our features, we can significantly boost the performance of Random Forest classifier, with a clear increment of sensitivity and Matthews correlation coefficient (MCC.

  3. Yeast prions and human prion-like proteins: sequence features and prediction methods.

    Science.gov (United States)

    Cascarina, Sean M; Ross, Eric D

    2014-06-01

    Prions are self-propagating infectious protein isoforms. A growing number of prions have been identified in yeast, each resulting from the conversion of soluble proteins into an insoluble amyloid form. These yeast prions have served as a powerful model system for studying the causes and consequences of prion aggregation. Remarkably, a number of human proteins containing prion-like domains, defined as domains with compositional similarity to yeast prion domains, have recently been linked to various human degenerative diseases, including amyotrophic lateral sclerosis. This suggests that the lessons learned from yeast prions may help in understanding these human diseases. In this review, we examine what has been learned about the amino acid sequence basis for prion aggregation in yeast, and how this information has been used to develop methods to predict aggregation propensity. We then discuss how this information is being applied to understand human disease, and the challenges involved in applying yeast prediction methods to higher organisms.

  4. Accurate single-sequence prediction of solvent accessible surface area using local and global features.

    Science.gov (United States)

    Faraggi, Eshel; Zhou, Yaoqi; Kloczkowski, Andrzej

    2014-11-01

    We present a new approach for predicting the Accessible Surface Area (ASA) using a General Neural Network (GENN). The novelty of the new approach lies in not using residue mutation profiles generated by multiple sequence alignments as descriptive inputs. Instead we use solely sequential window information and global features such as single-residue and two-residue compositions of the chain. The resulting predictor is both highly more efficient than sequence alignment-based predictors and of comparable accuracy to them. Introduction of the global inputs significantly helps achieve this comparable accuracy. The predictor, termed ASAquick, is tested on predicting the ASA of globular proteins and found to perform similarly well for so-called easy and hard cases indicating generalizability and possible usability for de-novo protein structure prediction. The source code and a Linux executables for GENN and ASAquick are available from Research and Information Systems at http://mamiris.com, from the SPARKS Lab at http://sparks-lab.org, and from the Battelle Center for Mathematical Medicine at http://mathmed.org.

  5. Tracking facial features in video sequences using a deformable-model-based approach

    Science.gov (United States)

    Malciu, Marius; Preteux, Francoise J.

    2000-10-01

    This paper addresses the issue of computer vision-based face motion capture as an alternative to physical sensor-based technologies. The proposed method combines a deformable template-based tracking of mouth and eyes in arbitrary video sequences with a single speaking person with a global 3D head pose estimation procedure yielding robust initializations. Mathematical principles underlying deformable template matching together with definition and extraction of salient image features are presented. Specifically, interpolating cubic B-splines between the MPEG-4 Face Animation Parameters (FAPs) associated with the mouth and eyes are used as template parameterization. Modeling the template a network of springs interconnecting with the mouth and eyes FAPs, the internal energy is expressed as a combination of elastic and symmetry local constraints. The external energy function, which allows to enforce interactions with image data, involves contour, texture and topography properties properly combined within robust potential functions. Template matching is achieved by applying the downhill simplex method for minimizing the global energy cost. Stability and accuracy of the results are discussed on a set of 2000 frames corresponding to 5 video sequences of speaking people.

  6. The complete amino acid sequence of the basic nuclear protein of bull spermatozoa

    NARCIS (Netherlands)

    Coelingh, J.P.; Monfoort, Cornelis H.; Rozijn, Thomas H.; Gevers Leuven, Jan A.; Schiphof, R.; Steyn-Parvé, Elizabeth P.; Braunitzer, Gerhard; Schrank, Barbara; Ruhfus, Annette

    1972-01-01

    The complete amino acid sequence of the basic nuclear protein of bull spermatozoa has been established. The sequence was partially deduced by characterization of peptides isolated from thermolysine and chymotryptic digests of the reduced and S-aminoethylated protein. The complete sequence of the fir

  7. NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.

    Directory of Open Access Journals (Sweden)

    Pu Wang

    Full Text Available Nuclear receptors (NRs are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1 thyroid hormone like (NR1, (2 HNF4-like (NR2, (3 estrogen like, (4 nerve growth factor IB-like (NR4, (5 fushi tarazu-F1 like (NR5, (6 germ cell nuclear factor like (NR6, and (7 knirps like (NR0. The identification was made by the Fuzzy K nearest neighbor (FK-NN classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will

  8. NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.

    Science.gov (United States)

    Wang, Pu; Xiao, Xuan; Chou, Kuo-Chen

    2011-01-01

    Nuclear receptors (NRs) are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1) thyroid hormone like (NR1), (2) HNF4-like (NR2), (3) estrogen like, (4) nerve growth factor IB-like (NR4), (5) fushi tarazu-F1 like (NR5), (6) germ cell nuclear factor like (NR6), and (7) knirps like (NR0). The identification was made by the Fuzzy K nearest neighbor (FK-NN) classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will

  9. Amino acid sequence of Japanese quail (Coturnix japonica) and northern bobwhite (Colinus virginianus) myoglobin.

    Science.gov (United States)

    Goodson, John; Beckstead, Robert B; Payne, Jason; Singh, Rakesh K; Mohan, Anand

    2015-08-15

    Myoglobin has an important physiological role in vertebrates, and as the primary sarcoplasmic pigment in meat, influences quality perception and consumer acceptability. In this study, the amino acid sequences of Japanese quail and northern bobwhite myoglobin were deduced by cDNA cloning of the coding sequence from mRNA. Japanese quail myoglobin was isolated from quail cardiac muscles, purified using ammonium sulphate precipitation and gel-filtration, and subjected to multiple enzymatic digestions. Mass spectrometry corroborated the deduced protein amino acid sequence at the protein level. Sequence analysis revealed both species' myoglobin structures consist of 153 amino acids, differing at only three positions. When compared with chicken myoglobin, Japanese quail showed 98% sequence identity, and northern bobwhite 97% sequence identity. The myoglobin in both quail species contained eight histidine residues instead of the nine present in chicken and turkey.

  10. The amino acid sequence of elephant (Elephas maximus) myoglobin and the phylogeny of Proboscidea.

    Science.gov (United States)

    Dene, H; Goodman, M; Romero-Herrera, A E

    1980-02-13

    The complete amino acid sequence of skeletal myoglobin from the Asian elephant (Elephas maximus) is reported. The functional significance of variations seen when this sequence is compared with that of sperm whale myoglobin is explored in the light of the crystallographic model available for the latter molecule. The phylogenetic implications of the elephant myoglobin amino acid sequence are evaluated by using the maximum parsimony technique. A similar analysis is also presented which incorporates all of the proteins sequenced from the elephant. These results are discussed with respect to current views on proboscidean phylogeny.

  11. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    Science.gov (United States)

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  12. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    Science.gov (United States)

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  13. Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data

    DEFF Research Database (Denmark)

    Rask, Thomas Salhøj; Petersen, Bent; Chen, Donald S.

    2016-01-01

    insertions and deletions, are on the other hand likely to disrupt open reading frames. Such an inverse relationship between errors and expectation based on prior knowledge can be used advantageously to guide the process known as basecalling, i.e. the inference of nucleotide sequence from raw sequencing data...... family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon...

  14. Peptide Nucleic Acids Having Enhanced Binding Affinity, Sequence Specificity and Solubility

    DEFF Research Database (Denmark)

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from a group consisting of naturally......-occurring nucleobases and non-naturally-occurring nucleobases attached to a polyamide backbone, and contain C1-C8 alkylamine side chains. Methods of enhancing the solubility, binding affinity and sequence specificity of PNAs are provided....

  15. The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features

    Science.gov (United States)

    Tu, Shiqi; Yuan, Guo-Cheng; Shao, Zhen

    2017-01-01

    Recently, long non-coding RNAs (lncRNAs) have emerged as an important class of molecules involved in many cellular processes. One of their primary functions is to shape epigenetic landscape through interactions with chromatin modifying proteins. However, mechanisms contributing to the specificity of such interactions remain poorly understood. Here we took the human and mouse lncRNAs that were experimentally determined to have physical interactions with Polycomb repressive complex 2 (PRC2), and systematically investigated the sequence features of these lncRNAs by developing a new computational pipeline for sequences composition analysis, in which each sequence is considered as a series of transitions between adjacent nucleotides. Through that, PRC2-binding lncRNAs were found to be associated with a set of distinctive and evolutionarily conserved sequence features, which can be utilized to distinguish them from the others with considerable accuracy. We further identified fragments of PRC2-binding lncRNAs that are enriched with these sequence features, and found they show strong PRC2-binding signals and are more highly conserved across species than the other parts, implying their functional importance.

  16. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    Science.gov (United States)

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  17. Genome Sequence of Clostridium tyrobutyricum ATCC 25755, a Butyric Acid-Overproducing Strain.

    Science.gov (United States)

    Jiang, Ling; Zhu, Liying; Xu, Xian; Li, Yanping; Li, Shuang; Huang, He

    2013-05-30

    Clostridium tyrobutyricum ATCC 25755 is an efficient producer of butyric acid. Here we report a 3.01-Mb assembly of its genome sequence and other useful information, including the coding sequences (CDSs) responsible for an alternative pathway leading to acetate synthesis as well as a series of membrane transport systems.

  18. Genome Sequence of Clostridium tyrobutyricum ATCC 25755, a Butyric Acid-Overproducing Strain

    OpenAIRE

    2013-01-01

    Clostridium tyrobutyricum ATCC 25755 is an efficient producer of butyric acid. Here we report a 3.01-Mb assembly of its genome sequence and other useful information, including the coding sequences (CDSs) responsible for an alternative pathway leading to acetate synthesis as well as a series of membrane transport systems.

  19. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    OpenAIRE

    Geissler, Andreas J.; Behr, Jürgen; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability.

  20. Amino Acid Sequence - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...ll_sequence_amino_db.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST...ta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_ine_full_sequence_amino_db.fasta.zip Fi...date History of This Database Site Policy | Contact Us Amino Acid Sequence - KOME | LSDB Archive ...

  1. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    Science.gov (United States)

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  2. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    Science.gov (United States)

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  3. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Science.gov (United States)

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid.

  4. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M;

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250-555 p...

  5. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    Science.gov (United States)

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  6. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    Science.gov (United States)

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  7. Complete DNA sequence of Kuraishia capsulata illustrates novel genomic features among budding yeasts (Saccharomycotina)

    NARCIS (Netherlands)

    Morales, L.; Noel, B.; Porcel, B.; Marcet-Houben, M.; Hullo, M.F.; Sacerdot, C.; Tekaia, F.; Leh-Louis, V.; Despons, L.; Khanna, V.; Aury, J.M.; Barbe, V.; Couloux, A.; Labadie, K.; Pelletier, E.; Souciet, J.L.; Boekhout, T.; Gabaldon, T.; Wincker, P.; Dujon, B.

    2013-01-01

    The numerous yeast genome sequences presently available provide a rich source of information for functional as well as evolutionary genomics, but unequally cover the large phylogenetic diversity of extant yeasts. We present here the complete sequence of the nuclear genome of the haploid type strain

  8. The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

    DEFF Research Database (Denmark)

    Behrendt, N; Rønne, E; Ploug, M;

    1990-01-01

    -PA. The purified protein shows a single 55-60 kDa band after sodium dodecyl sulfate-polyacrylamide gel electrophoresis and silver staining. It is a heavily glycosylated protein, the deglycosylated polypeptide chain comprising only 35 kDa. The glycosylated protein contains N-acetyl-D-glucosamine and sialic acid......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported....... The protein has a high content of cysteine residues. The NH2-terminal sequence is not closely related to any known sequence. The identification of the purified and sequenced protein with the human u-PA receptor is based on the following findings: 1) the ability of the purified protein to bind u-PA and its...

  9. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    Science.gov (United States)

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  10. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    Science.gov (United States)

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  11. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons

    Directory of Open Access Journals (Sweden)

    Mehtap Abu-Qarn

    2006-01-01

    Full Text Available Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  12. PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.

    Directory of Open Access Journals (Sweden)

    Huilin Wang

    Full Text Available X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed 'PredPPCrys' using the support vector machine (SVM. Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I. Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II, which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization

  13. Contig sequences and their annotation (amino acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Dicty_cDB Contig sequences and their annotation (amino acid sequence and results of homology search), and expre...s of homology search), and expression profile Description of data contents Contig...TogoDB edition only. Annotation Translated Amino Acid sequence Representative ami....u-tokyo.ac.jp/ ), which is the program to predict the localization of proteins in the cell. Expression prof...o acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive ...

  14. Feature and duration of metre-scale sequences in a storm-dominated carbonate ramp setting (Kimmeridgian, northeastern Spain)

    Science.gov (United States)

    Colombié, C.; Bádenas, B.; Aurell, M.; Götz, A. E.; Bertholon, S.; Boussaha, M.

    2014-10-01

    Metre-scale sequences may result from the combined effects of allocyclic and autocyclic processes which are closely inter-related. The carbonate ramp that developed northwest of the Iberian Basin during the late Kimmeridgian was affected by northwestward migrating cyclones. Marl-limestone alternations that settled in mid-ramp environments contain abundant mm to cm thick coarse-grained accumulations that have been related to these events. The aim of this paper is to determine the impact of storm-induced processes on the metre-scale sequence features. Four sections (R3, R4, R6, and R7), which are 5 to 7 m in thickness, were studied bed-by-bed along a 4 km-long outcrop, which shows the transition between the shallow and the relatively deep realms of the middle ramp. Metre-scale sequences were defined and correlated along this outcrop according to the detailed microfacies analysis of host, fine-grained deposits, palynofacies and sequence-stratigraphic analyses, and carbon- and oxygen-isotope measurements. The evolution through time of sedimentary features such as the size of quartz grains and the relative abundance of grains other than quartz (i.e., muscovite, bivalve, ooid, and intraclast) does not correlate from one section to the other, suggesting that the finest as well as the coarsest sediment was reworked in these storm-dominated environments. Small- and medium-scale sequences are defined according to changes in alternation, marly interbed, and limestone bed thickness, and correlated from one section to the other. Because of the effects of storms on sediment distribution and preservation, sequence boundaries coincide with thin alternations and marly interbeds in the most proximal sections (i.e., R3, R4), while they correspond to thin alternations and limestone beds in the most distal sections (i.e., R6, R7). Field observations and palynofacies analyses confirm this sequence-stratigraphic analysis. The excursions in carbon- and oxygen-isotope values are consistent

  15. Prediction of GPCR-G Protein Coupling Specificity Using Features of Sequences and Biological Functions

    Institute of Scientific and Technical Information of China (English)

    Toshihide Ono; Haretsugu Hishigaki

    2006-01-01

    Understanding the coupling specificity between G protein-coupled receptors (GPCRs) and specific classes of G proteins is important for further elucidation of receptor functions within a cell. Increasing information on GPCR sequences and the G protein family would facilitate prediction of the coupling properties of GPCRs. In this study, we describe a novel approach for predicting the coupling specificity between GPCRs and G proteins. This method uses not only GPCR sequences but also the functional knowledge generated by natural language processing, and can achieve 92.2% prediction accuracy by using the C4.5 algorithm.Furthermore, rules related to GPCR-G protein coupling are generated. The combination of sequence analysis and text mining improves the prediction accuracy for GPCR-G protein coupling specificity, and also provides clues for understanding GPCR signaling.

  16. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry.

    Science.gov (United States)

    Carter, Charles W; Wolfenden, Richard

    2016-01-01

    The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.

  17. 符号序列的傅利叶谱的一些性质%SOME FEATURES OF FOURIER SPECTRUM FOR SYMBOLIC SEQUENCES

    Institute of Scientific and Technical Information of China (English)

    王嘉松; 刘国庆; 赵剑

    2012-01-01

    Any symbolic sequence can be represented by one-dimensional numerical representation or multi-dimensional vector representation, and has a discrete Fourier transform (DFT) of the numerical sequence corresponding to the symbolic sequence. We find that the total Fourier spectrum of the symbolic sequence depends on the length of the sequence when the base vector representation is used. However the property is not valid for the one-dimensional representation of the symbolic sequence. According to the relations between the DFT of the original sequence and the subsequence for the complex numerical sequence, one simpler way to compute DFT of the subsequence is presented instead of directly computing the DFT coefficients of the original sequence at special frequencies. In fact, computing DFT of the indicator sequence at special frequencies only needs simple arithmetic operations of non-negative integer sequences. Finally, we present some features of the quadratic form corresponding to Fourier spectrum of real sequences.

  18. Face recognition based on matching of local features on 3D dynamic range sequences

    Science.gov (United States)

    Echeagaray-Patrón, B. A.; Kober, Vitaly

    2016-09-01

    3D face recognition has attracted attention in the last decade due to improvement of technology of 3D image acquisition and its wide range of applications such as access control, surveillance, human-computer interaction and biometric identification systems. Most research on 3D face recognition has focused on analysis of 3D still data. In this work, a new method for face recognition using dynamic 3D range sequences is proposed. Experimental results are presented and discussed using 3D sequences in the presence of pose variation. The performance of the proposed method is compared with that of conventional face recognition algorithms based on descriptors.

  19. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    Science.gov (United States)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  20. Protein submitochondrial localization from integrated sequence representation and SVM-based backward feature extraction.

    Science.gov (United States)

    Li, Liqi; Yu, Sanjiu; Xiao, Weidong; Li, Yongsheng; Hu, Wenjuan; Huang, Lan; Zheng, Xiaoqi; Zhou, Shiwen; Yang, Hua

    2015-01-01

    Mitochondrion, a tiny energy factory, plays an important role in various biological processes of most eukaryotic cells. Mitochondrial defection is associated with a series of human diseases. Knowledge of the submitochondrial locations of proteins can help to reveal the biological functions of novel proteins, and understand the mechanisms underlying various biological processes occurring in the mitochondrion. However, experimental methods to determine protein submitochondrial locations are costly and time consuming. Thus it is essential to develop a fast and reliable computational method to predict protein submitochondrial locations. Here, we proposed a support vector machine (SVM) based approach for predicting protein submitochondrial locations. Information from the position-specific score matrix (PSSM), gene ontology (GO) and the protein feature (PROFEAT) was integrated into the principal features of this model. Then a recursive feature selection scheme was employed to select the optimal features. Finally, an SVM module was used to predict protein submitochondrial locations based on the optimal features. Through the jackknife cross-validation test, our method achieved an accuracy of 99.37% on benchmark dataset M317, and 100% on the other two datasets, M1105 and T86. These results indicate that our method is economic and effective for accurate prediction of the protein submitochondrial location.

  1. Infants' Sensitivity to the Causal Features of Means-End Support Sequences in Action and Perception

    Science.gov (United States)

    Sommerville, Jessica A.; Woodward, Amanda L.

    2005-01-01

    Current work has yielded differential findings regarding infants' ability to perceptually detect the causal structure of a means-end support sequence. Resolving this debate has important implications for perception-action dissociations in this domain of object knowledge. In Study 1, 12-month-old infants' ability to perceive the causal structure of…

  2. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia.

    Science.gov (United States)

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  3. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66

    Directory of Open Access Journals (Sweden)

    Bin Liu

    2016-06-01

    Full Text Available Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA. Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276, with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.

  4. Sequence Pattern Correlation of Amino Acid in Collision-induced Dissociation Electrospray Ionization Mass Spectrometry

    Institute of Scientific and Technical Information of China (English)

    SONG,Hao-Wei(宋浩威); YUE,Gui-Hua(岳贵花); LU,Yu(陆宇); YANG,Peng-Yuan(杨芃原); WANG,Hong-Hai(王洪海)

    2002-01-01

    A novel approach of sequence pattern correlation has been applied to predict an expected amino acid sequence from CID ESI-MS spectra. The proposed approach deduces sequence patterns with no help from known protein database such that it is useful to identify an unknown peptide or new protein. The algorithm applies a cross-correlation to match an experimental CID spectrum with predicted sequence pattern generated from fragmentation information. The fragmentation knowledge of both y-series and other non y-series are utilized to generate the predicted sequence patterns. In contrast to the normal de novo approach, the proposed approach is insensitive to mass tolerance and non-susceptive to spectral integrality with no need for selection of a starting point.

  5. Prediction of protein structural features from sequence data based on Shannon entropy and Kolmogorov complexity.

    Science.gov (United States)

    Bywater, Robert Paul

    2015-01-01

    While the genome for a given organism stores the information necessary for the organism to function and flourish it is the proteins that are encoded by the genome that perhaps more than anything else characterize the phenotype for that organism. It is therefore not surprising that one of the many approaches to understanding and predicting protein folding and properties has come from genomics and more specifically from multiple sequence alignments. In this work I explore ways in which data derived from sequence alignment data can be used to investigate in a predictive way three different aspects of protein structure: secondary structures, inter-residue contacts and the dynamics of switching between different states of the protein. In particular the use of Kolmogorov complexity has identified a novel pathway towards achieving these goals.

  6. Effect of sequence features on assembly of spider silk block copolymers.

    Science.gov (United States)

    Tokareva, Olena S; Lin, Shangchao; Jacobsen, Matthew M; Huang, Wenwen; Rizzo, Daniel; Li, David; Simon, Marc; Staii, Cristian; Cebe, Peggy; Wong, Joyce Y; Buehler, Markus J; Kaplan, David L

    2014-06-01

    Bioengineered spider silk block copolymers were studied to understand the effect of protein chain length and sequence chemistry on the formation of secondary structure and materials assembly. Using a combination of in vitro protein design and assembly studies, we demonstrate that silk block copolymers possessing multiple repetitive units self-assemble into lamellar microstructures. Additionally, the study provides insights into the assembly behavior of spider silk block copolymers in concentrated salt solutions.

  7. Clinical utility of a next generation sequencing panel assay for Marfan and Marfan-like syndromes featuring aortopathy.

    Science.gov (United States)

    Wooderchak-Donahue, Whitney; VanSant-Webb, Chad; Tvrdik, Tatiana; Plant, Parker; Lewis, Tracey; Stocks, Jennifer; Raney, Joshua A; Meyers, Lindsay; Berg, Alizabeth; Rope, Alan F; Yetman, Anji T; Bleyl, Steven B; Mesley, Rebecca; Bull, David A; Collins, R Thomas; Ojeda, Mayra Martinez; Roberts, Amy; Lacro, Ronald; Woerner, Audrey; Stoler, Joan; Bayrak-Toydemir, Pinar

    2015-08-01

    Aortopathy can be defined as aortic dilation, aneurysm, dissection, and tortuosity. Familial aortopathy may occur secondary to fibrillin-1 (FBN1) mutations in the setting of Marfan syndrome, or may occur as a result of other genetic defects with different, but occasionally overlapping, phenotypes. Because of the phenotypic overlap and genetic heterogeneity of disorders featuring aortopathy, we developed a next generation sequencing (NGS) assay and comparative genomic hybridization (CGH) array to detect mutations in 10 genes that cause thoracic aortic aneurysms (TAAs). Here, we report on the clinical and molecular findings in 175 individuals submitted for aortopathy panel testing at ARUP laboratories. Ten genes associated with heritable aortopathies were targeted using hybridization capture prior to sequencing. NGS results were analyzed, and variants were confirmed using Sanger sequencing. Array CGH was used to detect copy-number variation. Of 175 individuals, 18 had a pathogenic mutation and 32 had a variant of uncertain significance (VUS). Most pathogenic mutations (72%) were identified in FBN1. A novel large SMAD3 duplication and FBN1 deletion were identified. Over half who had TAAs or other aortic involvement tested negative for a mutation, suggesting that additional aortopathy genes exist. We anticipate that the clinical sensitivity of at least 10.3% will rise with VUS reclassification and as additional genes are identified and included in the panel. The aortopathy NGS panel aids in the timely molecular diagnosis of individuals with disorders featuring aortopathy and guides proper treatment.

  8. Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Gclust Server Amino acid sequences used for clusterintg (Multi FASTA format) Data detail Data name Amino aci...d sequences used for clusterintg (Multi FASTA format) Description of data contents Amino acid sequences of p...redicted proteins and their annotation for 95 organism species. FASTA format file...5.fa.zip File size: 161MB Simple search URL - Data acquisition method - Data analysis method - Number of data entries - FAST... Site Policy | Contact Us Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive ...

  9. ProViz-a web-based visualization tool to investigate the functional and evolutionary features of protein sequences.

    Science.gov (United States)

    Jehl, Peter; Manguy, Jean; Shields, Denis C; Higgins, Desmond G; Davey, Norman E

    2016-07-01

    Low-throughput experiments and high-throughput proteomic and genomic analyses have created enormous quantities of data that can be used to explore protein function and evolution. The ability to consolidate these data into an informative and intuitive format is vital to our capacity to comprehend these distinct but complementary sources of information. However, existing tools to visualize protein-related data are restricted by their presentation, sources of information, functionality or accessibility. We introduce ProViz, a powerful browser-based tool to aid biologists in building hypotheses and designing experiments by simplifying the analysis of functional and evolutionary features of proteins. Feature information is retrieved in an automated manner from resources describing protein modular architecture, post-translational modification, structure, sequence variation and experimental characterization of functional regions. These features are mapped to evolutionary information from precomputed multiple sequence alignments. Data are displayed in an interactive and information-rich yet intuitive visualization, accessible through a simple protein search interface. This allows users with limited bioinformatic skills to rapidly access data pertinent to their research. Visualizations can be further customized with user-defined data either manually or using a REST API. ProViz is available at http://proviz.ucd.ie/.

  10. Multimodal emotional state recognition using sequence-dependent deep hierarchical features.

    Science.gov (United States)

    Barros, Pablo; Jirak, Doreen; Weber, Cornelius; Wermter, Stefan

    2015-12-01

    Emotional state recognition has become an important topic for human-robot interaction in the past years. By determining emotion expressions, robots can identify important variables of human behavior and use these to communicate in a more human-like fashion and thereby extend the interaction possibilities. Human emotions are multimodal and spontaneous, which makes them hard to be recognized by robots. Each modality has its own restrictions and constraints which, together with the non-structured behavior of spontaneous expressions, create several difficulties for the approaches present in the literature, which are based on several explicit feature extraction techniques and manual modality fusion. Our model uses a hierarchical feature representation to deal with spontaneous emotions, and learns how to integrate multiple modalities for non-verbal emotion recognition, making it suitable to be used in an HRI scenario. Our experiments show that a significant improvement of recognition accuracy is achieved when we use hierarchical features and multimodal information, and our model improves the accuracy of state-of-the-art approaches from 82.5% reported in the literature to 91.3% for a benchmark dataset on spontaneous emotion expressions.

  11. A design for computer nucleic-acid-sequence storage, retrieval, and manipulation

    OpenAIRE

    1982-01-01

    We have designed and built a data-base system for the storage of nucleic-acid sequences. The system consists of a data base (“the library”) and software that manages and provides access to that data base (“the Librarian”).

  12. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...

  13. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923).

    Science.gov (United States)

    Wasels, François; Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-03-03

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain.

  14. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    OpenAIRE

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain.

  15. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    Science.gov (United States)

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  16. Nucleic acid sequence-based amplification with oligochromatography for detection of Trypanosoma brucei in clinical samples

    NARCIS (Netherlands)

    C.M. Mugasa; T. Laurent; G.J. Schoone; P.A. Kager; G.W. Lubega; H.D.F.H. Schallig

    2009-01-01

    Molecular tools, such as real-time nucleic acid sequence-based amplification (NASBA) and PCR, have been developed to detect Trypanosoma brucei parasites in blood for the diagnosis of human African trypanosomiasis (HAT). Despite good sensitivity, these techniques are not implemented in HAT control pr

  17. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    DEFF Research Database (Denmark)

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally...... valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects...

  18. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    Science.gov (United States)

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  19. Estimation of cardiac motion in cine-MRI sequences by correlation transform optical flow of monogenic features distance

    Science.gov (United States)

    Gao, Bin; Liu, Wanyu; Wang, Liang; Liu, Zhengjun; Croisille, Pierre; Delachartre, Philippe; Clarysse, Patrick

    2016-12-01

    Cine-MRI is widely used for the analysis of cardiac function in clinical routine, because of its high soft tissue contrast and relatively short acquisition time in comparison with other cardiac MRI techniques. The gray level distribution in cardiac cine-MRI is relatively homogenous within the myocardium, and can therefore make motion quantification difficult. To ensure that the motion estimation problem is well posed, more image features have to be considered. This work is inspired by a method previously developed for color image processing. The monogenic signal provides a framework to estimate the local phase, orientation, and amplitude, of an image, three features which locally characterize the 2D intensity profile. The independent monogenic features are combined into a 3D matrix for motion estimation. To improve motion estimation accuracy, we chose the zero-mean normalized cross-correlation as a matching measure, and implemented a bilateral filter for denoising and edge-preservation. The monogenic features distance is used in lieu of the color space distance in the bilateral filter. Results obtained from four realistic simulated sequences outperformed two other state of the art methods even in the presence of noise. The motion estimation errors (end point error) using our proposed method were reduced by about 20% in comparison with those obtained by the other tested methods. The new methodology was evaluated on four clinical sequences from patients presenting with cardiac motion dysfunctions and one healthy volunteer. The derived strain fields were analyzed favorably in their ability to identify myocardial regions with impaired motion.

  20. Structural features of conopeptide genes inferred from partial sequences of the Conus tribblei genome.

    Science.gov (United States)

    Barghi, Neda; Concepcion, Gisela P; Olivera, Baldomero M; Lluisma, Arturo O

    2016-02-01

    The evolvability of venom components (in particular, the gene-encoded peptide toxins) in venomous species serves as an adaptive strategy allowing them to target new prey types or respond to changes in the prey field. The structure, organization, and expression of the venom peptide genes may provide insights into the molecular mechanisms that drive the evolution of such genes. Conus is a particularly interesting group given the high chemical diversity of their venom peptides, and the rapid evolution of the conopeptide-encoding genes. Conus genomes, however, are large and characterized by a high proportion of repetitive sequences. As a result, the structure and organization of conopeptide genes have remained poorly known. In this study, a survey of the genome of Conus tribblei was undertaken to address this gap. A partial assembly of C. tribblei genome was generated; the assembly, though consisting of a large number of fragments, accounted for 2160.5 Mb of sequence. A large number of repetitive genomic elements consisting of 642.6 Mb of retrotransposable elements, simple repeats, and novel interspersed repeats were observed. We characterized the structural organization and distribution of conotoxin genes in the genome. A significant number of conopeptide genes (estimated to be between 148 and 193) belonging to different superfamilies with complete or nearly complete exon regions were observed, ~60 % of which were expressed. The unexpressed conopeptide genes represent hidden but significant conotoxin diversity. The conotoxin genes also differed in the frequency and length of the introns. The interruption of exons by long introns in the conopeptide genes and the presence of repeats in the introns may indicate the importance of introns in facilitating recombination, evolution and diversification of conotoxins. These findings advance our understanding of the structural framework that promotes the gene-level molecular evolution of venom peptides.

  1. Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features

    Science.gov (United States)

    Li, Yuan; Wang, Mingjun; Wang, Huilin; Tan, Hao; Zhang, Ziding; Webb, Geoffrey I.; Song, Jiangning

    2014-07-01

    Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation.

  2. Amino acid sequences and structures of chicken and turkey beta 2-microglobulin

    DEFF Research Database (Denmark)

    Welinder, K G; Jespersen, H M; Walther-Rasmussen, J;

    1991-01-01

    The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11......,048. The higher apparent Mr obtained for the avian beta 2-microglobulins as compared to human beta 2-microglobulin by SDS-PAGE is not understood. Chicken and turkey beta 2-microglobulin consist of 98 residues and deviate at seven positions: 60, 66, 74-76, 78 and 82. The chicken and turkey sequences are identical...... suggest that the seven chicken to turkey differences are exposed to solvent in the avian MHC class I complex. The key residues of beta 2-microglobulin involved in alpha chain contacts within the MHC class I molecule are highly conserved between chicken and man. This explains that heterologous human beta 2...

  3. Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique.

    Science.gov (United States)

    Tang, Hua; Chen, Wei; Lin, Hao

    2016-04-01

    Immunoglobulins, also called antibodies, are a group of cell surface proteins which are produced by the immune system in response to the presence of a foreign substance (called antigen). They play key roles in many medical, diagnostic and biotechnological applications. Correct identification of immunoglobulins is crucial to the comprehension of humoral immune function. With the avalanche of protein sequences identified in postgenomic age, it is highly desirable to develop computational methods to timely identify immunoglobulins. In view of this, we designed a predictor called "IGPred" by formulating protein sequences with the pseudo amino acid composition into which nine physiochemical properties of amino acids were incorporated. Jackknife cross-validated results showed that 96.3% of immunoglobulins and 97.5% of non-immunoglobulins can be correctly predicted, indicating that IGPred holds very high potential to become a useful tool for antibody analysis. For the convenience of most experimental scientists, a web-server for IGPred was established at http://lin.uestc.edu.cn/server/IGPred. We believe that the web-server will become a powerful tool to study immunoglobulins and to guide related experimental validations.

  4. Deep sequencing of the transcriptome reveals inflammatory features of porcine visceral adipose tissue.

    Science.gov (United States)

    Wang, Tao; Jiang, Anan; Guo, Yanqin; Tan, Ya; Tang, Guoqing; Mai, Miaomiao; Liu, Haifeng; Xiao, Jian; Li, Mingzhou; Li, Xuewei

    2013-01-01

    Functional differences in the different types of adipose tissue and the impact of their dysfunction on metabolism are associated with the regional distribution of adipose depots. Here we show a genome-wide comparison between the transcriptomes of one source of subcutaneous and two sources of visceral adipose tissue in the pig using an RNA-seq approach. We obtained ~32.3 million unique mapped reads which covered ~80.2% of the current annotated transcripts across these three sources of adipose tissue. We identified various genes differentially expressed between subcutaneous and visceral adipose tissue, which are potentially associated with the inflammatory features of visceral adipose tissue. These results are of benefit for understanding the phenotypic, metabolic and functional differences between different types of adipose tissue that are deposited in different body sites.

  5. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2232 from Karijini National Park, Australia.

    Science.gov (United States)

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Reddy, Tatiparthi; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  6. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    Science.gov (United States)

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  7. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    Science.gov (United States)

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  8. Unique features of a Japanese 'Candidatus Liberibacter asiaticus' strain revealed by whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    Hiroshi Katoh

    Full Text Available Citrus greening (huanglongbing is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol, in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative 'Ca. L. asiaticus' Japanese isolate Ishi-1 was determined by metagenomic analysis of DNA extracted from 'Ca. L. asiaticus'-infected psyllids and leaf midribs. The 1.19-Mb genome has an average 36.32% GC content. Annotation revealed 13 operons encoding rRNA and 44 tRNA genes, but no typical bacterial pathogenesis-related genes were located within the genome, similar to the Floridian psy62 and Chinese gxpsy. In contrast to other 'Ca. L. asiaticus' strains, the genome of the Japanese Ishi-1 strain lacks a prophage-related region.

  9. Tectonic features of out-of-sequence-thrusts in central Nankai accretionary prism

    Science.gov (United States)

    Saito, S.; Kuramoto, S.; Ashi, J.; Kinoshita, M.; Ujiie, K.; Sagaguchi, A.; Lallemant, S.; Toki, T.; Kubo, Y.; Misawa, N.

    2002-12-01

    During NT02-02 and YK02-02 cruises, deep-tow camera, multi-beam bathymetry, and diving survey were conducted in central Nankai accretionary prism, off Kii Peninsula. A system of out-of-sequence thrusts (OOST) defines high ridges, roughly parallel to the deformation front. The surface manifestation of the OOST is characterized by right-stepped en-echelon arrangement of ridges, which suggests a dextral slip component along the OOST. A series of deep-tow camera and dive surveys were conducted on the sites of OOST. Observation of outcrops and rock-sampling documented that the ridges are composed dominantly of stratified shale, siltstone and partly of sandstone covered by present talus debris and clayey ooze. Exposures along the southern limb of the ridges indicate that the beddings of the sediments dip generally northwestward at an angle of about 20 to 30 degree. In contrast to the southern limb of the ridge, outcrops along the northern limb of the ridge show southward dipping bedding. Active cold seepages with Calyptogena colony, bacteria mat, and carbonate chimney were observed at several sites on the slope of OOST ridges. All the active cold seepages observed from the submersible are located on the gentle foot of the slope. High heat flow, low chlorinity of interstitial water chemistry, and high natural gamma radiation at the active colony suggests seepage from the inside of the basement.

  10. Unique Features of Germline Variation in Five Egyptian Familial Breast Cancer Families Revealed by Exome Sequencing

    Science.gov (United States)

    Kim, Yeong C.; Soliman, Amr S.; Cui, Jian; Ramadan, Mohamed; Hablas, Ahmed; Abouelhoda, Mohamed; Hussien, Nehal; Ahmed, Ola; Zekri, Abdel-Rahman Nabawy; Seifeldin, Ibrahim A.

    2017-01-01

    Genetic predisposition increases the risk of familial breast cancer. Recent studies indicate that genetic predisposition for familial breast cancer can be ethnic-specific. However, current knowledge of genetic predisposition for the disease is predominantly derived from Western populations. Using this existing information as the sole reference to judge the predisposition in non-Western populations is not adequate and can potentially lead to misdiagnosis. Efforts are required to collect genetic predisposition from non-Western populations. The Egyptian population has high genetic variations in reflecting its divergent ethnic origins, and incident rate of familial breast cancer in Egypt is also higher than the rate in many other populations. Using whole exome sequencing, we investigated genetic predisposition in five Egyptian familial breast cancer families. No pathogenic variants in BRCA1, BRCA2 and other classical breast cancer-predisposition genes were present in these five families. Comparison of the genetic variants with those in Caucasian familial breast cancer showed that variants in the Egyptian families were more variable and heterogeneous than the variants in Caucasian families. Multiple damaging variants in genes of different functional categories were identified either in a single family or shared between families. Our study demonstrates that genetic predisposition in Egyptian breast cancer families may differ from those in other disease populations, and supports a comprehensive screening of local disease families to determine the genetic predisposition in Egyptian familial breast cancer. PMID:28076423

  11. Prediction of viral microRNA precursors based on human microRNA precursor sequence and structural features.

    Science.gov (United States)

    Kumar, Shiva; Ansari, Faraz A; Scaria, Vinod

    2009-08-20

    MicroRNAs (small approximately 22 nucleotide long non-coding endogenous RNAs) have recently attracted immense attention as critical regulators of gene expression in multi-cellular eukaryotes, especially in humans. Recent studies have proved that viruses also express microRNAs, which are thought to contribute to the intricate mechanisms of host-pathogen interactions. Computational predictions have greatly accelerated the discovery of microRNAs. However, most of these widely used tools are dependent on structural features and sequence conservation which limits their use in discovering novel virus expressed microRNAs and non-conserved eukaryotic microRNAs. In this work an efficient prediction method is developed based on the hypothesis that sequence and structure features which discriminate between host microRNA precursor hairpins and pseudo microRNAs are shared by viral microRNA as they depend on host machinery for the processing of microRNA precursors. The proposed method has been found to be more efficient than recently reported ab-initio methods for predicting viral microRNAs and microRNAs expressed by mammals.

  12. Grain-size features of a Miocene loess-soil sequence at Qinan: Implications on its origin

    Institute of Scientific and Technical Information of China (English)

    QIAO Yansong; GUO Zhengtang; HAO Qingzhen; YIN Qiuzhen; YUAN Baoyin; LIU Tungsheng

    2006-01-01

    In this study, grain-size of 507 bulk samples from the QA-I Miocene loess-soil sequenceat Qinan were analyzed, and the grain-size features are compared with those of typical Quaternary loess and soil samples, representative lacustrine and fluvial samples. The results indicate that the grain-size distribution pattern of the Miocene loess is essentially similar to that of Quaternary loess,but greatly differs from the lacustrine and fluvial sediments. Loess layers are regularly coarser than soil layers, indicating cyclical climate changes. Median grain-size along the 253.1 m sequence varies from 6 to 13 μm and the >63 μm fraction represents only 5.3% in maximum, 0.9% in average.Long-term grain-size variations are consistent with the loess accumulation rate at Qinan and with the eolian mass accumulation rate in the North Pacific. These features firmly indicate an eolian origin of the studied sequence, and also reveal a coeval changes between the long-term changes of eolian grain-size and continental aridity in the dust source regions.

  13. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    Science.gov (United States)

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  14. Chromatin features of plant telomeric sequences at terminal versus internal positions

    Directory of Open Access Journals (Sweden)

    Eva eMajerová

    2014-11-01

    Full Text Available Epigenetic mechanisms are involved in regulation of crucial cellular processes in eukaryotic organisms. Data on the epigenetic features of plant telomeres and their epigenetic regulation were published mostly for Arabidopsis thaliana, in which the presence of interstitial telomeric repeats (ITRs may interfere with genuine telomeres in most analyses. Here, we studied the epigenetic landscape and transcription of telomeres and ITRs in Nicotiana tabacum with long telomeres and no detectable ITRs, and in Ballantinia antipoda with large blocks of pericentromeric ITRs and relatively short telomeres. Chromatin of genuine telomeres displayed heterochromatic as well as euchromatic marks, while ITRs were just heterochromatic. Methylated cytosines were present at telomeres and ITRs, but showed a bias with more methylation towards distal telomere positions and different blocks of B. antipoda ITRs methylated to different levels. Telomeric transcripts TERRA (G-rich and ARRET (C-rich were identified in both plants and their levels varied among tissues with a maximum in blossoms. Plants with substantially different proportions of internally and terminally located telomeric repeats are instrumental in clarifying the chromatin status of telomeric repeats at distinct chromosome locations.

  15. A Systematic Evaluation of Feature Selection and Classification Algorithms Using Simulated and Real miRNA Sequencing Data

    Directory of Open Access Journals (Sweden)

    Sheng Yang

    2015-01-01

    Full Text Available Sequencing is widely used to discover associations between microRNAs (miRNAs and diseases. However, the negative binomial distribution (NB and high dimensionality of data obtained using sequencing can lead to low-power results and low reproducibility. Several statistical learning algorithms have been proposed to address sequencing data, and although evaluation of these methods is essential, such studies are relatively rare. The performance of seven feature selection (FS algorithms, including baySeq, DESeq, edgeR, the rank sum test, lasso, particle swarm optimistic decision tree, and random forest (RF, was compared by simulation under different conditions based on the difference of the mean, the dispersion parameter of the NB, and the signal to noise ratio. Real data were used to evaluate the performance of RF, logistic regression, and support vector machine. Based on the simulation and real data, we discuss the behaviour of the FS and classification algorithms. The Apriori algorithm identified frequent item sets (mir-133a, mir-133b, mir-183, mir-937, and mir-96 from among the deregulated miRNAs of six datasets from The Cancer Genomics Atlas. Taking these findings altogether and considering computational memory requirements, we propose a strategy that combines edgeR and DESeq for large sample sizes.

  16. A Systematic Evaluation of Feature Selection and Classification Algorithms Using Simulated and Real miRNA Sequencing Data.

    Science.gov (United States)

    Yang, Sheng; Guo, Li; Shao, Fang; Zhao, Yang; Chen, Feng

    2015-01-01

    Sequencing is widely used to discover associations between microRNAs (miRNAs) and diseases. However, the negative binomial distribution (NB) and high dimensionality of data obtained using sequencing can lead to low-power results and low reproducibility. Several statistical learning algorithms have been proposed to address sequencing data, and although evaluation of these methods is essential, such studies are relatively rare. The performance of seven feature selection (FS) algorithms, including baySeq, DESeq, edgeR, the rank sum test, lasso, particle swarm optimistic decision tree, and random forest (RF), was compared by simulation under different conditions based on the difference of the mean, the dispersion parameter of the NB, and the signal to noise ratio. Real data were used to evaluate the performance of RF, logistic regression, and support vector machine. Based on the simulation and real data, we discuss the behaviour of the FS and classification algorithms. The Apriori algorithm identified frequent item sets (mir-133a, mir-133b, mir-183, mir-937, and mir-96) from among the deregulated miRNAs of six datasets from The Cancer Genomics Atlas. Taking these findings altogether and considering computational memory requirements, we propose a strategy that combines edgeR and DESeq for large sample sizes.

  17. Alignment of the amino terminal amino acid sequence of human cytochrome c oxidase subunits I and II with the sequence of their putative mRNAs.

    OpenAIRE

    CHOMYN, A.; Hunkapiller, M W; Attardi, G

    1981-01-01

    Thirteen of the first fifteen amino acids from the NH2-terminus of the primary sequence of human cytochrome c oxidase subunit I and eleven of the first twelve amino acids of subunit II have been identified by microsequencing procedures. These sequences have been compared with the recently determined 5'-end proximal sequences of the HeLa cell mitochondrial mRNAs and unambiguously aligned with two of them. This alignment has allowed the identification of the putative mRNA for subunit I, and has...

  18. NetTurnP – Neural Network Prediction of Beta-turns by Use of Evolutionary Information and Predicted Protein Sequence Features

    DEFF Research Database (Denmark)

    Petersen, Bent; Lundegaard, Claus; Petersen, Thomas Nordahl

    2010-01-01

    acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non...... NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which...... is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino...

  19. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Energy Technology Data Exchange (ETDEWEB)

    Aklujkar, Muktak [University of Massachusetts, Amherst; Krushkal, Julia [University of Texas, Austin; DiBartolo, Genevieve [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Lovley, Derek [University of Massachusetts, Amherst

    2009-01-01

    Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results. The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion. The genomic evidence suggests that metabolism, physiology Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and

  20. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Directory of Open Access Journals (Sweden)

    Aklujkar Muktak

    2009-05-01

    Full Text Available Abstract Background The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second putative succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.

  1. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Energy Technology Data Exchange (ETDEWEB)

    Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve; Lapidus, Alla; Land, Miriam L.; Lovley, Derek R.

    2008-12-01

    Background: The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results: The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion: The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.

  2. The complete amino acid sequence of the major component myoglobin of Amazon river dolphin (Inia geoffrensis).

    Science.gov (United States)

    Dwulet, F E; Bogardt, R A; Jones, B N; Lehman, L D; Gurd, F R

    1975-12-02

    The complete amino acid sequence of the major component myoglobin from Amazon River dolphin, Inia geoffrensis, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Three easily separable peptides were obtained by cleaving the protein with cyanogen bromide at the methionine residues and four peptides were obtained by cleaving the methyl-acetimidated protein with trypsin at the arginine residues. From these peptides over 85% of the sequence was completed. The remainder of the sequence was obtained by fragmentation of the large cyanogen bromide peptide with trypsin. This protein differs from that of the common porpoise, Phocoena phocoena, at seven positions, from that of the common dolphin, Delphinus delphis, at 11 positions, and from that of the sperm whale, Physeter catodon, at 15 positions. By comparison of this sequence with the three-dimensional structure of sperm whale myoglobin it appears that those residues close to the heme group are most conserved followed by those in nonhelical regions and lastly by those in the helical segments. All of the substitutions observed in this sequence fit easily into the three-dimensional structure of the sperm whale myoglobin.

  3. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    Science.gov (United States)

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  4. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    Science.gov (United States)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  5. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    Science.gov (United States)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2016-10-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  6. Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

    Directory of Open Access Journals (Sweden)

    Michael Andrew Meyer

    2013-07-01

    Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  7. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    Science.gov (United States)

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  8. Structural and Morphological Features of Acid-Bearing Polymers for PEM Fuel Cells

    DEFF Research Database (Denmark)

    Yang, Yunsong; Siu, Ana; Peckham, Timothy J.;

    2008-01-01

    significantly enhance the transport of water and protons. For membranes that contain a high content of water, phase separation is less influential. Continuity of ionic aggregates is influential on the diffusion of water and electroosmotic drag within a membrane. A balance of these properties must be considered......Chemical structure, polymer microstructure, sequence distribution, and morphology of acid-bearing polymers are important factors in the design of polymer electrolyte membranes (PEMs) for fuel cells. The roles of ion aggregation and phase separation in vinylic- and aromatic-based polymers in proton...... conductivity and water transport are described. The formation, dimensions, and connectivity of ionic pathways are consistently found to play an important role in determining the physicochemical properties of PEMs. For polymers that possess low water content, phase separation and ionic channel formation...

  9. Identification of Amino Acid Sequences with Good Folding Properties in an Off-Lattice Model

    CERN Document Server

    Irbäck, Anders; Potthast, Frank

    2008-01-01

    Folding properties of a two-dimensional toy protein model containing only two amino-acid types, hydrophobic and hydrophilic, respectively, are analyzed. An efficient Monte Carlo procedure is employed to ensure that the ground states are found. The thermodynamic properties are found to be strongly sequence dependent in contrast to the kinetic ones. Hence, criteria for good folders are defined entirely in terms of thermodynamic fluctuations. With these criteria sequence patterns that fold well are isolated. For 300 chains with 20 randomly chosen binary residues approximately 10% meet these criteria. Also, an analysis is performed by means of statistical and artificial neural network methods from which it is concluded that the folding properties can be predicted to a certain degree given the binary numbers characterizing the sequences.

  10. Sequence-selective targeting of duplex DNA by peptide nucleic acids

    DEFF Research Database (Denmark)

    Nielsen, Peter E

    2010-01-01

    Sequence-selective gene targeting constitutes an attractive drug-discovery approach for genetic therapy, with the aim of reducing or enhancing the activity of specific genes at the transcriptional level, or as part of a methodology for targeted gene repair. The pseudopeptide DNA mimic peptide...... nucleic acid (PNA) can recognize duplex DNA with high sequence specificity and affinity in triplex, duplex and double-duplex invasive modes or non-invasive triplex modes. Novel PNA modification has improved the affinity for DNA recognition via duplex invasion, double-duplex invasion and triplex...... recognition considerably. Such modifications have also resulted in new approaches to targeted gene repair and sequence-selective double-strand cleavage of genomic DNA....

  11. Structure of the fully modified left-handed cyclohexene nucleic acid sequence GTGTACAC.

    Science.gov (United States)

    Robeyns, Koen; Herdewijn, Piet; Van Meervelt, Luc

    2008-02-13

    CeNA oligonucleotides consist of a phosphorylated backbone where the deoxyribose sugars are replaced by cyclohexene moieties. The X-ray structure determination and analysis of a fully modified octamer sequence GTGTACAC, which is the first crystal structure of a carbocyclic-based nucleic acid, is presented. This particular sequence was built with left-handed building blocks and crystallizes as a left-handed double helix. The helix can be characterized as belonging to the (mirrored) A-type family. Crystallographic data were processed up to 1.53 A, and the octamer sequence crystallizes in the space group R32. The sugar puckering is found to adopt the 3H2 half-chair conformation which mimics the C3'-endo conformation of the ribose sugar. The double helices stack on top of each other to form continuous helices, and static disorder is observed due to this end-to-end stacking.

  12. NetTurnP--neural network prediction of beta-turns by use of evolutionary information and predicted protein sequence features.

    Directory of Open Access Journals (Sweden)

    Bent Petersen

    Full Text Available UNLABELLED: β-turns are the most common type of non-repetitive structures, and constitute on average 25% of the amino acids in proteins. The formation of β-turns plays an important role in protein folding, protein stability and molecular recognition processes. In this work we present the neural network method NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC=0.50, Qtotal=82.1%, sensitivity=75.6%, PPV=68.8% and AUC=0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17-0.47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively. CONCLUSION: The NetTurnP method has been implemented as a webserver, which is freely available at http://www.cbs.dtu.dk/services/NetTurnP/. NetTurnP is the only available webserver that allows submission of multiple sequences.

  13. Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

    Directory of Open Access Journals (Sweden)

    Samat Kozhakhmetov

    2014-01-01

    Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.

  14. Metazoan Remaining Genes for Essential Amino Acid Biosynthesis: Sequence Conservation and Evolutionary Analyses

    Directory of Open Access Journals (Sweden)

    Igor R. Costa

    2014-12-01

    Full Text Available Essential amino acids (EAA consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS and betaine-homocysteine S-methyltransferase (BHMT diverged from the expected Tree of Life (ToL relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.

  15. CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping

    Directory of Open Access Journals (Sweden)

    Shi Weisong

    2011-06-01

    Full Text Available Abstract Background Research in genetics has developed rapidly recently due to the aid of next generation sequencing (NGS. However, massively-parallel NGS produces enormous amounts of data, which leads to storage, compatibility, scalability, and performance issues. The Cloud Computing and MapReduce framework, which utilizes hundreds or thousands of shared computers to map sequencing reads quickly and efficiently to reference genome sequences, appears to be a very promising solution for these issues. Consequently, it has been adopted by many organizations recently, and the initial results are very promising. However, since these are only initial steps toward this trend, the developed software does not provide adequate primary functions like bisulfite, pair-end mapping, etc., in on-site software such as RMAP or BS Seeker. In addition, existing MapReduce-based applications were not designed to process the long reads produced by the most recent second-generation and third-generation NGS instruments and, therefore, are inefficient. Last, it is difficult for a majority of biologists untrained in programming skills to use these tools because most were developed on Linux with a command line interface. Results To urge the trend of using Cloud technologies in genomics and prepare for advances in second- and third-generation DNA sequencing, we have built a Hadoop MapReduce-based application, CloudAligner, which achieves higher performance, covers most primary features, is more accurate, and has a user-friendly interface. It was also designed to be able to deal with long sequences. The performance gain of CloudAligner over Cloud-based counterparts (35 to 80% mainly comes from the omission of the reduce phase. In comparison to local-based approaches, the performance gain of CloudAligner is from the partition and parallel processing of the huge reference genome as well as the reads. The source code of CloudAligner is available at http

  16. SigWin-detector: a Grid-enabled workflow for discovering enriched windows of genomic features related to DNA sequences

    Directory of Open Access Journals (Sweden)

    Wibisono Adianto

    2008-08-01

    Full Text Available Abstract Background Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Findings Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic features in a (DNA sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. Conclusion As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the

  17. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    Directory of Open Access Journals (Sweden)

    Yan Koon-Kiu

    2007-11-01

    Full Text Available Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw" duplication and deletion rates rdup∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsgaKjabbwha1jabbchaWbqaaiabgEHiQaaaaaa@3283@, rdel∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsga

  18. Purification, amino-acid sequence and partial characterization of two toxins with anti-insect activity from the venom of the South American scorpion Tityus bahiensis (Buthidae).

    Science.gov (United States)

    Pimenta, A M; Martin-Eauclaire, M; Rochat, H; Figueiredo, S G; Kalapothakis, E; Afonso, L C; De Lima, M E

    2001-07-01

    We report here the isolation by a two-step chromatographic procedure of two new toxins from the South American scorpion Tityus bahiensis. Their amino-acid sequences and some of their biological features were established. The two toxins have different biological properties. Toxin TbIT-I had almost no activity or pharmacological effects in vertebrate tissues whereas it was lethal to house flies (LD50 80.0 ng/house fly). In contrast, Tb2-II was active against both mammals (intracerebroventricular injection of 100 ng/mouse was lethal) and insects (LD50 40.0 ng/house fly). The amino-acid sequences of these toxins were established and found to be similar (60-95%) to previously described beta-toxins from the Tityus genus. Based on the available comparative information, this study attempts identify possible structure-function relationships that may be responsible for the differences in bioactivity displayed by these toxins.

  19. Genome sequence of Ensifer medicae strain WSM1115; an acid-tolerant Medicago-nodulating microsymbiont from Samothraki, Greece.

    Science.gov (United States)

    Reeve, Wayne; Ballard, Ross; Howieson, John; Drew, Elizabeth; Tian, Rui; Bräu, Lambert; Munk, Christine; Davenport, Karen; Chain, Patrick; Goodwin, Lynne; Pagani, Ioanna; Huntemann, Marcel; Mavrommatis, Konstantinos; Pati, Amrita; Markowitz, Victor; Ivanova, Natalia; Woyke, Tanja; Kyrpides, Nikos

    2014-06-15

    Ensifer medicae strain WSM1115 forms effective nitrogen fixing symbioses with a range of annual Medicago species and is used in commercial inoculants in Australia. WSM1115 is an aerobic, motile, Gram-negative, non-spore-forming rod. It was isolated from a nodule recovered from the root of burr medic (Medicago polymorpha) collected on the Greek Island of Samothraki. WSM1115 has a broad host range for nodulation and N2 fixation capacity within the genus Medicago, although this does not extend to all medic species. WSM1115 is considered saprophytically competent in moderately acid soils (pH(CaCl2) 5.0), but it has failed to persist at field sites where soil salinity exceeded 10 ECe (dS/m). Here we describe the features of E. medicae strain WSM1115, together with genome sequence information and its annotation. The 6,861,065 bp high-quality-draft genome is arranged into 7 scaffolds of 28 contigs, contains 6,789 protein-coding genes and 83 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  20. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    Science.gov (United States)

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  1. alpha-Amylase gene of Streptomyces limosus: nucleotide sequence, expression motifs, and amino acid sequence homology to mammalian and invertebrate alpha-amylases.

    OpenAIRE

    1987-01-01

    The nucleotide sequence of the coding and regulatory regions of the alpha-amylase gene (aml) of Streptomyces limosus was determined. High-resolution S1 mapping was used to locate the 5' end of the transcript and demonstrated that the gene is transcribed from a unique promoter. The predicted amino acid sequence has considerable identity to mammalian and invertebrate alpha-amylases, but not to those of plant, fungal, or eubacterial origin. Consistent with this is the susceptibility of the enzym...

  2. Development of a SCAR (sequence-characterised amplified region) marker for acid resistance-related gene in Lactobacillus plantarum.

    Science.gov (United States)

    Liu, Shu-Wen; Li, Kai; Yang, Shi-Ling; Tian, Shu-Fen; He, Ling

    2015-03-01

    A sequence characterised amplified region marker was developed to determine an acid resistance-related gene in Lactobacillus plantarum. A random amplified polymorphic DNA marker named S116-680 was reported to be closely related to the acid resistance of the strains. The DNA band corresponding to this marker was cloned and sequenced with the induction of specific designed PCR primers. The results of PCR test helped to amplify a clear specific band of 680 bp in the tested acid-resistant strains. S116-680 marker would be useful to explore the acid-resistant mechanism of L. plantarum and to screen desirable malolactic fermentation strains.

  3. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    Energy Technology Data Exchange (ETDEWEB)

    Croteau, Rodney Bruce (Pullman, WA); Wildung, Mark Raymond (Colfax, WA); Lange, Bernd Markus (Pullman, WA); McCaskill, David G. (Pullman, WA)

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  4. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...Gclust Server Amino acid sequences of predicted proteins and their annotation for 95 organism species. Data ...detail Data name Amino acid sequences of predicted proteins and their annotation for 95 organism species. De...scription of data contents Amino acid sequences of predicted proteins and their a...nload License Update History of This Database Site Policy | Contact Us Amino acid sequences of predicted pro

  5. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    Directory of Open Access Journals (Sweden)

    Xiaoyu Wang

    Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  6. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    Science.gov (United States)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  7. Amino acid sequence alignment of vertebrate CAPN3/calpain-3/p94

    Directory of Open Access Journals (Sweden)

    Yasuko Ono

    2015-12-01

    Full Text Available CAPN3 is a calpain superfamily member that is predominantly expressed in skeletal muscle. So far, clear CAPN3 orthologs were found only in vertebrates. CAPN3 is a unique protease in that it undergoes extremely rapid and exhaustive autolysis and that autolyzed fragments spontaneously associate each other to reconstitute the proteolytic activity. These unique properties of CAPN3 are dependent on IS1 and IS2, two CAPN3-characterizing sequences that do not exist in other calpains or any other proteases. To understand how IS1 and IS2 are conserved among vertebrates, this data article provides amino acid sequence alignment of representative vertebrate CAPN3s. For further analysis and discussion, see Ono et al. [1

  8. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    Directory of Open Access Journals (Sweden)

    A.E. El Hakim

    2015-12-01

    Full Text Available Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The tryptic peptides of the two isoforms share high sequence homology with other snake venom l-amino acid oxidases. The optimal pH and temperature values of Cc-LAAOI and Cc-LAAOII were 7.8, 50 °C and 7, 60 °C, respectively. The two isoenzymes were thermally stable up to 70 °C. The Km and Vmax values were 0.67 mM, 0.135 μmol/min for LAAOI and 0.82 mM, 0.087 μmol/min for LAAOII. Both isoenzymes displayed high catalytic preference to long-chain, hydrophobic and aromatic amino acids. The Mn2+ ion markedly increased the LAAO activity for both purified isoforms, while Na+, K+, Ca2+, Mg2+ and Ba2+ ions showed a non-significant increase in the enzymatic activity of both isoforms. Furthermore, Zn2+, Ni2+, Co2+, Cu2+ and AL3+ ions markedly inhibited the LAAOI and LAAOII activities. l-Cysteine and reduced glutathione completely inhibited the LAAO activity of both isoenzymes, whereas, β-mercaptoethanol, O-phenanthroline and PMSF completely inhibited the enzymatic activity of LAAOII. Furthermore, iodoacitic acid inhibited the enzymatic activity of LAAOII by 46% and had no effect on the LAAOI activity.

  9. Geochemical features and effects on deep-seated fluids during the May-June 2012 southern Po Valley seismic sequence

    Directory of Open Access Journals (Sweden)

    Francesco Italiano

    2012-10-01

    Full Text Available A periodic sampling of the groundwaters and dissolved and free gases in selected deep wells located in the area affected by the May-June 2012 southern Po Valley seismic sequence has provided insight into seismogenic-induced changes of the local aquifer systems. The results obtained show progressive changes in the fluid geochemistry, allowing it to be established that deep-seated fluids were mobilized during the seismic sequence and reached surface layers along faults and fractures, which generated significant geochemical anomalies. The May-June 2012 seismic swarm (mainshock on May 29, 2012, M 5.8; 7 shocks M >5, about 200 events 3 > M > 5 induced several modifications in the circulating fluids. This study reports the preliminary results obtained for the geochemical features of the waters and gases collected over the epicentral area from boreholes drilled at different depths, thus intercepting water and gases with different origins and circulation. The aim of the investigations was to improve our knowledge of the fluids circulating over the seismic area (e.g. origin, provenance, interactions, mixing of different components, temporal changes. This was achieved by collecting samples from both shallow and deep-drilled boreholes, and then, after the selection of the relevant sites, we looked for temporal changes with mid-to-long-term monitoring activity following a constant sampling rate. This allowed us to gain better insight into the relationships between the fluid circulation and the faulting activity. The sampling sites are listed in Table 1, along with the analytical results of the gas phase. […

  10. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    Science.gov (United States)

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  11. Comparative amino acid sequence analysis of hemolysins produced by Vibrio hollisae and Vibrio parahaemolyticus.

    OpenAIRE

    Yoh, M; Honda, T.; Miwatani, T; Tsunasawa, S; Sakiyama, F

    1989-01-01

    Vibrio hollisae produces a hemolysin (Vh-rTDH) that is related to the thermostable direct hemolysin of Vibrio parahaemolyticus (Vp-TDH). Although both hemolysins are essentially similar biologically and immunologically, they differ markedly in heat stability; Vp-TDH is heat stable, whereas Vh-rTDH is heat labile. To elucidate the relationships between their characteristics and molecular structures, we analyzed the amino acid sequence of Vh-rTDH and compared it with that of Vp-TDH. Vh-rTDH con...

  12. Cloning and expression of Chromobacterium violaceum phenylalanine hydroxylase in Escherichia coli and comparison of amino acid sequence with mammalian aromatic amino acid hydroxylases.

    Science.gov (United States)

    Onishi, A; Liotta, L J; Benkovic, S J

    1991-10-05

    The complete amino acid sequence (296 amino acids) of Chromobacterium violaceum phenylalanine hydroxylase (PAH) was determined by nucleotide analysis of a DNA clone isolated using both a synthetic oligonucleotide probe based on the NH2-terminal amino acid sequence and an antibody against this enzyme. The ApaL I fragment (approximately 1.9 kilobase pairs) containing the entire PAH gene was subcloned in pBluescript II and induced by isopropyl-beta-D-thiogalactopyranoside. In order to eliminate fusion proteins the XbaI/ClaI fragment which contained the PAH gene from the Bluescript construct was subcloned into pMAC 5-8 containing the TAC promoter. The recombinant protein reacts with antibody raised to authentic C. violaceum PAH and its NH2-terminal 20-amino acid sequence and COOH-terminal amino acid residue were identical with the wild-type protein. Key physical and chemical characteristics of the recombinant protein, i.e. its copper content and Michaelis-Menten parameters, were the same as wild-type. Comparison of amino acid sequences revealed a highly conserved region between C. violaceum PAH and three different mammalian aromatic amino acid hydroxylases. This conserved area may well be a catalytically important domain of these pterin- and metal-requiring aromatic amino acid hydroxylases. The over-expression of C. violaceum PAH in Escherichia coli will facilitate the analysis of the enzyme mechanism by various spectroscopic methods.

  13. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    Science.gov (United States)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  14. Hemodynamic variables and clinical features correlated with serum uric acid in patients with pulmonary arterial hypertension

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Background Serum uric acid (UA), the final product of purine degradation, has been proposed to be a marker for the severity and a possible predictor of mortality in patients with pulmonary arterial hypertension (PAH). The objectives of this study were to elucidate whether serum UA level correlates with the clinical features and the hemodynamic variables in Chinese patients with PAH and to compare the difference of the correlates in patients associated with different etiologies. Methods Serum UA was assessed in 228 patients with three types of PAH (idiopathic PAH (IPAH), congenital heart disease related PAH (CHD-PAH) and connective tissue disease related PAH (CTD-PAH)) together with other clinical features. After the individualized treatment for at least 6 months, the UA levels and clinical features were re-evaluated in 88 patients. Results Serum UA was significantly elevated in patients with PAH compared with age-matched control subjects ((350.40±108.73) μmol/L vs (266.91±81.38) μmol/L), P<0.001). Serum UA negatively correlated with cardiac output and mixed venous saturation (SvO) in all three types of PAH (all P<0.05), positively correlated with the size of right ventricle in IPAH (P=0.002) and CTD-PAH (P=0.013) patients and with pulmonary vascular resistance just in CTD-PAH patients (P=0.001). Serum UA significantly decreased from (365.80±120.46) μmol/L to (333.67±117.56) μmol/L in 88 patients (P=0.006) with vasodilator therapy for at least 6 months, accompanied with a reduction in pulmonary vascular resistance from (15.13±6.96) Woods unit to (12.00±5.04) Woods unit (P=0.001) and an increase in cardiac output from (2.63±0.98) L/min to (3.08±1.04) L/min (P=0.005). Conclusions Serum UA increases in proportion to the clinical severity of all the three types of PAH, especially the CTD-PAH had a stronger correlations compared with IPAH and CHD-PAH. The serum UA levels also could partly reflect the response to the treatment in patients with PAH.

  15. New monoclonal antibodies to the Ebola virus glycoprotein: Identification and analysis of the amino acid sequence of the variable domains.

    Science.gov (United States)

    Panina, A A; Aliev, T K; Shemchukova, O B; Dement'yeva, I G; Varlamov, N E; Pozdnyakova, L P; Bokov, M N; Dolgikh, D A; Sveshnikov, P G; Kirpichnikov, M P

    2016-03-01

    We determined the nucleotide and amino acid sequences of variable domains of three new monoclonal antibodies to the glycoprotein of Ebola virus capsid. The framework and hypervariable regions of immunoglobulin heavy and light chains were identified. The primary structures were confirmed using massspectrometry analysis. Immunoglobulin database search showed the uniqueness of the sequences obtained.

  16. Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

    Directory of Open Access Journals (Sweden)

    D. Satyanarayana Rao

    2007-02-01

    Full Text Available We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1 57-amino-acid-residue PxV domain, (2 122-amino-acid-residue FxF domain, (3 111-amino-acid-residue YEFF domain, (4 109-amino-acid-residue IMxxH domain, (5 103-amino-acid-residue VxxT domain, (6 84-amino-acid-residue ExW domain, (7 104-amino-acid-residue NTGFIG domain, (8 36-amino-acid-residue NxGK repeat, (9 95-amino-acid-residue VYV domain, (10 75-amino-acid-residue KEWE domain, (11 59-amino-acid-residue AFL domain, (12 53-amino-acid-residue RIDVK repeat, (13 (a 41-amino-acid-residue AGQF repeat and (b 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure.

  17. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Directory of Open Access Journals (Sweden)

    James B Howard

    Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification

  18. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Science.gov (United States)

    Howard, James B; Kechris, Katerina J; Rees, Douglas C; Glazer, Alexander N

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases for

  19. Pathophysiology, clinical features and radiological findings of differentiation syndrome/all-trans-retinoic acid syndrome

    Institute of Scientific and Technical Information of China (English)

    Luciano; Cardinale; Francesco; Asteggiano; Federica; Moretti; Federico; Torre; Stefano; Ulisciani; Carmen; Fava; Giovanna; Rege-Cambrin

    2014-01-01

    In acute promyelocytic leukemia, differentiation thera-py based on all-trans-retinoic acid can be complicated by the development of a differentiation syndrome(DS). DS is a life-threatening complication, characterized by respiratory distress, unexplained fever, weight gain, interstitial lung infiltrates, pleural or pericardial effusions, hypotension and acute renal failure. The diagnosis of DS is made on clinical grounds and has proven to be difficult, because none of the symptoms is pathognomonic for the syndrome without any definitive diagnostic criteria. As DS can have subtle signs and symptoms at presentation but progress rapidly, end-stage DS clinical picture resembles the acute respiratory distress syndrome with extremely poor prognosis; so it is of absolute importance to be conscious of these complications and initiate therapy as soon as it was suspected. The radiologic appearance resembles the typical features of cardiogenic pulmonary edema. Diagnosis of DS remains a great skill for radiologists and haematologist but it is of an utmost importance the cooperation in suspect DS, detect the early signs of DS, examine the patients’ behaviour and rapidly detect the complications.

  20. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    Science.gov (United States)

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  1. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    Science.gov (United States)

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  2. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    Science.gov (United States)

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  3. Complete amino acid sequence of the myoglobin from the Atlantic bottlenosed dolphin, Tursiops truncatus.

    Science.gov (United States)

    Jones, B N; Vigna, R A; Dwulet, F E; Bogardt, R A; Lehman, L D; Gurd, F R

    1976-10-05

    The complete amino acid sequence of the major component myoglobin from the Atlantic bottlenosed dolphin, Tursiops truncatus, was determined by specific cleavage of the protein to obtain large peptides that are readily degraded by the automatic sequencer. Three easily separable peptides were obtained by cleaving the protein with cyanogen bromide at the 2 methionine residues and 4 peptides were obtained by cleaving the methyl acetimidated protein with trypsin at the 3 arginine residues. By subjecting 4 of these peptides and the apomyoglobin to automatic Edman degradation, over 80% of the covalent structure of the protein was obtained. The remainder of the primary structure was determined by further digestion of the central cyanogen bromide peptide with trypsin and staphylococcal protease. This myoglobin differs from that of the sperm whale, Physter catodon, at 15 positions, from that of the California gray whale, Eschrichtius gibbosus, at 14 positions, from that of the common porpoise, Phocoena phocoena, at 6 positions, and from the myoglobin of the Black Sea dolphin, Delphinus delphis and the Amazon River dolphin, Inia goeffrensis, at 5 and 7 positions, respecitvely. All substitutions observed in this sequence fit easily into the tertiary structure of sperm whale myoglobin.

  4. Purification, amino-acid sequence and partial characterization of two toxins with anti-insect activity from the venom of the South American scorpion Tityus bahiensis (Buthidae).

    OpenAIRE

    2001-01-01

    We report here the isolation by a two-step chromatographic procedure of two new toxins from the South American scorpion Tityus bahiensis. Their amino-acid sequences and some of their biological features were established. The two toxins have different biological properties. Toxin TbIT-I had almost no activity or pharmacological effects in vertebrate tissues whereas it was lethal to house ¯ies (LD50 80.0 ng/house ¯y). In contrast, Tb2-II was active against both mammals (intracerebroventricular ...

  5. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE Consensus Sequence Repeats in the Viral Genome.

    Directory of Open Access Journals (Sweden)

    Ashutosh Kumar

    2016-08-01

    Full Text Available Owing to the reports of microcephaly as a consistent outcome in the foetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV - microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favour of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1 sequence homology between ZIKV genome and the response element of an early neural tube developmental marker ‘retinoic acid’ in human DNA and (2 comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE consensus sequence (5′–AGGTCA–3′ in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly and other viruses available in National Institute of Health genetic sequence database (GenBank for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause foetal brain defects (for which maternal-foetal transmission during developing stage may be required. The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although bioinformatic

  6. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  7. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    Energy Technology Data Exchange (ETDEWEB)

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L. (Univ. of Massachusetts Medical School, Worcester (USA))

    1988-06-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity.

  8. Sequence stratigraphic features of the Middle Permian Maokou Formation in the Sichuan Basin and their controls on source rocks and reservoirs

    Directory of Open Access Journals (Sweden)

    Wang Su

    2015-11-01

    Full Text Available Well Shuangyushi 1 and Well Nanchong l deployed in the NW and central Sichuan Basin have obtained a high-yield industrial gas flow in the dolomite and karst reservoirs of the Middle Permian Maokou Formation, showing good exploration prospects of the Maokou Formation. In order to identify the sequence stratigraphic features of the Maokou Formation, its sequence stratigraphy was divided and a unified sequence stratigraphic framework applicable for the entire basin was established to analyze the stratigraphic denudation features within the sequence framework by using the spectral curve trend attribute analysis, together with drilling and outcrop data. On this basis, the controls of sequence on source rocks and reservoirs were analyzed. In particular, the Maokou Formation was divided into two third-order sequences – SQ1 and SQ2. SQ1 was composed of members Mao 1 Member and Mao 3, while SQ2 was composed of Mao 4 Member. Sequence stratigraphic correlation indicated that the Maokou Formation within the basin had experienced erosion to varying extent, forming “three intense and two weak” denuded regions, among which, the upper part of SQ2 was slightly denuded in the two weak denuded regions (SW Sichuan Basin and locally Eastern Sichuan Basin, while SQ2 was denuded out in the three intense denuded regions (Southern Sichuan Basin–Central Sichuan Basin, NE and NW Sichuan Basin. The development of source rocks and reservoirs within sequence stratigraphic framework was significantly affected by sequence boundary; the grain banks that can form effective reservoir were predominately distributed in SQ1 highstand systems tract (HST, while effective source rocks were predominately distributed in SQ1 transgressive system tract (TST. It is concluded that the sequence division method is objective and reasonable, which can effectively guide oil and gas exploration in this region.

  9. Self-organizing maps: A tool to ascertain taxonomic relatedness based on features derived from 16S rDNA sequence

    Indian Academy of Sciences (India)

    D V Raje; H J Purohit; Y P Badhe; S S Tambe; B D Kulkarni

    2010-12-01

    Exploitation of microbial wealth, of which almost 95% or more is still unexplored, is a growing need. The taxonomic placements of a new isolate based on phenotypic characteristics are now being supported by information preserved in the 16S rRNA gene. However, the analysis of 16S rDNA sequences retrieved from metagenome, by the available bioinformatics tools, is subject to limitations. In this study, the occurrences of nucleotide features in 16S rDNA sequences have been used to ascertain the taxonomic placement of organisms. The tetra- and penta-nucleotide features were extracted from the training data set of the 16S rDNA sequence, and was subjected to an artificial neural network (ANN) based tool known as self-organizing map (SOM), which helped in visualization of unsupervised classification. For selection of significant features, principal component analysis (PCA) or curvilinear component analysis (CCA) was applied. The SOM along with these techniques could discriminate the sample sequences with more than 90% accuracy, highlighting the relevance of features. To ascertain the confidence level in the developed classification approach, the test data set was specifically evaluated for Thiobacillus, with Acidiphilium, Paracocus and Starkeya, which are taxonomically reassigned. The evaluation proved the excellent generalization capability of the developed tool. The topology of genera in SOM supported the conventional chemo-biochemical classification reported in the Bergey manual.

  10. Geometric Feature-Based Facial Expression Recognition in Image Sequences Using Multi-Class AdaBoost and Support Vector Machines

    Directory of Open Access Journals (Sweden)

    Joonwhoan Lee

    2013-06-01

    Full Text Available Facial expressions are widely used in the behavioral interpretation of emotions, cognitive science, and social interactions. In this paper, we present a novel method for fully automatic facial expression recognition in facial image sequences. As the facial expression evolves over time facial landmarks are automatically tracked in consecutive video frames, using displacements based on elastic bunch graph matching displacement estimation. Feature vectors from individual landmarks, as well as pairs of landmarks tracking results are extracted, and normalized, with respect to the first frame in the sequence. The prototypical expression sequence for each class of facial expression is formed, by taking the median of the landmark tracking results from the training facial expression sequences. Multi-class AdaBoost with dynamic time warping similarity distance between the feature vector of input facial expression and prototypical facial expression, is used as a weak classifier to select the subset of discriminative feature vectors. Finally, two methods for facial expression recognition are presented, either by using multi-class AdaBoost with dynamic time warping, or by using support vector machine on the boosted feature vectors. The results on the Cohn-Kanade (CK+ facial expression database show a recognition accuracy of 95.17% and 97.35% using multi-class AdaBoost and support vector machines, respectively.

  11. Amino acid sequences of neuropeptides in the sinus gland of the land crab Cardisoma carnifex: a novel neuropeptide proteolysis site.

    Science.gov (United States)

    Newcomb, R W

    1987-08-01

    The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.

  12. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    Science.gov (United States)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  13. The myoglobin of Emperor penguin (Aptenodytes forsteri): amino acid sequence and functional adaptation to extreme conditions.

    Science.gov (United States)

    Tamburrini, M; Romano, M; Giardina, B; di Prisco, G

    1999-02-01

    In the framework of a study on molecular adaptations of the oxygen-transport and storage systems to extreme conditions in Antarctic marine organisms, we have investigated the structure/function relationship in Emperor penguin (Aptenodytes forsteri) myoglobin, in search of correlation with the bird life style. In contrast with previous reports, the revised amino acid sequence contains one additional residue and 15 differences. The oxygen-binding parameters seem well adapted to the diving behaviour of the penguin and to the environmental conditions of the Antarctic habitat. Addition of lactate has no major effect on myoglobin oxygenation over a large temperature range. Therefore, metabolic acidosis does not impair myoglobin function under conditions of prolonged physical effort, such as diving.

  14. Repetitive sequence based polymerase chain reaction to differentiate close bacteria strains in acidic sites

    Institute of Scientific and Technical Information of China (English)

    XIE Ming; YIN Hua-qun; LIU Yi; LIU Jie; LIU Xue-duan

    2008-01-01

    To study the diversity of bacteria strains newly isolated from several acid mine drainage(AMD) sites in China,repetitive sequence based polymerase chain reaction (rep-PCR),a well established technology for diversity analysis of closely related bacteria strains,was conducted on 30 strains of bacteria Leptospirillum ferriphilium,8 strains of bacteria Acidithiobacillus ferrooxidans,as well as the Acidithiobacillus ferrooxidans type strain ATCC (American Type Culture Collection) 23270.The results showed that,using ERIC and BOX primer sets,rep-PCR produced highly discriminatory banding patterns.Phylogenetic analysis based on ERIC-PCR banding types was made and the results indicated that rep-PCR could be used as a rapid and highly discriminatory screening technique in studying bacterial diversity,especially in differentiating bacteria within one species in AMD.

  15. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Indian Academy of Sciences (India)

    Meng-Jun Li; Ai-Qin Li; Han Xia; Chuan-Zhi Zhao; Chang-Sheng Li; Shu-Bo Wan; Yu-Ping Bi; Xing-Jun Wang

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, -ketoacyl-ACP synthase (I, II, III), -ketoacyl-ACP reductase, -hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  16. Application of peptide nucleic acids containing azobenzene self-assembled electrochemical biosensors in detecting DNA sequences

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Hybridization of peptide nucleic acids probe containing azobenzene (NH2-TNT4, N-PNAs) with DNA was performed by covalently immobilizing of NH2-TNT4 in sequence on the 3-mercaptopropionic acid self-assembled monolayer modified gold electrode with the helps of N-(3-dimethylaminopropy1)-N’-ethylcarbodiimide hydrochloride (EDC) and N-hydroxysuccinimide (NHS), and the hybrid was coded as N-PNAs/DNA. Using [Fe(CN)6]4-/3- (1:1) as the electrochemical indicator, the electrochemical properties of the N-PNAs self-assembled monolayer (N-PNAs-SAMs) and N-PNAs/DNA hybridization system under the conditions of before and after UV light irradiation were characterized with cyclic voltammetry (CV), differential pulse voltammetry (DPV), and electrochemical impedance spectra (EIS). Results showed that the redox currents decreased with the increase of irradiation time, suggesting that the ability of the charge transfer on the electrode surface was weakened and the conformation of hybrid system had been changed, and the control of PNAs/DNA hybridization could be realized by UV light irradiation.

  17. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Science.gov (United States)

    Li, Meng-Jun; Li, Ai-Qin; Xia, Han; Zhao, Chuan-Zhi; Li, Chang-Sheng; Wan, Shu-Bo; Bi, Yu-Ping; Wang, Xing-Jun

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, beta-ketoacyl-ACP synthase (I, II, III), beta-ketoacyl-ACP reductase, beta-hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  18. Canine amino acid transport system Xc(-): cDNA sequence, distribution and cystine transport activity in lens epithelial cells.

    Science.gov (United States)

    Maruo, Takuya; Kanemaki, Nobuyuki; Onda, Ken; Sato, Reiichiro; Ichihara, Nobuteru; Ochiai, Hideharu

    2014-04-01

    The cystine transport activity of a lens epithelial cell line originated from a canine mature cataract was investigated. The distinct cystine transport activity was observed, which was inhibited to 28% by extracellular 1 mM glutamate. The cDNA sequences of canine cysteine/glutamate exchanger (xCT) and 4F2hc were determined. The predicted amino acid sequences were 527 and 533 amino acid polypeptides, respectively. The amino acid sequences of canine xCT and 4F2hc showed high similarities (>80%) to those of humans. The expression of xCT in lens epithelial cell line was confirmed by western blot analysis. RT-PCR analysis revealed high level expression only in the brain, and it was below the detectable level in other tissues.

  19. PSNO: Predicting Cysteine S-Nitrosylation Sites by Incorporating Various Sequence-Derived Features into the General Form of Chou’s PseAAC

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-06-01

    Full Text Available S-nitrosylation (SNO is one of the most universal reversible post-translational modifications involved in many biological processes. Malfunction or dysregulation of SNO leads to a series of severe diseases, such as developmental abnormalities and various diseases. Therefore, the identification of SNO sites (SNOs provides insights into disease progression and drug development. In this paper, a new bioinformatics tool, named PSNO, is proposed to identify SNOs from protein sequences. Firstly, we explore various promising sequence-derived discriminative features, including the evolutionary profile, the predicted secondary structure and the physicochemical properties. Secondly, rather than simply combining the features, which may bring about information redundancy and unwanted noise, we use the relative entropy selection and incremental feature selection approach to select the optimal feature subsets. Thirdly, we train our model by the technique of the k-nearest neighbor algorithm. Using both informative features and an elaborate feature selection scheme, our method, PSNO, achieves good prediction performance with a mean Mathews correlation coefficient (MCC value of about 0.5119 on the training dataset using 10-fold cross-validation. These results indicate that PSNO can be used as a competitive predictor among the state-of-the-art SNOs prediction tools. A web-server, named PSNO, which implements the proposed method, is freely available at http://59.73.198.144:8088/PSNO/.

  20. A novel phytase with sequence similarity to purple acid phosphatases is expressed in cotyledons of germinating soybean seedlings.

    Science.gov (United States)

    Hegeman, C E; Grabau, E A

    2001-08-01

    Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases.

  1. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  2. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    Science.gov (United States)

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  3. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    Science.gov (United States)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  4. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    Energy Technology Data Exchange (ETDEWEB)

    Chang, Soo-Ik (Harvard Medical School, Boston, MA (USA)); Hammes, G.G. (Univ. of California, Santa Barbara (USA))

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  5. Insights into Protein Sequence and Structure-Derived Features Mediating 3D Domain Swapping Mechanism using Support Vector Machine Based Approach

    Directory of Open Access Journals (Sweden)

    Khader Shameer

    2010-06-01

    Full Text Available 3-dimensional domain swapping is a mechanism where two or more protein molecules form higher order oligomers by exchanging identical or similar subunits. Recently, this phenomenon has received much attention in the context of prions and neuro-degenerative diseases, due to its role in the functional regulation, formation of higher oligomers, protein misfolding, aggregation etc. While 3-dimensional domain swap mechanism can be detected from three-dimensional structures, it remains a formidable challenge to derive common sequence or structural patterns from proteins involved in swapping. We have developed a SVM-based classifier to predict domain swapping events using a set of features derived from sequence and structural data. The SVM classifier was trained on features derived from 150 proteins reported to be involved in 3D domain swapping and 150 proteins not known to be involved in swapped conformation or related to proteins involved in swapping phenomenon. The testing was performed using 63 proteins from the positive dataset and 63 proteins from the negative dataset. We obtained 76.33% accuracy from training and 73.81% accuracy from testing. Due to high diversity in the sequence, structure and functions of proteins involved in domain swapping, availability of such an algorithm to predict swapping events from sequence and structure-derived features will be an initial step towards identification of more putative proteins that may be involved in swapping or proteins involved in deposition disease. Further, the top features emerging in our feature selection method may be analysed further to understand their roles in the mechanism of domain swapping.

  6. An Matching Method for Vehicle-borne Panoramic Image Sequence Based on Adaptive Structure from Motion Feature

    Directory of Open Access Journals (Sweden)

    ZHANG Zhengpeng

    2015-10-01

    Full Text Available Panoramic image matching method with the constraint condition of local structure from motion similarity feature is an important method, the process requires multivariable kernel density estimations for the structure from motion feature used nonparametric mean shift. Proper selection of the kernel bandwidth is a critical step for convergence speed and accuracy of matching method. Variable bandwidth with adaptive structure from motion feature for panoramic image matching method has been proposed in this work. First the bandwidth matrix is defined using the locally adaptive spatial structure of the sampling point in spatial domain and optical flow domain. The relaxation diffusion process of structure from motion similarity feature is described by distance weighting method of local optical flow feature vector. Then the expression form of adaptive multivariate kernel density function is given out, and discusses the solution of the mean shift vector, termination conditions, and the seed point selection method. The final fusions of multi-scale SIFT the features and structure features to establish a unified panoramic image matching framework. The sphere panoramic images from vehicle-borne mobile measurement system are chosen such that a comparison analysis between fixed bandwidth and adaptive bandwidth is carried out in detail. The results show that adaptive bandwidth is good for case with the inlier ratio changes and the object space scale changes. The proposed method can realize the adaptive similarity measure of structure from motion feature, improves the correct matching points and matching rate, experimental results have shown our method to be robust.

  7. Boolean map saliency combined with motion feature used for dim and small target detection in infrared video sequences

    Science.gov (United States)

    Wang, Xiaoyang; Peng, Zhenming; Zhang, Ping

    2016-10-01

    Infrared dim and small target detection plays an important role in infrared search and tracking systems. In this paper, a novel infrared dim and small target detection method based on Boolean map saliency and motion feature is proposed. Infrared targets are the most salient parts in images, with high gray level and continuous moving trajectory. Utilizing this property, we build a feature space containing gray level feature and motion feature. The gray level feature is the intensity of input images, while the motion feature is obtained by motion charge in consecutive frames. In the second step, the Boolean map saliency approach is implemented on the gray level feature and motion feature to obtain the gray saliency map and motion saliency map. In the third step, two saliency maps are combined together to get the final result. Numerical experiments have verified the effectiveness of the proposed method. The final detection result can not only get an accurate detection result, but also with fewer false alarms, which is suitable for practical use.

  8. Predicting the expression of recombinant monoclonal antibodies in Chinese hamster ovary cells based on sequence features of the CDR3 domain.

    Science.gov (United States)

    Pybus, Leon P; James, David C; Dean, Greg; Slidel, Tim; Hardman, Colin; Smith, Andrew; Daramola, Olalekan; Field, Ray

    2014-01-01

    Despite the development of high-titer bioprocesses capable of producing >10 g L(-1) of recombinant monoclonal antibody (MAb), some so called "difficult-to-express" (DTE) MAbs only reach much lower process titers. For widely utilized "platform" processes the only discrete variable is the protein coding sequence of the recombinant product. However, there has been little systematic study to identify the sequence parameters that affect expression. This information is vital, as it would allow us to rationally design genetic sequence and engineering strategies for optimal bioprocessing. We have therefore developed a new computational tool that enables prediction of MAb titer in Chinese hamster ovary (CHO) cells based on the recombinant coding sequence of the expressed MAb. Model construction utilized a panel of MAbs, which following a 10-day fed-batch transient production process varied in titer 5.6-fold, allowing analysis of the sequence features that impact expression over a range of high and low MAb productivity. The model identified 18 light chain (LC)-specific sequence features within complementarity determining region 3 (CDR3) capable of predicting MAb titer with a root mean square error of 0.585 relative expression units. Furthermore, we identify that CDR3 variation influences the rate of LC-HC dimerization during MAb synthesis, which could be exploited to improve the production of DTE MAb variants via increasing the transfected LC:HC gene ratio. Taken together these data suggest that engineering intervention strategies to improve the expression of DTE recombinant products can be rationally implemented based on an identification of the sequence motifs that render a recombinant product DTE.

  9. Features of separation on polymeric reversed phase for two classes of higher saturated fatty acids esters

    Science.gov (United States)

    Deineka, V. I.; Lapshova, M. S.; Zakharenko, E. V.; Deineka, L. A.

    2013-11-01

    The principles of sorption on polymeric reversed phase (PRP) YMS C30 for members of the two classes of esters formed by higher saturated fatty acids, i.e., lutein diesters ( I) and triacylglycerols ( II), are investigated. It is shown that the logarithm of the retention factor increases nonlinearly with an increase of the length of the acid radical, although the retention on PRP is higher in the case of I and lower in the case of II, compared to their retention on traditional monomeric reversed phase (MRP) Kromasil-100 5C18; however, the equivalence of the contributions to the retention of I that correspond to an identical change in acids, does not depend on the length of the hydrocarbon radical of the second acid. It is noted that the Van't Hoff plot for PRP contains a curve break, indicating a change in the retention mechanism upon a rise in temperature.

  10. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    Science.gov (United States)

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  11. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  12. N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

    OpenAIRE

    Kuhn, H.; Fietzek, P P; Lampen, J O

    1982-01-01

    The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.

  13. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    Science.gov (United States)

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  14. A common class of transcripts with 5′-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification

    Science.gov (United States)

    Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P.; Palazzo, Alexander F.

    2017-01-01

    Introns are found in 5′ untranslated regions (5′UTRs) for 35% of all human transcripts. These 5′UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5′UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5′UTR intron status, we developed a classifier that can predict 5′UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5′ proximal-intron-minus-like-coding regions (“5IM” transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5′ cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5′ proximal positions. Finally, N1-methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5′ proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N1-methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. PMID:27994090

  15. Amino acid sequence of Coprinus macrorhizus peroxidase and cDNA sequence encoding Coprinus cinereus peroxidase. A new family of fungal peroxidases.

    Science.gov (United States)

    Baunsgaard, L; Dalbøge, H; Houen, G; Rasmussen, E M; Welinder, K G

    1993-04-01

    Sequence analysis and cDNA cloning of Coprinus peroxidase (CIP) were undertaken to expand the understanding of the relationships of structure, function and molecular genetics of the secretory heme peroxidases from fungi and plants. Amino acid sequencing of Coprinus macrorhizus peroxidase, and cDNA sequencing of Coprinus cinereus peroxidase showed that the mature proteins are identical in amino acid sequence, 343 residues in size and preceded by a 20-residue signal peptide. Their likely identity to peroxidase from Arthromyces ramosus is discussed. CIP has an 8-residue, glycine-rich N-terminal extension blocked with a pyroglutamate residue which is absent in other fungal peroxidases. The presence of pyroglutamate, formed by cyclization of glutamine, and the finding of a minor fraction of a variant form lacking the N-terminal residue, indicate that signal peptidase cleavage is followed by further enzymic processing. CIP is 40-45% identical in amino-acid sequence to 11 lignin peroxidases from four fungal species, and 42-43% identical to the two known Mn-peroxidases. Like these white-rot fungal peroxidases, CIP has an additional segment of approximately 40 residues at the C-terminus which is absent in plant peroxidases. Although CIP is much more similar to horseradish peroxidase (HRP C) in substrate specificity, specific activity and pH optimum than to white-rot fungal peroxidases, the sequences of CIP and HRP C showed only 18% identity. Hence, CIP qualifies as the first member of a new family of fungal peroxidases. The nine invariant residues present in all plant, fungal and bacterial heme peroxidases are also found in CIP. The present data support the hypothesis that only one chromosomal CIP gene exists. In contrast, a large number of secretory plant and fungal peroxidases are expressed from several peroxidase gene clusters. Analyses of three batches of CIP protein and of 49 CIP clones revealed the existence of only two highly similar alleles indicating less

  16. Arachidonic and eicosapentaenoic acids in Araucariaceae, a unique feature among seed plants

    Directory of Open Access Journals (Sweden)

    Wolff Robert L.

    2000-01-01

    Full Text Available It is generally admitted that seed plants (spermaphytes are unable to synthesize either arachidonic or eicosapentaenoic acids (AA and EPA, the classic essential fatty acids in animals. We give here chromatographic and spectrometric data showing that species from the primitive family Araucariaceae (gymnosperms are able to synthesize AA and/or EPA in their seeds and leaves. Agathis robusta, in particular, contains AA and EPA in small amounts in its seeds, with no D5-unsaturated polymethylene-interrupted fatty acids (D5-UPIFA with 18 carbon atoms, whereas Araucaria spp. contain both AA and C18 D5-UPIFA. In both species, D5-UPIFA with 20 carbon atoms are present as in all other Coniferophytes. All metabolic intermediates necessary for the biosynthesis of AA and/or EPA have been characterized in Araucariaceae seeds. The relevance of these observations is discussed with regard to the phylogeny of Coniferophytes.

  17. Visualization of amino acid composition differences between processed protein from different animal species by self-organizing feature maps

    Directory of Open Access Journals (Sweden)

    Xingfan ZHOU,Zengling YANG,Longjian CHEN,Lujia HAN

    2016-06-01

    Full Text Available Amino acids are the dominant organic components of processed animal proteins, however there has been limited investigation of differences in their composition between various protein sources. Information on these differences will not only be helpful for their further utilization but also provide fundamental information for developing species-specific identification methods. In this study, self-organizing feature maps (SOFM were used to visualize amino acid composition of fish meal, and meat and bone meal (MBM produced from poultry, ruminants and swine. SOFM display the similarities and differences in amino acid composition between protein sources and effectively improve data transparency. Amino acid composition was shown to be useful for distinguishing fish meal from MBM due to their large concentration differences between glycine, lysine and proline. However, the amino acid composition of the three MBMs was quite similar. The SOFM results were consistent with those obtained by analysis of variance and principal component analysis but more straightforward. SOFM was shown to have a robust sample linkage capacity and to be able to act as a powerful means to link different sample for further data mining.

  18. Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Budzanivska I. G.

    2014-03-01

    Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.

  19. Clinical and biochemical features of aromatic L-amino acid decarboxylase deficiency.

    NARCIS (Netherlands)

    Brun, L.; Ngu, L.H.; Keng, W.T.; Ch'ng, G.S.; Choy, Y.S.; Hwu, W.L.; Lee, W.T.; Willemsen, M.A.A.P.; Verbeek, M.M.; Wassenberg, T.; Regal, L.; Orcesi, S.; Tonduti, D.; Accorsi, P.; Testard, H.; Abdenur, J.E.; Tay, S.; Allen, G.F.; Heales, S.; Kern, I.; Kato, M.; Burlina, A.; Manegold, C.; Hoffmann, G.F.; Blau, N.

    2010-01-01

    OBJECTIVE: To describe the current treatment; clinical, biochemical, and molecular findings; and clinical follow-up of patients with aromatic l-amino acid decarboxylase (AADC) deficiency. METHOD: Clinical and biochemical data of 78 patients with AADC deficiency were tabulated in a database of pediat

  20. Features of the Arabidopsis recombination landscape resulting from the combined loss of sequence variation and DNA methylation

    NARCIS (Netherlands)

    Colomé Tatché, Maria; Cortijo, Sandra; Wardenaar, Rene; Monteiro Morgado, Lionel; Lahouze, Benoit; Sarazin, Alexis; Etcheverry, Mathilde; Martin, Antoine; Feng, Suhua; Duvernois-Berthet, Evelyne; Labadie, Karine; Wincker, Patrick; Jacobsen, Steven E.; Jansen, Ritsert C.; Colot, Vincent; Johannes, Frank

    2012-01-01

    The rate of meiotic crossing over (CO) varies considerably along chromosomes, leading to marked distortions between physical and genetic distances. The causes underlying this variation are being unraveled, and DNA sequence and chromatin states have emerged as key factors. However, the extent to whic

  1. A Unique Sequence of Financial Accounting Courses Featuring Team Teaching, Linked Courses, Challenging Assignments, and Instruments for Evaluation and Assessment

    Science.gov (United States)

    Lundblad, Heidemarie; Wilson, Barbara A.

    2008-01-01

    The Department of Accounting at California State University Northridge (CSUN) has developed a unique sequence of courses designed to ensure that accounting students are trained not only in technical accounting, but also acquire critical thinking, research and communication skills. The courses have proven effective and have embedded assessment…

  2. Fine-structured multi-scaling long-range correlations in completely sequenced genomes - features, origin and classification.

    NARCIS (Netherlands)

    T.A. Knoch (Tobias); M. Göcker; R. Lohner (Rudolf); A. Abuseiris (Anis); F.G. Grosveld (Frank)

    2009-01-01

    textabstractThe sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation an

  3. Origin of Molar-Tooth Structure Based on Sequence-Stratigraphic Position and Macroscopic Features:Example from Mesoproterozoic Gaoyuzhuang Formation at Jixian Section, Tianjin, North China

    Institute of Scientific and Technical Information of China (English)

    Mei Mingxiang

    2006-01-01

    Both the macroscopic feature and the sequence-stratigraphic position of the molar-tooth structure developed in the third member of the Gaoyuzhuang (高于庄) Formation at the Jixian (蓟县)Section in Tianjin (天津) can provide some useful information about its origin and can reveal some problems to be further researched in the future. The Mesoproterozoic Gaoyuzhuang Formation is a set of ~ 1 600 m thick carbonate strata. This formation can be divided into four members. The first member is mainly made up of stromatolitic dolomites; the second is marked by a set of manganese dolomites; the third is mainly composed of lamina limestones with the development of molar-tooth strcutures; the fourth is a set of stromatolitic-lithoherm dolomites. According to lithofacies and its succession, several types of meter-scale cycles can be discerned in the Gaoyuzhuang Formation: the L-M type, the subtidal type and the peritidal type. There is a regularly vertical stacking pattern for meter-scale cycles in the third-order sequence. Therefore, the Mesoproterozoic Gaoyuzhuang Formation can be divided into 13 third-order sequences (SQ1 to SQ13 ) and can further be grouped into 4 second-order sequences. The third member is marked by lamina limestones and can be grouped into three third-order sequences (SQ9 to SQ11 ). The molar-tooth structure is developed in the middle part of the third sequence, I.e. SQ11, in the third member. Several features of this kind of molar-tooth structure reflect some features of carbonate sedimentation in the Precambrian, such as the particular configuration, abundant organic matter, and easy silication. Stromatolites are chiefly formed in a shallow tidal-flat environment; lamina are mainly formed in the shallow ramp and molar-tooth structures are mainly generated in a relatively more deep-water environment from the middle to the deep ramp. Therefore, similar to stromatolite and lamina, the molartooth structure might also be a kind of bio

  4. EST sequences and their annotation (amino acid sequence and results of homology search) - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available and VS) derived from five developmental stages. Clone ID ID of cDNA clone Atlas ID ID of Atlas database ( ht...tp://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ) and link to Atlas database NBRP ID ID of cDNA c...ir annotations (amino acid sequence, homology search results (with target DBs: dicty EST-DB, DNA-DB and prot...ein-DB)). Links to the Atlas database ( http://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ), whic

  5. Differentiation of acetic acid bacteria based on sequence analysis of 16S-23S rRNA gene internal transcribed spacer sequences.

    Science.gov (United States)

    González, Angel; Mas, Albert

    2011-06-30

    The 16S-23S gene internal transcribed spacer sequence of sixty-four strains belonging to different acetic acid bacteria genera were analyzed, and phylogenetic trees were generated for each genera. The topologies of the different trees were in accordance with the 16S rRNA gene trees, although the similarity percentages obtained between the species was shown to be much lower. These values suggest the usefulness of including the 16S-23S gene internal transcribed spacer region as a part of the polyphasic approach required for the further classification of acetic acid bacteria. Furthermore, the region could be a good target for primer and probe design. It has also been validated for use in the identification of unknown samples of this bacterial group from wine vinegar and fruit condiments.

  6. Molecular cloning and chromosomal localization of the nucleic acid sequences encoding the cerebrovascular and plaque amyloid peptide

    Energy Technology Data Exchange (ETDEWEB)

    Robakis, N.K.; Ramakrishna, N.; Wolfe, G.; Wisniewski, H.M.

    1987-05-01

    Amyloid deposits in vessels and neuritic plaques are found in large numbers in the brains of Alzheimer's Disease (AD) and adult Downs Syndrome (DS) patients. The partial amino acid sequence of the amyloid peptide has been determined. They used this amino acid sequence to synthesize an oligonucleotide probe specific for the amyloid peptide gene. Screening of a human brain cDNA library with this probe, yielded a clone which contained an insert 1.8 kb. This clone contains a long open reading frame including a region which encodes the 28 amino acids of the amyloid peptide. Northern blots of human brain mRNA detected a transcript of 3.3 kb long which hybridized to their cDNA clone. A similar mRNA was detected in the hamster, mouse, sheep and rabbit brains. Southern blots under stringent hybridization conditions detected sequences homologous to the amyloid gene in the genomes of hamster, mouse, sheep and rabbit suggesting that this gene has been conserved during mammalian evolution. Hybridization under reduced stringency revealed the presence of additional sequences related to the amyloid gene in the genome of the above organisms. Hybridization analysis of human x chinese hamster cell lines DNA showed that the gene encoding the amyloid peptide is located on chromosome 21, suggesting a genetic relationship between AD and DS.

  7. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    Science.gov (United States)

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  8. Using random forest to classify T-cell epitopes based on amino acid properties and molecular features.

    Science.gov (United States)

    Huang, Jian-Hua; Xie, Hua-Lin; Yan, Jun; Lu, Hong-Mei; Xu, Qing-Song; Liang, Yi-Zeng

    2013-12-04

    T-lymphocyte (T-cell) is a very important component in human immune system. T-cell epitopes can be used for the accurately monitoring the immune responses which activation by major histocompatibility complex (MHC), and rationally designing vaccines. Therefore, accurate prediction of T-cell epitopes is crucial for vaccine development and clinical immunology. In current study, two types peptide features, i.e., amino acid properties and chemical molecular features were used for the T-cell epitopes peptide representation. Based on these features, random forest (RF) algorithm, a powerful machine learning algorithm, was used to classify T-cell epitopes and non-T-cell epitopes. The classification accuracy, sensitivity, specificity, Matthews correlation coefficient (MCC), and area under the curve (AUC) values for proposed method are 97.54%, 97.22%, 97.60%, 0.9193, and 0.9868, respectively. These results indicate that current method based on the combined features and RF is effective for T-cell epitopes prediction.

  9. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    Science.gov (United States)

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  10. Structural and biochemical features of acidic α-amylase of Bacillus acidicola.

    Science.gov (United States)

    Sharma, Archana; Satyanarayana, T

    2013-10-01

    The investigation is aimed at understanding structure-function aspect of α-amylase of an acidophilic bacterium Bacillus acidicola (BAamy), which is Ca(2+)-independent and active at acidic pH of native starch, and thus, suits better in starch saccharification process. The CD spectroscopic data analysis revealed that the enzyme has 30% α-helices, 14.2% β-sheets, and 55.8% random coils at 60 °C and pH 4.0. Using Bacillus stearothermophilus α-amylase (BStA) as the template, 3-D structure of rBAamy has been proposed. A complete loss in α-amylase activity was recorded when the amino acid residues (D231, E261 and D328) were substituted that confirmed their role in catalysis. The CD studies indicated a decrease in the α-helices content below and beyond the optimum pH and temperature that suggests a critical role of α-helix in maintaining the structural conformation of the enzyme. Fluorescence-quenching by N-bromosuccinimide (NBS) suggested the role of tryptophan in maintaining structural integrity of α-amylase and that by acrylamide indicated interaction by simple collision process.

  11. Interesting Features of n2D Rydberg Series Fine-Structure Splittings along the Sodium-Like Isoelectronic Sequence

    Institute of Scientific and Technical Information of China (English)

    WANG Xiao-Lu; LIU Ling-Tao; GAO Xiang; SHEN Chun; LI Jia-Ming

    2008-01-01

    @@ Using a simplified multi-configuration Dirac-Fock (SMCDF) scheme based on the multi-configuration Dirac-Fock (MCDF) theory, we study the systematic variations of the fine-structure splittings of n2 D3/2,5/2 Rydberg series along the sodium-like isoelectronic sequence, i.e.the fine-structure orderings vary with increasing atomic number Z.The competition between the spin-orbit interactions and the exchange interactions due to relativistic effects of the nd orbital wavefunctions well explain such variations.Furthermore, the effect of Breit interactions which plays the secondary role is studied.

  12. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Science.gov (United States)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  13. Molecular features of the L-type amino acid transporter 2 determine different import and export profiles for thyroid hormones and amino acids.

    Science.gov (United States)

    Hinz, Katrin M; Neef, Dominik; Rutz, Claudia; Furkert, Jens; Köhrle, Josef; Schülein, Ralf; Krause, Gerd

    2017-03-05

    The L-type amino acid transporter 2 (LAT2) imports amino acids (AA) and also certain thyroid hormones (TH), e.g. 3,3'-T2 and T3, but not rT3 and T4. We utilized LAT2 mutations (Y130A, N133S, F242W) that increase 3,3'-T2 import and focus here on import and export capacity for AA, T4, T3, BCH and derivatives thereof to delineate molecular features. Transport studies and analysis of competitive inhibition of import by radiolabelled TH and AA were performed in Xenopus laevis oocytes. Only Y130A, a pocket widening mutation, enabled import for T4 and increased it for T3. Mutant F242W showed increased 3,3'-T2 import but no import rates for other TH derivatives. No export was detected for any TH by LAT2-wild type (WT). Mutations Y130A and N133S enabled only the export of 3,3'-T2, while N133S also increased AA export. Thus, distinct molecular LAT2-features determine bidirectional AA transport but only an unidirectional 3,3'-T2 and T3 import.

  14. An amphipathic trans-acting phosphorothioate DNA element delivers uncharged PNA and PMO nucleic acid sequences in mammalian cells.

    Science.gov (United States)

    Jain, Harsh V; Beaucage, Serge L

    An innovative approach to the delivery of uncharged peptide nucleic acids (PNA) and phosphorodiamidate morpholino (PMO) oligomers in mammalian cells is described and consists of extending the sequence of those oligomers with a short PNA-polyA or PMO-polyA tail. Recognition of the polyA-tailed PNA or PMO oligomers by an amphipathic trans-acting polythymidylic thiophosphate triester element (dTtaPS) results in efficient internalization of those oligomers in several cell lines. Our findings indicate that cellular uptake of the oligomers occurs through an energy-dependent mechanism and macropinocytosis appears to be the predo-minant endocytic pathway used for internalization. The functionality of the internalized oligomers is demonstrated by alternate splicing of the pre-mRNA encoding luciferase in HeLa pLuc 705 cells. Amphipathic phosphorothioate DNA elements may represent a unique class of cellular transporters for robust delivery of uncharged nucleic acid sequences in live mammalian cells.

  15. A Robust Approach for Action Recognition Based on Spatio-Temporal Features in RGB-D Sequences

    Directory of Open Access Journals (Sweden)

    Ly Quoc Ngoc

    2016-05-01

    Full Text Available Recognizing human action is attractive research topic in computer vision since it plays an important role on the applications such as human-computer interaction, intelligent surveillance, human actions retrieval system, health care, smart home, robotics and so on. The availability the low-cost Microsoft Kinect sensor, which can capture real-time high-resolution RGB and visual depth information, has opened an opportunity to significantly increase the capabilities of many automated vision based recognition tasks. In this paper, we propose new framework for action recognition in RGB-D video. We extract spatiotemporal features from RGB-D data that capture both visual, shape and motion information. Moreover, the segmentation technique is applied to present the temporal structure of action. Firstly, we use STIP to detect interest points both of RGB and depth channels. Secondly, we apply HOG3D descriptor for RGB channel and 3DS-HONV descriptor for depth channel. In addition, we also extract HOF2.5D from fusing RGB and Depth to capture human’s motion. Thirdly, we divide the video into segments and apply GMM to create feature vectors for each segment. So, we have three feature vectors (HOG3D, 3DS-HONV, and HOF2.5D that represent for each segment. Next, the max pooling technique is applied to create a final vector for each descriptor. Then, we concatenate the feature vectors from the previous step into the final vector for action representation. Lastly, we use SVM method for classification step. We evaluated our proposed method on three benchmark datasets to demonstrate generalizability. And, the experimental results shown to be more accurate for action recognition compared to the previous works. We obtain overall accuracies of 93.5%, 99.16% and 89.38% with our proposed method on the UTKinect-Action, 3D Action Pairs and MSR-Daily Activity 3D dataset, respectively. These results show that our method is feasible and superior performance over the

  16. Fusion protein predicted amino acid sequence of the first US avian pneumovirus isolate and lack of heterogeneity among other US isolates.

    Science.gov (United States)

    Seal, B S; Sellers, H S; Meinersmann, R J

    2000-02-01

    Avian pneumovirus (APV) was first isolated from turkeys in the west-central US following emergence of turkey rhinotracheitis (TRT) during 1996. Subsequently, several APV isolates were obtained from the north-central US. Matrix (M) and fusion (F) protein genes of these isolates were examined for sequence heterogeneity and compared with European APV subtypes A and B. Among US isolates the M gene shared greater than 98% nucleotide sequence identity with only one nonsynonymous change occurring in a single US isolate. Although the F gene among US APV isolates shared 98% nucleotide sequence identity, nine conserved substitutions were detected in the predicted amino acid sequence. The predicted amino acid sequence of the US APV isolate's F protein had 72% sequence identity to the F protein of APV subtype A and 71% sequence identity to the F protein of APV subtype B. This compares with 83% sequence identity between the APV subtype A and B predicted amino acid sequences of the F protein. The US isolates were phylogenetically distinguishable from their European counterparts based on F gene nucleotide or predicted amino acid sequences. Lack of sequence heterogeneity among US APV subtypes indicates these viruses have maintained a relatively stable population since the first outbreak of TRT. Phylogenetic analysis of the F protein among APV isolates supports classification of US isolates as a new APV subtype C.

  17. Purification, amino acid sequence, and cDNA cloning of trypsin inhibitors from onion (Allium cepa L.) bulbs.

    Science.gov (United States)

    Deshimaru, Masanobu; Watanabe, Akira; Suematsu, Keiko; Hatano, Maki; Terada, Shigeyuki

    2003-08-01

    Three protease inhibitors (OTI-1-3) have been purified from onion (Allium cepa L.) bulbs. Molecular masses of these inhibitors were found to be 7,370.2, 7,472.2, and 7,642.6 Da by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), respectively. Based on amino acid composition and N-terminal sequence, OTI-1 and -2 are the N-terminal truncated proteins of OTI-3. All the inhibitors are stable to heat and extreme pH. OTI-3 inhibited trypsin, chymotrypsin, and plasmin with dissociation constants of 1.3 x 10(-9) M, 2.3 x 10(-7) M, and 3.1 x 10(-7) M, respectively. The complete amino acid sequence of OTI-3 showed a significant homology to Bowman-Birk family inhibitors, and the first reactive site (P1) was found to be Arg17 by limited proteolysis by trypsin. The second reactive site (P1) was estimated to be Leu46, that may inhibit chymotrypsin. OTI-3 lacks an S-S bond near the second reactive site, resulting in a low affinity for the enzyme. The sequence of OTI-3 was also ascertained by the nucleotide sequence of a cDNA clone encoding a 101-residue precursor of the onion inhibitor.

  18. TBX1 mutation identified by exome sequencing in a Japanese family with 22q11.2 deletion syndrome-like craniofacial features and hypocalcemia.

    Directory of Open Access Journals (Sweden)

    Tsutomu Ogata

    Full Text Available BACKGROUND: Although TBX1 mutations have been identified in patients with 22q11.2 deletion syndrome (22q11.2DS-like phenotypes including characteristic craniofacial features, cardiovascular anomalies, hypoparathyroidism, and thymic hypoplasia, the frequency of TBX1 mutations remains rare in deletion-negative patients. Thus, it would be reasonable to perform a comprehensive genetic analysis in deletion-negative patients with 22q11.2DS-like phenotypes. METHODOLOGY/PRINCIPAL FINDINGS: We studied three subjects with craniofacial features and hypocalcemia (group 1, two subjects with craniofacial features alone (group 2, and three subjects with normal phenotype within a single Japanese family. Fluorescence in situ hybridization analysis excluded chromosome 22q11.2 deletion, and genomewide array comparative genomic hybridization analysis revealed no copy number change specific to group 1 or groups 1+2. However, exome sequencing identified a heterozygous TBX1 frameshift mutation (c.1253delA, p.Y418fsX459 specific to groups 1+2, as well as six missense variants and two in-frame microdeletions specific to groups 1+2 and two missense variants specific to group 1. The TBX1 mutation resided at exon 9C and was predicted to produce a non-functional truncated protein missing the nuclear localization signal and most of the transactivation domain. CONCLUSIONS/SIGNIFICANCE: Clinical features in groups 1+2 are well explained by the TBX1 mutation, while the clinical effects of the remaining variants are largely unknown. Thus, the results exemplify the usefulness of exome sequencing in the identification of disease-causing mutations in familial disorders. Furthermore, the results, in conjunction with the previous data, imply that TBX1 isoform C is the biologically essential variant and that TBX1 mutations are associated with a wide phenotypic spectrum, including most of 22q11.2DS phenotypes.

  19. Coding potential and transcript analysis of fowl adenovirus 4: insight into upstream ORFs as common sequence features in adenoviral transcripts.

    Science.gov (United States)

    Griffin, Bryan D; Nagy, Eva

    2011-06-01

    Recombinant fowl adenoviruses (FAdVs) have been successfully used as veterinary vaccine vectors. However, insufficient definitions of the protein-coding and non-coding regions and an incomplete understanding of virus-host interactions limit the progress of next-generation vectors. FAdVs are known to cause several diseases of poultry. Certain isolates of species FAdV-C are the aetiological agent of inclusion body hepatitis/hydropericardium syndrome (IBH/HPS). In this study, we report the complete 45667 bp genome sequence of FAdV-4 of species FAdV-C. Assessment of the protein-coding potential of FAdV-4 was carried out with the Bio-Dictionary-based Gene Finder together with an evaluation of sequence conservation among species FAdV-A and FAdV-D. On this basis, 46 potentially protein-coding ORFs were identified. Of these, 33 and 13 ORFs were assigned high and low protein-coding potential, respectively. Homologues of the ancestral adenoviral genes were, with few exceptions, assigned high protein-coding potential. ORFs that were unique to the FAdVs were differentiated into high and low protein-coding potential groups. Notable putative genes with high protein-coding capacity included the previously unreported fiber 1, hypothetical 10.3K and hypothetical 10.5K genes. Transcript analysis revealed that several of the small ORFs less than 300 nt in length that were assigned low coding potential contributed to upstream ORFs (uORFs) in important mRNAs, including the ORF22 mRNA. Subsequent analysis of the previously reported transcripts of FAdV-1, FAdV-9, human adenovirus 2 and bovine adenovirus 3 identified widespread uORFs in AdV mRNAs that have the potential to act as important translational regulatory elements.

  20. Evaluation of codon biology in citrus and Poncirus trifoliata based on genomic features and frame corrected expressed sequence tags.

    Science.gov (United States)

    Ahmad, Touqeer; Sablok, Gaurav; Tatarinova, Tatiana V; Xu, Qiang; Deng, Xiu-Xin; Guo, Wen-Wu

    2013-04-01

    Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid.

  1. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Science.gov (United States)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  2. [Ten-years records of organic arsenic (diphenylarsinic acid) poisoning: epidemiology, clinical feature, metabolism, and toxicity].

    Science.gov (United States)

    Ishi, Kazuhiro; Tamaoka, Akira

    2015-01-01

    We report here the symptoms of diphenylarsinic acid (DPAA) poisoning recorded over 10 years since the DPAA contamination of the potable well water was first detected in the Kamisu City, Ibaraki Prefecture, in 2003. The poisoning symptoms associated with the cerebellum and brainstem included nystagmus, tremors, myoclonus, and cerebellar ataxia as well as the symptoms associated with the temporal and occipital lobes such as memory impairment, sleep disorder, and visual disturbance. Some of the affected children exhibited mental retardation. Moreover, reduced blood flow and reduced glucose metabolism in the cerebella, brainstem, and temporal and occipital lobes persisted for several years among the DPAA-exposed persons. Based on the animal studies for DPAA intoxication, the target organs for the DPAA toxicity were determined to be the central nervous system (CNS), liver, and biliary system. In particular, DPAA tends to persist in the brain for a long time, resulting in long-term impacts on the brain. The cerebral blood flow and brain glucose metabolism, which can be measured by positron emission tomography (PET) and single photon emission computed tomography (SPECT), respectively, are useful objective clinical markers to determine the effect of DPAA on CNS. We believe that continuous monitoring of the DPAA-exposed people may promote the effect of carcinogen and accelerate brain aging.

  3. 广东省二叠纪含煤岩系层序地层特征%Permian Coal-bearing Strata Sequence Stratigraphic Features in Guangdong Province

    Institute of Scientific and Technical Information of China (English)

    董大啸; 刘特辉; 吴雷; 邵龙义

    2012-01-01

    Based on field measured sections, indoor rock and mineral identification and comprehensive mapping, have analyzed Permian coal-bearing strata sequence stratigraphic features. 5 types of regional surface of unconformity, incised valley scour surface, fossil soil bed, event interface and sedimentary facies transformation base have been identified, totally 6 third -order sequence boundaries and divided into 5 third-order sequences. Representative interfaces of S1~S5 bottom have K1 and K2 sandstone bottoms in the Guanghua area, limestone unconformity surface and white limestone member bottom in the Lianyang area, as well as the Shengtang Formation Datao sandstone bottom in Guanghua area; top interface of S5 is the structural event interface between Permian and Triassic systems. The contrast of different areas has found that the sequence development features in Guanghua and Lianyang two areas are rather similar. The variation of 5 sequences in Lianyang area presents cyclic process of marine transgression (S1, S2)-fast regression-transgression (S3)-slow regression (S4, S5), while 5 sequences features in the Quren area are similar but little varying. The contrast between 1990s' sequence stratigraphic division scheme and this study, 2 sequence interfaces are identical.%根据野外剖面实测、室内岩矿鉴定及综合作图等,对广东省二叠纪含煤岩系层序地层特征进行分析,识别出区域不整合面、下切谷冲刷面、古土壤层、事件界面、沉积相转换面5个类型、共6个三级层序界面,划分为5个三级层序.S1~S5各层序底界面的代表界面分别为广花地区K1砂岩底面、广花地区K2砂岩底面、连阳地区灰岩不整合面、连阳地区白灰岩段底面和广花地区圣堂组大套砂岩底面;S5的顶界面为二叠系与三叠系之间的构造事件面.对比不同地区后发现,广花和兴梅两个地区的层序发育特征较为类似,连阳地区5个层序的变化呈现了海侵(S1、S2)-

  4. Genetic, molecular and expression features of the Pervenets mutant leading to high oleic acid content of seed oil in sunflower

    Directory of Open Access Journals (Sweden)

    Lacombe Séverine

    2002-01-01

    Full Text Available Pervenets is a sunflower population that displays seed oil with a high oleic acid content [HOAC]. Our aim is to reconcile all the data gathered on this mutant in a unique explanatory mechanism. All Pervenets-derived [HOAC] lines display no accumulation or a very reduced accumulation of the DELTA12-desaturase transcript in the embryos during the stages for oil accumulation. They also carry oleHOS specific RFLP markers revealed by an DELTA12-desaturase cDNA used as a probe. The linoleic or [LO] genotypes do not carry this RFLP marker, but another allele: oleLOR (oleHL locus. Linkage disequilibrium between the oleHOS allele and [HOAC] was verified. We studied the mode of inheritance of [HOAC] in two segregating populations. A F2 progenies revealed one dominant allele for [HOAC] that co-segregated with the oleHOS allele showing that the Pervenets mutation and oleHOS were closely linked. F6 recombinant inbred lines, showed the [HOAC] trait due to two independent loci: the locus carrying the oleHOS allele and another locus sup. One allele, supole, at this second locus may suppress the effect of the oleHOS allele on the [HOAC] trait. Northern analyses performed on [HOAC] lines and F1 ([HOAC] x [LO] hybrids revealed under-accumulation of DELTA12-desaturase transcript. Thus Pervenets mutation acts in trans. The oleHOS genomic region that may carry the Pervenets mutation was cloned. A genomic library was constructed in lambdafixII with the DNA from the RHA345 [HOAC] line and screened with a DELTA12-desaturase cDNA as a probe. Two overlapping clones were entirely sequenced and revealed carrying a gene for an DELTA12-desaturase probably located in the RE. This corresponds to the invariant part of the oleHL locus. Another clone (11.1 probably carries DELTA12-desaturase repeated sequences that cause instability of the clone. We showed that the 11.1 clone carries most of cDNA sequence, but due to its organization it is not yet sequenced. A mutation mechanism

  5. Generation of novel cationic antimicrobial peptides from natural non-antimicrobial sequences by acid-amide substitution

    Directory of Open Access Journals (Sweden)

    Tamada Yasushi

    2011-03-01

    Full Text Available Abstract Background Cationic antimicrobial peptides (CAMPs are well recognized to be promising as novel antimicrobial and antitumor agents. To obtain novel skeletons of CAMPs, we propose a simple strategy using acid-amide substitution (i.e. Glu→Gln, Asp→Asn to confer net positive charge to natural non-antimicrobial sequences that have structures distinct from known CAMPs. The potential of this strategy was verified by a trial study. Methods The pro-regions of nematode cecropin P1-P3 (P1P-P3P were selected as parent sequences. P1P-P3P and their acid-amide-substituted mutants (NP1P-NP3P were chemically synthesized. Bactericidal and membrane-disruptive activities of these peptides were evaluated. Conformational changes were estimated from far-ultraviolet circular dichroism (CD spectra. Results NP1P-NP3P acquired potent bactericidal activities via membrane-disruption although P1P-P3P were not antimicrobial. Far-ultraviolet CD spectra of NP1P-NP3P were similar to those of their parent peptides P1P-P3P, suggesting that NP1P-NP3P acquire microbicidal activity without remarkable conformational changes. NP1P-NP3P killed bacteria in almost parallel fashion with their membrane-disruptive activities, suggesting that the mode of action of those peptides was membrane-disruption. Interestingly, membrane-disruptive activity of NP1P-NP3P were highly diversified against acidic liposomes, indicating that the acid-amide-substituted nematode cecropin pro-region was expected to be a unique and promising skeleton for novel synthetic CAMPs with diversified membrane-discriminative properties. Conclusions The acid-amide substitution successfully generated some novel CAMPs in our trial study. These novel CAMPs were derived from natural non-antimicrobial sequences, and their sequences were completely distinct from any categories of known CAMPs, suggesting that such mutated natural sequences could be a promising source of novel skeletons of CAMPs.

  6. Using scores of amino acid topological descriptors for quantitative sequence-mobility modeling of peptides based on support vector machine

    Institute of Scientific and Technical Information of China (English)

    LIANG Guizhao; YANG Shanbin; ZHOU Yuan; ZHOU Peng; LI Zhiliang

    2006-01-01

    Scores of amino acid topological descriptors (SATD) derived from principle components analysis of a matrix of 1262 structural variables related to 23 amino acids were employed to express the structure of 125 peptides in different length.Quantitative sequence-mobility modelings (QSMMs)were constructed using partial least square (PLS)and support vector machine (SVM), respectively. As new amino acid scales, SATD including plentiful information related to biological activity were easily manipulated. Better results were obtained compared to those obtained with PLS, which indicated that SVM presented robust stability and excellent predictive ability for electrophoretic mobilities. These results show that there is a wide prospect for the applications of SATD and SVM regression in QSMMs.

  7. Deep RNA sequencing reveals hidden features and dynamics of early gene transcription in Paramecium bursaria chlorella virus 1.

    Directory of Open Access Journals (Sweden)

    Guillaume Blanc

    Full Text Available Paramecium bursaria chlorella virus 1 (PBCV-1 is the prototype of the genus Chlorovirus (family Phycodnaviridae that infects the unicellular, eukaryotic green alga Chlorella variabilis NC64A. The 331-kb PBCV-1 genome contains 416 major open reading frames. A mRNA-seq approach was used to analyze PBCV-1 transcriptomes at 6 progressive times during the first hour of infection. The alignment of 17 million reads to the PBCV-1 genome allowed the construction of single-base transcriptome maps. Significant transcription was detected for a subset of 50 viral genes as soon as 7 min after infection. By 20 min post infection (p.i., transcripts were detected for most PBCV-1 genes and transcript levels continued to increase globally up to 60 min p.i., at which time 41% or the poly (A+-containing RNAs in the infected cells mapped to the PBCV-1 genome. For some viral genes, the number of transcripts in the latter time points (20 to 60 min p.i. was much higher than that of the most highly expressed host genes. RNA-seq data revealed putative polyadenylation signal sequences in PBCV-1 genes that were identical to the polyadenylation signal AAUAAA of green algae. Several transcripts have an RNA fragment excised. However, the frequency of excision and the resulting putative shortened protein products suggest that most of these excision events have no functional role but are probably the result of the activity of misled splicesomes.

  8. Insight on how fishing bats discern prey and adjust their mechanic and sensorial features during the attack sequence.

    Science.gov (United States)

    Aizpurua, Ostaizka; Alberdi, Antton; Aihartza, Joxerra; Garin, Inazio

    2015-07-21

    Several insectivorous bats have included fish in their diet, yet little is known about the processes underlying this trophic shift. We performed three field experiments with wild fishing bats to address how they manage to discern fish from insects and adapt their hunting technique to capture fish. We show that bats react only to targets protruding above the water and discern fish from insects based on prey disappearance patterns. Stationary fish trigger short and shallow dips and a terminal echolocation pattern with an important component of the narrowband and low frequency calls. When the fish disappears during the attack process, bats regulate their attack increasing the number of broadband and high frequency calls in the last phase of the echolocation as well as by lengthening and deepening their dips. These adjustments may allow bats to obtain more valuable sensorial information and to perform dips adjusted to the level of uncertainty on the location of the submerged prey. The observed ultrafast regulation may be essential for enabling fishing to become cost-effective in bats, and demonstrates the ability of bats to rapidly modify and synchronise their sensorial and motor features as a response to last minute stimulus variations.

  9. Modulation of anti-endotoxin property of Temporin L by minor amino acid substitution in identified phenylalanine zipper sequence.

    Science.gov (United States)

    Srivastava, Saurabh; Kumar, Amit; Tripathi, Amit Kumar; Tandon, Anshika; Ghosh, Jimut Kanti

    2016-11-01

    A 13-residue frog antimicrobial peptide Temporin L (TempL) possesses versatile antimicrobial activities and is considered a lead molecule for the development of new antimicrobial agents. To find out the amino acid sequences that influence the anti-microbial property of TempL, a phenylalanine zipper-like sequence was identified in it which was not reported earlier. Several alanine-substituted analogs and a scrambled peptide having the same composition of TempL were designed for evaluating the role of this motif. To investigate whether leucine residues instead of phenylalanine residues at 'a' and/or 'd' position(s) of the heptad repeat sequence could alter its antimicrobial property, several TempL analogs were synthesized after replacing these phenylalanine residues with leucine residues. Replacing phenylalanine residues with alanine residues in the phenylalanine zipper sequence significantly compromised the anti-endotoxin property of TempL. This is evident from the higher production of tumor necrosis factor-α and interleukin-6 in lipopolysaccharide (LPS)-stimulated rat bone-marrow-derived macrophage cells in the presence of its alanine-substituted analogs than TempL itself. However, replacement of these phenylalanine residues with leucine residues significantly augmented anti-endotoxin property of TempL. A single alanine-substituted TempL analog (F8A-TempL) showed significantly reduced cytotoxicity but retained the antibacterial activity of TempL, while the two single leucine-substituted analogs (F5L-TempL and F8L-TempL), although exhibiting lower cytotoxicity, were able to retain the antibacterial activity of the parent peptide. The results demonstrate how minor amino acid substitutions in the identified phenylalanine zipper sequence in TempL could yield analogs with better antibacterial and/or anti-endotoxin properties with their plausible mechanism of action.

  10. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Science.gov (United States)

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  11. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Science.gov (United States)

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  12. Luminescence features from conical bubble collapse in 1,2 propanediol and its perturbation adding sulfuric acid

    Energy Technology Data Exchange (ETDEWEB)

    Navarrete, M; Godinez, F A [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360, Instituto de Ingenieria, Lab. de Fotofisica (Mexico); Sanchez, C [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360, Lab. de Fotonica y Microondas (Mexico); Mejia, E V; Villagran, M, E-mail: mnm@pumas.iingen.unam.mx [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360 (Mexico)

    2011-01-01

    A summary of experimental findings on the luminescence from bubble collapse, CBL, varying the gas inert bubble content, the driving pressure and perturbing the liquid piston with small quantities of sulfuric acid is presented. The temporal, spectral, and spatial characteristics of the luminescence regarding with dynamic features of collapse are also examinees. CBL was reproduced using Argon gas, and 1, 2-propanediol as liquid piston. In general, the pulse shape exhibits a large variety of profiles. The luminescence intensity was increased two-fold and the pulse width decreased almost to half when the liquid was disturbed with sulfuric acid. Spectrally, the Swan, CH and CN lines were observed at low volume of Ar gas and low driving pressure, lines of OH{sup 0}, Na*, K* always appear superimposed on an underlying continuum background. De-excitation of sodium atom at 589 nm and two satellites diffuse bands at {approx}554 nm and {approx}620 nm from alkali-metal-argon exciplexes was observed in both systems under certain conditions. All these findings point towards several sources of light emission that are generated during the compression time line, resulting in temporally and spatially inhomogeneous pulse. A mechanism for explain the bright CBL is broached.

  13. Characterization of DNA-binding sequences for CcaR in the cephamycin-clavulanic acid supercluster of Streptomyces clavuligerus.

    Science.gov (United States)

    Santamarta, I; López-García, M T; Kurt, A; Nárdiz, N; Alvarez-Álvarez, R; Pérez-Redondo, R; Martín, J F; Liras, P

    2011-08-01

    RT-PCR analysis of the genes in the clavulanic acid cluster revealed three transcriptional polycistronic units that comprised the ceaS2-bls2-pah2-cas2, cyp-fd-orf12-orf13 and oppA2-orf16 genes, whereas oat2, car, oppA1, claR, orf14, gcaS and pbpA were expressed as monocistronic transcripts. Quantitative RT-PCR of Streptomyces clavuligerus ATCC 27064 and the mutant S. clavuligerus ccaR::aph showed that, in the mutant, there was a 1000- to 10,000-fold lower transcript level for the ceaS2 to cas2 polycistronic transcript that encoded CeaS2, the first enzyme of the clavulanic acid pathway that commits arginine to clavulanic acid biosynthesis. Smaller decreases in expression were observed in the ccaR mutant for other genes in the cluster. Two-dimensional electrophoresis and MALDI-TOF analysis confirmed the absence in the mutant strain of proteins CeaS2, Bls2, Pah2 and Car that are required for clavulanic acid biosynthesis, and CefF and IPNS that are required for cephamycin biosynthesis. Gel shift electrophoresis using recombinant r-CcaR protein showed that it bound to the ceaS2 and claR promoter regions in the clavulanic acid cluster, and to the lat, cefF, cefD-cmcI and ccaR promoter regions in the cephamycin C gene cluster. Footprinting experiments indicated that triple heptameric conserved sequences were protected by r-CcaR, and allowed identification of heptameric sequences as CcaR binding sites.

  14. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    Science.gov (United States)

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  15. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    Science.gov (United States)

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  16. The structural analysis of protein sequences based on the quasi-amino acids code

    Institute of Scientific and Technical Information of China (English)

    Zhu Ping; Tang Xu-Qing; Xu Zhen-Yuan

    2009-01-01

    Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (∑, +, *) is introduced, where ∑ is the set of 64 codons. According to the characteristics of (∑,+, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, +, ×) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3).

  17. Complete amino acid sequence of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris.

    Science.gov (United States)

    Lehman, L D; Jones, B N; Dwulet, F E; Bogardt, R A; Gurd, F R

    1980-10-21

    The complete primary structure of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at its two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage with trypsin of the citraconylated apomyoglobin at its three arginine residues. Further digestion of the central cyanogen bromide peptide with S. aureus strain V8 protease and the 1,2-cyclohexanedione-treated central cyanogen bromide peptide with trypsin enabled the determination of the remainder of the covalent structure. This myoglobin differs from the cetacean myoglobins determined to date at 12 to 17 positions. These large sequence differences reflect the distant taxonomic relationships between the goose-beaked whale and the other species of Cetacea the myoglobin sequences of which have previously been determined.

  18. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer

    Science.gov (United States)

    Zambanini, Thiemo; Buescher, Joerg M.; Meurer, Guido; Blank, Lars M.

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  19. Isolation and amino acid sequence of crustacean hyperglycemic hormone precursor-related peptides.

    Science.gov (United States)

    Tensen, C P; Verhoeven, A H; Gaus, G; Janssen, K P; Keller, R; Van Herp, F

    1991-01-01

    The crustacean hyperglycemic hormone (CHH) is synthesized as part of a larger preprohormone in which the sequence of CHH is N-terminally flanked by a peptide for which the name CPRP (CHH precursor-related peptide) is proposed. Both CHH and CPRP are present in the sinus gland, the neurohemal organ of neurosecretory cells located in the eyestalk of decapod crustaceans. This paper describes the isolation and sequence analysis of CPRPs isolated from sinus glands of the crab Carcinus maenas, the crayfish Orconectes limosus and the lobster Homarus americanus. The published sequence of "peptide H" isolated from the land crab, Cardisoma carnifex, has now been recognized as a CPRP in this species. Sequence comparison reveals a high level of identity for the N-terminal region (residues 1-13) between all four peptides, while identity in the C-terminal domain is high between lobster and crayfish CPRP on the one hand, and between both crab species on the other. Conserved N-terminal residues include a putative monobasic processing site at position 11, which suggests that CPRP may be a biosynthetic intermediate from which a potentially bioactive decapeptide can be derived.

  20. Complete genome sequence of the prototype lactic acid bacterium Lactococcus lactis subsp cremoris MG1363

    NARCIS (Netherlands)

    Wegmann, Udo; O'Connell-Motherwy, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 ps

  1. Features of intestinal absorption of salvianolic acid%丹参酚酸肠吸收特性研究

    Institute of Scientific and Technical Information of China (English)

    陈贤春; 吴清; 李冀湘; 石红欣; 张玲

    2009-01-01

    Objection To study the intestinal absorptive features of salvianolic acid taking salvianic acid A sodium and protocatechuic aldehyde as the indexes. Methods The method of intestinal absorption was applied in vivo in rats for establishing a quantitative determination of salvianic acid A sodium and protocatechuic aldehyde. The influences of different absorptive segments of intestines, drug concentrations and pH conditions on the intestinal absorptive volume of salvianic acid A sodium and protocatechuic aldehyde were observed, and the parameter of absorptive dynamics was studied. Results There was no specified absorptive segments of intestines. The absorptive volume had a good linear relationship in the range of test concentration without saturation. The value of pH had no influence on intestinal absorption of salvianic acid A sodium, but had on protocatechuic aldehyde. The absorptive rate constant of salvianic acid A sodium (Ka) was 0.3996 h~(-1) and T_(1/2) was 1.734 h, and that of protocatechuic aldehyde, 0.401 9 h~(-1) and T_(1/2), 1.724 h. Conclusion Salvianolic acid can be absorbed well in the intestines, and the mechanism is that the absorption is in a passive diffusion way.%目的 以丹参素钠和原儿茶醛为指标,研究丹参酚酸肠吸收特性.方法 建立丹参素钠和原儿茶醛含量测定方法,采用大鼠在体肠吸收法,考察不同吸收部位、药物浓度、介质pH对丹参素钠和原儿茶醛吸收量的影响,考察吸收动力学参数.结果 丹参素钠和原儿茶醛在小肠各段均有吸收,无特定吸收部位;在实验所设定的浓度范围内2种成分的吸收量均与浓度呈现出良好的线性关系,没有吸收饱和现象发生;pH值对丹参素钠的肠吸收量没有显著影响,而对原儿茶醛有影响;丹参素钠吸收速率常数Ka 0.3996 h~(-1),T_(1/2) 1.734 h,原儿茶醛吸收速率常数Ka 0.4019 h~(-1),T_(1/2) 1.724 h.结论 丹参酚酸在肠道中吸收良好,吸收机制为被动扩散.

  2. Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

    DEFF Research Database (Denmark)

    Radova, A.; Sebela, M.; Galuszka, P.

    2001-01-01

    This paper reports the first purification method developed for the isolation of an homogeneous polyamine oxidase (PAO) from etiolated barley seedlings. The crude enzyme preparation was obtained after initial precipitation of the extract with protamine sulphate and ammonium sulphate. The enzyme...... was further purified to a final homogeneity (by the criteria of isoelectric focusing and SDS-PAGE) using techniques of low pressure chromatography followed by two FPLC steps. The purified yellow enzyme showed visible absorption maxima of a flavoprotein at 380 and 450 nm: the presence of FAD as the cofactor...... was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...

  3. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Rhee, Mun Su [University of Florida, Gainesville; Moritz, Brelan E. [University of Florida, Gainesville; Xie, Gary [Los Alamos National Laboratory (LANL); Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Chertkov, Olga [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Patel, Milind [University of Florida, Gainesville; Ou, Mark [University of Florida, Gainesville; Harbrucker, Roberta [University of Florida, Gainesville; Ingram, Lonnie O. [University of Florida; Shanmugam, Keelnathan T. [University of Florida

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  4. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Xie, Gary [Los Alamos National Laboratory (LANL); Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  5. Prediction of the Occurrence of the ADP-binding βαβ-fold in Proteins, Using an Amino Acid Sequence Fingerprint

    NARCIS (Netherlands)

    Wierenga, Rik K.; Terpstra, Peter; Hol, Wim G.J.

    1986-01-01

    An amino acid sequence "fingerprint” has been derived that can be used to test if a particular sequence will fold into a βαβ-unit with ADP-binding properties. It was deduced from a careful analysis of the known three-dimensional structures of ADP-binding βαβ-folds. This fingerprint is in fact a set

  6. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Science.gov (United States)

    2010-07-01

    ... “Sequence Listing” file. (6) All computer readable forms must have a label permanently affixed thereto on...) Computer readable form files submitted may be in any of the following media: (1) Diskette: 3.50 inch, 1.44... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824...

  7. Diverse bacterial PKS sequences derived from okadaic acid-producing dinoflagellates.

    Science.gov (United States)

    Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S

    2008-05-22

    Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  8. Amino acid sequence diversity within the family of antibodies bearing the major antiarsonate cross-reactive idiotype of the A strain mouse

    OpenAIRE

    1983-01-01

    VH region amino acid sequences are described for five A/J anti-p- azophenylarsonate (anti-Ars) hybridoma antibodies for which the VL region sequences have previously been determined, thus completing the V domain sequences of these molecules. These antibodies all belong to the family designated Ars-A which bears the major anti-arsonate cross- reactive idiotype (CRI) of the A strain mouse. However, they differ in the degree to which they express the CRI in standard competition radioimmunoassays...

  9. Global analysis of physical and functional RNA targets of hnRNP L reveals distinct sequence and epigenetic features of repressed and enhanced exons.

    Science.gov (United States)

    Cole, Brian S; Tapescu, Iulia; Allon, Samuel J; Mallory, Michael J; Qiu, Jinsong; Lake, Robert J; Fan, Hua-Ying; Fu, Xiang-Dong; Lynch, Kristen W

    2015-12-01

    HnRNP L is a ubiquitous splicing-regulatory protein that is critical for the development and function of mammalian T cells. Previous work has identified a few targets of hnRNP L-dependent alternative splicing in T cells and has described transcriptome-wide association of hnRNP L with RNA. However, a comprehensive analysis of the impact of hnRNP L on mRNA expression remains lacking. Here we use next-generation sequencing to identify transcriptome changes upon depletion of hnRNP L in a model T-cell line. We demonstrate that hnRNP L primarily regulates cassette-type alternative splicing, with minimal impact of hnRNP L depletion on transcript abundance, intron retention, or other modes of alternative splicing. Strikingly, we find that binding of hnRNP L within or flanking an exon largely correlates with exon repression by hnRNP L. In contrast, exons that are enhanced by hnRNP L generally lack proximal hnRNP L binding. Notably, these hnRNP L-enhanced exons share sequence and context features that correlate with poor nucleosome positioning, suggesting that hnRNP may enhance inclusion of a subset of exons via a cotranscriptional or epigenetic mechanism. Our data demonstrate that hnRNP L controls inclusion of a broad spectrum of alternative cassette exons in T cells and suggest both direct RNA regulation as well as indirect mechanisms sensitive to the epigenetic landscape.

  10. Diversity of the 47-kD HtrA nucleic acid and translated amino acid sequences from 17 recent human isolates of Orientia.

    Science.gov (United States)

    Jiang, Ju; Paris, Daniel H; Blacksell, Stuart D; Aukkanit, Nuntipa; Newton, Paul N; Phetsouvanh, Rattanaphone; Izzard, Leonard; Stenos, John; Graves, Stephen R; Day, Nicholas P J; Richards, Allen L

    2013-06-01

    Orientia tsutsugamushi, the etiologic agent of potentially fatal scrub typhus, is characterized by a high antigenic diversity, which complicates the development of a broadly protective vaccine. Efficacy studies in murine and nonhuman primate models demonstrated the DNA vaccine candidate pKarp47, based upon the O. tsutsugamushi Karp 47-kD HtrA protein gene, to be a successful immunoprophylactic against scrub typhus. To characterize 47-kD HtrA protein diversity among human isolates of Orientia, we sequenced the full open reading frame (ORF) of the 47-kD HtrA gene and analyzed the translated amino acid sequences of 17 patient isolates from Thailand (n=13), Laos (n=2), Australia (n=1), and the United Arab Emirates (UAE) (n=1) and 9 reference strains: Karp (New Guinea), Kato (Japan), Ikeda (Japan), Gilliam (Burma), Boryong (Korea), TA763, TH1811 and TH1817 (Thailand), and MAK243 (China). The percentage identity (similarity) of translated amino acid sequences between 16 new isolates and 9 reference strains of O. tsutsugamushi ranged from 96.4% to 100% (97.4% to 100%). However, inclusion of the recently identified Orientia chuto sp. nov. reduced identity (similarity) values to 82.2% to 83.3% (90.4% to 91.4%). These results demonstrate the diversity of Orientia 47-kD HtrA among isolates encountered by humans and therefore provide support for the necessity of developing a broadly protective scrub typhus vaccine that takes this diversity into account.

  11. Air-stable platinum and palladium complexes featuring bis[2,4-bis(trifluoromethyl)phenyl]phosphinous acid ligands.

    Science.gov (United States)

    Kurscheid, Boris; Neumann, Beate; Stammler, Hans-Georg; Hoge, Berthold

    2011-12-23

    Secondary phosphane oxides, R(2)P(O)H, are commonly used as preligands for transition-metal complexes of phosphinous acids, R(2)P-OH (R=alkyl, aryl), which are relevant as efficient catalysts in cross-coupling processes. In contrast to previous work by other groups, we are interested in the ligating properties of an electron-deficient phosphinous acid, (R(f))(2)P-OH, bearing the strongly electron-withdrawing and sterically demanding 2,4-bis(trifluoromethyl)phenyl group towards catalysis-relevant metals, such as palladium and platinum. The preligand bis[2,4-bis(trifluoromethyl)phenyl]phosphane oxide, (R(f))(2)P(O)H, reacts smoothly with solid platinum(II) dichloride yielding the trans-configured phosphinous acid platinum complex trans-[PtCl(2)({2,4-(CF(3))(2)C(6)H(3)}(2)POH)(2)]. The deprotonation of one phosphinous acid ligand with an appropriate base leads to the cis-configured monoanion complex cis-[PtCl(2)({2,4-(CF(3))(2)C(6)H(3)}(2)PO)(2)H](-), featuring the quasi-chelating phosphinous acid phosphinito unit, (R(f))(2)P-O-H···O=P(R(f))(2), which exhibits a strong hydrogen bridge substantiated by an O···O distance of 245.1(4) pm. The second deprotonation step is accompanied by a rearrangement to afford the trans-configured dianion trans-[PtCl(2)({2,4-(CF(3))(2)C(6)H(3)}(2)PO)(2)](2-). The reaction of (R(f))(2)P(O)H with solid palladium(II) dichloride initially yields a mononuclear palladium complex [PdCl(2)({2,4-(CF(3))(2)C(6)H(3)}(2)POH)(2)], which condenses under liberation of HCl to the neutral dinuclear palladium complex [Pd(2)(μ-Cl)(2){({2,4-(CF(3))(2)C(6)H(3)}(2)PO)(2)H}(2)]. The equilibrium between the mononuclear [PdCl(2)({2,4-(CF(3))(2)C(6)H(3)}(2)POH)(2)] and dinuclear [Pd(2)(μ-Cl)(2){({2,4-(CF(3))(2)C(6)H(3)}(2)PO)(2)H}(2)] palladium complexes is reversible and can be shifted in each direction by the addition of base or HCl, respectively. Treatment of palladium(II) hexafluoroacetylacetonate, [Pd(F(6)acac)(2)], with a slight excess of (R(f))(2)P

  12. Hybridization probe for femtomolar quantification of selected nucleic acid sequences on a disposable electrode.

    Science.gov (United States)

    Jenkins, Daniel M; Chami, Bilal; Kreuzer, Matthias; Presting, Gernot; Alvarez, Anne M; Liaw, Bor Yann

    2006-04-01

    Mixed monolayers of electroactive hybridization probes on gold surfaces of a disposable electrode were investigated as a technology for simple, sensitive, selective, and rapid gene identification. Hybridization to the ferrocene-labeled hairpin probes reproducibly diminished cyclic redox currents, presumably due to a displacement of the label from the electrode. Observed peak current densities were roughly 1000x greater than those observed in previous studies, such that results could easily be interpreted without the use of algorithms to correct for background polarization currents. Probes were sensitive to hybridization with a number of oligonucleotide sequences with varying homology, but target oligonucleotides could be distinguished from competing nontarget sequences based on unique "melting" profiles from the probe. Detection limits were demonstrated down to nearly 100 fM, which may be low enough to identify certain genetic conditions or infections without amplification. This technology has rich potential for use in field devices for gene identification as well as in gene microarrays.

  13. Amino acid sequence of the cold-active alkaline phosphatase from Atlantic cod (Gadus morhua)

    DEFF Research Database (Denmark)

    Asgeirsson, Bjarni; Nielsen, Berit Noesgaard; Højrup, Peter

    2003-01-01

    -linked glycosylation sites were found. The glycan structure was determined as complex biantennary in type with fucose and sialic acid attached, although a trace of complex tri-antennary structure was also observed. A three-dimensional model was obtained by homology modelling using the human placental AP scaffold. Cod...

  14. Sequence-specific inhibition of duck hepatitis B virus reverse transcription by peptide nucleic acids (PNA)

    DEFF Research Database (Denmark)

    Robaczewska, Magdalena; Narayan, Ramamurthy; Seigneres, Beatrice

    2005-01-01

    BACKGROUND/AIMS: Peptide nucleic acids (PNAs) appear as promising new antisense agents, that have not yet been examined as hepatitis B virus (HBV) inhibitors. Our aim was to study the ability of PNAs targeting the duck HBV (DHBV) encapsidation signal epsilon to inhibit reverse transcription (RT...

  15. KSHV 2.0: a comprehensive annotation of the Kaposi's sarcoma-associated herpesvirus genome using next-generation sequencing reveals novel genomic and functional features.

    Directory of Open Access Journals (Sweden)

    Carolina Arias

    2014-01-01

    Full Text Available Productive herpesvirus infection requires a profound, time-controlled remodeling of the viral transcriptome and proteome. To gain insights into the genomic architecture and gene expression control in Kaposi's sarcoma-associated herpesvirus (KSHV, we performed a systematic genome-wide survey of viral transcriptional and translational activity throughout the lytic cycle. Using mRNA-sequencing and ribosome profiling, we found that transcripts encoding lytic genes are promptly bound by ribosomes upon lytic reactivation, suggesting their regulation is mainly transcriptional. Our approach also uncovered new genomic features such as ribosome occupancy of viral non-coding RNAs, numerous upstream and small open reading frames (ORFs, and unusual strategies to expand the virus coding repertoire that include alternative splicing, dynamic viral mRNA editing, and the use of alternative translation initiation codons. Furthermore, we provide a refined and expanded annotation of transcription start sites, polyadenylation sites, splice junctions, and initiation/termination codons of known and new viral features in the KSHV genomic space which we have termed KSHV 2.0. Our results represent a comprehensive genome-scale image of gene regulation during lytic KSHV infection that substantially expands our understanding of the genomic architecture and coding capacity of the virus.

  16. Hyperspectral analysis for qualitative and quantitative features related to acid mine drainage at a remediated open-pit mine

    Science.gov (United States)

    Davies, G.; Calvin, W. M.

    2015-12-01

    The exposure of pyrite to oxygen and water in mine waste environments is known to generate acidity and the accumulation of secondary iron minerals. Sulfates and secondary iron minerals associated with acid mine drainage (AMD) exhibit diverse spectral properties in the ultraviolet, visible and near-infrared regions of the electromagnetic spectrum. The use of hyperspectral imagery for identification of AMD mineralogy and contamination has been well studied. Fewer studies have examined the impacts of hydrologic variations on mapping AMD or the unique spectral signatures of mine waters. Open-pit mine lakes are an additional environmental hazard which have not been widely studied using imaging spectroscopy. A better understanding of AMD variation related to climate fluctuations and the spectral signatures of contaminated surface waters will aid future assessments of environmental contamination. This study examined the ability of multi-season airborne hyperspectral data to identify the geochemical evolution of substances and contaminant patterns at the Leviathan Mine Superfund site. The mine is located 24 miles southeast of Lake Tahoe and contains remnant tailings piles and several AMD collection ponds. The objectives were to 1) distinguish temporal changes in mineralogy at a the remediated open-pit sulfur mine, 2) identify the absorption features of mine affected waters, and 3) quantitatively link water spectra to known dissolved iron concentrations. Images from NASA's AVIRIS instrument were collected in the spring, summer, and fall seasons for two consecutive years at Leviathan (HyspIRI campaign). Images had a spatial resolution of 15 meters at nadir. Ground-based surveys using the ASD FieldSpecPro spectrometer and laboratory spectral and chemical analysis complemented the remote sensing data. Temporal changes in surface mineralogy were difficult to distinguish. However, seasonal changes in pond water quality were identified. Dissolved ferric iron and chlorophyll

  17. Kinetic investigation of a solvent-free, chemoenzymatic reaction sequence towards enantioselective synthesis of a β-amino acid ester.

    Science.gov (United States)

    Strompen, Simon; Weiss, Markus; Ingram, Thomas; Smirnova, Irina; Gröger, Harald; Hilterhaus, Lutz; Liese, Andreas

    2012-06-01

    A solvent-free, chemoenzymatic reaction sequence for the enantioselective synthesis of β-amino acid esters has been kinetically and thermodynamically characterized. The coupled sequence comprises a thermal aza-Michael addition of cheap starting materials and a lipase catalyzed aminolysis for the kinetic resolution of the racemic ester. Excellent ee values of >99% were obtained for the β-amino acid ester at 60% conversion. Kinetic constants for the aza-Michael addition were obtained by straightforward numerical integration of second-order rate equations and nonlinear fitting of the progress curves. A different strategy had to be devised for the biocatalytic reaction. Initially, a simplified Michaelis-Menten model including product inhibition was developed for the reaction running in THF as an organic solvent. Activity based parameters were used instead of concentrations in order to facilitate the transfer of the kinetic model to the solvent-free system. Observed solvent effects not accounted for by the use of thermodynamic activities were incorporated into the kinetic model. Enzyme deactivation was observed to depend on the ratio of the applied substrates and also included in the kinetic model. The developed simple model is in very good agreement with the experimental data and allows the simulation and optimization of the solvent-free process.

  18. Phylogenetic analysis of dicyemid mesozoans (phylum Dicyemida) from innexin amino acid sequences: dicyemids are not related to Platyhelminthes.

    Science.gov (United States)

    Suzuki, Takahito G; Ogino, Kazutoyo; Tsuneki, Kazuhiko; Furuya, Hidetaka

    2010-06-01

    Dicyemid mesozoans are endoparasites, or endosymbionts, found only in the renal sac of benthic cephalopod molluscs. The body organization of dicyemids is very simple, consisting of usually 10 to 40 cells, with neither body cavities nor differentiated organs. Dicyemids were considered as primitive animals, and the out-group of all metazoans, or as occupying a basal position of lophotrochozoans close to flatworms. We cloned cDNAs encoding for the gap junction component proteins, innexin, from the dicyemids. Its expression pattern was observed by whole-mount in situ hybridization. In adult individuals, the innexin was expressed in calottes, infusorigens, and infusoriform embryos. The unique temporal pattern was observed in the developing infusoriform embryos. Innexin amino acid sequences had taxon-specific indels which enabled identification of the 3 major protostome lineages, i.e., 2 ecdysozoans (arthropods and nematodes) and the lophotrochozoans. The dicyemids show typical, lophotrochozoan-type indels. In addition, the Bayesian and maximum likelihood trees based on the innexin amino acid sequences suggested dicyemids to be more closely related to the higher lophotrochozoans than to the flatworms. Flatworms were the sister group, or consistently basal, to the other lophotrochozoan clade that included dicyemids, annelids, molluscs, and brachiopods.

  19. Identification of a novel HMW glutenin subunit and comparison of its amino acid sequence with those of homologous subunits

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Aegilops tauschii is the donor of the D genome of common wheat (Triticum aestivum). Genetic variation of HMW glutenin subunits encoded by the Glu-1Dt locus of Ae. tauschii has been found to be higher than that specified by the Glu-1D locus in common wheat. In the present note, we report the identification of a novel HMW glutenin subunit, Dy13t, from Ae. tauschii. The newly identified subunit possessed an electrophoretic mobility that was faster than that of the Dy12 subunit of common wheat. The complete ORF of encoding the Dy13t subunit contained 624 codons (excluding the stop codons). The amino acid sequence deduced from the Dy13t gene ORF was the shortest among those of the previously reported subunits derived by the D genome. A further comparison of Dy13t amino acid sequence with those of the subunits characterized from the A, B, D, R genomes of Triticeae showed that the smaller size of the Dy13t subunit was associated with a reduction in the size of its repetitive domain.

  20. Amino Acid sequence analysis of the two major outer Capsid Proteins (VP7 and VP4 from human-derived canine G3P[3] Rotavirus Strain Detected in Brazil

    Directory of Open Access Journals (Sweden)

    Adriana Luchs

    2013-12-01

    Full Text Available Introduction: A close look at the rotavirus group A (RVA genotypes in Brazil revealed the detection of a rare G3P[3] strain close related to canine strains. The aim of this study was to add to the already known genetic analysis by the description of the G3P[3] (IAL-R2638 strain amino acid characteristics. Methods: Amino acid sequence analysis and protein based trees were conducted using BioEdit and MEGA 4.0. Results: The VP7 and VP4 protein of the IAL-R2638 strain displayed the highest amino acid identity to the canine-derived human strain HCR3A (99.2%, and to the canine strain RV52/96 (96.4%, respectively. IAL-R2638 strain did not possess an extra VP7 N-linked glycosylation site at amino acid 238 recently described for some G3 strains, as well as RotaTeqTM G3 vaccine strain. The topology exhibited by phylogenetic trees in previous analysis were maintained in the present amino acid-based trees, reinforcing a stable relationship between G3P[3] strains. Conclusions: Amino acid analysis data were consistent with the previous sequence of data obtained for the IAL-R2638 strain, supporting its possible canine origin. Theoretically, RotaTeqTM vaccine could efficiently protect against G3P[3] infections based on the lack of the extra VP7 N-linked glycosylation site at amino acid 238. Phylogenetic analysis hypothesizes that all features undergo evolution independently of each other; however, unfavorable effects of nucleotide substitutions may be compensated by substitutions in other positions. The present study raises the question as to whether the amino acid-based trees could be applied as an approach to the study of RVA evolution, avoiding incorrect phylogenetic reconstructions.

  1. The complete amino acid sequence of the major component myoglobin from the arctic minke whale, Balaenoptera acutorostrata.

    Science.gov (United States)

    Lehman, L D; Dwulet, F E; Bogardt, R A; Jones, B N; Gurd, F R

    1977-02-22

    The complete primary structure of the major component myoglobin from the Arctic minke whale, Balaenoptera acutorostrata, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at the two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage of the methylacetimidated apomyoglobin at the three arginine residues with trypsin. The further digestion of the central cyanogen bromide peptide with trypsin and S. aureus strain V8 protease enabled the determining of the remainder of the covalent structure. This myoglobin differs from that of the dwarf sperm whale, Kogia simus, at 16 positions, and the common dolphin, Delphinus delphis, at 14 positions, from that of the common porpoise, Phocaena phocaena, and the bottlenosed dolphin, Tursiops truncatus at 13 positions, from that of the Amazon River dolphin, Inia geoffrensis, at 10 positions, and from that of California gray whale, Eschrichtius gibbosus, at 3 positions- All of the substitutions observed in this sequence fit easily into the three-dimensional structure of the sperm whale myoglobin.

  2. Amino acid sequence and disulfide bond assignment of myotoxin a isolated from the venom of prairie rattlesnake (Crotalus viridis viridis)

    Energy Technology Data Exchange (ETDEWEB)

    Fox, J.W.; Elzinga, M.; Tu, A.T.

    1979-02-20

    The primary structure of myotoxin a, a myotoxin protein from the venom of the North American rattlesnake Crotalus viridis viridis, was determined and the position of the disulfide bonds assigned. The toxin was isolated, carboxymethylated, and cleaved by cyanogen bromide, and the resultant peptides were isolated. The cyanogen bromide peptides were subjected to amino acid sequence analysis. In order to assign the positions of the three disulfide bonds, the native toxin was cleaved sequentially with cyanogen bromide and trypsin. A two peptide unit connected by one disulfide bond was isolated and characterized, and a three-peptide unit connected by two disulfide bonds was isolated. One peptide in the three-peptide unit was identified as Cys-Cys-Lys. In order to establish the linkages between the peptides and Cys-Cys-Lys, one cycle of Edman degradation was carried out such that the Cys-Cys bond was cleaved. Upon isolation and analysis of the cleavage products, the disulfide bonds connecting the three peptides were determined. The positions of the disulfide bridges of myotoxin a were determined to be totally different from those of neurotoxins isolated from snake venoms. The sequence of myotoxin a was compared with the sequences of other snake venom toxins using the computer program RELATE to determine whether myotoxin a is similar to any other types of toxins. From the computer analysis, myotoxin a did not show any close relationship to other toxins except crotamine from the South American rattlesnake Crotalus durissus terrificus.

  3. Detection of DBD-carbamoyl amino acids in amino acid sequence and D/L configuration determination of peptides with fluorogenic Edman reagent 7-[(N,N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate.

    Science.gov (United States)

    Huang, Y; Matsunaga, H; Toriba, A; Santa, T; Fukushima, T; Imai, K

    1999-06-01

    A method for amino acid sequence and D/L configuration identification of peptides by using fluorogenic Edman reagent 7-[(N, N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate (DBD-NCS) has been developed. This method was based on the Edman degradation principle with some modifications. A peptide or protein was coupled with DBD-NCS under basic conditions and then cyclized/cleaved to produce DBD-thiazolinone (TZ) derivative by BF3, a Lewis acid, which could significantly suppress the amino acid racemization. The liberated DBD-TZ amino acid was hydrolyzed to DBD-thiocarbamoyl (TC) amino acid under a weakly acidic condition and then oxidized by NaNO2/H+ to DBD-carbamoyl (CA) amino acid which was a stable and had a strong fluorescence intensity. The individual DBD-CA amino acids were separated on a reversed-phase high-performance liquid chromatography (RP-HPLC) for amino acid sequencing and their enantiomers were resolved on a chiral stationary-phase HPLC for identifying their D/L configurations. Combination of the two HPLC systems, the amino acid sequence and D/L configuration of peptides could be determined. This method will be useful for searching D-amino-acid-containing peptides in animals.

  4. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    Directory of Open Access Journals (Sweden)

    Kathleen S. Rein

    2008-05-01

    Full Text Available Okadaic acid (OA and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  5. Researches on Sequence of Plant Cystatin: Phytocystatin

    Institute of Scientific and Technical Information of China (English)

    QINQingfeng; HEWei; LIANGJun; ZHANGXingyao

    2005-01-01

    Plant cystatins or phytocystatins are cysteine proteinase inhibitors exist widely in different plant species. Because they can kill insects by inhibiting the digestive function of the cysteine proteinase in gut, they are believed to play an important role in plant's defense against pests. Phytocystatins contain the conserved QXVXG motif and show some features on their sequence different to animal cystatins.After sequencing the protein directly and the cDNA clone, a large number of plant cystatins have been characterized. A multialignment with BLAST software and a detail analysis of 38 phytocystatins show that phytocystatins possess a specific conserved amino acid sequence [LRVI]-[AGT]-[RQKE]-[FY]-[AS]-[VI]-X-[EGHDQV]-[HYFQ]-N different to the conserved sequence demonstrated by Margis in 1998. This conserved sequence can be enough to detect with exclusivity phytocystatin sequences on protein data banks. A classification of these phytocystatins is performed and they can be divided into 3 groups according to their features on amino acid sequence, and the group-I can be still divided into 3 subgroups based on the feature of their amino acid and genomic sequence. By the CLUSTALX software,the most conserved nucleotide sequences of phytocystatins were found, which could be used to design the degenerate premiers to search new phytocystatins with PCR reaction.

  6. Hyperintense HCC on hepatobiliary phase images of gadoxetic acid-enhanced MRI: Correlation with clinical and pathological features

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Ja Young [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Kim, Myeong-Jin, E-mail: kimnex@yuhs.ac [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Kim, Kyung Ah; Jeong, Hyeon Tae [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Park, Young Nyun [Department of Pathology, Yonsei University Severance Hospital, Seoul (Korea, Republic of)

    2012-12-15

    Purpose: To retrospectively determine whether the hyperintense hepatocellular carcinomas (HCCs) seen on the hepatobiliary phase of gadoxetic acid-enhanced MR imaging (EOB-MRI) might have different histologic characteristics from usual hypointense HCCs. Materials and methods: Two hundred three surgically proven HCCs from 192 patients who underwent preoperative EOB-MRI were analyzed. The demographic and histologic characteristics of hyperintense HCCs were compared with usual hypointense HCCs by using the t-test or Fisher's exact test. Results: By visual assessment, 18 (8.8%) tumors were classified as hyperintense HCCs. Patients with hyperintense HCC were significantly (p < 0.05) older (60.1 vs. 55.2 years) than those with hypointense HCCs. Hyperintense HCCs showed significantly lower rate of microvascular invasion (27.8% vs. 53.5%) and significantly higher rate of peliosis (61.1% vs. 30.8%). Hyperintense HCCs were more frequently expanding type, and none showed infiltrative type or scirrhous histologic pattern. Conclusions: Hyperintense HCCs seem to have clinical and histologic features that might be related with more favorable outcomes.

  7. Electron microscopic features of brain edema in rodent cerebral malaria in relation to glial fibrillary acidic protein expression.

    Science.gov (United States)

    Ampawong, Sumate; Chaisri, Urai; Viriyavejakul, Parnpen; Nontprasert, Apichart; Grau, Georges E; Pongponratn, Emsri

    2014-01-01

    The mechanisms leading to cerebral malaria (CM) are not completely understood. Brain edema has been suggested as having an important role in experimental CM. In this study, CBA/CaH mice were infected with Plasmodium berghei ANKA blood-stage and when typical symptoms of CM developed on day 7, brain tissues were processed for electron-microscopic and immunohistochemical studies. The study demonstrated ultrastructural hallmarks of cerebral edema by perivascular edema and astroglial dilatation confirming existing evidence of vasogenic and cytogenic edema. This correlates closely with the clinical features of CM. An adaptive response of astrocytic activity, represented by increasing glial fibrillary acidic protein (GFAP) expression in the perivascular area and increasing numbers of large astrocyte clusters were predominately found in the CM mice. The presence of multivesicular and lamellar bodies indicates the severity of cerebral damage in experimental CM. Congestion of the microvessels with occluded white blood cells (WBCs), parasitized red blood cells (PRBCs) and platelets is also a crucial covariate role for CM pathogenesis.

  8. iTriplet, a rule-based nucleic acid sequence motif finder

    Directory of Open Access Journals (Sweden)

    Gunderson Samuel I

    2009-10-01

    Full Text Available Abstract Background With the advent of high throughput sequencing techniques, large amounts of sequencing data are readily available for analysis. Natural biological signals are intrinsically highly variable making their complete identification a computationally challenging problem. Many attempts in using statistical or combinatorial approaches have been made with great success in the past. However, identifying highly degenerate and long (>20 nucleotides motifs still remains an unmet challenge as high degeneracy will diminish statistical significance of biological signals and increasing motif size will cause combinatorial explosion. In this report, we present a novel rule-based method that is focused on finding degenerate and long motifs. Our proposed method, named iTriplet, avoids costly enumeration present in existing combinatorial methods and is amenable to parallel processing. Results We have conducted a comprehensive assessment on the performance and sensitivity-specificity of iTriplet in analyzing artificial and real biological sequences in various genomic regions. The results show that iTriplet is able to solve challenging cases. Furthermore we have confirmed the utility of iTriplet by showing it accurately predicts polyA-site-related motifs using a dual Luciferase reporter assay. Conclusion iTriplet is a novel rule-based combinatorial or enumerative motif finding method that is able to process highly degenerate and long motifs that have resisted analysis by other methods. In addition, iTriplet is distinguished from other methods of the same family by its parallelizability, which allows it to leverage the power of today's readily available high-performance computing systems.

  9. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    Science.gov (United States)

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  10. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    Science.gov (United States)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  11. Comparative RNA-Sequence Transcriptome Analysis of Phenolic Acid Metabolism in Salvia miltiorrhiza, a Traditional Chinese Medicine Model Plant

    Directory of Open Access Journals (Sweden)

    Zhenqiao Song

    2017-01-01

    Full Text Available Salvia miltiorrhiza Bunge is an important traditional Chinese medicine (TCM. In this study, two S. miltiorrhiza genotypes (BH18 and ZH23 with different phenolic acid concentrations were used for de novo RNA sequencing (RNA-seq. A total of 170,787 transcripts and 56,216 unigenes were obtained. There were 670 differentially expressed genes (DEGs identified between BH18 and ZH23, 250 of which were upregulated in ZH23, with genes involved in the phenylpropanoid biosynthesis pathway being the most upregulated genes. Nine genes involved in the lignin biosynthesis pathway were upregulated in BH18 and thus result in higher lignin content in BH18. However, expression profiles of most genes involved in the core common upstream phenylpropanoid biosynthesis pathway were higher in ZH23 than that in BH18. These results indicated that genes involved in the core common upstream phenylpropanoid biosynthesis pathway might play an important role in downstream secondary metabolism and demonstrated that lignin biosynthesis was a putative partially competing pathway with phenolic acid biosynthesis. The results of this study expanded our understanding of the regulation of phenolic acid biosynthesis in S. miltiorrhiza.

  12. Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

    Directory of Open Access Journals (Sweden)

    Zijian Li

    2014-02-01

    Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.

  13. Comparative RNA-Sequence Transcriptome Analysis of Phenolic Acid Metabolism in Salvia miltiorrhiza, a Traditional Chinese Medicine Model Plant

    Science.gov (United States)

    Song, Zhenqiao; Guo, Linlin; Liu, Tian; Lin, Caicai; Wang, Jianhua

    2017-01-01

    Salvia miltiorrhiza Bunge is an important traditional Chinese medicine (TCM). In this study, two S. miltiorrhiza genotypes (BH18 and ZH23) with different phenolic acid concentrations were used for de novo RNA sequencing (RNA-seq). A total of 170,787 transcripts and 56,216 unigenes were obtained. There were 670 differentially expressed genes (DEGs) identified between BH18 and ZH23, 250 of which were upregulated in ZH23, with genes involved in the phenylpropanoid biosynthesis pathway being the most upregulated genes. Nine genes involved in the lignin biosynthesis pathway were upregulated in BH18 and thus result in higher lignin content in BH18. However, expression profiles of most genes involved in the core common upstream phenylpropanoid biosynthesis pathway were higher in ZH23 than that in BH18. These results indicated that genes involved in the core common upstream phenylpropanoid biosynthesis pathway might play an important role in downstream secondary metabolism and demonstrated that lignin biosynthesis was a putative partially competing pathway with phenolic acid biosynthesis. The results of this study expanded our understanding of the regulation of phenolic acid biosynthesis in S. miltiorrhiza. PMID:28194403

  14. Main: Sequences [KOME

    Lifescience Database Archive (English)

    Full Text Available Sequences Amino Acid Sequence Amino Acid sequence of full length cDNA (Longest ORF) kome_ine_full_seq...uence_amino_db.fasta.zip kome_ine_full_sequence_amino_db.zip kome_ine_full_sequence_amino_db ...

  15. Large-Scale Conformational Transitions and Dimerization Are Encoded in the Amino-Acid Sequences of Hsp70 Chaperones

    Science.gov (United States)

    Malinverni, Duccio; Marsili, Simone; Barducci, Alessandro; De Los Rios, Paolo

    2015-01-01

    Hsp70s are a class of ubiquitous and highly conserved molecular chaperones playing a central role in the regulation of proteostasis in the cell. Hsp70s assist a myriad of cellular processes by binding unfolded or misfolded substrates during a complex biochemical cycle involving large-scale structural rearrangements. Here we show that an analysis of coevolution at the residue level fully captures the characteristic large-scale conformational transitions of this protein family, and predicts an evolutionary conserved–and thus functional–homo-dimeric arrangement. Furthermore, we highlight that the features encoding the Hsp70 dimer are more conserved in bacterial than in eukaryotic sequences, suggesting that the known Hsp70/Hsp110 hetero-dimer is a eukaryotic specialization built on a pre-existing template. PMID:26046683

  16. Genome sequence of thermotolerant Bacillus methanolicus: features and regulation related to methylotrophy and production of L-lysine and L-glutamate from methanol.

    Science.gov (United States)

    Heggeset, Tonje M B; Krog, Anne; Balzer, Simone; Wentzel, Alexander; Ellingsen, Trond E; Brautaset, Trygve

    2012-08-01

    Bacillus methanolicus can utilize methanol as its sole carbon and energy source, and the scientific interest in this thermotolerant bacterium has focused largely on exploring its potential as a biocatalyst for the conversion of methanol into L-lysine and L-glutamate. We present here the genome sequences of the important B. methanolicus model strain MGA3 (ATCC 53907) and the alternative wild-type strain PB1 (NCIMB13113). The physiological diversity of these two strains was demonstrated by a comparative fed-batch methanol cultivation displaying highly different methanol consumption and respiration profiles, as well as major differences in their L-glutamate production levels (406 mmol liter(-1) and 11 mmol liter(-1), respectively). Both genomes are small (ca 3.4 Mbp) compared to those of other related bacilli, and MGA3 has two plasmids (pBM19 and pBM69), while PB1 has only one (pBM20). In particular, we focus here on genes representing biochemical pathways for methanol oxidation and concomitant formaldehyde assimilation and dissimilation, the important phosphoenol pyruvate/pyruvate anaplerotic node, the tricarboxylic acid cycle including the glyoxylate pathway, and the biosynthetic pathways for L-lysine and L-glutamate. Several unique findings were made, including the discovery of three different methanol dehydrogenase genes in each of the two B. methanolicus strains, and the genomic analyses were accompanied by gene expression studies. Our results provide new insight into a number of peculiar physiological and metabolic traits of B. methanolicus and open up possibilities for system-level metabolic engineering of this bacterium for the production of amino acids and other useful compounds from methanol.

  17. Complete amino acid sequence of Mytilus anterior byssus retractor paramyosin and its putative phosphorylation site.

    Science.gov (United States)

    Watabe, S; Iwasaki, K; Funabara, D; Hirayama, Y; Nakaya, M; Kikuchi, K

    2000-01-01

    A cDNA encoding the full-length paramyosin molecule was cloned from the mussel Mytilus galloprovincialis, a species closely related to Mytilus edulis. It contained 3,497 nucleotides (nt), with 79 and 826 nt for the 5' and 3' non-coding regions, respectively. The coding region was composed of 2,592 nt for 864 amino acid residues, a size typical of paramyosin. While genomic DNA digests with either HindIII or PstI exhibited a single band when hybridized with a SacI fragment of paramyosin cDNA, the digests with either EcoRV or EcoRI showed two bands, suggesting that the mussel has at least two genes encoding paramyosin. The mRNAs encoding paramyosin were most abundant in muscle tissues from byssus retractor and adductor muscles. Only traces of paramyosin transcripts were found in the tissue of foot, gill, inner mantle, and outer mantle. The same phosphorylatable peptide previously reported for paramyosin from the bivalve Mercenaria mercenaria, Ser-Arg-Ser-Met-Ser(P)-Val-Ser-Arg (Watabe et al. 1989. Comp Biochem Physiol 94B:813-821) was found in the C-terminal non-helical part of this Mytilus paramyosin. We predict that this particular paramyosin has a coiled-coil structure composed of two alpha-helices that show the heptad repeats (a-b-c-d-e-f-g) with further 28-amino acid repeat zones, where a and d tend to be occupied by nonpolar residues.

  18. Using Triple Helix Forming Peptide Nucleic Acids for Sequence-selective Recognition of Double-stranded RNA

    Science.gov (United States)

    Hnedzko, Dziyana; Cheruiyot, Samwel K.; Rozners, Eriks

    2014-01-01

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple helix forming peptide nucleic acids (PNAs) that bind in the major grove of RNA helix. The strategy developed uses chemically modified nucleobases, such as 2-aminopyridine (M) that enables strong triple helical binding at physiologically relevant conditions, and 2-pyrimidinone (P) and 3-oxo-2,3-dihydropyridazine (E) that enable recognition of isolated pyrimidines in the purine rich strand of the RNA duplex. Detailed protocols for preparation of modified PNA monomers, solid-phase synthesis and HPLC purification of PNA oligomers, and measuring dsRNA binding affinity using isothermal titration calorimetry are included. PMID:25199637

  19. Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

    Science.gov (United States)

    Freimuth, Paul I.

    2010-04-06

    The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.

  20. Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins.

    Science.gov (United States)

    Carpena, Pedro; Bernaola-Galván, Pedro A; Carretero-Campos, Concepción; Coronado, Ana V

    2016-11-01

    Symbolic sequences have been extensively investigated in the past few years within the framework of statistical physics. Paradigmatic examples of such sequences are written texts, and deoxyribonucleic acid (DNA) and protein sequences. In these examples, the spatial distribution of a given symbol (a word, a DNA motif, an amino acid) is a key property usually related to the symbol importance in the sequence: The more uneven and far from random the symbol distribution, the higher the relevance of the symbol to the sequence. Thus, many techniques of analysis measure in some way the deviation of the symbol spatial distribution with respect to the random expectation. The problem is then to know the spatial distribution corresponding to randomness, which is typically considered to be either the geometric or the exponential distribution. However, these distributions are only valid for very large symbolic sequences and for many occurrences of the analyzed symbol. Here, we obtain analytically the exact, randomly expected spatial distribution valid for any sequence length and any symbol frequency, and we study its main properties. The knowledge of the distribution allows us to define a measure able to properly quantify the deviation from randomness of the symbol distribution, especially for short sequences and low symbol frequency. We apply the measure to the problem of keyword detection in written texts and to study amino acid clustering in protein sequences. In texts, we show how the results improve with respect to previous methods when short texts are analyzed. In proteins, which are typically short, we show how the measure quantifies unambiguously the amino acid clustering and characterize its spatial distribution.

  1. Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins

    Science.gov (United States)

    Carpena, Pedro; Bernaola-Galván, Pedro A.; Carretero-Campos, Concepción; Coronado, Ana V.

    2016-11-01

    Symbolic sequences have been extensively investigated in the past few years within the framework of statistical physics. Paradigmatic examples of such sequences are written texts, and deoxyribonucleic acid (DNA) and protein sequences. In these examples, the spatial distribution of a given symbol (a word, a DNA motif, an amino acid) is a key property usually related to the symbol importance in the sequence: The more uneven and far from random the symbol distribution, the higher the relevance of the symbol to the sequence. Thus, many techniques of analysis measure in some way the deviation of the symbol spatial distribution with respect to the random expectation. The problem is then to know the spatial distribution corresponding to randomness, which is typically considered to be either the geometric or the exponential distribution. However, these distributions are only valid for very large symbolic sequences and for many occurrences of the analyzed symbol. Here, we obtain analytically the exact, randomly expected spatial distribution valid for any sequence length and any symbol frequency, and we study its main properties. The knowledge of the distribution allows us to define a measure able to properly quantify the deviation from randomness of the symbol distribution, especially for short sequences and low symbol frequency. We apply the measure to the problem of keyword detection in written texts and to study amino acid clustering in protein sequences. In texts, we show how the results improve with respect to previous methods when short texts are analyzed. In proteins, which are typically short, we show how the measure quantifies unambiguously the amino acid clustering and characterize its spatial distribution.

  2. 一种单目视频下的人体量测参数计算方法%Anthropometric features extracted from calibrated image sequences by single camera

    Institute of Scientific and Technical Information of China (English)

    刘少华; 杜奎

    2013-01-01

    提出了一种在单目标定图像序列中提取身体量测参数的方法.该方法将人体掩膜图像简化为人体线模型,从而得到人体特征点,包括头顶点、肩点、重心点和脚点;通过反投影计算出特征点在现实世界中的空间坐标,从而得到身体量测参数,包括身高、肩高、肩宽和步幅.实验结果表明:该算法提取的身体量测参数误差较小,有较强的可用性.%This paper proposes a method for extracting anthropometric parameters from the calibrated image sequences by a single camera.The method can reduce the masking image of human body to a human line model so as to deal with the key points of human body,such as the top of head,shoulder,barycentric point and foot.The coordinates based on the key points obtained from antiprojection are used to display the anthropometric features such as stature,shoulder breadth and height,and pace.The experiment results show that the method is available with little error in anthropometric dimension.

  3. Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

    Science.gov (United States)

    Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

    2017-04-01

    The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.

  4. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

    Directory of Open Access Journals (Sweden)

    Leitner Dietmar

    2005-04-01

    Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.

  5. Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

    Directory of Open Access Journals (Sweden)

    Nagib eAhsan

    2012-07-01

    Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.

  6. First case series of emerging Rickettsial neonatal sepsis identified by polymerase chain reaction-based deoxyribonucleic acid sequencing

    Directory of Open Access Journals (Sweden)

    P Aarthi

    2013-01-01

    Full Text Available Purpose: To detect and identify the aetiological agent in the peripheral blood from the cases of neonatal sepsis. Materials and Methods: Four neonates from geographically different regions of South India presented with signs of neonatal sepsis and all the routine clinical and laboratory investigations were performed. Blood culture by Bac T Alert 3D was negative. To establish the aetiology, polymerase chain reaction (PCR for eubacterial genome and subsequent amplification with Gram positive and Gram negative primers were performed followed by deoxyribonucleic acid (DNA sequencing. Results: PCR for the detection of eubacterial genome was positive in all the four neonates and further amplification with designed Gram positive and Gram negative primers revealed the presence of Gram negative bacteria. The amplicons were identified as Orientia tsutsugamushi in three neonates and Coxiella burnetti in the other neonate. Multalin analysis was done to further characterise the strain variation among the three strains. Conclusion: PCR-based DNA sequencing is a rapid and reliable diagnostic tool to identify the aetiological agents of neonatal sepsis. This is the first case series of emerging Rickettsial neonatal sepsis in India .

  7. Introduction of Ca(2+)-binding amino-acid sequence into the T4 lysozyme.

    Science.gov (United States)

    Leontiev, V V; Uversky, V N; Permyakov, E A; Murzin, A G

    1993-03-05

    The 51-62 loop of T4 phage lysozyme was altered by site-directed mutagenesis to obtain maximal homology with the typical EF-hand motif. A Ca(2+)-binding site was designed and created by replacing both Gly-51 and Asn-53 with aspartic acid. The mutant T4 lysozyme (G51D/N53D) was expressed in Escherichia coli. The activity of the G51D/N53D-mutant was about 60% of that of the wild-type protein. This mutant can bind Ca2+ ions specifically, while the effective dissociation constant was essentially greater than that of the EF-hand proteins. Stability of the G51D/N53D-mutant apo-form to urea- or temperature-induced denaturation was the same as that of the wild-type protein. In the presence of Ca2+ ions in solution the stability of the mutant T4 phage lysozyme was less than that of the wild-type protein. It is suggested that the binding of Ca2+ by the mutant is accompanied by the considerable conformational changes in the 'corrected' loop, which can lead to the Ca(2+)-induced destabilization of the protein.

  8. Amino Acid Sequence and Structural Comparison of BACE1 and BACE2 Using Evolutionary Trace Method

    Directory of Open Access Journals (Sweden)

    Hoda Mirsafian

    2014-01-01

    Full Text Available Beta-amyloid precursor protein cleavage enzyme 1 (BACE1 and beta-amyloid precursor protein cleavage enzyme 2 (BACE2, members of aspartyl protease family, are close homologues and have high similarity in their protein crystal structures. However, their enzymatic properties differ leading to disparate clinical consequences. In order to identify the residues that are responsible for such differences, we used evolutionary trace (ET method to compare the amino acid conservation patterns of BACE1 and BACE2 in several mammalian species. We found that, in BACE1 and BACE2 structures, most of the ligand binding sites are conserved which indicate their enzymatic property of aspartyl protease family members. The other conserved residues are more or less randomly localized in other parts of the structures. Four group-specific residues were identified at the ligand binding site of BACE1 and BACE2. We postulated that these residues would be essential for selectivity of BACE1 and BACE2 biological functions and could be sites of interest for the design of selective inhibitors targeting either BACE1 or BACE2.

  9. Data for amino acid alignment of Japanese stingray melanocortin receptors with other gnathostome melanocortin receptor sequences, and the ligand selectivity of Japanese stingray melanocortin receptors

    OpenAIRE

    Akiyoshi Takahashi; Perry Davis; Christina Reinick; Kanta Mizusawa; Tatsuya Sakamoto; Dores, Robert M.

    2016-01-01

    This article contains structure and pharmacological characteristics of melanocortin receptors (MCRs) related to research published in “Characterization of melanocortin receptors from stingray Dasyatis akajei, a cartilaginous fish” (Takahashi et al., 2016) [1]. The amino acid sequences of the stingray, D. akajei, MC1R, MC2R, MC3R, MC4R, and MC5R were aligned with the corresponding melanocortin receptor sequences from the elephant shark, Callorhinchus milii, the dogfish, Squalus acanthias, the ...

  10. Genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia

    Directory of Open Access Journals (Sweden)

    Anastasiia Kovaliova

    2017-03-01

    Full Text Available Here we report the draft genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia. The draft genome has a size of 4.9 Mb and encodes multiple K+-transporters and proton-consuming decarboxylases. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain DV clusters together with the acid-tolerant Desulfovibrio sp. TomC and Desulfovibrio magneticus. The draft genome sequence and annotation have been deposited at GenBank under the accession number MLBG00000000.

  11. Complete genome sequence of the actinobacterium Amycolatopsis japonica MG417-CF17T (=DSM 44213T) producing (S,S)-N,N′-ethylenediaminedisuccinic acid

    DEFF Research Database (Denmark)

    Stegmann, Evi; Albersmeier, Andreas; Spohn, Marius

    2014-01-01

    We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons: the chro......We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons...

  12. Thimet oligopeptidase: similarity to 'soluble angiotensin II-binding protein' and some corrections to the published amino acid sequence of the rat testis enzyme.

    Science.gov (United States)

    McKie, N; Dando, P M; Rawlings, N D; Barrett, A J

    1993-01-01

    The deduced amino acid sequence of pig liver soluble angiotensin II-binding protein [Sugiura, Hagiwara and Hirose (1992) J. Biol. Chem. 267, 18067-18072] is similar over most of its length to that reported for rat testis thimet oligopeptidase (EC 3.4.24.15) by Pierotti, Dong, Glucksman, Orlowski and Roberts [(1990) (Biochemistry 29, 10323-10329]. We have found that homogeneous rat testis thimet oligopeptidase binds angiotensin II with the same distinctive characteristics as the pig liver protein. Analysis of the nucleotide sequences reported for the two proteins pointed to the likelihood that sequencing errors had caused two segments of the amino acid sequence of the rat protein to be translated out of frame, and re-sequencing of selected parts of the clone (kindly provided by the previous authors) confirmed this. The revised deduced amino acid sequence of rat thimet oligopeptidase contains 687 residues, representing a protein of 78,308 Da, and is more closely related to those of the pig liver protein and other known homologues of thimet oligopeptidase than that described previously. Images Figure 1 PMID:8216239

  13. Generation of deviation parameters for amino acid singlets, doublets and triplets from three-dimensional structures of proteins and its implications for secondary structure prediction from amino acid sequences

    Indian Academy of Sciences (India)

    S A Mugilan; K Veluraja

    2000-03-01

    We present a new method, secondary structure prediction by deviation parameter (SSPDP) for predicting the secondary structure of proteins from amino acid sequence. Deviation parameters (DP) for amino acid singlets, doublets and triplets were computed with respect to secondary structural elements of proteins based on the dictionary of secondary structure prediction (DSSP)-generated secondary structure for 408 selected non-homologous proteins. To the amino acid triplets which are not found in the selected dataset, a DP value of zero is assigned with respect to the secondary structural elements of proteins. The total number of parameters generated is 15,432, in the possible parameters of 25,260. Deviation parameter is complete with respect to amino acid singlets, doublets, and partially complete with respect to amino acid triplets. These generated parameters were used to predict secondary structural elements from amino acid sequence. The secondary structure predicted by our method (SSPDP) was compared with that of single sequence (NNPREDICT) and multiple sequence (PHD) methods. The average value of the percentage of prediction accuracy for α-helix by SSPDP, NNPREDICT and PHD methods was found to be 57%, 44% and 69% respectively for the proteins in the selected dataset. For -strand the prediction accuracy is found to be 69%, 21% and 53% respectively by SSPDP, NNPREDICT and PHD methods. This clearly indicates that the secondary structure prediction by our method is as good as PHD method but much better than NNPREDICT method.

  14. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

    DEFF Research Database (Denmark)

    Adamala, K.; Anella, F.; Wieczorek, R.

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From...... sequences among a vast array of possible ones, the huge "sequence space", leading to the question "why these macromolecules, and not the others?" We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides...

  15. Next-generation re-sequencing of genes involved in increased platelet reactivity in diabetic patients on acetylsalicylic acid.

    Science.gov (United States)

    Postula, Marek; Janicki, Piotr K; Eyileten, Ceren; Rosiak, Marek; Kaplon-Cieslicka, Agnieszka; Sugino, Shigekazu; Wilimski, Radosław; Kosior, Dariusz A; Opolski, Grzegorz; Filipiak, Krzysztof J; Mirowska-Guzel, Dagmara

    2016-06-01

    The objective of this study was to investigate whether rare missense genetic variants in several genes related to platelet functions and acetylsalicylic acid (ASA) response are associated with the platelet reactivity in patients with diabetes type 2 (T2D) on ASA therapy. Fifty eight exons and corresponding introns of eight selected genes, including PTGS1, PTGS2, TXBAS1, PTGIS, ADRA2A, ADRA2B, TXBA2R, and P2RY1 were re-sequenced in 230 DNA samples from T2D patients by using a pooled PCR amplification and next-generation sequencing by Illumina HiSeq2000. The observed non-synonymous variants were confirmed by individual genotyping of 384 DNA samples comprising of the individuals from the original discovery pools and additional verification cohort of 154 ASA-treated T2DM patients. The association between investigated phenotypes (ASA induced changes in platelets reactivity by PFA-100, VerifyNow and serum thromboxane B2 level [sTxB2]), and accumulation of rare missense variants (genetic burden) in investigated genes was tested using statistical collapsing tests. We identified a total of 35 exonic variants, including 3 common missense variants, 15 rare missense variants, and 17 synonymous variants in 8 investigated genes. The rare missense variants exhibited statistically significant difference in the accumulation pattern between a group of patients with increased and normal platelet reactivity based on PFA-100 assay. Our study suggests that genetic burden of the rare functional variants in eight genes may contribute to differences in the platelet reactivity measured with the PFA-100 assay in the T2DM patients treated with ASA.

  16. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach.

    Science.gov (United States)

    Adamala, Katarzyna; Anella, Fabrizio; Wieczorek, Rafal; Stano, Pasquale; Chiarabelli, Cristiano; Luisi, Pier Luigi

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the chemical viewpoint, macromolecules are formed by chemical pathways leading to the condensation of building blocks (amino acids, or nucleotides) in long-chain copolymers (proteins and nucleic acids, respectively). The second difficulty deals with a conceptual problem, namely with the emergence of specific sequences among a vast array of possible ones, the huge "sequence space", leading to the question "why these macromolecules, and not the others?" We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides, like Ser-His, with respect to peptide- and nucleotides-condensation, as a realistic model of primitive organocatalysis. We have also set up a strategy for exploring the sequence space of random proteins and RNAs (the so-called "never born biopolymer" project) with respect to the production of folded structures. Being still far from solved, the main aspects of these "open questions" are discussed here, by commenting on recent results obtained in our groups and by providing a unifying view on the problem and possible solutions. In particular, we propose a general scenario for macromolecule formation via fragment-condensation, as a scheme for the emergence of specific sequences based on molecular growth and selection.

  17. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

    NARCIS (Netherlands)

    Adamala, K.; Anella, F.M.; Wieczorek, R.; Stano, P.; Chiarabelli, C.; Luisi, P.L.

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the c

  18. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-γ-d-Glutamic Acid Anthrax Capsule.

    Science.gov (United States)

    Stabler, Richard A; Negus, David; Pain, Arnab; Taylor, Peter W

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  19. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

    KAUST Repository

    Stabler, R. A.

    2013-01-24

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  20. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    NARCIS (Netherlands)

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which mi

  1. Insights into the supramolecular features in isopropylmalonic and n-butylmalonic acids: Inputs from PIXEL and Hirshfeld surface analysis

    Science.gov (United States)

    Dey, Dhananjay; Mondal, Ranjan Kumar; Dhibar, Subhendu; Lin, Chia-Her; Schollmeyer, Dieter; Chopra, Deepak; Dey, Biswajit

    2016-10-01

    In this study, we have investigated the supramolecular pattern in two dicarboxylic acids, namely isopropylmalonic acid (1) and n-butylmalonic acid (2), via the presence of different hydrogen bonding patterns and aliphatic chains associated with the respective acids. The crystal structure is formed via the presence of strong Osbnd H⋯O and Csbnd H⋯O H-bonds, in the solid-state. The nature and energetics of the different supramolecular architectures in these two molecules have been further analyzed quantitatively via the PIXEL method. Hirshfeld surface analysis and fingerprint plots help to evaluate the contribution of different types of intermolecular contacts in the crystal packing. MESP calculations have been done to delineate the negative and positive areas of the electrostatic potential in different regions of the molecule in the crystal environment.

  2. Using triple-helix-forming Peptide nucleic acids for sequence-selective recognition of double-stranded RNA.

    Science.gov (United States)

    Hnedzko, Dziyana; Cheruiyot, Samwel K; Rozners, Eriks

    2014-09-08

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple-helix-forming peptide nucleic acids (PNAs) that bind in the major grove of the RNA helix. The strategy developed uses chemically modified nucleobases, such as 2-aminopyridine (M), which enables strong triple-helical binding under physiologically relevant conditions, and 2-pyrimidinone (P) and 3-oxo-2,3-dihydropyridazine (E), which enable recognition of isolated pyrimidines in the purine-rich strand of the RNA duplex. Detailed protocols for preparation of modified PNA monomers, solid-phase synthesis, HPLC purification of PNA oligomers, and measuring dsRNA binding affinity using isothermal titration calorimetry are included.

  3. Comparison of nucleic acid sequence-based amplification and loop-mediated isothermal amplification for diagnosis of human African trypanosomiasis.

    Science.gov (United States)

    Mugasa, Claire M; Katiti, Diana; Boobo, Alex; Lubega, George W; Schallig, Henk D F H; Matovu, Enock

    2014-02-01

    Diagnosis of human African trypanosomiasis (HAT) using molecular tests should ideally achieve high sensitivity without compromising specificity. This study compared 2 simplified tests, nucleic acid sequence-based amplification (NASBA) combined with oligochromatography (OC) and loop-mediated isothermal amplification (LAMP), executed on 181 blood samples from 65 Trypanosoma brucei gambiense HAT patients, 86 controls, and 30 serological suspects from Uganda. Basing on the composite reference standard, the diagnostic sensitivity and specificity of NASBA were 93.9% (95% confidence interval [CI] = 84.9-98.3%) and 100% (95% CI = 94.9-100%), respectively. The same parameters for LAMP were 76.9% (95% CI = 64.8-86.5%) and 100% (95% CI = 91.6-100%), respectively. The level of agreement between LAMP and microscopy was good with a kappa (κ) value of 79.2% (95% CI = 69.4-88.9%), while that of NASBA-OC/microscopy was very good (κ value 94.6%; 95% CI = 89.3-99.8%). The sensitivity of NASBA-OC was significantly higher than that of LAMP (Z = 2.723; P = 0.007). These tests have potential application to HAT surveillance.

  4. The Acid Sphingomyelinase Sequence Variant p.A487V Is Not Associated With Decreased Levels of Enzymatic Activity.

    Science.gov (United States)

    Rhein, Cosima; Naumann, Julia; Mühle, Christiane; Zill, Peter; Adli, Mazda; Hegerl, Ulrich; Hiemke, Christoph; Mergl, Roland; Möller, Hans-Jürgen; Reichel, Martin; Kornhuber, Johannes

    2013-01-01

    Rare loss-of-function mutations in the sphingomyelin phosphodiesterase 1 (SMPD1) gene are known to dramatically decrease the catalytic activity of acid sphingomyelinase (ASM), resulting in an autosomal recessive lysosomal storage disorder known as Niemann-Pick disease (NPD) type A and B. In contrast to the general low frequency of those deleterious mutations, we found a relatively high frequency for the proposed type B NPD variant c.1460C>T (p.A487V) in our sample of 58 patients suffering from Major Depressive Disorder. We therefore investigated the biochemical consequences of this variant more closely. Our in vivo data derived from blood cell analyses indicated cellular ASM activity levels in the normal range. The secreted ASM activity levels in blood plasma were slightly lower, but still above those levels reported for type B NPD patients. In vitro expression studies of this ASM variant in different cell lines confirmed these results, showing cellular and secreted enzymatic activities equivalent to those of wild-type ASM and similar expression levels. Thus, we conclude that the ASM variant c.1460C>T (p.A487V) is not a rare missense mutation but an SMPD1 sequence variant that yields a protein with functional catalytic characteristics.

  5. Nucleotide sequence of a cDNA clone encoding a major allergenic protein in rice seeds. Homology of the deduced amino acid sequence with members of alpha-amylase/trypsin inhibitor family.

    Science.gov (United States)

    Izumi, H; Adachi, T; Fujii, N; Matsuda, T; Nakamura, R; Tanaka, K; Urisu, A; Kurosawa, Y

    1992-05-18

    A cDNA clone of rice major allergenic protein (RAP) was isolated from a cDNA library of maturing rice seeds. The cDNA had an open reading frame (486 nucleotides) which coded a 162 amino acid residue polypeptide comprising a 27-residue signal peptide and a 135-residue mature protein of M(r) 14,764. The deduced amino acid sequence of RAP showed a considerable similarity to barley trypsin inhibitor [1983, J. Biol. Chem. 258, 7998-8003] and wheat alpha-amylase inhibitor [1981, Phytochemistry 20, 1781-1784].

  6. Proteaselike sequence in hepatitis B virus core antigen is not required for e antigen generation and may not be part of an aspartic acid-type protease.

    Science.gov (United States)

    Nassal, M; Galle, P R; Schaller, H

    1989-01-01

    The hepatitis B virus (HBV) C gene directs the synthesis of two major gene products: HBV core antigen (HBcAg[p21c]), which forms the nucleocapsid, and HBV e antigen (HBeAg [p17e]), a secreted antigen that is produced by several processing events during its maturation. These proteins contain an amino acid sequence similar to the active-site residues of aspartic acid and retroviral proteases. On the basis of this sequence similarity, which is highly conserved among mammalian hepadnaviruses, a model has been put forward according to which processing to HBeAg is due to self-cleavage of p21c involving the proteaselike sequence. Using site-directed mutagenesis in conjunction with transient expression of HBV proteins in the human hepatoma cell line HepG2, we tested this hypothesis. Our results with HBV mutants in which one or two of the conserved amino acids have been replaced by others suggest strongly that processing to HBeAg does not depend on the presence of an intact proteaselike sequence in the core protein. Attempts to detect an influence of this sequence on the processing of HBV P gene products into enzymatically active viral polymerase also gave no conclusive evidence for the existence of an HBV protease. Mutations replacing the putatively essential aspartic acid showed little effect on polymerase activity. Additional substitution of the likewise conserved threonine residue by alanine, in contrast, almost abolished the activity of the polymerase. We conclude that an HBV protease, if it exists, is functionally different from aspartic acid and retroviral proteases. Images PMID:2657101

  7. Identities among actin-encoding cDNAs of the Nile tilapia (Oreochromis niloticus and other eukaryote species revealed by nucleotide and amino acid sequence analyses

    Directory of Open Access Journals (Sweden)

    Andréia B. Poletto

    2008-01-01

    Full Text Available Actin-encoding cDNAs of Nile tilapia (Oreochromis niloticus were isolated by RT-PCR using total RNA samples of different tissues and further characterized by nucleotide sequencing and in silico amino acid (aa sequence analysis. Comparisons among the actin gene sequences of O. niloticus and those of other species evidenced that the isolated genes present a high similarity to other fish and other vertebrate actin genes. The highest nucleotide resemblance was observed between O. niloticus and O. mossambicus a-actin and b-actin genes. Analysis of the predicted aa sequences revealed two distinct types of cytoplasmic actins, one cardiac muscle actin type and one skeletal muscle actin type that were expressed in different tissues of Nile tilapia. The evolutionary relationships between the Nile tilapia actin genes and diverse other organisms is discussed.

  8. Feature Fusion Based SVM Classifier for Protein Subcellular Localization Prediction.

    Science.gov (United States)

    Rahman, Julia; Mondal, Md Nazrul Islam; Islam, Md Khaled Ben; Hasan, Md Al Mehedi

    2016-12-18

    For the importance of protein subcellular localization in different branches of life science and drug discovery, researchers have focused their attentions on protein subcellular localization prediction. Effective representation of features from protein sequences plays a most vital role in protein subcellular localization prediction specially in case of machine learning techniques. Single feature representation-like pseudo amino acid composition (PseAAC), physiochemical property models (PPM), and amino acid index distribution (AAID) contains insufficient information from protein sequences. To deal with such problems, we have proposed two feature fusion representations, AAIDPAAC and PPMPAAC, to work with Support Vector Machine classifiers, which fused PseAAC with PPM and AAID accordingly. We have evaluated the performance for both single and fused feature representation of a Gram-negative bacterial dataset. We have got at least 3% more actual accuracy by AAIDPAAC and 2% more locative accuracy by PPMPAAC than single feature representation.

  9. Clavulanic acid production estimation based on color and structural features of Streptomyces clavuligerus bacteria using self-organizing map and genetic algorithm.

    Science.gov (United States)

    Nurmohamadi, Maryam; Pourghassem, Hossein

    2014-05-01

    The utilization of antibiotics produced by Clavulanic acid (CA) is an increasing need in medicine and industry. Usually, the CA is created from the fermentation of Streptomycen Clavuligerus (SC) bacteria. Analysis of visual and morphological features of SC bacteria is an appropriate measure to estimate the growth of CA. In this paper, an automatic and fast CA production level estimation algorithm based on visual and structural features of SC bacteria instead of statistical methods and experimental evaluation by microbiologist is proposed. In this algorithm, structural features such as the number of newborn branches, thickness of hyphal and bacterial density and also color features such as acceptance color levels are extracted from the SC bacteria. Moreover, PH and biomass of the medium provided by microbiologists are considered as specified features. The level of CA production is estimated by using a new application of Self-Organizing Map (SOM), and a hybrid model of genetic algorithm with back propagation network (GA-BPN). The proposed algorithm is evaluated on four carbonic resources including malt, starch, wheat flour and glycerol that had used as different mediums of bacterial growth. Then, the obtained results are compared and evaluated with observation of specialist. Finally, the Relative Error (RE) for the SOM and GA-BPN are achieved 14.97% and 16.63%, respectively.

  10. Genome sequence of Candidatus Nitrososphaera evergladensis from group I.1b enriched from Everglades soil reveals novel genomic features of the ammonia-oxidizing archaea.

    Directory of Open Access Journals (Sweden)

    Kateryna V Zhalnina

    Full Text Available The activity of ammonia-oxidizing archaea (AOA leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group.

  11. Use of sourdough made with quinoa (Chenopodium quinoa) flour and autochthonous selected lactic acid bacteria for enhancing the nutritional, textural and sensory features of white bread.

    Science.gov (United States)

    Rizzello, Carlo Giuseppe; Lorusso, Anna; Montemurro, Marco; Gobbetti, Marco

    2016-06-01

    Lactic acid bacteria were isolated and identified from quinoa flour, spontaneously fermented quinoa dough, and type I quinoa sourdough. Strains were further selected based on acidification and proteolytic activities. Selected Lactobacillus plantarum T6B10 and Lactobacillus rossiae T0A16 were used as mixed starter to get quinoa sourdough. Compared to non-fermented flour, organic acids, free amino acids, soluble fibers, total phenols, phytase and antioxidant activities, and in vitro protein digestibility markedly increased during fermentation. A wheat bread was made using 20% (w/w) of quinoa sourdough, and compared to baker's yeast wheat breads manufactured with or without quinoa flour. The use of quinoa sourdough improved the chemical, textural, and sensory features of wheat bread, showing better performances compared to the use of quinoa flour. Protein digestibility and quality, and the rate of starch hydrolysis were also nutritional features that markedly improved using quinoa sourdough as an ingredient. This study exploited the potential of quinoa flour through sourdough fermentation. A number of advantages encouraged the manufacture of novel and healthy leavened baked goods.

  12. Identification of multiple lipid genes with modifications in expression and sequence associated with the evolution of hydroxy fatty acid accumulation in Physaria fendleri.

    Science.gov (United States)

    Horn, Patrick J; Liu, Jinjie; Cocuron, Jean-Christophe; McGlew, Kathleen; Thrower, Nicholas A; Larson, Matt; Lu, Chaofu; Alonso, Ana P; Ohlrogge, John

    2016-05-01

    Two Brassicaceae species, Physaria fendleri and Camelina sativa, are genetically very closely related to each other and to Arabidopsis thaliana. Physaria fendleri seeds contain over 50% hydroxy fatty acids (HFAs), while Camelina sativa and Arabidopsis do not accumulate HFAs. To better understand how plants evolved new biochemical pathways with the capacity to accumulate high levels of unusual fatty acids, transcript expression and protein sequences of developing seeds of Physaria fendleri, wild-type Camelina sativa, and Camelina sativa expressing a castor bean (Ricinus communis) hydroxylase were analyzed. A number of potential evolutionary adaptations within lipid metabolism that probably enhance HFA production and accumulation in Physaria fendleri, and, in their absence, limit accumulation in transgenic tissues were revealed. These adaptations occurred in at least 20 genes within several lipid pathways from the onset of fatty acid synthesis and its regulation to the assembly of triacylglycerols. Lipid genes of Physaria fendleri appear to have co-evolved through modulation of transcriptional abundances and alterations within protein sequences. Only a handful of genes showed evidence for sequence adaptation through gene duplication. Collectively, these evolutionary changes probably occurred to minimize deleterious effects of high HFA amounts and/or to enhance accumulation for physiological advantage. These results shed light on the evolution of pathways for novel fatty acid production in seeds, help explain some of the current limitations to accumulation of HFAs in transgenic plants, and may provide improved strategies for future engineering of their production.

  13. Mapping the molecular characteristics of Brazilian human T-cell lymphotropic virus type 1 Env (gp46 and Pol amino acid sequences for vaccine design

    Directory of Open Access Journals (Sweden)

    Aline Cristina Mota-Miranda

    2007-09-01

    Full Text Available This study was carried out to evaluate the molecular pattern of all available Brazilian human T-cell lymphotropic virus type 1 Env (n = 15 and Pol (n = 43 nucleotide sequences via epitope prediction, physico-chemical analysis, and protein potential sites identification, giving support to the Brazilian AIDS vaccine program. In 12 previously described peptides of the Env sequences we found 12 epitopes, while in 4 peptides of the Pol sequences we found 4 epitopes. The total variation on the amino acid composition was 9 and 17% for human leukocyte antigen (HLA class I and class II Env epitopes, respectively. After analyzing the Pol sequences, results revealed a total amino acid variation of 0.75% for HLA-I and HLA-II epitopes. In 5 of the 12 Env epitopes the physico-chemical analysis demonstrated that the mutations magnified the antigenicity profile. The potential protein domain analysis of Env sequences showed the loss of a CK-2 phosphorylation site caused by D197N mutation in one epitope, and a N-glycosylation site caused by S246Y and V247I mutations in another epitope. Besides, the analysis of selection pressure have found 8 positive selected sites (w = 9.59 using the codon-based substitution models and maximum-likelihood methods. These studies underscore the importance of this Env region for the virus fitness, for the host immune response and, therefore, for the development of vaccine candidates.

  14. Isolation of a human anti-haemophilic factor IX cDNA clone using a unique 52-base synthetic oligonucleotide probe deduced from the amino acid sequence of bovine factor IX.

    Science.gov (United States)

    Jaye, M; de la Salle, H; Schamber, F; Balland, A; Kohli, V; Findeli, A; Tolstoshev, P; Lecocq, J P

    1983-04-25

    A unique 52mer oligonucleotide deduced from the amino acid sequence of bovine Factor IX was synthesized and used as a probe to screen a human liver cDNA bank. The Factor IX clone isolated shows 5 differences in nucleotide and deduced amino acid sequence as compared to a previously isolated clone. In addition, precisely one codon has been deleted.Images

  15. Ruthenium Hydride/Brønsted Acid-Catalyzed Tandem Isomerization/N-Acyliminium Cyclization Sequence for the Synthesis of Tetrahydro-β-carbolines

    DEFF Research Database (Denmark)

    Hansen, Casper Lykke; Clausen, Janie Regitse Waël; Ohm, Ragnhild Gaard;

    2013-01-01

    This paper describes an efficient tandem sequence for the synthesis of 1,2,3,4-tetrahydro-β-carbolines (THBCs) relying on a ruthenium hydride/Brønsted acid- catalyzed isomerization of allylic amides to N-acyliminium ion intermediates which are trapped by a tethered indolenucleophile. The methodol...... the Suzuki cross-coupling reaction to the isomerization/N-acyliminium cyclization sequence. Finally, diastereo- and enantioselective versions of the title reaction have been examined using substrate control (with dr >15: 1) and asymmetric catalysis (ee up to 57%), respectively...

  16. Structural features of dilute acid, steam exploded, and alkali pretreated mustard stalk and their impact on enzymatic hydrolysis.

    Science.gov (United States)

    Kapoor, Manali; Raj, Tirath; Vijayaraj, M; Chopra, Anju; Gupta, Ravi P; Tuli, Deepak K; Kumar, Ravindra

    2015-06-25

    To overcome the recalcitrant nature of biomass several pretreatment methodologies have been explored to make it amenable to enzymatic hydrolysis. These methodologies alter cell wall structure primarily by removing/altering hemicelluloses and lignin. In this work, alkali, dilute acid, steam explosion pretreatment are systematically studied for mustard stalk. To assess the structural variability after pretreatment, chemical analysis, surface area, crystallinity index, accessibility of cellulose, FT-IR and thermal analysis are conducted. Although the extent of enzymatic hydrolysis varies upon the methodologies used, nevertheless, cellulose conversion increases from pretreatment. Glucose yield at 2 and 72h are well correlated with surface area and maximum adsorption capacity. However, no such relationship is observed for xylose yield. Mass balance of the process is also studied. Dilute acid pretreatment is the best methodology in terms of maximum sugar yield at lower enzyme loading.

  17. Development of microwave-assisted acid hydrolysis of proteins using a commercial microwave reactor and its combination with LC-MS for protein full-sequence analysis.

    Science.gov (United States)

    Chen, Lu; Wang, Nan; Li, Liang

    2014-11-01

    Microwave-assisted acid hydrolysis (MAAH) can be used to degrade a protein non-specifically into many peptides with overlapping sequences which can be identified by mass spectrometry (MS) to produce a sequence map that covers the full sequence of a protein. The success of this method for protein sequence analysis depends on the proper control of the MAAH process, which is currently done using a household microwave oven. However, to meet the regulatory or good laboratory practice (GLP) requirement in a clinical or pharmaceutical laboratory, using a commercial microwave device is often required. In this paper, we report a method of performing MAAH using a CEM Discover single-mode microwave reactor. It is shown that, using an optimized protocol for MAAH, reproducible results comparable to those obtained using a household microwave oven can be generated using the commercial reactor. To illustrate the potential applications of MAAH MS for characterizing clinically relevant proteins, this method was applied, for the first time, to map the amino acid sequences of normal and sickle-cell human hemoglobin as well as bovine hemoglobin. Full sequence coverage was readily achieved from 294 and 266 unique peptides matched to the alpha and beta subunits of normal hemoglobin, respectively, 334 and 265 unique peptides matched to the alpha and beta submit units of sickle-cell hemoglobin, and 377 and 224 unique peptides matched to the alpha and beta subunits of bovine hemoglobin. This method opens the possibility for any laboratory to use a commercial laboratory equipment to perform MAAH MS for protein full-sequence analysis.

  18. Sequence and sedimentary features of the Changxing Fm organic reefs and their control on reservoir development in the Yuanba Gas Field, Sichuan Basin

    Directory of Open Access Journals (Sweden)

    Hongtao Li

    2015-12-01

    Full Text Available In the Yuanba area, Sichuan Basin, the gas reservoirs in the Upper Permian Changxing Fm are now at the development stage. With the smooth progress of development, it is urgent to characterize the reservoir architectures accurately and summarize the controlling factors for reservoir development. In this paper, research was mainly performed on the Changxing Fm organic reefs in terms of their sequence stratigraphy, sedimentary facies, and reservoir characteristics and architectures based on core observation and thin section analysis, combined with physical property data and logging curves analysis results. It is shown that the Changing Fm can be divided into two third-order sequences and six fourth-order sequences, their electric logs are characterized by abrupt change above and below the high-frequency sequence boundary and are consistent with the sedimentary cycles controlled by high-frequency sequences. Besides, the Changxing Fm organic reefs mainly represents zonal distribution outside SQ2 platform margin, and they are vertically composed of two obvious two reef sedimentary cycles and laterally developed in asymmetric patterns (early in the east and late in the west. Finally, in general, organic reef (bank. reservoirs are mainly composed of low-porosity and moderate–low-permeability dissolved dolomite reservoirs, and they are mostly distributed at reef caps in the upper–middle parts of the two fourth-order sequences, with the characteristics of multiple beds, thin single beds, different types of reservoirs with different thickness interbedded with each other, strong heterogeneity and double-layer reservoir architectures. It is concluded that the distribution of organic reef microfacies in this area is controlled by high-frequency sequence, which is the key controlling factor for reservoir development and spatial distribution.

  19. Crystal structure of a 2:1 piroxicam–gentisic acid co-crystal featuring neutral and zwitterionic piroxicam molecules

    Directory of Open Access Journals (Sweden)

    Elizabeth M. Horstman

    2016-12-01

    Full Text Available A new 2:1 co-crystal of piroxicam and gentisic acid [systematic name: 4-hydroxy-1,1-dioxo-N-(pyridin-2-yl-2H-1λ6,2-benzothiazine-3-carboxamide–2-(4-oxido-1,1-dioxo-2H-1λ6,2-benzothiazine-3-amidopyridin-1-ium–2,5-dihydroxybenzoic acid, 2C15H13N3O4S·C7H6O4] has been synthesized using a microfluidic platform and initially identified using Raman spectroscopy. In the co-crystal, one piroxicam molecule is in its neutral form and an intramolecular O—H...O hydrogen bond is observed. The other piroxicam molecule is zwitterionic (proton transfer from the OH group to the pyridine N atom and two intramolecular N—H...O hydrogen bonds occur. The gentisic acid molecule shows whole-molecule disorder over two sets of sites in a 0.809 (2:0.191 (2 ratio. In the crystal, extensive hydrogen bonding between the components forms layers propagating in the ab plane.

  20. Compositional and functional features of humic acid-like fractions from vermicomposting of sewage sludge and cow dung

    Energy Technology Data Exchange (ETDEWEB)

    Li Xiaowei [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China); Xing Meiyan, E-mail: xmy5000@163.com [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China); Yang Jian; Huang Zhidong [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China)

    2011-01-30

    The chemical changes occurring in five different substrates of sewage sludge spiked with different proportions of cow dung after vermicomposting with Eisenia foetida for 90 days were investigated. Their humic acid-like (HAL) fractions were isolated to determine the elemental and functional composition, and structural and functional characteristics using ultraviolet/visible, Fourier transform infrared (FT-IR) and fluorescence spectroscopies and scanning electron microscopy. After vermicomposting, the total organic C and C/N ratio decreased, and the total extractable C and humic acid (HA) C increased in all substrates. In the HAL fractions, the C and H contents, C/N and C/O and aliphatic structures, proteinaceous components and carbohydrates decreased, while the O and N and acidic functional group contents and C/H ratio, aromaticity and polycondensation structures increased. Further, the results suggest that the addition of cow dung to sewage sludge could improve the quality of organic matter humification of the substrates. The structures of HAL fractions in vermicomposts resembled those typical of soil HA, especially the vermicompost of cow dung alone. Scanning electron microscopy showed the microstructure of HAL fraction in final product became close-grained and lumpy. Overall results indicate that vermicomposting was an efficient technology for promoting organic matter (OM) humification in sewage sludge and cow dung alone, as well as in mixtures of both materials, improving their quality and environmental safety as a soil OM resource for utilization as soil amendments.

  1. Dna Sequencing

    Science.gov (United States)

    Tabor, Stanley; Richardson, Charles C.

    1995-04-25

    A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.

  2. The amino acid sequences of eleven tryptic peptides of papaya mosaic virus protein by electron ionization mass spectrometry.

    Science.gov (United States)

    Parente, A; Short, M N; Self, R; Parsley, K R

    1982-04-01

    Eleven of the fourteen tryptic peptides of papaya mosaic virus protein have been sequenced by electron ionization mass spectrometry using chemical and enzymic hydrolyses and mixture analysis as required. Mid-chain cleavages of N-C bonds produced secondary ion series which allowed up to 16 residues to be sequenced without further hydrolysis. Mixture analysis on hydrolysis products enabled a 24 residue tryptic peptide to be sequenced from the data recorded in a single mass spectrum.

  3. Some features of the effect the pH value and the physicochemical properties of boric acid have on mass transfer in a VVER reactor's core

    Science.gov (United States)

    Gavrilov, A. V.; Kritskii, V. G.; Rodionov, Yu. A.; Berezina, I. G.

    2013-07-01

    Certain features of the effect of boric acid in the reactor coolant of nuclear power installations equipped with a VVER-440 reactor on mass transfer in the reactor core are considered. It is determined that formation of boric acid polyborate complexes begins under field conditions at a temperature of 300°C when the boric acid concentration is equal to around 0.065 mol/L (4 g/L). Operations for decontaminating the reactor coolant system entail a growth of corrosion product concentration in the coolant, which gives rise to formation of iron borates in the zones where subcooled boiling of coolant takes place and to the effect of axial offset anomalies. A model for simulating variation of pressure drop in a VVER-440 reactor's core that has invariable parameters during the entire fuel campaign is developed by additionally taking into account the concentrations of boric acid polyborate complexes and the quantity of corrosion products (Fe, Ni) represented by the ratio of their solubilities.

  4. Terminal sequence importance of de novo proteins from binary-patterned library: stable artificial proteins with 11- or 12-amino acid alphabet.

    Science.gov (United States)

    Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu

    2012-06-01

    Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.

  5. Regression and Sparse Regression Methods for Viscosity Estimation of Acid Milk From it’s Sls Features

    DEFF Research Database (Denmark)

    Sharifzadeh, Sara; Skytte, Jacob Lercke; Nielsen, Otto Højager Attermann;

    2012-01-01

    Statistical solutions find wide spread use in food and medicine quality control. We investigate the effect of different regression and sparse regression methods for a viscosity estimation problem using the spectro-temporal features from new Sub-Surface Laser Scattering (SLS) vision system. From...... this investigation, we propose the optimal solution for regression estimation in case of noisy and inconsistent optical measurements, which is the case in many practical measurement systems. The principal component regression (PLS), partial least squares (PCR) and least angle regression (LAR) methods are compared...

  6. Data for amino acid alignment of Japanese stingray melanocortin receptors with other gnathostome melanocortin receptor sequences, and the ligand selectivity of Japanese stingray melanocortin receptors.

    Science.gov (United States)

    Takahashi, Akiyoshi; Davis, Perry; Reinick, Christina; Mizusawa, Kanta; Sakamoto, Tatsuya; Dores, Robert M

    2016-06-01

    This article contains structure and pharmacological characteristics of melanocortin receptors (MCRs) related to research published in "Characterization of melanocortin receptors from stingray Dasyatis akajei, a cartilaginous fish" (Takahashi et al., 2016) [1]. The amino acid sequences of the stingray, D. akajei, MC1R, MC2R, MC3R, MC4R, and MC5R were aligned with the corresponding melanocortin receptor sequences from the elephant shark, Callorhinchus milii, the dogfish, Squalus acanthias, the goldfish, Carassius auratus, and the mouse, Mus musculus. These alignments provide the basis for phylogenetic analysis of these gnathostome melanocortin receptor sequences. In addition, the Japanese stingray melanocortin receptors were separately expressed in Chinese Hamster Ovary cells, and stimulated with stingray ACTH, α-MSH, β-MSH, γ-MSH, δ-MSH, and β-endorphin. The dose response curves reveal the order of ligand selectivity for each stingray MCR.

  7. Data for amino acid alignment of Japanese stingray melanocortin receptors with other gnathostome melanocortin receptor sequences, and the ligand selectivity of Japanese stingray melanocortin receptors

    Directory of Open Access Journals (Sweden)

    Akiyoshi Takahashi

    2016-06-01

    Full Text Available This article contains structure and pharmacological characteristics of melanocortin receptors (MCRs related to research published in “Characterization of melanocortin receptors from stingray Dasyatis akajei, a cartilaginous fish” (Takahashi et al., 2016 [1]. The amino acid sequences of the stingray, D. akajei, MC1R, MC2R, MC3R, MC4R, and MC5R were aligned with the corresponding melanocortin receptor sequences from the elephant shark, Callorhinchus milii, the dogfish, Squalus acanthias, the goldfish, Carassius auratus, and the mouse, Mus musculus. These alignments provide the basis for phylogenetic analysis of these gnathostome melanocortin receptor sequences. In addition, the Japanese stingray melanocortin receptors were separately expressed in Chinese Hamster Ovary cells, and stimulated with stingray ACTH, α-MSH, β-MSH, γ-MSH, δ-MSH, and β-endorphin. The dose response curves reveal the order of ligand selectivity for each stingray MCR.

  8. Geochemical features and sources of hydrocarbons and fatty acids in soils from the McMurdo Dry Valleys in the Antarctic

    Science.gov (United States)

    Matsumoto, Genki I.; Honda, Eisuke; Sonoda, Kazuhiko; Yamamoto, Shuichi; Takemura, Tetsuo

    2010-08-01

    We studied the geochemical features and compound-specific (CS)-δ 13C of hydrocarbons and fatty acids in soil samples from the McMurdo Dry Valleys in the Antarctic to elucidate their source organisms and characteristics of their environments. Total organic carbon contents in soil samples were extremely low reflecting extremely harsh environments for organisms. Normal-alkanes ranging in carbon chain length from n-C 14 to n-C 38 with the predominance of odd-carbon numbers were found, together with n-alkenes ( n-C 23:1 to n-C 27:1). Normal-alkanoic acids ranging in carbon chain length from n-C 10 to n-C 30 with the predominance of even-carbon numbers were detected in the samples, along with small amounts of branched ( iso and anteiso) and n-alkenoic acids. CS-δ 13C values of long-chain n-alkanes ( n-C 20 to n-C 29) ranged from -30.4 to -26.6‰. CS-δ 13C values of n-alkanoic acids with short-chain carbon numbers ( n-C 14 to n-C 19) ranging from -27.7 to -21.7‰ were much higher than those of long-chain carbon numbers ( n-C 20 to n-C 30, -32.5 to -25.3‰). The geochemical features and CS-δ 13C values of long-chain n-alkanes and n-alkanoic acids revealed that they are originated from lichen and/or vascular plant debris from the pre- and inter-glacial periods in this region, whereas short-chain n-alkanoic acids are come from microalgae and cyanobacterial debris. CS-δ 13C values suggest that they are derived from gymnosperms and/or C 4 plants in the cold and dry environments of the pre- and inter-glacial periods of the McMurdo Dry Valleys region.

  9. Cloning, DNA sequencing and heterologous expression of the gene for thermostable N-acylamino acid racemase from Amycolatopsis sp. TS-1-60 in Escherichia coli.

    Science.gov (United States)

    Tokuyama, S; Hatano, K

    1995-03-01

    The gene encoding the novel enzyme N-acylamino acid racemase (AAR) was cloned in recombinant phage lambda-4 from the DNA library of Amycolatopsis sp. TS-1-60, a rare actinomycete, using antiserum against the enzyme. The cloned gene was subcloned and transformed in Escherichia coli JM105 using pUC118 as a vector. The AAR gene consists of an open-reading frame of 1104 nucleotides, which specifies a 368-amino-acid protein with a molecular mass of 39411Da. The molecular mass deduced from the AAR gene is in good agreement with the subunit molecular mass (40kDa) of AAR from Amycolatopsis sp. TS-1-60. The guanosine plus cytosine content of the AAR gene was about 70%. Although the AAR gene uses the unusual initiation codon GTG, the gene was expressed in Escherichia coli using the lac promoter of pUC118. The amount of the enzyme produced by the transformant was 16 times that produced by Amycolatopsis sp. TS-1-60. When the unusual initiation codon GTG was changed to ATG, the enzyme productivity of the transformant increased to more than 37 times that of Amycolatopsis sp. TS-1-60. In the comparison of the DNA sequence and the deduced amino acid sequence of AAR with those of known racemases and epimerases in data bases, no significant sequence homology was found. However, AAR resembles mandelate racemase in that requires metal ions for enzyme activity.(ABSTRACT TRUNCATED AT 250 WORDS)

  10. Separation of small molecular peptides with the same amino acid composition but different sequences by high performance liquid chromatography-electrospray ionization-mass spectrometry

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Peptidomics has emerged as a new discipline in recent years. Mass spectrometry (MS) is the most universal and efficient tool for structure identification of proteins and peptides. However,there is a limitation for the identification of peptides with the same amino acid composition but different se-quences because these peptides have identical mass spectra of molecular ions. This paper presents a high performance liquid chromatography-electrospray ionization-mass spectrometry (HPLC-ESI-MS) method for the separation of small molecular peptides with the same amino acid composition but dif-ferent sequences. Two tripeptides of Gly-Ser-Phe and Gly-Phe-Ser were used as a model sample. The separation behavior has been investigated and the separation conditions have been optimized. Under the optimum conditions,good repeatability was achieved. The developed method could provide a helpful reference for the separation of other peptides with the same amino acid composition but different sequences in the study of proteomics and peptidomics.

  11. Different features of the MHC class I heterodimer have evolved at different rates. Chicken B-F and beta 2-microglobulin sequences reveal invariant surface residues

    DEFF Research Database (Denmark)

    Kaufman, J; Andersen, R; Avila, D;

    1992-01-01

    of small exons in the cytoplasmic region. The cDNA sequences were compared to turkey beta 2m, the apparent allele B-F12 alpha and other vertebrate homologs, using the 2.6 A structure of the human HLA-A2 molecule as a model. Both chicken alpha 1 and alpha 2 domains resemble mammalian classical class I...

  12. A Study on Korean Overseas Students' Expression of Prosodic Features of Chinese Trisyllabic Sequences%韩国留学生汉语三字组部分韵律特征的表达状况研究

    Institute of Scientific and Technical Information of China (English)

    刘一杉

    2012-01-01

    本文以100个三字组作为实验材料,考察了三个年级组共31名韩国留学生对汉语三字组部分韵律特征的表达状况,得出了以下结论:第一,被试对汉语三字组重音格式的表达并非随着他们汉语水平的提高而提高;第二,被试的语音表现受三字组中各音节调类的影响比较明显;第三,被试的语音表现还普遍受到三字组中各音节所处位置的影响。据此,本文认为,针对汉语韵律的课堂教学是必要的。我们应该在课堂教学中强调汉语的重音格式以及重音的表达手段,进行有针对性的教学。%The present research is mainly about Korean overseas students' expression of the prosodic features of Chinese trisyllabic sequences. The study takes 100 trisyllabie sequences as ex- perimental corpora, 31 Korean overseas students' as subjects. Three conclusions have been drawn after analyzing the results of the experiment by statistical means. First, Korean overseas students' failed to make any progress on pronouncing the prosodic features in Chinese trisyllabic sequences properly while their Chinese level was growing. Second, most Korean overseas students' pronuncia- tions of the prosodic features of Chinese trisyllabic sequences are influenced by the tones of the sylla- bles. Third, their pronunciations of the prosodic features of syllables are usually affected by the lo- cations of the syllables in the trisyllabic sequences. In conclusion, Korean overseas students have many problems on handling Chinese trisyllabic sequence' s stress patterns and the stressed syllables' expressional means. It is claimed that it is necessary to teach the Chinese prosodic features in the courses. It is also suggested to emphasize on the stress patterns and their expressional means, and develop a teaching method which is pertinent to the students' errors.

  13. Comparison of 5 monoclonal antibodies for immunopurification of human butyrylcholinesterase on Dynabeads: KD values, binding pairs, and amino acid sequences.

    Science.gov (United States)

    Peng, Hong; Brimijoin, Stephen; Hrabovska, Anna; Targosova, Katarina; Krejci, Eric; Blake, Thomas A; Johnson, Rudolph C; Masson, Patrick; Lockridge, Oksana

    2015-10-05

    Human butyrylcholinesterase (HuBChE) is a stoichiometric bioscavenger of nerve agents and organophosphorus pesticides. Mass spectrometry methods detect stable nerve agent adducts on the active site serine of HuBChE. The first step in sample preparation is immunopurification of HuBChE from plasma. Our goal was to identify monoclonal antibodies that could be used to immunopurify HuBChE on Dynabeads Protein G. Mouse anti-HuBChE monoclonal antibodies were obtained in the form of ascites fluid, dead hybridoma cells stored frozen at -80 °C for 30 years, or recently frozen hybridoma cells. RNA from 4 hybridoma cell lines was amplified by PCR for determination of their nucleotide and amino acid sequences. Full-length light and heavy chains were expressed, and the antibodies purified from culture medium. A fifth monoclonal was purchased. The 5 monoclonal antibodies were compared for ability to capture HuBChE from human plasma on Dynabeads Protein G. In addition, they were evaluated for binding affinity by Biacore and ELISA. Epitope mapping by pairing analysis was performed on the Octet Red96 instrument. The 5 monoclonal antibodies, B2 12-1, B2 18-5, 3E8, mAb2, and 11D8, had similar KD values of 10(-9) M for HuBChE. Monoclonal B2 18-5 outperformed the others in the Dynabeads Protein G assay where it captured 97% of the HuBChE in 0.5 ml plasma. Pairing analysis showed that 3E8 and B2 12-1 share the same epitope, 11D8 and B2 18-5 share the same epitope, but mAb2 and B2 12-1 or mAb2 and 3E8 bind to different epitopes on HuBChE. B2 18-5 was selected for establishment of a stable CHO cell line for production of mouse anti-HuBChE monoclonal.

  14. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    Directory of Open Access Journals (Sweden)

    Kager Piet A

    2006-10-01

    Full Text Available Abstract Background Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (Plasmodium antigen detection do often not allow for species discrimination as microscopy does, but also become insensitive at Methods This paper reports the development of a sensitive and specific real-time Quantitative Nucleic Acid Sequence Based Amplification (real-time QT-NASBA assays, based on the small-subunit 18S rRNA gene, to identify the four human Plasmodium species. Results The lower detection limit of the assay is 100 – 1000 molecules in vitro RNA for all species, which corresponds to 0.01 – 0.1 parasite per diagnostic sample (i.e. 50 μl of processed blood. The real-time QT-NASBA was further evaluated using 79 clinical samples from malaria patients: i.e. 11 Plasmodium. falciparum, 37 Plasmodium vivax, seven Plasmodium malariae, four Plasmodium ovale and 20 mixed infections. The initial diagnosis of 69 out of the 79 samples was confirmed with the developed real-time QT-NASBA. Re-analysis of seven available original slides resolved five mismatches. Three of those were initially identified as P. malariae mono-infection, but after re-reading the slides P. falciparum was found, confirming the real-time QT-NASBA result. The other two slides were of poor quality not allowing true species identification. The remaining five discordant results could not be explained by microscopy, but may be due to extreme low numbers of parasites present in the samples. In addition, 12 Plasmodium berghei isolates from mice and 20 blood samples from healthy donors did not show any reaction in the assay. Conclusion Real-time QT-NASBA is a very sensitive and specific technique with a detection limit of 0.1 Plasmodium parasite per diagnostic sample (50 μl of blood and can be used for the detection, identification and quantitative measurement of low parasitaemia of Plasmodium species, thus

  15. Immunoglobulin V(H) gene sequence analysis of spontaneous murine immunoglobulin secreting B-cell tumours with clinical features of human disease

    NARCIS (Netherlands)

    Zhu, D.; Arkel, C. van; King, C.A.; Meirvenne, S. van; Greef, C. de; Thielemans, K.; Radl, J.; Stevenson, F.K.

    1998-01-01

    The 5T series of multiple myelomas (MM) and Waldenstrsom's macroglobulinaemia-like lymphomas (WM), which developed spontaneously in ageing mice of the C57BL/KaLwRij strain, shows clinical and biological features that closely resemble their corresponding human diseases. In order to compare the patter

  16. Transcriptional analysis of the HeT-A retrotransposon in mutant and wild type stocks reveals high sequence variability at Drosophila telomeres and other unusual features

    Directory of Open Access Journals (Sweden)

    Piñeyro David

    2011-11-01

    Full Text Available Abstract Background Telomere replication in Drosophila depends on the transposition of a domesticated retroelement, the HeT-A retrotransposon. The sequence of the HeT-A retrotransposon changes rapidly resulting in differentiated subfamilies. This pattern of sequence change contrasts with the essential function with which the HeT-A is entrusted and brings about questions concerning the extent of sequence variability, the telomere contribution of different subfamilies, and whether wild type and mutant Drosophila stocks show different HeT-A scenarios. Results A detailed study on the variability of HeT-A reveals that both the level of variability and the number of subfamilies are higher than previously reported. Comparisons between GIII, a strain with longer telomeres, and its parental strain Oregon-R indicate that both strains have the same set of HeT-A subfamilies. Finally, the presence of a highly conserved splicing pattern only in its antisense transcripts indicates a putative regulatory, functional or structural role for the HeT-A RNA. Interestingly, our results also suggest that most HeT-A copies are actively expressed regardless of which telomere and where in the telomere they are located. Conclusions Our study demonstrates how the HeT-A sequence changes much faster than previously reported resulting in at least nine different subfamilies most of which could actively contribute to telomere extension in Drosophila. Interestingly, the only significant difference observed between Oregon-R and GIII resides in the nature and proportion of the antisense transcripts, suggesting a possible mechanism that would in part explain the longer telomeres of the GIII stock.

  17. Ethosomes® and transfersomes® containing linoleic acid: physicochemical and technological features of topical drug delivery carriers for the potential treatment of melasma disorders.

    Science.gov (United States)

    Celia, Christian; Cilurzo, Felisa; Trapasso, Elena; Cosco, Donato; Fresta, Massimo; Paolino, Donatella

    2012-02-01

    Two vesicular colloidal carriers, ethosomes® and transfersomes® were proposed for the topical delivery of linoleic acid, an active compound used in the therapeutic treatment of hyperpigmentation disorders, i.e. melasma, which is characterized by an increase of the melanin production in the epidermis. Dynamic light scattering was used for the physicochemical characterization of vesicles and mean size, size distribution and zeta potential were evaluated. The stability of formulations was also evaluated using the Turbiscan Lab® Expert based on the analysis of sample transmittance and photon backscattering. Ethosomes® and transfersomes® were prepared using Phospholipon 100 G®, as the lecithin component, and ethanol and sodium cholate, as edge activator agents, respectively. Linoleic acid at 0.05% and 0.1% (w/v) was used as the active ingredient and entrapped in colloidal vesicles. Technological parameters, i.e. entrapment efficacy, drug release and permeation profiles, were also investigated. Experimental findings showed that physicochemical and technological features of ethosomes® and transfersomes® were influenced by the lipid composition of the carriers. The percutaneous permeation experiments of linoleic acid-loaded ethosomes® and transfersomes® through human stratum corneum-epidermidis membranes showed that both carriers are accumulated in the skin membrane model as a function of their lipid compositions. The findings reported in this investigation showed that both vesicular carriers could represent a potential system for the topical treatment of hyperpigmentation disorders.

  18. A rapid method for characterization of protein relatedness using feature vectors.

    Directory of Open Access Journals (Sweden)

    Kareem Carr

    Full Text Available We propose a feature vector approach to characterize the variation in large data sets of biological sequences. Each candidate sequence produces a single feature vector constructed with the number and location of amino acids or nucleic acids in the sequence. The feature vector characterizes the distance between the actual sequence and a model of a theoretical sequence based on the binomial and uniform distributions. This method is distinctive in that it does not rely on sequence alignment for determining protein relatedness, allowing the user to visualize the relationships within a set of proteins without making a priori assumptions about those proteins. We apply our method to two large families of proteins: protein kinase C, and globins, including hemoglobins and myoglobins. We interpret the high-dimensional feature vectors using principal components analysis and agglomerative hierarchical clustering. We find that the feature vector retains much of the information about the original sequence. By using principal component analysis to extract information from collections of feature vectors, we are able to quickly identify the nature of variation in a collection of proteins. Where collections are phylogenetically or functionally related, this is easily detected. Hierarchical agglomerative clustering provides a means of constructing cladograms from the feature vector output.

  19. The Origin and Features of the Two Intermediate-Acid Intrusive Series in Tongling Area, Anhui, China

    Institute of Scientific and Technical Information of China (English)

    WU Cailai; CHEN Songyong; HAO Meiying; SHI Rendeng

    2001-01-01

    The Mesozoic intermediate-acid intrusive rocks in Tongling area can be divided into two magmatic series, namely the shoshonitic series and the high-potassium (K) calc-alkaline series. The shoshonitic series is characterized by an association of pyroxene monzodiorites+monzodiorites+quartz monzonites, the Rittmann indices of these rocks are greater than 4 (ranging from 4.2 to 5.8), the total of rare earth elements (∑REE) ranges from 210×10-6 to 260×10-6, and has good relationship with gold deposits. The high-potassium calc-alkaline series is by an association of diorite+quartz monzodiorites+granodiorites, the Rittmann indices are less than 4 (1.8-3.2), ∑REE 130×l0-6-180×10-6,and has good relationship with copper deposits. Together with lithoenclave study, it can be concluded that the shoshonitic series magma is derived from the differentiation of the alkaline basic magma in mantle and the high-potassium (K) calc-alkaline series magma is mixture between the differentiated mantle magma and crustal magma.

  20. Endonuclease restriction-mediated real-time polymerase chain reaction: a novel technique for rapid, sensitive and quantitative detection of nucleic-acid sequence

    Directory of Open Access Journals (Sweden)

    Yi Wang

    2016-07-01

    Full Text Available The article reported a novel methodology for real-time PCR analysis of nucleic acids, termed endonuclease restriction-mediated real-time polymerase chain reaction (ET-PCR. Just like PCR, ET-PCR only required one pair of primers. A short sequence (Ss, which was recognized by restriction enzyme BstUI, was attached to the 5’ end of the forward (F or reverse (R PCR primer, and the new F or R primer was named EF or ER. EF/ER was labeled at the 5’ end with a reporter dye and in the middle with a quenching dye. BstUI cleaves the newly synthesized double-stranded terminal sequences (5’ end recognition sequences and their complementary sequences during the extension phase, which separates the reporter molecule from the quenching dye, leading to a gain of fluorescence signal. This process is repeated in each amplification cycle and unaffected the exponential synthesis of the PCR amplification. ET-PCR allowed real-time analysis of single or multiple targets in a single vessel, and provided the reproducible quantitation of nucleic acids. positive results were generated in a relatively short period. The analytical sensitivity and specificity of ETR-PCR were successfully evaluated, detecting down to 250 fg of genomic DNA per tube of target pathogen DNA examined, and the positive results were generated in a relatively short period. Moreover, the practical application of ET-PCR for simultaneous detection of multiple target pathogens was also demonstrated in artificially contaminated blood samples. In conclusion, due to the technique’s simplicity of design, reproducible data and low contamination risk, ET-PCR assay is an appealing alternative to conventional approaches currently used for real-time nucleic acid analysis.

  1. Interactive Effect of Salicylic Acid on Some Physiological Features and Antioxidant Enzymes Activity in Ginger (Zingiber officinale Roscoe

    Directory of Open Access Journals (Sweden)

    Hawa Z. E. Jaafar

    2013-05-01

    Full Text Available The effect of foliar salicylic acid (SA applications (10−3 and 10−5 M on activities of nitrate reductase, guaiacol peroxidase (POD, superoxide dismutases (SOD, catalase (CAT and proline enzymes and physiological parameters was evaluated in two ginger varieties (Halia Bentong and Halia Bara under greenhouse conditions. In both varieties, tested treatments generally enhanced photosynthetic rate and total dry weight. Photosynthetic rate increases were generally accompanied by increased or unchanged stomatal conductance levels, although intercellular CO2 concentrations of treated plants were typically lower than in controls. Lower SA concentrations were generally more effective in enhancing photosynthetic rate and plant growth. Exogenous application of SA increased antioxidant enzyme activities and proline content; the greatest responses were obtained in plants sprayed with 10–5 M SA, with significant increases observed in CAT (20.1%, POD (45.2%, SOD (44.1% and proline (43.1% activities. Increased CAT activity in leaves is naturally expected to increase photosynthetic efficiency and thus net photosynthesis by maintaining a constant CO2 supply. Our results support the idea that low SA concentrations (10–5 M may induce nitrite reductase synthesis by mobilizing intracellular NO3− and can provide protection to nitrite reductase degradation in vivo in the absence of NO3–. Observed positive correlations among proline, SOD, CAT and POD activities in the studied varieties suggest that increased SOD activity was accompanied by increases in CAT and POD activities because of the high demands of H2O2 quenching.

  2. Chemical reactive features of novel amino acids intercalated layered double hydroxides in As(III) and As(V) adsorption.

    Science.gov (United States)

    Shen, Liang; Jiang, Xiuli; Chen, Zheng; Fu, Dun; Li, Qingbiao; Ouyang, Tong; Wang, Yuanpeng

    2017-06-01

    Layered double hydroxides (LDHs) intercalated with amino acids such as methionine (Met) were synthesized as new adsorbents to remediate arsenic-polluted water. This Zn2Al-Met-LDHs, identified with the formula of Zn0.7Al0.3(OH)2(Met)0.3·0.32H2O, has good thermal stability. Adsorption experiments with Zn2Al-Met-LDHs showed that the residual arsenic in solution could be reduced below the regulation limit, and this adsorption process fitted Langmuir isotherm and the pseudo-second-order kinetics well. A remarkably high removal efficiency and the maximum adsorption capacity for As(III) were achieved, 96.7% and 94.1 mg/g, respectively, at 298 K. The desorption efficiency of As(III) from the arsenic-saturated Zn2Al-Met-LDHs (<8.7%), far less than that of As(V), promises a specific and reliable uptake of As(III) in sorts of solutions. More importantly, a complete and in-depth spectra analysis through FTIR, XPS and NMR was conducted to explain the excellent performance of Zn2Al-Met-LDHs in arsenic removal. Herein, two special chemical reactions were proposed as the dominant mechanisms, i.e., hydrogen bonding between the carboxyl group of the host Met and the hydroxyl group of As(III) or As(V), and the formation of a chelate ring between the guest As(III) and the S, N bidentate ligands of the intercalated Met in the LDHs.

  3. Axolotl hemoglobin: cDNA-derived amino acid sequences of two alpha globins and a beta globin from an adult Ambystoma mexicanum.

    Science.gov (United States)

    Shishikura, Fumio; Takeuchi, Hiro-aki; Nagai, Takatoshi

    2005-11-01

    Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.

  4. The complete genome sequence of Natrinema sp. J7-2, a haloarchaeon capable of growth on synthetic media without amino acid supplements.

    Directory of Open Access Journals (Sweden)

    Jie Feng

    Full Text Available Natrinema sp. J7-2 is an extreme haloarchaeon capable of growing on synthetic media without amino acid supplements. Here we report the complete genome sequence of Natrinema sp. J7-2 which is composed of a 3,697,626-bp chromosome and a 95,989-bp plasmid pJ7-I. This is the first complete genome sequence of a member of the genus Natrinema. We demonstrate that Natrinema sp. J7-2 can use gluconate, glycerol, or acetate as the sole carbon source and that its genome encodes complete metabolic pathways for assimilating these substrates. The biosynthetic pathways for all 20 amino acids have been reconstructed, and we discuss a possible evolutionary relationship between the haloarchaeal arginine synthetic pathway and the bacterial lysine synthetic pathway. The genome harbors the genes for assimilation of ammonium and nitrite, but not nitrate, and has a denitrification pathway to reduce nitrite to N(2O. Comparative genomic analysis suggests that most sequenced haloarchaea employ the TrkAH system, rather than the Kdp system, to actively uptake potassium. The genomic analysis also reveals that one of the three CRISPR loci in the Natrinema sp. J7-2 chromosome is located in an integrative genetic element and is probably propagated via horizontal gene transfer (HGT. Finally, our phylogenetic analysis of haloarchaeal genomes provides clues about evolutionary relationships of haloarchaea.

  5. Amino Acid Sequence of an Excitatory Insect-selective Toxin (BmK IT) From Venom of the Scorpion Buthus martensi Karsch

    Institute of Scientific and Technical Information of China (English)

    吉永华; P. Mansuelle; 徐科; C. Granier; C. Kopeyan; S. Terakawa; H. Rochat

    1994-01-01

    The insect-selective neurotoxin(BmK IT) of scorpion Buthus martensi Karsch was first reduced and S-alkylated, and then digested by TPCK-trypsin and Staphylococcus aureus V-8 Protease. The enzymatic peptides were purified on TLC-plastic sheet and submitted to determine their amino acid compositions and sequences. The sequence of the 70 amino acid residues of BmK IT was established with reference to the primary structure of AaH IT, another excitatory insect-selective toxin from the venom of North African scorpion Androctonus australis Hector. About 75% of the homologous sequence was found in the molecules of BmK IT and AaH IT. It is obvious that the results contribute toward better understanding of the molecular structure characteristics, structure/activity relationship of scorpion insect-selective toxins, and they can serve as the molecular basis for utilizing the toxins as a tool to clarify molecular mechanism involved in channel gating, and to infer the possibility of developing them as new selective b

  6. Transcriptome sequencing revealed the transcriptional organization at ribosome-mediated attenuation sites in Corynebacterium glutamicum and identified a novel attenuator involved in aromatic amino acid biosynthesis.

    Science.gov (United States)

    Neshat, Armin; Mentz, Almut; Rückert, Christian; Kalinowski, Jörn

    2014-11-20

    The Gram-positive bacterium Corynebacterium glutamicum belongs to the order Corynebacteriales and is used as a producer of amino acids at industrial scales. Due to its economic importance, gene expression and particularly the regulation of amino acid biosynthesis has been investigated extensively. Applying the high-resolution technique of transcriptome sequencing (RNA-seq), recently a vast amount of data has been generated that was used to comprehensively analyze the C. glutamicum transcriptome. By analyzing RNA-seq data from a small RNA cDNA library of C. glutamicum, short transcripts in the known transcriptional attenuators sites of the trp operon, the ilvBNC operon and the leuA gene were verified. Furthermore, whole transcriptome RNA-seq data were used to elucidate the transcriptional organization of these three amino acid biosynthesis operons. In addition, we discovered and analyzed the novel attenuator aroR, located upstream of the aroF gene (cg1129). The DAHP synthase encoded by aroF catalyzes the first step in aromatic amino acid synthesis. The AroR leader peptide contains the amino acid sequence motif F-Y-F, indicating a regulatory effect by phenylalanine and tyrosine. Analysis by real-time RT-PCR suggests that the attenuator regulates the transcription of aroF in dependence of the cellular amount of tRNA loaded with phenylalanine when comparing a phenylalanine-auxotrophic C. glutamicum mutant fed with limiting and excess amounts of a phenylalanine-containing dipeptide. Additionally, the very interesting finding was made that all analyzed attenuators are leaderless transcripts.

  7. Identification in the human central nervous system, pituitary, and thyroid of a novel calcitonin gene-related peptide, and partial amino acid sequence in the spinal cord.

    Science.gov (United States)

    Petermann, J B; Born, W; Chang, J Y; Fischer, J A

    1987-01-15

    Two human genes encoding precursors for two calcitonin gene-related peptides (CGRP) I (or alpha) and II (or beta) have been identified (Steenbergh, P. H., Höppener, J. W. M., Zandberg, J., Lips, C. J. M., and Jansz, H. S. (1985) FEBS Lett. 183, 403-407). The amino acid sequence of CGRP-I was obtained in medullary thyroid carcinoma extracts (Morris, H. R., Panico, M., Etienne, T., Tippins, J., Girgis, S. I., and MacIntyre, I. (1984) Nature 308, 746-748), but not in normal human tissues. The human CGRP-II peptide remained to be discovered. Here we have determined in the human spinal cord the amino acid composition and the partial amino acid sequence of the DNA-predicted CGRP-I and -II. The data indicate for the first time the existence of a second CGRP different from the known CGRP-I. CGRP-II has been identified in the central nervous system, pituitary, thyroid, and in medullary thyroid carcinoma as a major CGRP form together with CGRP-I.

  8. Amino acid sequence of the oligomycin sensitivity-conferring protein (OSCP) of beef-heart mitochondria and its homology with the delta-subunit of the F1-ATPase of Escherichia coli.

    Science.gov (United States)

    Ovchinnikov, Y A; Modyanov, N N; Grinkevich, V A; Aldanova, N A; Trubetskaya, O E; Nazimov, I V; Hundal, T; Ernster, L

    1984-01-23

    The complete amino acid sequence of the oligomycin sensitivity-conferring protein (OSCP) of beef-heart mitochondria is reported. The protein contains 190 amino acids and has a molecular mass of 20 967. Its structure is characterized by a concentration of charged amino acids in the two terminal segments (N 1-77 and C 128-190) of the protein, whereas its central region is more hydrophobic. The earlier reported homology of the protein with the delta-subunit of E. coli F1, based on the terminal amino acid sequences of OSCP, is further substantiated.

  9. In-gel microwave-assisted acid hydrolysis of proteins combined with liquid chromatography tandem mass spectrometry for mapping protein sequences.

    Science.gov (United States)

    Sun, Difei; Wang, Nan; Li, Liang

    2014-01-07

    We report an enabling method for mapping the protein sequence with high sequence coverage. This method combines the high separation power of gel electrophoresis for protein separation with the high sequence coverage capability of microwave-assisted acid hydrolysis (MAAH) mass spectrometry (MS). In-gel MAAH using 25% trifluoroacetic acid was developed and optimized for degrading the gel-separated protein into small peptides suitable for tandem MS sequencing. For bovine serum albumin (BSA) (∼67 kDa), with 4 μg of protein loading onto a gel for separation, followed by excising the protein gel band for in-gel MAAH and then injecting ∼2 μg of the resultant peptides into a liquid chromatography quadrupole time-of-flight mass spectrometer for analysis, 689 ± 54 (n = 3) unique peptides were identified with a protein sequence coverage of 99 ± 1%. Both the number of peptides detected and sequence coverage decreased as the sample amount decreased, mainly due to background interference: 316 ± 59 peptides and 94 ± 3% coverage for 2 μg loading, 136 ± 19 and 76 ± 5% for 1 μg loading, and 30 ± 2 and 32 ± 2% for 0.5 μg loading. To demonstrate the general applicability of the method, 10 gel bands from gel electrophoresis of an albumin-depleted human plasma sample were excised for in-gel MAAH LC-MS analysis. In total, 19 relatively high abundance proteins with molecular weights ranging from ∼8 to ∼160 kD could be mapped with coverage of 100% for six proteins (MW 8759 to 68 425 Da), 96-98% for five proteins (MW 11 458 to 36 431 Da), 92% for three proteins (MW 15 971 to 36 431 Da), 80-87% for four proteins (MW 42 287 to 162 134 Da), and 56% for one protein (MW 51 358 Da). Finally, to demonstrate the applicability of the method for more detailed analysis of complex protein mixtures, two-dimensional (2D) gel electrophoresis was combined with in-gel MAAH, affinity purification, and LC-MS/MS to characterize six bovine alpha-S1-casein phosphoprotein

  10. Multiplex, rapid and sensitive isothermal detection of nucleic-acid sequence by endonuclease restriction-mediated real-time multiple cross displacement amplification

    Directory of Open Access Journals (Sweden)

    Yi eWang

    2016-05-01

    Full Text Available We have devised a novel isothermal amplification technology, termed endonuclease restriction-mediated real-time multiple cross displacement amplification (ET-MCDA, which facilitated multiplex, rapid, specific and sensitive detection of nucleic-acid sequences at a constant temperature. The ET-MCDA integrated multiple cross displacement amplification strategy, restriction endonuclease cleavage and real-time fluorescence detection technique. In the ET-MCDA system, the functional cross primer E-CP1 or E-CP2 was constructed by adding a short sequence at the 5’ end of CP1 or CP2, respectively, and the new E-CP1 or E-CP2 primer was labelled at the 5’ end with a fluorophore and in the middle with a dark quencher. The restriction endonuclease Nb.BsrDI specifically recognized the short sequence and digested the newly synthesized double-stranded terminal sequences (5’ end short sequences and their complementary sequences, which released the quenching, resulting on a gain of fluorescence signal. Thus, the ET-MCDA allowed real-time detection of single or multiple targets in only a single reaction, and the positive results were observed in as short as 12 minutes, detecting down to 3.125 fg of genomic DNA per tube. Moreover, the analytical specificity and the practical application of the ET-MCDA were also successfully evaluated in this study. Here we provided the details on the novel ET-MCDA technique and expounded the basic ET-MCDA amplification mechanism.

  11. A Recognition Algorithm for Radar PRI Modulation Mode Based on Extremum Sequence Features Set%基于极值序列特征集的雷达PRI调制模式识别算法

    Institute of Scientific and Technical Information of China (English)

    周一鹏; 王星; 田元荣; 周东青; 程嗣怡

    2016-01-01

    识别雷达信号的脉冲重复间隔( PRI)调制模式是分析雷达工作状态和工作任务的重要手段。针对复杂体制雷达的PRI调制模式可实时切换并改变调制参数因而难于识别的问题,提出一种基于极值序列特征集的雷达PRI调制模式识别算法。该算法首先提取PRI序列的极值特征,构建极值序列特征集;然后,基于PRI序列及其特征集建立恒参、类正弦、正弦和抖动判定准则,实现雷达PRI调制模式的分层识别。仿真分析表明:该算法对复杂体制雷达PRI调制模式的识别正确率达95.3%,同时具有较高的实时性,在电子对抗应用领域具有良好的前景。%The recognition of pulse repetition interval ( PRI) modulation mode is meaningful for analyzing the condition and task of radar.In order to recognize advanced radar which could change its PRI modulation mode and parameters quickly , a recognition al-gorithm based on extremum sequence features set is proposed .Firstly, by extracting the extremum sequence features from PRI se-quence , the PRI extremum features set is constructed .Then the judge criterions of five PRI modulation modes based on PRI se-quence and its extremum features set is proposed .Finally, a multi-layer recognition algorithm is presented .The result shows that the recognition algorithm has preferable recognition correct rate (95.3%), and could classify the PRI modulation modes quickly , so the resae rch has a good application prospect in electronic support measures.

  12. FeatureViewer, a BioJS component for visualization of position-based annotations in protein sequences [v1; ref status: indexed, http://f1000r.es/2u2

    Directory of Open Access Journals (Sweden)

    Leyla Garcia

    2014-02-01

    Full Text Available Summary: FeatureViewer is a BioJS component that lays out, maps, orients, and renders position-based annotations for protein sequences. This component is highly flexible and customizable, allowing the presentation of annotations by rows, all centered, or distributed in non-overlapping tracks. It uses either lines or shapes for sites and rectangles for regions. The result is a powerful visualization tool that can be easily integrated into web applications as well as documents as it provides an export-to-image functionality. Availability: https://github.com/biojs/biojs/blob/master/src/main/javascript/Biojs.FeatureViewer.js; http://dx.doi.org/10.5281/zenodo.7719

  13. Comparison of amino acid sequences of the trypsin inhibitors from taro (Colocasia esculenta), giant taro (Alocasia macrorrhiza) and giant swamp taro (Cyrtosperma chamissonis).

    Science.gov (United States)

    Peng, L; Bradbury, J H; Hammer, B C; Shaw, D C

    1993-09-01

    The amino acid sequences of the trypsin inhibitors from taro Colocasia esculenta var. esculenta and giant swamp taro Cyrtosperma chamissonis have been determined and are compared with the protein sequence of the trypsin/chymotrypsin inhibitor from giant taro Alocasia macrorrhiza. Both inhibitors display polymorphism and there is evidence of two components in the giant swamp taro. The positional identity between the proteins is highest at 73-75% for the comparison of the giant taro (GT) with the polymorphic forms of the taro (T) inhibitors and lowest at 56-58% for the pairs of taro and giant swamp taro (GST) proteins. The comparisons show that the inhibitors from T and GT are more related to each other than to GST, which supports their taxonomic classification into different tribes. Location of the P1 site for the trypsin inhibitors of aroids is different from that of other Kunitz-type inhibitors and could be at Leu56.

  14. Development of an electrochemical DNA biosensor for detection of specific Mycobacterium tuberculosis sequence based on poly(L-glutamic acid) modified electrode

    Indian Academy of Sciences (India)

    MERVE YESIL; SONER DONMEZ; FATMA ARSLAN

    2016-11-01

    An electrochemical DNA biosensor was developed by avidin-biotin interaction of a biotinylated probe and avidin-attached, poly(L-glutamic) acid coated pencil graphite electrode (PGA/PGE) for detection of specific Mycobacterium tuberculosis DNA sequence. The discrimination of fully complementary hybridization and mismatch hybridization was carried out by electrochemical reduction current of Meldola’s Blue (MDB) in square wave voltammetry (SWV). The calibration graph of the DNA biosensor was linear between 1.5–12.5 nM and the detection limit was calculated as 1.3 nM. The proposed biosensor successfully discriminated short andlong oligonucleotides related to DNA sequence of Mycobacterium tuberculosis in optimal condition.

  15. Amino acid sequence of toxin XI of the scorpion Buthus occitanus tunetanus. Evidence of a mutation having an important effect upon neurotoxic activity.

    Science.gov (United States)

    Sampieri, F; Habersetzer-Rochat, C; Martin, M F; Kopeyan, C; Rochat, H

    1987-02-01

    The complete amino acid sequence of toxin XI of the North African scorpion Buthus occitanus tunetanus has been elucidated by automatic sequencing of the reduced and alkylated toxin and of the peptides obtained after tryptic cleavage restricted to arginyl bonds. This toxin is structurally homologous to toxin II of Androctonus australis Hector, the most active among the alpha-toxins, but is far less potent, both in vivo and in vitro. This work points out 12 mutations, many of which are conservative. Nevertheless, the most striking difference is the replacement of the lysine residue at position 58, known to be important in the activity of AaH toxin II, by a valine residue. Thus, it seems that the presence of a positive charge at this location facilitates the interactions between the receptor on the sodium channel and the alpha-type toxins.

  16. 基于分支定界算法的飞机结构件元特征排序%Meta-Feature Sequencing of Aircraft Structural Part Based on Branch-and-Bound Algorithm

    Institute of Scientific and Technical Information of China (English)

    黄玲玲; 李迎光; 王伟; 汤立民

    2015-01-01

    针对现在定义的飞机结构件加工特征与加工操作不一一对应导致的特征排序存在大量空走刀问题,将加工特征分解为与加工操作一一对应的元特征。基于加工规则和最小耗费分支定界算法对元特征进行排序,实现刀具非切削行程最短。%The one-to-many mappings between the current deifned machining features and the machining operations lead to the increasing of the non-cutting path. In order to solve this problem, the machining features are de-composed into meta-features that are associated with a sin-gle machining operation individually. The rules abstracted from the machining process and the branch-and-bound algorithm are employed to sequence the meta-features to minimum the non-cutting path.

  17. Probing Structural Features and Binding Mode of 3-Arylpyrimidin-2,4-diones within Housefly γ-Aminobutyric Acid (GABA Receptor

    Directory of Open Access Journals (Sweden)

    Xiangya Kong

    2011-09-01

    Full Text Available In order to obtain structural features of 3-arylpyrimidin-2,4-diones emerged as promising inhibitors of insect γ-aminobutyric acid (GABA receptor, a set of ligand-/receptor-based 3D-QSAR models for 60 derivatives are generated using Comparative Molecular Field Analysis (CoMFA and Comparative Molecular Similarity Index Analysis (CoMSIA. The statistically optimal CoMSIA model is produced with highest q2 of 0.62, r2ncv of 0.97, and r2pred of 0.95. A minor/bulky electronegative hydrophilic polar substituent at the 1-/6-postion of the uracil ring, and bulky substituents at the 3'-, 4'- and 5'-positions of the benzene ring are beneficial for the enhanced potency of the inhibitors as revealed by the obtained 3D-contour maps. Furthermore, homology modeling, molecular dynamics (MD simulation and molecular docking are also carried out to gain a better understanding of the probable binding modes of these inhibitors, and the results show that residues Ala-183(C, Thr-187(B, Thr-187(D and Thr-187(E in the second transmembrane domains of GABA receptor are responsible for the H-bonding interactions with the inhibitor. The good correlation between docking observations and 3D-QSAR analyses further proves the model reasonability in probing the structural features and the binding mode of 3-arylpyrimidin-2,4-dione derivatives within the housefly GABA receptor.

  18. Analysis of nanopore arrangement and structural features of anodic alumina layers formed by two-step anodizing in oxalic acid using the dedicated executable software

    Science.gov (United States)

    Zaraska, Leszek; Stępniowski, Wojciech J.; Sulka, Grzegorz D.; Ciepiela, Eryk; Jaskuła, Marian

    2014-02-01

    Anodic porous alumina layers were fabricated by a two-step self-organized anodization in 0.3 M oxalic acid under various anodizing potentials ranging from 30 to 60 V at two different temperatures (10 and 17 ∘C). The effect of anodizing conditions on structural features and pore arrangement of AAO was investigated in detail by using the dedicated executable publication combined with ImageJ software. With increasing anodizing potential, a linear increase of the average pore diameter, interpore distance, wall thickness and barrier layer thickness, as well as a decrease of the pore density, were observed. In addition, the higher pore diameter and porosity values were obtained for samples anodized at the elevated temperature, independently of the anodizing potential. A degree of pore order was investigated on the basis of Delaunay triangulations (defect maps) and calculation of pair distribution or angle distribution functions (PDF or ADF), respectively. All methods confirmed that in order to obtain nanoporous alumina with the best, hexagonal pore arrangement, the potential of 40 V should be applied during anodization. It was confirmed that the dedicated executable publication can be used to a fast and complex analysis of nanopore arrangement and structural features of nanoporous oxide layers.

  19. E-probe Diagnostic Nucleic acid Analysis (EDNA): A theoretical approach for handling of next generation sequencing data for diagnostics

    Science.gov (United States)

    There are many plant pathogen-specific diagnostic assays, based on PCR and immune-detection. However, the ability to test for large numbers of pathogens simultaneously is lacking. Next generation sequencing (NGS) allows one to detect all organisms within a given sample, but has computational limitat...

  20. FiveS rRNA sequences and fatty acid profiles of colourless sulfur-oxidising bacteria

    Digital Repository Service at National Institute of Oceanography (India)

    LokaBharathi, P.A.; Ortiz-conde, B.A.; Nair, S.; Chandramohan, D.; Colwell, R.R.

    was Pediococcus halophlus with an E.D. of 27.5 with very little similarity. Visual comparison of 5S rRNA sequences with other G-ve organisms that were intuitively chosen for their relatedness showed that both these isolates have the greatest homology (82...

  1. In silico prediction of exposure amino acid sequences of outer inflammatory protein A of Helicobacter pylori for surface display on Eschierchia coli

    Directory of Open Access Journals (Sweden)

    Omid Teymournejad

    2012-01-01

    Conclusion: OipA has the ability to induce IL-8 from gastric epithelial cells and some papers are mentioned that this outer membrane protein involve to attachment and intracellular signaling. Receptor of OipA and adhesion motifs on this protein is unknown. Detection of exposure motifs aids to recognition of adhesion motifs and receptor of OipA on gastric epithelial cells. In this study, we have predicted exposure amino acid sequences for insert to subunit CstH of CS3 pilli E. coli for surface display.

  2. Detection of Campylobacter jejuni and Campylobacter coli in chicken meat samples by real-time nucleic acid sequence-based amplification with molecular beacons.

    Science.gov (United States)

    Churruca, E; Girbau, C; Martínez, I; Mateo, E; Alonso, R; Fernández-Astorga, A

    2007-06-10

    A nucleic acid sequence-based amplification (NASBA) assay based on molecular beacons was used for real-time detection of Campylobacter jejuni and Campylobacter coli in samples of chicken meat. A set of specific primers and beacon probe were designed to target the 16S rRNA of both species. The real-time NASBA protocol including the RNA isolation was valid for both of the cell suspensions in buffered saline and the artificially contaminated chicken meat samples. The presence of rRNA could be correlated with cellular viability, following inactivation of the bacteria by heating, in inoculated chicken meat samples but not in RNase-free cell suspensions.

  3. Preparation of multihollow P(St-MAA) particles by sequence soap-free/soap emulsion polymerization and followed by stepwise alkali/acid posttreatment

    Institute of Scientific and Technical Information of China (English)

    Rong Long Li; Cheng You Kan; Ze Ping Li; Yi Du; Yan Nan Cui

    2007-01-01

    The effects of ionic emulsifier, sodium dodecylbenzene sulfate (SDBS), on the formation of the multihollow structures in sub-micron sized polymer particles produced by alkali/acid posttreatment were investigated.The original latex particles with narrow size distribution were synthesized by a new sequence emulsifier-free/emulsifier emulsion copolymerization of styrene (St) and methacrylic acid (MAA).Results indicated that the pore size decreased and the pore number increased with the increase of SDBS amount, and the morphology of the posttreated latex particles was also significantly influenced by the introducing time of SDBS in the preparation of the original latex particles, and a suitable introducing time was 3 h of polymerization.

  4. Amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by streptococcal IgA1 proteases

    DEFF Research Database (Denmark)

    Batten, MR; Senior, BW; Kilian, Mogens;

    2003-01-01

    The amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by IgA1 proteases of different species of Streptococcus were investigated. Recombinant IgA1 antibodies were generated with point mutations at proline 227 and threonine 228, the residues lying on either...... that proteases from different strains may have different amino acid requirements for cleavage. Remarkably, some streptococcal proteases appeared able to cleave the hinge at a distant alternative site if substitution prevented efficient cleavage of the original site. Hence, this study has identified key residues...... required for the recognition of the IgA1 hinge as a substrate by streptococcal IgA1 proteases, and it marks a preliminary step towards development of specific enzyme inhibitors....

  5. Localization of an O-glycosylated site in the recombinant barley alpha-amylase 1 produced in yeast and correction of the amino acid sequence using matrix-assisted laser desorption/ionization mass spectrometry of peptide mixtures

    DEFF Research Database (Denmark)

    Andersen, Jens S.; Søgaard, M; Svensson, B

    1994-01-01

    , and analyzed directly by MALDI-MS. Based on the three mass spectrometric peptide maps, an error in the sequence deduced from cDNA, resulting in a mass difference of 28 Da, was located to a sequence stretch of 5 amino acid residues; furthermore, a dihexose substituent was identified on Thr410. Subsequent Edman...

  6. A case study on the genetic origin of the high oleic acid trait through FAD2-1 DNA sequence variation in safflower (Carthamus tinctorius L.

    Directory of Open Access Journals (Sweden)

    Sara eRapson

    2015-09-01

    Full Text Available The safflower (Carthamus tinctorius L. is considered a strongly domesticated species with a long history of cultivation. The hybridization of safflower with its wild relatives has played an important role in the evolution of cultivars and is of particular interest with regards to their production of high quality edible oils. Original safflower varieties were all rich in linoleic acid, while varieties rich in oleic acid have risen to prominence in recent decades. The high oleic acid trait is controlled by a partially recessive allele ol at a single locus OL. The ol allele was found to be a defective microsomal oleate desaturase FAD2-1. Here we present DNA sequence data and DNA Southern blot analysis suggesting that there has been an ancient hybridization and introgression of the FAD2-1 gene into C. tinctorius from its wild relative C. palaestinus. It is from this gene that FAD2-1Δ was derived more recently. Identification and characterization of the genetic origin and diversity of FAD2-1 could aid safflower breeders in reducing population size and generations required for the development of new high oleic acid varieties by using perfect molecular marker-assisted selection.

  7. Identification of microRNAs actively involved in fatty acid biosynthesis in developing Brassica napus seeds using high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Jia Wang

    2016-10-01

    Full Text Available Seed development has a critical role during the spermatophyte life cycle. In Brassica napus, a major oil crop, fatty acids are synthesized and stored in specific tissues during embryogenesis, and understanding the molecular mechanism underlying fatty acid biosynthesis during seed development is an important research goal. In this study, we constructed three small RNA libraries from early seeds at 14, 21 and 28 days after flowering (DAF and used high-throughput sequencing to examine microRNA (miRNA expression. A total of 85 known miRNAs from 30 families and 1,160 novel miRNAs were identified, of which 24, including 5 known and 19 novel miRNAs, were found to be involved in fatty acid biosynthesis. bna-miR156b, bna-miR156c, bna-miR156g, novel_mir_1706, novel_mir_1407, novel_mir_173, and novel_mir_104 were significantly down-regulated at 21 DAF and 28 DAF, whereas bna-miR159, novel_mir_1081, novel_mir_19 and novel_mir_555 were significantly up-regulated. In addition, we found that some miRNAs regulate functional genes that are directly involved in fatty acid biosynthesis and that other miRNAs regulate the process of fatty acid biosynthesis by acting on a large number of transcription factors. The miRNAs and their corresponding predicted targets were partially validated by quantitative RT-PCR. Our data suggest that diverse and complex miRNAs are involved in the seed development process and that miRNAs play important roles in fatty acid biosynthesis during seed development.

  8. Research on Nucleic Acid Sequence Design Methods for DNA Computing%DNA计算中核酸序列设计方法比较研究

    Institute of Scientific and Technical Information of China (English)

    张凯; 耿修堂; 肖建华; 赵东明

    2008-01-01

    DNA computing maps the instances of a reality problem onto specific nucleic acid mol-ecules and protocols so that the result contains the answers to the problem's instances to enable successful extraction.Good DNA sequences prevent unwanted hybridization errors during the computation and enable easy retrieval the answers in the extraction phase.At first,the paper in-troduces two typical nucleic acid sequence methods,free energy based method and Hamming dis-tance based method,and analyzes the influence of constraints on the quality of DNA sequences.In addition,the paper has compared the integrity and computational cost of two methods for ex-cluding the non-specific hybridization sequence.At last,the paper analyzes the efficiency of two nucleic acid sequence design methods.Through the comparison we can see that the Hamming dis-tance based DNA sequence design method can only approximately estimate the thermodynamic stability of DNA hybridization,and can not be a perfect substitute for the minimum free energy method.Further analysis shows that when only consider the free energy of Watson-crick base pairs,the calculation of free energy will be equal to the Hamming distance method.So,if the precision satisfied the requirement of experiment,Hamming distance based DNA sequence design method,not only can effectively distinguish between specific and non-specific hybridization,and effectively reduce the amount of calculation,improve the efficiency of the DNA sequence designing.%DNA计算是将现实问题进行编码,映射到DNA分子上,然后通过分子生物实验产生出代表问题解的DNA分子.最后通过检测技术提取出该DNA分子.高质量的DNA编码可以尽可能避免或减少计算过程中出现的错误,并使检测阶段易于提取出代表问题解的DNA分子.文中对基于汉明距离和基于自由能的DNA核酸编码方法进行研究,分析了两类方法的约束条件对DNA编码质量的影响,比较了两类方法排除非特异性

  9. Combination of Whole Genome Sequencing, Linkage and Functional Studies Implicates a Missense Mutation in Titin as a Cause of Autosomal Dominant Cardiomyopathy with Features of Left Ventricular Non-Compaction

    Science.gov (United States)

    Hooper, Charlotte; Ormondroyd, Liz; Pagnamenta, Alistair; Lise, Stefano; Salatino, Silvia; Knight, Samantha JL; Taylor, Jenny C.; Thomson, Kate L.; Arnold, Linda; Chatziefthimiou, Spyros D.; Konarev, Petr V.; Wilmanns, Matthias; Ehler, Elisabeth; Ghisleni, Andrea; Gautel, Mathias; Blair, Edward; Watkins, Hugh; Gehmlich, Katja

    2016-01-01

    Background High throughput next generation sequencing techniques have made whole genome sequencing accessible in clinical practice, however, the abundance of variation in the human genomes makes the identification of a disease-causing mutation on a background of benign rare variants challenging. Methods and Results Here we combine whole genome sequencing with linkage analysis in a three-generation family affected by cardiomyopathy with features of autosomal dominant left-ventricular non-compaction cardiomyopathy. A missense mutation in the giant protein titin is the only plausible disease-causing variant that segregates with disease amongst the eight surviving affected individuals, with interrogation of the entire genome excluding other potential causes. This A178D missense mutation, affecting a conserved residue in the second immunoglobulin-like domain of titin, was introduced in a bacterially expressed recombinant protein fragment and biophysically characterised in comparison to its wild-type counterpart. Multiple experiments, including size exclusion chromatography, small angle X-ray scattering and circular dichroism spectroscopy suggest partial unfolding and domain destabilisation in the presence of the mutation. Moreover, binding experiments in mammalian cells show that the mutation markedly impairs binding to the titin ligand telethonin. Conclusions Here we present genetic and functional evidence implicating the novel A178D missense mutation in titin as the cause of a highly penetrant familial cardiomyopathy with features of left-ventricular non-compaction. This expands the spectrum of titin’s roles in cardiomyopathies. It furthermore highlights that rare titin missense variants, currently often ignored or left un-interpreted, should be considered to be relevant for cardiomyopathies and can be identified by the approach presented here. PMID:27625337

  10. Isolation, amino acid sequence and biological activities of novel long-chain polyamine-associated peptide toxins from the sponge Axinyssa aculeata.

    Science.gov (United States)

    Matsunaga, Satoko; Jimbo, Mitsuru; Gill, Martin B; Wyhe, L Leanne Lash-Van; Murata, Michio; Nonomura, Ken'ichi; Swanson, Geoffrey T; Sakai, Ryuichi

    2011-09-19

    A novel family of functionalized peptide toxins, aculeines (ACUs), was isolated from the marine sponge Axinyssa aculeate. ACUs are polypeptides with N-terminal residues that are modified by the addition of long-chain polyamines (LCPA). Aculeines were present in the sponge extract as a complex mixture with differing polyamine chain lengths and peptide structures. ACU-A and B, which were purified in this study, share a common polypeptide chain but differ in their N-terminal residue modifications. The amino acid sequence of the polypeptide portion of ACU-A and B was deduced from 3' and 5' RACE, and supported by Edman degradation and mass spectral analysis of peptide fragments. ACU induced convulsions upon intracerebroventricular (i.c.v.) injection in mice, and disrupted neuronal membrane integrity in electrophysiological assays. ACU also lysed erythrocytes with a potency that differed between animal species. Here we describe the isolation, amino acid sequence, and biological activity of this new group of cytotoxic sponge peptides.

  11. Whole-Exome Sequencing in a South American Cohort Links ALDH1A3, FOXN1 and Retinoic Acid Regulation Pathways to Autism Spectrum Disorders.

    Science.gov (United States)

    Moreno-Ramos, Oscar A; Olivares, Ana María; Haider, Neena B; de Autismo, Liga Colombiana; Lattig, María Claudia

    2015-01-01

    Autism spectrum disorders (ASDs) are a range of complex neurodevelopmental conditions principally characterized by dysfunctions linked to mental development. Previous studies have shown that there are more than 1000 genes likely involved in ASD, expressed mainly in brain and highly interconnected among them. We applied whole exome sequencing in Colombian-South American trios. Two missense novel SNVs were found in the same child: ALDH1A3 (RefSeq NM_000693: c.1514T>C (p.I505T)) and FOXN1 (RefSeq NM_003593: c.146C>T (p.S49L)). Gene expression studies reveal that Aldh1a3 and Foxn1 are expressed in ~E13.5 mouse embryonic brain, as well as in adult piriform cortex (PC; ~P30). Conserved Retinoic Acid Response Elements (RAREs) upstream of human ALDH1A3 and FOXN1 and in mouse Aldh1a3 and Foxn1 genes were revealed using bioinformatic approximation. Chromatin immunoprecipitation (ChIP) assay using Retinoid Acid Receptor B (Rarb) as the immunoprecipitation target suggests RA regulation of Aldh1a3 and Foxn1 in mice. Our results frame a possible link of RA regulation in brain to ASD etiology, and a feasible non-additive effect of two apparently unrelated variants in ALDH1A3 and FOXN1 recognizing that every result given by next generation sequencing should be cautiously analyzed, as it might be an incidental finding.

  12. Distinct features of lamin A-interacting chromatin domains mapped by ChIP-sequencing from sonicated or micrococcal nuclease-digested chromatin.

    Science.gov (United States)

    Lund, Eivind G; Duband-Goulet, Isabelle; Oldenburg, Anja; Buendia, Brigitte; Collas, Philippe

    2015-01-01

    The nuclear lamina has been shown to interact with the genome through lamina-associated domains (LADs). LADs have been identified by DamID, a proximity labeling assay, and more recently by chromatin immunoprecipitation-sequencing (ChIP-seq) of A- and B-type lamins. LADs form megabase-size domains at the nuclear periphery, they are gene-poor and mostly heterochromatic. Here, we show that the mode of chromatin fragmentation for ChIP, namely bath sonication or digestion with micrococcal nuclease (MNase), leads to the discovery of common but also distinct sets of lamin-interacting domains, or LiDs. Using ChIP-seq, we show the existence of lamin A/C (LMNA) LiDs with distinct gene contents, histone composition enrichment and relationships to lamin B1-interacting domains. The extent of genome coverage of lamin A/C (LMNA) LiDs in sonicated or MNase-digested chromatin is similar (∼730 megabases); however over half of these domains are uniquely detected in sonicated or MNase-digested chromatin. Sonication-specific LMNA LiDs are gene-poor and devoid of a broad panel of histone modifications, while MNase-specific LMNA LiDs are of higher gene density and are enriched in H3K9me3, H3K27me3 and in histone variant H2A.Z. LMNB1 LiDs are gene-poor and show no or little enrichment in these marks. Comparison of published LMNB1 DamID LADs with LMNB1 and LMNA LiDs identified here by ChIP-seq further shows that LMNA can associate with 'open' chromatin domains displaying euchromatin characteristics, and which are not associated with LMNB1. The differential genomic and epigenetic properties of lamin-interacting domains reflect the existence of distinct LiD populations identifiable in different chromatin contexts, including nuclease-accessible regions presumably localized in the nuclear interior.

  13. Amino acid sequences recognized by T cells: studies on a merozoite surface antigen from the FCQ-27/PNG isolate of Plasmodium falciparum.

    Science.gov (United States)

    Rzepczyk, C M; Csurhes, P A; Baxter, E P; Doran, T J; Irving, D O; Kere, N

    1990-08-01

    Twenty-six overlapping peptides, spanning the entire FCQ-27/PNG sequence of the Plasmodium falciparum antigen known as merozoite surface antigen 2 were screened for their ability to induce the proliferation of peripheral blood lymphocytes (PBL) obtained from 12 donors living in Honiara, Solomon Islands where P. falciparum is endemic. A recombinant (r) form of MSA2, known as Ag 1609 was also screened in these assays and tetanus toxoid (TT) antigen was included as a control. The location of the predicted T cell determinants within MSA2 was examined using the algorithm, AMPHI and by scanning MSA2 for amino acid sequences showing the Rothbard motif. There were 13 predicted amphipathic helical sites and five examples of Rothbard sequences in the antigen. The location of these with regard to the peptides tested is shown. Nine of the 12 individuals responded to TT with high stimulation indices (greater than 4) being obtained in the majority of donors. Only three individuals responded to r-MSA2 with the stimulation indices (SI) in the range of 2.4-4.1. Peptides from both the constant and variable regions of MSA2 were recognized in the proliferative assays. However, the majority of the positive proliferative responses were to peptides which spanned the central variable region which included the two copies of the 32-amino-acid repeat occurring in the antigen. High SI comparable to those obtained to TT were seen in some individuals with some peptides. There was considerable variation between donors in number and nature of the peptides recognised and two donors did not respond to any of the antigens tested. The significance of these findings to vaccine development is discussed.

  14. DNA sequences encoding erythropoietin

    Energy Technology Data Exchange (ETDEWEB)

    Lin, F.K.

    1987-10-27

    A purified and isolated DNA sequence is described consisting essentially of a DNA sequence encoding a polypeptide having an amino acid sequence sufficiently duplicative of that of erythropoietin to allow possession of the biological property of causing bone marrow cells to increase production of reticulocytes and red blood cells, and to increase hemoglobin synthesis or iron uptake.

  15. Study on Application Features of Trichloro - isocyanuric Acid%三氯异氰尿酸应用特性的研究

    Institute of Scientific and Technical Information of China (English)

    王洪英; 魏新; 郦和生; 王岽

    2012-01-01

    Trichloro-isocyanuric acid(TCCA) can generate hypochlorous acid and cyanuric acid when it is hydrolyzed in water.Hypochlorous acid is more static due to the existence of cyanuric acid.Therefore,TCCA exerts some features which are different from other chlorine category bactericides.In this paper,the application features of TCCA are investigated.The laboratory static tests show that the mass concentration of residual chlorine increases firstly and then decreases when TCCA is dissolved in water and approximately reaches the maximum when it is completely dissolved.The attenuation of residual chlorine is more rapid with the higher initial concentration of TCCA.The organic substances in sewerage consume the hypochlorous which results in the decrease of residual chlorine.In different water quality conditions,the same concentration of TCCA may result in different initial concentrations,representing different effect of sterilization.TCCA has little corrosion on stainless steel,and is more corrosive for brass than carbon steel.The study can provide a technical support for the application of TCCA in cooling water system.%三氯异氰尿酸在水中水解生成次氯酸和异氰尿酸,由于异氰尿酸的存在,使次氯酸在水中的状态更加稳定,因此表现出与其它氯类杀菌剂不同的一些特性,本文对三氯异氰尿酸的应用特性进行了研究。实验室静态实验研究结果表明:三氯异氰尿酸溶解时水中的余氯质量浓度先增加而后减少,大致在完全溶解时余氯质量浓度达到最大值。加入三氯异氰尿酸的初始质量浓度越大,水中余氯的衰减也就越快。污水中有机物对次氯酸的消耗造成余氯减少。在不同水质条件下,相同投加质量浓度可能会有不同的余氯初始质量浓度,表现出不同的杀菌效果。三氯异氰尿酸对不锈钢几乎无腐蚀作用,对黄铜的腐蚀比对碳钢的腐蚀强烈。该研究能够为三氯异氰尿酸在循环冷却水

  16. Genotyping-by-Sequencing-Based Investigation of the Genetic Architecture Responsible for a ∼Sevenfold Increase in Soybean Seed Stearic Acid.

    Science.gov (United States)

    Heim, Crystal B; Gillman, Jason D

    2017-01-05

    Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1) to reduce or eliminate the need for chemical hydrogenation, and (2) to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24-28% seed stearic acid. Genotyping-by-sequencing (GBS)-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s); loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability.

  17. Genotyping-by-Sequencing-Based Investigation of the Genetic Architecture Responsible for a ∼Sevenfold Increase in Soybean Seed Stearic Acid

    Directory of Open Access Journals (Sweden)

    Crystal B. Heim

    2017-01-01

    Full Text Available Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1 to reduce or eliminate the need for chemical hydrogenation, and (2 to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24–28% seed stearic acid. Genotyping-by-sequencing (GBS-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s; loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability.

  18. Solid phase sequencing of biopolymers

    Energy Technology Data Exchange (ETDEWEB)

    Cantor, Charles (Del Mar, CA); Koster, Hubert (La Jolla, CA)

    2010-09-28

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  19. Prediction of protein secondary structure using probability based features and a hybrid system.

    Science.gov (United States)

    Ghanty, Pradip; Pal, Nikhil R; Mudi, Rajani K

    2013-10-01

    In this paper, we propose some co-occurrence probability-based features for prediction of protein secondary structure. The features are extracted using occurrence/nonoccurrence of secondary structures in the protein sequences. We explore two types of features: position-specific (based on position of amino acid on fragments of protein sequences) as well as position-independent (independent of amino acid position on fragments of protein sequences). We use a hybrid system, NEUROSVM, consisting of neural networks and support vector machines for classification of secondary structures. We propose two schemes NSVMps and NSVM for protein secondary structure prediction. The NSVMps uses position-specific probability-based features and NEUROSVM classifier whereas NSVM uses the same classifier with position-independent probability-based features. The proposed method falls in the single-sequence category of methods because it does not use any sequence profile information such as position specific scoring matrices (PSSM) derived from PSI-BLAST. Two widely used datasets RS126 and CB513 are used in the experiments. The results obtained using the proposed features and NEUROSVM classifier are better than most of the existing single-sequence prediction methods. Most importantly, the results using NSVMps that are obtained using lower dimensional features, are comparable to those by other existing methods. The NSVMps and NSVM are finally tested on target proteins of the critical assessment of protein structure prediction experiment-9 (CASP9). A larger dataset is used to compare the performance of the proposed methods with that of two recent single-sequence prediction methods. We also investigate the impact of presence of different amino acid residues (in protein sequences) that are responsible for the formation of different secondary structures.

  20. Sequence-selective recognition of double-stranded RNA and enhanced cellular uptake of cationic nucleobase and backbone-modified peptide nucleic acids.

    Science.gov (United States)

    Hnedzko, Dziyana; McGee, Dennis W; Karamitas, Yannis A; Rozners, Eriks

    2017-01-01

    Sequence-selective recognition of complex RNAs in live cells could find broad applications in biology, biomedical research, and biotechnology. However, specific recognition of structured RNA is challenging, and generally applicable and effective methods are lacking. Recently, we found that peptide nucleic acids (PNAs) were unusually well-suited ligands for recognition of double-stranded RNAs. Herein, we report that 2-aminopyridine (M) modified PNAs and their conjugates with lysine and arginine tripeptides form strong (Ka = 9.4 to 17 × 10(7) M(-1)) and sequence-selective triple helices with RNA hairpins at physiological pH and salt concentration. The affinity of PNA-peptide conjugates for the matched RNA hairpins was unusually high compared to the much lower affinity for DNA hairpins of the same sequence (Ka = 0.05 to 1.1 × 10(7) M(-1)). The binding of double-stranded RNA by M-modified PNA-peptide conjugates was a relatively fast process (kon = 2.9 × 10(4) M(-1) sec(-1)) compared to the notoriously slow triple helix formation by oligodeoxynucleotides (kon ∼ 10(3) M(-1) sec(-1)). M-modified PNA-peptide conjugates were not cytotoxic and were efficiently delivered in the cytosol of HEK293 cells at 10 µM. Surprisingly, M-modified PNAs without peptide conjugation were also taken up by HEK293 cells, which, to the best of our knowledge, is the first example of heterocyclic base modification that enhances the cellular uptake of PNA. Our results suggest that M-modified PNA-peptide conjugates are promising probes for sequence-selective recognition of double-stranded RNA in live cells and other biological systems.

  1. Amino acid sequence and crystal structure of BaP1, a metalloproteinase from Bothrops asper snake venom that exerts multiple tissue-damaging activities

    Science.gov (United States)

    Watanabe, Leandra; Shannon, John D.; Valente, Richard H.; Rucavado, Alexandra; Alape-Girón, Alberto; Kamiguti, Aura S.; Theakston, R. David G.; Fox, Jay W.; Gutiérrez, José María; Arni, Raghuvir K.

    2003-01-01

    BaP1 is a 22.7-kD P-I-type zinc-dependent metalloproteinase isolated from the venom of the snake Bothrops asper, a medically relevant species in Central America. This enzyme exerts multiple tissue-damaging activities, including hemorrhage, myonecrosis, dermonecrosis, blistering, and edema. BaP1 is a single chain of 202 amino acids that shows highest sequence identity with metalloproteinases isolated from the venoms of snakes of the subfamily Crotalinae. It has six Cys residues involved in three disulfide bridges (Cys 117–Cys 197, Cys 159–Cys 181, Cys 157–Cys 164). It has the consensus sequence H142E143XXH146XXGXXH152, as well as the sequence C164I165M166, which characterize the “metzincin” superfamily of metalloproteinases. The active-site cleft separates a major subdomain (residues 1–152), comprising four α-helices and a five-stranded β-sheet, from the minor subdomain, which is formed by a single α-helix and several loops. The catalytic zinc ion is coordinated by the Nɛ2 nitrogen atoms of His 142, His 146, and His 152, in addition to a solvent water molecule, which in turn is bound to Glu 143. Several conserved residues contribute to the formation of the hydrophobic pocket, and Met 166 serves as a hydrophobic base for the active-site groups. Sequence and structural comparisons of hemorrhagic and nonhemorrhagic P-I metalloproteinases from snake venoms revealed differences in several regions. In particular, the loop comprising residues 153 to 176 has marked structural differences between metalloproteinases with very different hemorrhagic activities. Because this region lies in close proximity to the active-site microenvironment, it may influence the interaction of these enzymes with physiologically relevant substrates in the extracellular matrix. PMID:14500885

  2. 黔邵花猪BP/基因的cDNA克隆及蛋白质序列分析%Cloning of BPI gene from Qianshao spot pig and its amino acids sequence analysis

    Institute of Scientific and Technical Information of China (English)

    魏麟; 陈斌; 张善文; 宋伸; 刘鹏

    2011-01-01

    从猪血液总RNA中克隆出BPI基因,对该基因的cDNA进行序列分析.结果表明:克隆到的序列全长1 874 bp(基因登录号为FJ810853),其中1452bp的开放阅读框编码483个氨基酸残基,含13.25%的亮氨酸,有一段27个氨基酸的信号肽序列.同源性分析结果显示:猪BPI与人、牛、兔、狗、大鼠、小鼠、鲤鱼、非洲爪蟾、大西洋鲑和大黄鱼BPI分子氨基酸序列的同源性分别为64%、74%、59%、67%、53%、51%、35%、44%、28%和27%.该蛋白氨基端部分和羧基端部分为2个明显不同的功能区,各存在1个超活性结构域,中间为胰蛋白酶水解位点,表现出类似人BPI结构的特征.%Porcine bactericidal/permeability increasing protein (BPI) gene was cloned from porcine blood by reverse transcription-polymerase chain reaction (RT-PCR). Sequence analysis showed that the porcine BPI cDNA cloned was 1 874 bp in length (GenBank accession No: FJ810853)and the open reading frame encoded 483 amino acids residues including 13.25% leucines and a signal peptide of 27 amino acids. Comparison analysis showed that homology of BPI amino acids sequence between porcine and that of human, cattle, rabbit, dog, rat, mouse, carp, xenopus laevis, atlantic salmon and large yellow croaker were 64%,74%, 59%, 67%, 53%, 51%, 35%, 44%, 28% and 27%, respectively. The amino terminal and the carboxyl terminal were two distinct functional domains, and an ultra-active domain was contained in each of the terminals and between the two terminals there existed trypsin hydrolytic site, which was the common structural features of human's BPI.

  3. Sequencing adsorption and degradation cycles towards the methanization of long chain fatty acids : comparison between granular and suspended sludge

    OpenAIRE

    Pereira, M A; Pires, O. C.; Mota, M.; Alves, M.M.

    2001-01-01

    The aim of the present work was to study the maximum potential methane production in batch assays of sludge samples taken along the operation of two EGSB reactors (RI inoculated with granular sludge and RII inoculated with suspended sludge) fed with increasing oleic acid concentrations between 2 and 8 gCOD/l (HRT=1 day). After removing the residual substrate, the sludge was incubated in batch vials without any added carbon source. A maximum methane production rate of 152±21 mlCH4(...

  4. Features of the Thermodynamics of Trivalent Lanthanide/Actinide Distribution Reactions by Tri-n-Octylphosphine Oxide and Bis(2-EthylHexyl) Phosphoric Acid

    Energy Technology Data Exchange (ETDEWEB)

    Travis S. Grimes; Peter R. Zalupski

    2014-11-01

    A new methodology has been developed to study the thermochemical features of the biphasic transfer reactions of trisnitrato complexes of lanthanides and americium by a mono-functional solvating ligand (tri-n-octyl phosphine oxide - TOPO). Stability constants for successive nitrato complexes (M(NO3)x3-x (aq) where M is Eu3+, Am3+ or Cm3+) were determined to assist in the calculation of the extraction constant, Kex, for the metal ions under study. Enthalpies of extraction (?Hextr) for the lanthanide series (excluding Pm3+) and Am3+ by TOPO have been measured using isothermal titration calorimetry. The observed ?Hextr were found to be constant at ~29 kJ mol-1across the series from La3+-Er3+, with a slight decrease observed from Tm3+-Lu3+. These heats were found to be consistent with enthalpies determined using van ’t Hoff analysis of temperature dependent extraction studies. A complete set of thermodynamic parameters (?G, ?H, ?S) was calculated for Eu(NO3)3, Am(NO3)3 and Cm(NO3)3 extraction by TOPO and Am3+ and Cm3+ extraction by bis(2-ethylhexyl) phosphoric acid (HDEHP). A discussion comparing the energetics of these systems is offered. The measured biphasic extraction heats for the transplutonium elements, ?Hextr, presented in these studies are the first ever direct measurements offered using two-phase calorimetric techniques.

  5. Data on human neutrophil activation induced by pepducins with amino acid sequences derived from β2AR and CXCR4

    Directory of Open Access Journals (Sweden)

    André Holdfeldt

    2016-09-01

    Full Text Available The data described here is related to the research article titled (Gabl et al., 2016 [1]. Pepducins with peptide sequence derived from one of the intracellular domains of a given G-protein coupled receptor (GPCR can either activate or inhibit cell functions. Here we include data on human neutrophil function induced by pepducins derived from β2AR (ICL3-8 and CXCR4 (ATI-2341, respectively. ICL3-8 exerts neither direct activating effect on the NADPH-oxidase as measured by superoxide release nor inhibitory effect on FPR signaling. ATI-2341 dose-dependently triggers neutrophil activation and these cells were subsequently desensitized in their response to FPR2 specific agonists F2Pal10 and WKYMVM. Moreover, the ATI-2341 response is inhibited by PBP10 and the peptidomimetic Pam-(Lys-betaNSpe6-NH2 (both are FPR2 specific inhibitors, but not to the FPR1 specific inhibitor cyclosporine H.

  6. Characterization of relative abundance of lactic acid bacteria species in French organic sourdough by cultural, qPCR and MiSeq high-throughput sequencing methods.

    Science.gov (United States)

    Michel, Elisa; Monfort, Clarisse; Deffrasnes, Marion; Guezenec, Stéphane; Lhomme, Emilie; Barret, Matthieu; Sicard, Delphine; Dousset, Xavier; Onno, Bernard

    2016-12-19

    In order to contribute to the description of sourdough LAB composition, MiSeq sequencing and qPCR methods were performed in association with cultural methods. A panel of 16 French organic bakers and farmer-bakers were selected for this work. The lactic acid bacteria (LAB) diversity of their organic sourdoughs was investigated quantitatively and qualitatively combining (i) Lactobacillus sanfranciscensis-specific qPCR, (ii) global sequencing with MiSeq Illumina technology and (iii) molecular isolates identification. In addition, LAB and yeast enumeration, pH, Total Titratable Acidity, organic acids and bread specific volume were analyzed. Microbial and physico-chemical data were statistically treated by Principal Component Analysis (PCA) and Hierarchical Ascendant Classification (HAC). Total yeast counts were 6 log10 to 7.6 log10CFU/g while LAB counts varied from 7.2 log10 to 9.6 log10CFU/g. Values obtained by L. sanfranciscensis-specific qPCR were estimated between 7.2 and 10.3 log10CFU/g, except for one sample at 4.4 log10CFU/g. HAC and PCA clustered the sixteen sourdoughs into three classes described by their variables but without links to bakers' practices. L. sanfranciscensis was the dominant species in 13 of the 16 sourdoughs analyzed by Next Generation Sequencing (NGS), by the culture dependent method this species was dominant only in only 10 samples. Based on isolates identification, LAB diversity was higher for 7 sourdoughs with the recovery of L. curvatus, L. brevis, L. heilongjiangensis, L. xiangfangensis, L. koreensis, L. pontis, Weissella sp. and Pediococcus pentosaceus, as the most representative species. L. koreensis, L. heilongjiangensis and L. xiangfangensis were identified in traditional Asian food and here for the first time as dominant in organic sourdough. This study highlighted that L. sanfranciscensis was not the major species in 6/16 sourdough samples and that a relatively high LAB diversity can be observed in French organic sourdough.

  7. Mutation of the aspartic acid residues of the GDD sequence motif of poliovirus RNA-dependent RNA polymerase results in enzymes with altered metal ion requirements for activity.

    Science.gov (United States)

    Jablonski, S A; Morrow, C D

    1995-01-01

    The poliovirus RNA-dependent RNA polymerase, 3Dpol, is known to share a region of sequence homology with all RNA polymerases centered at the GDD amino acid motif. The two aspartic acids have been postulated to be involved in the catalytic activity and metal ion coordination of the enzyme. To test this hypothesis, we have utilized oligonucleotide site-directed mutagenesis to generate defined mutations in the aspartic acids of the GDD motif of the 3Dpol gene. The codon for the first aspartate (3D-D-328 [D refers to the single amino acid change, and the number refers to its position in the polymerase]) was changed to that for glutamic acid, histidine, asparagine, or glutamine; the codons for both aspartic acids were simultaneously changed to those for glutamic acids; and the codon for the second aspartic acid (3D-D-329) was changed to that for glutamic acid or asparagine. The mutant enzymes were expressed in Escherichia coli, and the in vitro poly(U) polymerase activity was characterized. All of the mutant 3Dpol enzymes were enzymatically inactive in vitro when tested over a range of Mg2+ concentrations. However, when Mn2+ was substituted for Mg2+ in the in vitro assays, the mutant that substituted the second aspartic acid for asparagine (3D-N-329) was active. To further substantiate this finding, a series of different transition metal ions were substituted for Mg2+ in the poly(U) polymerase assay. The wild-type enzyme was active with all metals except Ca2+, while the 3D-N-329 mutant was active only when FeC6H7O5 was used in the reaction. To determine the effects of the mutations on poliovirus replication, the mutant 3Dpol genes were subcloned into an infectious cDNA of poliovirus. The cDNAs containing the mutant 3Dpol genes did not produce infectious virus when transfected into tissue culture cells under standard conditions. Because of the activity of the 3D-N-329 mutant in the presence of Fe2+ and Mn2+, transfections were also performed in the presence of the

  8. Permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, a thermoacidophilic sulfur-reducing crenarchaeon isolated from acidic hot springs of Hveravellir, Iceland.

    Science.gov (United States)

    Susanti, Dwi; Johnson, Eric F; Lapidus, Alla; Han, James; Reddy, T B K; Pilay, Manoj; Ivanova, Natalia N; Markowitz, Victor M; Woyke, Tanja; Kyrpides, Nikos C; Mukhopadhyay, Biswarup

    2016-01-01

    This report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilization systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.

  9. Genome sequence of the copper resistant and acid-tolerant Desulfosporosinus sp. BG isolated from the tailings of a molybdenum-tungsten mine in the Transbaikal area

    Directory of Open Access Journals (Sweden)

    Olga V. Karnachuk

    2017-03-01

    Full Text Available Here, we report on the draft genome of a copper-resistant and acidophilic Desulfosporosinus sp. BG, isolated from the tailings of a molybdenum-tungsten mine in Transbaikal area. The draft genome has a size of 4.52 Mb and encodes transporters of heavy metals. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain BG clusters together with the other acidophilic copper-resistant strains Desulfosporosinus sp. OT and Desulfosporosinus sp. I2. The K+-ATPase, Na+/H+ antiporter and amino acid decarboxylases may participate in enabling growth at low pH. The draft genome sequence and annotation have been deposited at GenBank under the accession number NZ_MASS00000000.

  10. De novo sequencing and transcriptome analysis of Pinellia ternata identify the candidate genes involved in the biosynthesis of benzoic acid and ephedrine

    Directory of Open Access Journals (Sweden)

    Zhang Guang Hui

    2016-08-01

    Full Text Available Background: The medicinal herb, Pinellia ternate, is purported to be an anti-emetic with analgesic and sedative effects. Alkaloids are the main biologically active compounds in P. ternata, especially ephedrine that is a phenylpropylamino alkaloid specifically produced by Ephedra and Catha edulis. However, how ephedrine is synthesized in plants is uncertain. Only the phenylalanine ammonia lyase (PAL and relevant genes in this pathway have been characterized. Genomic information of P. ternata is also unavailable. Results: We analyzed the transcriptome of the tuber of P. ternata with the Illumina HiSeqTM 2000 sequencing platform. 66,813,052 high-quality reads were generated, and these reads were assembled de novo into 89,068 unigenes. Most known genes involved in benzoic acid biosynthesis were identified in the unigene dataset of P. ternate, and the expression patterns of some ephedrine biosynthesis-related genes were analyzed by reverse transcription quantitative real-time PCR (RT-qPCR. Also, 14,468 simple sequence repeats (SSRs were identified from 12,000 unigenes. Twenty primer pairs for SSRs were randomly selected for the validation of their amplification effect. Conclusion: RNA-seq data was firstly used to provide a comprehensive gene information on P. ternata at the transcriptional level. These data will advance molecular genetics in this valuable medicinal plant.

  11. High genetic diversity among strains of the unindustrialized lactic acid bacterium Carnobacterium maltaromaticum in dairy products as revealed by multilocus sequence typing.

    Science.gov (United States)

    Rahman, Abdur; Cailliez-Grimal, Catherine; Bontemps, Cyril; Payot, Sophie; Chaillou, Stéphane; Revol-Junelles, Anne-Marie; Borges, Frédéric

    2014-07-01

    Dairy products are colonized with three main classes of lactic acid bacteria (LAB): opportunistic bacteria, traditional starters, and industrial starters. Most of the population structure studies were previously performed with LAB species belonging to these three classes and give interesting knowledge about the population structure of LAB at the stage where they are already industrialized. However, these studies give little information about the population structure of LAB prior their use as an industrial starter. Carnobacterium maltaromaticum is a LAB colonizing diverse environments, including dairy products. Since this bacterium was discovered relatively recently, it is not yet commercialized as an industrial starter, which makes C. maltaromaticum an interesting model for the study of unindustrialized LAB population structure in dairy products. A multilocus sequence typing scheme based on an analysis of fragments of the genes dapE, ddlA, glpQ, ilvE, pyc, pyrE, and leuS was applied to a collection of 47 strains, including 28 strains isolated from dairy products. The scheme allowed detecting 36 sequence types with a discriminatory index of 0.98. The whole population was clustered in four deeply branched lineages, in which the dairy strains were spread. Moreover, the dairy strains could exhibit a high diversity within these lineages, leading to an overall dairy population with a diversity level as high as that of the nondairy population. These results are in agreement with the hypothesis according to which the industrialization of LAB leads to a diversity reduction in dairy products.

  12. Characterization, Genome Sequence, and Analysis of Escherichia Phage CICC 80001, a Bacteriophage Infecting an Efficient L-Aspartic Acid Producing Escherichia coli.

    Science.gov (United States)

    Xu, Youqiang; Ma, Yuyue; Yao, Su; Jiang, Zengyan; Pei, Jiangsen; Cheng, Chi

    2016-03-01

    Escherichia phage CICC 80001 was isolated from the bacteriophage contaminated medium of an Escherichia coli strain HY-05C (CICC 11022S) which could produce L-aspartic acid. The phage had a head diameter of 45-50 nm and a tail of about 10 nm. The one-step growth curve showed a latent period of 10 min and a rise period of about 20 min. The average burst size was about 198 phage particles per infected cell. Tests were conducted on the plaques, multiplicity of infection, and host range. The genome of CICC 80001 was sequenced with a length of 38,810 bp, and annotated. The key proteins leading to host-cell lysis were phylogenetically analyzed. One protein belonged to class II holin, and the other two belonged to the endopeptidase family and N-acetylmuramoyl-L-alanine amidase family, respectively. The genome showed the sequence identity of 82.7% with that of Enterobacteria phage T7, and carried ten unique open reading frames. The bacteriophage resistant E. coli strain designated CICC 11021S was breeding and its L-aspartase activity was 84.4% of that of CICC 11022S.

  13. Purification, characterization and N-terminal amino acid sequence of a new major allergen from European chestnut pollen--Cas s 1.

    Science.gov (United States)

    Kos, T; Hoffmann-Sommergruber, K; Ferreira, F; Hirschwehr, R; Ahorn, H; Horak, F; Jager, S; Sperr, W; Kraft, D; Scheiner, O

    1993-11-15

    Pollens from trees of the order Fagales (e.g. birch, alder, hazel, and hornbeam) all contain one major allergen--the main cause for tree pollen allergy. So far the major allergens from birch (Bet v 1), alder (Aln g 1), hazel (Cor a 1), and hornbeam (Car b 1) have been characterized, showing high sequence similarity with each other (1-4). We present the molecular and immunologic characterization of Cas s 1, the major allergen from the European chestnut (Castanea sativa). From aqueous pollen extracts from European chestnut a protein was purified to homogeneity and was subjected to further investigation. The protein revealed a Mr of 22 kDa and was shown to represent the major allergen of the European chestnut (immunoblotting, histamine release) and designated Cas s 1. Despite a marked difference in Mr, Cas s 1 shows significant amino acid sequence similarity at the N-terminus and is antigenically closely related to the major birch pollen allergen Bet v 1 (17 kDa), as shown by binding to the anti-Bet v 1 monoclonal antibody BIP-1 and by IgE-inhibition tests using recombinant Bet v 1.

  14. RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

    Science.gov (United States)

    Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

    2015-04-09

    Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  15. RNA Sequencing Identifies Upregulated Kyphoscoliosis Peptidase and Phosphatidic Acid Signaling Pathways in Muscle Hypertrophy Generated by Transgenic Expression of Myostatin Propeptide

    Directory of Open Access Journals (Sweden)

    Yuanxin Miao

    2015-04-01

    Full Text Available Myostatin (MSTN, a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph, and zinc metallopeptidase STE24 (Zmpste24. In addition, kyphoscoliosis peptidase (Ky, which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA pathways (Dgki, Dgkz, Plcd4 were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  16. Production of bio-hydrogen by mesophilic anaerobic fermentation in an acid-phase sequencing batch reactor.

    Science.gov (United States)

    Cheong, Dae-Yeol; Hansen, Conly L; Stevens, David K

    2007-02-15

    The pH and hydraulic retention time (HRT) of an anaerobic sequencing batch reactor (ASBR) were varied to optimize the conversion of carbohydrate-rich synthetic wastewater into bio-hydrogen. A full factorial design using evolutionary operation (EVOP) was used to determine the effect of the factors and to find the optimum condition of each factor required for high hydrogen production rate. Experimental results from 20 runs indicate that a maximum hydrogen production rate of 4,460-5,540 mL/L/day under the volumetric organic loading rate (VOLR) of 75 g-COD/L/day obtained at an observed design point of HRT = 8 h and pH = 5.7. The hydrogen production rate was strongly dependent on the HRT, and the effect was statistically significant (P 0.05) was found for the pH on the hydrogen production rate. When the ASBR conditions were set for a maximum hydrogen production rate, the hydrogen production yield and specific hydrogen production rate were 60-74 mL/g-COD and 330-360 mL/g-VSS/day, respectively. The hydrogen composition was 43-51%, and no methanogenesis was observed. Acetate, propionate, butyrate, valerate, caproate, and ethanol were major liquid intermediate metabolites during runs of this ASBR. The dominant fermentative types were butyrate-acetate or ethanol-acetate, representing the typical anaerobic pathway of Clostridium species. This hydrogen-producing ASBR had a higher hydrogen production rate, compared with that produced using continuous-flow stirred tank reactors (CSTRs). This study suggests that the hydrogen-producing ASBR is a promising bio-system for prolonged and stable hydrogen production.

  17. Driver Fatigue Features Extraction

    Directory of Open Access Journals (Sweden)

    Gengtian Niu

    2014-01-01

    Full Text Available Driver fatigue is the main cause of traffic accidents. How to extract the effective features of fatigue is important for recognition accuracy and traffic safety. To solve the problem, this paper proposes a new method of driver fatigue features extraction based on the facial image sequence. In this method, first, each facial image in the sequence is divided into nonoverlapping blocks of the same size, and Gabor wavelets are employed to extract multiscale and multiorientation features. Then the mean value and standard deviation of each block’s features are calculated, respectively. Considering the facial performance of human fatigue is a dynamic process that developed over time, each block’s features are analyzed in the sequence. Finally, Adaboost algorithm is applied to select the most discriminating fatigue features. The proposed method was tested on a self-built database which includes a wide range of human subjects of different genders, poses, and illuminations in real-life fatigue conditions. Experimental results show the effectiveness of the proposed method.

  18. FASMA: A Service to Format and Analyze Sequences in Multiple Alignments

    Institute of Scientific and Technical Information of China (English)

    Susan Costantini; Giovanni Colonna; Angelo M. Facchiano

    2007-01-01

    Multiple sequence alignments are successfully applied in many studies for under- standing the structural and functional relations among single nucleic acids and pro- tein sequences as well as whole families. Because of the rapid growth of sequence databases, multiple sequence alignments can often be very large and difficult to visualize and analyze. We offer a new service aimed to visualize and analyze the multiple alignments obtained with different external algorithms, with new features useful for the comparison of the aligned sequences as well as for the creation of a final image of the alignment. The service is named FASMA and is available at http: //bioinformatica.isa.cnr.it /FASMA /.

  19. Purification and partial amino acid sequences of a new presynaplic toxin and a cytotoxin from venom of pit veper Agkistrodon blomhoffii brevicaudus

    Institute of Scientific and Technical Information of China (English)

    吉永华; 陆丽芳; 徐科; Hiroyuki Hattori; Susumu Terakawa

    1995-01-01

    The technique of the reverse-phase performance liquid chromatography (RP-HPLC) was employed to separate and purify the toxic proteins from the venom of Agkistrodon blomhoffii brevicaudus collected in China 3 toxic proteins marked as AgTx-1, AgTx-2 and AgTx-3 consisting of about 122 amino acid residues were screened The toxicities (LD50,) of the AgTx-1, AgTx-2 and AgTx-3 were 0.075, 0.51 and 6.6 mg per kg weight of mice respectively. Toxicological experiment in the chick biventer cervicis nerve-muscle preparation showed that the acetylcholine (Ach) sensitivity of the preparation was unchanged after the total failure of the indirect contraction caused by AgTx-1 and AgTx-2. suggesting that they were presynaptic blockers, namely β-type of snake toxins. However, the amplitude of indirect contraction of the preparation was gradually reduced due to its incomplete relaxation caused by AgTx-3, indicating that it should belong to the category of cytotoxins. The partial amino acid sequences of 3 toxins have been

  20. Amino acids

    Science.gov (United States)

    ... this page: //medlineplus.gov/ency/article/002222.htm Amino acids To use the sharing features on this page, please enable JavaScript. Amino acids are organic compounds that combine to form proteins . ...

  1. Solid-Phase Purification of Synthetic DNA Sequences.

    Science.gov (United States)

    Grajkowski, Andrzej; Cieslak, Jacek; Beaucage, Serge L

    2016-08-05

    Although high-throughput methods for solid-phase synthesis of DNA sequences are currently available for synthetic biology applications and technologies for large-scale production of nucleic acid-based drugs have been exploited for various therapeutic indications, little has been done to develop high-throughput procedures for the purification of synthetic nucleic acid sequences. An efficient process for purification of phosphorothioate and native DNA sequences is described herein. This process consists of functionalizing commercial aminopropylated silica gel with aminooxyalkyl functions to enable capture of DNA sequences carrying a 5'-siloxyl ether linker with a "keto" function through an oximation reaction. Deoxyribonucleoside phosphoramidites functionalized with the 5'-siloxyl ether linker were prepared in yields of 75-83% and incorporated last into the solid-phase assembly of DNA sequences. Capture of nucleobase- and phosphate-deprotected DNA sequences released from the synthesis support is demonstrated to proceed near quantitatively. After shorter than full-length DNA sequences were washed from the capture support, the purified DNA sequences were released from this support upon treatment with tetra-n-butylammonium fluoride in dry DMSO. The purity of released DNA sequences exceeds 98%. The scalability and high-throughput features of the purification process are demonstrated without sacrificing purity of the DNA sequences.

  2. Interconnection between the protein solubility and amino acid and dipeptide compositions.

    Science.gov (United States)

    Niu, Xiaohui; Li, Nana; Chen, Dinyan; Wang, Zengzhen

    2013-01-01

    Obtaining soluble proteins in sufficient concentrations helps increase the overall success rate in various experimental studies. Protein solubility is an individual trait ultimately determined by its primary protein sequence. Exploring the interconnection between the protein solubility and the compositions of protein sequence is instrumental for setting priorities on targets in large scale proteomics projects. In this paper, amino acid composition (20 dimensions) and the dipeptide composition (400 dimensions) were extracted to form the total candidate feature pool (420 dimensions), and each feature was selected into the feature vectors one by one, which were sorted by the absolute value of the correlation coefficient. Finally, we evaluated and recorded the 420 results of Support Vector Machine (SVM) as the prediction engine. According to the results of SVM, the first 208 features were chosen from the 420 dimensions, which were considered as the efficient ones. By analyzing the composition of the former 208 features, we found that the protein solubility was significantly influenced by the occurrence frequencies of the acidic amino acids, basic amino acids, non-polar hydrophobic amino acids and the two polar neutral amino acids(C, Q) in the protein sequences. Additionally, we detected that the dipeptides composed by the acidic amino acids (D, E) and basic amino acids (K, R and H), especially the dipeptide composed by the acidic amino acids (D, E), had strong interconnection with the protein solubility.

  3. Solid phase sequencing of biopolymers

    Science.gov (United States)

    Cantor, Charles R.; Hubert, Koster

    2014-06-24

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Probes may be affixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  4. Human influenza A virus hemagglutinin distinguishes sialyloligosaccharides in membrane-associated gangliosides as its receptor which mediates the adsorption and fusion processes of virus infection. Specificity for oligosaccharides and sialic acids and the sequence to which sialic acid is attached.

    Science.gov (United States)

    Suzuki, Y; Nagao, Y; Kato, H; Matsumoto, M; Nerome, K; Nakajima, K; Nobusawa, E

    1986-12-25

    Human influenza A virus isolates bearing antigenically different H1 (A/PR/8/34), H2 (A/Japan/305/57), and H3 (A/Aichi/2/68, A/X-31) hemagglutinin serotypes caused extensive hemagglutination, low pH fusion, and hemolysis of asialoerythrocytes reconstituted with gangliosides. Sialylparaglobosides (IV3NeuAc-nLc4Cer, IV6NeuAc-nLc4Cer), I-active and i-active (VI3NeuAc-nLc6Cer) gangliosides, and GM3-NeuAc commonly exhibited significant specific receptor activity toward the viruses. A/PR/8/34 recognized IV3NeuAc-nLc4Cer containing the NeuAc alpha 2-3Gal sequence preferentially over IV6NeuAc-nLc4Cer containing NeuAc alpha 2-6Gal, whereas the other two recognized the NeuAc alpha 2-6Gal sequence preferentially over NeuAc alpha 2-3Gal. Responsiveness of erythrocytes labeled with gangliosides containing NeuGc to the viruses used was considerably lower than that of erythrocytes labeled with gangliosides containing NeuAc. The activities of GM1a, GM2, and GD1b bearing NeuAc on inner galactose of the ganglio series core were also very low. These results indicate that sialyloligosaccharides of IV3NeuAc-nLc4Cer, IV6NeuAc-nLc4Cer, I-active ganglioside, and VI3NeuAc-nLc6Cer in addition to GM3-NeuAc and GM1b-NeuAc (Suzuki, Y., Matsunaga, M., and Matsumoto, M. (1985), J. Biol. Chem. 260, 1362-1365; Suzuki, Y., Matsunaga, M., Nagao, Y., Taki, T., Hirabayashi, Y., and Matsumoto, M. (1985) Vaccine 3, 201-203) are functional receptor determinants toward hemagglutinin of human influenza A viruses, and the viruses differentiate microdomains of the gangliosides, such as the sialic acid species (NeuAc, NeuGc) and the sequence of sialic acid linkages (NeuAc alpha 2-3Gal, NeuAc alpha 2-6Gal).

  5. Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

    Science.gov (United States)

    Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

    2003-01-01

    A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171

  6. Prediction of human protein function from post-translational modifications and localization features

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Blom, Nikolaj;

    2002-01-01

    a number of functional attributes that are more directly related to the linear sequence of amino acids, and hence easier to predict, than protein structure. These attributes include features associated with post-translational modifications and protein sorting, but also much simpler aspects...

  7. Complete Genome Sequence of the Unclassified Iron-Oxidizing, Chemolithoautotrophic Burkholderiales Bacterium GJ-E10, Isolated from an Acidic River.

    Science.gov (United States)

    Fukushima, Jun; Tojo, Fuyumi; Asano, Ryoki; Kobayashi, Yayoi; Shimura, Yoichiro; Okano, Kunihiro; Miyata, Naoyuki

    2015-02-05

    Burkholderiales bacterium GJ-E10, isolated from the Tamagawa River in Akita Prefecture, Japan, is an unclassified, iron-oxidizing chemolithoautotrophic bacterium. Its single circular genome, consisting of 3,276,549 bp, was sequenced by using three types of next-generation sequencers and the sequences were then confirmed by PCR-based Sanger sequencing.

  8. [ANALYSIS OF ARACHIDONIC ACID RELATIVE CONTENT CHANGES IN ERYTHROCYTES AND PLATELETS PHOSPHOLIPIDS MEMBRANES FEATURES IN CORONARY HEART DISEASE WITH ATRIAL FIBRILLATION PATIENTS].

    Science.gov (United States)

    Lizogub, V G; Zavalska, T V; Merkulova, I O; Bryuzgina, T S

    2015-01-01

    Erythrocytes and platelets phospholipid membranes fatty acid spectrum was detected in coronary heart disease and atrial fibrillation patients and in patients with coronary heart disease without atrial fibrillation. 87 patients were investigated. Significant decrease in the arachidonic acid relative content in coronary heart disease patients compared with healthy individuals was related. As well as a significant decrease in the arachidonic acid relative content in coronary heart disease and atrial fibrillation patients compared with coronary heart disease patients without atrial fibrillation was related too. These dates may indicate that decreasing relative content arachidonic acid can be possible pathogenetic link in the development of arrhythmias.

  9. Sequencing of Intron 3 of Porcine Heart Fatty Acid-Binding Protein Gene%猪H-FABP基因intron3全序列测定

    Institute of Scientific and Technical Information of China (English)

    杨文平; 张家琦; 李彩桃; 王明艳; 张红梅; 李超; 曹果清; 周忠孝

    2012-01-01

    [目的]为将H-FABP基因应用于猪育种过程中的标记辅助选择提供基础资料.[方法]根据GenBank数据库上公开发表的相关的猪H-FABP基因序列设计特异性扩增引物,对H-FABP基因内含子3的PCR产物纯化后直接进行测序.[结果]成功扩增出猪H-FABP基因intron 3的全序列,全长为1 350 bp,已向GenBank数据库提交,检索号为DQ 002993.[结论]该研究为确定影响肌内脂肪沉积的主效基因奠定了理论基础.%[ Objective ]The aim of this paper is to provide the basic data for marker-assisted selection of pig breeding using porcine heart fatty acid-binding protein (H-FABP) gene. [Method]According to the related sequences of porcine H-FABP gene released in GenBank,specific primers were designed to amplify the intron 3 of porcine H-FABP gene. [ Result] The intron 3 of porcine H-FABP gene was amplified successfully. Its whole sequence was 1 350 bp in length and had been submitted to GenBank (Accesion no. :DQ 002993). [Conclusion] The study lays a theoretical foundation for deter ruination of the major genes affecting intramuscular fat deposition.

  10. Sequence ID and annotation information - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Number of data entries - Data item Description Field 1 ID of amino acid sequence (Sequence ID) Field... 2 Length of amino acid sequence Field 3 Annotation of amino acid sequence Joomla SEF URL

  11. On the activation of bovine plasma factor XIII. Amino acid sequence of the peptide released by thrombin and the terminal residues of the subunit polypeptides.

    Science.gov (United States)

    Nakamura, S; Iwanaga, S; Suzuki, T

    1975-12-01

    A blood coagulation factor, Factor XIII, was highly purified from bovine fresh plasma by a method similar to those used for human plasma Factor XIII. The isolated Factor XIII consisted of two subunit polypeptides, a and b chains, with molecular weights of 79,000 +/- 2,000 and 75,000 +/- 2,000, respectively. In the conversion of Factor XIII to the active enzyme, Factor XIIIa, by bovine thrombin [EC 3.4.21.5], a peptide was liberated. This peptide, designated tentatively as "activation peptide," was isolated by gel-filtration on a Sephadex G-75 column. It contained a total of 37 amino acid residues with a masked N-terminal residue and C-terminal arginine. The whole amino acid sequence of "Activation peptide" was established by the dansyl-Edman method and standard enzymatic techniques, and the masked N-terminal residue was identified as N-acetylserine by using a rat liver acylamino acid-releasing enzyme. This enzyme specifically cleaved the N-acetylserylglutamyl peptide bond serine and the remaining peptide, which was now reactive to 1-dimethylamino-naphthalene-5-sulfonyl chloride. A comparison of the sequences of human and bovine "Activation peptide" revealed five amino acids replacements, Ser-3 to Thr; Gly-5 to Arg; Ile-14 to Val; Thr-18 to Asn, and Pro-26 to Leu. Another difference was the deletion of Leu-34 in the human peptide. Adsorption chromatography on a hydroxylapatite column in the presence of 0.1% sodium dodecyl sulfate was developed as a preparative procedure for the resolution of the two subunit polypeptides, a or a' chain and b chain, constituting the protein molecule of Factor XIII or Factor XIIIa. End group analyses on the isolated pure chains revealed that the structural change of Factor XIII during activation with thrombin occurs only in the N-terminal portion of the a chain, not in the N-terminal end of the b chain or in the C-terminal ends of the a and b chains. From these results, it was concluded that the activation of bovine plasma Factor XIII

  12. Higher specificity of nucleic acid sequence-based amplification isothermal technology than of real-time PCR for quantification of HIV-1 RNA on dried blood spots.

    Science.gov (United States)

    Mercier-Delarue, Severine; Vray, Muriel; Plantier, Jean Christophe; Maillard, Theodora; Adjout, Zidan; de Olivera, Fabienne; Schnepf, Nathalie; Maylin, Sarah; Simon, Francois; Delaugerre, Constance

    2014-01-01

    Dried blood spots (DBS) are widely proposed as a plasma surrogate for monitoring antiretroviral treatment efficacy based on the HIV-1 RNA level (viral load [VL]) in resource-limited settings. Interfering coamplification of cell-associated HIV-1 DNA during reverse transcription (RT)-PCR can be avoided by using nucleic acid sequence-based amplification (NASBA) technology, which is based on an RNA template and isothermic conditions. We analyzed VL values obtained with DBS and plasma samples by comparing isothermic NASBA (NucliSENS EasyQ HIV-1 V2.0; bioMérieux) with real-time RT-PCR (Cobas TaqMan HIV-1 V2.0; Roche). Samples from 197 HIV-1-infected patients were tested (non-B subtypes in 51% of the cases). Nucleic acid extractions were performed by use of NucliSENS EasyMAG (bioMérieux) and Cobas AmpliPrep (Roche) before the NASBA and RT-PCR quantifications, respectively. Both quantification assays have lower limits of detection of 20 (1.3) and 800 (2.9) log10 copies/ml (log) in plasma and DBS, respectively. The mean (DBS minus plasma) differences were -0.39 and -0.46 log, respectively, for RT-PCR and NASBA. RT-PCR on DBS identified virological failure in 122 of 126 patients (sensitivity, 97%) and viral suppression in 58 of 70 patients (specificity, 83%), yielding 12 false-positive results (median, 3.2 log). NASBA on DBS identified virological failure in 85 of 96 patients (sensitivity, 89%) and viral suppression in 95 of 97 patients (specificity, 98%) and yielded 2 false-positive results (3.0 log for both). Both technologies detected HIV-1 RNA in DBS at a threshold of 800 copies/ml. This higher specificity of NASBA technology could avoid overestimation of poor compliance or the emergence of resistance when monitoring antiretroviral efficacy with the DBS method.

  13. Evaluation of T-cell responses to peptides and lipopeptides with MHC class I binding motifs derived from the amino acid sequence of the19-kDa lipoprotein of Mycobacterium tuberculosis

    NARCIS (Netherlands)

    Fonseca, DPAJ; Joosten, D; Snippe, H; Verheul, AFM

    2000-01-01

    Cytotoxic T-lymphocyte (CTL) epitopes on the 19-kDa lipoprotein from Mycobacterium tuberculosis were identified by the use of lipopeptides and their cytokine profile studied. Selection of candidate CTL epitopes was based on synthetic peptides derived from the amino acid sequence of the 19-kDa lipopr

  14. Genome sequences of the high-acetic acid-resistant bacteria Gluconacetobacter europaeus LMG 18890T and G. europaeus LMG 18494 (reference strains), G. europaeus 5P3, and Gluconacetobacter oboediens 174Bp2 (isolated from vinegar).

    Science.gov (United States)

    Andrés-Barrao, Cristina; Falquet, Laurent; Calderon-Copete, Sandra P; Descombes, Patrick; Ortega Pérez, Ruben; Barja, François

    2011-05-01

    Bacteria of the genus Gluconacetobacter are usually involved in the industrial production of vinegars with high acetic acid concentrations. We describe here the genome sequence of three Gluconacetobacter europaeus strains, a very common bacterial species from industrial fermentors, as well as of a Gluconacetobacter oboediens strain.

  15. Amino acid sequence requirements at residues 69 and 238 for the SME-1 beta-lactamase to confer resistance to beta-lactam antibiotics.

    Science.gov (United States)

    Majiduddin, Fahd K; Palzkill, Timothy

    2003-03-01

    Carbapenem antibiotics have been used to counteract resistant strains of bacteria harboring beta-lactamases and extended-spectrum beta-lactamases. Four enzymes from the class A group of beta-lactamases, NMC-A, IMI-1, SME-1, and KPC-1, efficiently hydrolyze carbapenem antibiotics. Sequence comparisons and structural information indicate that cysteines at amino acid residues 69 and 238, which are conserved in all four of these enzymes, form a disulfide bond that is unique to these beta-lactamases. To test whether this disulfide bond is required for catalytic activity, the codons for residues Cys69 and Cys238 were randomized individually and simultaneously by PCR-based mutagenesis to create random replacement libraries for these positions. Mutants that were able to confer resistance to ampicillin, imipenem, or cefotaxime were selected from these libraries. The results indicate that positions Cys69 and Cys238 are critical for hydrolysis of all of the antibiotics tested, suggesting that the disulfide bond is generally required for this enzyme to catalyze the hydrolysis of beta-lactam antibiotics.

  16. Domain structures and molecular evolution of class I and class II major histocompatibility gene complex (MHC) products deduced from amino acid and nucleotide sequence homologies.

    Science.gov (United States)

    Ohnishi, K

    1984-01-01

    Domain structures of class I and class II MHC products were analyzed from a viewpoint of amino acid and nucleotide sequence homologies. Alignment statistics revealed that class I (transplantation) antigen H chains consist of four mutually homologous domains, and that class II (HLA-DR) antigen beta and alpha chains are both composed of three mutually homologous ones. The N-terminal three and two domains of class I and class II (both beta and alpha) gene products, respectively, all of which being approximately 90 residues long, were concluded to be homologous to beta2-microglobulin (beta2M). The membrane-embedded C-terminal shorter domains of these MHC products were also found to be homologous to one another and to the third domain of class I H chains. Class I H chains were found to be more closely related to class II alpha chains than to class II beta chains. Based on these findings, an exon duplication history from a common ancestral gene encoding a beta2M-like primodial protein of one-domain-length up to the contemporary MHC products was proposed.

  17. PhTX-II a Basic Myotoxic Phospholipase A2 from Porthidium hyoprora Snake Venom, Pharmacological Characterization and Amino Acid Sequence by Mass Spectrometry

    Directory of Open Access Journals (Sweden)

    Salomón Huancahuire-Vega

    2014-10-01

    Full Text Available A monomeric basic PLA2 (PhTX-II of 14149.08 Da molecular weight was purified to homogeneity from Porthidium hyoprora venom. Amino acid sequence by in tandem mass spectrometry revealed that PhTX-II belongs to Asp49 PLA2 enzyme class and displays conserved domains as the catalytic network, Ca2+-binding loop and the hydrophobic channel of access to the catalytic site, reflected in the high catalytic activity displayed by the enzyme. Moreover, PhTX-II PLA2 showed an allosteric behavior and its enzymatic activity was dependent on Ca2+. Examination of PhTX-II PLA2 by CD spectroscopy indicated a high content of alpha-helical structures, similar to the known structure of secreted phospholipase IIA group suggesting a similar folding. PhTX-II PLA2 causes neuromuscular blockade in avian neuromuscular preparations with a significant direct action on skeletal muscle function, as well as, induced local edema and myotoxicity, in mice. The treatment of PhTX-II by BPB resulted in complete loss of their catalytic activity that was accompanied by loss of their edematogenic effect. On the other hand, enzymatic activity of PhTX-II contributes to this neuromuscular blockade and local myotoxicity is dependent not only on enzymatic activity. These results show that PhTX-II is a myotoxic Asp49 PLA2 that contributes with toxic actions caused by P. hyoprora venom.

  18. Analysis of amino acid sequences of penicillin-binding protein 2 in clinical isolates of Neisseria gonorrhoeae with reduced susceptibility to cefixime and ceftriaxone.

    Science.gov (United States)

    Osaka, Kazuyoshi; Takakura, Tadakazu; Narukawa, Kayo; Takahata, Masahiro; Endo, Katsuhisa; Kiyota, Hiroshi; Onodera, Shoichi

    2008-06-01

    Neisseria gonorrhoeae strains with reduced susceptibility to cefixime and ceftriaxone, with minimum inhibitory concentrations (MICs) of cefixime of 0.125-0.25 microg/ml and ceftriaxone of 0.031-0.125 microg/ml, were isolated from male urethritis patients in Tokyo, Japan, in 2006. The amino acid sequences of PenA, penicillin-binding protein 2, in these strains were of two types: PenA mosaic and nonmosaic strains. In the PenA mosaic strain, some regions in the transpeptidase-encoding domain in PenA were similar to those of Neisseria perflava/sicca, Neisseria cinerea, Neisseria flavescens, Neisseria polysaccharea, and Neisseria meningitidis. In the PenA nonmosaic strain, there was a mutation of Ala-501 to Val in PenA. In addition, we performed homology modeling of PenA wild-type and mosaic strains and compared them. The results of the modeling studies suggested that reduced susceptibility to cephems such as cefixime and ceftriaxone is due to a conformational alteration of the beta-lactam-binding pocket. These results also indicated that the mosaic structures and the above point mutation in PenA make a major contribution to the reduced susceptibility to cephem antibiotics.

  19. Indole acetic acid production by fluorescent Pseudomonas spp. from the rhizosphere of Plectranthus amboinicus (Lour.) Spreng. and their variation in extragenic repetitive DNA sequences.

    Science.gov (United States)

    Sethia, Bedhya; Mustafa, Mariam; Manohar, Sneha; Patil, Savita V; Jayamohan, Nellickal Subramanian; Kumudini, Belur Satyan

    2015-06-01

    Fluorescent Pseudomonas (FP) is a heterogenous group of growth promoting rhizobacteria that regulate plant growth by releasing secondary metabolic compounds viz., indole acetic acid (IAA), siderophores, ammonia and hydrogen cyanide. In the present study, IAA producing FPs from the rhizosphere of Plectranthus amboinicus were characterized morphologically, biochemically and at the molecular level. Molecular identification of the isolates were carried out using Pseudomonas specific primers. The effect of varying time (24, 48, 72 and 96 h), Trp concentrations (100, 200, 300, 400 and 500 μg x ml(-1)), temperature (10, 26, 37 and 50 ± 2 degrees C) and pH (6, 7 and 8) on IAA production by 10 best isolates were studied. Results showed higher IAA production at 72 h incubation, at 300 μg x ml(-1) Trp concentration, temperature 26 ± 2 degrees C and pH 7. TLC with acidified ethyl acetate extract showed that the IAA produced has a similar Rf value to that of the standard IAA. Results of TLC were confirmed by HPLC analysis. Genetic diversity of the isolates was also studied using 40 RAPD and 4 Rep primers. Genetic diversity parameters such as dominance, Shannon index and Simpson index were calculated. Out of 40 RAPD primers tested, 9 (2 OP-D series and 7 OP-E series) were shortlisted for further analysis. Studies using RAPD, ERIC, BOX, REP and GTG5 primers revealed that isolates exhibit significant diversity in repetitive DNA sequences irrespective of the rhizosphere.

  20. Visual detection and differentiation of Classic Swine Fever Virus strains using nucleic acid sequence-based amplification (NASBA) and G-quadruplex DNAzyme assay

    Science.gov (United States)

    Lu, Xiaolu; Shi, Xueyao; Wu, Gege; Wu, Tiantian; Qin, Rui; Wang, Yi

    2017-01-01

    The split G-quadruplex DNAzyme has emerged as a valuable tool for visual DNA detection. Here, we successfully integrated colorimetric split G-quadruplex DNAzyme assay with nucleic acid sequence-based amplification to generate a novel detection approach, allowing visual and rapid detection for the RNA of Shimen and HCLV strains of Classic Swine Fever Virus (CSFV). CSFV is a RNA virus that causes a highly contagious disease in domestic pigs and wild boar. With this method, we were able to detect as little as 10 copies/ml of CSF viral RNA within 3 h in serum samples taken from the field. No interference was encountered in the amplification and detection of Classic Swine Fever Virus in the presence of non-target RNA or DNA. Moreover, Shimen and HCLV strains of Classic Swine Fever Virus could be easily differentiated using the NASBA-DNAzyme system. These findings indicate the NASBA-DNAzyme system is a rapid and practical technique for detecting and discriminating CSFV strains and may be applied to the detection of other RNA viruses. PMID:28287135

  1. Understanding Legacy Features with Featureous

    DEFF Research Database (Denmark)

    Olszak, Andrzej; Jørgensen, Bo Nørregaard

    2011-01-01

    Feature-centric comprehension of source code is essential during software evolution. However, such comprehension is oftentimes difficult to achieve due the discrepancies between structural and functional units of object-oriented programs. We present a tool for feature-centric analysis of legacy...

  2. SIFT based algorithm for point feature tracking

    Directory of Open Access Journals (Sweden)

    Adrian BURLACU

    2007-12-01

    Full Text Available In this paper a tracking algorithm for SIFT features in image sequences is developed. For each point feature extracted using SIFT algorithm a descriptor is computed using information from its neighborhood. Using an algorithm based on minimizing the distance between two descriptors tracking point features throughout image sequences is engaged. Experimental results, obtained from image sequences that capture scaling of different geometrical type object, reveal the performances of the tracking algorithm.

  3. Feature Extraction

    CERN Document Server

    CERN. Geneva

    2015-01-01

    Feature selection and reduction are key to robust multivariate analyses. In this talk I will focus on pros and cons of various variable selection methods and focus on those that are most relevant in the context of HEP.

  4. Site Features

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset consists of various site features from multiple Superfund sites in U.S. EPA Region 8. These data were acquired from multiple sources at different times...

  5. Solar Features

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Collection includes a variety of solar feature datasets contributed by a number of national and private solar observatories located worldwide.

  6. Molecular field analysis and 3D-quantitative structure-activity relationship study (MFA 3D-QSAR) unveil novel features of bile acid recognition at TGR5.

    Science.gov (United States)

    Macchiarulo, Antonio; Gioiello, Antimo; Thomas, Charles; Massarotti, Alberto; Nuti, Roberto; Rosatelli, Emiliano; Sabbatini, Paola; Schoonjans, Kristina; Auwerx, Johan; Pellicciari, Roberto

    2008-09-01

    Bile acids regulate nongenomic actions through the activation of TGR5, a membrane receptor that is G protein-coupled to the induction of adenylate cyclase. In this work, a training set of 43 bile acid derivatives is used to develop a molecular interaction field analysis (MFA) and a 3D-quantitative structure-activity relationship study (3D-QSAR) of TGR5 agonists. The predictive ability of the resulting model is evaluated using an external set of compounds with known TGR5 activity, and six bile acid derivatives whose unknown TGR5 activity is herein assessed with in vitro luciferase assay of cAMP formation. The results show a good predictive model and indicate a statistically relevant degree of correlation between the TGR5 activity and the molecular interaction fields produced by discrete positions of the bile acid scaffold. This information is instrumental to extend on a quantitative basis the current structure-activity relationships of bile acids as TGR5 modulators and will be fruitful to design new potent and selective agonists of the receptor.

  7. Temporal Feature Integration for Music Organisation

    OpenAIRE

    Meng, Anders; Larsen, Jan; Hansen, Lars Kai

    2006-01-01

    This Ph.D. thesis focuses on temporal feature integration for music organisation. Temporal feature integration is the process of combining all the feature vectors of a given time-frame into a single new feature vector in order to capture relevant information in the frame. Several existing methods for handling sequences of features are formulated in the temporal feature integration framework. Two datasets for music genre classification have been considered as valid test-beds for music organisa...

  8. Design features of a sulphuric acid plant based on lead and zinc sintering machine off-gas%硫酸用新型耐蚀合金的研究与开发

    Institute of Scientific and Technical Information of China (English)

    刘焕安

    2001-01-01

    Design features of a 150kt/a sulphuric acid plant based on lead and zinc sintering machine off-gases are described. The plant adopted a single absorption technology including closed dilute-acid-scrubbing gas cleaning and ammonia-acid off-gas treatment. The dilute acid settling system,cooling water circulation system, installation of electrostatic precipitator, high-temperature absorption technology and acid distributor of drying and absorption section, and preheater, hot bypass and insulation of conversion section are emphasized in detail.%论述硫酸对金属腐蚀的特殊性和合金设计的基本原理。介绍高温浓硫酸用高硅不锈钢HD-1、合金球墨铸铁HD-3以及稀硫酸用高钼含氮奥氏体不锈钢HD-7、HD-11的研究开发和应用范围。

  9. Pseudomonas lini Strain ZBG1 Revealed Carboxylic Acid Utilization and Copper Resistance Features Required for Adaptation to Vineyard Soil Environment: A Draft Genome Analysis

    Science.gov (United States)

    Chan, Kok-Gan; Chong, Teik-Min; Adrian, Tan-Guan-Sheng; Kher, Heng Leong; Grandclément, Catherine; Faure, Denis; Yin, Wai-Fong; Dessaux, Yves; Hong, Kar-Wai

    2016-01-01

    Pseudomonas lini strain ZBG1 was isolated from the soil of vineyard in Zellenberg, France and the draft genome was reported in this study. Bioinformatics analyses of the genome revealed presence of genes encoding tartaric and malic acid utilization as well as copper resistance that correspond to the adaptation this strain in vineyard soil environment. PMID:27512520

  10. Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition.

    Science.gov (United States)

    Zhang, Lichao; Zhao, Xiqiang; Kong, Liang

    2014-08-21

    Knowledge of protein structural class plays an important role in characterizing the overall folding type of a given protein. At present, it is still a challenge to extract sequence information solely using protein sequence for protein structural class prediction with low similarity sequence in the current computational biology. In this study, a novel sequence representation method is proposed based on position specific scoring matrix for protein structural class prediction. By defined evolutionary difference formula, varying length proteins are expressed as uniform dimensional vectors, which can represent evolutionary difference information between the adjacent residues of a given protein. To perform and evaluate the proposed method, support vector machine and jackknife tests are employed on three widely used datasets, 25PDB, 1189 and 640 datasets with sequence similarity lower than 25%, 40% and 25%, respectively. Comparison of our results with the previous methods shows that our method may provide a promising method to predict protein structural class especially for low-similarity sequences.

  11. Crystal structure of a 2:1 piroxicam-gentisic acid co-crystal featuring neutral and zwitterionic piroxicam mol-ecules.

    Science.gov (United States)

    Horstman, Elizabeth M; Bertke, Jeffery A; Woods, Toby J; Kenis, Paul J A

    2016-12-01

    A new 2:1 co-crystal of piroxicam and gentisic acid [systematic name: 4-hy-droxy-1,1-dioxo-N-(pyridin-2-yl)-2H-1λ(6),2-benzo-thia-zine-3-carboxamide-2-(4-oxido-1,1-dioxo-2H-1λ(6),2-benzo-thia-zine-3-amido)-pyridin-1-ium-2,5-di-hydroxy-benzoic acid, 2C15H13N3O4S·C7H6O4] has been synthesized using a microfluidic platform and initially identified using Raman spectroscopy. In the co-crystal, one piroxicam mol-ecule is in its neutral form and an intra-molecular O-H⋯O hydrogen bond is observed. The other piroxicam mol-ecule is zwitterionic (proton transfer from the OH group to the pyridine N atom) and two intra-molecular N-H⋯O hydrogen bonds occur. The gentisic acid mol-ecule shows whole-mol-ecule disorder over two sets of sites in a 0.809 (2):0.191 (2) ratio. In the crystal, extensive hydrogen bonding between the components forms layers propagating in the ab plane.

  12. Crystal structure of a 2:1 piroxicam–gentisic acid co-crystal featuring neutral and zwitterionic piroxicam mol­ecules

    Science.gov (United States)

    Horstman, Elizabeth M.; Bertke, Jeffery A.; Woods, Toby J.; Kenis, Paul J. A.

    2016-01-01

    A new 2:1 co-crystal of piroxicam and gentisic acid [systematic name: 4-hy­droxy-1,1-dioxo-N-(pyridin-2-yl)-2H-1λ6,2-benzo­thia­zine-3-carboxamide–2-(4-oxido-1,1-dioxo-2H-1λ6,2-benzo­thia­zine-3-amido)­pyridin-1-ium–2,5-di­hydroxy­benzoic acid, 2C15H13N3O4S·C7H6O4] has been synthesized using a microfluidic platform and initially identified using Raman spectroscopy. In the co-crystal, one piroxicam mol­ecule is in its neutral form and an intra­molecular O—H⋯O hydrogen bond is observed. The other piroxicam mol­ecule is zwitterionic (proton transfer from the OH group to the pyridine N atom) and two intra­molecular N—H⋯O hydrogen bonds occur. The gentisic acid mol­ecule shows whole-mol­ecule disorder over two sets of sites in a 0.809 (2):0.191 (2) ratio. In the crystal, extensive hydrogen bonding between the components forms layers propagating in the ab plane. PMID:27980814

  13. Automatic sequences

    CERN Document Server

    Haeseler, Friedrich

    2003-01-01

    Automatic sequences are sequences which are produced by a finite automaton. Although they are not random they may look as being random. They are complicated, in the sense of not being not ultimately periodic, they may look rather complicated, in the sense that it may not be easy to name the rule by which the sequence is generated, however there exists a rule which generates the sequence. The concept automatic sequences has special applications in algebra, number theory, finite automata and formal languages, combinatorics on words. The text deals with different aspects of automatic sequences, in particular:· a general introduction to automatic sequences· the basic (combinatorial) properties of automatic sequences· the algebraic approach to automatic sequences· geometric objects related to automatic sequences.

  14. Assessment of Epstein-Barr virus nucleic acids in gastric but not in breast cancer by next-generation sequencing of pooled Mexican samples

    Science.gov (United States)

    Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier

    2016-01-01

    Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline. PMID:26910355

  15. Sequence-specific nucleic acid mobility using a reversible block copolymer gel matrix and DNA amphiphiles (lipid-DNA) in capillary and microfluidic electrophoretic separations

    NARCIS (Netherlands)

    Wagler, Patrick; Minero, Gabriel Antonio S.; Tangen, Uwe; de Vries, Jan Willem; Prusty, Deepak; Kwak, Minseok; Herrmann, Andreas; McCaskill, John S.

    2015-01-01

    Reversible noncovalent but sequence-dependent attachment of DNA to gels is shown to allow programmable mobility processing of DNA populations. The covalent attachment of DNA oligomers to polyacrylamide gels using acrydite-modified oligonucleotides has enabled sequence-specific mobility assays for DN

  16. Assessment of Epstein-Barr virus nucleic acids in gastric but not in breast cancer by next-generation sequencing of pooled Mexican samples

    Directory of Open Access Journals (Sweden)

    Ezequiel M Fuentes-Pananá

    2016-01-01

    Full Text Available Gastric (GC and breast (BrC cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline.

  17. Amino acid sequence and immunological characterization with monoclonal antibodies of two toxins from the venom of the scorpion Centruroides noxius Hoffmann.

    Science.gov (United States)

    Zamudio, F; Saavedra, R; Martin, B M; Gurrola-Briones, G; Hérion, P; Possani, L D

    1992-02-15

    Two toxins, which we propose to call toxins 2 and 3, were purified to homogeneity from the venom of the scorpion Centruroides noxius Hoffmann. The full primary structures of both peptides (66 amino acid residues each) was determined. Sequence comparison indicates that the two new toxins display 79% identity and present a high similarity to previously characterized Centruroides toxins, the most similar toxins being Centruroides suffusus toxin 2 and Centruroides limpidus tecomanus toxin 1. Six monoclonal antibodies (mAb) directed against purified fraction II-9.2 (which contains toxins 2 and 3) were isolated in order to carry out the immunochemical characterization of these toxins. mAb BCF2, BCF3, BCF7 and BCF9 reacted only with toxin 2, whereas BCF1 and BCF8 reacted with both toxins 2 and 3 with the same affinity. Simultaneous binding of mAb pairs to the toxin and cross-reactivity of the venoms of different scorpions with the mAb were examined. The results of these experiments showed that the mAb define four different epitopes (A-D). Epitope A (BCF8) is topographically unrelated to epitopes B (BCF2 and BCF7), C (BCF3) and D (BCF9) but the latter three appear to be more closely related or in close proximity to each other. Epitope A was found in all Centruroides venoms tested as well as on four different purified toxins of C. noxius, and thus seems to correspond to a highly conserved structure. Based on the cross-reactivity of their venoms with the mAb, Centruroides species could be classified in the following order: Centruroides elegans, Centruroides suffusus suffusus = Centruroides infamatus infamatus, Centruroides limpidus tecomanus, Centruroides limpidus limpidus, and Centruroides limpidus acatlanensis, according to increasing immunochemical relatedness of their toxins to those of Centruroides noxius. All six mAb inhibited the binding of toxin 2 to rat brain synaptosomal membranes, but only mAb BCF2, which belongs to the IgG2a subclass, displayed a clear

  18. A 30-residue-long "export initiation domain" adjacent to the signal sequence is critical for protein translocation across the inner membrane of Escherichia coli.

    OpenAIRE

    Andersson, H; von Heijne, G

    1991-01-01

    Signal sequences serve to target proteins to the secretory pathway in both prokaryotic and eukaryotic cells. However, although necessary, the presence of a signal sequence is not always sufficient to ensure efficient membrane translocation. One feature of the nascent chain that adversely affects secretion, at least in Escherichia coli, is the presence of positively charged amino acids immediately downstream of the signal sequence. We have exploited this sensitivity to positively charged resid...

  19. α-Lipoic acid treatment increases mitochondrial biogenesis and promotes beige adipose features in subcutaneous adipocytes from overweight/obese subjects.

    Science.gov (United States)

    Fernández-Galilea, Marta; Pérez-Matute, Patricia; Prieto-Hontoria, Pedro L; Houssier, Marianne; Burrell, María A; Langin, Dominique; Martínez, J Alfredo; Moreno-Aliaga, María J

    2015-03-01

    α-Lipoic acid (α-Lip) is a natural occurring antioxidant with beneficial anti-obesity properties. The aim of this study was to investigate the putative effects of α-Lip on mitochondrial biogenesis and the acquirement of brown-like characteristics by subcutaneous adipocytes from overweight/obese subjects. Thus, fully differentiated human subcutaneous adipocytes were treated with α-Lip (100 and 250μM) for 24h for studies on mitochondrial content and morphology, mitochondrial DNA (mtDNA) copy number, fatty acid oxidation enzymes and brown/beige characteristic genes. The involvement of the Sirtuin1/Peroxisome proliferator-activated receptor gamma, coactivator 1 alpha (SIRT1/PGC-1α) pathway was also evaluated. Our results showed that α-Lip increased mitochondrial content in cultured human adipocytes as revealed by electron microscopy and by mitotracker green labeling. Moreover, an enhancement in mtDNA content was observed. This increase was accompanied by an up-regulation of SIRT1 protein levels, a decrease in PGC-1α acetylation and up-regulation of Nuclear respiratory factor 1 (Nrf1) and Mitochondrial transcription factor (Tfam) transcription factors. Enhanced oxygen consumption and fatty acid oxidation enzymes, Carnitine palmitoyl transferase 1 and Acyl-coenzyme A oxidase (CPT-1 and ACOX) were also observed. Mitochondria from α-Lip-treated adipocytes exhibited some morphological characteristics of brown mitochondria, and α-Lip also induced up-regulation of some brown/beige adipocytes markers such as cell death-inducing DFFA-like effector a (Cidea) and T-box 1 (Tbx1). Moreover, α-Lip up-regulated PR domain containing 16 (Prdm16) mRNA levels in treated adipocytes. Therefore, our study suggests the ability of α-Lip to promote mitochondrial biogenesis and brown-like remodeling in cultured white subcutaneous adipocytes from overweight/obese donors.

  20. Progressive multiple sequence alignments from triplets

    Directory of Open Access Journals (Sweden)

    Stadler Peter F

    2007-07-01

    Full Text Available Abstract Background The quality of progressive sequence alignments strongly depends on the accuracy of the individual pairwise alignment steps since gaps that are introduced at one step cannot be removed at later aggregation steps. Adjacent insertions and deletions necessarily appear in arbitrary order in pairwise alignments and hence form an unavoidable source of errors. Research Here we present a modified variant of progressive sequence alignments that addresses both issues. Instead of pairwise alignments we use exact dynamic programming to align sequence or profile triples. This avoids a large fractions of the ambiguities arising in pairwise alignments. In the subsequent aggregation steps we follow the logic of the Neighbor-Net algorithm, which constructs a phylogenetic network by step-wisely replacing triples by pairs instead of combining pairs to singletons. To this end the three-way alignments are subdivided into two partial alignments, at which stage all-gap columns are naturally removed. This alleviates the "once a gap, always a gap" problem of progressive alignment procedures. Conclusion The three-way Neighbor-Net based alignment program aln3nn is shown to compare favorably on both protein sequences and nucleic acids sequences to other progressive alignment tools. In the latter case one easily can include scoring terms that consider secondary structure features. Overall, the quality of resulting alignments in general exceeds that of clustalw or other multiple alignments tools even though our software does not included heuristics for context dependent (mismatch scores.

  1. [Analysis on the impact of composting with different proportions of corn stalks and pig manure on humic acid fractions and IR spectral feature].

    Science.gov (United States)

    Sun, Xiang-Ping; Li, Guo-Xue; Xiao, Ai-Ping; Shi, Hong; Wang, Yi-Ming; Li, Yang-Yang

    2014-09-01

    Using pig manure and corn straw as raw materials for high-temperature composting, setting three different treat- ments: C/N 15, C/N 25, and C/N 35. Composting period is 120 days, which contains 30 days for ventilation cycle by forced continuous ventilation. Sampled on 0, 22, 30, 60, 90, 120th days, they were analyzed by elemental analysis and IR spectroscopy to study effect of different lignin content on compost humic acid (HA) composition and molecular structure. The results showed that the change in composting humic acid C focused on the first 30 days, while after composting, the O/C of compost HA increased, H/C decreased, and N content increased. Low C/N (15) and higher C/N ratio (35) had higher degree of oxidation than the C/N 25 in compost HA. FTIR indicated that the infrared spectrum shapes with different lignin content treatment are similar during the composting process, but the peak intensity is obviously different. Research results proved that the composting stage is more conducive to enhanced aromatic in compost HA. After composting, C/N 15 had less polysaccharide and fat ingredients and more aromatic structural components in compost HA, compared with C/N 25 and 35. In addition, compost HA of C/N 15 had higher degree of humification and its structure was more stable.

  2. Modular Architecture and Unique Teichoic Acid Recognition Features of Choline-Binding Protein L (CbpL) Contributing to Pneumococcal Pathogenesis

    Science.gov (United States)

    Gutiérrez-Fernández, Javier; Saleh, Malek; Alcorlo, Martín; Gómez-Mejía, Alejandro; Pantoja-Uceda, David; Treviño, Miguel A.; Voß, Franziska; Abdullah, Mohammed R.; Galán-Bartual, Sergio; Seinen, Jolien; Sánchez-Murcia, Pedro A.; Gago, Federico; Bruix, Marta; Hammerschmidt, Sven; Hermoso, Juan A.

    2016-12-01

    The human pathogen Streptococcus pneumoniae is decorated with a special class of surface-proteins known as choline-binding proteins (CBPs) attached to phosphorylcholine (PCho) moieties from cell-wall teichoic acids. By a combination of X-ray crystallography, NMR, molecular dynamics techniques and in vivo virulence and phagocytosis studies, we provide structural information of choline-binding protein L (CbpL) and demonstrate its impact on pneumococcal pathogenesis and immune evasion. CbpL is a very elongated three-module protein composed of (i) an Excalibur Ca2+-binding domain -reported in this work for the very first time-, (ii) an unprecedented anchorage module showing alternate disposition of canonical and non-canonical choline-binding sites that allows vine-like binding of fully-PCho-substituted teichoic acids (with two choline moieties per unit), and (iii) a Ltp_Lipoprotein domain. Our structural and infection assays indicate an important role of the whole multimodular protein allowing both to locate CbpL at specific places on the cell wall and to interact with host components in order to facilitate pneumococcal lung infection and transmigration from nasopharynx to the lungs and blood. CbpL implication in both resistance against killing by phagocytes and pneumococcal pathogenesis further postulate this surface-protein as relevant among the pathogenic arsenal of the pneumococcus.

  3. Crystal Structures of Xanthomonas campestris OleA Reveal Features That Promote Head-to-Head Condensation of Two Long-Chain Fatty Acids

    Energy Technology Data Exchange (ETDEWEB)

    Goblirsch, Brandon R.; Frias, Janice A.; Wackett, Lawrence P.; Wilmot, Carrie M. (UMM)

    2012-10-25

    OleA is a thiolase superfamily enzyme that has been shown to catalyze the condensation of two long-chain fatty acyl-coenzyme A (CoA) substrates. The enzyme is part of a larger gene cluster responsible for generating long-chain olefin products, a potential biofuel precursor. In thiolase superfamily enzymes, catalysis is achieved via a ping-pong mechanism. The first substrate forms a covalent intermediate with an active site cysteine that is followed by reaction with the second substrate. For OleA, this conjugation proceeds by a nondecarboxylative Claisen condensation. The OleA from Xanthomonas campestris has been crystallized and its structure determined, along with inhibitor-bound and xenon-derivatized structures, to improve our understanding of substrate positioning in the context of enzyme turnover. OleA is the first characterized thiolase superfamily member that has two long-chain alkyl substrates that need to be bound simultaneously and therefore uniquely requires an additional alkyl binding channel. The location of the fatty acid biosynthesis inhibitor, cerulenin, that possesses an alkyl chain length in the range of known OleA substrates, in conjunction with a single xenon binding site, leads to the putative assignment of this novel alkyl binding channel. Structural overlays between the OleA homologues, 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA) synthase and the fatty acid biosynthesis enzyme FabH, allow assignment of the two remaining channels: one for the thioester-containing pantetheinate arm and the second for the alkyl group of one substrate. A short {beta}-hairpin region is ordered in only one of the crystal forms, and that may suggest open and closed states relevant for substrate binding. Cys143 is the conserved catalytic cysteine within the superfamily, and the site of alkylation by cerulenin. The alkylated structure suggests that a glutamic acid residue (Glu117{beta}) likely promotes Claisen condensation by acting as the catalytic base. Unexpectedly

  4. Crystal Structures of Xanthomonas campestris OleA Reveal Features That Promote Head-to-Head Condensation of Two Long-Chain Fatty Acids

    Energy Technology Data Exchange (ETDEWEB)

    Goblirsch, BR; Frias, JA; Wackett, LP; Wilmot, CM

    2012-05-22

    OleA is a thiolase superfamily enzyme that has been shown to catalyze the condensation of two long-chain fatty acylcoenzyme A (CoA) substrates. The enzyme is part of a larger gene cluster responsible for generating long-chain olefin products, a potential biofuel precursor. In thiolase superfamily enzymes, catalysis is achieved via a ping-pong mechanism. The first substrate forms a covalent intermediate with an active site cysteine that is followed by reaction with the second substrate. For OleA, this conjugation proceeds by a nondecarboxylative Claisen condensation. The OleA from Xanthomonas campestris has been crystallized and its structure determined, along with inhibitor-bound and xenon-derivatized structures, to improve our understanding of substrate positioning in the context of enzyme turnover. OleA is the first characterized thiolase superfamily member that has two long-chain alkyl substrates that need to be bound simultaneously and therefore uniquely requires an additional alkyl binding channel. The location of the fatty acid biosynthesis inhibitor, cerulenin, that possesses an alkyl chain length in the range of known OleA substrates, in conjunction with a single xenon binding site, leads to the putative assignment of this novel alkyl binding channel. Structural overlays between the OleA homologues, 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA) synthase and the fatty acid biosynthesis enzyme FabH, allow assignment of the two remaining channels: one for the thioester-containing pantetheinate arm and the second for the alkyl group of one substrate. A short beta-hairpin region is ordered in only one of the crystal forms, and that may suggest open and closed states relevant for substrate binding. Cys143 is the conserved catalytic cysteine within the superfamily, and the site of alkylation by cerulenin. The alkylated structure suggests that a glutamic acid residue (Glu117 beta) likely promotes Claisen condensation by acting as the catalytic base. Unexpectedly, Glu117

  5. Vibrational analysis of amino acids and short peptides in aqueous media. V. The effect of the disulfide bridge on the structural features of the peptide hormone somatostatin-14.

    Science.gov (United States)

    Hernández, Belén; Carelli, Claude; Coïc, Yves-Marie; De Coninck, Joël; Ghomi, Mahmoud

    2009-09-24

    To emphasize the role played by the S-S bridge in the structural features of somatostatin-14 (SST-14), newly recorded CD and Raman spectra of this cyclic peptide and its open analogue obtained by Cys-->Ser substitution are presented. CD spectra of both peptides recorded in aqueous solutions in the 100-500 microM concentration range are strikingly similar. They reveal principally that random conformers constitute the major population in both peptides. Consequently, the S-S bridge has no structuring effect at submillimolar concentrations. In methanol, the CD spectrum of somatostatin-14 keeps globally the same spectral shape as that observed in water, whereas its open analogue presents a major population of helical conformers. Raman spectra recorded as a function of peptide concentration (5-20 mM) and also in the presence of 150 mM NaCl provide valuable conformational information. All Raman spectra present a mixture of random and beta-hairpin structures for both cyclic and open peptides. More importantly, the presence or the absence of the disulfide bridge does not seem to influence considerably different populations of secondary structures within this range of concentrations. CD and Raman data obtained in the submillimolar and millimolar ranges of concentrations, respectively, lead us to accept the idea that SST-14 monomers aggregate upon increasing concentration, thus stabilizing beta-hairpin conformations in solution. However, even at high concentrations, random conformers do not disappear. Raman spectra of SST-14 also reveal a concentration effect on the flexibility of the S-S linkage and consequently on that of its cyclic part. In conclusion, although the disulfide linkage does not seem to markedly influence the SST-14 conformational features in aqueous solutions, its presence seems to be necessary to ensure the flexibility of the cyclic part of this peptide and to maintain its closed structure in lower dielectric constant environments.

  6. X-ray studies of crystalline complexes involving amino acids and peptides. XLIV. Invariant features of supramolecular association and chiral effects in the complexes of arginine and lysine with tartaric acid.

    Science.gov (United States)

    Selvaraj, M; Thamotharan, S; Roy, Siddhartha; Vijayan, M

    2007-06-01

    The tartaric acid complexes with arginine and lysine exhibit two stoichiometries depending upon the ionization state of the anion. The structures reported here are DL-argininium DL-hydrogen tartrate, bis(L-argininium) L-tartrate, bis(DL-lysinium) DL-tartrate monohydrate, L-lysinium D-hydrogen tartrate and L-lysinium L-hydrogen tartrate. During crystallization, L-lysine preferentially interacts with D-tartaric acid to form a complex when DL-tartaric acid is used in the experiment. The anions and the cations aggregate into separate alternating layers in four of the five complexes. In bis(L-argininium) L-tartrate, the amino acid layers are interconnected by individual tartrate ions which do not interact among themselves. The aggregation of argininium ions in the DL- and the L-arginine complexes is remarkably similar, which is in turn similar to those observed in other dicarboxylic acid complexes of arginine. Thus, argininium ions have a tendency to assume similar patterns of aggregation, which are largely unaffected by a change in the chemistry of partner molecules such as the introduction of hydroxyl groups or a change in chirality or stoichiometry. On the contrary, the lysinium ions exhibit fundamentally different aggregation patterns in the DL-DL complexes on the one hand and L-D and L-L complexes on the other. Interestingly, the pattern in the L-D complex is similar to that in the L-L complex. The lysinium ions in the DL-DL complex exhibit an aggregation pattern similar to those observed in the DL-lysine complexes involving other dicarboxylic acids. Thus, the effect of change in the chirality of a subset of the component complexes could be profound or marginal, in an unpredictable manner. The relevant crystal structures appear to indicate that the preference of L-lysine for D-tartaric acid is perhaps caused by chiral discrimination resulting from the amplification of a small energy difference.

  7. Using of acid hydrolysis for preparation of porous starch and feature analysis%酸法制备多孔淀粉及其特性研究

    Institute of Scientific and Technical Information of China (English)

    李梅; 刘可志; 徐雅雯; 孙平

    2012-01-01

    Porous starch was made of corn starch by using of different concentrations of hydrochloric acid or compound enzyme under certain conditions.The morphology and some properties of the prepared porous starch were determined.Water absorption and oil absorption of the two products were compared.The results showed that the porous starch which made in the conditions of solid to liquid ratio 1:4,the concentration of hydrochloric acid 8% ,temperature 45~(3 and time 20h was similar to that made by enzymatic hydrolysis on water absorption and oil absorption,and the property was greatly improved on the basis of the native starch.The SEM results showed that the surface of porous starch made by enzymatic hydrolysis was covered with holes and pits. But the starch granules made by acid hydrolysis were random, and the effect was worse than enzymatic hydrolysis. Compared with native starch,gelatinization start temperature of porous starch was slightly increased but peak viscosity was decreased in a great degree.With high price advantage than enzymatic hydrolysis,acid hydrolysis was simple and inexpensive,and had industrial production value.%以玉米淀粉为原料,分别用不同浓度的盐酸溶液和复合酶在一定条件下制备多孔淀粉。对制得的多孔淀粉进行形态结构和部分性质的测定,并对两种方式处理所得产品的吸水性和吸油性进行对比。结果显示,料液比1:4、8%浓度的盐酸溶液、45℃下水浴反应20h制备的多孔淀粉在吸水和吸油性能上与酶法制备的多孔淀粉较为相似,同时相对于原淀粉性能有很大的提高。扫描电子显微镜(SEM)显示,酶法制备的多孔淀粉表面布有类似蜂窝状的孔洞或凹坑,而酸法制得的多孔淀粉颗粒具有随机性,效果不如酶法处理。与原淀粉相比,酸法制备的多孔淀粉糊化开始温度略微升高,而峰值黏度有很大降低。酸法制备多孔淀粉的方法简单廉价,相对酶

  8. 碱性条件下胡敏酸吸附镉的特征研究%Adsorption Features of Cadmium by Humic Acid in Alkaline Conditions

    Institute of Scientific and Technical Information of China (English)

    曾祥峰; 王祖伟; 魏树和; 于晓曼

    2014-01-01

    为了探讨胡敏酸在碱性条件下的吸附镉机理,了解碱性盐化土壤中镉污染机理和生态环境之间的关系,实验研究了胡敏酸在碱性条件吸附镉的特征。采用批吸附试验方法,研究不同Cd初始浓度、反应时间、不同pH和离子强度对胡敏酸吸附镉的影响,结果表明:胡敏酸具有较强吸附镉的能力,可以用Langmuir吸附模型和Temkin吸附模型很好地拟合其等温吸附过程(r分别为0.9809和0.9816);在60 min内的快速反应阶段和60 min至6 h间的慢速反应阶段,胡敏酸对镉的吸附量分别为2.895 mg·g-1和3.342 mg·g-1,吸附反应平衡前6 h的动力学过程可以用Elovich方程进行很好的拟合(r为0.9285);随着pH增加,吸附率表现出逐步增加趋势,并以pH为4.5和8.5为界,呈现两端增加速度快,中间增加慢的规律性;在较低浓度离子强度下,离子强度的增加促进胡敏酸吸附镉;而在高离子强度下,表现出相反的规律性;在相同的条件下,不同离子强度对胡敏酸吸附镉的影响大小为:氯化钙>氯化镁>氯化钾>氯化钠。土壤在盐化的过程中,由于无机盐浓度的增加,增加了重金属离子的生物可利用性,加大了重金属离子的生态风险。%In order to investigate the adsorption mechanism of cadmium on humic acid in alkaline condition and understand cadmium contamination mechanisms and its relationship with the ecological environment in alkaline saline soil, the adsorptions of cadmium on humic acid were studied by experiment in alkaline conditions. The different effect of initial concentration, reaction time, different pH and ionic strength on the adsorption of cadmium by humic acid were investigated though batch adsorption test methods. The results indicated that, humic acid had strong cadmiumadsorption ability. Langmuir and Temkin adsorption models were well fitted isotherm adsorption process (r are 0.9809 and 0

  9. Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

    Science.gov (United States)

    Benyo, B; Biro, J C; Benyo, Z

    2004-01-01

    The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.

  10. Featuring animacy

    Directory of Open Access Journals (Sweden)

    Elizabeth Ritter

    2015-01-01

    Full Text Available Algonquian languages are famous for their animacy-based grammatical properties—an animacy based noun classification system and direct/inverse system which gives rise to animacy hierarchy effects in the determination of verb agreement. In this paper I provide new evidence for the proposal that the distinctive properties of these languages is due to the use of participant-based features, rather than spatio-temporal ones, for both nominal and verbal functional categories (Ritter & Wiltschko 2009, 2014. Building on Wiltschko (2012, I develop a formal treatment of the Blackfoot aspectual system that assumes a category Inner Aspect (cf. MacDonald 2008, Travis 1991, 2010. Focusing on lexical aspect in Blackfoot, I demonstrate that the classification of both nouns (Seinsarten and verbs (Aktionsarten is based on animacy, rather than boundedness, resulting in a strikingly different aspectual system for both categories. 

  11. Chaos game representation (CGR)-walk model for DNA sequences

    Institute of Scientific and Technical Information of China (English)

    Gao Jie; Xu Zhen-Yuan

    2009-01-01

    Chaos game representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to determine the coordinates of their positions in a continuous space. This distribution of positions has two features: one is unique, and the other is source sequence that can be recovered from the coordinates so that the distance between positions may serve as a measure of similarity between the corresponding sequences. A CGR-walk model is proposed based on CGR coordinates for the DNA sequences. The CGR coordinates are converted into a time series, and a long-memory ARFIMA (p, d, q) model, where ARFIMA stands for autoregressive fractionally integrated moving average, is introduced into the DNA sequence analysis. This model is applied to simulating real CGR-walk sequence data of ten genomic sequences. Remarkably long-range correlations are uncovered in the data, and the results from these models are reasonably fitted with those from the ARFIMA (p, d, q) model.

  12. Molecular cloning of a beta-glucan pattern-recognition lipoprotein from the white shrimp Penaeus (Litopenaeus) vannamei: correlations between the deduced amino acid sequence and the native protein structure.

    Science.gov (United States)

    Romo-Figueroa, María Gabriela; Vargas-Requena, Claudia; Sotelo-Mundo, Rogerio R; Vargas-Albores, Francisco; Higuera-Ciapara, Inocencio; Söderhäll, Kenneth; Yepiz-Plascencia, Gloria

    2004-06-01

    The hemolymph pattern-recognition beta-glucan binding protein from the white shrimp Penaeus (Litopenaeus) vannamei is also a high density lipoprotein (betaGBP-HDL) involved in innate immunity. The betaGBP-HDL full length cDNA sequence determined was 6.3 kb long, and contains a long 3'UTR region with a polyadenylation signal and a poly-A+ tail. The open reading frame is 1454 amino acids long and the N-terminal residue of the mature protein is localized in position 198 of the ORF. Comparison of the betaGBP-HDL amino acid sequence against GenBank detected only significant similarity to betaGBP from the crayfish Pacifastacus leniusculus. betaGBP-HDL is expressed in hepatopancreas, muscle, pleopods and gills, but not in hemocytes as determined by RT-PCR. We discuss the analysis of the deduced primary sequence in terms of the predicted secondary structure, glucanase-like and RGD motives relevant to its dual roles in defence and lipid transport.

  13. Co-registered positron emission tomography/computed tomography and gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid magnetic resonance imaging features of multiple angiosarcoma of the liver.

    Science.gov (United States)

    Kamatani, Takashi; Iguchi, Hiroyoshi; Okada, Takemichi; Yamazaki, Hitoshi; Tsunoda, Hidekazu; Watanabe, Masaaki; Oda, Masaya; Ohbu, Makoto; Yokomori, Hiroaki

    2014-10-01

    Hepatic angiosarcoma is a very rare disease, accounting for only 2% of primary liver malignancy. An 82-year-old man was admitted to our hospital because of jaundice and weight loss. Computed tomography (CT) and magnetic resonance imaging (MRI) showed diffuse and multiple space-occupying lesions. On gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid (Gd-EOB-DTPA)-enhanced MRI, the tumor was not enhanced intensely in the arterial phase following contrast injection, and was then gradually enhanced homogeneously. In the delayed phase and hepatobiliary phase, the tumor was completely washed out. Whole-body (18) F-fluorodeoxyglucose positron emission tomography (FDG-PET)/CT fusion scanning confirmed metabolic activity with maximum uptake value of 3.64 in the lesions. A liver biopsy showed spindle-shaped tumor cells proliferating along sinusoids, with elongated and hyperchromatic nuclei. Immunohistochemical studies showed tumor cells positive for von Willebrand factor and CD34. These findings were consistent with angiosarcoma of the liver. This case report is the first description of co-registered FDG-PET/CT images and Gd-EOB-DTPA-enhanced MRI of primary hepatic angiosarcoma.

  14. Enhanced virome sequencing using targeted sequence capture.

    Science.gov (United States)

    Wylie, Todd N; Wylie, Kristine M; Herter, Brandi N; Storch, Gregory A

    2015-12-01

    Metagenomic shotgun sequencing (MSS) is an important tool for characterizing viral populations. It is culture independent, requires no a priori knowledge of the viruses in the sample, and may provide useful genomic information. However, MSS can lack sensitivity and may yield insufficient data for detailed analysis. We have created a targeted sequence capture panel, ViroCap, designed to enrich nucleic acid from DNA and RNA viruses from 34 families that infect vertebrate hosts. A computational approach condensed ∼1 billion bp of viral reference sequence into <200 million bp of unique, representative sequence suitable for targeted sequence capture. We compared the effectiveness of detecting viruses in standard MSS versus MSS following targeted sequence capture. First, we analyzed two sets of samples, one derived from samples submitted to a diagnostic virology laboratory and one derived from samples collected in a study of fever in children. We detected 14 and 18 viruses in the two sets, comprising 19 genera from 10 families, with dramatic enhancement of genome representation following capture enrichment. The median fold-increases in percentage viral reads post-capture were 674 and 296. Median breadth of coverage increased from 2.1% to 83.2% post-capture in the first set and from 2.0% to 75.6% in the second set. Next, we analyzed samples containing a set of diverse anellovirus sequences and demonstrated that ViroCap could be used to detect viral sequences with up to 58% variation from the references used to select capture probes. ViroCap substantially enhances MSS for a comprehensive set of viruses and has utility for research and clinical applications.

  15. Cloning of an Erwinia herbicola gene necessary for gluconic acid production and enhanced mineral phosphate solubilization in Escherichia coli HB101: nucleotide sequence and probable involvement in biosynthesis of the coenzyme pyrroloquinoline quinone.

    Science.gov (United States)

    Liu, S T; Lee, L Y; Tai, C Y; Hung, C H; Chang, Y S; Wolfram, J H; Rogers, R; Goldstein, A H

    1992-09-01

    Escherichia coli is capable of synthesizing the apo-glucose dehydrogenase enzyme (GDH) but not the cofactor pyrroloquinoline quinone (PQQ), which is essential for formation of the holoenzyme. Therefore, in the absence of exogenous PQQ, E. coli does not produce gluconic acid. Evidence is presented to show that the expression of an Erwinia herbicola gene in E. coli HB101(pMCG898) resulted in the production of gluconic acid, which, in turn, implied PQQ biosynthesis. Transposon mutagenesis showed that the essential gene or locus was within a 1.8-kb region of a 4.5-kb insert of the plasmid pMCG898. This 1.8-kb region contained only one apparent open reading frame. In this paper, we present the nucleotide sequence of this open reading frame, a 1,134-bp DNA fragment coding for a protein with an M(r) of 42,160. The deduced sequence of this protein had a high degree of homology with that of gene III (M(r), 43,600) of a PQQ synthase gene complex from Acinetobacter calcoaceticus previously identified by Goosen et al. (J. Bacteriol. 171:447-455, 1989). In minicell analysis, pMCG898 encoded a protein with an M(r) of 41,000. These data indicate that E. coli HB101(pMCG898) produced the GDH-PQQ holoenzyme, which, in turn, catalyzed the oxidation of glucose to gluconic acid in the periplasmic space. As a result of the gluconic acid production, E. coli HB101(pMCG898) showed an enhanced mineral phosphate-solubilizing phenotype due to acid dissolution of the hydroxyapatite substrate.

  16. Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens

    Directory of Open Access Journals (Sweden)

    Peijnenburg Ad ACM

    2002-12-01

    Full Text Available Abstract Background Transgenic proteins expressed by genetically modified food crops are evaluated for their potential allergenic properties prior to marketing, among others by identification of short identical amino acid sequences that occur both in the transgenic protein and allergenic proteins. A strategy is proposed, in which the positive outcomes of the sequence comparison with a minimal length of six amino acids are further screened for the presence of potential linear IgE-epitopes. This double track approach involves the use of literature data on IgE-epitopes and an antigenicity prediction algorithm. Results Thirty-three transgenic proteins have been screened for identities of at least six contiguous amino acids shared with allergenic proteins. Twenty-two transgenic proteins showed positive results of six- or seven-contiguous amino acids length. Only a limited number of identical stretches shared by transgenic proteins (papaya ringspot virus coat protein, acetolactate synthase GH50, and glyphosate oxidoreductase and allergenic proteins could be identified as (part of potential linear epitopes. Conclusion Many transgenic proteins have identical stretches of six or seven amino acids in common with allergenic proteins. Most identical stretches are likely to be false positives. As shown in this study, identical stretches can be further screened for relevance by comparison with linear IgE-binding epitopes described in literature. In the absence of literature data on epitopes, antigenicity prediction by computer aids to select potential antibody binding sites that will need verification of IgE binding by sera binding tests. Finally, the positive outcomes of this approach warrant further clinical testing for potential allergenicity.

  17. Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens

    Science.gov (United States)

    Kleter, Gijs A; Peijnenburg, Ad ACM

    2002-01-01

    Background Transgenic proteins expressed by genetically modified food crops are evaluated for their potential allergenic properties prior to marketing, among others by identification of short identical amino acid sequences that occur both in the transgenic protein and allergenic proteins. A strategy is proposed, in which the positive outcomes of the sequence comparison with a minimal length of six amino acids are further screened for the presence of potential linear IgE-epitopes. This double track approach involves the use of literature data on IgE-epitopes and an antigenicity prediction algorithm. Results Thirty-three transgenic proteins have been screened for identities of at least six contiguous amino acids shared with allergenic proteins. Twenty-two transgenic proteins showed positive results of six- or seven-contiguous amino acids length. Only a limited number of identical stretches shared by transgenic proteins (papaya ringspot virus coat protein, acetolactate synthase GH50, and glyphosate oxidoreductase) and allergenic proteins could be identified as (part of) potential linear epitopes. Conclusion Many transgenic proteins have identical stretches of six or seven amino acids in common with allergenic proteins. Most identical stretches are likely to be false positives. As shown in this study, identical stretches can be further screened for relevance by comparison with linear IgE-binding epitopes described in literature. In the absence of literature data on epitopes, antigenicity prediction by computer aids to select potential antibody binding sites that will need verification of IgE binding by sera binding tests. Finally, the positive outcomes of this approach warrant further clinical testing for potential allergenicity. PMID:12477382

  18. Cloning and Sequence Analysis of Y-box Binding Protein Gene in Min Pig

    Institute of Scientific and Technical Information of China (English)

    Zhang Dong-jie; Liu Di; Wang Liang; He Xin-miao; Wang Wen-tao

    2014-01-01

    In order to study the gene sequence of Min pig Y-box binding protein (YB-1) gene, the complete coding sequence of Min pig YB-1 gene was cloned by RT-PCR, the sequence features were analyzed by some software and online website. The results showed that the complete CDS of Min pig Y-box was found to be 975 bp long, encoding 324 amino acids. It contained a conserved cold shock domain and several phosphorylation sites, but had no transmembrane domains, and was consistent with a protein found in the cytoplasm. Min pig YB-1 nucleotides shared high similarity (61.37%-97.66%) with other mammals.

  19. Local amino acid sequence patterns dominate the heterogeneous phenotype for the collagen connective tissue disease Osteogenesis Imperfecta resulting from Gly mutations.

    Science.gov (United States)

    Xiao, Jianxi; Yang, Zhangfu; Sun, Xiuxia; Addabbo, Rayna; Baum, Jean

    2015-10-01

    Osteogenesis Imperfecta (OI), a hereditary connective tissue disease in collagen that arises from a single Gly → X mutation in the collagen chain, varies widely in phenotype from perinatal lethal to mild. It is unclear why there is such a large variation in the severity of the disease considering the repeating (Gly-X-Y)n sequence and the uniform rod-like structure of collagen. We systematically evaluate the effect of local (Gly-X-Y)n sequence around the mutation site on OI phenotype using integrated bio-statistical approaches, including odds ratio analysis and decision tree modeling. We show that different Gly → X mutations have different local sequence patterns that are correlated with lethal and nonlethal phenotypes providing a mechanism for understanding the sensitivity of local context in defining lethal and non-lethal OI. A number of important trends about which factors are related to OI phenotypes are revealed by the bio-statistical analyses; most striking is the complementary relationship between the placement of Pro residues and small residues and their correlation to OI phenotype. When Pro is present or small flexible residues are absent nearby a mutation site, the OI case tends to be lethal; when Pro is present or small flexible residues are absent further away from the mutation site, the OI case tends to be nonlethal. The analysis also reveals the dominant role of local sequence around mutation sites in the Major Ligand Binding Regions that are primarily responsible for collagen binding to its receptors and shows that non-lethal mutations are highly predicted by local sequence considerations alone whereas lethal mutations are not as easily predicted and may be a result of more complex interactions. Understanding the sequence determinants of OI mutations will enhance genetic counseling and help establish which steps in the collagen hierarchy to target for drug therapy.

  20. 贵溪冶炼厂硫酸二系列干吸工序工艺特点剖析%Analysis of technological features of drying and absorption section in Guixi Smelter'No.2 sulphuric acid plant

    Institute of Scientific and Technical Information of China (English)

    王希林

    2001-01-01

    介绍1333t/d铜冶炼烟气制酸装置干吸工序的设计特点。通过提高空塔气速、增加分酸点和喷淋密度实现干吸塔的高效设计,以节约投资。干吸工序为泵后冷却流程,干吸塔采用低位布置,节省了酸泵能耗。泵槽为卧式,从根本上解决了进口硫酸对泵槽的冲刷腐蚀问题。干燥塔和吸收塔共用一个泵槽,为避免干燥系统的SO2串入吸收系统,干燥酸从一吸塔上酸管串入。整个系统共设置5个控制回路。%The design features of drying and absorption section in a 1333t/d sulphuric acid plant based on copper smelter off-gas are described. High efficiency design was conducted for cost saving by increasing empty tower gas velocity, acid distribution points and liquid rate. The drying and absorption section adopted a pump-after cooling process with low-position arragement of drying and absorption towers to decrease power requirement of pumps. Pump tank was designed in horizontal form, so the brushing corrosion by sulphuric acid was thoroughly avoided. A common pump tank was used for both drying and absorption towers. To prevent SO2 carrying from drying system to absorption system, drying acid was added to intermediate absorption tower at the inlet pipe. Five control circuits were set in the system.