WorldWideScience

Sample records for acid sequence features

  1. Protein location prediction using atomic composition and global features of the amino acid sequence

    Energy Technology Data Exchange (ETDEWEB)

    Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India); Nair, Achuthsankar S. [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India)

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  2. Extraction of Sequence Conservation Features for the Prioritization of Candidate Single Amino Acid Polymorphisms

    Directory of Open Access Journals (Sweden)

    Jiaxin Wu

    2011-03-01

    Full Text Available Although remarkable success has been achieved by genome-wide association (GWA studies over the past few years, genetic variants discovered in GWA studies can typically account for only a small fraction of heritability of most common diseases. As such, the identification of multiple rare variants that are associated with complex diseases has been receiving more and more attentions. However, most of the recently developed statistical approaches for detecting association of rare variants with diseases require the selection of functional variants before the successive analysis, making an effective bioinformatics method for filtering out non-relevant rare variants indispensible. In this paper, we focus on a specific type of genetic variants called single amino acid polymorphisms (SAAPs. We propose to prioritize candidate SAAPs for a specific disease according to their association scores that are calculated using a guilt-by-association model with a set of features derived from protein sequences. We validate the proposed approach in a systematic way and demonstrate that the proposed model is powerful in distinguishing disease-associated SAAPs for the specific disease of interest.

  3. Classifying Genomic Sequences by Sequence Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hua Liu; Dian Jiao; Xiao Sun

    2005-01-01

    Traditional sequence analysis depends on sequence alignment. In this study, we analyzed various functional regions of the human genome based on sequence features, including word frequency, dinucleotide relative abundance, and base-base correlation. We analyzed the human chromosome 22 and classified the upstream,exon, intron, downstream, and intergenic regions by principal component analysis and discriminant analysis of these features. The results show that we could classify the functional regions of genome based on sequence feature and discriminant analysis.

  4. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    Directory of Open Access Journals (Sweden)

    Selvaraj S

    2011-10-01

    Full Text Available Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching. Results We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix. Conclusions The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.

  5. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    OpenAIRE

    Selvaraj S; Jayaram B; Saranya N; Gromiha M; Fukui Kazuhiko

    2011-01-01

    Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such...

  6. Protein sequence classification using feature hashing.

    Science.gov (United States)

    Caragea, Cornelia; Silvescu, Adrian; Mitra, Prasenjit

    2012-06-21

    Recent advances in next-generation sequencing technologies have resulted in an exponential increase in the rate at which protein sequence data are being acquired. The k-gram feature representation, commonly used for protein sequence classification, usually results in prohibitively high dimensional input spaces, for large values of k. Applying data mining algorithms to these input spaces may be intractable due to the large number of dimensions. Hence, using dimensionality reduction techniques can be crucial for the performance and the complexity of the learning algorithms. In this paper, we study the applicability of feature hashing to protein sequence classification, where the original high-dimensional space is "reduced" by hashing the features into a low-dimensional space, using a hash function, i.e., by mapping features into hash keys, where multiple features can be mapped (at random) to the same hash key, and "aggregating" their counts. We compare feature hashing with the "bag of k-grams" approach. Our results show that feature hashing is an effective approach to reducing dimensionality on protein sequence classification tasks.

  7. Feature-by-feature--evaluating de novo sequence assembly.

    Directory of Open Access Journals (Sweden)

    Francesco Vezzi

    Full Text Available The whole-genome sequence assembly (WGSA problem is among one of the most studied problems in computational biology. Despite the availability of a plethora of tools (i.e., assemblers, all claiming to have solved the WGSA problem, little has been done to systematically compare their accuracy and power. Traditional methods rely on standard metrics and read simulation: while on the one hand, metrics like N50 and number of contigs focus only on size without proportionately emphasizing the information about the correctness of the assembly, comparisons performed on simulated dataset, on the other hand, can be highly biased by the non-realistic assumptions in the underlying read generator. Recently the Feature Response Curve (FRC method was proposed to assess the overall assembly quality and correctness: FRC transparently captures the trade-offs between contigs' quality against their sizes. Nevertheless, the relationship among the different features and their relative importance remains unknown. In particular, FRC cannot account for the correlation among the different features. We analyzed the correlation among different features in order to better describe their relationships and their importance in gauging assembly quality and correctness. In particular, using multivariate techniques like principal and independent component analysis we were able to estimate the "excess-dimensionality" of the feature space. Moreover, principal component analysis allowed us to show how poorly the acclaimed N50 metric describes the assembly quality. Applying independent component analysis we identified a subset of features that better describe the assemblers performances. We demonstrated that by focusing on a reduced set of highly informative features we can use the FRC curve to better describe and compare the performances of different assemblers. Moreover, as a by-product of our analysis, we discovered how often evaluation based on simulated data, obtained with state

  8. Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition.

    Science.gov (United States)

    Zhao, Xiaowei; Ning, Qiao; Ai, Meiyue; Chai, Haiting; Yang, Guifu

    2016-06-01

    As a selective and reversible protein post-translational modification, S-glutathionylation generates mixed disulfides between glutathione (GSH) and cysteine residues, and plays an important role in regulating protein activity, stability, and redox regulation. To fully understand S-glutathionylation mechanisms, identification of substrates and specific S-Glutathionylated sites is crucial. Experimental identification of S-glutathionylated sites is labor-intensive and time consuming, so establishing an effective computational method is much desirable due to their convenient and fast speed. Therefore, in this study, a new bioinformatics tool named SSGlu (Species-Specific identification of Protein S-glutathionylation Sites) was developed to identify species-specific protein S-glutathionylated sites, utilizing support vector machines that combine multiple sequence-derived features with a two-step feature selection. By 5-fold cross validation, the performance of SSGlu was measured with an AUC of 0.8105 and 0.8041 for Homo sapiens and Mus musculus, respectively. Additionally, SSGlu was compared with the existing methods, and the higher MCC and AUC of SSGlu demonstrated that SSGlu was very promising to predict S-glutathionylated sites. Furthermore, a site-specific analysis showed that S-glutathionylation intimately correlated with the features derived from its surrounding sites. The conclusions derived from this study might help to understand more of the S-glutathionylation mechanism and guide the related experimental validation. For public access, SSGlu is freely accessible at http://59.73.198.144:8080/SSGlu/. PMID:27025952

  9. Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition.

    Science.gov (United States)

    Zhao, Xiaowei; Ning, Qiao; Ai, Meiyue; Chai, Haiting; Yang, Guifu

    2016-06-01

    As a selective and reversible protein post-translational modification, S-glutathionylation generates mixed disulfides between glutathione (GSH) and cysteine residues, and plays an important role in regulating protein activity, stability, and redox regulation. To fully understand S-glutathionylation mechanisms, identification of substrates and specific S-Glutathionylated sites is crucial. Experimental identification of S-glutathionylated sites is labor-intensive and time consuming, so establishing an effective computational method is much desirable due to their convenient and fast speed. Therefore, in this study, a new bioinformatics tool named SSGlu (Species-Specific identification of Protein S-glutathionylation Sites) was developed to identify species-specific protein S-glutathionylated sites, utilizing support vector machines that combine multiple sequence-derived features with a two-step feature selection. By 5-fold cross validation, the performance of SSGlu was measured with an AUC of 0.8105 and 0.8041 for Homo sapiens and Mus musculus, respectively. Additionally, SSGlu was compared with the existing methods, and the higher MCC and AUC of SSGlu demonstrated that SSGlu was very promising to predict S-glutathionylated sites. Furthermore, a site-specific analysis showed that S-glutathionylation intimately correlated with the features derived from its surrounding sites. The conclusions derived from this study might help to understand more of the S-glutathionylation mechanism and guide the related experimental validation. For public access, SSGlu is freely accessible at http://59.73.198.144:8080/SSGlu/.

  10. Coding visual features extracted from video sequences.

    Science.gov (United States)

    Baroffio, Luca; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

    2014-05-01

    Visual features are successfully exploited in several applications (e.g., visual search, object recognition and tracking, etc.) due to their ability to efficiently represent image content. Several visual analysis tasks require features to be transmitted over a bandwidth-limited network, thus calling for coding techniques to reduce the required bit budget, while attaining a target level of efficiency. In this paper, we propose, for the first time, a coding architecture designed for local features (e.g., SIFT, SURF) extracted from video sequences. To achieve high coding efficiency, we exploit both spatial and temporal redundancy by means of intraframe and interframe coding modes. In addition, we propose a coding mode decision based on rate-distortion optimization. The proposed coding scheme can be conveniently adopted to implement the analyze-then-compress (ATC) paradigm in the context of visual sensor networks. That is, sets of visual features are extracted from video frames, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast to the traditional compress-then-analyze (CTA) paradigm, in which video sequences acquired at a node are compressed and then sent to a central unit for further processing. In this paper, we compare these coding paradigms using metrics that are routinely adopted to evaluate the suitability of visual features in the context of content-based retrieval, object recognition, and tracking. Experimental results demonstrate that, thanks to the significant coding gains achieved by the proposed coding scheme, ATC outperforms CTA with respect to all evaluation metrics. PMID:24818244

  11. Feature-based Image Sequence Compression Coding

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    A novel compressing method for video teleconference applications is presented. Semantic-based coding based on human image feature is realized, where human features are adopted as parameters. Model-based coding and the concept of vector coding are combined with the work on image feature extraction to obtain the result.

  12. Chip-based sequencing nucleic acids

    Science.gov (United States)

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  13. Prediction of novel archaeal enzymes from sequence-derived features

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Skovgaard, Marie; Brunak, Søren

    2002-01-01

    The completely sequenced archaeal genomes potentially encode, among their many functionally uncharacterized genes, novel enzymes of biotechnological interest. We have developed a prediction method for detection and classification of enzymes from sequence alone (available at http://www.cbs.dtu.dk/......The completely sequenced archaeal genomes potentially encode, among their many functionally uncharacterized genes, novel enzymes of biotechnological interest. We have developed a prediction method for detection and classification of enzymes from sequence alone (available at http......://www.cbs.dtu.dk/services/ArchaeaFun/). The method does not make use of sequence similarity; rather, it relies on predicted protein features like cotranslational and posttranslational modifications, secondary structure, and simple physical/chemical properties....

  14. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  15. Identifying features in biological sequences: Sixth workshop report

    Energy Technology Data Exchange (ETDEWEB)

    Burks, C. [Los Alamos National Lab., NM (United States); Myers, E. [Univ. of Arizona (United States); Pearson, W.R. [Univ. of Virginia (United States)

    1995-12-31

    This report covers the sixth of an annual series of workshops held at the Aspen Center for Physics concentrating particularly on the identification of features in DNA sequence, and more broadly on related topics in computational molecular biology. The workshop series originally focused primarily on discussion of current needs and future strategies for identifying and predicting the presence of complex functional units on sequenced, but otherwise uncharacterized, genomic DNA. We addressed the need for computationally-based, automatic tools for synthesizing available data about individual consensus sequences and local compositional patterns into the composite objects (e.g., genes) that are -- as composite entities -- the true object of interest when scanning DNA sequences. The workshop was structured to promote sustained informal contact and exchange of expertise between molecular biologists, computer scientists, and mathematicians.

  16. Enhanced regulatory sequence prediction using gapped k-mer features.

    Directory of Open Access Journals (Sweden)

    Mahmoud Ghandi

    2014-07-01

    Full Text Available Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.

  17. Quantifying sequence and structural features of protein-RNA interactions.

    Science.gov (United States)

    Li, Songling; Yamashita, Kazuo; Amada, Karlou Mar; Standley, Daron M

    2014-09-01

    Increasing awareness of the importance of protein-RNA interactions has motivated many approaches to predict residue-level RNA binding sites in proteins based on sequence or structural characteristics. Sequence-based predictors are usually high in sensitivity but low in specificity; conversely structure-based predictors tend to have high specificity, but lower sensitivity. Here we quantified the contribution of both sequence- and structure-based features as indicators of RNA-binding propensity using a machine-learning approach. In order to capture structural information for proteins without a known structure, we used homology modeling to extract the relevant structural features. Several novel and modified features enhanced the accuracy of residue-level RNA-binding propensity beyond what has been reported previously, including by meta-prediction servers. These features include: hidden Markov model-based evolutionary conservation, surface deformations based on the Laplacian norm formalism, and relative solvent accessibility partitioned into backbone and side chain contributions. We constructed a web server called aaRNA that implements the proposed method and demonstrate its use in identifying putative RNA binding sites. PMID:25063293

  18. Comparative Amino Acid Sequences of Dengue Viruses

    OpenAIRE

    Haishi, Shozo; TANAKA Mariko; Igarashi, Akira

    1990-01-01

    Amino acid (AA) sequences of 4 serotype of dengue viruses deduced from their nucleotide (nt) sequences of genomic RNA were analyzed for each genome segment and each stretch of 10 AA residues. Precursor of membrane protein (pM), and 4 nonstructural proteins (NS1, NS3, NS4B, NS5) were highly conserved, while another nonstructural protein (NS2A) was least conserved among 5 strains of dengue viruses. When homology was compared among heterotypic viruses, type 1 and type 3 dengue viruses showed clo...

  19. Sequence features contributing to chromosomal rearrangements in Neisseria gonorrhoeae.

    Directory of Open Access Journals (Sweden)

    Russell Spencer-Smith

    Full Text Available Through whole genome sequence alignments, breakpoints in chromosomal synteny can be identified and the sequence features associated with these determined. Alignments of the genome sequences of Neisseria gonorrhoeae strain FA1090, N.gonorrhoeae strain NCCP11945, and N. gonorrhoeae strain TCDC-NG08107 reveal chromosomal rearrangements that have occurred. Based on these alignments and dot plot pair-wise comparisons, the overall chromosomal arrangement of strain NCCP11945 and TCDC-NG08107 are very similar, with no large inversions or translocations. The insertion of the Gonococcal Genetic Island in strain NCCP11945 is the most prominent distinguishing feature differentiating these strains. When strain NCCP11945 is compared to strain FA1090, however, 14 breakpoints in chromosomal synteny are identified between these gonococcal strains. The majority of these, 11 of 14, are associated with a prophage, IS elements, or IS-like repeat enclosed elements which appear to have played a role in the rearrangements observed. Additional rearrangements of small regions of the genome are associated with pilin genes. Evidence presented here suggests that the rearrangements of blocks of sequence are mediated by activation of prophage and associated IS elements and reintegration elsewhere in the genome or by homologous recombination between IS-like elements that have generated inversions.

  20. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Science.gov (United States)

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  1. Detection of nucleic acid sequences by invader-directed cleavage

    Science.gov (United States)

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  2. Automatic discovery of cross-family sequence features associated with protein function

    Directory of Open Access Journals (Sweden)

    Krings Andrea

    2006-01-01

    Full Text Available Abstract Background Methods for predicting protein function directly from amino acid sequences are useful tools in the study of uncharacterised protein families and in comparative genomics. Until now, this problem has been approached using machine learning techniques that attempt to predict membership, or otherwise, to predefined functional categories or subcellular locations. A potential drawback of this approach is that the human-designated functional classes may not accurately reflect the underlying biology, and consequently important sequence-to-function relationships may be missed. Results We show that a self-supervised data mining approach is able to find relationships between sequence features and functional annotations. No preconceived ideas about functional categories are required, and the training data is simply a set of protein sequences and their UniProt/Swiss-Prot annotations. The main technical aspect of the approach is the co-evolution of amino acid-based regular expressions and keyword-based logical expressions with genetic programming. Our experiments on a strictly non-redundant set of eukaryotic proteins reveal that the strongest and most easily detected sequence-to-function relationships are concerned with targeting to various cellular compartments, which is an area already well studied both experimentally and computationally. Of more interest are a number of broad functional roles which can also be correlated with sequence features. These include inhibition, biosynthesis, transcription and defence against bacteria. Despite substantial overlaps between these functions and their corresponding cellular compartments, we find clear differences in the sequence motifs used to predict some of these functions. For example, the presence of polyglutamine repeats appears to be linked more strongly to the "transcription" function than to the general "nuclear" function/location. Conclusion We have developed a novel and useful approach for

  3. Sequence of MET protooncogene cDNA has features characteristic of the tyrosine kinase family of growth-factor receptors

    International Nuclear Information System (INIS)

    The authors isolated overlapping cDNA clones corresponding to the major MET protooncogene transcript. The cDNA nucleotide sequence contained an open reading frame of 1408 amino acids with features characteristic of the tyrosine kinase family of growth factor receptors. These features include a putative 24-amino acid signal peptide and a candidate, hybrophobic, membrane-spanning segment of 23 amino acids, which defines an extracellular domain of 926 amino acids that could serve as a ligand-binding domain. A putative intracellular domain 435 amino acids long shows high homology with the SRC family of tyrosine kinases and within the kinase domain is most homologous with the human insulin receptor (44%) and v-abl (41%). Despite these similarities, however, they found no apparent sequence homology to other growth factor receptors in the putative ligand-binding domain. They conclude from the results that the MET protooncogene is a cell-surface receptor for an as-yet-unknown ligand

  4. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    Science.gov (United States)

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  5. Intumescent features of nucleic acids and proteins

    Energy Technology Data Exchange (ETDEWEB)

    Alongi, Jenny, E-mail: jenny.alongi@polito.it; Cuttica, Fabio; Blasio, Alessandro Di; Carosio, Federico; Malucelli, Giulio

    2014-09-10

    Highlights: • The combustion resistance of DNA and caseins to different heat fluxes was studied. • Upon heating, DNA and caseins exhibited an intumescent behaviour. • The char derived from DNA was more stable and coherent than that from caseins. - Abstract: Are nucleic acids and proteins intumescent molecules? In order to get an answer, in the present manuscript, powders of deoxyribose nucleic acids (DNA) and caseins have been exposed to different heat fluxes under a cone calorimeter source and to the direct application of a propane flame. Under these conditions, DNA and caseins exhibited a typical intumescent behaviour, generating a coherent expanded cellular carbonaceous residue (char), extremely resistant to heat exposure. The resulting volumetric expansion as well as the resistance of the formed char turned out to be dependent on (i) the chemical structure of the chosen biomacromolecule, (ii) the evolution of ammonia and (iii) the adopted heat flux in cone calorimetry tests (namely, 25, 35, 50 and 75 kW/m{sup 2}). The presence of ribose units within the DNA backbone determined the formation of highly expanded and coherent residues as compared to those obtained from caseins. Indeed, under a heat flux of 35 kW/m{sup 2}, when a carbon source (i.e. common cane sugar) was added to caseins, the resulting char was similar to that formed by DNA. Furthermore, the char expansion was ascribed to the evolution of ammonia released by these biomacromolecules upon heating, as detected by thermogravimetry coupled to infrared spectroscopy, and confirmed by scanning electron microscopy experiments performed on the bubbles present in the residues of flammability tests.

  6. Inference of Global HIV-1 Sequence Patterns and Preliminary Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Yan Wang; Reda Rawi; Daniel Hoffmann; Binlian Sun; Rongge Yang

    2013-01-01

    The epidemiology of HIV-1 varies in different areas of the world,and it is possible that this complexity may leave unique footprints in the viral genome.Thus,we attempted to find significant patterns in global HIV-1 genome sequences.By applying the rule inference algorithm RIPPER (Repeated Incremental Pruning to Produce Error Reduction) to multiple sequence alignments of Env sequences from four classes of compiled datasets,we generated four sets of signature patterns.We found that these patterns were able to distinguish southeastern Asian from nonsoutheastern Asian sequences with 97.5% accuracy,Chinese from non-Chinese sequences with 98.3% accuracy,African from non-African sequences with 88.4% accuracy,and southern African from non-southern African sequences with 91.2% accuracy.These patterns showed different associations with subtypes and with amino acid positions.In addition,some signature patterns were characteristic of the geographic area from which the sample was taken.Amino acid features corresponding to the phylogenetic clustering of HIV-1 sequences were consistent with some of the deduced patterns.Using a combination of patterns inferred from subtypes B,C,and all subtypes chimeric with CRF01_AE worldwide,we found that signature patterns of subtype C were extremely common in some sampled countries (for example,Zambia in southern Africa),which may hint at the origin of this HIV-1 subtype and the need to pay special attention to this area of Africa.Signature patterns of subtype B sequences were associated with different countries.Even more,there are distinct patterns at single position 21 with glycine,leucine and isoleucine corresponding to subtype C,B and all possible recombination forms chimeric with CRF01_AE,which also indicate distinct geographic features.Our method widens the scope of inference of signature from geographic,genetic,and genomic viewpoints.These findings may provide a valuable reference for epidemiological research or vaccine design.

  7. Feature Selection and the Class Imbalance Problem in Predicting Protein Function from Sequence

    NARCIS (Netherlands)

    Al-Shahib, A.; Breitling, R.; Gilbert, D.

    2005-01-01

    Abstract: When the standard approach to predict protein function by sequence homology fails, other alternative methods can be used that require only the amino acid sequence for predicting function. One such approach uses machine learning to predict protein function directly from amino acid sequence

  8. Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites.

    Science.gov (United States)

    Lee, Tzong-Yi; Chen, Shu-An; Hung, Hsin-Yi; Ou, Yu-Yen

    2011-03-09

    Ubiquitin (Ub) is a small protein that consists of 76 amino acids about 8.5 kDa. In ubiquitin conjugation, the ubiquitin is majorly conjugated on the lysine residue of protein by Ub-ligating (E3) enzymes. Three major enzymes participate in ubiquitin conjugation. They are E1, E2 and E3 which are responsible for activating, conjugating and ligating ubiquitin, respectively. Ubiquitin conjugation in eukaryotes is an important mechanism of the proteasome-mediated degradation of a protein and regulating the activity of transcription factors. Motivated by the importance of ubiquitin conjugation in biological processes, this investigation develops a method, UbSite, which uses utilizes an efficient radial basis function (RBF) network to identify protein ubiquitin conjugation (ubiquitylation) sites. This work not only investigates the amino acid composition but also the structural characteristics, physicochemical properties, and evolutionary information of amino acids around ubiquitylation (Ub) sites. With reference to the pathway of ubiquitin conjugation, the substrate sites for E3 recognition, which are distant from ubiquitylation sites, are investigated. The measurement of F-score in a large window size (-20∼+20) revealed a statistically significant amino acid composition and position-specific scoring matrix (evolutionary information), which are mainly located distant from Ub sites. The distant information can be used effectively to differentiate Ub sites from non-Ub sites. As determined by five-fold cross-validation, the model that was trained using the combination of amino acid composition and evolutionary information performs best in identifying ubiquitin conjugation sites. The prediction sensitivity, specificity, and accuracy are 65.5%, 74.8%, and 74.5%, respectively. Although the amino acid sequences around the ubiquitin conjugation sites do not contain conserved motifs, the cross-validation result indicates that the integration of distant sequence features of Ub

  9. Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites.

    Directory of Open Access Journals (Sweden)

    Tzong-Yi Lee

    Full Text Available Ubiquitin (Ub is a small protein that consists of 76 amino acids about 8.5 kDa. In ubiquitin conjugation, the ubiquitin is majorly conjugated on the lysine residue of protein by Ub-ligating (E3 enzymes. Three major enzymes participate in ubiquitin conjugation. They are E1, E2 and E3 which are responsible for activating, conjugating and ligating ubiquitin, respectively. Ubiquitin conjugation in eukaryotes is an important mechanism of the proteasome-mediated degradation of a protein and regulating the activity of transcription factors. Motivated by the importance of ubiquitin conjugation in biological processes, this investigation develops a method, UbSite, which uses utilizes an efficient radial basis function (RBF network to identify protein ubiquitin conjugation (ubiquitylation sites. This work not only investigates the amino acid composition but also the structural characteristics, physicochemical properties, and evolutionary information of amino acids around ubiquitylation (Ub sites. With reference to the pathway of ubiquitin conjugation, the substrate sites for E3 recognition, which are distant from ubiquitylation sites, are investigated. The measurement of F-score in a large window size (-20∼+20 revealed a statistically significant amino acid composition and position-specific scoring matrix (evolutionary information, which are mainly located distant from Ub sites. The distant information can be used effectively to differentiate Ub sites from non-Ub sites. As determined by five-fold cross-validation, the model that was trained using the combination of amino acid composition and evolutionary information performs best in identifying ubiquitin conjugation sites. The prediction sensitivity, specificity, and accuracy are 65.5%, 74.8%, and 74.5%, respectively. Although the amino acid sequences around the ubiquitin conjugation sites do not contain conserved motifs, the cross-validation result indicates that the integration of distant sequence

  10. FeatureMap3D - a tool to map protein features and sequence conservation onto homologous structures in the PDB

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Rapacki, Krzysztof; Stærfeldt, Hans Henrik;

    2006-01-01

    with the selected features and sequence conservation enhanced. The results are also returned in a readily parsable text format as well as a PyMol (http://pymol.sourceforge.net/) script file, which allows the user to easily modify the protein structure image to suit a specific purpose. FeatureMap3D can also be used...

  11. New complete genome sequences of human rhinoviruses shed light on their phylogeny and genomic features

    Directory of Open Access Journals (Sweden)

    Zdobnov Evgeny M

    2007-07-01

    Full Text Available Abstract Background Human rhinoviruses (HRV, the most frequent cause of respiratory infections, include 99 different serotypes segregating into two species, A and B. Rhinoviruses share extensive genomic sequence similarity with enteroviruses and both are part of the picornavirus family. Nevertheless they differ significantly at the phenotypic level. The lack of HRV full-length genome sequences and the absence of analysis comparing picornaviruses at the whole genome level limit our knowledge of the genomic features supporting these differences. Results Here we report complete genome sequences of 12 HRV-A and HRV-B serotypes, more than doubling the current number of available HRV sequences. The whole-genome maximum-likelihood phylogenetic analysis suggests that HRV-B and human enteroviruses (HEV diverged from the last common ancestor after their separation from HRV-A. On the other hand, compared to HEV, HRV-B are more related to HRV-A in the capsid and 3B-C regions. We also identified the presence of a 2C cis-acting replication element (cre in HRV-B that is not present in HRV-A, and that had been previously characterized only in HEV. In contrast to HEV viruses, HRV-A and HRV-B share also markedly lower GC content along the whole genome length. Conclusion Our findings provide basis to speculate about both the biological similarities and the differences (e.g. tissue tropism, temperature adaptation or acid lability of these three groups of viruses.

  12. An improved classification of G-protein-coupled receptors using sequence-derived features

    Directory of Open Access Journals (Sweden)

    Peng Zhen-Ling

    2010-08-01

    Full Text Available Abstract Background G-protein-coupled receptors (GPCRs play a key role in diverse physiological processes and are the targets of almost two-thirds of the marketed drugs. The 3 D structures of GPCRs are largely unavailable; however, a large number of GPCR primary sequences are known. To facilitate the identification and characterization of novel receptors, it is therefore very valuable to develop a computational method to accurately predict GPCRs from the protein primary sequences. Results We propose a new method called PCA-GPCR, to predict GPCRs using a comprehensive set of 1497 sequence-derived features. The principal component analysis is first employed to reduce the dimension of the feature space to 32. Then, the resulting 32-dimensional feature vectors are fed into a simple yet powerful classification algorithm, called intimate sorting, to predict GPCRs at five levels. The prediction at the first level determines whether a protein is a GPCR or a non-GPCR. If it is predicted to be a GPCR, then it will be further predicted into certain family, subfamily, sub-subfamily and subtype by the classifiers at the second, third, fourth, and fifth levels, respectively. To train the classifiers applied at five levels, a non-redundant dataset is carefully constructed, which contains 3178, 1589, 4772, 4924, and 2741 protein sequences at the respective levels. Jackknife tests on this training dataset show that the overall accuracies of PCA-GPCR at five levels (from the first to the fifth can achieve up to 99.5%, 88.8%, 80.47%, 80.3%, and 92.34%, respectively. We further perform predictions on a dataset of 1238 GPCRs at the second level, and on another two datasets of 167 and 566 GPCRs respectively at the fourth level. The overall prediction accuracies of our method are consistently higher than those of the existing methods to be compared. Conclusions The comprehensive set of 1497 features is believed to be capable of capturing information about amino acid

  13. The origin of biased sequence depth in sequence-independent nucleic acid amplification and optimization for efficient massive parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Toon Rosseel

    Full Text Available Sequence Independent Single Primer Amplification is one of the most widely used random amplification approaches in virology for sequencing template preparation. This technique relies on oligonucleotides consisting of a 3' random part used to prime complementary DNA synthesis and a 5' defined tag sequence for subsequent amplification. Recently, this amplification method was combined with next generation sequencing to obtain viral sequences. However, these studies showed a biased distribution of the resulting sequence reads over the analyzed genomes. The aim of this study was to elucidate the mechanisms that lead to biased sequence depth when using random amplification. Avian paramyxovirus type 8 was used as a model RNA virus to investigate these mechanisms. We showed, based on in silico analysis of the sequence depth in relation to GC-content, predicted RNA secondary structure and sequence complementarity to the 3' part of the tag sequence, that the tag sequence has the main contribution to the observed bias in sequence depth. We confirmed this finding experimentally using both fragmented and non-fragmented viral RNAs as well as primers differing in random oligomer length (6 or 12 nucleotides and in the sequence of the amplification tag. The observed oligonucleotide annealing bias can be reduced by extending the random oligomer sequence and by in silico combining sequence data from SISPA experiments using different 5' defined tag sequences. These findings contribute to the optimization of random nucleic acid amplification protocols that are currently required for downstream applications such as viral metagenomics and microarray analysis.

  14. Peptide Nucleic Acids Having Enhanced Binding Affinity and Sequence Specificity

    DEFF Research Database (Denmark)

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and binding affinity. Methods of increasing binding affinity and sequence specificity of peptide nucleic aci...

  15. MEANS AND METHODS FOR CLONING NUCLEIC ACID SEQUENCES

    NARCIS (Netherlands)

    Geertsma, Eric Robin; Poolman, Berend

    2008-01-01

    The invention provides means and methods for efficiently cloning nucleic acid sequences of interest in micro-organisms that are less amenable to conventional nucleic acid manipulations, as compared to, for instance, E.coli. The present invention enables high-throughput cloning (and, preferably, expr

  16. Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data.

    Science.gov (United States)

    Pan, Yonglong; Wang, Xiaoming; Liu, Lin; Wang, Hao; Luo, Meizhong

    2016-01-01

    A good physical map is essential to guide sequence assembly in de novo whole genome sequencing, especially when sequences are produced by high-throughput sequencing such as next-generation-sequencing (NGS) technology. We here present a novel method, Feature sets-based Genome Mapping (FGM). With FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion. Method model was created and parameters were inspected by simulations using the Arabidopsis genome sequence. In the simulations, when ~4.8X genome BAC library including 4,096 clones was used to sequence the whole genome, ~90% of clones were successfully connected to physical contigs, and 91.58% of genome sequences were mapped and connected to chromosomes. This method was experimentally verified using the existing physical map and genome sequence of rice. Of 4,064 clones covering 115 Mb sequence selected from ~3 tiles of 3 chromosomes of a rice draft physical map, 3,364 clones were reconstructed into physical contigs and 98 Mb sequences were integrated into the 3 chromosomes. The physical map-integrated draft genome sequences can provide permanent frameworks for eventually obtaining high-quality reference sequences by targeted sequencing, gap filling and combining other sequences. PMID:27611682

  17. Two distinct ferredoxins from Rhodobacter capsulatus: complete amino acid sequences and molecular evolution.

    Science.gov (United States)

    Saeki, K; Suetsugu, Y; Yao, Y; Horio, T; Marrs, B L; Matsubara, H

    1990-09-01

    Two distinct ferredoxins were purified from Rhodobacter capsulatus SB1003. Their complete amino acid sequences were determined by a combination of protease digestion, BrCN cleavage and Edman degradation. Ferredoxins I and II were composed of 64 and 111 amino acids, respectively, with molecular weights of 6,728 and 12,549 excluding iron and sulfur atoms. Both contained two Cys clusters in their amino acid sequences. The first cluster of ferredoxin I and the second cluster of ferredoxin II had a sequence, CxxCxxCxxxCP, in common with the ferredoxins found in Clostridia. The second cluster of ferredoxin I had a sequence, CxxCxxxxxxxxCxxxCM, with extra amino acids between the second and third Cys, which has been reported for other photosynthetic bacterial ferredoxins and putative ferredoxins (nif-gene products) from nitrogen-fixing bacteria, and with a unique occurrence of Met. The first cluster of ferredoxin II had a CxxCxxxxCxxxCP sequence, with two additional amino acids between the second and third Cys, a characteristics feature of Azotobacter-[3Fe-4S] [4Fe-4S]-ferredoxin. Ferredoxin II was also similar to Azotobacter-type ferredoxins with an extended carboxyl (C-) terminal sequence compared to the common Clostridium-type. The evolutionary relationship of the two together with a putative one recently found to be encoded in nifENXQ region in this bacterium [Moreno-Vivian et al. (1989) J. Bacteriol. 171, 2591-2598] is discussed. PMID:2277040

  18. Amino acid sequences of proteins from Leptospira serovar pomona

    Directory of Open Access Journals (Sweden)

    Alves Selmo F

    2000-01-01

    Full Text Available This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  19. Representation of protein-sequence information by amino acid subalphabets

    DEFF Research Database (Denmark)

    Andersen, C.A.F.; Brunak, Søren

    2004-01-01

    Within computational biology, algorithms are constructed with the aim of extracting knowledge from biological data, in particular, data generated by the large genome projects, where gene and protein sequences are produced in high volume. In this article, we explore new ways of representing protein......-sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...

  20. On Quantum Algorithm for Multiple Alignment of Amino Acid Sequences

    Science.gov (United States)

    Iriyama, Satoshi; Ohya, Masanori

    2009-02-01

    The alignment of genome sequences or amino acid sequences is one of fundamental operations for the study of life. Usual computational complexity for the multiple alignment of N sequences with common length L by dynamic programming is O(LN). This alignment is considered as one of the NP problems, so that it is desirable to find a nice algorithm of the multiple alignment. Thus in this paper we propose the quantum algorithm for the multiple alignment based on the works12,1,2 in which the NP complete problem was shown to be the P problem by means of quantum algorithm and chaos information dynamics.

  1. Estimation of Position Specific Energy as a Feature of Protein Residues from Sequence Alone for Structural Classification.

    Science.gov (United States)

    Iqbal, Sumaiya; Hoque, Md Tamjidul

    2016-01-01

    A set of features computed from the primary amino acid sequence of proteins, is crucial in the process of inducing a machine learning model that is capable of accurately predicting three-dimensional protein structures. Solutions for existing protein structure prediction problems are in need of features that can capture the complexity of molecular level interactions. With a view to this, we propose a novel approach to estimate position specific estimated energy (PSEE) of a residue using contact energy and predicted relative solvent accessibility (RSA). Furthermore, we demonstrate PSEE can be reasonably estimated based on sequence information alone. PSEE is useful in identifying the structured as well as unstructured or, intrinsically disordered region of a protein by computing favorable and unfavorable energy respectively, characterized by appropriate threshold. The most intriguing finding, verified empirically, is the indication that the PSEE feature can effectively classify disorder versus ordered residues and can segregate different secondary structure type residues by computing the constituent energies. PSEE values for each amino acid strongly correlate with the hydrophobicity value of the corresponding amino acid. Further, PSEE can be used to detect the existence of critical binding regions that essentially undergo disorder-to-order transitions to perform crucial biological functions. Towards an application of disorder prediction using the PSEE feature, we have rigorously tested and found that a support vector machine model informed by a set of features including PSEE consistently outperforms a model with an identical set of features with PSEE removed. In addition, the new disorder predictor, DisPredict2, shows competitive performance in predicting protein disorder when compared with six existing disordered protein predictors. PMID:27588752

  2. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences.

    Science.gov (United States)

    Derr, Julien; Manapat, Michael L; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A; Chen, Irene A

    2012-05-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.

  3. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    Science.gov (United States)

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  4. Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis.

    Directory of Open Access Journals (Sweden)

    Lu-Lu Zheng

    Full Text Available Pyrrolidone carboxylic acid (PCA is formed during a common post-translational modification (PTM of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR and incremental feature selection (IFS. We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations.

  5. Effective automated feature construction and selection for classification of biological sequences.

    Directory of Open Access Journals (Sweden)

    Uday Kamath

    Full Text Available Many open problems in bioinformatics involve elucidating underlying functional signals in biological sequences. DNA sequences, in particular, are characterized by rich architectures in which functional signals are increasingly found to combine local and distal interactions at the nucleotide level. Problems of interest include detection of regulatory regions, splice sites, exons, hypersensitive sites, and more. These problems naturally lend themselves to formulation as classification problems in machine learning. When classification is based on features extracted from the sequences under investigation, success is critically dependent on the chosen set of features.We present an algorithmic framework (EFFECT for automated detection of functional signals in biological sequences. We focus here on classification problems involving DNA sequences which state-of-the-art work in machine learning shows to be challenging and involve complex combinations of local and distal features. EFFECT uses a two-stage process to first construct a set of candidate sequence-based features and then select a most effective subset for the classification task at hand. Both stages make heavy use of evolutionary algorithms to efficiently guide the search towards informative features capable of discriminating between sequences that contain a particular functional signal and those that do not.To demonstrate its generality, EFFECT is applied to three separate problems of importance in DNA research: the recognition of hypersensitive sites, splice sites, and ALU sites. Comparisons with state-of-the-art algorithms show that the framework is both general and powerful. In addition, a detailed analysis of the constructed features shows that they contain valuable biological information about DNA architecture, allowing biologists and other researchers to directly inspect the features and potentially use the insights obtained to assist wet-laboratory studies on retainment or modification

  6. Anti-peptide aptamers recognize amino acid sequence and bind a protein epitope.

    OpenAIRE

    Xu, W; Ellington, A. D.

    1996-01-01

    In vitro selection of nucleic acid binding species (aptamers) is superficially similar to the immune response. Both processes produce biopolymers that can recognize targets with high affinity and specificity. While antibodies are known to recognize the sequence and conformation of protein surface features (epitopes), very little is known about the precise interactions between aptamers and their epitopes. Therefore, aptamers that could recognize a particular epitope, a peptide fragment of huma...

  7. SeqVISTA: a graphical tool for sequence feature visualization and comparison

    Directory of Open Access Journals (Sweden)

    Niu Tianhua

    2003-01-01

    Full Text Available Abstract Background Many readers will sympathize with the following story. You are viewing a gene sequence in Entrez, and you want to find whether it contains a particular sequence motif. You reach for the browser's "find in page" button, but those darn spaces every 10 bp get in the way. And what if the motif is on the opposite strand? Subsequently, your favorite sequence analysis software informs you that there is an interesting feature at position 13982–14013. By painstakingly counting the 10 bp blocks, you are able to examine the sequence at this location. But now you want to see what other features have been annotated close by, and this information is buried several screenfuls higher up the web page. Results SeqVISTA presents a holistic, graphical view of features annotated on nucleotide or protein sequences. This interactive tool highlights the residues in the sequence that correspond to features chosen by the user, and allows easy searching for sequence motifs or extraction of particular subsequences. SeqVISTA is able to display results from diverse sequence analysis tools in an integrated fashion, and aims to provide much-needed unity to the bioinformatics resources scattered around the Internet. Our viewer may be launched on a GenBank record by a single click of a button installed in the web browser. Conclusion SeqVISTA allows insights to be gained by viewing the totality of sequence annotations and predictions, which may be more revealing than the sum of their parts. SeqVISTA runs on any operating system with a Java 1.4 virtual machine. It is freely available to academic users at http://zlab.bu.edu/SeqVISTA.

  8. Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains

    Directory of Open Access Journals (Sweden)

    Branislava Gemovic

    2013-01-01

    Full Text Available There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM, a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.

  9. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...... alignment by 'reverse translation' of the aligned protein sequences. In the resulting DNA alignment, gaps occur in groups of three corresponding to entire codons, and analogous codon positions are therefore always lined up. These features are useful when constructing multiple DNA alignments for phylogenetic...

  10. Analysis on n-gram statistics and linguistic features of whole genome protein sequences

    Institute of Scientific and Technical Information of China (English)

    DONG Qi-wen; WANG Xiao-long; LIN Lei

    2008-01-01

    To obtain the statistical sequence analysis on a large number of genomic and proteomie sequences available for different organisms,the n-grams of whole genome protein sequences from 20 organisms were extracted.Their linguistic features were analyzed by two tests:Zipf power law and Shannon entropy,developed for analysis of natural languages and symbolic sequences.The natural genome proteins and the artificial genome proteins were compared with each other and some statistical features of n-grams were discovered.The results show that:the n-grams of whole genome protein sequences approximately follow the Zipf law when n is larger than 4;the Shannon n-gram entropy of natural genome proteins is lower than that of artificial proteins;a simple unigram model can distinguish different organisms;there exist organism-specific usages of "phrases" in protein sequences.It is suggested that further detailed analysis on n-gram of whole genome protein sequences will result in a powerful model for mapping the relationship of protein sequence,structure and function.

  11. What are the basic modules of implicit sequence learning? A feature-based account

    OpenAIRE

    Eberhardt, Katharina

    2015-01-01

    According to the Theory of Event Coding (TEC; Hommel et al., 2001), action and perception are represented in a shared format in the cognitive system by means of feature codes. In implicit sequence learning research, it is still common to make a conceptual difference between independent motor and perceptual sequences. This supposedly independent learning takes place in encapsulated modules (Keele et al., 2003) which process information along single dimensions. These dimensions have remained un...

  12. An Improved Management Model for Tracking Missing Features in Computer Vision Long Image Sequences

    OpenAIRE

    Pinho, Raquel R.; João Manuel R. S. Tavares; Correia, Miguel V.

    2006-01-01

    In this paper we present a management model to deal with the problem of tracking missing features during long image sequences using Computational Vision. Some usual difficulties related with missing features are that they may be temporarily occluded or might even have disappeared definitively, and the computational cost involved should always be reduced to the strictly necessary. The proposed Net Present Value (NPV) model, based on the economic Theory of Capital, considers the tracking of eac...

  13. The amino acid sequence of Escherichia coli cyanase.

    Science.gov (United States)

    Chin, C C; Anderson, P M; Wold, F

    1983-01-10

    The amino acid sequence of the enzyme cyanase (cyanate hydrolase) from Escherichia coli has been determined by automatic Edman degradation of the intact protein and of its component peptides. The primary peptides used in the sequencing were produced by cyanogen bromide cleavage at the methionine residues, yielding 4 peptides plus free homoserine from the NH2-terminal methionine, and by trypsin cleavage at the 7 arginine residues after acetylation of the lysines. Secondary peptides required for overlaps and COOH-terminal sequences were produced by chymotrypsin or clostripain cleavage of some of the larger peptides. The complete sequence of the cyanase subunit consists of 156 amino acid residues (Mr 16,350). Based on the observation that the cysteine-containing peptide is obtained as a disulfide-linked dimer, it is proposed that the covalent structure of cyanase is made up of two subunits linked by a disulfide bond between the single cystine residue in each subunit. The native enzyme (Mr 150,000) then appears to be a complex of four or five such subunit dimers.

  14. Virtual Ribosome - a comprehensive DNA translation tool with support for integration of sequence feature annotation

    DEFF Research Database (Denmark)

    Wernersson, Rasmus

    2006-01-01

    of alternative start codons. ( ii) Integration of sequences feature annotation - in particular, native support for working with files containing intron/ exon structure annotation. The software is available for both download and online use at http://www.cbs.dtu.dk/services/VirtualRibosome/....

  15. Nucleotide sequence and corresponding amino acid sequence of the gene for the major antigen of foot and mouth disease virus.

    OpenAIRE

    Kurz, C; Forss, S; Küpper, H; K Strohmaier; Schaller, H

    1981-01-01

    A segment of 1160 nucleotides of the FMDV genome has been sequenced using three overlapping fragments of cloned cDNA from FMDV strain O1K. This sequence contains the coding sequence for the viral capsid protein VP1 as shown by its homology to known and newly determined amino acid sequences from this man antigenic polypeptide of the FMDV virion. The structural gene for VP1 comprises 639 nucleotides which specify a sequence of 213 amino acids for the VP1 protein. The coding sequence is not flan...

  16. Correlation between fibroin amino acid sequence and physical silk properties.

    Science.gov (United States)

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet. PMID:12816957

  17. Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework

    Directory of Open Access Journals (Sweden)

    Chen Shaokang

    2011-01-01

    Full Text Available Although automatic faces recognition has shown success for high-quality images under controlled conditions, for video-based recognition it is hard to attain similar levels of performance. We describe in this paper recent advances in a project being undertaken to trial and develop advanced surveillance systems for public safety. In this paper, we propose a local facial feature based framework for both still image and video-based face recognition. The evaluation is performed on a still image dataset LFW and a video sequence dataset MOBIO to compare 4 methods for operation on feature: feature averaging (Avg-Feature, Mutual Subspace Method (MSM, Manifold to Manifold Distance (MMS, and Affine Hull Method (AHM, and 4 methods for operation on distance on 3 different features. The experimental results show that Multi-region Histogram (MRH feature is more discriminative for face recognition compared to Local Binary Patterns (LBP and raw pixel intensity. Under the limitation on a small number of images available per person, feature averaging is more reliable than MSM, MMD, and AHM and is much faster. Thus, our proposed framework—averaging MRH feature is more suitable for CCTV surveillance systems with constraints on the number of images and the speed of processing.

  18. SoftSearch: integration of multiple sequence features to identify breakpoints of structural variations.

    Directory of Open Access Journals (Sweden)

    Steven N Hart

    Full Text Available BACKGROUND: Structural variation (SV represents a significant, yet poorly understood contribution to an individual's genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. RESULTS: We developed and validated SoftSearch using real and synthetic datasets. SoftSearch's key features are 1 not requiring secondary (or exhaustive primary alignment, 2 portability into established sequencing workflows, and 3 is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.. SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. CONCLUSIONS: We show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance.

  19. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    Directory of Open Access Journals (Sweden)

    Ruan Jishou

    2007-04-01

    Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are

  20. Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data

    DEFF Research Database (Denmark)

    Rask, Thomas Salhøj; Petersen, Bent; Chen, Donald S.;

    2016-01-01

    insertions and deletions, are on the other hand likely to disrupt open reading frames. Such an inverse relationship between errors and expectation based on prior knowledge can be used advantageously to guide the process known as basecalling, i.e. the inference of nucleotide sequence from raw sequencing data......Amplicon pyrosequencing targets a known genetic region and thus inherently produces reads highly anticipated to have certain features, such as conserved nucleotide sequence, and in the case of protein coding DNA, an open reading frame. Pyrosequencing errors, consisting mainly of nucleotide....... This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria...

  1. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Directory of Open Access Journals (Sweden)

    Meiler Arno

    2012-09-01

    Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  2. What Matters in Implicit Task Sequence Learning: Perceptual Stimulus Features, Task Sets, or Correlated Streams of Information?

    Science.gov (United States)

    Weiermann, Brigitte; Cock, Josephine; Meier, Beat

    2010-01-01

    Implicit task sequence learning may be attributed to learning the order of perceptual stimulus features associated with the task sequence, learning a series of automatic task set activations, or learning an integrated sequence that derives from 2 correlated streams of information. In the present study, our purpose was to distinguish among these 3…

  3. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    KAUST Repository

    Chen, Peng

    2013-07-23

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  4. Prediction of protein motions from amino acid sequence and its application to protein-protein interaction

    Directory of Open Access Journals (Sweden)

    Wako Hiroshi

    2010-07-01

    Full Text Available Abstract Background Structural flexibility is an important characteristic of proteins because it is often associated with their function. The movement of a polypeptide segment in a protein can be broken down into two types of motions: internal and external ones. The former is deformation of the segment itself, but the latter involves only rotational and translational motions as a rigid body. Normal Model Analysis (NMA can derive these two motions, but its application remains limited because it necessitates the gathering of complete structural information. Results In this work, we present a novel method for predicting two kinds of protein motions in ordered structures. The prediction uses only information from the amino acid sequence. We prepared a dataset of the internal and external motions of segments in many proteins by application of NMA. Subsequently, we analyzed the relation between thermal motion assessed from X-ray crystallographic B-factor and internal/external motions calculated by NMA. Results show that attributes of amino acids related to the internal motion have different features from those related to the B-factors, although those related to the external motion are correlated strongly with the B-factors. Next, we developed a method to predict internal and external motions from amino acid sequences based on the Random Forest algorithm. The proposed method uses information associated with adjacent amino acid residues and secondary structures predicted from the amino acid sequence. The proposed method exhibited moderate correlation between predicted internal and external motions with those calculated by NMA. It has the highest prediction accuracy compared to a naïve model and three published predictors. Conclusions Finally, we applied the proposed method predicting the internal motion to a set of 20 proteins that undergo large conformational change upon protein-protein interaction. Results show significant overlaps between the

  5. Prediction of peptide drift time in ion mobility mass spectrometry from sequence-based features

    KAUST Repository

    Wang, Bing

    2013-05-09

    Background: Ion mobility-mass spectrometry (IMMS), an analytical technique which combines the features of ion mobility spectrometry (IMS) and mass spectrometry (MS), can rapidly separates ions on a millisecond time-scale. IMMS becomes a powerful tool to analyzing complex mixtures, especially for the analysis of peptides in proteomics. The high-throughput nature of this technique provides a challenge for the identification of peptides in complex biological samples. As an important parameter, peptide drift time can be used for enhancing downstream data analysis in IMMS-based proteomics.Results: In this paper, a model is presented based on least square support vectors regression (LS-SVR) method to predict peptide ion drift time in IMMS from the sequence-based features of peptide. Four descriptors were extracted from peptide sequence to represent peptide ions by a 34-component vector. The parameters of LS-SVR were selected by a grid searching strategy, and a 10-fold cross-validation approach was employed for the model training and testing. Our proposed method was tested on three datasets with different charge states. The high prediction performance achieve demonstrate the effectiveness and efficiency of the prediction model.Conclusions: Our proposed LS-SVR model can predict peptide drift time from sequence information in relative high prediction accuracy by a test on a dataset of 595 peptides. This work can enhance the confidence of protein identification by combining with current protein searching techniques. 2013 Wang et al.; licensee BioMed Central Ltd.

  6. Sequencing of bovine herpesvirus 4 v.test strain reveals important genome features

    Directory of Open Access Journals (Sweden)

    Gillet Laurent

    2011-08-01

    Full Text Available Abstract Background Bovine herpesvirus 4 (BoHV-4 is a useful model for the human pathogenic gammaherpesviruses Epstein-Barr virus and Kaposi's Sarcoma-associated Herpesvirus. Although genome manipulations of this virus have been greatly facilitated by the cloning of the BoHV-4 V.test strain as a Bacterial Artificial Chromosome (BAC, the lack of a complete genome sequence for this strain limits its experimental use. Methods In this study, we have determined the complete sequence of BoHV-4 V.test strain by a pyrosequencing approach. Results The long unique coding region (LUR consists of 108,241 bp encoding at least 79 open reading frames and is flanked by several polyrepetitive DNA units (prDNA. As previously suggested, we showed that the prDNA unit located at the left prDNA-LUR junction (prDNA-G differs from the other prDNA units (prDNA-inner. Namely, the prDNA-G unit lacks the conserved pac-2 cleavage and packaging signal in its right terminal region. Based on the mechanisms of cleavage and packaging of herpesvirus genomes, this feature implies that only genomes bearing left and right end prDNA units are encapsulated into virions. Conclusions In this study, we have determined the complete genome sequence of the BAC-cloned BoHV-4 V.test strain and identified genome organization features that could be important in other herpesviruses.

  7. FASTERp: A Feature Array Search Tool for Estimating Resemblance of Protein Sequences

    Energy Technology Data Exchange (ETDEWEB)

    Macklin, Derek; Egan, Rob; Wang, Zhong

    2014-03-14

    Metagenome sequencing efforts have provided a large pool of billions of genes for identifying enzymes with desirable biochemical traits. However, homology search with billions of genes in a rapidly growing database has become increasingly computationally impractical. Here we present our pilot efforts to develop a novel alignment-free algorithm for homology search. Specifically, we represent individual proteins as feature vectors that denote the presence or absence of short kmers in the protein sequence. Similarity between feature vectors is then computed using the Tanimoto score, a distance metric that can be rapidly computed on bit string representations of feature vectors. Preliminary results indicate good correlation with optimal alignment algorithms (Spearman r of 0.87, ~;;1,000,000 proteins from Pfam), as well as with heuristic algorithms such as BLAST (Spearman r of 0.86, ~;;1,000,000 proteins). Furthermore, a prototype of FASTERp implemented in Python runs approximately four times faster than BLAST on a small scale dataset (~;;1000 proteins). We are optimizing and scaling to improve FASTERp to enable rapid homology searches against billion-protein databases, thereby enabling more comprehensive gene annotation efforts.

  8. Parallax Effect Free Mosaicing of Underwater Video Sequence Based on Texture Features

    Directory of Open Access Journals (Sweden)

    Nagaraja S

    2014-10-01

    Full Text Available In this paper, we present feature-based technique for construction of mosaic image from underwater video sequence, which suffers from parallax distortion due to propagation properties of light in the underwater environment. The most of the available mosaic tools and underwater image mosaicing techniques yields final result with some artifacts such as blurring, ghosting and seam due to presence of parallax in the input images. The removal of parallax from input images may not reduce its effects instead it must be corrected in successive steps of mosaicing. Thus, our approach minimizes the parallax effects by adopting an efficient local alignment technique after global registration. We extract texture features using Centre Symmetric Local Binary Pattern (CS-LBP descriptor in order to find feature correspondences, which are used further for estimation of homography through RANSAC. In order to increase the accuracy of global registration, we perform preprocessing such as colour alignment between two selected frames based on colour distribution adjustment. Because of existence of 100% overlap in consecutive frames of underwater video, we select frames with minimum overlap based on mutual offset in order to reduce the computation cost during mosaicing. Our approach minimizes the parallax effects considerably in final mosaic constructed using our own underwater video sequences.

  9. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome

    Science.gov (United States)

    Benoit, Joshua B.; Adelman, Zach N.; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C.; Szuter, Elise M.; Hagan, Richard W.; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M.; Nelson, David R.; Rosendale, Andrew J.; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M.; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R.; Ioannidis, Panagiotis; Waterhouse, Robert M.; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J. Spencer; Gondhalekar, Ameya D.; Scharf, Michael E.; Peterson, Brittany F.; Raje, Kapil R.; Hottel, Benjamin A.; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S. T.; Duncan, Elizabeth J.; Murali, Shwetha C.; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L.; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C.; Muzny, Donna M.; Wheeler, David; Panfilio, Kristen A.; Vargas Jentzsch, Iris M.; Vargo, Edward L.; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T.; Anderson, Michelle A. E.; Jones, Jeffery W.; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D.; Attardo, Geoffrey M.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Ribeiro, Jose M. C.; Gibbs, Richard A.; Werren, John H.; Palli, Subba R.; Schal, Coby; Richards, Stephen

    2016-01-01

    The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host–symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human–bed bug and symbiont–bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814

  10. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome.

    Science.gov (United States)

    Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus; Dolan, Amanda; Poelchau, Monica; Jennings, Emily C; Szuter, Elise M; Hagan, Richard W; Gujar, Hemant; Shukla, Jayendra Nath; Zhu, Fang; Mohan, M; Nelson, David R; Rosendale, Andrew J; Derst, Christian; Resnik, Valentina; Wernig, Sebastian; Menegazzi, Pamela; Wegener, Christian; Peschel, Nicolai; Hendershot, Jacob M; Blenau, Wolfgang; Predel, Reinhard; Johnston, Paul R; Ioannidis, Panagiotis; Waterhouse, Robert M; Nauen, Ralf; Schorn, Corinna; Ott, Mark-Christoph; Maiwald, Frank; Johnston, J Spencer; Gondhalekar, Ameya D; Scharf, Michael E; Peterson, Brittany F; Raje, Kapil R; Hottel, Benjamin A; Armisén, David; Crumière, Antonin Jean Johan; Refki, Peter Nagui; Santos, Maria Emilia; Sghaier, Essia; Viala, Sèverine; Khila, Abderrahman; Ahn, Seung-Joon; Childers, Christopher; Lee, Chien-Yueh; Lin, Han; Hughes, Daniel S T; Duncan, Elizabeth J; Murali, Shwetha C; Qu, Jiaxin; Dugan, Shannon; Lee, Sandra L; Chao, Hsu; Dinh, Huyen; Han, Yi; Doddapaneni, Harshavardhan; Worley, Kim C; Muzny, Donna M; Wheeler, David; Panfilio, Kristen A; Vargas Jentzsch, Iris M; Vargo, Edward L; Booth, Warren; Friedrich, Markus; Weirauch, Matthew T; Anderson, Michelle A E; Jones, Jeffery W; Mittapalli, Omprakash; Zhao, Chaoyang; Zhou, Jing-Jiang; Evans, Jay D; Attardo, Geoffrey M; Robertson, Hugh M; Zdobnov, Evgeny M; Ribeiro, Jose M C; Gibbs, Richard A; Werren, John H; Palli, Subba R; Schal, Coby; Richards, Stephen

    2016-01-01

    The bed bug, Cimex lectularius, has re-established itself as a ubiquitous human ectoparasite throughout much of the world during the past two decades. This global resurgence is likely linked to increased international travel and commerce in addition to widespread insecticide resistance. Analyses of the C. lectularius sequenced genome (650 Mb) and 14,220 predicted protein-coding genes provide a comprehensive representation of genes that are linked to traumatic insemination, a reduced chemosensory repertoire of genes related to obligate hematophagy, host-symbiont interactions, and several mechanisms of insecticide resistance. In addition, we document the presence of multiple putative lateral gene transfer events. Genome sequencing and annotation establish a solid foundation for future research on mechanisms of insecticide resistance, human-bed bug and symbiont-bed bug associations, and unique features of bed bug biology that contribute to the unprecedented success of C. lectularius as a human ectoparasite. PMID:26836814

  11. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  12. Topological features of proteins from amino acid residue networks

    CERN Document Server

    Alves, N A; Alves, Nelson Augusto; Martinez, Alexandre Souto

    2006-01-01

    Topological properties of native folds are obtained from statistical analysis of 160 low homology proteins covering the four structural classes. This is done analysing one, two and three-vertex joint distribution of quantities related to the corresponding network of amino acid residues. Emphasis on the amino acid residue hydrophobicity leads to the definition of their center of mass as vertices in this contact network model with interactions represented by edges. The network analysis helps us to interpret experimental results such as hydrophobic scales and fraction of buried accessible surface area in terms of the network connectivity. To explore the vertex type dependent correlations, we build a network of hydrophobic and polar vertices. This procedure presents the wiring diagram of the topological structure of globular proteins leading to the following attachment probabilities between hydrophobic-hydrophobic 0.424(5), hydrophobic-polar 0.419(2) and polar-polar 0.157(3) residues.

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Science.gov (United States)

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  14. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    Directory of Open Access Journals (Sweden)

    Pelletier Eric

    2010-10-01

    Full Text Available Abstract Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C

  15. Cloning and characterization of cDNAs encoding S-RNases from almond (Prunus dulcis): primary structural features and sequence diversity of the S-RNases in Rosaceae.

    Science.gov (United States)

    Ushijima, K; Sassa, H; Tao, R; Yamane, H; Dandekar, A M; Gradziel, T M; Hirano, H

    1998-11-01

    cDNAs encoding three S-RNases of almond (Prunus dulcis), which belongs to the family Rosaceae, were cloned and sequenced. The comparison of amino acid sequences between the S-RNases of almond and those of other rosaceous species showed that the amino acid sequences of the rosaceous S-RNases are highly divergent, and intra-subfamilial similarities are higher than inter-subfamilial similarities. Twelve amino acid sequences of the rosaceous S-RNases were aligned to characterize their primary structural features. In spite of their high level of diversification, the rosaceous S-RNases were found to have five conserved regions, C1, C2, C3, C5, and RC4 which is Rosaceae-specific conserved region. Many variable sites fall into one region, named RHV. RHV is located at a similar position to that of the hypervariable region a (HVa) of the solanaceous S-RNases, and is assumed to be involved in recognizing S-specificity of pollen. On the other hand, the region corresponding to another solanaceous hypervariable region (HVb) was not variable in the rosaceous S-RNases. In the phylogenetic tree of the T2/S type RNase, the rosaceous S-RNase fall into two subfamily-specific groups (Amygdaloideae and Maloideae). The results of sequence comparisons and phylogenetic analysis imply that the present S-RNases of Rosaceae have diverged again relatively recently, after the divergence of subfamilies. PMID:9862480

  16. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  17. Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Gclust Server Amino acid sequences used for clusterintg (Multi FASTA format) Data detail Data name Amino acid sequence...s used for clusterintg (Multi FASTA format) Description of data contents Amino acid sequences of p...a entries - FASTA format header: ID and the annotation information of the sequence... Site Policy | Contact Us Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive ...

  18. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    Science.gov (United States)

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  19. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    Science.gov (United States)

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

  20. Contig sequences and their annotation (amino acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us Dicty_cDB Contig sequence...s and their annotation (amino acid sequence and results of homology search), and ex...pression profile Data detail Data name Contig sequences and their annotation (amino acid sequence and result...s of homology search), and expression profile Description of data contents Contig sequences of cDNA sequen...ces of Dictyostelium discoideum and their annotation (amino acid sequence and resul

  1. Host-Associated Genomic Features of the Novel Uncultured Intracellular Pathogen Ca. Ichthyocystis Revealed by Direct Sequencing of Epitheliocysts.

    Science.gov (United States)

    Qi, Weihong; Vaughan, Lloyd; Katharios, Pantelis; Schlapbach, Ralph; Seth-Smith, Helena M B

    2016-01-01

    Advances in single-cell and mini-metagenome sequencing have enabled important investigations into uncultured bacteria. In this study, we applied the mini-metagenome sequencing method to assemble genome drafts of the uncultured causative agents of epitheliocystis, an emerging infectious disease in the Mediterranean aquaculture species gilthead seabream. We sequenced multiple cyst samples and constructed 11 genome drafts from a novel beta-proteobacterial lineage, Candidatus Ichthyocystis. The draft genomes demonstrate features typical of pathogenic bacteria with an obligate intracellular lifestyle: a reduced genome of up to 2.6 Mb, reduced G + C content, and reduced metabolic capacity. Reconstruction of metabolic pathways reveals that Ca Ichthyocystis genomes lack all amino acid synthesis pathways, compelling them to scavenge from the fish host. All genomes encode type II, III, and IV secretion systems, a large repertoire of predicted effectors, and a type IV pilus. These are all considered to be virulence factors, required for adherence, invasion, and host manipulation. However, no evidence of lipopolysaccharide synthesis could be found. Beyond the core functions shared within the genus, alignments showed distinction into different species, characterized by alternative large gene families. These comprise up to a third of each genome, appear to have arisen through duplication and diversification, encode many effector proteins, and are seemingly critical for virulence. Thus, Ca Ichthyocystis represents a novel obligatory intracellular pathogenic beta-proteobacterial lineage. The methods used: mini-metagenome analysis and manual annotation, have generated important insights into the lifestyle and evolution of the novel, uncultured pathogens, elucidating many putative virulence factors including an unprecedented array of novel gene families. PMID:27190004

  2. Yeast prions and human prion-like proteins: sequence features and prediction methods.

    Science.gov (United States)

    Cascarina, Sean M; Ross, Eric D

    2014-06-01

    Prions are self-propagating infectious protein isoforms. A growing number of prions have been identified in yeast, each resulting from the conversion of soluble proteins into an insoluble amyloid form. These yeast prions have served as a powerful model system for studying the causes and consequences of prion aggregation. Remarkably, a number of human proteins containing prion-like domains, defined as domains with compositional similarity to yeast prion domains, have recently been linked to various human degenerative diseases, including amyotrophic lateral sclerosis. This suggests that the lessons learned from yeast prions may help in understanding these human diseases. In this review, we examine what has been learned about the amino acid sequence basis for prion aggregation in yeast, and how this information has been used to develop methods to predict aggregation propensity. We then discuss how this information is being applied to understand human disease, and the challenges involved in applying yeast prediction methods to higher organisms.

  3. Accurate single-sequence prediction of solvent accessible surface area using local and global features.

    Science.gov (United States)

    Faraggi, Eshel; Zhou, Yaoqi; Kloczkowski, Andrzej

    2014-11-01

    We present a new approach for predicting the Accessible Surface Area (ASA) using a General Neural Network (GENN). The novelty of the new approach lies in not using residue mutation profiles generated by multiple sequence alignments as descriptive inputs. Instead we use solely sequential window information and global features such as single-residue and two-residue compositions of the chain. The resulting predictor is both highly more efficient than sequence alignment-based predictors and of comparable accuracy to them. Introduction of the global inputs significantly helps achieve this comparable accuracy. The predictor, termed ASAquick, is tested on predicting the ASA of globular proteins and found to perform similarly well for so-called easy and hard cases indicating generalizability and possible usability for de-novo protein structure prediction. The source code and a Linux executables for GENN and ASAquick are available from Research and Information Systems at http://mamiris.com, from the SPARKS Lab at http://sparks-lab.org, and from the Battelle Center for Mathematical Medicine at http://mathmed.org. PMID:25204636

  4. The complete amino acid sequence of the basic nuclear protein of bull spermatozoa

    NARCIS (Netherlands)

    Coelingh, J.P.; Monfoort, Cornelis H.; Rozijn, Thomas H.; Gevers Leuven, Jan A.; Schiphof, R.; Steyn-Parvé, Elizabeth P.; Braunitzer, Gerhard; Schrank, Barbara; Ruhfus, Annette

    1972-01-01

    The complete amino acid sequence of the basic nuclear protein of bull spermatozoa has been established. The sequence was partially deduced by characterization of peptides isolated from thermolysine and chymotryptic digests of the reduced and S-aminoethylated protein. The complete sequence of the fir

  5. NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.

    Directory of Open Access Journals (Sweden)

    Pu Wang

    Full Text Available Nuclear receptors (NRs are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1 thyroid hormone like (NR1, (2 HNF4-like (NR2, (3 estrogen like, (4 nerve growth factor IB-like (NR4, (5 fushi tarazu-F1 like (NR5, (6 germ cell nuclear factor like (NR6, and (7 knirps like (NR0. The identification was made by the Fuzzy K nearest neighbor (FK-NN classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will

  6. NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.

    Science.gov (United States)

    Wang, Pu; Xiao, Xuan; Chou, Kuo-Chen

    2011-01-01

    Nuclear receptors (NRs) are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1) thyroid hormone like (NR1), (2) HNF4-like (NR2), (3) estrogen like, (4) nerve growth factor IB-like (NR4), (5) fushi tarazu-F1 like (NR5), (6) germ cell nuclear factor like (NR6), and (7) knirps like (NR0). The identification was made by the Fuzzy K nearest neighbor (FK-NN) classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will

  7. Nucleotide Sequence of a Chicken Vitellogenin Gene and Derived Amino Acid Sequence of the Encoded Yolk Precursor Protein

    NARCIS (Netherlands)

    Schip, Fred D. van het; Samallo, John; Broos, Jaap; Ophuis, Jan; Mojet, Mart; Gruber, Max; AB, Geert

    1987-01-01

    The gene encoding the major vitellogenin from chicken has been completely sequenced and its exon-intron organization has been established. The gene is 20,342 base-pairs long and contains 35 exons with a combined length of 5787 base-pairs. They encode the 1850-amino acid pre-peptide of vitellogenin,

  8. The amino acid sequence of elephant (Elephas maximus) myoglobin and the phylogeny of Proboscidea.

    Science.gov (United States)

    Dene, H; Goodman, M; Romero-Herrera, A E

    1980-02-13

    The complete amino acid sequence of skeletal myoglobin from the Asian elephant (Elephas maximus) is reported. The functional significance of variations seen when this sequence is compared with that of sperm whale myoglobin is explored in the light of the crystallographic model available for the latter molecule. The phylogenetic implications of the elephant myoglobin amino acid sequence are evaluated by using the maximum parsimony technique. A similar analysis is also presented which incorporates all of the proteins sequenced from the elephant. These results are discussed with respect to current views on proboscidean phylogeny.

  9. A conversational system for the computer analysis of nucleic acid sequences.

    OpenAIRE

    Sege, R; Söll, D.; Ruddle, F H; Queen, C

    1981-01-01

    We present a conversational system for the computer analysis of nucleic acid and protein sequences based on the well-known Queen and Korn program (1). The system can be used by persons with only minimal knowledge of computers.

  10. Event-related potential indices of congruency sequence effects without feature integration or contingency learning confounds.

    Science.gov (United States)

    Larson, Michael J; Clayson, Peter E; Kirwan, C Brock; Weissman, Daniel H

    2016-06-01

    The congruency effect in Stroop-like tasks (i.e., increased response time and reduced accuracy in incongruent relative to congruent trials) is often smaller when the previous trial was incongruent as compared to congruent. This congruency sequence effect (CSE) is thought to reflect cognitive control processes that shift attention to the target and/or modulate the response engendered by the distracter differently after incongruent relative to congruent trials. The neural signatures of CSEs are therefore usually attributed to cognitive control processes that minimize distraction from irrelevant stimuli. However, CSEs in previous functional neuroimaging studies were ubiquitously confounded with feature integration and/or contingency learning processes. We therefore investigated whether a neural CSE can be observed without such confounds in a group of healthy young adults (n = 56). To this end, we combined a prime-probe task that lacks such confounds with high-density ERPs to identify, for the first time, the neural time course of confound-minimized CSEs. Replicating recent behavioral findings, we observed strong CSEs in this task for mean response time and mean accuracy. Critically, conceptually replicating prior ERP results from confounded tasks, we also observed a CSE in both the parietal conflict slow potential (conflict SP) and the frontomedial N450. These findings indicate for the first time that neural CSEs as indexed by ERPs can be observed without the typical confounds. More broadly, the present study provides a confound-minimized protocol that will help future researchers to better isolate the neural bases of control processes that minimize distraction from irrelevant stimuli. PMID:26854028

  11. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    Science.gov (United States)

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  12. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    Science.gov (United States)

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  13. Amino Acid Sequence - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...ll_sequence_amino_db.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST...ta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_ine_full_sequence_amino_db.fasta.zip Fi...date History of This Database Site Policy | Contact Us Amino Acid Sequence - KOME | LSDB Archive ...

  14. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid

    Science.gov (United States)

    Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  15. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Science.gov (United States)

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  16. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    Science.gov (United States)

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  17. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Science.gov (United States)

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid.

  18. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M;

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250-555 p...

  19. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    Science.gov (United States)

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  20. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    Science.gov (United States)

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  1. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    Science.gov (United States)

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  2. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    Science.gov (United States)

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  3. Protein chemotaxonomy. XIII. Amino acid sequence of ferredoxin from Panax ginseng.

    Science.gov (United States)

    Mino, Yoshiki

    2006-08-01

    The complete amino acid sequence of [2Fe-2S] ferredoxin from Panax ginseng (Araliaceae) has been determined by automated Edman degradation of the entire S-carboxymethylcysteinyl protein and of the peptides obtained by enzymatic digestion. This ferredoxin has a unique amino acid sequence, which includes an insertion of Tyr at the 3rd position from the amino-terminus and a deletion of two amino acid residues at the carboxyl terminus. This ferredoxin had 18 differences in its amino acid sequence compared to that of Petroselinum sativum (Umbelliferae). In contrast, 23-33 differences were observed compared to other dicotyledonous plants. This suggests that Panax ginseng is related taxonomically to umbelliferous plants. PMID:16880642

  4. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry.

    Science.gov (United States)

    Carter, Charles W; Wolfenden, Richard

    2016-01-01

    The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.

  5. PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.

    Directory of Open Access Journals (Sweden)

    Huilin Wang

    Full Text Available X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed 'PredPPCrys' using the support vector machine (SVM. Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I. Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II, which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization

  6. Amino acid sequence heterogeneity of the chromosomal encoded Borrelia burgdorferi sensu lato major antigen P100.

    Science.gov (United States)

    Fellinger, W; Farencena, A; Redl, B; Sambri, V; Cevenini, R; Stöffler, G

    1995-04-01

    The entire nucleotide sequence of the chromosomal encoded major antigen p100 of the European Borrelia garinii isolate B29 was determined and the deduced amino acid sequence was compared to the homologous antigen p83 of the North American Borrelia burgdorferi sensu stricto strain B31 and the p100 of the European Borrelia afzelii (group VS461) strain PKo. p100 of strain B29 shows 87% amino acid sequence identity to strain B31 and 79.2% to strain PKo, p100 of strain B31 and PKo shows 62.5% identity to each other. In addition, partial nucleotide sequences of the most heterogeneous region of the p100 gene of two other Borrelia garinii isolates (PBi and VS286) have been determined and the deduced amino acid sequences were compared with all p100 of Borrelia garinii published so far. We found an amino acid sequence identity between 88.6 and 100% within the same genospecies. The N-terminal part of the p100 proteins is highly conserved whereas a striking heterogeneous region within the C-terminal part of the proteins was observed.

  7. Prediction of GPCR-G Protein Coupling Specificity Using Features of Sequences and Biological Functions

    Institute of Scientific and Technical Information of China (English)

    Toshihide Ono; Haretsugu Hishigaki

    2006-01-01

    Understanding the coupling specificity between G protein-coupled receptors (GPCRs) and specific classes of G proteins is important for further elucidation of receptor functions within a cell. Increasing information on GPCR sequences and the G protein family would facilitate prediction of the coupling properties of GPCRs. In this study, we describe a novel approach for predicting the coupling specificity between GPCRs and G proteins. This method uses not only GPCR sequences but also the functional knowledge generated by natural language processing, and can achieve 92.2% prediction accuracy by using the C4.5 algorithm.Furthermore, rules related to GPCR-G protein coupling are generated. The combination of sequence analysis and text mining improves the prediction accuracy for GPCR-G protein coupling specificity, and also provides clues for understanding GPCR signaling.

  8. Introduction of restriction enzyme sites in protein-coding DNA sequences by site-specific mutagenesis not affecting the amino acid sequence: a computer program.

    OpenAIRE

    Arentzen, R; Ripka, W. C.

    1984-01-01

    Structure/function relationship studies of proteins are greatly facilitated by recombinant DNA technology which allows specific amino acid mutations to be made at the DNA sequence level by site-specific mutagenesis employing synthetic oligonucleotides. This technique has been successfully used to alter one or two amino acids in a protein. Replacement of existing DNA sequence coding for several amino acids with new synthetic DNA fragments would be facilitated by the presence of unique restrict...

  9. Infants' Sensitivity to the Causal Features of Means-End Support Sequences in Action and Perception

    Science.gov (United States)

    Sommerville, Jessica A.; Woodward, Amanda L.

    2005-01-01

    Current work has yielded differential findings regarding infants' ability to perceptually detect the causal structure of a means-end support sequence. Resolving this debate has important implications for perception-action dissociations in this domain of object knowledge. In Study 1, 12-month-old infants' ability to perceive the causal structure of…

  10. Amino acid sequence of phospholipase A/sub 2/-. cap alpha. from the venom of Crotalus adamanteus

    Energy Technology Data Exchange (ETDEWEB)

    Heinrikson, R.L.; Krueger, E.T.; Keim, P.S.

    1977-07-25

    The complete amino acid sequence of Crotalus adamanteus venom phospholipase A/sub 2/-..cap alpha.. has been determined by analysis of the five tryptic peptides from the citraconylated, reduced, and S-(/sup 14/C)carboxamidomethylated enzyme. Earlier studies provided the information necessary to align the tryptic fragments so that secondary cleavage procedures to establish overlaps were unnecessary. The subunit in the phospholipase A/sub 2/-..cap alpha.. dimer is a single polypeptide chain containing 122 amino acids and seven disulfide bonds. The histidine residue implicated in the active site of mammalian phospholipases is at position 47 in the C. adamanteus enzyme and is located in a domain of the molecule which is highly homologous in sequence with corresponding regions of phospholipases from a variety of venom and pancreatic sources. Comparative sequence analysis has revealed insights with regard to the function and evolution of phospholipases A/sub 2/. Primary structural relationships observed among the snake venom enzymes parallel the phylogenetic classification of the venomous reptiles from which they were derived. It is proposed that phospholipases A/sub 2/ of this general type be divided into two groups depending upon the presence or absence of distinctive structural features elucidated in this study.

  11. Coding Local and Global Binary Visual Features Extracted From Video Sequences.

    Science.gov (United States)

    Baroffio, Luca; Canclini, Antonio; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

    2015-11-01

    Binary local features represent an effective alternative to real-valued descriptors, leading to comparable results for many visual analysis tasks while being characterized by significantly lower computational complexity and memory requirements. When dealing with large collections, a more compact representation based on global features is often preferred, which can be obtained from local features by means of, e.g., the bag-of-visual word model. Several applications, including, for example, visual sensor networks and mobile augmented reality, require visual features to be transmitted over a bandwidth-limited network, thus calling for coding techniques that aim at reducing the required bit budget while attaining a target level of efficiency. In this paper, we investigate a coding scheme tailored to both local and global binary features, which aims at exploiting both spatial and temporal redundancy by means of intra- and inter-frame coding. In this respect, the proposed coding scheme can conveniently be adopted to support the analyze-then-compress (ATC) paradigm. That is, visual features are extracted from the acquired content, encoded at remote nodes, and finally transmitted to a central controller that performs the visual analysis. This is in contrast with the traditional approach, in which visual content is acquired at a node, compressed and then sent to a central unit for further processing, according to the compress-then-analyze (CTA) paradigm. In this paper, we experimentally compare the ATC and the CTA by means of rate-efficiency curves in the context of two different visual analysis tasks: 1) homography estimation and 2) content-based retrieval. Our results show that the novel ATC paradigm based on the proposed coding primitives can be competitive with the CTA, especially in bandwidth limited scenarios. PMID:26080384

  12. Sequence Pattern Correlation of Amino Acid in Collision-induced Dissociation Electrospray Ionization Mass Spectrometry

    Institute of Scientific and Technical Information of China (English)

    SONG,Hao-Wei(宋浩威); YUE,Gui-Hua(岳贵花); LU,Yu(陆宇); YANG,Peng-Yuan(杨芃原); WANG,Hong-Hai(王洪海)

    2002-01-01

    A novel approach of sequence pattern correlation has been applied to predict an expected amino acid sequence from CID ESI-MS spectra. The proposed approach deduces sequence patterns with no help from known protein database such that it is useful to identify an unknown peptide or new protein. The algorithm applies a cross-correlation to match an experimental CID spectrum with predicted sequence pattern generated from fragmentation information. The fragmentation knowledge of both y-series and other non y-series are utilized to generate the predicted sequence patterns. In contrast to the normal de novo approach, the proposed approach is insensitive to mass tolerance and non-susceptive to spectral integrality with no need for selection of a starting point.

  13. Structural and Morphological Features of Acid-Bearing Polymers for PEM Fuel Cells

    DEFF Research Database (Denmark)

    Yang, Yunsong; Siu, Ana; Peckham, Timothy J.;

    2008-01-01

    Chemical structure, polymer microstructure, sequence distribution, and morphology of acid-bearing polymers are important factors in the design of polymer electrolyte membranes (PEMs) for fuel cells. The roles of ion aggregation and phase separation in vinylic- and aromatic-based polymers in proton...... significantly enhance the transport of water and protons. For membranes that contain a high content of water, phase separation is less influential. Continuity of ionic aggregates is influential on the diffusion of water and electroosmotic drag within a membrane. A balance of these properties must be considered...

  14. Prediction of protein structural features from sequence data based on Shannon entropy and Kolmogorov complexity.

    Science.gov (United States)

    Bywater, Robert Paul

    2015-01-01

    While the genome for a given organism stores the information necessary for the organism to function and flourish it is the proteins that are encoded by the genome that perhaps more than anything else characterize the phenotype for that organism. It is therefore not surprising that one of the many approaches to understanding and predicting protein folding and properties has come from genomics and more specifically from multiple sequence alignments. In this work I explore ways in which data derived from sequence alignment data can be used to investigate in a predictive way three different aspects of protein structure: secondary structures, inter-residue contacts and the dynamics of switching between different states of the protein. In particular the use of Kolmogorov complexity has identified a novel pathway towards achieving these goals.

  15. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    OpenAIRE

    Hiroshi Katoh; Shin-Ichi Miyata; Hiromitsu Inoue; Toru Iwanami

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative 'C...

  16. Characterization of soybean genomic features by analysis of its expressed sequence tags

    DEFF Research Database (Denmark)

    Tian, Ai-Guo; Wang, Jun; Cui, Peng;

    2004-01-01

    to be fast-evolving. Soybean unigenes with no match to genes within the Arabidopsis genome were identified as soybean-specific genes. These genes were mainly involved in nodule development and the synthesis of seed storage proteins. In addition, we also identified 61 genes regulated by salicylic acid, 1...

  17. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome.

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)-microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker "retinoic acid" in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5'-AGGTCA-3') in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and embryological

  18. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome.

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)-microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker "retinoic acid" in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5'-AGGTCA-3') in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and embryological

  19. Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework

    OpenAIRE

    Shaokang Chen; Sandra Mau; Harandi, Mehrtash T.; Conrad Sanderson; Abbas Bigdeli; Lovell, Brian C.

    2011-01-01

    Although automatic faces recognition has shown success for high-quality images under controlled conditions, for video-based recognition it is hard to attain similar levels of performance. We describe in this paper recent advances in a project being undertaken to trial and develop advanced surveillance systems for public safety. In this paper, we propose a local facial feature based framework for both still image and video-based face recognition. The evaluation is performed on a still image d...

  20. ProViz-a web-based visualization tool to investigate the functional and evolutionary features of protein sequences.

    Science.gov (United States)

    Jehl, Peter; Manguy, Jean; Shields, Denis C; Higgins, Desmond G; Davey, Norman E

    2016-07-01

    Low-throughput experiments and high-throughput proteomic and genomic analyses have created enormous quantities of data that can be used to explore protein function and evolution. The ability to consolidate these data into an informative and intuitive format is vital to our capacity to comprehend these distinct but complementary sources of information. However, existing tools to visualize protein-related data are restricted by their presentation, sources of information, functionality or accessibility. We introduce ProViz, a powerful browser-based tool to aid biologists in building hypotheses and designing experiments by simplifying the analysis of functional and evolutionary features of proteins. Feature information is retrieved in an automated manner from resources describing protein modular architecture, post-translational modification, structure, sequence variation and experimental characterization of functional regions. These features are mapped to evolutionary information from precomputed multiple sequence alignments. Data are displayed in an interactive and information-rich yet intuitive visualization, accessible through a simple protein search interface. This allows users with limited bioinformatic skills to rapidly access data pertinent to their research. Visualizations can be further customized with user-defined data either manually or using a REST API. ProViz is available at http://proviz.ucd.ie/.

  1. ProViz-a web-based visualization tool to investigate the functional and evolutionary features of protein sequences.

    Science.gov (United States)

    Jehl, Peter; Manguy, Jean; Shields, Denis C; Higgins, Desmond G; Davey, Norman E

    2016-07-01

    Low-throughput experiments and high-throughput proteomic and genomic analyses have created enormous quantities of data that can be used to explore protein function and evolution. The ability to consolidate these data into an informative and intuitive format is vital to our capacity to comprehend these distinct but complementary sources of information. However, existing tools to visualize protein-related data are restricted by their presentation, sources of information, functionality or accessibility. We introduce ProViz, a powerful browser-based tool to aid biologists in building hypotheses and designing experiments by simplifying the analysis of functional and evolutionary features of proteins. Feature information is retrieved in an automated manner from resources describing protein modular architecture, post-translational modification, structure, sequence variation and experimental characterization of functional regions. These features are mapped to evolutionary information from precomputed multiple sequence alignments. Data are displayed in an interactive and information-rich yet intuitive visualization, accessible through a simple protein search interface. This allows users with limited bioinformatic skills to rapidly access data pertinent to their research. Visualizations can be further customized with user-defined data either manually or using a REST API. ProViz is available at http://proviz.ucd.ie/. PMID:27085803

  2. Multimodal emotional state recognition using sequence-dependent deep hierarchical features.

    Science.gov (United States)

    Barros, Pablo; Jirak, Doreen; Weber, Cornelius; Wermter, Stefan

    2015-12-01

    Emotional state recognition has become an important topic for human-robot interaction in the past years. By determining emotion expressions, robots can identify important variables of human behavior and use these to communicate in a more human-like fashion and thereby extend the interaction possibilities. Human emotions are multimodal and spontaneous, which makes them hard to be recognized by robots. Each modality has its own restrictions and constraints which, together with the non-structured behavior of spontaneous expressions, create several difficulties for the approaches present in the literature, which are based on several explicit feature extraction techniques and manual modality fusion. Our model uses a hierarchical feature representation to deal with spontaneous emotions, and learns how to integrate multiple modalities for non-verbal emotion recognition, making it suitable to be used in an HRI scenario. Our experiments show that a significant improvement of recognition accuracy is achieved when we use hierarchical features and multimodal information, and our model improves the accuracy of state-of-the-art approaches from 82.5% reported in the literature to 91.3% for a benchmark dataset on spontaneous emotion expressions.

  3. Multimodal emotional state recognition using sequence-dependent deep hierarchical features.

    Science.gov (United States)

    Barros, Pablo; Jirak, Doreen; Weber, Cornelius; Wermter, Stefan

    2015-12-01

    Emotional state recognition has become an important topic for human-robot interaction in the past years. By determining emotion expressions, robots can identify important variables of human behavior and use these to communicate in a more human-like fashion and thereby extend the interaction possibilities. Human emotions are multimodal and spontaneous, which makes them hard to be recognized by robots. Each modality has its own restrictions and constraints which, together with the non-structured behavior of spontaneous expressions, create several difficulties for the approaches present in the literature, which are based on several explicit feature extraction techniques and manual modality fusion. Our model uses a hierarchical feature representation to deal with spontaneous emotions, and learns how to integrate multiple modalities for non-verbal emotion recognition, making it suitable to be used in an HRI scenario. Our experiments show that a significant improvement of recognition accuracy is achieved when we use hierarchical features and multimodal information, and our model improves the accuracy of state-of-the-art approaches from 82.5% reported in the literature to 91.3% for a benchmark dataset on spontaneous emotion expressions. PMID:26548943

  4. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Science.gov (United States)

    2010-07-01

    ... Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of WIPO... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences... Listing” referred to in paragraph (c) of this section must also be submitted in computer readable...

  5. Nucleic acid sequence-based amplification with oligochromatography for detection of Trypanosoma brucei in clinical samples

    NARCIS (Netherlands)

    C.M. Mugasa; T. Laurent; G.J. Schoone; P.A. Kager; G.W. Lubega; H.D.F.H. Schallig

    2009-01-01

    Molecular tools, such as real-time nucleic acid sequence-based amplification (NASBA) and PCR, have been developed to detect Trypanosoma brucei parasites in blood for the diagnosis of human African trypanosomiasis (HAT). Despite good sensitivity, these techniques are not implemented in HAT control pr

  6. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    OpenAIRE

    Katoh, Hiroshi; Miyata, Shin-ichi; Inoue, Hiromitsu; Iwanami, Toru

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, ‘Candidatus Liberibacter asiaticus’, ‘Ca. L. americanus’, and ‘Ca. L. africanus’. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative ‘C...

  7. Sequence Features of Drosha and Dicer Cleavage Sites Affect the Complexity of IsomiRs

    OpenAIRE

    Julia Starega-Roslan; Witkos, Tomasz M.; Paulina Galka-Marciniak; Krzyzosiak, Wlodzimierz J.

    2015-01-01

    The deep-sequencing of small RNAs has revealed that different numbers and proportions of miRNA variants called isomiRs are formed from single miRNA genes and that this effect is attributable mainly to imprecise cleavage by Drosha and Dicer. Factors that influence the degree of cleavage precision of Drosha and Dicer are under investigation, and their identification may improve our understanding of the mechanisms by which cells modulate the regulatory potential of miRNAs. In this study, we focu...

  8. Global features of sequences of bacterial chromosomes, plasmids and phages revealed by analysis of oligonucleotide usage patterns

    Directory of Open Access Journals (Sweden)

    Tümmler Burkhard

    2004-07-01

    Full Text Available Abstract Background Oligonucleotide frequencies were shown to be conserved signatures for bacterial genomes, however, the underlying constraints have yet not been resolved in detail. In this paper we analyzed oligonucleotide usage (OU biases in a comprehensive collection of 155 completely sequenced bacterial chromosomes, 316 plasmids and 104 phages. Results Two global features were analyzed: pattern skew (PS and variance of OU deviations normalized by mononucleotide content of the sequence (OUV. OUV reflects the strength of OU biases and taxonomic signals. PS denotes asymmetry of OU in direct and reverse DNA strands. A trend towards minimal PS was observed for almost all complete sequences of bacterial chromosomes and plasmids, however, PS was substantially higher in separate genomic loci and several types of plasmids and phages characterized by long stretches of non-coding DNA and/or asymmetric gene distribution on the two DNA strands. Five of the 155 bacterial chromosomes have anomalously high PS, of which the chromosomes of Xylella fastidiosa 9a5c and Prochlorococcus marinus MIT9313 exhibit extreme PS values suggesting an intermediate unstable state of these two genomes. Conclusions Strand symmetry as indicated by minimal PS is a universally conserved feature of complete bacterial genomes that results from the matching mutual compensation of local OU biases on both replichors while OUV is more a taxon specific feature. Local events such as inversions or the incorporation of genome islands are balanced by global changes in genome organization to minimize PS that may represent one of the leading evolutionary forces driving bacterial genome diversification.

  9. Emotion Recognition based on 2D-3D Facial Feature Extraction from Color Image Sequences

    Directory of Open Access Journals (Sweden)

    Robert Niese

    2010-10-01

    Full Text Available Normal 0 21 false false false DE X-NONE X-NONE /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Normale Tabelle"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-qformat:yes; mso-style-parent:""; mso-padding-alt:0cm 5.4pt 0cm 5.4pt; mso-para-margin:0cm; mso-para-margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:11.0pt; font-family:"Calibri","sans-serif"; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-fareast-font-family:"Times New Roman"; mso-fareast-theme-font:minor-fareast; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin; mso-bidi-font-family:"Times New Roman"; mso-bidi-theme-font:minor-bidi;} In modern human computer interaction systems, emotion recognition from video is becoming an imperative feature. In this work we propose a new method for automatic recognition of facial expressions related to categories of basic emotions from image data. Our method incorporates a series of image processing, low level 3D computer vision and pattern recognition techniques. For image feature extraction, color and gradient information is used. Further, in terms of 3D processing, camera models are applied along with an initial registration step, in which person specific face models are automatically built from stereo. Based on these face models, geometric feature measures are computed and normalized using photogrammetric techniques. For recognition this normalization leads to minimal mixing between different emotion classes, which are determined with an artificial neural network classifier. Our framework achieves robust and superior classification results, also across a variety of head poses with resulting perspective foreshortening and changing face size. Results are presented for domestic and publicly available databases.

  10. MRI Sequence and Characteristic Features in ‘Giant Cell Tumor’ of Clivus

    OpenAIRE

    Mahale, Ajit; K.V.N, Dhananjaya; Pai, Muralidhar; Poornima, Vinaya; Sahu, Kausalya Kumari

    2013-01-01

    Giant cell tumours of the clivus are rare. These tumours present in the second and third decades of life and they are slightly more frequent in women than in men. We are presenting a case of a 20 years young patient who came with the complaints of headache, retro-orbital pain and recurrent transient bleeding from the nose since two and half months. MRI of the brain with contrast was done and its features were suggestive of a Giant cell tumour of the clivus. A transnasal endoscopic biopsy was ...

  11. MRI sequence and characteristic features in 'giant cell tumor' of clivus.

    Science.gov (United States)

    Mahale, Ajit; K V N, Dhananjaya; Pai, Muralidhar; Poornima, Vinaya; Sahu, Kausalya Kumari

    2013-06-01

    Giant cell tumours of the clivus are rare. These tumours present in the second and third decades of life and they are slightly more frequent in women than in men. We are presenting a case of a 20 years young patient who came with the complaints of headache, retro-orbital pain and recurrent transient bleeding from the nose since two and half months. MRI of the brain with contrast was done and its features were suggestive of a Giant cell tumour of the clivus. A transnasal endoscopic biopsy was done under general anaesthesia and the histopathology report suggested that the features were of a giant cell tumour. Excision of the mass was done by Transnasal endoscopy. Post operatively, the patient did not recover from the lateral rectus palsy which was there on the right side. The patient was discharged with an advice of a follow up and radiotherapy. Radiation therapy and chemotherapy may be effective as adjuvant treatments. Even though a recurrence usually occurs within 4 years of the initial treatment, these patients will need to be carefully followed for the remainder of their lives. PMID:23905141

  12. Structural features of conopeptide genes inferred from partial sequences of the Conus tribblei genome.

    Science.gov (United States)

    Barghi, Neda; Concepcion, Gisela P; Olivera, Baldomero M; Lluisma, Arturo O

    2016-02-01

    The evolvability of venom components (in particular, the gene-encoded peptide toxins) in venomous species serves as an adaptive strategy allowing them to target new prey types or respond to changes in the prey field. The structure, organization, and expression of the venom peptide genes may provide insights into the molecular mechanisms that drive the evolution of such genes. Conus is a particularly interesting group given the high chemical diversity of their venom peptides, and the rapid evolution of the conopeptide-encoding genes. Conus genomes, however, are large and characterized by a high proportion of repetitive sequences. As a result, the structure and organization of conopeptide genes have remained poorly known. In this study, a survey of the genome of Conus tribblei was undertaken to address this gap. A partial assembly of C. tribblei genome was generated; the assembly, though consisting of a large number of fragments, accounted for 2160.5 Mb of sequence. A large number of repetitive genomic elements consisting of 642.6 Mb of retrotransposable elements, simple repeats, and novel interspersed repeats were observed. We characterized the structural organization and distribution of conotoxin genes in the genome. A significant number of conopeptide genes (estimated to be between 148 and 193) belonging to different superfamilies with complete or nearly complete exon regions were observed, ~60 % of which were expressed. The unexpressed conopeptide genes represent hidden but significant conotoxin diversity. The conotoxin genes also differed in the frequency and length of the introns. The interruption of exons by long introns in the conopeptide genes and the presence of repeats in the introns may indicate the importance of introns in facilitating recombination, evolution and diversification of conotoxins. These findings advance our understanding of the structural framework that promotes the gene-level molecular evolution of venom peptides.

  13. Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features

    Science.gov (United States)

    Li, Yuan; Wang, Mingjun; Wang, Huilin; Tan, Hao; Zhang, Ziding; Webb, Geoffrey I.; Song, Jiangning

    2014-07-01

    Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation.

  14. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    Science.gov (United States)

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided. PMID:11414222

  15. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    DEFF Research Database (Denmark)

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein...

  16. Deep sequencing of the transcriptome reveals inflammatory features of porcine visceral adipose tissue.

    Science.gov (United States)

    Wang, Tao; Jiang, Anan; Guo, Yanqin; Tan, Ya; Tang, Guoqing; Mai, Miaomiao; Liu, Haifeng; Xiao, Jian; Li, Mingzhou; Li, Xuewei

    2013-01-01

    Functional differences in the different types of adipose tissue and the impact of their dysfunction on metabolism are associated with the regional distribution of adipose depots. Here we show a genome-wide comparison between the transcriptomes of one source of subcutaneous and two sources of visceral adipose tissue in the pig using an RNA-seq approach. We obtained ~32.3 million unique mapped reads which covered ~80.2% of the current annotated transcripts across these three sources of adipose tissue. We identified various genes differentially expressed between subcutaneous and visceral adipose tissue, which are potentially associated with the inflammatory features of visceral adipose tissue. These results are of benefit for understanding the phenotypic, metabolic and functional differences between different types of adipose tissue that are deposited in different body sites.

  17. Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

    Science.gov (United States)

    Rodrigue, Nicolas; Philippe, Hervé; Lartillot, Nicolas

    2010-03-01

    Modeling the interplay between mutation and selection at the molecular level is key to evolutionary studies. To this end, codon-based evolutionary models have been proposed as pertinent means of studying long-range evolutionary patterns and are widely used. However, these approaches have not yet consolidated results from amino acid level phylogenetic studies showing that selection acting on proteins displays strong site-specific effects, which translate into heterogeneous amino acid propensities across the columns of alignments; related codon-level studies have instead focused on either modeling a single selective context for all codon columns, or a separate selective context for each codon column, with the former strategy deemed too simplistic and the latter deemed overparameterized. Here, we integrate recent developments in nonparametric statistical approaches to propose a probabilistic model that accounts for the heterogeneity of amino acid fitness profiles across the coding positions of a gene. We apply the model to a dozen real protein-coding gene alignments and find it to produce biologically plausible inferences, for instance, as pertaining to site-specific amino acid constraints, as well as distributions of scaled selection coefficients. In their account of mutational features as well as the heterogeneous regimes of selection at the amino acid level, the modeling approaches studied here can form a backdrop for several extensions, accounting for other selective features, for variable population size, or for subtleties of mutational features, all with parameterizations couched within population-genetic theory. PMID:20176949

  18. New features in the genus Ilarvirus revealed by the nucleotide sequence of Fragaria chiloensis latent virus.

    Science.gov (United States)

    Tzanetakis, Ioannis E; Martin, Robert R

    2005-09-01

    Fragaria chiloensis latent virus (FClLV), a member of the genus Ilarvirus was first identified in the early 1990s. Double-stranded RNA was extracted from FClLV infected plants and cloned. The complete nucleotide sequence of the virus has been elucidated. RNA 1 encodes a protein with methyltransferase and helicase enzymatic motifs while RNA 2 encodes the viral RNA dependent RNA polymerase and an ORF, that shares no homology with other Ilarvirus genes. RNA 3 codes for movement and coat proteins and an additional ORF, making FClLV possibly the first Ilarvirus encoding a third protein in RNA 3. Phylogenetic analysis reveals that FClLV is most closely related to Prune dwarf virus, the type member of subgroup 4 of the Ilarvirus genus. FClLV is also closely related to Alfalfa mosaic virus (AlMV), a virus that shares many properties with Ilarviruses . We propose the reclassification of AlMV as a member of the Ilarvirus genus instead of being a member of a distinct genus. PMID:15878214

  19. Unique features of a Japanese 'Candidatus Liberibacter asiaticus' strain revealed by whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    Hiroshi Katoh

    Full Text Available Citrus greening (huanglongbing is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol, in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative 'Ca. L. asiaticus' Japanese isolate Ishi-1 was determined by metagenomic analysis of DNA extracted from 'Ca. L. asiaticus'-infected psyllids and leaf midribs. The 1.19-Mb genome has an average 36.32% GC content. Annotation revealed 13 operons encoding rRNA and 44 tRNA genes, but no typical bacterial pathogenesis-related genes were located within the genome, similar to the Floridian psy62 and Chinese gxpsy. In contrast to other 'Ca. L. asiaticus' strains, the genome of the Japanese Ishi-1 strain lacks a prophage-related region.

  20. The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae.

    Science.gov (United States)

    Choi, Kyoung Su; Park, SeonJoo

    2015-11-10

    Aster spathulifolius, a member of the Asteraceae family, is distributed along the coast of Japan and Korea. This plant is used for medicinal and ornamental purposes. The complete chloroplast (cp) genome of A. sphathulifolius consists of 149,473 bp that include a pair of inverted repeats of 24,751 bp separated by a large single copy region of 81,998 bp and a small single copy region of 17,973 bp. The chloroplast genome contains 78 coding genes, four rRNA genes and 29 tRNA genes. When compared to other cpDNA sequences of Asteraceae, A. spathulifolius showed the closest relationship with Jacobaea vulgaris, and its atpB gene was found to be a pseudogene, unlike J. vulgaris. Furthermore, evaluation of the gene compositions of J. vulgaris, Helianthus annuus, Guizotia abyssinica and A. spathulifolius revealed that 13.6-kb showed inversion from ndhF to rps15, unlike Lactuca of Asteraceae. Comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates with J. vulgaris revealed that synonymous genes related to a small subunit of the ribosome showed the highest value (0.1558), while nonsynonymous rates of genes related to ATP synthase genes were highest (0.0118). These findings revealed that substitution has occurred at similar rates in most genes, and the substitution rates suggested that most genes is a purified selection. PMID:26164759

  1. Grain-size features of a Miocene loess-soil sequence at Qinan: Implications on its origin

    Institute of Scientific and Technical Information of China (English)

    QIAO; Yansong; GUO; Zhengtang; HAO; Qingzhen; YIN; Qiuzhen; YUAN; Baoyin; LIU; Tungsheng

    2006-01-01

    In this study, grain-size of 507 bulk samples from the QA-I Miocene loess-soil sequenceat Qinan were analyzed, and the grain-size features are compared with those of typical Quaternary loess and soil samples, representative lacustrine and fluvial samples. The results indicate that the grain-size distribution pattern of the Miocene loess is essentially similar to that of Quaternary loess,but greatly differs from the lacustrine and fluvial sediments. Loess layers are regularly coarser than soil layers, indicating cyclical climate changes. Median grain-size along the 253.1 m sequence varies from 6 to 13 μm and the >63 μm fraction represents only 5.3% in maximum, 0.9% in average.Long-term grain-size variations are consistent with the loess accumulation rate at Qinan and with the eolian mass accumulation rate in the North Pacific. These features firmly indicate an eolian origin of the studied sequence, and also reveal a coeval changes between the long-term changes of eolian grain-size and continental aridity in the dust source regions.

  2. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    Science.gov (United States)

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  3. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    Science.gov (United States)

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand. PMID:21402111

  4. Chromatin features of plant telomeric sequences at terminal versus internal positions

    Directory of Open Access Journals (Sweden)

    Eva eMajerová

    2014-11-01

    Full Text Available Epigenetic mechanisms are involved in regulation of crucial cellular processes in eukaryotic organisms. Data on the epigenetic features of plant telomeres and their epigenetic regulation were published mostly for Arabidopsis thaliana, in which the presence of interstitial telomeric repeats (ITRs may interfere with genuine telomeres in most analyses. Here, we studied the epigenetic landscape and transcription of telomeres and ITRs in Nicotiana tabacum with long telomeres and no detectable ITRs, and in Ballantinia antipoda with large blocks of pericentromeric ITRs and relatively short telomeres. Chromatin of genuine telomeres displayed heterochromatic as well as euchromatic marks, while ITRs were just heterochromatic. Methylated cytosines were present at telomeres and ITRs, but showed a bias with more methylation towards distal telomere positions and different blocks of B. antipoda ITRs methylated to different levels. Telomeric transcripts TERRA (G-rich and ARRET (C-rich were identified in both plants and their levels varied among tissues with a maximum in blossoms. Plants with substantially different proportions of internally and terminally located telomeric repeats are instrumental in clarifying the chromatin status of telomeric repeats at distinct chromosome locations.

  5. A Systematic Evaluation of Feature Selection and Classification Algorithms Using Simulated and Real miRNA Sequencing Data.

    Science.gov (United States)

    Yang, Sheng; Guo, Li; Shao, Fang; Zhao, Yang; Chen, Feng

    2015-01-01

    Sequencing is widely used to discover associations between microRNAs (miRNAs) and diseases. However, the negative binomial distribution (NB) and high dimensionality of data obtained using sequencing can lead to low-power results and low reproducibility. Several statistical learning algorithms have been proposed to address sequencing data, and although evaluation of these methods is essential, such studies are relatively rare. The performance of seven feature selection (FS) algorithms, including baySeq, DESeq, edgeR, the rank sum test, lasso, particle swarm optimistic decision tree, and random forest (RF), was compared by simulation under different conditions based on the difference of the mean, the dispersion parameter of the NB, and the signal to noise ratio. Real data were used to evaluate the performance of RF, logistic regression, and support vector machine. Based on the simulation and real data, we discuss the behaviour of the FS and classification algorithms. The Apriori algorithm identified frequent item sets (mir-133a, mir-133b, mir-183, mir-937, and mir-96) from among the deregulated miRNAs of six datasets from The Cancer Genomics Atlas. Taking these findings altogether and considering computational memory requirements, we propose a strategy that combines edgeR and DESeq for large sample sizes. PMID:26508990

  6. A Systematic Evaluation of Feature Selection and Classification Algorithms Using Simulated and Real miRNA Sequencing Data

    Directory of Open Access Journals (Sweden)

    Sheng Yang

    2015-01-01

    Full Text Available Sequencing is widely used to discover associations between microRNAs (miRNAs and diseases. However, the negative binomial distribution (NB and high dimensionality of data obtained using sequencing can lead to low-power results and low reproducibility. Several statistical learning algorithms have been proposed to address sequencing data, and although evaluation of these methods is essential, such studies are relatively rare. The performance of seven feature selection (FS algorithms, including baySeq, DESeq, edgeR, the rank sum test, lasso, particle swarm optimistic decision tree, and random forest (RF, was compared by simulation under different conditions based on the difference of the mean, the dispersion parameter of the NB, and the signal to noise ratio. Real data were used to evaluate the performance of RF, logistic regression, and support vector machine. Based on the simulation and real data, we discuss the behaviour of the FS and classification algorithms. The Apriori algorithm identified frequent item sets (mir-133a, mir-133b, mir-183, mir-937, and mir-96 from among the deregulated miRNAs of six datasets from The Cancer Genomics Atlas. Taking these findings altogether and considering computational memory requirements, we propose a strategy that combines edgeR and DESeq for large sample sizes.

  7. The amino acid sequence of cytochrome c from Cucurbita maxima L. (pumpkin)

    Science.gov (United States)

    Thompson, E. W.; Richardson, M.; Boulter, D.

    1971-01-01

    The amino acid sequence of pumpkin cytochrome c was determined on 2μmol of protein. Some evidence was found for the occurrence of two forms of cytochrome c, whose sequences differed in three positions. Pumpkin cytochrome c consists of 111 residues and is homologous with mitochondrial cytochromes c from other plants. Experimental details are given in a supplementary paper that has been deposited as Supplementary Publication SUP 50005 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1971), 121, 7. PMID:5131733

  8. Human apolipoprotein C-II: complete nucleic acid sequence of preapolipoprotein C-II.

    OpenAIRE

    Fojo, S S; Law, S W; Brewer, H B

    1984-01-01

    Apolipoprotein (apo) C-II is a cofactor for lipoprotein lipase, the enzyme that catalyzes the hydrolysis of triglycerides on plasma triglyceride-rich lipoproteins. The complete coding sequence of apoC-II mRNA has been determined from an apoC-II clone isolated from a human liver cDNA library. A 17-base-long synthetic oligonucleotide based on amino acid residues 5-10 of apoC-II was utilized as a hybridization probe to select recombinant plasmids containing the apoC-II sequence. Two thousand fou...

  9. Nucleotide and amino acid sequences of human intestinal alkaline phosphatase: close homology to placental alkaline phosphatase

    International Nuclear Information System (INIS)

    A cDNA clone for human adult intestinal alkaline phosphatase (ALP) [orthophosphoric-monoester phosphohydrolase (alkaline optimum); EC 3.1.3.1] was isolated from a λgt11 expression library. The cDNA insert of this clone is 2513 base pairs in length and contains an open reading frame that encodes a 528-amino acid polypeptide. This deduced polypeptide contains the first 40 amino acids of human intestinal ALP, as determined by direct protein sequencing. Intestinal ALP shows 86.5% amino acid identity to placental (type 1) ALP and 56.6% amino acid identity to liver/bone/kidney ALP. In the 3'-untranslated regions, intestinal and placental ALP cDNAs are 73.5% identical (excluding gaps). The evolution of this multigene enzyme family is discussed

  10. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Energy Technology Data Exchange (ETDEWEB)

    Aklujkar, Muktak [University of Massachusetts, Amherst; Krushkal, Julia [University of Texas, Austin; DiBartolo, Genevieve [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Lovley, Derek [University of Massachusetts, Amherst

    2009-01-01

    Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results. The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion. The genomic evidence suggests that metabolism, physiology Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and

  11. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Energy Technology Data Exchange (ETDEWEB)

    Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve; Lapidus, Alla; Land, Miriam L.; Lovley, Derek R.

    2008-12-01

    Background: The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results: The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion: The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.

  12. The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

    Directory of Open Access Journals (Sweden)

    Aklujkar Muktak

    2009-05-01

    Full Text Available Abstract Background The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second putative succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.

  13. Sequence-specific nucleic acid detection from binary pore conductance measurement

    OpenAIRE

    Esfandiari, Leyla; Monbouquette, Harold G.; Jacob J. Schmidt

    2012-01-01

    We describe a platform for sequence-specific nucleic acid (NA) detection utilizing a micropipette tapered to a 2 μm diameter pore and 3 μm diameter polystyrene beads to which uncharged peptide nucleic acid (PNA) probe molecules have been conjugated. As the target NAs hybridize to the complementary PNA-beads, the beads acquire negative charge and become electrophoretically mobile. An applied electric field guides these NA-PNA-beads toward the pipette tip, which they obstruct, leading to an ind...

  14. Self-Sequencing of Amino Acids and Origins of Polyfunctional Protocells

    Science.gov (United States)

    Fox, Sidney W.

    1984-12-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells.

  15. NetTurnP--neural network prediction of beta-turns by use of evolutionary information and predicted protein sequence features.

    Directory of Open Access Journals (Sweden)

    Bent Petersen

    Full Text Available UNLABELLED: β-turns are the most common type of non-repetitive structures, and constitute on average 25% of the amino acids in proteins. The formation of β-turns plays an important role in protein folding, protein stability and molecular recognition processes. In this work we present the neural network method NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC=0.50, Qtotal=82.1%, sensitivity=75.6%, PPV=68.8% and AUC=0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17-0.47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively. CONCLUSION: The NetTurnP method has been implemented as a webserver, which is freely available at http://www.cbs.dtu.dk/services/NetTurnP/. NetTurnP is the only available webserver that allows submission of multiple sequences.

  16. Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

    Directory of Open Access Journals (Sweden)

    Michael Andrew Meyer

    2013-07-01

    Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  17. Identification of Amino Acid Sequences with Good Folding Properties in an Off-Lattice Model

    CERN Document Server

    Irbäck, Anders; Potthast, Frank

    2008-01-01

    Folding properties of a two-dimensional toy protein model containing only two amino-acid types, hydrophobic and hydrophilic, respectively, are analyzed. An efficient Monte Carlo procedure is employed to ensure that the ground states are found. The thermodynamic properties are found to be strongly sequence dependent in contrast to the kinetic ones. Hence, criteria for good folders are defined entirely in terms of thermodynamic fluctuations. With these criteria sequence patterns that fold well are isolated. For 300 chains with 20 randomly chosen binary residues approximately 10% meet these criteria. Also, an analysis is performed by means of statistical and artificial neural network methods from which it is concluded that the folding properties can be predicted to a certain degree given the binary numbers characterizing the sequences.

  18. Sequence-selective targeting of duplex DNA by peptide nucleic acids

    DEFF Research Database (Denmark)

    Nielsen, Peter E

    2010-01-01

    Sequence-selective gene targeting constitutes an attractive drug-discovery approach for genetic therapy, with the aim of reducing or enhancing the activity of specific genes at the transcriptional level, or as part of a methodology for targeted gene repair. The pseudopeptide DNA mimic peptide...... nucleic acid (PNA) can recognize duplex DNA with high sequence specificity and affinity in triplex, duplex and double-duplex invasive modes or non-invasive triplex modes. Novel PNA modification has improved the affinity for DNA recognition via duplex invasion, double-duplex invasion and triplex...... recognition considerably. Such modifications have also resulted in new approaches to targeted gene repair and sequence-selective double-strand cleavage of genomic DNA....

  19. Structure of the fully modified left-handed cyclohexene nucleic acid sequence GTGTACAC.

    Science.gov (United States)

    Robeyns, Koen; Herdewijn, Piet; Van Meervelt, Luc

    2008-02-13

    CeNA oligonucleotides consist of a phosphorylated backbone where the deoxyribose sugars are replaced by cyclohexene moieties. The X-ray structure determination and analysis of a fully modified octamer sequence GTGTACAC, which is the first crystal structure of a carbocyclic-based nucleic acid, is presented. This particular sequence was built with left-handed building blocks and crystallizes as a left-handed double helix. The helix can be characterized as belonging to the (mirrored) A-type family. Crystallographic data were processed up to 1.53 A, and the octamer sequence crystallizes in the space group R32. The sugar puckering is found to adopt the 3H2 half-chair conformation which mimics the C3'-endo conformation of the ribose sugar. The double helices stack on top of each other to form continuous helices, and static disorder is observed due to this end-to-end stacking.

  20. Random amino acid mutations and protein misfolding lead to Shannon limit in sequence-structure communication.

    Directory of Open Access Journals (Sweden)

    Andreas Martin Lisewski

    Full Text Available The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions and in structure (structural defects trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a sensitive to random errors and (b restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials.

  1. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    Science.gov (United States)

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  2. Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

    Directory of Open Access Journals (Sweden)

    Samat Kozhakhmetov

    2014-01-01

    Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.

  3. CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping

    Directory of Open Access Journals (Sweden)

    Shi Weisong

    2011-06-01

    Full Text Available Abstract Background Research in genetics has developed rapidly recently due to the aid of next generation sequencing (NGS. However, massively-parallel NGS produces enormous amounts of data, which leads to storage, compatibility, scalability, and performance issues. The Cloud Computing and MapReduce framework, which utilizes hundreds or thousands of shared computers to map sequencing reads quickly and efficiently to reference genome sequences, appears to be a very promising solution for these issues. Consequently, it has been adopted by many organizations recently, and the initial results are very promising. However, since these are only initial steps toward this trend, the developed software does not provide adequate primary functions like bisulfite, pair-end mapping, etc., in on-site software such as RMAP or BS Seeker. In addition, existing MapReduce-based applications were not designed to process the long reads produced by the most recent second-generation and third-generation NGS instruments and, therefore, are inefficient. Last, it is difficult for a majority of biologists untrained in programming skills to use these tools because most were developed on Linux with a command line interface. Results To urge the trend of using Cloud technologies in genomics and prepare for advances in second- and third-generation DNA sequencing, we have built a Hadoop MapReduce-based application, CloudAligner, which achieves higher performance, covers most primary features, is more accurate, and has a user-friendly interface. It was also designed to be able to deal with long sequences. The performance gain of CloudAligner over Cloud-based counterparts (35 to 80% mainly comes from the omission of the reduce phase. In comparison to local-based approaches, the performance gain of CloudAligner is from the partition and parallel processing of the huge reference genome as well as the reads. The source code of CloudAligner is available at http

  4. Guanine-Rich Sequences Are a Dominant Feature of Exosomal microRNAs across the Mammalian Species and Cell Types.

    Science.gov (United States)

    Momose, Fumiyasu; Seo, Naohiro; Akahori, Yasushi; Sawada, Shin-Ichi; Harada, Naozumi; Ogura, Toru; Akiyoshi, Kazunari; Shiku, Hiroshi

    2016-01-01

    Exosome is an extracellular vesicle released from multivesicular endosomes and contains micro (mi) RNAs and functional proteins derived from the donor cells. Exosomal miRNAs act as an effector during communication with appropriate recipient cells, this can aid in the utilization of the exosomes in a drug delivery system for various disorders including malignancies. Differences in the miRNA distribution pattern between exosomes and donor cells indicate the active translocation of miRNAs into the exosome cargos in a miRNA sequence-dependent manner, although the molecular mechanism is little known. In this study, we statistically analyzed the miRNA microarray data and revealed that the guanine (G)-rich sequence is a dominant feature of exosome-dominant miRNAs, across the mammalian species-specificity and the cell types. Our results provide important information regarding the potential use of exosome cargos to develop miRNA-based drugs for the treatment of human diseases. PMID:27101102

  5. tRNAfeature: An algorithm for tRNA features to identify tRNA genes in DNA sequences.

    Science.gov (United States)

    Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh

    2016-09-01

    The identification of transfer RNAs (tRNAs) is critical for a detailed understanding of the evolution of biological organisms and viruses. However, some tRNAs are difficult to recognize due to their unusual sub-structures and may result in the detection of the wrong anticodon. Therefore, the detection of unusual sub-structures of tRNA genes remains an important challenge. In this study, we propose a method to identify tRNA genes based on tRNA features. tRNAfeature attempts to refold the sequence with single-stranded regions longer than those found in the canonical and conventional structural models for tRNA. We predicted a set of 53926 archaeal, eubacterial and eukaryotic tRNA genes annotated in tRNADB-CE and scanned the tRNA genes in whole genome sequencing. The results indicate that tRNAfeature is more powerful than other existing methods for identifying tRNAs. PMID:27291467

  6. Amino acid selective unlabeling for sequence specific resonance assignments in proteins

    Energy Technology Data Exchange (ETDEWEB)

    Krishnarjuna, B.; Jaipuria, Garima; Thakur, Anushikha [Indian Institute of Science, NMR Research Centre (India); D' Silva, Patrick, E-mail: patrick@biochem.iisc.ernet.in [Indian Institute of Science, Department of Biochemistry (India); Atreya, Hanudatta S., E-mail: hsatreya@sif.iisc.ernet.in [Indian Institute of Science, NMR Research Centre (India)

    2011-01-15

    Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective 'unlabeling' or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly {sup 13}C/{sup 15}N labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {l_brace}{sup 12}CO{sub i}-{sup 15}N{sub i+1}{r_brace}-filtered HSQC, which aids in linking the {sup 1}H{sup N}/{sup 15}N resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i - 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to {sup 2}H labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of {sup 14}N at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies.

  7. N-terminal amino acid sequences and some characteristics of fibrinolytic/hemorrhagic metalloproteinases purified from Bothrops jararaca venom.

    Science.gov (United States)

    Maruyama, Masugi; Sugiki, Masahiko; Anai, Keita; Yoshida, Etsuo

    2002-08-01

    We determined the N-terminal amino acid sequences of the fibrinolytic/hemorrhagic metalloproteinases (jararafibrases I, III and IV) purified from Bothrops jararaca venom. The N-terminal amino acid sequences of jararafibrase I and its degradation products were identical to those of jararhagin, another hemorrhagic metalloproteinase purified from the same snake venom. Together with enzymatic and immunological properties, we concluded that those two enzymes are identical. The N-terminal amino acid sequence of jararafibrase III was quite similar to C-type lectin isolated from Crotalus atrox, and the protein had a hemagglutinating activity on intact rat red blood cells. PMID:12165326

  8. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    Science.gov (United States)

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-01

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. PMID:27375218

  9. SigWin-detector: a Grid-enabled workflow for discovering enriched windows of genomic features related to DNA sequences

    Directory of Open Access Journals (Sweden)

    Wibisono Adianto

    2008-08-01

    Full Text Available Abstract Background Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Findings Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic features in a (DNA sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. Conclusion As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the

  10. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    Directory of Open Access Journals (Sweden)

    Yan Koon-Kiu

    2007-11-01

    Full Text Available Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw" duplication and deletion rates rdup∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsgaKjabbwha1jabbchaWbqaaiabgEHiQaaaaaa@3283@, rdel∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsga

  11. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    Science.gov (United States)

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  12. Development of a SCAR (sequence-characterised amplified region) marker for acid resistance-related gene in Lactobacillus plantarum.

    Science.gov (United States)

    Liu, Shu-Wen; Li, Kai; Yang, Shi-Ling; Tian, Shu-Fen; He, Ling

    2015-03-01

    A sequence characterised amplified region marker was developed to determine an acid resistance-related gene in Lactobacillus plantarum. A random amplified polymorphic DNA marker named S116-680 was reported to be closely related to the acid resistance of the strains. The DNA band corresponding to this marker was cloned and sequenced with the induction of specific designed PCR primers. The results of PCR test helped to amplify a clear specific band of 680 bp in the tested acid-resistant strains. S116-680 marker would be useful to explore the acid-resistant mechanism of L. plantarum and to screen desirable malolactic fermentation strains.

  13. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    Energy Technology Data Exchange (ETDEWEB)

    Croteau, Rodney Bruce (Pullman, WA); Wildung, Mark Raymond (Colfax, WA); Lange, Bernd Markus (Pullman, WA); McCaskill, David G. (Pullman, WA)

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  14. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    Directory of Open Access Journals (Sweden)

    Xiaoyu Wang

    Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  15. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    Science.gov (United States)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  16. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    Directory of Open Access Journals (Sweden)

    A.E. El Hakim

    2015-12-01

    Full Text Available Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The tryptic peptides of the two isoforms share high sequence homology with other snake venom l-amino acid oxidases. The optimal pH and temperature values of Cc-LAAOI and Cc-LAAOII were 7.8, 50 °C and 7, 60 °C, respectively. The two isoenzymes were thermally stable up to 70 °C. The Km and Vmax values were 0.67 mM, 0.135 μmol/min for LAAOI and 0.82 mM, 0.087 μmol/min for LAAOII. Both isoenzymes displayed high catalytic preference to long-chain, hydrophobic and aromatic amino acids. The Mn2+ ion markedly increased the LAAO activity for both purified isoforms, while Na+, K+, Ca2+, Mg2+ and Ba2+ ions showed a non-significant increase in the enzymatic activity of both isoforms. Furthermore, Zn2+, Ni2+, Co2+, Cu2+ and AL3+ ions markedly inhibited the LAAOI and LAAOII activities. l-Cysteine and reduced glutathione completely inhibited the LAAO activity of both isoenzymes, whereas, β-mercaptoethanol, O-phenanthroline and PMSF completely inhibited the enzymatic activity of LAAOII. Furthermore, iodoacitic acid inhibited the enzymatic activity of LAAOII by 46% and had no effect on the LAAOI activity.

  17. Amino acid sequence and variant forms of favin, a lectin from Vicia faba.

    Science.gov (United States)

    Hopp, T P; Hemperly, J J; Cunningham, B A

    1982-04-25

    We have determined the complete amino acid sequence (182 residues) of the beta chain of favin, the glucose-binding lectin from fava beans (Vicia faba), and have established that the carbohydrate moiety is attached to Asn 168. Together with the sequence of the alpha chain previously reported (Hemperly, J. J., Hopp, T. P., Becker, J. W., and Cunningham, B. A. (1979) J. Biol. Chem. 254, 6803-6810), these data complete the analysis of the primary structure of the lectin. We have also examined minor polypeptides that appear in all preparations of favin. Two lower molecular weight species (Mr = 9,500-11,600) appear to be fragments of the beta chain resulting from cleavage following Asn 76, whereas six high molecular weight forms (Mr = 25,000 or greater) appear to include aggregates of the beta chain and possibly some alternative products of chain processing. PMID:7068646

  18. Pathophysiology, clinical features and radiological findings of differentiation syndrome/all-trans-retinoic acid syndrome

    Institute of Scientific and Technical Information of China (English)

    Luciano; Cardinale; Francesco; Asteggiano; Federica; Moretti; Federico; Torre; Stefano; Ulisciani; Carmen; Fava; Giovanna; Rege-Cambrin

    2014-01-01

    In acute promyelocytic leukemia, differentiation thera-py based on all-trans-retinoic acid can be complicated by the development of a differentiation syndrome(DS). DS is a life-threatening complication, characterized by respiratory distress, unexplained fever, weight gain, interstitial lung infiltrates, pleural or pericardial effusions, hypotension and acute renal failure. The diagnosis of DS is made on clinical grounds and has proven to be difficult, because none of the symptoms is pathognomonic for the syndrome without any definitive diagnostic criteria. As DS can have subtle signs and symptoms at presentation but progress rapidly, end-stage DS clinical picture resembles the acute respiratory distress syndrome with extremely poor prognosis; so it is of absolute importance to be conscious of these complications and initiate therapy as soon as it was suspected. The radiologic appearance resembles the typical features of cardiogenic pulmonary edema. Diagnosis of DS remains a great skill for radiologists and haematologist but it is of an utmost importance the cooperation in suspect DS, detect the early signs of DS, examine the patients’ behaviour and rapidly detect the complications.

  19. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    Energy Technology Data Exchange (ETDEWEB)

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  20. Geochemical features and effects on deep-seated fluids during the May-June 2012 southern Po Valley seismic sequence

    Directory of Open Access Journals (Sweden)

    Francesco Italiano

    2012-10-01

    Full Text Available A periodic sampling of the groundwaters and dissolved and free gases in selected deep wells located in the area affected by the May-June 2012 southern Po Valley seismic sequence has provided insight into seismogenic-induced changes of the local aquifer systems. The results obtained show progressive changes in the fluid geochemistry, allowing it to be established that deep-seated fluids were mobilized during the seismic sequence and reached surface layers along faults and fractures, which generated significant geochemical anomalies. The May-June 2012 seismic swarm (mainshock on May 29, 2012, M 5.8; 7 shocks M >5, about 200 events 3 > M > 5 induced several modifications in the circulating fluids. This study reports the preliminary results obtained for the geochemical features of the waters and gases collected over the epicentral area from boreholes drilled at different depths, thus intercepting water and gases with different origins and circulation. The aim of the investigations was to improve our knowledge of the fluids circulating over the seismic area (e.g. origin, provenance, interactions, mixing of different components, temporal changes. This was achieved by collecting samples from both shallow and deep-drilled boreholes, and then, after the selection of the relevant sites, we looked for temporal changes with mid-to-long-term monitoring activity following a constant sampling rate. This allowed us to gain better insight into the relationships between the fluid circulation and the faulting activity. The sampling sites are listed in Table 1, along with the analytical results of the gas phase. […

  1. Complete Genome Sequence of Lactococcus lactis IO-1, a Lactic Acid Bacterium That Utilizes Xylose and Produces High Levels of l-Lactic Acid

    OpenAIRE

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-01-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly l-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  2. The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

    DEFF Research Database (Denmark)

    Behrendt, N; Rønne, E; Ploug, M;

    1990-01-01

    The receptor for human urokinase-type plasminogen activator (u-PA) was purified from phorbol 12-myristate 13-acetate-stimulated U937 cells by temperature-induced phase separation of detergent extracts, followed by affinity chromatography with immobilized diisopropyl fluorophosphate-treated u......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported...... amino-terminal fragment; 2) the identical electrophoretic mobilities observed for cross-linked conjugates, formed between either the purified protein or the u-PA receptor on intact U937 cells and the above ligands; 3) the identity of the apparent molecular weight of the purified protein...

  3. WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences.

    Science.gov (United States)

    Zika, Radek; Paces, Jan; Pavlícek, Adam; Paces, Václav

    2004-07-01

    Web Alignment Visualization Server contains a set of web-tools designed for quick generation of publication-quality color figures of multiple alignments of nucleotide or amino acids sequences. It can be used for identification of conserved regions and gaps within many sequences using only common web browsers. The server is accessible at http://wavis.img.cas.cz.

  4. WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences

    OpenAIRE

    Zika, Radek; Pačes, Jan; Pavlíček, Adam; Pačes, Václav

    2004-01-01

    Web Alignment Visualization Server contains a set of web-tools designed for quick generation of publication-quality color figures of multiple alignments of nucleotide or amino acids sequences. It can be used for identification of conserved regions and gaps within many sequences using only common web browsers. The server is accessible at http://wavis.img.cas.cz.

  5. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    OpenAIRE

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of cons...

  6. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Directory of Open Access Journals (Sweden)

    James B Howard

    Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification

  7. Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

    DEFF Research Database (Denmark)

    Radova, A.; Sebela, M.; Galuszka, P.;

    2001-01-01

    was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...

  8. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    Science.gov (United States)

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  9. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    International Nuclear Information System (INIS)

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species

  10. Complete amino acid sequence of the myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani.

    Science.gov (United States)

    Jones, B N; Wang, C C; Dwulet, F E; Lehman, L D; Meuth, J L; Bogardt, R A; Gurd, F R

    1979-04-25

    The complete amino acid sequence of the major component myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani, was determined by the automated Edman degradation of several large peptides obtained by specific cleavage of the protein. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. By subjecting four of these peptides and the apomyoglobin to automated Edman degradation, over 80% of the primary structure of the protein was obtained. The remainder of the covalent structure was determined by the sequence analysis of peptides that resulted from further digestion of the central cyanogen bromide fragment. This fragment was cleaved at its glutamyl residues with staphylococcal protease and its lysyl residues with trypsin. The action of trypsin was restricted to the lysyl residues by chemical modification of the single arginyl residue of the fragment with 1,2-cyclohexanedione. The primary structure of this myoglobin proved to be identical with that from the Atlantic bottlenosed dolphin and Pacific common dolphin but differs from the myoglobins of the killer whale and pilot whale at two positions. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea. PMID:454657

  11. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE Consensus Sequence Repeats in the Viral Genome.

    Directory of Open Access Journals (Sweden)

    Ashutosh Kumar

    2016-08-01

    Full Text Available Owing to the reports of microcephaly as a consistent outcome in the foetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV - microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favour of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1 sequence homology between ZIKV genome and the response element of an early neural tube developmental marker ‘retinoic acid’ in human DNA and (2 comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE consensus sequence (5′–AGGTCA–3′ in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly and other viruses available in National Institute of Health genetic sequence database (GenBank for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause foetal brain defects (for which maternal-foetal transmission during developing stage may be required. The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although bioinformatic

  12. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  13. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    Energy Technology Data Exchange (ETDEWEB)

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L. (Univ. of Massachusetts Medical School, Worcester (USA))

    1988-06-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity.

  14. Sequence stratigraphic features of the Middle Permian Maokou Formation in the Sichuan Basin and their controls on source rocks and reservoirs

    Directory of Open Access Journals (Sweden)

    Wang Su

    2015-11-01

    Full Text Available Well Shuangyushi 1 and Well Nanchong l deployed in the NW and central Sichuan Basin have obtained a high-yield industrial gas flow in the dolomite and karst reservoirs of the Middle Permian Maokou Formation, showing good exploration prospects of the Maokou Formation. In order to identify the sequence stratigraphic features of the Maokou Formation, its sequence stratigraphy was divided and a unified sequence stratigraphic framework applicable for the entire basin was established to analyze the stratigraphic denudation features within the sequence framework by using the spectral curve trend attribute analysis, together with drilling and outcrop data. On this basis, the controls of sequence on source rocks and reservoirs were analyzed. In particular, the Maokou Formation was divided into two third-order sequences – SQ1 and SQ2. SQ1 was composed of members Mao 1 Member and Mao 3, while SQ2 was composed of Mao 4 Member. Sequence stratigraphic correlation indicated that the Maokou Formation within the basin had experienced erosion to varying extent, forming “three intense and two weak” denuded regions, among which, the upper part of SQ2 was slightly denuded in the two weak denuded regions (SW Sichuan Basin and locally Eastern Sichuan Basin, while SQ2 was denuded out in the three intense denuded regions (Southern Sichuan Basin–Central Sichuan Basin, NE and NW Sichuan Basin. The development of source rocks and reservoirs within sequence stratigraphic framework was significantly affected by sequence boundary; the grain banks that can form effective reservoir were predominately distributed in SQ1 highstand systems tract (HST, while effective source rocks were predominately distributed in SQ1 transgressive system tract (TST. It is concluded that the sequence division method is objective and reasonable, which can effectively guide oil and gas exploration in this region.

  15. Geometric Feature-Based Facial Expression Recognition in Image Sequences Using Multi-Class AdaBoost and Support Vector Machines

    Directory of Open Access Journals (Sweden)

    Joonwhoan Lee

    2013-06-01

    Full Text Available Facial expressions are widely used in the behavioral interpretation of emotions, cognitive science, and social interactions. In this paper, we present a novel method for fully automatic facial expression recognition in facial image sequences. As the facial expression evolves over time facial landmarks are automatically tracked in consecutive video frames, using displacements based on elastic bunch graph matching displacement estimation. Feature vectors from individual landmarks, as well as pairs of landmarks tracking results are extracted, and normalized, with respect to the first frame in the sequence. The prototypical expression sequence for each class of facial expression is formed, by taking the median of the landmark tracking results from the training facial expression sequences. Multi-class AdaBoost with dynamic time warping similarity distance between the feature vector of input facial expression and prototypical facial expression, is used as a weak classifier to select the subset of discriminative feature vectors. Finally, two methods for facial expression recognition are presented, either by using multi-class AdaBoost with dynamic time warping, or by using support vector machine on the boosted feature vectors. The results on the Cohn-Kanade (CK+ facial expression database show a recognition accuracy of 95.17% and 97.35% using multi-class AdaBoost and support vector machines, respectively.

  16. Self-organizing maps: A tool to ascertain taxonomic relatedness based on features derived from 16S rDNA sequence

    Indian Academy of Sciences (India)

    D V Raje; H J Purohit; Y P Badhe; S S Tambe; B D Kulkarni

    2010-12-01

    Exploitation of microbial wealth, of which almost 95% or more is still unexplored, is a growing need. The taxonomic placements of a new isolate based on phenotypic characteristics are now being supported by information preserved in the 16S rRNA gene. However, the analysis of 16S rDNA sequences retrieved from metagenome, by the available bioinformatics tools, is subject to limitations. In this study, the occurrences of nucleotide features in 16S rDNA sequences have been used to ascertain the taxonomic placement of organisms. The tetra- and penta-nucleotide features were extracted from the training data set of the 16S rDNA sequence, and was subjected to an artificial neural network (ANN) based tool known as self-organizing map (SOM), which helped in visualization of unsupervised classification. For selection of significant features, principal component analysis (PCA) or curvilinear component analysis (CCA) was applied. The SOM along with these techniques could discriminate the sample sequences with more than 90% accuracy, highlighting the relevance of features. To ascertain the confidence level in the developed classification approach, the test data set was specifically evaluated for Thiobacillus, with Acidiphilium, Paracocus and Starkeya, which are taxonomically reassigned. The evaluation proved the excellent generalization capability of the developed tool. The topology of genera in SOM supported the conventional chemo-biochemical classification reported in the Bergey manual.

  17. Amino acid sequences of neuropeptides in the sinus gland of the land crab Cardisoma carnifex: a novel neuropeptide proteolysis site.

    Science.gov (United States)

    Newcomb, R W

    1987-08-01

    The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.

  18. Enzyme-free translation of DNA into sequence-defined synthetic polymers structurally unrelated to nucleic acids

    Science.gov (United States)

    Niu, Jia; Hili, Ryan; Liu, David R.

    2013-04-01

    The translation of DNA sequences into corresponding biopolymers enables the production, function and evolution of the macromolecules of life. In contrast, methods to generate sequence-defined synthetic polymers with similar levels of control have remained elusive. Here, we report the development of a DNA-templated translation system that enables the enzyme-free translation of DNA templates into sequence-defined synthetic polymers that have no necessary structural relationship with nucleic acids. We demonstrate the efficiency, sequence-specificity and generality of this translation system by oligomerizing building blocks including polyethylene glycol, α-(D)-peptides, and β-peptides in a DNA-programmed manner. Sequence-defined synthetic polymers with molecular weights of 26 kDa containing 16 consecutively coupled building blocks and 90 densely functionalized β-amino acid residues were translated from DNA templates using this strategy. We integrated the DNA-templated translation system developed here into a complete cycle of translation, coding sequence replication, template regeneration and re-translation suitable for the iterated in vitro selection of functional sequence-defined synthetic polymers unrelated in structure to nucleic acids.

  19. Repetitive sequence based polymerase chain reaction to differentiate close bacteria strains in acidic sites

    Institute of Scientific and Technical Information of China (English)

    XIE Ming; YIN Hua-qun; LIU Yi; LIU Jie; LIU Xue-duan

    2008-01-01

    To study the diversity of bacteria strains newly isolated from several acid mine drainage(AMD) sites in China,repetitive sequence based polymerase chain reaction (rep-PCR),a well established technology for diversity analysis of closely related bacteria strains,was conducted on 30 strains of bacteria Leptospirillum ferriphilium,8 strains of bacteria Acidithiobacillus ferrooxidans,as well as the Acidithiobacillus ferrooxidans type strain ATCC (American Type Culture Collection) 23270.The results showed that,using ERIC and BOX primer sets,rep-PCR produced highly discriminatory banding patterns.Phylogenetic analysis based on ERIC-PCR banding types was made and the results indicated that rep-PCR could be used as a rapid and highly discriminatory screening technique in studying bacterial diversity,especially in differentiating bacteria within one species in AMD.

  20. Trypsin inhibitors from ridged gourd (Luffa acutangula Linn.) seeds: purification, properties, and amino acid sequences.

    Science.gov (United States)

    Haldar, U C; Saha, S K; Beavis, R C; Sinha, N K

    1996-02-01

    Two trypsin inhibitors, LA-1 and LA-2, have been isolated from ridged gourd (Luffa acutangula Linn.) seeds and purified to homogeneity by gel filtration followed by ion-exchange chromatography. The isoelectric point is at pH 4.55 for LA-1 and at pH 5.85 for LA-2. The Stokes radius of each inhibitor is 11.4 A. The fluorescence emission spectrum of each inhibitor is similar to that of the free tyrosine. The biomolecular rate constant of acrylamide quenching is 1.0 x 10(9) M-1 sec-1 for LA-1 and 0.8 x 10(9) M-1 sec-1 for LA-2 and that of K2HPO4 quenching is 1.6 x 10(11) M-1 sec-1 for LA-1 and 1.2 x 10(11) M-1 sec-1 for LA-2. Analysis of the circular dichroic spectra yields 40% alpha-helix and 60% beta-turn for La-1 and 45% alpha-helix and 55% beta-turn for LA-2. Inhibitors LA-1 and LA-2 consist of 28 and 29 amino acid residues, respectively. They lack threonine, alanine, valine, and tryptophan. Both inhibitors strongly inhibit trypsin by forming enzyme-inhibitor complexes at a molar ratio of unity. A chemical modification study suggests the involvement of arginine of LA-1 and lysine of LA-2 in their reactive sites. The inhibitors are very similar in their amino acid sequences, and show sequence homology with other squash family inhibitors. PMID:8924202

  1. Canine amino acid transport system Xc(-): cDNA sequence, distribution and cystine transport activity in lens epithelial cells.

    Science.gov (United States)

    Maruo, Takuya; Kanemaki, Nobuyuki; Onda, Ken; Sato, Reiichiro; Ichihara, Nobuteru; Ochiai, Hideharu

    2014-04-01

    The cystine transport activity of a lens epithelial cell line originated from a canine mature cataract was investigated. The distinct cystine transport activity was observed, which was inhibited to 28% by extracellular 1 mM glutamate. The cDNA sequences of canine cysteine/glutamate exchanger (xCT) and 4F2hc were determined. The predicted amino acid sequences were 527 and 533 amino acid polypeptides, respectively. The amino acid sequences of canine xCT and 4F2hc showed high similarities (>80%) to those of humans. The expression of xCT in lens epithelial cell line was confirmed by western blot analysis. RT-PCR analysis revealed high level expression only in the brain, and it was below the detectable level in other tissues.

  2. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  3. PSNO: Predicting Cysteine S-Nitrosylation Sites by Incorporating Various Sequence-Derived Features into the General Form of Chou’s PseAAC

    Directory of Open Access Journals (Sweden)

    Jian Zhang

    2014-06-01

    Full Text Available S-nitrosylation (SNO is one of the most universal reversible post-translational modifications involved in many biological processes. Malfunction or dysregulation of SNO leads to a series of severe diseases, such as developmental abnormalities and various diseases. Therefore, the identification of SNO sites (SNOs provides insights into disease progression and drug development. In this paper, a new bioinformatics tool, named PSNO, is proposed to identify SNOs from protein sequences. Firstly, we explore various promising sequence-derived discriminative features, including the evolutionary profile, the predicted secondary structure and the physicochemical properties. Secondly, rather than simply combining the features, which may bring about information redundancy and unwanted noise, we use the relative entropy selection and incremental feature selection approach to select the optimal feature subsets. Thirdly, we train our model by the technique of the k-nearest neighbor algorithm. Using both informative features and an elaborate feature selection scheme, our method, PSNO, achieves good prediction performance with a mean Mathews correlation coefficient (MCC value of about 0.5119 on the training dataset using 10-fold cross-validation. These results indicate that PSNO can be used as a competitive predictor among the state-of-the-art SNOs prediction tools. A web-server, named PSNO, which implements the proposed method, is freely available at http://59.73.198.144:8088/PSNO/.

  4. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    Science.gov (United States)

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. PMID:25708409

  5. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Indian Academy of Sciences (India)

    Meng-Jun Li; Ai-Qin Li; Han Xia; Chuan-Zhi Zhao; Chang-Sheng Li; Shu-Bo Wan; Yu-Ping Bi; Xing-Jun Wang

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, -ketoacyl-ACP synthase (I, II, III), -ketoacyl-ACP reductase, -hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  6. Application of peptide nucleic acids containing azobenzene self-assembled electrochemical biosensors in detecting DNA sequences

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Hybridization of peptide nucleic acids probe containing azobenzene (NH2-TNT4, N-PNAs) with DNA was performed by covalently immobilizing of NH2-TNT4 in sequence on the 3-mercaptopropionic acid self-assembled monolayer modified gold electrode with the helps of N-(3-dimethylaminopropy1)-N’-ethylcarbodiimide hydrochloride (EDC) and N-hydroxysuccinimide (NHS), and the hybrid was coded as N-PNAs/DNA. Using [Fe(CN)6]4-/3- (1:1) as the electrochemical indicator, the electrochemical properties of the N-PNAs self-assembled monolayer (N-PNAs-SAMs) and N-PNAs/DNA hybridization system under the conditions of before and after UV light irradiation were characterized with cyclic voltammetry (CV), differential pulse voltammetry (DPV), and electrochemical impedance spectra (EIS). Results showed that the redox currents decreased with the increase of irradiation time, suggesting that the ability of the charge transfer on the electrode surface was weakened and the conformation of hybrid system had been changed, and the control of PNAs/DNA hybridization could be realized by UV light irradiation.

  7. Site-directed gene mutation at mixed sequence targets by psoralen-conjugated pseudo-complementary peptide nucleic acids

    OpenAIRE

    Kim, Ki-Hyun; Nielsen, Peter E.; Glazer, Peter M.

    2007-01-01

    Sequence-specific DNA-binding molecules such as triple helix-forming oligonucleotides (TFOs) provide a means for inducing site-specific mutagenesis and recombination at chromosomal sites in mammalian cells. However, the utility of TFOs is limited by the requirement for homopurine stretches in the target duplex DNA. Here, we report the use of pseudo-complementary peptide nucleic acids (pcPNAs) for intracellular gene targeting at mixed sequence sites. Due to steric hindrance, pcPNAs are unable ...

  8. Nucleotide and amino acid sequence coding for polypeptides of foot-and-mouth disease virus type A12.

    OpenAIRE

    Robertson, B H; Grubman, M J; Weddell, G N; Moore, D.M.; Welsh, J D; Fischer, T.; Dowbenko, D J; Yansura, D G; Small, B.; Kleid, D G

    1985-01-01

    The coding region for the structural and nonstructural polypeptides of the type A12 foot-and-mouth disease virus genome has been identified by nucleotide sequencing of cloned DNA derived from the viral RNA. In addition, 704 nucleotides in the 5' untranslated region between the polycytidylic acid tract and the probable initiation codon of the first translated gene, P16-L, have been sequenced. This region has several potential initiation codons, one of which appears to be a low-frequency altern...

  9. Identification of Protein-Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information.

    Science.gov (United States)

    Ding, Yijie; Tang, Jijun; Guo, Fei

    2016-01-01

    Identification of protein-protein interactions (PPIs) is a difficult and important problem in biology. Since experimental methods for predicting PPIs are both expensive and time-consuming, many computational methods have been developed to predict PPIs and interaction networks, which can be used to complement experimental approaches. However, these methods have limitations to overcome. They need a large number of homology proteins or literature to be applied in their method. In this paper, we propose a novel matrix-based protein sequence representation approach to predict PPIs, using an ensemble learning method for classification. We construct the matrix of Amino Acid Contact (AAC), based on the statistical analysis of residue-pairing frequencies in a database of 6323 protein-protein complexes. We first represent the protein sequence as a Substitution Matrix Representation (SMR) matrix. Then, the feature vector is extracted by applying algorithms of Histogram of Oriented Gradient (HOG) and Singular Value Decomposition (SVD) on the SMR matrix. Finally, we feed the feature vector into a Random Forest (RF) for judging interaction pairs and non-interaction pairs. Our method is applied to several PPI datasets to evaluate its performance. On the S . c e r e v i s i a e dataset, our method achieves 94 . 83 % accuracy and 92 . 40 % sensitivity. Compared with existing methods, and the accuracy of our method is increased by 0 . 11 percentage points. On the H . p y l o r i dataset, our method achieves 89 . 06 % accuracy and 88 . 15 % sensitivity, the accuracy of our method is increased by 0 . 76 % . On the H u m a n PPI dataset, our method achieves 97 . 60 % accuracy and 96 . 37 % sensitivity, and the accuracy of our method is increased by 1 . 30 % . In addition, we test our method on a very important PPI network, and it achieves 92 . 71 % accuracy. In the Wnt-related network, the accuracy of our method is increased by 16 . 67 % . The source code and all datasets are available

  10. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    Science.gov (United States)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  11. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    International Nuclear Information System (INIS)

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the β-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution

  12. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    Energy Technology Data Exchange (ETDEWEB)

    Chang, Soo-Ik (Harvard Medical School, Boston, MA (USA)); Hammes, G.G. (Univ. of California, Santa Barbara (USA))

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  13. Insights into Protein Sequence and Structure-Derived Features Mediating 3D Domain Swapping Mechanism using Support Vector Machine Based Approach

    Directory of Open Access Journals (Sweden)

    Khader Shameer

    2010-06-01

    Full Text Available 3-dimensional domain swapping is a mechanism where two or more protein molecules form higher order oligomers by exchanging identical or similar subunits. Recently, this phenomenon has received much attention in the context of prions and neuro-degenerative diseases, due to its role in the functional regulation, formation of higher oligomers, protein misfolding, aggregation etc. While 3-dimensional domain swap mechanism can be detected from three-dimensional structures, it remains a formidable challenge to derive common sequence or structural patterns from proteins involved in swapping. We have developed a SVM-based classifier to predict domain swapping events using a set of features derived from sequence and structural data. The SVM classifier was trained on features derived from 150 proteins reported to be involved in 3D domain swapping and 150 proteins not known to be involved in swapped conformation or related to proteins involved in swapping phenomenon. The testing was performed using 63 proteins from the positive dataset and 63 proteins from the negative dataset. We obtained 76.33% accuracy from training and 73.81% accuracy from testing. Due to high diversity in the sequence, structure and functions of proteins involved in domain swapping, availability of such an algorithm to predict swapping events from sequence and structure-derived features will be an initial step towards identification of more putative proteins that may be involved in swapping or proteins involved in deposition disease. Further, the top features emerging in our feature selection method may be analysed further to understand their roles in the mechanism of domain swapping.

  14. The complete amino acid sequence of a mannose-binding lectin from "Kidachi Aloe" (Aloe arborescens Miller var. natalensis Berger).

    Science.gov (United States)

    Koike, T; Titani, K; Suzuki, M; Beppu, H; Kuzuya, H; Maruta, K; Shimpo, K; Fujita, K

    1995-09-01

    The complete amino acid sequence of a mannose-binding lectin purified from the leaf skin of "Kidachi Aloe" (Aloe arborescens Miller var. natalensis Berger) is presented. The 109-residue sequence of the subunit was determined by analysis of peptides of the intact or S-pyridylethylated protein generated by digestion with cyanogen bromide, BNPS-skatole, Achromobacter protease I, or trypsin. The subunit contains an intrachain disulfide bridge. The sequence is highly homologous to that of a mannose-binding lectin from snowdrop bulb. PMID:7669035

  15. An Matching Method for Vehicle-borne Panoramic Image Sequence Based on Adaptive Structure from Motion Feature

    Directory of Open Access Journals (Sweden)

    ZHANG Zhengpeng

    2015-10-01

    Full Text Available Panoramic image matching method with the constraint condition of local structure from motion similarity feature is an important method, the process requires multivariable kernel density estimations for the structure from motion feature used nonparametric mean shift. Proper selection of the kernel bandwidth is a critical step for convergence speed and accuracy of matching method. Variable bandwidth with adaptive structure from motion feature for panoramic image matching method has been proposed in this work. First the bandwidth matrix is defined using the locally adaptive spatial structure of the sampling point in spatial domain and optical flow domain. The relaxation diffusion process of structure from motion similarity feature is described by distance weighting method of local optical flow feature vector. Then the expression form of adaptive multivariate kernel density function is given out, and discusses the solution of the mean shift vector, termination conditions, and the seed point selection method. The final fusions of multi-scale SIFT the features and structure features to establish a unified panoramic image matching framework. The sphere panoramic images from vehicle-borne mobile measurement system are chosen such that a comparison analysis between fixed bandwidth and adaptive bandwidth is carried out in detail. The results show that adaptive bandwidth is good for case with the inlier ratio changes and the object space scale changes. The proposed method can realize the adaptive similarity measure of structure from motion feature, improves the correct matching points and matching rate, experimental results have shown our method to be robust.

  16. Predicting the expression of recombinant monoclonal antibodies in Chinese hamster ovary cells based on sequence features of the CDR3 domain.

    Science.gov (United States)

    Pybus, Leon P; James, David C; Dean, Greg; Slidel, Tim; Hardman, Colin; Smith, Andrew; Daramola, Olalekan; Field, Ray

    2014-01-01

    Despite the development of high-titer bioprocesses capable of producing >10 g L(-1) of recombinant monoclonal antibody (MAb), some so called "difficult-to-express" (DTE) MAbs only reach much lower process titers. For widely utilized "platform" processes the only discrete variable is the protein coding sequence of the recombinant product. However, there has been little systematic study to identify the sequence parameters that affect expression. This information is vital, as it would allow us to rationally design genetic sequence and engineering strategies for optimal bioprocessing. We have therefore developed a new computational tool that enables prediction of MAb titer in Chinese hamster ovary (CHO) cells based on the recombinant coding sequence of the expressed MAb. Model construction utilized a panel of MAbs, which following a 10-day fed-batch transient production process varied in titer 5.6-fold, allowing analysis of the sequence features that impact expression over a range of high and low MAb productivity. The model identified 18 light chain (LC)-specific sequence features within complementarity determining region 3 (CDR3) capable of predicting MAb titer with a root mean square error of 0.585 relative expression units. Furthermore, we identify that CDR3 variation influences the rate of LC-HC dimerization during MAb synthesis, which could be exploited to improve the production of DTE MAb variants via increasing the transfected LC:HC gene ratio. Taken together these data suggest that engineering intervention strategies to improve the expression of DTE recombinant products can be rationally implemented based on an identification of the sequence motifs that render a recombinant product DTE.

  17. Hydroxycinnamic acid bound arabinoxylans from millet brans-structural features and antioxidant activity.

    Science.gov (United States)

    Bijalwan, Vandana; Ali, Usman; Kesarwani, Atul Kumar; Yadav, Kamalendra; Mazumder, Koushik

    2016-07-01

    Hydroxycinnamic acid bound arabinoxylans (HCA-AXs) were extracted from brans of five Indian millet varieties and response surface methodology was used to optimize the extraction conditions. The optimal condition to obtain highest yield of millet HCA-AXs was determined as follows: time 61min, temperature 66°C, ratio of solvent to sample 12ml/g. Linkage analysis indicated that hydroxycinnamic acid bound arabinoxylan from kodo millet (KM-HCA-AX) contained comparatively low branched arabinoxylan consisting of 14.6% mono-substituted, 1.2% di-substituted and 41.2% un-substituted Xylp residues. The HPLC analysis of millet HCA-AXs showed significant variation in the content of three major bound hydroxycinnamic acids (caffeic, p-coumaric and ferulic acid). The antioxidant activity of millet HCA-AXs were evaluated using three in vitro assay methods (DPPH, FRAP and β-carotene linoleate emulsion assays) which suggested both phenolic acid composition and structural characteristics of arabinoxylans could be correlated to their antioxidant potential, the detailed structural analysis revealed that low substituted KM-HCA-AX exhibited relatively higher antioxidant activity compared to other medium and highly substituted HCA-AXs from finger (FM), proso (PM), barnyard (BM) and foxtail (FOXM) millet. PMID:27050114

  18. The developmental transcriptome landscape of bovine skeletal muscle defined by Ribo-Zero ribonucleic acid sequencing.

    Science.gov (United States)

    Sun, X; Li, M; Sun, Y; Cai, H; Li, R; Wei, X; Lan, X; Huang, Y; Lei, C; Chen, H

    2015-12-01

    Ribonucleic acid sequencing (RNA-Seq) libraries are normally prepared with oligo(dT) selection of poly(A)+ mRNA, but it depends on intact total RNA samples. Recent studies have described Ribo-Zero technology, a novel method that can capture both poly(A)+ and poly(A)- transcripts from intact or fragmented RNA samples. We report here the first application of Ribo-Zero RNA-Seq for the analysis of the bovine embryonic, neonatal, and adult skeletal muscle whole transcriptome at an unprecedented depth. Overall, 19,893 genes were found to be expressed, with a high correlation of expression levels between the calf and the adult. Hundreds of genes were found to be highly expressed in the embryo and decreased at least 10-fold after birth, indicating their potential roles in embryonic muscle development. In addition, we present for the first time the analysis of global transcript isoform discovery in bovine skeletal muscle and identified 36,694 transcript isoforms. Transcriptomic data were also analyzed to unravel sequence variations; 185,036 putative SNP and 12,428 putative short insertions-deletions (InDel) were detected. Specifically, many stop-gain, stop-loss, and frameshift mutations were identified that probably change the relative protein production and sequentially affect the gene function. Notably, the numbers of stage-specific transcripts, alternative splicing events, SNP, and InDel were greater in the embryo than in the calf and the adult, suggesting that gene expression is most active in the embryo. The resulting view of the transcriptome at a single-base resolution greatly enhances the comprehensive transcript catalog and uncovers the global trends in gene expression during bovine skeletal muscle development. PMID:26641174

  19. Geometric Feature-Based Facial Expression Recognition in Image Sequences Using Multi-Class AdaBoost and Support Vector Machines

    OpenAIRE

    Joonwhoan Lee; Deepak Ghimire

    2013-01-01

    Facial expressions are widely used in the behavioral interpretation of emotions, cognitive science, and social interactions. In this paper, we present a novel method for fully automatic facial expression recognition in facial image sequences. As the facial expression evolves over time facial landmarks are automatically tracked in consecutive video frames, using displacements based on elastic bunch graph matching displacement estimation. Feature vectors from individual landmarks, as well as pa...

  20. Clinical and biochemical features of aromatic L-amino acid decarboxylase deficiency.

    NARCIS (Netherlands)

    Brun, L.; Ngu, L.H.; Keng, W.T.; Ch'ng, G.S.; Choy, Y.S.; Hwu, W.L.; Lee, W.T.; Willemsen, M.A.A.P.; Verbeek, M.M.; Wassenberg, T.; Regal, L.; Orcesi, S.; Tonduti, D.; Accorsi, P.; Testard, H.; Abdenur, J.E.; Tay, S.; Allen, G.F.; Heales, S.; Kern, I.; Kato, M.; Burlina, A.; Manegold, C.; Hoffmann, G.F.; Blau, N.

    2010-01-01

    OBJECTIVE: To describe the current treatment; clinical, biochemical, and molecular findings; and clinical follow-up of patients with aromatic l-amino acid decarboxylase (AADC) deficiency. METHOD: Clinical and biochemical data of 78 patients with AADC deficiency were tabulated in a database of pediat

  1. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    Science.gov (United States)

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  2. Genome Sequence of Corynebacterium glutamicum ATCC 14067, Which Provides Insight into Amino Acid Biosynthesis in Coryneform Bacteria

    OpenAIRE

    Lv, Yangyong; Liao, Juanjun; Wu, Zhanhong; Han, Shuangyan; Lin, Ying; Zheng, Suiping

    2012-01-01

    We report the genome sequence of Corynebacterium glutamicum ATCC 14067 (once named Brevibacterium flavum), which is useful for taxonomy research and further molecular breeding in amino acid production. Preliminary comparison with those of the reported coryneform strains revealed some notable differences that might be related to the difficulties in molecular manipulation.

  3. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    Science.gov (United States)

    Meneghel, Julie; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  4. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  5. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  6. N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

    OpenAIRE

    Kuhn, H; Fietzek, P P; Lampen, J. O.

    1982-01-01

    The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.

  7. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    Science.gov (United States)

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  8. Amino acid sequence of Coprinus macrorhizus peroxidase and cDNA sequence encoding Coprinus cinereus peroxidase. A new family of fungal peroxidases.

    Science.gov (United States)

    Baunsgaard, L; Dalbøge, H; Houen, G; Rasmussen, E M; Welinder, K G

    1993-04-01

    Sequence analysis and cDNA cloning of Coprinus peroxidase (CIP) were undertaken to expand the understanding of the relationships of structure, function and molecular genetics of the secretory heme peroxidases from fungi and plants. Amino acid sequencing of Coprinus macrorhizus peroxidase, and cDNA sequencing of Coprinus cinereus peroxidase showed that the mature proteins are identical in amino acid sequence, 343 residues in size and preceded by a 20-residue signal peptide. Their likely identity to peroxidase from Arthromyces ramosus is discussed. CIP has an 8-residue, glycine-rich N-terminal extension blocked with a pyroglutamate residue which is absent in other fungal peroxidases. The presence of pyroglutamate, formed by cyclization of glutamine, and the finding of a minor fraction of a variant form lacking the N-terminal residue, indicate that signal peptidase cleavage is followed by further enzymic processing. CIP is 40-45% identical in amino-acid sequence to 11 lignin peroxidases from four fungal species, and 42-43% identical to the two known Mn-peroxidases. Like these white-rot fungal peroxidases, CIP has an additional segment of approximately 40 residues at the C-terminus which is absent in plant peroxidases. Although CIP is much more similar to horseradish peroxidase (HRP C) in substrate specificity, specific activity and pH optimum than to white-rot fungal peroxidases, the sequences of CIP and HRP C showed only 18% identity. Hence, CIP qualifies as the first member of a new family of fungal peroxidases. The nine invariant residues present in all plant, fungal and bacterial heme peroxidases are also found in CIP. The present data support the hypothesis that only one chromosomal CIP gene exists. In contrast, a large number of secretory plant and fungal peroxidases are expressed from several peroxidase gene clusters. Analyses of three batches of CIP protein and of 49 CIP clones revealed the existence of only two highly similar alleles indicating less

  9. Pathophysiology, clinical features and radiological findings of differentiation syndrome/all-trans-retinoic acid syndrome

    OpenAIRE

    Cardinale, Luciano; Asteggiano, Francesco; Moretti, Federica; Torre, Federico; Ulisciani, Stefano; Fava, Carmen; Rege-Cambrin, Giovanna

    2014-01-01

    In acute promyelocytic leukemia, differentiation therapy based on all-trans-retinoic acid can be complicated by the development of a differentiation syndrome (DS). DS is a life-threatening complication, characterized by respiratory distress, unexplained fever, weight gain, interstitial lung infiltrates, pleural or pericardial effusions, hypotension and acute renal failure. The diagnosis of DS is made on clinical grounds and has proven to be difficult, because none of the symptoms is pathognom...

  10. A Unique Sequence of Financial Accounting Courses Featuring Team Teaching, Linked Courses, Challenging Assignments, and Instruments for Evaluation and Assessment

    Science.gov (United States)

    Lundblad, Heidemarie; Wilson, Barbara A.

    2008-01-01

    The Department of Accounting at California State University Northridge (CSUN) has developed a unique sequence of courses designed to ensure that accounting students are trained not only in technical accounting, but also acquire critical thinking, research and communication skills. The courses have proven effective and have embedded assessment…

  11. Features of the Arabidopsis recombination landscape resulting from the combined loss of sequence variation and DNA methylation

    NARCIS (Netherlands)

    Colomé Tatché, Maria; Cortijo, Sandra; Wardenaar, Rene; Monteiro Morgado, Lionel; Lahouze, Benoit; Sarazin, Alexis; Etcheverry, Mathilde; Martin, Antoine; Feng, Suhua; Duvernois-Berthet, Evelyne; Labadie, Karine; Wincker, Patrick; Jacobsen, Steven E.; Jansen, Ritsert C.; Colot, Vincent; Johannes, Frank

    2012-01-01

    The rate of meiotic crossing over (CO) varies considerably along chromosomes, leading to marked distortions between physical and genetic distances. The causes underlying this variation are being unraveled, and DNA sequence and chromatin states have emerged as key factors. However, the extent to whic

  12. Origin of Molar-Tooth Structure Based on Sequence-Stratigraphic Position and Macroscopic Features:Example from Mesoproterozoic Gaoyuzhuang Formation at Jixian Section, Tianjin, North China

    Institute of Scientific and Technical Information of China (English)

    Mei Mingxiang

    2006-01-01

    Both the macroscopic feature and the sequence-stratigraphic position of the molar-tooth structure developed in the third member of the Gaoyuzhuang (高于庄) Formation at the Jixian (蓟县)Section in Tianjin (天津) can provide some useful information about its origin and can reveal some problems to be further researched in the future. The Mesoproterozoic Gaoyuzhuang Formation is a set of ~ 1 600 m thick carbonate strata. This formation can be divided into four members. The first member is mainly made up of stromatolitic dolomites; the second is marked by a set of manganese dolomites; the third is mainly composed of lamina limestones with the development of molar-tooth strcutures; the fourth is a set of stromatolitic-lithoherm dolomites. According to lithofacies and its succession, several types of meter-scale cycles can be discerned in the Gaoyuzhuang Formation: the L-M type, the subtidal type and the peritidal type. There is a regularly vertical stacking pattern for meter-scale cycles in the third-order sequence. Therefore, the Mesoproterozoic Gaoyuzhuang Formation can be divided into 13 third-order sequences (SQ1 to SQ13 ) and can further be grouped into 4 second-order sequences. The third member is marked by lamina limestones and can be grouped into three third-order sequences (SQ9 to SQ11 ). The molar-tooth structure is developed in the middle part of the third sequence, I.e. SQ11, in the third member. Several features of this kind of molar-tooth structure reflect some features of carbonate sedimentation in the Precambrian, such as the particular configuration, abundant organic matter, and easy silication. Stromatolites are chiefly formed in a shallow tidal-flat environment; lamina are mainly formed in the shallow ramp and molar-tooth structures are mainly generated in a relatively more deep-water environment from the middle to the deep ramp. Therefore, similar to stromatolite and lamina, the molartooth structure might also be a kind of bio

  13. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    Science.gov (United States)

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  14. EST sequences and their annotation (amino acid sequence and results of homology search) - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available and VS) derived from five developmental stages. Clone ID ID of cDNA clone Atlas ID ID of Atlas database ( ht...tp://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ) and link to Atlas database NBRP ID ID of cDNA c...ir annotations (amino acid sequence, homology search results (with target DBs: dicty EST-DB, DNA-DB and prot...ein-DB)). Links to the Atlas database ( http://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ), whic

  15. Investigation of Antifouling Properties of Surfaces Featuring Zwitterionic α-Aminophosphonic Acid Moieties.

    Science.gov (United States)

    Wagner, Natalie; Zimmermann, Phyllis; Heisig, Peter; Klitsche, Franziska; Maison, Wolfgang; Theato, Patrick

    2015-12-01

    Zwitterionic thin films containing α-amino phosphonic acid moieties were successfully introduced on silicon surfaces and their antifouling properties were investigated. Initially, the substrates were modified with a hybrid polymer, composed of poly(methylsilsesquioxane) (PMSSQ) and poly(4-vinyl benzaldehyde) (PStCHO). Next, a Kabachnik-Fields post-polymerization modification (sur-KF-PMR) of the functionalized aldehyde surfaces was conducted with different amines and dialkyl phosphonates. After subsequent deprotection reaction of dialkyl phosphonates, the obtained zwitterionic surfaces were characterized by various techniques and we found excellent antifouling properties of the resulting films. PMID:26332285

  16. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    Science.gov (United States)

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  17. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    Science.gov (United States)

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  18. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    Science.gov (United States)

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered. PMID:11281267

  19. PASS2 database for the structure-based sequence alignment of distantly related SCOP domain superfamilies: update to version 5 and added features.

    Science.gov (United States)

    Gandhimathi, Arumugam; Ghosh, Pritha; Hariharaputran, Sridhar; Mathew, Oommen K; Sowdhamini, R

    2016-01-01

    Structure-based sequence alignment is an essential step in assessing and analysing the relationship of distantly related proteins. PASS2 is a database that records such alignments for protein domain superfamilies and has been constantly updated periodically. This update of the PASS2 version, named as PASS2.5, directly corresponds to the SCOPe 2.04 release. All SCOPe structural domains that share less than 40% sequence identity, as defined by the ASTRAL compendium of protein structures, are included. The current version includes 1977 superfamilies and has been assembled utilizing the structure-based sequence alignment protocol. Such an alignment is obtained initially through MATT, followed by a refinement through the COMPARER program. The JOY program has been used for structural annotations of such alignments. In this update, we have automated the protocol and focused on inclusion of new features such as mapping of GO terms, absolutely conserved residues among the domains in a superfamily and inclusion of PDBs, that are absent in SCOPe 2.04, using the HMM profiles from the alignments of the superfamily members and are provided as a separate list. We have also implemented a more user-friendly manner of data presentation and options for downloading more features. PASS2.5 version is available at http://caps.ncbs.res.in/pass2/. PMID:26553811

  20. Amino acid alcohols: growth inhibition and induction of differentiated features in melanoma cells.

    Science.gov (United States)

    Landau, O; Wasserman, L; Deutsch, A A; Reiss, R; Panet, H; Novogrodsky, A; Nordenberg, J

    1993-05-14

    The effects of a series of D- and L-amino acid alcohols on the proliferation and phenotypic expression of B16 mouse melanoma cells were evaluated. B16 melanoma cells were incubated for different time intervals in the presence of D- or L-phenylalaninol (PHE), D- or L-alaninol (AL), D- or L-leucinol (LE), L-histidinol (HIS), L-tyrosinol (TYR) and L-methioninol (MET). All agents, including the D or L configuration, induced an anti-proliferative effect, although of considerably different magnitude. D-PHE was the most active growth inhibitor. The growth inhibitory effects were accompanied by phenotypic alterations, which included morphological changes and enhancement in the activities of NADPH cytochrome c reductase and tau-glutamyl transpeptidase. These phenotypic alterations correlated with the growth inhibitory effects of the different agents and seem to reflect a higher differentiated state. PMID:8099846

  1. Identification and Characterization of Second-Generation Invader Locked Nucleic Acids (LNAs) for Mixed-Sequence Recognition of Double-Stranded DNA

    DEFF Research Database (Denmark)

    Sau, Sujay P; Madsen, Andreas S; Podbevsek, Peter;

    2013-01-01

    , but substantial efforts are currently devoted to the development of alternative strategies that overcome the limitations observed with the classic approaches. In 2005, we introduced Invader locked nucleic acids (LNAs), i.e., double-stranded probes that are activated for mixed-sequence recognition of dsDNA through...... modification with "+1 interstrand zippers" of 2'-N-(pyren-1-yl)methyl-2'-amino-α-l-LNA monomers. Despite promising preliminary results, progress has been slow because of the synthetic complexity of the building blocks. Here we describe a study that led to the identification of two simpler classes of Invader...... monomers. We compare the thermal denaturation characteristics of double-stranded probes featuring different interstrand zippers of pyrene-functionalized monomers based on 2'-amino-α-l-LNA, 2'-N-methyl-2'-amino-DNA, and RNA scaffolds. Insights from fluorescence spectroscopy, molecular modeling, and NMR...

  2. JRC GMO-Amplicons, a collection of nucleic acid sequences related to genetically modified organisms

    OpenAIRE

    PETRILLO MAURO; ANGERS ALEXANDRE; HENRIKSSON PETER; Bonfini, Laura; PATAK DENNSTEDT Alexandre; KREYSA JOACHIM

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico...

  3. Structural features of the human C3 gene: Intron/exon organization, transcriptional start site, and promoter region sequence

    International Nuclear Information System (INIS)

    The third component of human complement (C3) is a key molecule in the activation of the complement cascade. C3 cDNA fragments were used to identify seven cosmid clones that covered all but 1 kilobase pair (kb) of the C3 gene. The remainder of the gene was cloned by using the polymerase chain reaction. These clones were used to identify the interon/exon boundaries and to map the gene. The C3 gene is 42 kb in length and comprises 41 exons ranging in size from 52 to 213 base pairs (bp). The transcription start site was identified by primer extension, and approximately 1 kb of DNA upstream of this site was sequenced. Putative TATA and CAAT boxes were identified along with a number of regions that shared homology with known regulatory sequences. These include responsive elements for interferon-γ, interleukin-6, nuclear factor kB, estrogen, glucocorticoids and thyroid hormone. Several of these agents have been shown to affect C3 synthesis and mRNA levels. The sizes of the exons in C3 were compared to those of C4 and α2-macroglobulin (α2M). Thirty-nine of 41 exons in C4 were found to be of similar size to the analogous ones in C3, and two-thirds of those in α2M were also similarly sized, supporting the hypothesis that these genes arose from a common ancestor

  4. NetTurnP – Neural Network Prediction of Beta-turns by Use of Evolutionary Information and Predicted Protein Sequence Features

    DEFF Research Database (Denmark)

    Petersen, Bent; Lundegaard, Claus; Petersen, Thomas Nordahl

    2010-01-01

    NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which......-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC = 0.50, Qtotal = 82.1%, sensitivity = 75.6%, PPV = 68.8% and AUC = 0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17 – 0.......47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively....

  5. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Science.gov (United States)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  6. [Ten-years records of organic arsenic (diphenylarsinic acid) poisoning: epidemiology, clinical feature, metabolism, and toxicity].

    Science.gov (United States)

    Ishi, Kazuhiro; Tamaoka, Akira

    2015-01-01

    We report here the symptoms of diphenylarsinic acid (DPAA) poisoning recorded over 10 years since the DPAA contamination of the potable well water was first detected in the Kamisu City, Ibaraki Prefecture, in 2003. The poisoning symptoms associated with the cerebellum and brainstem included nystagmus, tremors, myoclonus, and cerebellar ataxia as well as the symptoms associated with the temporal and occipital lobes such as memory impairment, sleep disorder, and visual disturbance. Some of the affected children exhibited mental retardation. Moreover, reduced blood flow and reduced glucose metabolism in the cerebella, brainstem, and temporal and occipital lobes persisted for several years among the DPAA-exposed persons. Based on the animal studies for DPAA intoxication, the target organs for the DPAA toxicity were determined to be the central nervous system (CNS), liver, and biliary system. In particular, DPAA tends to persist in the brain for a long time, resulting in long-term impacts on the brain. The cerebral blood flow and brain glucose metabolism, which can be measured by positron emission tomography (PET) and single photon emission computed tomography (SPECT), respectively, are useful objective clinical markers to determine the effect of DPAA on CNS. We believe that continuous monitoring of the DPAA-exposed people may promote the effect of carcinogen and accelerate brain aging.

  7. Purification, characterization, and amino acid sequencing of a. delta. /sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B

    Energy Technology Data Exchange (ETDEWEB)

    Linden, K.G.

    1986-01-01

    Studies were performed on the ..delta../sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B. The studies have involved three broad areas: improvement in the purification of the enzyme, further characterization of the purified enzyme, and completion of the amino acid sequence of the enzyme. For the purification of the enzyme, techniques for removing the isomerase from whole cells were studied, the effects of ionic strength on the binding of the isomerase to steroidal affinity resins was explored, and a new affinity resin was developed. Absorption spectra and the proton NMR spectra of the isomerase were obtained. Amino acid sequencing of the oxosteroid isomerase indicates that the enzyme is a dimeric protein consisting of two identical subunits each consisting of a polypeptide chain of 131 residues and a M/sub r/ = 14,536.

  8. A Robust Approach for Action Recognition Based on Spatio-Temporal Features in RGB-D Sequences

    Directory of Open Access Journals (Sweden)

    Ly Quoc Ngoc

    2016-05-01

    Full Text Available Recognizing human action is attractive research topic in computer vision since it plays an important role on the applications such as human-computer interaction, intelligent surveillance, human actions retrieval system, health care, smart home, robotics and so on. The availability the low-cost Microsoft Kinect sensor, which can capture real-time high-resolution RGB and visual depth information, has opened an opportunity to significantly increase the capabilities of many automated vision based recognition tasks. In this paper, we propose new framework for action recognition in RGB-D video. We extract spatiotemporal features from RGB-D data that capture both visual, shape and motion information. Moreover, the segmentation technique is applied to present the temporal structure of action. Firstly, we use STIP to detect interest points both of RGB and depth channels. Secondly, we apply HOG3D descriptor for RGB channel and 3DS-HONV descriptor for depth channel. In addition, we also extract HOF2.5D from fusing RGB and Depth to capture human’s motion. Thirdly, we divide the video into segments and apply GMM to create feature vectors for each segment. So, we have three feature vectors (HOG3D, 3DS-HONV, and HOF2.5D that represent for each segment. Next, the max pooling technique is applied to create a final vector for each descriptor. Then, we concatenate the feature vectors from the previous step into the final vector for action representation. Lastly, we use SVM method for classification step. We evaluated our proposed method on three benchmark datasets to demonstrate generalizability. And, the experimental results shown to be more accurate for action recognition compared to the previous works. We obtain overall accuracies of 93.5%, 99.16% and 89.38% with our proposed method on the UTKinect-Action, 3D Action Pairs and MSR-Daily Activity 3D dataset, respectively. These results show that our method is feasible and superior performance over the

  9. Real-Time Nucleic Acid Sequence-Based Amplification Using Molecular Beacons for Detection of Enterovirus RNA in Clinical Specimens

    OpenAIRE

    Landry, Marie L.; Garner, Robin; Ferguson, David

    2005-01-01

    Real-time nucleic acid sequence-based amplification (NASBA) using molecular beacon technology (NASBA-beacon) was compared to standard NASBA with postamplification hybridization using electrochemiluminescently labeled probes (NASBA-ECL) for detection of enteroviruses (EV) in 133 cerebrospinal fluid and 27 stool samples. NASBA-ECL and NASBA-beacon were similar in sensitivity, detecting 55 (100%) and 52 (94.5%) EV-positive samples, respectively. There were no false positives. Both NASBA assays w...

  10. Using Triple Helix Forming Peptide Nucleic Acids for Sequence-selective Recognition of Double-stranded RNA

    OpenAIRE

    Hnedzko, Dziyana; Cheruiyot, Samwel K.; Rozners, Eriks

    2014-01-01

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple helix forming peptide nucleic acids (PNAs) that bind in the major grov...

  11. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids

    OpenAIRE

    Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

    2015-01-01

    Background Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. Results The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of thre...

  12. Genetic, molecular and expression features of the Pervenets mutant leading to high oleic acid content of seed oil in sunflower

    Directory of Open Access Journals (Sweden)

    Lacombe Séverine

    2002-01-01

    Full Text Available Pervenets is a sunflower population that displays seed oil with a high oleic acid content [HOAC]. Our aim is to reconcile all the data gathered on this mutant in a unique explanatory mechanism. All Pervenets-derived [HOAC] lines display no accumulation or a very reduced accumulation of the DELTA12-desaturase transcript in the embryos during the stages for oil accumulation. They also carry oleHOS specific RFLP markers revealed by an DELTA12-desaturase cDNA used as a probe. The linoleic or [LO] genotypes do not carry this RFLP marker, but another allele: oleLOR (oleHL locus. Linkage disequilibrium between the oleHOS allele and [HOAC] was verified. We studied the mode of inheritance of [HOAC] in two segregating populations. A F2 progenies revealed one dominant allele for [HOAC] that co-segregated with the oleHOS allele showing that the Pervenets mutation and oleHOS were closely linked. F6 recombinant inbred lines, showed the [HOAC] trait due to two independent loci: the locus carrying the oleHOS allele and another locus sup. One allele, supole, at this second locus may suppress the effect of the oleHOS allele on the [HOAC] trait. Northern analyses performed on [HOAC] lines and F1 ([HOAC] x [LO] hybrids revealed under-accumulation of DELTA12-desaturase transcript. Thus Pervenets mutation acts in trans. The oleHOS genomic region that may carry the Pervenets mutation was cloned. A genomic library was constructed in lambdafixII with the DNA from the RHA345 [HOAC] line and screened with a DELTA12-desaturase cDNA as a probe. Two overlapping clones were entirely sequenced and revealed carrying a gene for an DELTA12-desaturase probably located in the RE. This corresponds to the invariant part of the oleHL locus. Another clone (11.1 probably carries DELTA12-desaturase repeated sequences that cause instability of the clone. We showed that the 11.1 clone carries most of cDNA sequence, but due to its organization it is not yet sequenced. A mutation mechanism

  13. Fusion protein predicted amino acid sequence of the first US avian pneumovirus isolate and lack of heterogeneity among other US isolates.

    Science.gov (United States)

    Seal, B S; Sellers, H S; Meinersmann, R J

    2000-02-01

    Avian pneumovirus (APV) was first isolated from turkeys in the west-central US following emergence of turkey rhinotracheitis (TRT) during 1996. Subsequently, several APV isolates were obtained from the north-central US. Matrix (M) and fusion (F) protein genes of these isolates were examined for sequence heterogeneity and compared with European APV subtypes A and B. Among US isolates the M gene shared greater than 98% nucleotide sequence identity with only one nonsynonymous change occurring in a single US isolate. Although the F gene among US APV isolates shared 98% nucleotide sequence identity, nine conserved substitutions were detected in the predicted amino acid sequence. The predicted amino acid sequence of the US APV isolate's F protein had 72% sequence identity to the F protein of APV subtype A and 71% sequence identity to the F protein of APV subtype B. This compares with 83% sequence identity between the APV subtype A and B predicted amino acid sequences of the F protein. The US isolates were phylogenetically distinguishable from their European counterparts based on F gene nucleotide or predicted amino acid sequences. Lack of sequence heterogeneity among US APV subtypes indicates these viruses have maintained a relatively stable population since the first outbreak of TRT. Phylogenetic analysis of the F protein among APV isolates supports classification of US isolates as a new APV subtype C.

  14. cDNA-derived amino-acid sequence of a land turtle (Geochelone carbonaria) beta-chain hemoglobin.

    Science.gov (United States)

    Bordin, S; Meza, A N; Saad, S T; Ogo, S H; Costa, F F

    1997-06-01

    The cDNA sequence encoding the turtle Geochelone carbonaria beta-chain was determinated. The isolation of hemoglobin mRNA was based on degenerate primers' PCR in combination with 5'- and 3'-RACE protocol. The full length cDNA is 615 bp with the ATG start codon at position 53 and TGA stop codon at position 495; The AATAAA polyadenylation signal is found at position 599. The deduced polypeptyde contains 146 amino-acid residues. The predicted amino acid sequence shares 83% identity with the beta-globin of a related specie, the aquatic turtle C. p. belli. Otherwise, identity is higher when compared with chicken beta-Hb (80%) than with other reptilian orders (Squamata, 69%, and Crocodilia, 61%). Compared with human HbA, there is 67% identity, and at least three amino acid substitutions could be of some functional significance (Glu43 beta-->Ser, His116 beta-->Thr and His143 beta-->Leu). To our knowledge this represents the first cDNA sequence of a reptile globin gene described. PMID:9238523

  15. Amino acid sequence of the serine-repeat antigen (SERA) of Plasmodium falciparum determined from cloned cDNA.

    Science.gov (United States)

    Bzik, D J; Li, W B; Horii, T; Inselburg, J

    1988-09-01

    We report the isolation of cDNA clones for a Plasmodium falciparum gene that encodes the complete amino acid sequence of a previously identified exported blood stage antigen. The Mr of this antigen protein had been determined by sodium dodecylsulphate-polyacrylamide gel electrophoresis analysis, by different workers, to be 113,000, 126,000, and 140,000. We show, by cDNA nucleotide sequence analysis, that this antigen gene encodes a 989 amino acid protein (111 kDa) that contains a potential signal peptide, but not a membrane anchor domain. In the FCR3 strain the serine content of the protein was 11%, of which 57% of the serine residues were localized within a 201 amino acid sequence that included 35 consecutive serine residues. The protein also contained three possible N-linked glycosylation sites and numerous possible O-linked glycosylation sites. The mRNA was abundant during late trophozoite-schizont parasite stages. We propose to identity this antigen, which had been called p126, by the acronym SERA, serine-repeat antigen, based on its complete structure. The usefulness of the cloned cDNA as a source of a possible malaria vaccine is considered in view of the previously demonstrated ability of the antigen to induce parasite-inhibitory antibodies and a protective immune response in Saimiri monkeys. PMID:2847041

  16. TBX1 mutation identified by exome sequencing in a Japanese family with 22q11.2 deletion syndrome-like craniofacial features and hypocalcemia.

    Directory of Open Access Journals (Sweden)

    Tsutomu Ogata

    Full Text Available BACKGROUND: Although TBX1 mutations have been identified in patients with 22q11.2 deletion syndrome (22q11.2DS-like phenotypes including characteristic craniofacial features, cardiovascular anomalies, hypoparathyroidism, and thymic hypoplasia, the frequency of TBX1 mutations remains rare in deletion-negative patients. Thus, it would be reasonable to perform a comprehensive genetic analysis in deletion-negative patients with 22q11.2DS-like phenotypes. METHODOLOGY/PRINCIPAL FINDINGS: We studied three subjects with craniofacial features and hypocalcemia (group 1, two subjects with craniofacial features alone (group 2, and three subjects with normal phenotype within a single Japanese family. Fluorescence in situ hybridization analysis excluded chromosome 22q11.2 deletion, and genomewide array comparative genomic hybridization analysis revealed no copy number change specific to group 1 or groups 1+2. However, exome sequencing identified a heterozygous TBX1 frameshift mutation (c.1253delA, p.Y418fsX459 specific to groups 1+2, as well as six missense variants and two in-frame microdeletions specific to groups 1+2 and two missense variants specific to group 1. The TBX1 mutation resided at exon 9C and was predicted to produce a non-functional truncated protein missing the nuclear localization signal and most of the transactivation domain. CONCLUSIONS/SIGNIFICANCE: Clinical features in groups 1+2 are well explained by the TBX1 mutation, while the clinical effects of the remaining variants are largely unknown. Thus, the results exemplify the usefulness of exome sequencing in the identification of disease-causing mutations in familial disorders. Furthermore, the results, in conjunction with the previous data, imply that TBX1 isoform C is the biologically essential variant and that TBX1 mutations are associated with a wide phenotypic spectrum, including most of 22q11.2DS phenotypes.

  17. Coding potential and transcript analysis of fowl adenovirus 4: insight into upstream ORFs as common sequence features in adenoviral transcripts.

    Science.gov (United States)

    Griffin, Bryan D; Nagy, Eva

    2011-06-01

    Recombinant fowl adenoviruses (FAdVs) have been successfully used as veterinary vaccine vectors. However, insufficient definitions of the protein-coding and non-coding regions and an incomplete understanding of virus-host interactions limit the progress of next-generation vectors. FAdVs are known to cause several diseases of poultry. Certain isolates of species FAdV-C are the aetiological agent of inclusion body hepatitis/hydropericardium syndrome (IBH/HPS). In this study, we report the complete 45667 bp genome sequence of FAdV-4 of species FAdV-C. Assessment of the protein-coding potential of FAdV-4 was carried out with the Bio-Dictionary-based Gene Finder together with an evaluation of sequence conservation among species FAdV-A and FAdV-D. On this basis, 46 potentially protein-coding ORFs were identified. Of these, 33 and 13 ORFs were assigned high and low protein-coding potential, respectively. Homologues of the ancestral adenoviral genes were, with few exceptions, assigned high protein-coding potential. ORFs that were unique to the FAdVs were differentiated into high and low protein-coding potential groups. Notable putative genes with high protein-coding capacity included the previously unreported fiber 1, hypothetical 10.3K and hypothetical 10.5K genes. Transcript analysis revealed that several of the small ORFs less than 300 nt in length that were assigned low coding potential contributed to upstream ORFs (uORFs) in important mRNAs, including the ORF22 mRNA. Subsequent analysis of the previously reported transcripts of FAdV-1, FAdV-9, human adenovirus 2 and bovine adenovirus 3 identified widespread uORFs in AdV mRNAs that have the potential to act as important translational regulatory elements.

  18. Evaluation of codon biology in citrus and Poncirus trifoliata based on genomic features and frame corrected expressed sequence tags.

    Science.gov (United States)

    Ahmad, Touqeer; Sablok, Gaurav; Tatarinova, Tatiana V; Xu, Qiang; Deng, Xiu-Xin; Guo, Wen-Wu

    2013-04-01

    Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid. PMID:23315666

  19. Evaluation of codon biology in citrus and Poncirus trifoliata based on genomic features and frame corrected expressed sequence tags.

    Science.gov (United States)

    Ahmad, Touqeer; Sablok, Gaurav; Tatarinova, Tatiana V; Xu, Qiang; Deng, Xiu-Xin; Guo, Wen-Wu

    2013-04-01

    Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid.

  20. Amino acid sequence and glycosylation of functional unit RtH2-e from Rapana thomasiana (gastropod) hemocyanin.

    Science.gov (United States)

    Stoeva, Stanka; Idakieva, Krasimira; Betzel, Christian; Genov, Nicolay; Voelter, Wolfgang

    2002-03-15

    The complete amino acid sequence of Rapana thomasiana hemocyanin functional unit RtH2-e was determined by direct sequencing and matrix-assisted laser desorption ionization mass spectrometry of peptides obtained by cleavage with EndoLysC proteinase, chymotrypsin, and trypsin. The single-polypeptide chain of RtH2-e consists of 413 amino acid residues and contains two consensus sequences NXS/T (positions 11-19 and 127-129), potential sites for N-glycosylation. Monosaccharide analysis of RtH2-e revealed a carbohydrate content of about 1.1% and the presence of xylose, fucose, mannose, and N-acetylglucosamine, demonstrating that only N-linked carbohydrate chains of high-mannose type seem to be present. On basis of the monosaccharide composition and MALDI-MS analysis of native and PNGase-F-treated chymotryptic glycopeptide fragment of RtH2-e the oligosaccharide Man(5)GlcNAc(2), attached to Asn(127), is suggested. Multiple sequence alignments with other molluscan hemocyanin e functional units revealed an identity of 63% to the cephalopod Octopus dofleini and of 69% to the gastropod Haliotis tuberculata. The present results are discussed in view of the recently determined X-ray structure of the functional unit g of the O. dofleini hemocyanin. PMID:11888200

  1. Cloning, Sequence Analysis, and Expression in Escherichia coli of the Gene Encoding an α-Amino Acid Ester Hydrolase from Acetobacter turbidans

    NARCIS (Netherlands)

    Polderman-Tijmes, Jolanda J.; Jekel, P; de Vries, Erik; van Merode, Annet; Floris, René; Laan, Jan-Metske van der; Sonke, Theo; Janssen, Dick B.

    2002-01-01

    The α-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing β-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified α-amino acid ester hydrolase allowed cloning and genetic characterization of the

  2. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    Science.gov (United States)

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  3. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Science.gov (United States)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  4. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans

    NARCIS (Netherlands)

    Polderman-Tijmes, JJ; Jekel, PA; de Vries, EJ; van Merode, Annet; Floris, R; van der Laan, JM; Sonke, T; Janssen, DB

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterizat

  5. 广东省二叠纪含煤岩系层序地层特征%Permian Coal-bearing Strata Sequence Stratigraphic Features in Guangdong Province

    Institute of Scientific and Technical Information of China (English)

    董大啸; 刘特辉; 吴雷; 邵龙义

    2012-01-01

    Based on field measured sections, indoor rock and mineral identification and comprehensive mapping, have analyzed Permian coal-bearing strata sequence stratigraphic features. 5 types of regional surface of unconformity, incised valley scour surface, fossil soil bed, event interface and sedimentary facies transformation base have been identified, totally 6 third -order sequence boundaries and divided into 5 third-order sequences. Representative interfaces of S1~S5 bottom have K1 and K2 sandstone bottoms in the Guanghua area, limestone unconformity surface and white limestone member bottom in the Lianyang area, as well as the Shengtang Formation Datao sandstone bottom in Guanghua area; top interface of S5 is the structural event interface between Permian and Triassic systems. The contrast of different areas has found that the sequence development features in Guanghua and Lianyang two areas are rather similar. The variation of 5 sequences in Lianyang area presents cyclic process of marine transgression (S1, S2)-fast regression-transgression (S3)-slow regression (S4, S5), while 5 sequences features in the Quren area are similar but little varying. The contrast between 1990s' sequence stratigraphic division scheme and this study, 2 sequence interfaces are identical.%根据野外剖面实测、室内岩矿鉴定及综合作图等,对广东省二叠纪含煤岩系层序地层特征进行分析,识别出区域不整合面、下切谷冲刷面、古土壤层、事件界面、沉积相转换面5个类型、共6个三级层序界面,划分为5个三级层序.S1~S5各层序底界面的代表界面分别为广花地区K1砂岩底面、广花地区K2砂岩底面、连阳地区灰岩不整合面、连阳地区白灰岩段底面和广花地区圣堂组大套砂岩底面;S5的顶界面为二叠系与三叠系之间的构造事件面.对比不同地区后发现,广花和兴梅两个地区的层序发育特征较为类似,连阳地区5个层序的变化呈现了海侵(S1、S2)-

  6. Site-directed gene mutation at mixed sequence targets by psoralen-conjugated pseudo-complementary peptide nucleic acids.

    Science.gov (United States)

    Kim, Ki-Hyun; Nielsen, Peter E; Glazer, Peter M

    2007-01-01

    Sequence-specific DNA-binding molecules such as triple helix-forming oligonucleotides (TFOs) provide a means for inducing site-specific mutagenesis and recombination at chromosomal sites in mammalian cells. However, the utility of TFOs is limited by the requirement for homopurine stretches in the target duplex DNA. Here, we report the use of pseudo-complementary peptide nucleic acids (pcPNAs) for intracellular gene targeting at mixed sequence sites. Due to steric hindrance, pcPNAs are unable to form pcPNA-pcPNA duplexes but can bind to complementary DNA sequences by Watson-Crick pairing via double duplex-invasion complex formation. We show that psoralen-conjugated pcPNAs can deliver site-specific photoadducts and mediate targeted gene modification within both episomal and chromosomal DNA in mammalian cells without detectable off-target effects. Most of the induced psoralen-pcPNA mutations were single-base substitutions and deletions at the predicted pcPNA-binding sites. The pcPNA-directed mutagenesis was found to be dependent on PNA concentration and UVA dose and required matched pairs of pcPNAs. Neither of the individual pcPNAs alone had any effect nor did complementary PNA pairs of the same sequence. These results identify pcPNAs as new tools for site-specific gene modification in mammalian cells without purine sequence restriction, thereby providing a general strategy for designing gene targeting molecules. PMID:17977869

  7. Phylogenetic analysis of evolutionary relationships of the planctomycete division of the domain bacteria based on amino acid sequences of elongation factor Tu.

    Science.gov (United States)

    Jenkins, C; Fuerst, J A

    2001-05-01

    Sequences from the tuf gene coding for the elongation factor EF-Tu were amplified and sequenced from the genomic DNA of Pirellula marina and Isosphaera pallida, two species of bacteria within the order Planctomycetales. A near-complete (1140-bp) sequence was obtained from Pi. marina and a partial (759-bp) sequence was obtained for I. pallida. Alignment of the deduced Pi. marina EF-Tu amino acid sequence against reference sequences demonstrated the presence of a unique 11-amino acid sequence motif not present in any other division of the domain Bacteria. Pi. marina shared the highest percentage amino acid sequence identity with I. pallida but showed only a low percentage identity with other members of the domain Bacteria. This is consistent with the concept of the planctomycetes as a unique division of the Bacteria. Neither primary sequence comparison of EF-Tu nor phylogenetic analysis supports any close relationship between planctomycetes and the chlamydiae, which has previously been postulated on the basis of 16S rRNA. Phylogenetic analysis of aligned EF-Tu amino acid sequences performed using distance, maximum-parsimony, and maximum-likelihood approaches yielded contradictory results with respect to the position of planctomycetes relative to other bacteria. It is hypothesized that long-branch attraction effects due to unequal evolutionary rates and mutational saturation effects may account for some of the contradictions. PMID:11443344

  8. 视频序列中基于LBP特征的人体行为识别%Human Action Recognition Based on Local Binary Pattern Feature in Video Sequences

    Institute of Scientific and Technical Information of China (English)

    王宪; 慕鑫; 宋书林; 陈向阳

    2013-01-01

    Human action recognition in the video sequence have become a hot research topic in computer vision field. In order to extract the contour feature of the human’s behavior sequence more effectively, a new algorithm for human action recognition based on Local Binary Pattern (LBP) is proposed. Firstly, background subtraction algorithm is used to extract the complete human motion sequence in the video, and the LBP operators are used to calculate the samples’ LBP feature space which is composed of the motion sequences’ LBP feature spectrum. Then, the behavior feature is generated by k-means clustering method. Finally, the Hidden Markov Model (HMM) is adopted for the classification. During the experiment, the test experiment is performed in the two public behavior databases respectively, and the average recognition rate can reach more than 85%. The intersection of the two databases experimental results shows that the proposed algorithm has certain robustness.%  视频序列中的行为分析与识别已经成为当前计算机视觉领域的研究热点。为了更加有效地提取人体行为序列中的轮廓特征的信息,提出了一种基于局部二值模式(Local Binary Pattern,LBP)特征的人体行为识别的算法。通过背景差分法从视频中提取完整的人体运动序列,利用LBP算子计算运动序列的LBP特征谱,组成样本的LBP轮廓特征空间,接着将特征空间通过K-means聚类的方法生成行为特征。最后,采用隐马尔可夫模型(HMM)对特征进行识别。实验过程中,分别在两个公共行为数据库上进行了测试实验,平均识别率能达到85%以上,并且在两个数据库的交叉实验结果表明了本文算法具有一定的鲁棒性。

  9. Biomechanical features of Poly-D,L-lactic acid (PDLLA) rods through the degradation in vitro and in vivo

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Objective:To observe the changing of biomechanical features during the degradation course of poly-D,L-lactic acid (PDLLA) rods in vivo and in vitro and to evaluate its value as an internal fixation material. Methods :PDLLA rods were emerged into PBS simultaneous body fluid with constant temperature of 37C and the rods were embedded into muscle tissue of 20 rabbits for degradation in vitro and in vivo . The rods were taken out in 2, 4, 6, 8 and 12 weeks. Biomechanical features of bending, shearing and axial compression strength, rigidity and elastic modulus were observed during the degradation course. Statistical method was used to test the changes of biomechanical parameters. Results: (1)There was similar changes of bending, compressive, shearing strength and bending, compressive and shearing rigidity of the PDLLA rods between in vivo and in vitro. (2)Bending, compressive, shearing strength decreased 33%,18 % and 43 % respectively within the first stage of the degradation, and after 6 weeks of degradation, they decreased slowly. (3)Elastic modulus, bending, compressive and shearing rigiditydecreased sharply during the 6 weeks of degradation, with a drop of 22%, 39% and 30%00 respectively, and after 8 weeks, they decreased slowly. Even after 12 weeks of degradation, the strength of the rods was still higher than that of sponge bone. Conclusion: During the degradation of the material, the strength and rigidity of PDLLA rods can meet the need of fracture fixation of cancellous bones.

  10. Purification and amino acid sequence of a bacteriocins produced by Lactobacillus salivarius K7 isolated from chicken intestine

    Directory of Open Access Journals (Sweden)

    Kenji Sonomoto

    2006-03-01

    Full Text Available A bacteriocin-producing strain, Lactobacillus K7, was isolated from a chicken intestine. The inhibitory activity was determined by spot-on-lawn technique. Identification of the strain was performed by morphological, biochemical (API 50 CH kit and molecular genetic (16S rDNA basis. Bacteriocin purification processes were carried out by amberlite adsorption, cation exchange and reverse-phase high perform- ance liquid chromatography. N-terminal amino acid sequences were performed by Edman degradation. Molecular mass was determined by electrospray-ionization (ESI mass spectrometry (MS. Lactobacillus K7 showed inhibitory activity against Lactobacillus sakei subsp. sakei JCM 1157T, Leuconostoc mesenteroides subsp. mesenteroides JCM 6124T and Bacillus coagulans JCM 2257T. This strain was identified as Lb. salivarius. The antimicrobial substance was destroyed by proteolytic enzymes, indicating its proteinaceous structure designated as a bacteriocin type. The purification of bacteriocin by amberlite adsorption, cation exchange, and reverse-phase chromatography resulted in only one single active peak, which was designated FK22. Molecular weight of this fraction was 4331.70 Da. By amino acid sequence, this peptide was homology to Abp 118 beta produced by Lb. salivarius UCC118. In addition, Lb. salivarius UCC118 produced 2-peptide bacteriocin, which was Abp 118 alpha and beta. Based on the partial amino acid sequences of Abp 118 beta, specific primers were designed from nucleotide sequences according to data from GenBank. The result showed that the deduced peptide was high homology to 2-peptide bacteriocin, Abp 118 alpha and beta.

  11. Insight on how fishing bats discern prey and adjust their mechanic and sensorial features during the attack sequence.

    Science.gov (United States)

    Aizpurua, Ostaizka; Alberdi, Antton; Aihartza, Joxerra; Garin, Inazio

    2015-07-21

    Several insectivorous bats have included fish in their diet, yet little is known about the processes underlying this trophic shift. We performed three field experiments with wild fishing bats to address how they manage to discern fish from insects and adapt their hunting technique to capture fish. We show that bats react only to targets protruding above the water and discern fish from insects based on prey disappearance patterns. Stationary fish trigger short and shallow dips and a terminal echolocation pattern with an important component of the narrowband and low frequency calls. When the fish disappears during the attack process, bats regulate their attack increasing the number of broadband and high frequency calls in the last phase of the echolocation as well as by lengthening and deepening their dips. These adjustments may allow bats to obtain more valuable sensorial information and to perform dips adjusted to the level of uncertainty on the location of the submerged prey. The observed ultrafast regulation may be essential for enabling fishing to become cost-effective in bats, and demonstrates the ability of bats to rapidly modify and synchronise their sensorial and motor features as a response to last minute stimulus variations.

  12. Gene expression classification using epigenetic features and DNA sequence composition in the human embryonic stem cell line H1.

    Science.gov (United States)

    Su, Wen-Xia; Li, Qian-Zhong; Zhang, Lu-Qiang; Fan, Guo-Liang; Wu, Cheng-Yan; Yan, Zhen-He; Zuo, Yong-Chun

    2016-10-30

    Epigenetic factors are known to correlate with gene expression in the existing studies. However, quantitative models that accurately classify the highly and lowly expressed genes based on epigenetic factors are currently lacking. In this study, a new machine learning method combines histone modifications, DNA methylation, DNA accessibility, transcription factors, and trinucleotide composition with support vector machines (SVM) is developed in the context of human embryonic stem cell line (H1). The results indicate that the predictive accuracy will be markedly improved when the epigenetic features are considered. The predictive accuracy and Matthews correlation coefficient of the best model are as high as 95.96% and 0.92 for 10-fold cross-validation test, and 95.58% and 0.92 for independent dataset test, respectively. Our model provides a good way to judge a gene is either highly or lowly expressed gene by using genetic and epigenetic data, when the expression data of the gene is lacking. And a web-server GECES for our analysis method is established at http://202.207.14.87:8032/fuwu/GECES/index.asp, so that other scientists can easily get their desired results by our web-server, without going through the mathematical details. PMID:27468948

  13. Luminescence features from conical bubble collapse in 1,2 propanediol and its perturbation adding sulfuric acid

    Energy Technology Data Exchange (ETDEWEB)

    Navarrete, M; Godinez, F A [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360, Instituto de Ingenieria, Lab. de Fotofisica (Mexico); Sanchez, C [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360, Lab. de Fotonica y Microondas (Mexico); Mejia, E V; Villagran, M, E-mail: mnm@pumas.iingen.unam.mx [Universidad Nacional Autonoma de Mexico, Ciudad Universitaria No. 3000, Col. Copilco Universidad, Delegacion de Coyoacan, Mexico, D. F. Codigo Postal 04360 (Mexico)

    2011-01-01

    A summary of experimental findings on the luminescence from bubble collapse, CBL, varying the gas inert bubble content, the driving pressure and perturbing the liquid piston with small quantities of sulfuric acid is presented. The temporal, spectral, and spatial characteristics of the luminescence regarding with dynamic features of collapse are also examinees. CBL was reproduced using Argon gas, and 1, 2-propanediol as liquid piston. In general, the pulse shape exhibits a large variety of profiles. The luminescence intensity was increased two-fold and the pulse width decreased almost to half when the liquid was disturbed with sulfuric acid. Spectrally, the Swan, CH and CN lines were observed at low volume of Ar gas and low driving pressure, lines of OH{sup 0}, Na*, K* always appear superimposed on an underlying continuum background. De-excitation of sodium atom at 589 nm and two satellites diffuse bands at {approx}554 nm and {approx}620 nm from alkali-metal-argon exciplexes was observed in both systems under certain conditions. All these findings point towards several sources of light emission that are generated during the compression time line, resulting in temporally and spatially inhomogeneous pulse. A mechanism for explain the bright CBL is broached.

  14. IDEPI: rapid prediction of HIV-1 antibody epitopes and other phenotypic features from sequence data using a flexible machine learning platform.

    Science.gov (United States)

    Hepler, N Lance; Scheffler, Konrad; Weaver, Steven; Murrell, Ben; Richman, Douglas D; Burton, Dennis R; Poignard, Pascal; Smith, Davey M; Kosakovsky Pond, Sergei L

    2014-09-01

    Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals--a cure and a vaccine--remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with some genes differing at as many as 40% of nucleotide positions among circulating strains. Because of this, the genetic bases of many viral phenotypes, most notably the susceptibility to neutralization by a particular antibody, are difficult to identify computationally. Drawing upon open-source general-purpose machine learning algorithms and libraries, we have developed a software package IDEPI (IDentify EPItopes) for learning genotype-to-phenotype predictive models from sequences with known phenotypes. IDEPI can apply learned models to classify sequences of unknown phenotypes, and also identify specific sequence features which contribute to a particular phenotype. We demonstrate that IDEPI achieves performance similar to or better than that of previously published approaches on four well-studied problems: finding the epitopes of broadly neutralizing antibodies (bNab), determining coreceptor tropism of the virus, identifying compartment-specific genetic signatures of the virus, and deducing drug-resistance associated mutations. The cross-platform Python source code (released under the GPL 3.0 license), documentation, issue tracking, and a pre-configured virtual machine for IDEPI can be found at https://github.com/veg/idepi. PMID:25254639

  15. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    OpenAIRE

    Yan Koon-Kiu; Axelsen Jacob; Maslov Sergei

    2005-01-01

    Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model...

  16. Transition Metal Complexes of Phosphinous Acids Featuring a Quasichelating Unit: Synthesis, Characterization, and Hetero-bimetallic Complexes.

    Science.gov (United States)

    Allefeld, Nadine; Bader, Julia; Neumann, Beate; Stammler, Hans-Georg; Ignat'ev, Nikolai; Hoge, Berthold

    2015-08-17

    Diorganophosphane oxides were employed as preligands for the synthesis of catalytically active transition metal complexes of the phosphinous acids (CF3)2POH and (C2F5)2POH. Their reactions with solid PtCl2 and PdCl2 led to the formation of mononuclear phosphinous acid complexes [Cl2M{P(R(f))2OH}2] (M = Pd, Pt; R(f) = C2F5, CF3), which can be crystallized, for example, as its pyridinium salts, 2[HPy](+)[Cl2Pd{P(CF3)2O}2](2-). In vacuo HCl is liberated from the neutral palladium complexes affording mixtures of di- and polynuclear complexes. Moreover, (C2F5)2POH was reacted with several β-diketonato complexes of palladium, platinum, and nickel yielding air- and moisture-stable complexes [(acac)M{[P(R(f))2O]2H}], featuring a quasichelating phosphinous acid phosphinito unit {P(R(f))2O···H···O(R(f))2P}(-). Treatment of [Ni(Cp)2] (Cp = cyclopentadienyl) and [(cod)RhCl]2 (cod = 1,5-cyclooctadiene) with (C2F5)2POH leads to the substitution of one Cp or chloro ligand by a quasichelating unit. The novel coordination compounds were characterized by NMR and IR spectroscopies, mass spectrometry, and X-ray diffraction analysis. The platinum complex [(acac)Pt{[P(C2F5)2O]2H}] (acac = acetylacetonato) was used for the construction of hetero-bimetallic complexes by the treatment with [(cod)RhCl]2 and [Ni(Cp)2]. The trinuclear bimetallic complex [{(acac)Pt[P(C2F5)2O]2}2Ni] is the first structurally characterized hetero-bimetallic species containing a bis(perfluoroalkyl)phosphinito bridge. PMID:26242286

  17. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Science.gov (United States)

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  18. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Science.gov (United States)

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  19. "De-novo" amino acid sequence elucidation of protein G'e by combined "Top-Down" and "Bottom-Up" mass spectrometry

    Science.gov (United States)

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F. M.; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L.; Glocker, Michael O.

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein Ǵ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α- N-gluconoylation and α- N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α- N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant ( K d ) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.

  20. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Science.gov (United States)

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  1. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Science.gov (United States)

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/. PMID:26424080

  2. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    International Nuclear Information System (INIS)

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of ≅ 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH2-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung

  3. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Warr, R.G.; Hawgood, S.; Buckley, D.I.; Crisp, T.M.; Schilling, J.; Benson, B.J.; Ballard, P.L.; Clements, J.A.; White, R.T.

    1987-11-01

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of approx. = 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH/sub 2/-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung.

  4. Amino acid sequence and posttranslational modifications of human factor VIIa from plasma and transfected baby hamster kidney cells

    International Nuclear Information System (INIS)

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VIIa, participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca2+ and tissue factor. Three types of potential posttranslational modifications exist in the human factor VIIa molecule, namely, 10 γ-carboxylated, N-terminally located glutamic acid residues, 1 β-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VIIa as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VIIa. By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VIIa was found to be identical with human factor VIIa. Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VIIa. In the recombinant factor VIIa, asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VIIa and human plasma factor VIIa. These results show that factor VIIa as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VIIa and that this cell line thus might represent an alternative source for human factor VIIa

  5. Characterization of DNA-binding sequences for CcaR in the cephamycin-clavulanic acid supercluster of Streptomyces clavuligerus.

    Science.gov (United States)

    Santamarta, I; López-García, M T; Kurt, A; Nárdiz, N; Alvarez-Álvarez, R; Pérez-Redondo, R; Martín, J F; Liras, P

    2011-08-01

    RT-PCR analysis of the genes in the clavulanic acid cluster revealed three transcriptional polycistronic units that comprised the ceaS2-bls2-pah2-cas2, cyp-fd-orf12-orf13 and oppA2-orf16 genes, whereas oat2, car, oppA1, claR, orf14, gcaS and pbpA were expressed as monocistronic transcripts. Quantitative RT-PCR of Streptomyces clavuligerus ATCC 27064 and the mutant S. clavuligerus ccaR::aph showed that, in the mutant, there was a 1000- to 10,000-fold lower transcript level for the ceaS2 to cas2 polycistronic transcript that encoded CeaS2, the first enzyme of the clavulanic acid pathway that commits arginine to clavulanic acid biosynthesis. Smaller decreases in expression were observed in the ccaR mutant for other genes in the cluster. Two-dimensional electrophoresis and MALDI-TOF analysis confirmed the absence in the mutant strain of proteins CeaS2, Bls2, Pah2 and Car that are required for clavulanic acid biosynthesis, and CefF and IPNS that are required for cephamycin biosynthesis. Gel shift electrophoresis using recombinant r-CcaR protein showed that it bound to the ceaS2 and claR promoter regions in the clavulanic acid cluster, and to the lat, cefF, cefD-cmcI and ccaR promoter regions in the cephamycin C gene cluster. Footprinting experiments indicated that triple heptameric conserved sequences were protected by r-CcaR, and allowed identification of heptameric sequences as CcaR binding sites.

  6. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    OpenAIRE

    A.E. El Hakim; W.H. Salama; M.B. Hamed; Ali, A. A.; N.M. Ibrahim

    2015-01-01

    Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The t...

  7. The nucleotide sequence of HLA-B{sup *}2704 reveals a new amino acid substitution in exon 4 which is also present in HLA-B{sup *}2706

    Energy Technology Data Exchange (ETDEWEB)

    Rudwaleit, M.; Bowness, P.; Wordsworth, P. [John Radcliffe Hospital, Oxford (United Kingdom)

    1996-12-31

    The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.

  8. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    Science.gov (United States)

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  9. Main: Sequences [KOME

    Lifescience Database Archive (English)

    Full Text Available Sequences Amino Acid Sequence Amino Acid sequence of full length cDNA (Longest ORF) kome_ine_full_sequence..._amino_db.fasta.zip kome_ine_full_sequence_amino_db.zip kome_ine_full_sequence_amino_db ...

  10. The structural analysis of protein sequences based on the quasi-amino acids code

    Institute of Scientific and Technical Information of China (English)

    Zhu Ping; Tang Xu-Qing; Xu Zhen-Yuan

    2009-01-01

    Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (∑, +, *) is introduced, where ∑ is the set of 64 codons. According to the characteristics of (∑,+, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, +, ×) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3).

  11. Complete amino acid sequence of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris.

    Science.gov (United States)

    Lehman, L D; Jones, B N; Dwulet, F E; Bogardt, R A; Gurd, F R

    1980-10-21

    The complete primary structure of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at its two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage with trypsin of the citraconylated apomyoglobin at its three arginine residues. Further digestion of the central cyanogen bromide peptide with S. aureus strain V8 protease and the 1,2-cyclohexanedione-treated central cyanogen bromide peptide with trypsin enabled the determination of the remainder of the covalent structure. This myoglobin differs from the cetacean myoglobins determined to date at 12 to 17 positions. These large sequence differences reflect the distant taxonomic relationships between the goose-beaked whale and the other species of Cetacea the myoglobin sequences of which have previously been determined.

  12. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ID ID of a sequence Cluster ID ID of cluster. gclust_cluster is referenced. Annotation in original... database Annotation at the original website Species Species name Length Amino acid sequen

  13. Construction Strategy for an Internal Amplification Control for Real-Time Diagnostic Assays Using Nucleic Acid Sequence-Based Amplification: Development and Clinical Application

    OpenAIRE

    Rodríguez-Lázaro, David; D'Agostino, Martin; Pla, Maria; Cook, Nigel

    2005-01-01

    An important analytical control in molecular amplification-based methods is an internal amplification control (IAC), which should be included in each reaction mixture. An IAC is a nontarget nucleic acid sequence which is coamplified simultaneously with the target sequence. With negative results for the target nucleic acid, the absence of an IAC signal indicates that amplification has failed. A general strategy for the construction of an IAC for inclusion in molecular beacon-based real-time nu...

  14. Isolation and amino acid sequence of crustacean hyperglycemic hormone precursor-related peptides.

    Science.gov (United States)

    Tensen, C P; Verhoeven, A H; Gaus, G; Janssen, K P; Keller, R; Van Herp, F

    1991-01-01

    The crustacean hyperglycemic hormone (CHH) is synthesized as part of a larger preprohormone in which the sequence of CHH is N-terminally flanked by a peptide for which the name CPRP (CHH precursor-related peptide) is proposed. Both CHH and CPRP are present in the sinus gland, the neurohemal organ of neurosecretory cells located in the eyestalk of decapod crustaceans. This paper describes the isolation and sequence analysis of CPRPs isolated from sinus glands of the crab Carcinus maenas, the crayfish Orconectes limosus and the lobster Homarus americanus. The published sequence of "peptide H" isolated from the land crab, Cardisoma carnifex, has now been recognized as a CPRP in this species. Sequence comparison reveals a high level of identity for the N-terminal region (residues 1-13) between all four peptides, while identity in the C-terminal domain is high between lobster and crayfish CPRP on the one hand, and between both crab species on the other. Conserved N-terminal residues include a putative monobasic processing site at position 11, which suggests that CPRP may be a biosynthetic intermediate from which a potentially bioactive decapeptide can be derived.

  15. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer

    Science.gov (United States)

    Zambanini, Thiemo; Buescher, Joerg M.; Meurer, Guido; Blank, Lars M.

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  16. Amino Acid Sequence Requirements of the Transmembrane and Cytoplasmic Domains of Influenza Virus Hemagglutinin for Viable Membrane Fusion

    OpenAIRE

    Melikyan, Grigory B.; Lin, Sasa; Roth, Michael G.; Cohen, Fredric S.

    1999-01-01

    The amino acid sequence requirements of the transmembrane (TM) domain and cytoplasmic tail (CT) of the hemagglutinin (HA) of influenza virus in membrane fusion have been investigated. Fusion properties of wild-type HA were compared with those of chimeras consisting of the ectodomain of HA and the TM domain and/or CT of polyimmunoglobulin receptor, a nonviral integral membrane protein. The presence of a CT was not required for fusion. But when a TM domain and CT were present, fusion activity w...

  17. Hyperspectral analysis for qualitative and quantitative features related to acid mine drainage at a remediated open-pit mine

    Science.gov (United States)

    Davies, G.; Calvin, W. M.

    2015-12-01

    The exposure of pyrite to oxygen and water in mine waste environments is known to generate acidity and the accumulation of secondary iron minerals. Sulfates and secondary iron minerals associated with acid mine drainage (AMD) exhibit diverse spectral properties in the ultraviolet, visible and near-infrared regions of the electromagnetic spectrum. The use of hyperspectral imagery for identification of AMD mineralogy and contamination has been well studied. Fewer studies have examined the impacts of hydrologic variations on mapping AMD or the unique spectral signatures of mine waters. Open-pit mine lakes are an additional environmental hazard which have not been widely studied using imaging spectroscopy. A better understanding of AMD variation related to climate fluctuations and the spectral signatures of contaminated surface waters will aid future assessments of environmental contamination. This study examined the ability of multi-season airborne hyperspectral data to identify the geochemical evolution of substances and contaminant patterns at the Leviathan Mine Superfund site. The mine is located 24 miles southeast of Lake Tahoe and contains remnant tailings piles and several AMD collection ponds. The objectives were to 1) distinguish temporal changes in mineralogy at a the remediated open-pit sulfur mine, 2) identify the absorption features of mine affected waters, and 3) quantitatively link water spectra to known dissolved iron concentrations. Images from NASA's AVIRIS instrument were collected in the spring, summer, and fall seasons for two consecutive years at Leviathan (HyspIRI campaign). Images had a spatial resolution of 15 meters at nadir. Ground-based surveys using the ASD FieldSpecPro spectrometer and laboratory spectral and chemical analysis complemented the remote sensing data. Temporal changes in surface mineralogy were difficult to distinguish. However, seasonal changes in pond water quality were identified. Dissolved ferric iron and chlorophyll

  18. Prediction of the Occurrence of the ADP-binding βαβ-fold in Proteins, Using an Amino Acid Sequence Fingerprint

    NARCIS (Netherlands)

    Wierenga, Rik K.; Terpstra, Peter; Hol, Wim G.J.

    1986-01-01

    An amino acid sequence "fingerprint” has been derived that can be used to test if a particular sequence will fold into a βαβ-unit with ADP-binding properties. It was deduced from a careful analysis of the known three-dimensional structures of ADP-binding βαβ-folds. This fingerprint is in fact a set

  19. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Rhee, Mun Su [University of Florida, Gainesville; Moritz, Brelan E. [University of Florida, Gainesville; Xie, Gary [Los Alamos National Laboratory (LANL); Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Chertkov, Olga [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Patel, Milind [University of Florida, Gainesville; Ou, Mark [University of Florida, Gainesville; Harbrucker, Roberta [University of Florida, Gainesville; Ingram, Lonnie O. [University of Florida; Shanmugam, Keelnathan T. [University of Florida

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  20. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Xie, Gary [Los Alamos National Laboratory (LANL); Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  1. Global analysis of physical and functional RNA targets of hnRNP L reveals distinct sequence and epigenetic features of repressed and enhanced exons.

    Science.gov (United States)

    Cole, Brian S; Tapescu, Iulia; Allon, Samuel J; Mallory, Michael J; Qiu, Jinsong; Lake, Robert J; Fan, Hua-Ying; Fu, Xiang-Dong; Lynch, Kristen W

    2015-12-01

    HnRNP L is a ubiquitous splicing-regulatory protein that is critical for the development and function of mammalian T cells. Previous work has identified a few targets of hnRNP L-dependent alternative splicing in T cells and has described transcriptome-wide association of hnRNP L with RNA. However, a comprehensive analysis of the impact of hnRNP L on mRNA expression remains lacking. Here we use next-generation sequencing to identify transcriptome changes upon depletion of hnRNP L in a model T-cell line. We demonstrate that hnRNP L primarily regulates cassette-type alternative splicing, with minimal impact of hnRNP L depletion on transcript abundance, intron retention, or other modes of alternative splicing. Strikingly, we find that binding of hnRNP L within or flanking an exon largely correlates with exon repression by hnRNP L. In contrast, exons that are enhanced by hnRNP L generally lack proximal hnRNP L binding. Notably, these hnRNP L-enhanced exons share sequence and context features that correlate with poor nucleosome positioning, suggesting that hnRNP may enhance inclusion of a subset of exons via a cotranscriptional or epigenetic mechanism. Our data demonstrate that hnRNP L controls inclusion of a broad spectrum of alternative cassette exons in T cells and suggest both direct RNA regulation as well as indirect mechanisms sensitive to the epigenetic landscape.

  2. Hyperintense HCC on hepatobiliary phase images of gadoxetic acid-enhanced MRI: Correlation with clinical and pathological features

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Ja Young [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Kim, Myeong-Jin, E-mail: kimnex@yuhs.ac [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Kim, Kyung Ah; Jeong, Hyeon Tae [Department of Radiology and Research Institute of Radiological Science, Yonsei University Severance Hospital, Seoul (Korea, Republic of); Park, Young Nyun [Department of Pathology, Yonsei University Severance Hospital, Seoul (Korea, Republic of)

    2012-12-15

    Purpose: To retrospectively determine whether the hyperintense hepatocellular carcinomas (HCCs) seen on the hepatobiliary phase of gadoxetic acid-enhanced MR imaging (EOB-MRI) might have different histologic characteristics from usual hypointense HCCs. Materials and methods: Two hundred three surgically proven HCCs from 192 patients who underwent preoperative EOB-MRI were analyzed. The demographic and histologic characteristics of hyperintense HCCs were compared with usual hypointense HCCs by using the t-test or Fisher's exact test. Results: By visual assessment, 18 (8.8%) tumors were classified as hyperintense HCCs. Patients with hyperintense HCC were significantly (p < 0.05) older (60.1 vs. 55.2 years) than those with hypointense HCCs. Hyperintense HCCs showed significantly lower rate of microvascular invasion (27.8% vs. 53.5%) and significantly higher rate of peliosis (61.1% vs. 30.8%). Hyperintense HCCs were more frequently expanding type, and none showed infiltrative type or scirrhous histologic pattern. Conclusions: Hyperintense HCCs seem to have clinical and histologic features that might be related with more favorable outcomes.

  3. Sequence-specific inhibition of duck hepatitis B virus reverse transcription by peptide nucleic acids (PNA)

    DEFF Research Database (Denmark)

    Robaczewska, Magdalena; Narayan, Ramamurthy; Seigneres, Beatrice;

    2005-01-01

    BACKGROUND/AIMS: Peptide nucleic acids (PNAs) appear as promising new antisense agents, that have not yet been examined as hepatitis B virus (HBV) inhibitors. Our aim was to study the ability of PNAs targeting the duck HBV (DHBV) encapsidation signal epsilon to inhibit reverse transcription (RT...

  4. Amino acid sequence of the cold-active alkaline phosphatase from Atlantic cod (Gadus morhua)

    DEFF Research Database (Denmark)

    Asgeirsson, Bjarni; Nielsen, Berit Noesgaard; Højrup, Peter

    2003-01-01

    -linked glycosylation sites were found. The glycan structure was determined as complex biantennary in type with fucose and sialic acid attached, although a trace of complex tri-antennary structure was also observed. A three-dimensional model was obtained by homology modelling using the human placental AP scaffold. Cod...

  5. Complete Genome Sequence of Moraxella osloensis Strain KMC41, a Producer of 4-Methyl-3-Hexenoic Acid, a Major Malodor Compound in Laundry.

    Science.gov (United States)

    Goto, Takatsugu; Hirakawa, Hideki; Morita, Yuji; Tomida, Junko; Sato, Jun; Matsumura, Yuta; Mitani, Asako; Niwano, Yu; Takeuchi, Kohei; Kubota, Hiromi; Kawamura, Yoshiaki

    2016-01-01

    We report the complete genome sequence of Moraxella osloensis strain KMC41, isolated from laundry with malodor. The KMC41 genome comprises a 2,445,556-bp chromosome and three plasmids. A fatty acid desaturase and at least four β-oxidation-related genes putatively associated with 4-methyl-3-hexenoic acid generation were detected in the KMC41 chromosome. PMID:27445387

  6. Identification of a novel HMW glutenin subunit and comparison of its amino acid sequence with those of homologous subunits

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Aegilops tauschii is the donor of the D genome of common wheat (Triticum aestivum). Genetic variation of HMW glutenin subunits encoded by the Glu-1Dt locus of Ae. tauschii has been found to be higher than that specified by the Glu-1D locus in common wheat. In the present note, we report the identification of a novel HMW glutenin subunit, Dy13t, from Ae. tauschii. The newly identified subunit possessed an electrophoretic mobility that was faster than that of the Dy12 subunit of common wheat. The complete ORF of encoding the Dy13t subunit contained 624 codons (excluding the stop codons). The amino acid sequence deduced from the Dy13t gene ORF was the shortest among those of the previously reported subunits derived by the D genome. A further comparison of Dy13t amino acid sequence with those of the subunits characterized from the A, B, D, R genomes of Triticeae showed that the smaller size of the Dy13t subunit was associated with a reduction in the size of its repetitive domain.

  7. Amino Acid sequence analysis of the two major outer Capsid Proteins (VP7 and VP4 from human-derived canine G3P[3] Rotavirus Strain Detected in Brazil

    Directory of Open Access Journals (Sweden)

    Adriana Luchs

    2013-12-01

    Full Text Available Introduction: A close look at the rotavirus group A (RVA genotypes in Brazil revealed the detection of a rare G3P[3] strain close related to canine strains. The aim of this study was to add to the already known genetic analysis by the description of the G3P[3] (IAL-R2638 strain amino acid characteristics. Methods: Amino acid sequence analysis and protein based trees were conducted using BioEdit and MEGA 4.0. Results: The VP7 and VP4 protein of the IAL-R2638 strain displayed the highest amino acid identity to the canine-derived human strain HCR3A (99.2%, and to the canine strain RV52/96 (96.4%, respectively. IAL-R2638 strain did not possess an extra VP7 N-linked glycosylation site at amino acid 238 recently described for some G3 strains, as well as RotaTeqTM G3 vaccine strain. The topology exhibited by phylogenetic trees in previous analysis were maintained in the present amino acid-based trees, reinforcing a stable relationship between G3P[3] strains. Conclusions: Amino acid analysis data were consistent with the previous sequence of data obtained for the IAL-R2638 strain, supporting its possible canine origin. Theoretically, RotaTeqTM vaccine could efficiently protect against G3P[3] infections based on the lack of the extra VP7 N-linked glycosylation site at amino acid 238. Phylogenetic analysis hypothesizes that all features undergo evolution independently of each other; however, unfavorable effects of nucleotide substitutions may be compensated by substitutions in other positions. The present study raises the question as to whether the amino acid-based trees could be applied as an approach to the study of RVA evolution, avoiding incorrect phylogenetic reconstructions.

  8. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    Directory of Open Access Journals (Sweden)

    Kathleen S. Rein

    2008-05-01

    Full Text Available Okadaic acid (OA and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  9. Complete genome sequence of probiotic Bacillus coagulans HM-08: A potential lactic acid producer.

    Science.gov (United States)

    Yao, Guoqiang; Gao, Pengfei; Zhang, Wenyi

    2016-06-20

    Bacillus coagulans HM-08 is a commercialized probiotic strain in China. Its genome contains a 3.62Mb circular chromosome with an average GC content of 46.3%. In silico analysis revealed the presence of one xyl operon as well as several other genes that are correlated to xylose utilization. The genetic information provided here may help to expand its future biotechnology potential in lactic acid production. PMID:27130497

  10. Researches on Sequence of Plant Cystatin: Phytocystatin

    Institute of Scientific and Technical Information of China (English)

    QINQingfeng; HEWei; LIANGJun; ZHANGXingyao

    2005-01-01

    Plant cystatins or phytocystatins are cysteine proteinase inhibitors exist widely in different plant species. Because they can kill insects by inhibiting the digestive function of the cysteine proteinase in gut, they are believed to play an important role in plant's defense against pests. Phytocystatins contain the conserved QXVXG motif and show some features on their sequence different to animal cystatins.After sequencing the protein directly and the cDNA clone, a large number of plant cystatins have been characterized. A multialignment with BLAST software and a detail analysis of 38 phytocystatins show that phytocystatins possess a specific conserved amino acid sequence [LRVI]-[AGT]-[RQKE]-[FY]-[AS]-[VI]-X-[EGHDQV]-[HYFQ]-N different to the conserved sequence demonstrated by Margis in 1998. This conserved sequence can be enough to detect with exclusivity phytocystatin sequences on protein data banks. A classification of these phytocystatins is performed and they can be divided into 3 groups according to their features on amino acid sequence, and the group-I can be still divided into 3 subgroups based on the feature of their amino acid and genomic sequence. By the CLUSTALX software,the most conserved nucleotide sequences of phytocystatins were found, which could be used to design the degenerate premiers to search new phytocystatins with PCR reaction.

  11. A study of archaeal enzymes involved in polar lipid synthesis linking amino acid sequence information, genomic contexts and lipid composition

    Directory of Open Access Journals (Sweden)

    Hiromi Daiyasu

    2005-01-01

    Full Text Available Cellular membrane lipids, of which phospholipids are the major constituents, form one of the characteristic features that distinguish Archaea from other organisms. In this study, we focused on the steps in archaeal phospholipid synthetic pathways that generate polar lipids such as archaetidylserine, archaetidylglycerol, and archaetidylinositol. Only archaetidylserine synthase (ASS, from Methanothermobacter thermautotrophicus, has been experimentally identified. Other enzymes have not been fully examined. Through database searching, we detected many archaeal hypothetical proteins that show sequence similarity to members of the CDP alcohol phosphatidyltransferase family, such as phosphatidylserine synthase (PSS, phosphatidylglycerol synthase (PGS and phosphatidylinositol synthase (PIS derived from Bacteria and Eukarya. The archaeal hypothetical proteins were classified into two groups, based on the sequence similarity. Members of the first group, including ASS from M. thermautotrophicus, were closely related to PSS. The rough agreement between PSS homologue distribution within Archaea and the experimentally identified distribution of archaetidylserine suggested that the hypothetical proteins are ASSs. We found that an open reading frame (ORF tends to be adjacent to that of ASS in the genome, and that the order of the two ORFs is conserved. The sequence similarity of phosphatidylserine decarboxylase to the product of the ORF next to the ASS gene, together with the genomic context conservation, suggests that the ORF encodes archaetidylserine decarboxylase, which may transform archaetidylserine to archaetidylethanolamine. The second group of archaeal hypothetical proteins was related to PGS and PIS. The members of this group were subjected to molecular phylogenetic analysis, together with PGSs and PISs and it was found that they formed two distinct clusters in the molecular phylogenetic tree. The distribution of members of each cluster within Archaea

  12. Cloning of NruI and Sbo13I restriction and modification sstems in E. coli and amino acid sequence comparison of M.NruI and M.Sbo13I with other amino-methyltransferases

    Directory of Open Access Journals (Sweden)

    Benner Jack

    2010-05-01

    Full Text Available Abstract Background NruI and Sbo13I are restriction enzyme isoschizomers with the same recognition sequence 5' TCG↓CGA 3' (cleavage as indicated↓. Here we report the cloning of NruI and Sbo13I restriction-modification (R-M systems in E. coli. The NruI restriction endonuclease gene (nruIR was cloned by PCR and inverse PCR using primers designed from the N-terminal amino acid sequence. The NruI methylase gene (nruIM was derived by inverse PCR walking. Results The amino acid sequences of NruI endonuclease and methylase are very similar to the Sbo13I R-M system which has been cloned and expressed in E. coli by phage selection of a plasmid DNA library. Dot blot analysis using rabbit polyclonal antibodies to N6mA- or N4mC-modified DNA indicated that M.NruI is possibly a N6mA-type amino-methyltransferase that most likely modifies the external A in the 5' TCGCGA 3' sequence. M.Sbo13I, however, is implicated as a probable N4mC-type methylase since plasmid carrying sbo13IM gene is not restricted by Mrr endonuclease and Sbo13I digestion is not blocked by Dam methylation of the overlapping site. The amino acid sequence of M.NruI and M.Sbo13I did not show significant sequence similarity to many known amino-methyltransferases in the α, β, and γ groups, except to a few putative methylases in sequenced microbial genomes. Conclusions The order of the conserved amino acid motifs (blocks in M.NruI/M.Sbo13I is similar to the γ. group amino-methyltranferases, but with two distinct features: In motif IV, the sequence is DPPY instead of NPPY; there are two additional conserved motifs, IVa and Xa as extension of motifs IV and X, in this family of enzymes. We propose that M.NruI and M.Sbo13I form a subgroup in the γ group of amino-methyltransferases.

  13. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    Science.gov (United States)

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  14. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    Science.gov (United States)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  15. Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

    Directory of Open Access Journals (Sweden)

    Zijian Li

    2014-02-01

    Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.

  16. Non-Esterified Fatty Acids Profiling in Rheumatoid Arthritis: Associations with Clinical Features and Th1 Response

    Science.gov (United States)

    Rodríguez-Carrio, Javier; Alperi-López, Mercedes; López, Patricia; Ballina-García, Francisco Javier; Suárez, Ana

    2016-01-01

    Objectives Since lipid compounds are known to modulate the function of CD4+ T-cells and macrophages, we hypothesize that altered levels of serum non-esterified fatty acids (NEFA) may underlie rheumatoid arthritis (RA) pathogenesis. Methods Serum levels of NEFA (palmitic, stearic, palmitoleic, oleic, linoleic, γ-linoleic, arachidonic –AA–, linolenic, eicosapentaenoic –EPA– and docosahexaenoic –DHA–) were quantified by LC-MS/MS after methyl-tert-butylether (MTBE)-extraction in 124 RA patients and 56 healthy controls (HC). CD4+ phenotype was studied by flow cytometry. TNFα, IL-8, VEGF, GM-CSF, IFNγ, IL-17, CCL2, CXCL10, leptin and resistin serum levels were quantified by immunoassays. The effect of FA on IFNγ production by PBMC was evaluated in vitro. Results Lower levels of palmitic (p<0.0001), palmitoleic (p = 0.002), oleic (p = 0.010), arachidonic (p = 0.027), EPA (p<0.0001) and DHA (p<0.0001) were found in RA patients, some NEFA being altered at onset. Cluster analysis identified a NEFA profile (hallmarked by increased stearic and decreased EPA and DHA) overrepresented in RA patients compared to HC (p = 0.002), being associated with clinical features (RF, shared epitope and erosions), increased IFNγ expression in CD4+ T-cells (p = 0.002) and a Th1-enriched serum milieu (IFNγ, CCL2 and CXCL10, all p<0.005). In vitro assays demonstrated that imbalanced FA could underlie IFNγ production by CD4+ T-cells. Finally, changes on NEFA levels were associated with clinical response upon TNFα-blockade. Conclusion An altered NEFA profile can be found in RA patients associated with clinical characteristics of aggressive disease and enhanced Th1 response. These results support the relevance of lipidomic studies in RA and provide a rationale for new therapeutic targets. PMID:27487156

  17. Evolution of early life inferred from protein and ribonucleic acid sequences

    Science.gov (United States)

    Dayhoff, M. O.; Schwartz, R. M.

    1978-01-01

    The chemical structures of ferredoxin, 5S ribosomal RNA, and c-type cytochrome sequences have been employed to construct a phylogenetic tree which connects all major photosynthesizing organisms: the three types of bacteria, blue-green algae, and chloroplasts. Anaerobic and aerobic bacteria, eukaryotic cytoplasmic components and mitochondria are also included in the phylogenetic tree. Anaerobic nonphotosynthesizing bacteria similar to Clostridium were the earliest organisms, arising more than 3.2 billion years ago. Bacterial photosynthesis evolved nearly 3.0 billion years ago, while oxygen-evolving photosynthesis, originating in the blue-green algal line, came into being about 2.0 billion years ago. The phylogenetic tree supports the symbiotic theory of the origin of eukaryotes.

  18. Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

    Science.gov (United States)

    Freimuth, Paul I.

    2010-04-06

    The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.

  19. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

    Directory of Open Access Journals (Sweden)

    Leitner Dietmar

    2005-04-01

    Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.

  20. Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

    Directory of Open Access Journals (Sweden)

    Nagib eAhsan

    2012-07-01

    Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.

  1. Amino Acid Sequence and Structural Comparison of BACE1 and BACE2 Using Evolutionary Trace Method

    Directory of Open Access Journals (Sweden)

    Hoda Mirsafian

    2014-01-01

    Full Text Available Beta-amyloid precursor protein cleavage enzyme 1 (BACE1 and beta-amyloid precursor protein cleavage enzyme 2 (BACE2, members of aspartyl protease family, are close homologues and have high similarity in their protein crystal structures. However, their enzymatic properties differ leading to disparate clinical consequences. In order to identify the residues that are responsible for such differences, we used evolutionary trace (ET method to compare the amino acid conservation patterns of BACE1 and BACE2 in several mammalian species. We found that, in BACE1 and BACE2 structures, most of the ligand binding sites are conserved which indicate their enzymatic property of aspartyl protease family members. The other conserved residues are more or less randomly localized in other parts of the structures. Four group-specific residues were identified at the ligand binding site of BACE1 and BACE2. We postulated that these residues would be essential for selectivity of BACE1 and BACE2 biological functions and could be sites of interest for the design of selective inhibitors targeting either BACE1 or BACE2.

  2. Thimet oligopeptidase: similarity to 'soluble angiotensin II-binding protein' and some corrections to the published amino acid sequence of the rat testis enzyme.

    Science.gov (United States)

    McKie, N; Dando, P M; Rawlings, N D; Barrett, A J

    1993-01-01

    The deduced amino acid sequence of pig liver soluble angiotensin II-binding protein [Sugiura, Hagiwara and Hirose (1992) J. Biol. Chem. 267, 18067-18072] is similar over most of its length to that reported for rat testis thimet oligopeptidase (EC 3.4.24.15) by Pierotti, Dong, Glucksman, Orlowski and Roberts [(1990) (Biochemistry 29, 10323-10329]. We have found that homogeneous rat testis thimet oligopeptidase binds angiotensin II with the same distinctive characteristics as the pig liver protein. Analysis of the nucleotide sequences reported for the two proteins pointed to the likelihood that sequencing errors had caused two segments of the amino acid sequence of the rat protein to be translated out of frame, and re-sequencing of selected parts of the clone (kindly provided by the previous authors) confirmed this. The revised deduced amino acid sequence of rat thimet oligopeptidase contains 687 residues, representing a protein of 78,308 Da, and is more closely related to those of the pig liver protein and other known homologues of thimet oligopeptidase than that described previously. Images Figure 1 PMID:8216239

  3. Generation of deviation parameters for amino acid singlets, doublets and triplets from three-dimensional structures of proteins and its implications for secondary structure prediction from amino acid sequences

    Indian Academy of Sciences (India)

    S A Mugilan; K Veluraja

    2000-03-01

    We present a new method, secondary structure prediction by deviation parameter (SSPDP) for predicting the secondary structure of proteins from amino acid sequence. Deviation parameters (DP) for amino acid singlets, doublets and triplets were computed with respect to secondary structural elements of proteins based on the dictionary of secondary structure prediction (DSSP)-generated secondary structure for 408 selected non-homologous proteins. To the amino acid triplets which are not found in the selected dataset, a DP value of zero is assigned with respect to the secondary structural elements of proteins. The total number of parameters generated is 15,432, in the possible parameters of 25,260. Deviation parameter is complete with respect to amino acid singlets, doublets, and partially complete with respect to amino acid triplets. These generated parameters were used to predict secondary structural elements from amino acid sequence. The secondary structure predicted by our method (SSPDP) was compared with that of single sequence (NNPREDICT) and multiple sequence (PHD) methods. The average value of the percentage of prediction accuracy for α-helix by SSPDP, NNPREDICT and PHD methods was found to be 57%, 44% and 69% respectively for the proteins in the selected dataset. For -strand the prediction accuracy is found to be 69%, 21% and 53% respectively by SSPDP, NNPREDICT and PHD methods. This clearly indicates that the secondary structure prediction by our method is as good as PHD method but much better than NNPREDICT method.

  4. Prediction of Golgi-resident protein types using general form of Chou's pseudo-amino acid compositions: Approaches with minimal redundancy maximal relevance feature selection.

    Science.gov (United States)

    Jiao, Ya-Sen; Du, Pu-Feng

    2016-08-01

    Recently, several efforts have been made in predicting Golgi-resident proteins. However, it is still a challenging task to identify the type of a Golgi-resident protein. Precise prediction of the type of a Golgi-resident protein plays a key role in understanding its molecular functions in various biological processes. In this paper, we proposed to use a mutual information based feature selection scheme with the general form Chou's pseudo-amino acid compositions to predict the Golgi-resident protein types. The positional specific physicochemical properties were applied in the Chou's pseudo-amino acid compositions. We achieved 91.24% prediction accuracy in a jackknife test with 49 selected features. It has the best performance among all the present predictors. This result indicates that our computational model can be useful in identifying Golgi-resident protein types. PMID:27155042

  5. Complete Genome Sequence of the d-Amino Acid Catabolism Bacterium Phaeobacter sp. Strain JL2886, Isolated from Deep Seawater of the South China Sea.

    Science.gov (United States)

    Fu, Yingnan; Wang, Rui; Zhang, Zilian; Jiao, Nianzhi

    2016-01-01

    Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. PMID:27587825

  6. A root chicory MADS box sequence and the Arabidopsis flowering repressor FLC share common features that suggest conserved function in vernalization and de-vernalization responses.

    Science.gov (United States)

    Périlleux, Claire; Pieltain, Alexandra; Jacquemin, Guillaume; Bouché, Frédéric; Detry, Nathalie; D'Aloia, Maria; Thiry, Laura; Aljochim, Pierre; Delansnay, Martin; Mathieu, Anne-Sophie; Lutts, Stanley; Tocquin, Pierre

    2013-08-01

    Root chicory (Cichorium intybus var. sativum) is a biennial crop, but is harvested to obtain root inulin at the end of the first growing season before flowering. However, cold temperatures may vernalize seeds or plantlets, leading to incidental early flowering, and hence understanding the molecular basis of vernalization is important. A MADS box sequence was isolated by RT-PCR and named FLC-LIKE1 (CiFL1) because of its phylogenetic positioning within the same clade as the floral repressor Arabidopsis FLOWERING LOCUS C (AtFLC). Moreover, over-expression of CiFL1 in Arabidopsis caused late flowering and prevented up-regulation of the AtFLC target FLOWERING LOCUS T by photoperiod, suggesting functional conservation between root chicory and Arabidopsis. Like AtFLC in Arabidopsis, CiFL1 was repressed during vernalization of seeds or plantlets of chicory, but repression of CiFL1 was unstable when the post-vernalization temperature was favorable to flowering and when it de-vernalized the plants. This instability of CiFL1 repression may be linked to the bienniality of root chicory compared with the annual lifecycle of Arabidopsis. However, re-activation of AtFLC was also observed in Arabidopsis when a high temperature treatment was used straight after seed vernalization, eliminating the promotive effect of cold on flowering. Cold-induced down-regulation of a MADS box floral repressor and its re-activation by high temperature thus appear to be conserved features of the vernalization and de-vernalization responses in distant species. PMID:23581257

  7. Origin of the 900 cm{sup −1} broad double-hump OH vibrational feature of strongly hydrogen-bonded carboxylic acids

    Energy Technology Data Exchange (ETDEWEB)

    Van Hoozen, Brian L.; Petersen, Poul B. [Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853 (United States)

    2015-03-14

    Medium and strong hydrogen bonds are common in biological systems. Here, they provide structural support and can act as proton transfer relays to drive electron and/or energy transfer. Infrared spectroscopy is a sensitive probe of molecular structure and hydrogen bond strength but strongly hydrogen-bonded structures often exhibit very broad and complex vibrational bands. As an example, strong hydrogen bonds between carboxylic acids and nitrogen-containing aromatic bases commonly display a 900 cm{sup −1} broad feature with a remarkable double-hump structure. Although previous studies have assigned this feature to the OH, the exact origin of the shape and width of this unusual feature is not well understood. In this study, we present ab initio calculations of the contributions of the OH stretch and bend vibrational modes to the vibrational spectrum of strongly hydrogen-bonded heterodimers of carboxylic acids and nitrogen-containing aromatic bases, taking the 7-azaindole—acetic acid and pyridine—acetic acid dimers as examples. Our calculations take into account coupling between the OH stretch and bend modes as well as how both of these modes are affected by lower frequency dimer stretch modes, which modulate the distance between the monomers. Our calculations reproduce the broadness and the double-hump structure of the OH vibrational feature. Where the spectral broadness is primarily caused by the dimer stretch modes strongly modulating the frequency of the OH stretch mode, the double-hump structure results from a Fermi resonance between the out of the plane OH bend and the OH stretch modes.

  8. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach.

    Science.gov (United States)

    Adamala, Katarzyna; Anella, Fabrizio; Wieczorek, Rafal; Stano, Pasquale; Chiarabelli, Cristiano; Luisi, Pier Luigi

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the chemical viewpoint, macromolecules are formed by chemical pathways leading to the condensation of building blocks (amino acids, or nucleotides) in long-chain copolymers (proteins and nucleic acids, respectively). The second difficulty deals with a conceptual problem, namely with the emergence of specific sequences among a vast array of possible ones, the huge "sequence space", leading to the question "why these macromolecules, and not the others?" We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides, like Ser-His, with respect to peptide- and nucleotides-condensation, as a realistic model of primitive organocatalysis. We have also set up a strategy for exploring the sequence space of random proteins and RNAs (the so-called "never born biopolymer" project) with respect to the production of folded structures. Being still far from solved, the main aspects of these "open questions" are discussed here, by commenting on recent results obtained in our groups and by providing a unifying view on the problem and possible solutions. In particular, we propose a general scenario for macromolecule formation via fragment-condensation, as a scheme for the emergence of specific sequences based on molecular growth and selection.

  9. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

    KAUST Repository

    Stabler, R. A.

    2013-01-24

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  10. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Science.gov (United States)

    2010-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... PRACTICE IN PATENT CASES Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet...

  11. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    NARCIS (Netherlands)

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which mi

  12. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

    NARCIS (Netherlands)

    Adamala, K.; Anella, F.M.; Wieczorek, R.; Stano, P.; Chiarabelli, C.; Luisi, P.L.

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the c

  13. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

    DEFF Research Database (Denmark)

    Adamala, K.; Anella, F.; Wieczorek, R.;

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From...

  14. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    OpenAIRE

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which might shed light onto the enzymatic machineries that are involved in its production of biogenic amines.

  15. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-γ-d-Glutamic Acid Anthrax Capsule.

    Science.gov (United States)

    Stabler, Richard A; Negus, David; Pain, Arnab; Taylor, Peter W

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  16. Clavulanic acid production estimation based on color and structural features of Streptomyces clavuligerus bacteria using self-organizing map and genetic algorithm.

    Science.gov (United States)

    Nurmohamadi, Maryam; Pourghassem, Hossein

    2014-05-01

    The utilization of antibiotics produced by Clavulanic acid (CA) is an increasing need in medicine and industry. Usually, the CA is created from the fermentation of Streptomycen Clavuligerus (SC) bacteria. Analysis of visual and morphological features of SC bacteria is an appropriate measure to estimate the growth of CA. In this paper, an automatic and fast CA production level estimation algorithm based on visual and structural features of SC bacteria instead of statistical methods and experimental evaluation by microbiologist is proposed. In this algorithm, structural features such as the number of newborn branches, thickness of hyphal and bacterial density and also color features such as acceptance color levels are extracted from the SC bacteria. Moreover, PH and biomass of the medium provided by microbiologists are considered as specified features. The level of CA production is estimated by using a new application of Self-Organizing Map (SOM), and a hybrid model of genetic algorithm with back propagation network (GA-BPN). The proposed algorithm is evaluated on four carbonic resources including malt, starch, wheat flour and glycerol that had used as different mediums of bacterial growth. Then, the obtained results are compared and evaluated with observation of specialist. Finally, the Relative Error (RE) for the SOM and GA-BPN are achieved 14.97% and 16.63%, respectively.

  17. Identities among actin-encoding cDNAs of the Nile tilapia (Oreochromis niloticus and other eukaryote species revealed by nucleotide and amino acid sequence analyses

    Directory of Open Access Journals (Sweden)

    Andréia B. Poletto

    2008-01-01

    Full Text Available Actin-encoding cDNAs of Nile tilapia (Oreochromis niloticus were isolated by RT-PCR using total RNA samples of different tissues and further characterized by nucleotide sequencing and in silico amino acid (aa sequence analysis. Comparisons among the actin gene sequences of O. niloticus and those of other species evidenced that the isolated genes present a high similarity to other fish and other vertebrate actin genes. The highest nucleotide resemblance was observed between O. niloticus and O. mossambicus a-actin and b-actin genes. Analysis of the predicted aa sequences revealed two distinct types of cytoplasmic actins, one cardiac muscle actin type and one skeletal muscle actin type that were expressed in different tissues of Nile tilapia. The evolutionary relationships between the Nile tilapia actin genes and diverse other organisms is discussed.

  18. Use of sourdough made with quinoa (Chenopodium quinoa) flour and autochthonous selected lactic acid bacteria for enhancing the nutritional, textural and sensory features of white bread.

    Science.gov (United States)

    Rizzello, Carlo Giuseppe; Lorusso, Anna; Montemurro, Marco; Gobbetti, Marco

    2016-06-01

    Lactic acid bacteria were isolated and identified from quinoa flour, spontaneously fermented quinoa dough, and type I quinoa sourdough. Strains were further selected based on acidification and proteolytic activities. Selected Lactobacillus plantarum T6B10 and Lactobacillus rossiae T0A16 were used as mixed starter to get quinoa sourdough. Compared to non-fermented flour, organic acids, free amino acids, soluble fibers, total phenols, phytase and antioxidant activities, and in vitro protein digestibility markedly increased during fermentation. A wheat bread was made using 20% (w/w) of quinoa sourdough, and compared to baker's yeast wheat breads manufactured with or without quinoa flour. The use of quinoa sourdough improved the chemical, textural, and sensory features of wheat bread, showing better performances compared to the use of quinoa flour. Protein digestibility and quality, and the rate of starch hydrolysis were also nutritional features that markedly improved using quinoa sourdough as an ingredient. This study exploited the potential of quinoa flour through sourdough fermentation. A number of advantages encouraged the manufacture of novel and healthy leavened baked goods.

  19. Use of sourdough made with quinoa (Chenopodium quinoa) flour and autochthonous selected lactic acid bacteria for enhancing the nutritional, textural and sensory features of white bread.

    Science.gov (United States)

    Rizzello, Carlo Giuseppe; Lorusso, Anna; Montemurro, Marco; Gobbetti, Marco

    2016-06-01

    Lactic acid bacteria were isolated and identified from quinoa flour, spontaneously fermented quinoa dough, and type I quinoa sourdough. Strains were further selected based on acidification and proteolytic activities. Selected Lactobacillus plantarum T6B10 and Lactobacillus rossiae T0A16 were used as mixed starter to get quinoa sourdough. Compared to non-fermented flour, organic acids, free amino acids, soluble fibers, total phenols, phytase and antioxidant activities, and in vitro protein digestibility markedly increased during fermentation. A wheat bread was made using 20% (w/w) of quinoa sourdough, and compared to baker's yeast wheat breads manufactured with or without quinoa flour. The use of quinoa sourdough improved the chemical, textural, and sensory features of wheat bread, showing better performances compared to the use of quinoa flour. Protein digestibility and quality, and the rate of starch hydrolysis were also nutritional features that markedly improved using quinoa sourdough as an ingredient. This study exploited the potential of quinoa flour through sourdough fermentation. A number of advantages encouraged the manufacture of novel and healthy leavened baked goods. PMID:26919812

  20. Iris recognition algorithm based on extracting stable feature points of sequence images%基于序列图像提取稳定特征点的虹膜识别算法

    Institute of Scientific and Technical Information of China (English)

    苑玮琦; 张开营; 杨冉冉; 刘博

    2011-01-01

    为了尽可能降低不稳定特征点对识别率的影响,研究提出了基于序列图像提取稳定特征点的虹膜识别算法.该算法首先用二维Gabor滤波器对序列虹膜图像提取特征编码,然后对该序列特征编码求交集以提取稳定特征点,并利用这些稳定特征点建立虹膜的特征模板库,最后通过计算相似度,获得识别结果.在实验室采集的序列虹膜图库上,当等错率为0.3017%,分类阈值为0.6402时,正确识别率可以达到99.73%.实验证明该算法是有效、可行的,并更好地提高了虹膜的分类精度和改善了虹膜的识别性能.%In order to reduce the influence of unstable feature points on iris recognition, an iris recognition algorithm based on extracting stable feature points of sequence images is proposed. Firstly, this algorithm extracts feature codes from sequence iris images with two-dimensional Gabor filters. Secondly, the stable feature points can be extracted by calculating the intersection of the feature codes of the sequence images. At the same time, iris feature templates can be built based on these stable feature points; and the iris feature templates can be used for iris classification. At last, the identification results can be obtained with pattern recognition. When the equal error rate is 0.3017% and the distance threshold equals to 0.6402 , the identification accuracy is 99.73% for the iris database acquired in laboratory by oneself. Experimental results demonstrate that the iris recognition algorithm based on extracting stable feature points of sequence images is effective and feasible, and the proposed algorithm can achieve high classification accuracy and improve iris recognition performance effectively.

  1. Genome sequence of Candidatus Nitrososphaera evergladensis from group I.1b enriched from Everglades soil reveals novel genomic features of the ammonia-oxidizing archaea.

    Directory of Open Access Journals (Sweden)

    Kateryna V Zhalnina

    Full Text Available The activity of ammonia-oxidizing archaea (AOA leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group.

  2. Evolution of alpha-lactalbumins. The complete amino acid sequence of the alpha-lactalbumin from a marsupial (Macropus rufogriseus) and corrections to regions of sequence in bovine and goat alpha-lactalbumins.

    Science.gov (United States)

    Shewale, J G; Sinha, S K; Brew, K

    1984-04-25

    alpha-Lactalbumin was purified from a whey protein fraction of the milk of the red-necked wallaby (Macropus rufogriseus). The complete amino acid sequence was determined from the results of automatic sequenator analyses of the intact protein, the three cyanogen bromide fragments, and of peptides generated from the larger, COOH-terminal CNBr fragment by digestion with trypsin or staphylococcal protease. This is the first sequence to be determined of an alpha-lactalbumin from a marsupial and differs from known eutherian alpha-lactalbumins in size and locations of deletions in alignments with the homologous type c lysozymes, as well as in having amino acid substitutions at 8 sites that are invariant in known eutherian proteins. Some corrections are also reported for two regions of sequence in both bovine and goat alpha-lactalbumins. The new and previously published information on alpha-lactalbumin sequences is analyzed in relation to the evolutionary history of the alpha-lactalbumin line as well as the relationship of structure to function in these proteins. PMID:6715332

  3. Sequence-specific nucleic acid mobility using a reversible block copolymer gel matrix and DNA amphiphiles (lipid-DNA) in capillary and microfluidic electrophoretic separations.

    Science.gov (United States)

    Wagler, Patrick; Minero, Gabriel Antonio S; Tangen, Uwe; de Vries, Jan Willem; Prusty, Deepak; Kwak, Minseok; Herrmann, Andreas; McCaskill, John S

    2015-10-01

    Reversible noncovalent but sequence-dependent attachment of DNA to gels is shown to allow programmable mobility processing of DNA populations. The covalent attachment of DNA oligomers to polyacrylamide gels using acrydite-modified oligonucleotides has enabled sequence-specific mobility assays for DNA in gel electrophoresis: sequences binding to the immobilized DNA are delayed in their migration. Such a system has been used for example to construct complex DNA filters facilitating DNA computations. However, these gels are formed irreversibly and the choice of immobilized sequences is made once off during fabrication. In this work, we demonstrate the reversible self-assembly of gels combined with amphiphilic DNA molecules, which exhibit hydrophobic hydrocarbon chains attached to the nucleobase. This amphiphilic DNA, which we term lipid-DNA, is synthesized in advance and is blended into a block copolymer gel to induce sequence-dependent DNA retention during electrophoresis. Furthermore, we demonstrate and characterize the programmable mobility shift of matching DNA in such reversible gels both in thin films and microchannels using microelectrode arrays. Such sequence selective separation may be employed to select nucleic acid sequences of similar length from a mixture via local electronics, a basic functionality that can be employed in novel electronic chemical cell designs and other DNA information-processing systems. PMID:26095642

  4. Structural features of dilute acid, steam exploded, and alkali pretreated mustard stalk and their impact on enzymatic hydrolysis.

    Science.gov (United States)

    Kapoor, Manali; Raj, Tirath; Vijayaraj, M; Chopra, Anju; Gupta, Ravi P; Tuli, Deepak K; Kumar, Ravindra

    2015-06-25

    To overcome the recalcitrant nature of biomass several pretreatment methodologies have been explored to make it amenable to enzymatic hydrolysis. These methodologies alter cell wall structure primarily by removing/altering hemicelluloses and lignin. In this work, alkali, dilute acid, steam explosion pretreatment are systematically studied for mustard stalk. To assess the structural variability after pretreatment, chemical analysis, surface area, crystallinity index, accessibility of cellulose, FT-IR and thermal analysis are conducted. Although the extent of enzymatic hydrolysis varies upon the methodologies used, nevertheless, cellulose conversion increases from pretreatment. Glucose yield at 2 and 72h are well correlated with surface area and maximum adsorption capacity. However, no such relationship is observed for xylose yield. Mass balance of the process is also studied. Dilute acid pretreatment is the best methodology in terms of maximum sugar yield at lower enzyme loading.

  5. Compositional and functional features of humic acid-like fractions from vermicomposting of sewage sludge and cow dung

    International Nuclear Information System (INIS)

    The chemical changes occurring in five different substrates of sewage sludge spiked with different proportions of cow dung after vermicomposting with Eisenia foetida for 90 days were investigated. Their humic acid-like (HAL) fractions were isolated to determine the elemental and functional composition, and structural and functional characteristics using ultraviolet/visible, Fourier transform infrared (FT-IR) and fluorescence spectroscopies and scanning electron microscopy. After vermicomposting, the total organic C and C/N ratio decreased, and the total extractable C and humic acid (HA) C increased in all substrates. In the HAL fractions, the C and H contents, C/N and C/O and aliphatic structures, proteinaceous components and carbohydrates decreased, while the O and N and acidic functional group contents and C/H ratio, aromaticity and polycondensation structures increased. Further, the results suggest that the addition of cow dung to sewage sludge could improve the quality of organic matter humification of the substrates. The structures of HAL fractions in vermicomposts resembled those typical of soil HA, especially the vermicompost of cow dung alone. Scanning electron microscopy showed the microstructure of HAL fraction in final product became close-grained and lumpy. Overall results indicate that vermicomposting was an efficient technology for promoting organic matter (OM) humification in sewage sludge and cow dung alone, as well as in mixtures of both materials, improving their quality and environmental safety as a soil OM resource for utilization as soil amendments.

  6. Compositional and functional features of humic acid-like fractions from vermicomposting of sewage sludge and cow dung

    Energy Technology Data Exchange (ETDEWEB)

    Li Xiaowei [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China); Xing Meiyan, E-mail: xmy5000@163.com [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China); Yang Jian; Huang Zhidong [State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092 (China)

    2011-01-30

    The chemical changes occurring in five different substrates of sewage sludge spiked with different proportions of cow dung after vermicomposting with Eisenia foetida for 90 days were investigated. Their humic acid-like (HAL) fractions were isolated to determine the elemental and functional composition, and structural and functional characteristics using ultraviolet/visible, Fourier transform infrared (FT-IR) and fluorescence spectroscopies and scanning electron microscopy. After vermicomposting, the total organic C and C/N ratio decreased, and the total extractable C and humic acid (HA) C increased in all substrates. In the HAL fractions, the C and H contents, C/N and C/O and aliphatic structures, proteinaceous components and carbohydrates decreased, while the O and N and acidic functional group contents and C/H ratio, aromaticity and polycondensation structures increased. Further, the results suggest that the addition of cow dung to sewage sludge could improve the quality of organic matter humification of the substrates. The structures of HAL fractions in vermicomposts resembled those typical of soil HA, especially the vermicompost of cow dung alone. Scanning electron microscopy showed the microstructure of HAL fraction in final product became close-grained and lumpy. Overall results indicate that vermicomposting was an efficient technology for promoting organic matter (OM) humification in sewage sludge and cow dung alone, as well as in mixtures of both materials, improving their quality and environmental safety as a soil OM resource for utilization as soil amendments.

  7. Identification of multiple lipid genes with modifications in expression and sequence associated with the evolution of hydroxy fatty acid accumulation in Physaria fendleri.

    Science.gov (United States)

    Horn, Patrick J; Liu, Jinjie; Cocuron, Jean-Christophe; McGlew, Kathleen; Thrower, Nicholas A; Larson, Matt; Lu, Chaofu; Alonso, Ana P; Ohlrogge, John

    2016-05-01

    Two Brassicaceae species, Physaria fendleri and Camelina sativa, are genetically very closely related to each other and to Arabidopsis thaliana. Physaria fendleri seeds contain over 50% hydroxy fatty acids (HFAs), while Camelina sativa and Arabidopsis do not accumulate HFAs. To better understand how plants evolved new biochemical pathways with the capacity to accumulate high levels of unusual fatty acids, transcript expression and protein sequences of developing seeds of Physaria fendleri, wild-type Camelina sativa, and Camelina sativa expressing a castor bean (Ricinus communis) hydroxylase were analyzed. A number of potential evolutionary adaptations within lipid metabolism that probably enhance HFA production and accumulation in Physaria fendleri, and, in their absence, limit accumulation in transgenic tissues were revealed. These adaptations occurred in at least 20 genes within several lipid pathways from the onset of fatty acid synthesis and its regulation to the assembly of triacylglycerols. Lipid genes of Physaria fendleri appear to have co-evolved through modulation of transcriptional abundances and alterations within protein sequences. Only a handful of genes showed evidence for sequence adaptation through gene duplication. Collectively, these evolutionary changes probably occurred to minimize deleterious effects of high HFA amounts and/or to enhance accumulation for physiological advantage. These results shed light on the evolution of pathways for novel fatty acid production in seeds, help explain some of the current limitations to accumulation of HFAs in transgenic plants, and may provide improved strategies for future engineering of their production.

  8. Some features of the effect the pH value and the physicochemical properties of boric acid have on mass transfer in a VVER reactor's core

    Science.gov (United States)

    Gavrilov, A. V.; Kritskii, V. G.; Rodionov, Yu. A.; Berezina, I. G.

    2013-07-01

    Certain features of the effect of boric acid in the reactor coolant of nuclear power installations equipped with a VVER-440 reactor on mass transfer in the reactor core are considered. It is determined that formation of boric acid polyborate complexes begins under field conditions at a temperature of 300°C when the boric acid concentration is equal to around 0.065 mol/L (4 g/L). Operations for decontaminating the reactor coolant system entail a growth of corrosion product concentration in the coolant, which gives rise to formation of iron borates in the zones where subcooled boiling of coolant takes place and to the effect of axial offset anomalies. A model for simulating variation of pressure drop in a VVER-440 reactor's core that has invariable parameters during the entire fuel campaign is developed by additionally taking into account the concentrations of boric acid polyborate complexes and the quantity of corrosion products (Fe, Ni) represented by the ratio of their solubilities.

  9. Myeloid neoplasm demonstrating a STAT5B-RARA rearrangement and genetic alterations associated with all-trans retinoic acid resistance identified by a custom next-generation sequencing assay.

    Science.gov (United States)

    Kluk, Michael J; Abo, Ryan P; Brown, Ronald D; Kuo, Frank C; Dal Cin, Paola; Pozdnyakova, Olga; Morgan, Elizabeth A; Lindeman, Neal I; DeAngelo, Daniel J; Aster, Jon C

    2015-10-01

    We describe the case of a patient presenting with several weeks of symptoms related to pancytopenia associated with a maturation arrest at the late promyelocyte/early myelocyte stage of granulocyte differentiation. A diagnosis of acute promyelocytic leukemia was considered, but the morphologic features were atypical for this entity and conventional tests for the presence of a PML-RARA fusion gene were negative. Additional analysis using a custom next-generation sequencing assay revealed a rearrangement producing a STAT5B-RARA fusion gene, which was confirmed by reverse transcription polymerase chain reaction (RT-PCR) and supplementary cytogenetic studies, allowing the diagnosis of a morphologically atypical form of acute promyelocytic leukemia to be made. Analysis of the sequencing data permitted characterization of both chromosomal breakpoints and revealed two additional alterations, a small deletion in RARA exon 9 and a RARA R276W substitution, that have been linked to resistance to all-trans retinoic acid. This case highlights how next-generation sequencing can augment currently standard testing to establish diagnoses in difficult cases, and in doing so help guide selection of therapy. PMID:27148563

  10. Complete genome sequence of the actinobacterium Amycolatopsis japonica MG417-CF17(T) (=DSM 44213T) producing (S,S)-N,N'-ethylenediaminedisuccinic acid.

    Science.gov (United States)

    Stegmann, Evi; Albersmeier, Andreas; Spohn, Marius; Gert, Helena; Weber, Tilmann; Wohlleben, Wolfgang; Kalinowski, Jörn; Rückert, Christian

    2014-11-10

    We report the complete genome sequence of Amycolatopsis japonica MG417-CF17(T) (=DSM 44213(T)) which was identified as the producer of (S,S)-N,N'-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17(T) consists of two replicons: the chromosome (8,961,318 bp, 68.89% G+C content) and the plasmid pAmyja1 (92,539 bp, 68.23% G+C content), encoding a total of 8422 protein coding genes. Analysis of the sequence data revealed 30 clusters encoding the biosynthesis of secondary metabolites. PMID:25193710

  11. Ruthenium Hydride/Brønsted Acid-Catalyzed Tandem Isomerization/N-Acyliminium Cyclization Sequence for the Synthesis of Tetrahydro-β-carbolines

    DEFF Research Database (Denmark)

    Hansen, Casper Lykke; Clausen, Janie Regitse Waël; Ohm, Ragnhild Gaard;

    2013-01-01

    This paper describes an efficient tandem sequence for the synthesis of 1,2,3,4-tetrahydro-β-carbolines (THBCs) relying on a ruthenium hydride/Brønsted acid- catalyzed isomerization of allylic amides to N-acyliminium ion intermediates which are trapped by a tethered indolenucleophile. The methodol...... the Suzuki cross-coupling reaction to the isomerization/N-acyliminium cyclization sequence. Finally, diastereo- and enantioselective versions of the title reaction have been examined using substrate control (with dr >15: 1) and asymmetric catalysis (ee up to 57%), respectively...

  12. Transcriptome-Wide Cleavage Site Mapping on Cellular mRNAs Reveals Features Underlying Sequence-Specific Cleavage by the Viral Ribonuclease SOX.

    Directory of Open Access Journals (Sweden)

    Marta Maria Gaglia

    2015-12-01

    Full Text Available Many viruses express factors that reduce host gene expression through widespread degradation of cellular mRNA. An example of this class of proteins is the mRNA-targeting endoribonuclease SOX from the gamma-herpesvirus Kaposi's sarcoma-associated herpesvirus (KSHV. Previous studies indicated that cleavage of messenger RNAs (mRNA by SOX occurs at specific locations defined by the sequence of the target RNA, which is at odds with the down-regulation of a large portion of cellular transcripts. In this study, we address this paradox by using high-throughput sequencing of cleavage intermediates combined with a custom bioinformatics-based analysis pipeline to identify SOX cleavage sites across the mRNA transcriptome. These data, coupled with targeted mutagenesis, reveal that while cleavage sites are specific and reproducible, they are defined by a degenerate sequence motif containing a small number of conserved residues rather than a strong consensus sequence. This degenerate element is well represented in both human and KSHV mRNA, and its presence correlates with RNA destabilization by SOX. This represents a new endonuclease targeting strategy, in which use of a degenerate targeting element enables RNA cleavage at specific locations without restricting the range of targets. Furthermore, it shows that strong target selectivity can be achieved without a high degree of sequence specificity.

  13. Geochemical features and sources of hydrocarbons and fatty acids in soils from the McMurdo Dry Valleys in the Antarctic

    Science.gov (United States)

    Matsumoto, Genki I.; Honda, Eisuke; Sonoda, Kazuhiko; Yamamoto, Shuichi; Takemura, Tetsuo

    2010-08-01

    We studied the geochemical features and compound-specific (CS)-δ 13C of hydrocarbons and fatty acids in soil samples from the McMurdo Dry Valleys in the Antarctic to elucidate their source organisms and characteristics of their environments. Total organic carbon contents in soil samples were extremely low reflecting extremely harsh environments for organisms. Normal-alkanes ranging in carbon chain length from n-C 14 to n-C 38 with the predominance of odd-carbon numbers were found, together with n-alkenes ( n-C 23:1 to n-C 27:1). Normal-alkanoic acids ranging in carbon chain length from n-C 10 to n-C 30 with the predominance of even-carbon numbers were detected in the samples, along with small amounts of branched ( iso and anteiso) and n-alkenoic acids. CS-δ 13C values of long-chain n-alkanes ( n-C 20 to n-C 29) ranged from -30.4 to -26.6‰. CS-δ 13C values of n-alkanoic acids with short-chain carbon numbers ( n-C 14 to n-C 19) ranging from -27.7 to -21.7‰ were much higher than those of long-chain carbon numbers ( n-C 20 to n-C 30, -32.5 to -25.3‰). The geochemical features and CS-δ 13C values of long-chain n-alkanes and n-alkanoic acids revealed that they are originated from lichen and/or vascular plant debris from the pre- and inter-glacial periods in this region, whereas short-chain n-alkanoic acids are come from microalgae and cyanobacterial debris. CS-δ 13C values suggest that they are derived from gymnosperms and/or C 4 plants in the cold and dry environments of the pre- and inter-glacial periods of the McMurdo Dry Valleys region.

  14. Dna Sequencing

    Science.gov (United States)

    Tabor, Stanley; Richardson, Charles C.

    1995-04-25

    A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.

  15. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    OpenAIRE

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which might shed light onto the enzymatic machineries that are involved in its production of biogenic amines.

  16. Data for amino acid alignment of Japanese stingray melanocortin receptors with other gnathostome melanocortin receptor sequences, and the ligand selectivity of Japanese stingray melanocortin receptors

    Directory of Open Access Journals (Sweden)

    Akiyoshi Takahashi

    2016-06-01

    Full Text Available This article contains structure and pharmacological characteristics of melanocortin receptors (MCRs related to research published in “Characterization of melanocortin receptors from stingray Dasyatis akajei, a cartilaginous fish” (Takahashi et al., 2016 [1]. The amino acid sequences of the stingray, D. akajei, MC1R, MC2R, MC3R, MC4R, and MC5R were aligned with the corresponding melanocortin receptor sequences from the elephant shark, Callorhinchus milii, the dogfish, Squalus acanthias, the goldfish, Carassius auratus, and the mouse, Mus musculus. These alignments provide the basis for phylogenetic analysis of these gnathostome melanocortin receptor sequences. In addition, the Japanese stingray melanocortin receptors were separately expressed in Chinese Hamster Ovary cells, and stimulated with stingray ACTH, α-MSH, β-MSH, γ-MSH, δ-MSH, and β-endorphin. The dose response curves reveal the order of ligand selectivity for each stingray MCR.

  17. Cloning, DNA sequencing and heterologous expression of the gene for thermostable N-acylamino acid racemase from Amycolatopsis sp. TS-1-60 in Escherichia coli.

    Science.gov (United States)

    Tokuyama, S; Hatano, K

    1995-03-01

    The gene encoding the novel enzyme N-acylamino acid racemase (AAR) was cloned in recombinant phage lambda-4 from the DNA library of Amycolatopsis sp. TS-1-60, a rare actinomycete, using antiserum against the enzyme. The cloned gene was subcloned and transformed in Escherichia coli JM105 using pUC118 as a vector. The AAR gene consists of an open-reading frame of 1104 nucleotides, which specifies a 368-amino-acid protein with a molecular mass of 39411Da. The molecular mass deduced from the AAR gene is in good agreement with the subunit molecular mass (40kDa) of AAR from Amycolatopsis sp. TS-1-60. The guanosine plus cytosine content of the AAR gene was about 70%. Although the AAR gene uses the unusual initiation codon GTG, the gene was expressed in Escherichia coli using the lac promoter of pUC118. The amount of the enzyme produced by the transformant was 16 times that produced by Amycolatopsis sp. TS-1-60. When the unusual initiation codon GTG was changed to ATG, the enzyme productivity of the transformant increased to more than 37 times that of Amycolatopsis sp. TS-1-60. In the comparison of the DNA sequence and the deduced amino acid sequence of AAR with those of known racemases and epimerases in data bases, no significant sequence homology was found. However, AAR resembles mandelate racemase in that requires metal ions for enzyme activity.(ABSTRACT TRUNCATED AT 250 WORDS)

  18. Separation of small molecular peptides with the same amino acid composition but different sequences by high performance liquid chromatography-electrospray ionization-mass spectrometry

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Peptidomics has emerged as a new discipline in recent years. Mass spectrometry (MS) is the most universal and efficient tool for structure identification of proteins and peptides. However,there is a limitation for the identification of peptides with the same amino acid composition but different se-quences because these peptides have identical mass spectra of molecular ions. This paper presents a high performance liquid chromatography-electrospray ionization-mass spectrometry (HPLC-ESI-MS) method for the separation of small molecular peptides with the same amino acid composition but dif-ferent sequences. Two tripeptides of Gly-Ser-Phe and Gly-Phe-Ser were used as a model sample. The separation behavior has been investigated and the separation conditions have been optimized. Under the optimum conditions,good repeatability was achieved. The developed method could provide a helpful reference for the separation of other peptides with the same amino acid composition but different sequences in the study of proteomics and peptidomics.

  19. Amino acid sequence of the alpha subunit and computer modelling of the alpha and beta subunits of echicetin from the venom of Echis carinatus (saw-scaled viper).

    Science.gov (United States)

    Polgár, J; Magnenat, E M; Peitsch, M C; Wells, T N; Saqi, M S; Clemetson, K J

    1997-04-15

    Echicetin, a heterodimeric protein from the venom of Echis carinatus, binds to platelet glycoprotein Ib (GPIb) and so inhibits platelet aggregation or agglutination induced by various platelet agonists acting via GPIb. The amino acid sequence of the beta subunit of echicetin has been reported and found to belong to the recently identified snake venom subclass of the C-type lectin protein family. Echicetin alpha and beta subunits were purified. N-terminal sequence analysis provided direct evidence that the protein purified was echicetin. The paper presents the complete amino acid sequence of the alpha subunit and computer models of the alpha and beta subunits. The sequence of alpha echicetin is highly similar to the alpha and beta chains of various heterodimeric and homodimeric C-type lectins. Neither of the fully reduced and alkylated alpha or beta subunits of echicetin inhibited the platelet agglutination induced by von Willebrand factor-ristocetin or alpha-thrombin. Earlier reports about the inhibitory activity of reduced and alkylated echicetin beta subunit might have been due to partial reduction of the protein. PMID:9163349

  20. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    Directory of Open Access Journals (Sweden)

    Kager Piet A

    2006-10-01

    Full Text Available Abstract Background Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (Plasmodium antigen detection do often not allow for species discrimination as microscopy does, but also become insensitive at Methods This paper reports the development of a sensitive and specific real-time Quantitative Nucleic Acid Sequence Based Amplification (real-time QT-NASBA assays, based on the small-subunit 18S rRNA gene, to identify the four human Plasmodium species. Results The lower detection limit of the assay is 100 – 1000 molecules in vitro RNA for all species, which corresponds to 0.01 – 0.1 parasite per diagnostic sample (i.e. 50 μl of processed blood. The real-time QT-NASBA was further evaluated using 79 clinical samples from malaria patients: i.e. 11 Plasmodium. falciparum, 37 Plasmodium vivax, seven Plasmodium malariae, four Plasmodium ovale and 20 mixed infections. The initial diagnosis of 69 out of the 79 samples was confirmed with the developed real-time QT-NASBA. Re-analysis of seven available original slides resolved five mismatches. Three of those were initially identified as P. malariae mono-infection, but after re-reading the slides P. falciparum was found, confirming the real-time QT-NASBA result. The other two slides were of poor quality not allowing true species identification. The remaining five discordant results could not be explained by microscopy, but may be due to extreme low numbers of parasites present in the samples. In addition, 12 Plasmodium berghei isolates from mice and 20 blood samples from healthy donors did not show any reaction in the assay. Conclusion Real-time QT-NASBA is a very sensitive and specific technique with a detection limit of 0.1 Plasmodium parasite per diagnostic sample (50 μl of blood and can be used for the detection, identification and quantitative measurement of low parasitaemia of Plasmodium species, thus

  1. Ethosomes® and transfersomes® containing linoleic acid: physicochemical and technological features of topical drug delivery carriers for the potential treatment of melasma disorders.

    Science.gov (United States)

    Celia, Christian; Cilurzo, Felisa; Trapasso, Elena; Cosco, Donato; Fresta, Massimo; Paolino, Donatella

    2012-02-01

    Two vesicular colloidal carriers, ethosomes® and transfersomes® were proposed for the topical delivery of linoleic acid, an active compound used in the therapeutic treatment of hyperpigmentation disorders, i.e. melasma, which is characterized by an increase of the melanin production in the epidermis. Dynamic light scattering was used for the physicochemical characterization of vesicles and mean size, size distribution and zeta potential were evaluated. The stability of formulations was also evaluated using the Turbiscan Lab® Expert based on the analysis of sample transmittance and photon backscattering. Ethosomes® and transfersomes® were prepared using Phospholipon 100 G®, as the lecithin component, and ethanol and sodium cholate, as edge activator agents, respectively. Linoleic acid at 0.05% and 0.1% (w/v) was used as the active ingredient and entrapped in colloidal vesicles. Technological parameters, i.e. entrapment efficacy, drug release and permeation profiles, were also investigated. Experimental findings showed that physicochemical and technological features of ethosomes® and transfersomes® were influenced by the lipid composition of the carriers. The percutaneous permeation experiments of linoleic acid-loaded ethosomes® and transfersomes® through human stratum corneum-epidermidis membranes showed that both carriers are accumulated in the skin membrane model as a function of their lipid compositions. The findings reported in this investigation showed that both vesicular carriers could represent a potential system for the topical treatment of hyperpigmentation disorders. PMID:21960035

  2. Ethosomes® and transfersomes® containing linoleic acid: physicochemical and technological features of topical drug delivery carriers for the potential treatment of melasma disorders.

    Science.gov (United States)

    Celia, Christian; Cilurzo, Felisa; Trapasso, Elena; Cosco, Donato; Fresta, Massimo; Paolino, Donatella

    2012-02-01

    Two vesicular colloidal carriers, ethosomes® and transfersomes® were proposed for the topical delivery of linoleic acid, an active compound used in the therapeutic treatment of hyperpigmentation disorders, i.e. melasma, which is characterized by an increase of the melanin production in the epidermis. Dynamic light scattering was used for the physicochemical characterization of vesicles and mean size, size distribution and zeta potential were evaluated. The stability of formulations was also evaluated using the Turbiscan Lab® Expert based on the analysis of sample transmittance and photon backscattering. Ethosomes® and transfersomes® were prepared using Phospholipon 100 G®, as the lecithin component, and ethanol and sodium cholate, as edge activator agents, respectively. Linoleic acid at 0.05% and 0.1% (w/v) was used as the active ingredient and entrapped in colloidal vesicles. Technological parameters, i.e. entrapment efficacy, drug release and permeation profiles, were also investigated. Experimental findings showed that physicochemical and technological features of ethosomes® and transfersomes® were influenced by the lipid composition of the carriers. The percutaneous permeation experiments of linoleic acid-loaded ethosomes® and transfersomes® through human stratum corneum-epidermidis membranes showed that both carriers are accumulated in the skin membrane model as a function of their lipid compositions. The findings reported in this investigation showed that both vesicular carriers could represent a potential system for the topical treatment of hyperpigmentation disorders.

  3. Intracellular traffic of the lysine and glutamic acid rich protein KERP1 reveals features of endomembrane organization in Entamoeba histolytica.

    Science.gov (United States)

    Perdomo, Doranda; Manich, Maria; Syan, Sylvie; Olivo-Marin, Jean-Christophe; Dufour, Alexandre C; Guillén, Nancy

    2016-08-01

    The development of amoebiasis is influenced by the expression of the lysine and glutamic acid rich protein 1 (KERP1), a virulence factor involved in Entamoeba histolytica adherence to human cells. Up to date, it is unknown how the protein transits the parasite cytoplasm towards the plasma membrane, specially because this organism lacks a well-defined endoplasmic reticulum (ER) and Golgi apparatus. In this work we demonstrate that KERP1 is present at the cell surface and in intracellular vesicles which traffic in a pathway that is independent of the ER-Golgi anterograde transport. The intracellular displacement of vesicles enriched in KERP1 relies on the actin-rich cytoskeleton activities. KERP1 is also present in externalized vesicles deposited on the surface of human cells. We further report the interactome of KERP1 with its association to endomembrane components and lipids. The model for KERP1 traffic here proposed hints for the first time elements of the endocytic and exocytic paths of E. histolytica. PMID:26857352

  4. The Origin and Features of the Two Intermediate-Acid Intrusive Series in Tongling Area, Anhui, China

    Institute of Scientific and Technical Information of China (English)

    WU Cailai; CHEN Songyong; HAO Meiying; SHI Rendeng

    2001-01-01

    The Mesozoic intermediate-acid intrusive rocks in Tongling area can be divided into two magmatic series, namely the shoshonitic series and the high-potassium (K) calc-alkaline series. The shoshonitic series is characterized by an association of pyroxene monzodiorites+monzodiorites+quartz monzonites, the Rittmann indices of these rocks are greater than 4 (ranging from 4.2 to 5.8), the total of rare earth elements (∑REE) ranges from 210×10-6 to 260×10-6, and has good relationship with gold deposits. The high-potassium calc-alkaline series is by an association of diorite+quartz monzodiorites+granodiorites, the Rittmann indices are less than 4 (1.8-3.2), ∑REE 130×l0-6-180×10-6,and has good relationship with copper deposits. Together with lithoenclave study, it can be concluded that the shoshonitic series magma is derived from the differentiation of the alkaline basic magma in mantle and the high-potassium (K) calc-alkaline series magma is mixture between the differentiated mantle magma and crustal magma.

  5. Interactive Effect of Salicylic Acid on Some Physiological Features and Antioxidant Enzymes Activity in Ginger (Zingiber officinale Roscoe

    Directory of Open Access Journals (Sweden)

    Hawa Z. E. Jaafar

    2013-05-01

    Full Text Available The effect of foliar salicylic acid (SA applications (10−3 and 10−5 M on activities of nitrate reductase, guaiacol peroxidase (POD, superoxide dismutases (SOD, catalase (CAT and proline enzymes and physiological parameters was evaluated in two ginger varieties (Halia Bentong and Halia Bara under greenhouse conditions. In both varieties, tested treatments generally enhanced photosynthetic rate and total dry weight. Photosynthetic rate increases were generally accompanied by increased or unchanged stomatal conductance levels, although intercellular CO2 concentrations of treated plants were typically lower than in controls. Lower SA concentrations were generally more effective in enhancing photosynthetic rate and plant growth. Exogenous application of SA increased antioxidant enzyme activities and proline content; the greatest responses were obtained in plants sprayed with 10–5 M SA, with significant increases observed in CAT (20.1%, POD (45.2%, SOD (44.1% and proline (43.1% activities. Increased CAT activity in leaves is naturally expected to increase photosynthetic efficiency and thus net photosynthesis by maintaining a constant CO2 supply. Our results support the idea that low SA concentrations (10–5 M may induce nitrite reductase synthesis by mobilizing intracellular NO3− and can provide protection to nitrite reductase degradation in vivo in the absence of NO3–. Observed positive correlations among proline, SOD, CAT and POD activities in the studied varieties suggest that increased SOD activity was accompanied by increases in CAT and POD activities because of the high demands of H2O2 quenching.

  6. Interactive effect of salicylic acid on some physiological features and antioxidant enzymes activity in ginger (Zingiber officinale Roscoe).

    Science.gov (United States)

    Ghasemzadeh, Ali; Jaafar, Hawa Z E

    2013-01-01

    The effect of foliar salicylic acid (SA) applications (10⁻³ and 10⁻⁵ M) on activities of nitrate reductase, guaiacol peroxidase (POD), superoxide dismutases (SOD), catalase (CAT) and proline enzymes and physiological parameters was evaluated in two ginger varieties (Halia Bentong and Halia Bara) under greenhouse conditions. In both varieties, tested treatments generally enhanced photosynthetic rate and total dry weight. Photosynthetic rate increases were generally accompanied by increased or unchanged stomatal conductance levels, although intercellular CO₂ concentrations of treated plants were typically lower than in controls. Lower SA concentrations were generally more effective in enhancing photosynthetic rate and plant growth. Exogenous application of SA increased antioxidant enzyme activities and proline content; the greatest responses were obtained in plants sprayed with 10⁻⁵ M SA, with significant increases observed in CAT (20.1%), POD (45.2%), SOD (44.1%) and proline (43.1%) activities. Increased CAT activity in leaves is naturally expected to increase photosynthetic efficiency and thus net photosynthesis by maintaining a constant CO₂ supply. Our results support the idea that low SA concentrations (10⁻⁵ M) may induce nitrite reductase synthesis by mobilizing intracellular NO³⁻ and can provide protection to nitrite reductase degradation in vivo in the absence of NO³⁻. Observed positive correlations among proline, SOD, CAT and POD activities in the studied varieties suggest that increased SOD activity was accompanied by increases in CAT and POD activities because of the high demands of H₂O₂ quenching. PMID:23698049

  7. KSHV 2.0: A Comprehensive Annotation of the Kaposi's Sarcoma-Associated Herpesvirus Genome Using Next-Generation Sequencing Reveals Novel Genomic and Functional Features

    OpenAIRE

    Carolina Arias; Ben Weisburd; Noam Stern-Ginossar; Alexandre Mercier; Alexis S Madrid; Priya Bellare; Meghan Holdorf; Jonathan S Weissman; Don Ganem

    2014-01-01

    Productive herpesvirus infection requires a profound, time-controlled remodeling of the viral transcriptome and proteome. To gain insights into the genomic architecture and gene expression control in Kaposi's sarcoma-associated herpesvirus (KSHV), we performed a systematic genome-wide survey of viral transcriptional and translational activity throughout the lytic cycle. Using mRNA-sequencing and ribosome profiling, we found that transcripts encoding lytic genes are promptly bound by ribosomes...

  8. Transcriptional analysis of the HeT-A retrotransposon in mutant and wild type stocks reveals high sequence variability at Drosophila telomeres and other unusual features

    Directory of Open Access Journals (Sweden)

    Piñeyro David

    2011-11-01

    Full Text Available Abstract Background Telomere replication in Drosophila depends on the transposition of a domesticated retroelement, the HeT-A retrotransposon. The sequence of the HeT-A retrotransposon changes rapidly resulting in differentiated subfamilies. This pattern of sequence change contrasts with the essential function with which the HeT-A is entrusted and brings about questions concerning the extent of sequence variability, the telomere contribution of different subfamilies, and whether wild type and mutant Drosophila stocks show different HeT-A scenarios. Results A detailed study on the variability of HeT-A reveals that both the level of variability and the number of subfamilies are higher than previously reported. Comparisons between GIII, a strain with longer telomeres, and its parental strain Oregon-R indicate that both strains have the same set of HeT-A subfamilies. Finally, the presence of a highly conserved splicing pattern only in its antisense transcripts indicates a putative regulatory, functional or structural role for the HeT-A RNA. Interestingly, our results also suggest that most HeT-A copies are actively expressed regardless of which telomere and where in the telomere they are located. Conclusions Our study demonstrates how the HeT-A sequence changes much faster than previously reported resulting in at least nine different subfamilies most of which could actively contribute to telomere extension in Drosophila. Interestingly, the only significant difference observed between Oregon-R and GIII resides in the nature and proportion of the antisense transcripts, suggesting a possible mechanism that would in part explain the longer telomeres of the GIII stock.

  9. Moving Away from the Reference Genome: Evaluating a Peptide Sequencing Tagging Approach for Single Amino Acid Polymorphism Identifications in the Genus Populus

    Energy Technology Data Exchange (ETDEWEB)

    Abraham, Paul E [ORNL; Adams, Rachel M [ORNL; Tuskan, Gerald A [ORNL; Hettich, Robert {Bob} L [ORNL

    2013-01-01

    The genetic diversity across natural populations of the model organism, Populus, is extensive, containing a single nucleotide polymorphism roughly every 200 base pairs. When deviations from the reference genome occur in coding regions, they can impact protein sequences. Rather than relying on a static reference database to profile protein expression, we employed a peptide sequence tagging (PST) approach capable of decoding the plasticity of the Populus proteome. Using shotgun proteomics data from two genotypes of P. trichocarpa, a tag-based approach enabled the detection of 6,653 unexpected sequence variants. Through manual validation, our study investigated how the most abundant chemical modification (methionine oxidation) could masquerade as a sequence variant (AlaSer) when few site-determining ions existed. In fact, precise localization of an oxidation site for peptides with more than one potential placement was indeterminate for 70% of the MS/MS spectra. We demonstrate that additional fragment ions made available by high energy collisional dissociation enhances the robustness of the peptide sequence tagging approach (81% of oxidation events could be exclusively localized to a methionine). We are confident that augmenting fragmentation processes for a PST approach will further improve the identification of single amino acid polymorphism in Populus and potentially other species as well.

  10. Assignment of fatty acid-beta-oxidizing syntrophic bacteria to Syntrophomonadaceae fam. nov. on the basis of 16S rRNA sequence analyses

    Science.gov (United States)

    Zhao, H.; Yang, D.; Woese, C. R.; Bryant, M. P.

    1993-01-01

    After enrichment from Chinese rural anaerobic digestor sludge, anaerobic, sporing and nonsporing, saturated fatty acid-beta-oxidizing syntrophic bacteria were isolated as cocultures with H2- and formate-utilizing Methanospirillum hungatei or Desulfovibrio sp. strain G-11. The syntrophs degraded C4 to C8 saturated fatty acids, including isobutyrate and 2-methylbutyrate. They were adapted to grow on crotonate and were isolated as pure cultures. The crotonate-grown pure cultures alone did not grow on butyrate in either the presence or the absence of some common electron acceptors. However, when they were reconstituted with M. hungatei, growth on butyrate again occurred. In contrast, crotonate-grown Clostridium kluyveri and Clostridium sticklandii, as well as Clostridium sporogenes, failed to grow on butyrate when these organisms were cocultured with M. hungatei. The crotonate-grown pure subcultures of the syntrophs described above were subjected to 16S rRNA sequence analysis. Several previously documented fatty acid-beta-oxidizing syntrophs grown in pure cultures with crotonate were also subjected to comparative sequence analyses. The sequence analyses revealed that the new sporing and nonsporing isolates and other syntrophs that we sequenced, which had either gram-negative or gram-positive cell wall ultrastructure, all belonged to the phylogenetically gram-positive phylum. They were not closely related to any of the previously known subdivisions in the gram-positive phylum with which they were compared, but were closely related to each other, forming a new subdivision in the phylum. We recommend that this group be designated Syntrophomonadaceae fam. nov.; a description is given.

  11. Amino acid sequence and carbohydrate-binding analysis of the N-acetyl-D-galactosamine-specific C-type lectin, CEL-I, from the Holothuroidea, Cucumaria echinata.

    Science.gov (United States)

    Hatakeyama, Tomomitsu; Matsuo, Noriaki; Shiba, Kouhei; Nishinohara, Shoichi; Yamasaki, Nobuyuki; Sugawara, Hajime; Aoyagi, Haruhiko

    2002-01-01

    CEL-I is one of the Ca2+-dependent lectins that has been isolated from the sea cucumber, Cucumaria echinata. This protein is composed of two identical subunits held by a single disulfide bond. The complete amino acid sequence of CEL-I was determined by sequencing the peptides produced by proteolytic fragmentation of S-pyridylethylated CEL-I. A subunit of CEL-I is composed of 140 amino acid residues. Two intrachain (Cys3-Cys14 and Cys31-Cys135) and one interchain (Cys36) disulfide bonds were also identified from an analysis of the cystine-containing peptides obtained from the intact protein. The similarity between the sequence of CEL-I and that of other C-type lectins was low, while the C-terminal region, including the putative Ca2+ and carbohydrate-binding sites, was relatively well conserved. When the carbohydrate-binding activity was examined by a solid-phase microplate assay, CEL-I showed much higher affinity for N-acetyl-D-galactosamine than for other galactose-related carbohydrates. The association constant of CEL-I for p-nitrophenyl N-acetyl-beta-D-galactosaminide (NP-GalNAc) was determined to be 2.3 x 10(4) M(-1), and the maximum number of bound NP-GalNAc was estimated to be 1.6 by an equilibrium dialysis experiment. PMID:11866098

  12. A comparative study of 2',3'-cyclic-nucleotide 3'-phosphodiesterase in vertebrates: cDNA cloning and amino acid sequences for chicken and bullfrog enzymes.

    Science.gov (United States)

    Kasama-Yoshida, H; Tohyama, Y; Kurihara, T; Sakuma, M; Kojima, H; Tamai, Y

    1997-10-01

    In mammalian brain, two 2',3'-cyclic-nucleotide 3'-phosphodiesterase (EC 3.1.4.37) isoforms, CNP1 and CNP2, are translated, respectively, from the two mRNAs, which have been transcribed and processed by alternative use of the two transcription start points and by differential splicing. In the present study, the cDNAs encoding chicken CNP2 and bullfrog CNP1, respectively, were isolated, and the amino acid sequences of chicken CNP2 and bullfrog CNP1 were deduced. Western blot analysis showed that chicken brain contains a major CNP2-type protein together with a minor unidentified isoform, and bullfrog brain contains only a CNP1-type protein. All available amino acid sequences of vertebrate 2',3'-cyclic-nucleotide 3'-phosphodiesterases were aligned and compared. Three conserved motif sequences were noted: (a) an ATP-binding site near the amino terminus, (b) an isoprenylation site at the carboxyl terminus, and (c) a probable catalytic site resembling the active site of beta-ketoacyl synthase (EC 2.3.1.41). The second and the third motifs are conserved also in goldfish RICH (regeneration-induced 2',3'-cyclic-nucleotide 3'-phosphodiesterase homologue), which has been shown recently to have 2',3'-cyclic-nucleotide 3'-phosphodiesterase activity. The third motif (probably catalytic site) was assigned for the first time in the present report. PMID:9326261

  13. The complete genome sequence of Natrinema sp. J7-2, a haloarchaeon capable of growth on synthetic media without amino acid supplements.

    Directory of Open Access Journals (Sweden)

    Jie Feng

    Full Text Available Natrinema sp. J7-2 is an extreme haloarchaeon capable of growing on synthetic media without amino acid supplements. Here we report the complete genome sequence of Natrinema sp. J7-2 which is composed of a 3,697,626-bp chromosome and a 95,989-bp plasmid pJ7-I. This is the first complete genome sequence of a member of the genus Natrinema. We demonstrate that Natrinema sp. J7-2 can use gluconate, glycerol, or acetate as the sole carbon source and that its genome encodes complete metabolic pathways for assimilating these substrates. The biosynthetic pathways for all 20 amino acids have been reconstructed, and we discuss a possible evolutionary relationship between the haloarchaeal arginine synthetic pathway and the bacterial lysine synthetic pathway. The genome harbors the genes for assimilation of ammonium and nitrite, but not nitrate, and has a denitrification pathway to reduce nitrite to N(2O. Comparative genomic analysis suggests that most sequenced haloarchaea employ the TrkAH system, rather than the Kdp system, to actively uptake potassium. The genomic analysis also reveals that one of the three CRISPR loci in the Natrinema sp. J7-2 chromosome is located in an integrative genetic element and is probably propagated via horizontal gene transfer (HGT. Finally, our phylogenetic analysis of haloarchaeal genomes provides clues about evolutionary relationships of haloarchaea.

  14. Comparison of nucleic acid hybridization and nucleic acid amplification using conserved sequences from the 5' noncoding region for detection of bovine viral diarrhea virus.

    OpenAIRE

    Ridpath, J F; Bolin, S R; Katz, J

    1993-01-01

    Primers and probes derived from conserved sequences located in the 5' noncoding region of pestiviruses were evaluated for detection of bovine viral diarrhea virus. With these reagents, hybridization and polymerase chain reaction tests detected 62 of 90 and 90 of 90 bovine viral diarrhea virus isolates, respectively. A quick lysis method for preparing RNA for use in polymerase chain reaction amplification also was evaluated.

  15. Transcriptome sequencing revealed the transcriptional organization at ribosome-mediated attenuation sites in Corynebacterium glutamicum and identified a novel attenuator involved in aromatic amino acid biosynthesis.

    Science.gov (United States)

    Neshat, Armin; Mentz, Almut; Rückert, Christian; Kalinowski, Jörn

    2014-11-20

    The Gram-positive bacterium Corynebacterium glutamicum belongs to the order Corynebacteriales and is used as a producer of amino acids at industrial scales. Due to its economic importance, gene expression and particularly the regulation of amino acid biosynthesis has been investigated extensively. Applying the high-resolution technique of transcriptome sequencing (RNA-seq), recently a vast amount of data has been generated that was used to comprehensively analyze the C. glutamicum transcriptome. By analyzing RNA-seq data from a small RNA cDNA library of C. glutamicum, short transcripts in the known transcriptional attenuators sites of the trp operon, the ilvBNC operon and the leuA gene were verified. Furthermore, whole transcriptome RNA-seq data were used to elucidate the transcriptional organization of these three amino acid biosynthesis operons. In addition, we discovered and analyzed the novel attenuator aroR, located upstream of the aroF gene (cg1129). The DAHP synthase encoded by aroF catalyzes the first step in aromatic amino acid synthesis. The AroR leader peptide contains the amino acid sequence motif F-Y-F, indicating a regulatory effect by phenylalanine and tyrosine. Analysis by real-time RT-PCR suggests that the attenuator regulates the transcription of aroF in dependence of the cellular amount of tRNA loaded with phenylalanine when comparing a phenylalanine-auxotrophic C. glutamicum mutant fed with limiting and excess amounts of a phenylalanine-containing dipeptide. Additionally, the very interesting finding was made that all analyzed attenuators are leaderless transcripts. PMID:24910972

  16. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    Science.gov (United States)

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov. PMID:26071988

  17. FiveS rRNA sequences and fatty acid profiles of colourless sulfur-oxidising bacteria

    Digital Repository Service at National Institute of Oceanography (India)

    LokaBharathi, P.A.; Ortiz-conde, B.A.; Nair, S.; Chandramohan, D.; Colwell, R.R.

    of these organisms have defied identification by API (Appareils et Procedes d'Identification) and biology systems. Hence, computer-aided MIS (Microbial Identification System) has been used to define their relatedness through fatty acid profiles. To characterize...

  18. FeatureViewer, a BioJS component for visualization of position-based annotations in protein sequences [v1; ref status: indexed, http://f1000r.es/2u2

    Directory of Open Access Journals (Sweden)

    Leyla Garcia

    2014-02-01

    Full Text Available Summary: FeatureViewer is a BioJS component that lays out, maps, orients, and renders position-based annotations for protein sequences. This component is highly flexible and customizable, allowing the presentation of annotations by rows, all centered, or distributed in non-overlapping tracks. It uses either lines or shapes for sites and rectangles for regions. The result is a powerful visualization tool that can be easily integrated into web applications as well as documents as it provides an export-to-image functionality. Availability: https://github.com/biojs/biojs/blob/master/src/main/javascript/Biojs.FeatureViewer.js; http://dx.doi.org/10.5281/zenodo.7719

  19. Comparison of amino acid sequences of the trypsin inhibitors from taro (Colocasia esculenta), giant taro (Alocasia macrorrhiza) and giant swamp taro (Cyrtosperma chamissonis).

    Science.gov (United States)

    Peng, L; Bradbury, J H; Hammer, B C; Shaw, D C

    1993-09-01

    The amino acid sequences of the trypsin inhibitors from taro Colocasia esculenta var. esculenta and giant swamp taro Cyrtosperma chamissonis have been determined and are compared with the protein sequence of the trypsin/chymotrypsin inhibitor from giant taro Alocasia macrorrhiza. Both inhibitors display polymorphism and there is evidence of two components in the giant swamp taro. The positional identity between the proteins is highest at 73-75% for the comparison of the giant taro (GT) with the polymorphic forms of the taro (T) inhibitors and lowest at 56-58% for the pairs of taro and giant swamp taro (GST) proteins. The comparisons show that the inhibitors from T and GT are more related to each other than to GST, which supports their taxonomic classification into different tribes. Location of the P1 site for the trypsin inhibitors of aroids is different from that of other Kunitz-type inhibitors and could be at Leu56.

  20. Development of an electrochemical DNA biosensor for detection of specific Mycobacterium tuberculosis sequence based on poly(L-glutamic acid) modified electrode

    Indian Academy of Sciences (India)

    MERVE YESIL; SONER DONMEZ; FATMA ARSLAN

    2016-11-01

    An electrochemical DNA biosensor was developed by avidin-biotin interaction of a biotinylated probe and avidin-attached, poly(L-glutamic) acid coated pencil graphite electrode (PGA/PGE) for detection of specific Mycobacterium tuberculosis DNA sequence. The discrimination of fully complementary hybridization and mismatch hybridization was carried out by electrochemical reduction current of Meldola’s Blue (MDB) in square wave voltammetry (SWV). The calibration graph of the DNA biosensor was linear between 1.5–12.5 nM and the detection limit was calculated as 1.3 nM. The proposed biosensor successfully discriminated short andlong oligonucleotides related to DNA sequence of Mycobacterium tuberculosis in optimal condition.

  1. Amino acid sequence of toxin XI of the scorpion Buthus occitanus tunetanus. Evidence of a mutation having an important effect upon neurotoxic activity.

    Science.gov (United States)

    Sampieri, F; Habersetzer-Rochat, C; Martin, M F; Kopeyan, C; Rochat, H

    1987-02-01

    The complete amino acid sequence of toxin XI of the North African scorpion Buthus occitanus tunetanus has been elucidated by automatic sequencing of the reduced and alkylated toxin and of the peptides obtained after tryptic cleavage restricted to arginyl bonds. This toxin is structurally homologous to toxin II of Androctonus australis Hector, the most active among the alpha-toxins, but is far less potent, both in vivo and in vitro. This work points out 12 mutations, many of which are conservative. Nevertheless, the most striking difference is the replacement of the lysine residue at position 58, known to be important in the activity of AaH toxin II, by a valine residue. Thus, it seems that the presence of a positive charge at this location facilitates the interactions between the receptor on the sodium channel and the alpha-type toxins.

  2. Amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by streptococcal IgA1 proteases

    DEFF Research Database (Denmark)

    Batten, MR; Senior, BW; Kilian, Mogens;

    2003-01-01

    The amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by IgA1 proteases of different species of Streptococcus were investigated. Recombinant IgA1 antibodies were generated with point mutations at proline 227 and threonine 228, the residues lying on either...... that proteases from different strains may have different amino acid requirements for cleavage. Remarkably, some streptococcal proteases appeared able to cleave the hinge at a distant alternative site if substitution prevented efficient cleavage of the original site. Hence, this study has identified key residues...... required for the recognition of the IgA1 hinge as a substrate by streptococcal IgA1 proteases, and it marks a preliminary step towards development of specific enzyme inhibitors....

  3. Genotyping by RAD sequencing enables mapping of fatty acid composition traits in perennial ryegrass (Lolium perenne (L.)).

    Science.gov (United States)

    Hegarty, Matthew; Yadav, Rattan; Lee, Michael; Armstead, Ian; Sanderson, Ruth; Scollan, Nigel; Powell, Wayne; Skøt, Leif

    2013-06-01

    Perennial ryegrass (Lolium perenne L.) is the most important forage crop in temperate livestock agriculture. Its nutritional quality has significant impact on the quality of meat and milk for human consumption. Evidence suggests that higher energy content in forage can assist in reducing greenhouse gas emissions from ruminants. Increasing the fatty acid content (especially α-linolenic acid, an omega-3 fatty acid) may thus contribute to better forage, but little is known about the genetic basis of variation for this trait. To this end, quantitative trait loci (QTLs) were identified associated with major fatty acid content in perennial ryegrass using a population derived from a cross between the heterozygous and outbreeding high-sugar grass variety AberMagic and an older variety, Aurora. A genetic map with 434 restriction-associated DNA (RAD) and SSR markers was generated. Significant QTLs for the content of palmitic (C16:0) on linkage groups (LGs) 2 and 7; stearic (C18:0) on LGs 3, 4 and 7; linoleic (C18:2n-6) on LGs 2 and 5; and α-linolenic acids (C18:3n-3) on LG 1 were identified. Two candidate genes (a lipase and a beta-ketoacyl CoA synthase), both associated with C16:0, and separately with C18:2n-6 and C18:0 contents, were identified. The physical positions of these genes in rice and their genetic positions in perennial ryegrass were consistent with established syntenic relationships between these two species. Validation of these associations is required, but the utility of RAD markers for rapid generation of genetic maps and QTL analysis has been demonstrated for fatty acid composition in a global forage crop. PMID:23331642

  4. A case study on the genetic origin of the high oleic acid trait through FAD2-1 DNA sequence variation in safflower (Carthamus tinctorius L.

    Directory of Open Access Journals (Sweden)

    Sara eRapson

    2015-09-01

    Full Text Available The safflower (Carthamus tinctorius L. is considered a strongly domesticated species with a long history of cultivation. The hybridization of safflower with its wild relatives has played an important role in the evolution of cultivars and is of particular interest with regards to their production of high quality edible oils. Original safflower varieties were all rich in linoleic acid, while varieties rich in oleic acid have risen to prominence in recent decades. The high oleic acid trait is controlled by a partially recessive allele ol at a single locus OL. The ol allele was found to be a defective microsomal oleate desaturase FAD2-1. Here we present DNA sequence data and DNA Southern blot analysis suggesting that there has been an ancient hybridization and introgression of the FAD2-1 gene into C. tinctorius from its wild relative C. palaestinus. It is from this gene that FAD2-1Δ was derived more recently. Identification and characterization of the genetic origin and diversity of FAD2-1 could aid safflower breeders in reducing population size and generations required for the development of new high oleic acid varieties by using perfect molecular marker-assisted selection.

  5. Research on Nucleic Acid Sequence Design Methods for DNA Computing%DNA计算中核酸序列设计方法比较研究

    Institute of Scientific and Technical Information of China (English)

    张凯; 耿修堂; 肖建华; 赵东明

    2008-01-01

    DNA computing maps the instances of a reality problem onto specific nucleic acid mol-ecules and protocols so that the result contains the answers to the problem's instances to enable successful extraction.Good DNA sequences prevent unwanted hybridization errors during the computation and enable easy retrieval the answers in the extraction phase.At first,the paper in-troduces two typical nucleic acid sequence methods,free energy based method and Hamming dis-tance based method,and analyzes the influence of constraints on the quality of DNA sequences.In addition,the paper has compared the integrity and computational cost of two methods for ex-cluding the non-specific hybridization sequence.At last,the paper analyzes the efficiency of two nucleic acid sequence design methods.Through the comparison we can see that the Hamming dis-tance based DNA sequence design method can only approximately estimate the thermodynamic stability of DNA hybridization,and can not be a perfect substitute for the minimum free energy method.Further analysis shows that when only consider the free energy of Watson-crick base pairs,the calculation of free energy will be equal to the Hamming distance method.So,if the precision satisfied the requirement of experiment,Hamming distance based DNA sequence design method,not only can effectively distinguish between specific and non-specific hybridization,and effectively reduce the amount of calculation,improve the efficiency of the DNA sequence designing.%DNA计算是将现实问题进行编码,映射到DNA分子上,然后通过分子生物实验产生出代表问题解的DNA分子.最后通过检测技术提取出该DNA分子.高质量的DNA编码可以尽可能避免或减少计算过程中出现的错误,并使检测阶段易于提取出代表问题解的DNA分子.文中对基于汉明距离和基于自由能的DNA核酸编码方法进行研究,分析了两类方法的约束条件对DNA编码质量的影响,比较了两类方法排除非特异性

  6. Draft genome sequence of extremely acidophilic bacterium Acidithiobacillus ferrooxidans DLC-5 isolated from acid mine drainage in Northeast China

    Directory of Open Access Journals (Sweden)

    Peng Chen

    2015-12-01

    Full Text Available Acidithiobacillus ferrooxidans type strain DLC-5, isolated from Wudalianchi in Heihe of Heilongjiang Province, China. Here, we present the draft genome of strain DLC-5 which contains 4,232,149 bp in 2745 contigs with 57.628% GC content and includes 32,719 protein-coding genes and 64 tRNA-encoding genes. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession no. JNNH00000000.1.

  7. Combination of Whole Genome Sequencing, Linkage and Functional Studies Implicates a Missense Mutation in Titin as a Cause of Autosomal Dominant Cardiomyopathy with Features of Left Ventricular Non-Compaction

    Science.gov (United States)

    Hooper, Charlotte; Ormondroyd, Liz; Pagnamenta, Alistair; Lise, Stefano; Salatino, Silvia; Knight, Samantha JL; Taylor, Jenny C.; Thomson, Kate L.; Arnold, Linda; Chatziefthimiou, Spyros D.; Konarev, Petr V.; Wilmanns, Matthias; Ehler, Elisabeth; Ghisleni, Andrea; Gautel, Mathias; Blair, Edward; Watkins, Hugh; Gehmlich, Katja

    2016-01-01

    Background High throughput next generation sequencing techniques have made whole genome sequencing accessible in clinical practice, however, the abundance of variation in the human genomes makes the identification of a disease-causing mutation on a background of benign rare variants challenging. Methods and Results Here we combine whole genome sequencing with linkage analysis in a three-generation family affected by cardiomyopathy with features of autosomal dominant left-ventricular non-compaction cardiomyopathy. A missense mutation in the giant protein titin is the only plausible disease-causing variant that segregates with disease amongst the eight surviving affected individuals, with interrogation of the entire genome excluding other potential causes. This A178D missense mutation, affecting a conserved residue in the second immunoglobulin-like domain of titin, was introduced in a bacterially expressed recombinant protein fragment and biophysically characterised in comparison to its wild-type counterpart. Multiple experiments, including size exclusion chromatography, small angle X-ray scattering and circular dichroism spectroscopy suggest partial unfolding and domain destabilisation in the presence of the mutation. Moreover, binding experiments in mammalian cells show that the mutation markedly impairs binding to the titin ligand telethonin. Conclusions Here we present genetic and functional evidence implicating the novel A178D missense mutation in titin as the cause of a highly penetrant familial cardiomyopathy with features of left-ventricular non-compaction. This expands the spectrum of titin’s roles in cardiomyopathies. It furthermore highlights that rare titin missense variants, currently often ignored or left un-interpreted, should be considered to be relevant for cardiomyopathies and can be identified by the approach presented here. PMID:27625337

  8. Efficacy and Safety of a Hyaluronic Acid Filler to Correct Aesthetically Detracting or Deficient Features of the Asian Nose: A Prospective, Open-Label, Long-Term Study

    Science.gov (United States)

    Liew, Steven; Scamp, Terrence; de Maio, Mauricio; Halstead, Michael; Johnston, Nicole; Silberberg, Michael; Rogers, John D.

    2016-01-01

    Background There is increasing interest among patients and plastic surgeons for alternatives to rhinoplasty, a common surgical procedure performed in Asia. Objectives To evaluate the safety, efficacy, and longevity of a hyaluronic acid filler in the correction of aesthetically detracting or deficient features of the Asian nose. Methods Twenty-nine carefully screened Asian patients had their noses corrected with the study filler (Juvéderm VOLUMA [Allergan plc, Dublin, Ireland] with lidocaine injectable gel), reflecting individualized treatment goals and utilizing a standardized injection procedure, and were followed for over 12 months. Results A clinically meaningful correction (≥1 grade improvement on the Assessment of Aesthetic Improvement Scale) was achieved in 27 (93.1%) patients at the first follow-up visit. This was maintained in 28 (96.6%) patients at the final visit, based on the independent assessments of a central non-injecting physician and the patients. At this final visit, 23 (79.3%) patients were satisfied or very satisfied with the study filler and 25 (86.2%) would recommend it to others. In this small series of patients, there were no serious adverse events (AEs), with all treatment-related AEs being mild to moderate, transient injection site reactions, unrelated to the study filler. Conclusions Using specific eligibility criteria, individualized treatment goals, and a standardized injection procedure, the study filler corrected aesthetically detracting or deficient features of the Asian nose, with the therapeutic effects lasting for over 12 months, consistent with a high degree of patient satisfaction. This study supports the safety and efficacy of this HA filler for specific nose augmentation procedures in selected Asian patients. Level of Evidence: 3 Therapeutic PMID:27301371

  9. Structural features of the ribonucleotide reductase of Aujeszky's disease virus.

    Science.gov (United States)

    Kaliman, A V; Boldogköi, Z; Fodor, I

    1994-01-01

    A gene construct of the Aujeszky's disease virus (ADV) genome was prepared and the DNA fragment encoding the ribonucleotide reductase was structurally characterized. We determined the entire DNA sequence of two adjacent open reading frames of the ribonucleotide reductase genes with the intergenic sequence of nine base pairs. From the sequence analysis we predict that Aujeszky's disease virus encodes a ribonucleotide reductase which comprises two polypeptides--large and small subunits, with sizes of 835 and 303 amino acids, respectively. Nucleotide and amino acid sequences of the large and small subunits of the Aujeszky's disease virus ribonucleotide reductase have been compared with that of other herpesviruses, and structural features of both proteins have been characterized. PMID:7810419

  10. Whole-Exome Sequencing in a South American Cohort Links ALDH1A3, FOXN1 and Retinoic Acid Regulation Pathways to Autism Spectrum Disorders.

    Science.gov (United States)

    Moreno-Ramos, Oscar A; Olivares, Ana María; Haider, Neena B; de Autismo, Liga Colombiana; Lattig, María Claudia

    2015-01-01

    Autism spectrum disorders (ASDs) are a range of complex neurodevelopmental conditions principally characterized by dysfunctions linked to mental development. Previous studies have shown that there are more than 1000 genes likely involved in ASD, expressed mainly in brain and highly interconnected among them. We applied whole exome sequencing in Colombian-South American trios. Two missense novel SNVs were found in the same child: ALDH1A3 (RefSeq NM_000693: c.1514T>C (p.I505T)) and FOXN1 (RefSeq NM_003593: c.146C>T (p.S49L)). Gene expression studies reveal that Aldh1a3 and Foxn1 are expressed in ~E13.5 mouse embryonic brain, as well as in adult piriform cortex (PC; ~P30). Conserved Retinoic Acid Response Elements (RAREs) upstream of human ALDH1A3 and FOXN1 and in mouse Aldh1a3 and Foxn1 genes were revealed using bioinformatic approximation. Chromatin immunoprecipitation (ChIP) assay using Retinoid Acid Receptor B (Rarb) as the immunoprecipitation target suggests RA regulation of Aldh1a3 and Foxn1 in mice. Our results frame a possible link of RA regulation in brain to ASD etiology, and a feasible non-additive effect of two apparently unrelated variants in ALDH1A3 and FOXN1 recognizing that every result given by next generation sequencing should be cautiously analyzed, as it might be an incidental finding. PMID:26352270

  11. Whole-Exome Sequencing in a South American Cohort Links ALDH1A3, FOXN1 and Retinoic Acid Regulation Pathways to Autism Spectrum Disorders.

    Science.gov (United States)

    Moreno-Ramos, Oscar A; Olivares, Ana María; Haider, Neena B; de Autismo, Liga Colombiana; Lattig, María Claudia

    2015-01-01

    Autism spectrum disorders (ASDs) are a range of complex neurodevelopmental conditions principally characterized by dysfunctions linked to mental development. Previous studies have shown that there are more than 1000 genes likely involved in ASD, expressed mainly in brain and highly interconnected among them. We applied whole exome sequencing in Colombian-South American trios. Two missense novel SNVs were found in the same child: ALDH1A3 (RefSeq NM_000693: c.1514T>C (p.I505T)) and FOXN1 (RefSeq NM_003593: c.146C>T (p.S49L)). Gene expression studies reveal that Aldh1a3 and Foxn1 are expressed in ~E13.5 mouse embryonic brain, as well as in adult piriform cortex (PC; ~P30). Conserved Retinoic Acid Response Elements (RAREs) upstream of human ALDH1A3 and FOXN1 and in mouse Aldh1a3 and Foxn1 genes were revealed using bioinformatic approximation. Chromatin immunoprecipitation (ChIP) assay using Retinoid Acid Receptor B (Rarb) as the immunoprecipitation target suggests RA regulation of Aldh1a3 and Foxn1 in mice. Our results frame a possible link of RA regulation in brain to ASD etiology, and a feasible non-additive effect of two apparently unrelated variants in ALDH1A3 and FOXN1 recognizing that every result given by next generation sequencing should be cautiously analyzed, as it might be an incidental finding.

  12. Purification, N-terminal amino acid sequence, and some properties of Cu, Zn-superoxide dismutase from Japanese flounder (Paralichthys olivaceus) hepato-pancreas.

    Science.gov (United States)

    Osatomi, K; Masuda, Y; Hara, K; Ishihara, T

    2001-04-01

    Cu, Zn-superoxide dismutase (SOD) has been purified to homogeneity from Japanese flounder Paralichthys olivaceus hepato-pancreas. The purification of the enzyme was carried out by an ethanol/chloroform treatment and acetone precipitation, and then followed by column chromatographies on Q-Sepharose, S-Sepharose and Ultrogel AcA 54. On SDS-PAGE, the purified enzyme gave a single protein band with molecular mass of 17.8 kDa under reducing conditions, and showed approximately equal proportions of 17.8 and 36 kDa molecular mass under non-reducing conditions. Three bands were obtained when the purified enzyme was subjected to native-PAGE, both on protein and activity staining, but the electrophoretic mobility of the purified enzyme differed from that of bovine erythrocyte Cu, Zn-SOD. Isoelectric point values of 5.9, 6.0 and 6.2, respectively, were obtained for the three components. The N-terminal amino acid sequence of the purified enzyme was determined for 25 amino acid residues, and the sequence was compared with other Cu, Zn-SODs. The N-terminal alanine residue was unacetylated, as in the case of swordfish SOD. Above 60 degrees C, the thermostability of the enzyme was much lower than that of bovine Cu, Zn-SOD. PMID:11290457

  13. Distinct features of lamin A-interacting chromatin domains mapped by ChIP-sequencing from sonicated or micrococcal nuclease-digested chromatin.

    Science.gov (United States)

    Lund, Eivind G; Duband-Goulet, Isabelle; Oldenburg, Anja; Buendia, Brigitte; Collas, Philippe

    2015-01-01

    The nuclear lamina has been shown to interact with the genome through lamina-associated domains (LADs). LADs have been identified by DamID, a proximity labeling assay, and more recently by chromatin immunoprecipitation-sequencing (ChIP-seq) of A- and B-type lamins. LADs form megabase-size domains at the nuclear periphery, they are gene-poor and mostly heterochromatic. Here, we show that the mode of chromatin fragmentation for ChIP, namely bath sonication or digestion with micrococcal nuclease (MNase), leads to the discovery of common but also distinct sets of lamin-interacting domains, or LiDs. Using ChIP-seq, we show the existence of lamin A/C (LMNA) LiDs with distinct gene contents, histone composition enrichment and relationships to lamin B1-interacting domains. The extent of genome coverage of lamin A/C (LMNA) LiDs in sonicated or MNase-digested chromatin is similar (∼730 megabases); however over half of these domains are uniquely detected in sonicated or MNase-digested chromatin. Sonication-specific LMNA LiDs are gene-poor and devoid of a broad panel of histone modifications, while MNase-specific LMNA LiDs are of higher gene density and are enriched in H3K9me3, H3K27me3 and in histone variant H2A.Z. LMNB1 LiDs are gene-poor and show no or little enrichment in these marks. Comparison of published LMNB1 DamID LADs with LMNB1 and LMNA LiDs identified here by ChIP-seq further shows that LMNA can associate with 'open' chromatin domains displaying euchromatin characteristics, and which are not associated with LMNB1. The differential genomic and epigenetic properties of lamin-interacting domains reflect the existence of distinct LiD populations identifiable in different chromatin contexts, including nuclease-accessible regions presumably localized in the nuclear interior.

  14. CSTX-9, a toxic peptide from the spider Cupiennius salei: amino acid sequence, disulphide bridge pattern and comparison with other spider toxins containing the cystine knot structure.

    Science.gov (United States)

    Schalle, J; Kämpfer, U; Schürch, S; Kuhn-Nentwig, L; Haeberli, S; Nentwig, W

    2001-09-01

    CSTX-9 (68 residues, 7530.9 Da) is one of the most abundant toxic polypeptides in the venom of the wandering spider Cupiennius salei. The amino acid sequence was determined by Edman degradation using reduced and alkylated CSTX-9 and peptides generated by cleavages with endoproteinase Asp-N and trypsin, respectively. Sequence comparison with CSTX-1, the most abundant and the most toxic polypeptide in the crude spider venom, revealed a high degree of similarity (53% identity). By means of limited proteolysis with immobilised trypsin and RP-HPLC, the cystine-containing peptides of CSTX-9 were isolated and the disulphide bridges were assigned by amino acid analysis, Edman degradation and nanospray tandem mass spectrometry. The four disulphide bonds present in CSTX-9 are arranged in the following pattern: 1-4, 2-5, 3-8 and 6-7 (Cys6-Cys21, Cys13-Cys30, Cys20-Cys48, Cys32-Cys46). Sequence comparison of CSTX-1 with CSTX-9 clearly indicates the same disulphide bridge pattern, which is also found in other spider polypeptide toxins, e.g. agatoxins (omega-AGA-IVA, omega-AGA-IVB, mu-AGA-I and mu-AGA-VI) from Agelenopsis aperta, SNX-325 from Segestria florentina and curtatoxins (CT-I, CT-II and CT-III) from Hololena curta. CSTX-1/CSTX-9 belong to the family of ion channel toxins containing the inhibitor cystine knot structural motif. CSTX-9, lacking the lysine-rich C-terminal tail of CSTX-1, exhibits a ninefold lower toxicity to Drosophila melanogaster than CSTX-1. This is in accordance with previous observations of CSTX-2a and CSTX-2b, two truncated forms of CSTX-1 which, like CSTX-9, also lack the C-terminal lysine-rich tail. PMID:11693532

  15. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe).

    OpenAIRE

    Glasser, S W; Korfhagen, T R; Weaver, T.; Pilot-Matias, T; Fox, J L; Whitsett, J A

    1987-01-01

    Hydrophobic surfactant-associated protein of Mr 6000-14,000 was isolated from ether/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Tyr-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) ...

  16. DNA sequences encoding erythropoietin

    Energy Technology Data Exchange (ETDEWEB)

    Lin, F.K.

    1987-10-27

    A purified and isolated DNA sequence is described consisting essentially of a DNA sequence encoding a polypeptide having an amino acid sequence sufficiently duplicative of that of erythropoietin to allow possession of the biological property of causing bone marrow cells to increase production of reticulocytes and red blood cells, and to increase hemoglobin synthesis or iron uptake.

  17. Solid phase sequencing of biopolymers

    Energy Technology Data Exchange (ETDEWEB)

    Cantor, Charles (Del Mar, CA); Koster, Hubert (La Jolla, CA)

    2010-09-28

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  18. 编码序列和非编码序列的3-tuple分布特征%Features of Coding and Noncoding Sequences Based on 3-tuple Distributions

    Institute of Scientific and Technical Information of China (English)

    傅强; 钱敏平; 陈良标; 朱玉贤

    2005-01-01

    reaches 2/100. The fact that ( a total of 25 introns) from all three different genomes S. pombe, C. elegans and H. sapiens searched are found to share high sequence identity with coding regions indicates that at least some introns may have come directly from CDS(coding sequences). We suggest that SREs may be a useful feature for evolutionary study.

  19. A New Approach to Sequence Analysis Exemplified by Identification of cis-Elements in Abscisic Acid Inducible Promoters

    DEFF Research Database (Denmark)

    Busk, Peter Kamp; Hallin, Peter Fischer; Salomon, Jesper;

    Identification of cis-regulatory elements in DNA promoters by computational approaches are a great help in genome-wide elucidation of signal transduction, transcriptional regulation and gene expression. However, to accomplish this has proven a difficult problem in computational genomics in part due...... the sequences of known cis-elements for promoter prediction4. This is a computational alternative to genome-wide DNA array analysis of gene expression and can be used to confirm the importance of known promoter elements on a genome level. Unfortunately, this approach does not yield information about new cis...... to the redundant nature of cis-regulatory elements1. Moreover, promoter context and chromatin structure have an important influence on the function of cis-elements and does further complicate the identification of functional cis-elements2,3. A related computational approach to promoter studies is to use...

  20. Understanding Legacy Features with Featureous

    DEFF Research Database (Denmark)

    Olszak, Andrzej; Jørgensen, Bo Nørregaard

    2011-01-01

    Java programs called Featureous that addresses this issue. Featureous allows a programmer to easily establish feature-code traceability links and to analyze their characteristics using a number of visualizations. Featureous is an extension to the NetBeans IDE, and can itself be extended by third...

  1. Soluble forms of tumor necrosis factor receptors (TNF-Rs). The cDNA for the type I TNF-R, cloned using amino acid sequence data of its soluble form, encodes both the cell surface and a soluble form of the receptor

    DEFF Research Database (Denmark)

    Nophar, Y; Kemper, O; Brakebusch, C;

    1990-01-01

    in the extracellular domains of the nerve growth factor receptor and the B lymphocytes surface antigen CDw40. The amino acid composition and size of the extracellular domain of the type I TNF-R closely resemble those of TBPI. The COOH-terminal amino acid sequence of the four cysteine rich repeats within...... the extracellular domain of the type I TNF-R matches the COOH-terminal sequence of TBPI. Amino acid sequences in the extracellular domain also fully match other sequences found in TBPI. On the other hand, amino acid sequences in the soluble form of the type II TNF-R (TBPII), while indicating a marked homology...

  2. Permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, a thermoacidophilic sulfur-reducing crenarchaeon isolated from acidic hot springs of Hveravellir, Iceland.

    Science.gov (United States)

    Susanti, Dwi; Johnson, Eric F; Lapidus, Alla; Han, James; Reddy, T B K; Pilay, Manoj; Ivanova, Natalia N; Markowitz, Victor M; Woyke, Tanja; Kyrpides, Nikos C; Mukhopadhyay, Biswarup

    2016-01-01

    This report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilization systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.

  3. RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

    Science.gov (United States)

    Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

    2015-01-01

    Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition. PMID:25860951

  4. RNA Sequencing Identifies Upregulated Kyphoscoliosis Peptidase and Phosphatidic Acid Signaling Pathways in Muscle Hypertrophy Generated by Transgenic Expression of Myostatin Propeptide

    Directory of Open Access Journals (Sweden)

    Yuanxin Miao

    2015-04-01

    Full Text Available Myostatin (MSTN, a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph, and zinc metallopeptidase STE24 (Zmpste24. In addition, kyphoscoliosis peptidase (Ky, which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA pathways (Dgki, Dgkz, Plcd4 were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  5. RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

    Science.gov (United States)

    Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

    2015-04-09

    Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  6. Uses of Phage Display in Agriculture: Sequence Analysis and Comparative Modeling of Late Embryogenesis Abundant Client Proteins Suggest Protein-Nucleic Acid Binding Functionality

    Directory of Open Access Journals (Sweden)

    Rekha Kushwaha

    2013-01-01

    Full Text Available A group of intrinsically disordered, hydrophilic proteins—Late Embryogenesis Abundant (LEA proteins—has been linked to survival in plants and animals in periods of stress, putatively through safeguarding enzymatic function and prevention of aggregation in times of dehydration/heat. Yet despite decades of effort, the molecular-level mechanisms defining this protective function remain unknown. A recent effort to understand LEA functionality began with the unique application of phage display, wherein phage display and biopanning over recombinant Seed Maturation Protein homologs from Arabidopsis thaliana and Glycine max were used to retrieve client proteins at two different temperatures, with one intended to represent heat stress. From this previous study, we identified 21 client proteins for which clones were recovered, sometimes repeatedly. Here, we use sequence analysis and homology modeling of the client proteins to ascertain common sequence and structural properties that may contribute to binding affinity with the protective LEA protein. Our methods uncover what appears to be a predilection for protein-nucleic acid interactions among LEA client proteins, which is suggestive of subcellular residence. The results from this initial computational study will guide future efforts to uncover the protein protective mechanisms during heat stress, potentially leading to phage-display-directed evolution of synthetic LEA molecules.

  7. Evaluation of crop residues on potassium kinetics in an acid soil and potassium use efficiency in potato-garlic sequence using tracer 86Rb

    International Nuclear Information System (INIS)

    Greenhouse and laboratory studies were conducted on an acid soil in order to evaluate the role of two crop residues i.e. paddy and wheat along with farmyard manure on potassium kinetics and its availability in the potato-garlic sequence using tracer 86Rb. Under rapid equilibrium, application of crop residues of paddy, wheat straw and FYM were able to enhance soil pH and organic carbon content. In addition, their application helped in enhancing soil K availability indices like water soluble, available and non-exchangeable -K. This was further augmented by the Q/I studies using 86Rb where application of organic residues helped in lowering the potassium buffering capacity of the soil. Greenhouse study supplemented the results obtained from laboratory study where application of crop residues/FYM were able to improve the potato yield significantly and maintained higher concentration of K in potato leaf at early growth stages. A significant correlation was obtained between leaf K and haulms-K with that of 86Rb activities in potato leaf at 35 days and 86Rb absorbed in the haulms, respectively. Residues/ FYM and PK application to potato left sufficient residual effect on succeeding garlic crop. In potato-garlic sequence, K recovery was highest with FYM while N and P recoveries were higher with wheat residues. The nutrient recoveries with PK application followed law of diminishing returns. (author)

  8. Purification, characterization and N-terminal amino acid sequence of a new major allergen from European chestnut pollen--Cas s 1.

    Science.gov (United States)

    Kos, T; Hoffmann-Sommergruber, K; Ferreira, F; Hirschwehr, R; Ahorn, H; Horak, F; Jager, S; Sperr, W; Kraft, D; Scheiner, O

    1993-11-15

    Pollens from trees of the order Fagales (e.g. birch, alder, hazel, and hornbeam) all contain one major allergen--the main cause for tree pollen allergy. So far the major allergens from birch (Bet v 1), alder (Aln g 1), hazel (Cor a 1), and hornbeam (Car b 1) have been characterized, showing high sequence similarity with each other (1-4). We present the molecular and immunologic characterization of Cas s 1, the major allergen from the European chestnut (Castanea sativa). From aqueous pollen extracts from European chestnut a protein was purified to homogeneity and was subjected to further investigation. The protein revealed a Mr of 22 kDa and was shown to represent the major allergen of the European chestnut (immunoblotting, histamine release) and designated Cas s 1. Despite a marked difference in Mr, Cas s 1 shows significant amino acid sequence similarity at the N-terminus and is antigenically closely related to the major birch pollen allergen Bet v 1 (17 kDa), as shown by binding to the anti-Bet v 1 monoclonal antibody BIP-1 and by IgE-inhibition tests using recombinant Bet v 1.

  9. Identification and sequence analysis of pWcMBF8-1, a bacteriocin-encoding plasmid from the lactic acid bacterium Weissella confusa.

    Science.gov (United States)

    Malik, Amarila; Sumayyah, Sumayyah; Yeh, Chia-Wen; Heng, Nicholas C K

    2016-04-01

    Members of the Gram-positive lactic acid bacteria (LAB) are well-known for their beneficial properties as starter cultures and probiotics. Many LAB species produce ribosomally synthesized proteinaceous antibiotics (bacteriocins). Weissella confusa MBF8-1 is a strain isolated from a fermented soybean product that not only produces useful exopolysaccharides but also exhibits bacteriocin activity, which we call weissellicin MBF. Here, we show that bacteriocin production by W. confusa MBF8-1 is specified by a large plasmid, pWcMBF8-1. Plasmid pWcMBF8-1 (GenBank accession number KR350502), which was identified from the W. confusa MBF8-1 draft genome sequence, is 17 643 bp in length with a G + C content of 34.8% and contains 25 open reading frames (ORFs). Six ORFs constitute the weissellicin MBF locus, encoding three putative double-glycine-motif peptides (Bac1, Bac2, Bac3), an ABC transporter complex (BacTE) and a putative immunity protein (BacI). Two ORFs encode plasmid partitioning and mobilization proteins, suggesting that pWcMBF8-1 is transferable to other hosts. To the best of our knowledge, plasmid pWcMBF8-1 not only represents the first large Weissella plasmid to be sequenced but also the first to be associated with bacteriocin production in W. confusa. PMID:26976853

  10. Clinical Features of Hepatocellular Carcinoma and Observation on the Images of MRI Sequence Scanning%肝细胞肝癌临床特征及MRI各序列扫描的图像观察

    Institute of Scientific and Technical Information of China (English)

    孙永华; 陈江; 梁志刚; 腾云

    2016-01-01

    Objective To analyze the clinical features of hepatocellular carcinoma and to evaluate the characteristics of images and diagnostic value of MRI sequence scanning in hepatocellular carcinoma.Methods The clinical data of 46 patients with hepatocellular carcinoma treated in our hospital from April 2013 to April 2015 were analyzed retrospectively. After admission, all patients underwent MRI three phase scan and examination and the imaging data were complete. The clinical features of hepatocellular carcinoma and the application value of MRI in clinical diagnosis were summarized.Results There were 50 lesions in 46 patients with hepatocellular carcinoma, including 23 highly differentiated lesions, 13 moderately differentiated lesions and 14 poorly differentiated lesions; 30 lesions were located in the right lobe of liver and 20 in the left lobe of the liver; T1WI sequence scan showed 36 slightly lower signal, 8 equal signal and 6 slightly higher signal; T2WI sequence showed 44 slightly higher signal, including 4 of the lesions showed higher signal and 6 lesions showed equal signal; DWI sequence showed 30 higher signal, 15 slightly higher signal and 5 equal signal.Conclusion MRI sequence scan has its own advantages in showing the size, internal structure, edge and blood supply of hepatocellular carcinoma lesions. It is of important value in determination of the differentiation degree of hepatocellular carcinoma tissues. It can also show the characteristics of lesion enhancement and micro-morphological characteristics and provide reference for the treatment and prognosis.%目的 分析肝细胞肝癌的临床特征,评估MRI各序列扫描肝细胞肝癌的图像特点及诊断价值.方法 回顾性分析我院自2013年4月-2015年4月收治的46例肝细胞肝癌患者的临床资料,所有患者入院后均接受MRI三期扫描与检查,且影像学资料完整,总结肝细胞肝癌临床特点及MRI在其临床诊断中的应用价值.结果 46

  11. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe)

    Energy Technology Data Exchange (ETDEWEB)

    Glasser, S.W.; Korfhagen, T.R.; Weaver, T.; Pilot-Matias, T.; Fox, J.L.; Whitsett, J.A.

    1987-06-01

    Hydrophobic surfactant-associated protein of M/sub r/ 6000-14,000 was isolated from either/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Try-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) from bovine pulmonary surfactant recognized protein of M/sub r/ 6000-14,000 in immunoblot analysis and was used to screen a lambdagt11 expression library constructed from adult human lung poly(A)/sup +/ RNA. This resulted in identification of a 1.4-kilobase cDNA clone that was shown to encode the N-terminus of the surfactant polypeptide SPL(Phe) (Phe-Pro-Ile-Pro-Leu-Pro-) within an open reading frame for a larger protein. Expression of a fused ..beta..-galactosidase-SPL (Phe) gene in Escherichia coli yielded an immunoreactive M/sub r/ 34,000 fusion peptide. Hybrid-arrested translation with the cDNA and immunoprecipitation of (/sup 35/S)methionine-labeled in vitro translation products of human poly(A)/sup +/ RNA with a surfactant polyclonal antibody resulted in identification of a M/sub r/ 40,000 precursor protein. Blot hybridization analysis of electrophoretically fractionated RNA from human lung detected a 2.0-kilobase RNA that was more abundant in adult lung than in fetal lung. These proteins, and specifically SPL(Phe), may therefore be useful for synthesis of replacement surfactants for treatment of hyaline membrane disease in newborn infants or of other surfactant-deficient states.

  12. Production of bio-hydrogen by mesophilic anaerobic fermentation in an acid-phase sequencing batch reactor.

    Science.gov (United States)

    Cheong, Dae-Yeol; Hansen, Conly L; Stevens, David K

    2007-02-15

    The pH and hydraulic retention time (HRT) of an anaerobic sequencing batch reactor (ASBR) were varied to optimize the conversion of carbohydrate-rich synthetic wastewater into bio-hydrogen. A full factorial design using evolutionary operation (EVOP) was used to determine the effect of the factors and to find the optimum condition of each factor required for high hydrogen production rate. Experimental results from 20 runs indicate that a maximum hydrogen production rate of 4,460-5,540 mL/L/day under the volumetric organic loading rate (VOLR) of 75 g-COD/L/day obtained at an observed design point of HRT = 8 h and pH = 5.7. The hydrogen production rate was strongly dependent on the HRT, and the effect was statistically significant (P 0.05) was found for the pH on the hydrogen production rate. When the ASBR conditions were set for a maximum hydrogen production rate, the hydrogen production yield and specific hydrogen production rate were 60-74 mL/g-COD and 330-360 mL/g-VSS/day, respectively. The hydrogen composition was 43-51%, and no methanogenesis was observed. Acetate, propionate, butyrate, valerate, caproate, and ethanol were major liquid intermediate metabolites during runs of this ASBR. The dominant fermentative types were butyrate-acetate or ethanol-acetate, representing the typical anaerobic pathway of Clostridium species. This hydrogen-producing ASBR had a higher hydrogen production rate, compared with that produced using continuous-flow stirred tank reactors (CSTRs). This study suggests that the hydrogen-producing ASBR is a promising bio-system for prolonged and stable hydrogen production.

  13. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe)

    International Nuclear Information System (INIS)

    Hydrophobic surfactant-associated protein of M/sub r/ 6000-14,000 was isolated from either/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Try-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) from bovine pulmonary surfactant recognized protein of M/sub r/ 6000-14,000 in immunoblot analysis and was used to screen a λgt11 expression library constructed from adult human lung poly(A)+ RNA. This resulted in identification of a 1.4-kilobase cDNA clone that was shown to encode the N-terminus of the surfactant polypeptide SPL(Phe) (Phe-Pro-Ile-Pro-Leu-Pro-) within an open reading frame for a larger protein. Expression of a fused β-galactosidase-SPL (Phe) gene in Escherichia coli yielded an immunoreactive M/sub r/ 34,000 fusion peptide. Hybrid-arrested translation with the cDNA and immunoprecipitation of [35S]methionine-labeled in vitro translation products of human poly(A)+ RNA with a surfactant polyclonal antibody resulted in identification of a M/sub r/ 40,000 precursor protein. Blot hybridization analysis of electrophoretically fractionated RNA from human lung detected a 2.0-kilobase RNA that was more abundant in adult lung than in fetal lung. These proteins, and specifically SPL(Phe), may therefore be useful for synthesis of replacement surfactants for treatment of hyaline membrane disease in newborn infants or of other surfactant-deficient states

  14. FASMA: A Service to Format and Analyze Sequences in Multiple Alignments

    Institute of Scientific and Technical Information of China (English)

    Susan; Costantini; Giovanni; Colonna; Angelo; M.; Facchiano

    2007-01-01

    Multiple sequence alignments are successfully applied in many studies for under- standing the structural and functional relations among single nucleic acids and pro- tein sequences as well as whole families. Because of the rapid growth of sequence databases, multiple sequence alignments can often be very large and difficult to visualize and analyze. We offer a new service aimed to visualize and analyze the multiple alignments obtained with different external algorithms, with new features useful for the comparison of the aligned sequences as well as for the creation of a final image of the alignment. The service is named FASMA and is available at http: //bioinformatica.isa.cnr.it /FASMA /.

  15. Purification and partial amino acid sequences of a new presynaplic toxin and a cytotoxin from venom of pit veper Agkistrodon blomhoffii brevicaudus

    Institute of Scientific and Technical Information of China (English)

    吉永华; 陆丽芳; 徐科; Hiroyuki Hattori; Susumu Terakawa

    1995-01-01

    The technique of the reverse-phase performance liquid chromatography (RP-HPLC) was employed to separate and purify the toxic proteins from the venom of Agkistrodon blomhoffii brevicaudus collected in China 3 toxic proteins marked as AgTx-1, AgTx-2 and AgTx-3 consisting of about 122 amino acid residues were screened The toxicities (LD50,) of the AgTx-1, AgTx-2 and AgTx-3 were 0.075, 0.51 and 6.6 mg per kg weight of mice respectively. Toxicological experiment in the chick biventer cervicis nerve-muscle preparation showed that the acetylcholine (Ach) sensitivity of the preparation was unchanged after the total failure of the indirect contraction caused by AgTx-1 and AgTx-2. suggesting that they were presynaptic blockers, namely β-type of snake toxins. However, the amplitude of indirect contraction of the preparation was gradually reduced due to its incomplete relaxation caused by AgTx-3, indicating that it should belong to the category of cytotoxins. The partial amino acid sequences of 3 toxins have been

  16. Interconnection between the protein solubility and amino acid and dipeptide compositions.

    Science.gov (United States)

    Niu, Xiaohui; Li, Nana; Chen, Dinyan; Wang, Zengzhen

    2013-01-01

    Obtaining soluble proteins in sufficient concentrations helps increase the overall success rate in various experimental studies. Protein solubility is an individual trait ultimately determined by its primary protein sequence. Exploring the interconnection between the protein solubility and the compositions of protein sequence is instrumental for setting priorities on targets in large scale proteomics projects. In this paper, amino acid composition (20 dimensions) and the dipeptide composition (400 dimensions) were extracted to form the total candidate feature pool (420 dimensions), and each feature was selected into the feature vectors one by one, which were sorted by the absolute value of the correlation coefficient. Finally, we evaluated and recorded the 420 results of Support Vector Machine (SVM) as the prediction engine. According to the results of SVM, the first 208 features were chosen from the 420 dimensions, which were considered as the efficient ones. By analyzing the composition of the former 208 features, we found that the protein solubility was significantly influenced by the occurrence frequencies of the acidic amino acids, basic amino acids, non-polar hydrophobic amino acids and the two polar neutral amino acids(C, Q) in the protein sequences. Additionally, we detected that the dipeptides composed by the acidic amino acids (D, E) and basic amino acids (K, R and H), especially the dipeptide composed by the acidic amino acids (D, E), had strong interconnection with the protein solubility.

  17. Amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by streptococcal IgA1 proteases.

    Science.gov (United States)

    Batten, Margaret R; Senior, Bernard W; Kilian, Mogens; Woof, Jenny M

    2003-03-01

    The amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by IgA1 proteases of different species of Streptococcus were investigated. Recombinant IgA1 antibodies were generated with point mutations at proline 227 and threonine 228, the residues lying on either side of the peptide bond at which all streptococcal IgA1 proteases cleave wild-type human IgA1. The amino acid substitutions produced no major effect upon the structure of the mutant IgA1 antibodies or their functional ability to bind to Fcalpha receptors. However, the substitutions had a substantial effect upon sensitivity to cleavage with some streptococcal IgA1 proteases, with, in some cases, a single point mutation rendering the antibody resistant to a particular IgA1 protease. This effect was least marked with the IgA1 protease from Streptococcus pneumoniae, which showed no absolute requirement for either proline or threonine at residues 227 to 228. By contrast, the IgA1 proteases of Streptococcus oralis, Streptococcus sanguis, and Streptococcus mitis had an absolute requirement for proline at 227 but not for threonine at 228, which could be replaced by valine. There was evidence in S. mitis that proteases from different strains may have different amino acid requirements for cleavage. Remarkably, some streptococcal proteases appeared able to cleave the hinge at a distant alternative site if substitution prevented efficient cleavage of the original site. Hence, this study has identified key residues required for the recognition of the IgA1 hinge as a substrate by streptococcal IgA1 proteases, and it marks a preliminary step towards development of specific enzyme inhibitors. PMID:12595464

  18. The Chaotic Structure of Bacterial Virulence Protein Sequences

    Directory of Open Access Journals (Sweden)

    Sevdanur Genc

    2015-01-01

    Full Text Available Bacterial virulence proteins, which have been class ified on structure of virulence, causes several diseases. For instance, Adhesins play an important role in th e host cells. They are inserted DNA sequences for a variety of virulence properties. Several important methods conducted for the prediction of bacterial virulence proteins for finding new drugs or vaccines. In this study, we propose a method for feature sele ction about classification of bacterial virulence protein. The features are constituted dir ectly from the amino acid sequence of a given protein. Amino acids form proteins, which are criti cal to life, and have many important functions in living cells. They occurring with diff erent physicochemical properties by a vector of 20 numerical values, and collected in AAIndex datab ases of known 544 indices. For all that, this approach have two steps. Firstly , the amino acid sequence of a given protein analysed with Lyapunov Exponents that they have a chaotic structure in accordance wi th the chaos theory. After that, if the results show chara cterization over the complete distribution in the phase space from the point of deterministic sys tem, it means related protein will show a chaotic structure. Empirical results revealed that generated feature v ectors give the best performance with chaotic structure of physicochemical features of amino acid s with Adhesins and non-Adhesins data sets.

  19. [ANALYSIS OF ARACHIDONIC ACID RELATIVE CONTENT CHANGES IN ERYTHROCYTES AND PLATELETS PHOSPHOLIPIDS MEMBRANES FEATURES IN CORONARY HEART DISEASE WITH ATRIAL FIBRILLATION PATIENTS].

    Science.gov (United States)

    Lizogub, V G; Zavalska, T V; Merkulova, I O; Bryuzgina, T S

    2015-01-01

    Erythrocytes and platelets phospholipid membranes fatty acid spectrum was detected in coronary heart disease and atrial fibrillation patients and in patients with coronary heart disease without atrial fibrillation. 87 patients were investigated. Significant decrease in the arachidonic acid relative content in coronary heart disease patients compared with healthy individuals was related. As well as a significant decrease in the arachidonic acid relative content in coronary heart disease and atrial fibrillation patients compared with coronary heart disease patients without atrial fibrillation was related too. These dates may indicate that decreasing relative content arachidonic acid can be possible pathogenetic link in the development of arrhythmias.

  20. Improving the classification of nuclear receptors with feature selection.

    Science.gov (United States)

    Gao, Qing-Bin; Jin, Zhi-Chao; Ye, Xiao-Fei; Wu, Cheng; Lu, Jian; He, Jia

    2009-01-01

    Nuclear receptors are involved in multiple cellular signaling pathways that affect and regulate processes. Because of their physiology and pathophysiology significance, classification of nuclear receptors is essential for the proper understanding of their functions. Bhasin and Raghava have shown that the subfamilies of nuclear receptors are closely correlated with their amino acid composition and dipeptide composition [29]. They characterized each protein by a 400 dimensional feature vector. However, using high dimensional feature vectors for characterization of protein sequences will increase the computational cost as well as the risk of overfitting. Therefore, using only those features that are most relevant to the present task might improve the prediction system, and might also provide us with some biologically useful knowledge. In this paper a feature selection approach was proposed to identify relevant features and a prediction engine of support vector machines was developed to estimate the prediction accuracy of classification using the selected features. A reduced subset containing 30 features was accepted to characterize the protein sequences in view of its good discriminative power towards the classes, in which 18 are of amino acid composition and 12 are of dipeptide composition. This reduced feature subset resulted in an overall accuracy of 98.9% in a 5-fold cross-validation test, higher than 88.7% of amino acid composition based method and almost as high as 99.3% of dipeptide composition based method. Moreover, an overall accuracy of 93.7% was reached when it was evaluated on a blind data set of 63 nuclear receptors. On the other hand, an overall accuracy of 96.1% and 95.2% based on the reduced 12 dipeptide compositions was observed simultaneously in the 5-fold cross-validation test and the blind data set test, respectively. These results demonstrate the effectiveness of the present method. PMID:19601913

  1. Prediction of human protein function from post-translational modifications and localization features

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Blom, Nikolaj;

    2002-01-01

    a number of functional attributes that are more directly related to the linear sequence of amino acids, and hence easier to predict, than protein structure. These attributes include features associated with post-translational modifications and protein sorting, but also much simpler aspects...

  2. Solid phase sequencing of biopolymers

    Energy Technology Data Exchange (ETDEWEB)

    Cantor, Charles R.; Hubert, Koster

    2014-06-24

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Probes may be affixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  3. Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

    Science.gov (United States)

    Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

    2003-01-01

    A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171

  4. Specific features complexation of copper(2), manganese(2) and gadolinium(3) with salicylic, benzoic and sulfosalicylic acids in aqueous solutions of nonionic surfactant

    International Nuclear Information System (INIS)

    Impact of nonionic surfactant Triton-X-100 (TX) on acid-basic and complex-forming properties of benzoic (HR) and sulfosalicylic (H3X) acids is studied through the pH-metric titration and NMR-relaxation methods. The H3X acidic properties in water and in the H surfactant solution are practically similar. Significant increase in the proton relaxation rate values is observed in the solutions of the salicylic acid and TX mixtures by ions presence. The complexes of the [Gd(HL)3(TX)2] composition with lgK 0.22±0.05 are established for the Gd(3). The benzoic acid forms ternary particles in presence of Gadolinium(3): [GdR(TX)] with lgK = 2.17±0.04

  5. On the activation of bovine plasma factor XIII. Amino acid sequence of the peptide released by thrombin and the terminal residues of the subunit polypeptides.

    Science.gov (United States)

    Nakamura, S; Iwanaga, S; Suzuki, T

    1975-12-01

    A blood coagulation factor, Factor XIII, was highly purified from bovine fresh plasma by a method similar to those used for human plasma Factor XIII. The isolated Factor XIII consisted of two subunit polypeptides, a and b chains, with molecular weights of 79,000 +/- 2,000 and 75,000 +/- 2,000, respectively. In the conversion of Factor XIII to the active enzyme, Factor XIIIa, by bovine thrombin [EC 3.4.21.5], a peptide was liberated. This peptide, designated tentatively as "activation peptide," was isolated by gel-filtration on a Sephadex G-75 column. It contained a total of 37 amino acid residues with a masked N-terminal residue and C-terminal arginine. The whole amino acid sequence of "Activation peptide" was established by the dansyl-Edman method and standard enzymatic techniques, and the masked N-terminal residue was identified as N-acetylserine by using a rat liver acylamino acid-releasing enzyme. This enzyme specifically cleaved the N-acetylserylglutamyl peptide bond serine and the remaining peptide, which was now reactive to 1-dimethylamino-naphthalene-5-sulfonyl chloride. A comparison of the sequences of human and bovine "Activation peptide" revealed five amino acids replacements, Ser-3 to Thr; Gly-5 to Arg; Ile-14 to Val; Thr-18 to Asn, and Pro-26 to Leu. Another difference was the deletion of Leu-34 in the human peptide. Adsorption chromatography on a hydroxylapatite column in the presence of 0.1% sodium dodecyl sulfate was developed as a preparative procedure for the resolution of the two subunit polypeptides, a or a' chain and b chain, constituting the protein molecule of Factor XIII or Factor XIIIa. End group analyses on the isolated pure chains revealed that the structural change of Factor XIII during activation with thrombin occurs only in the N-terminal portion of the a chain, not in the N-terminal end of the b chain or in the C-terminal ends of the a and b chains. From these results, it was concluded that the activation of bovine plasma Factor XIII

  6. Sequence Searcher: A Java tool to perform regular expression and fuzzy searches of multiple DNA and protein sequences

    Directory of Open Access Journals (Sweden)

    Upton Chris

    2009-01-01

    Full Text Available Abstract Background Many sequence-searching tools have limiting factors for their use. For example, they may be platform specific, enforce restrictive size limits and sequences to be searched, or only allow searches of one of DNA or protein. Findings We present an easy-to-use, fast, platform-independent tool to search for amino acid or nucleotide patterns within one or many protein or nucleic acid sequences. The user can choose to search for regular expressions or perform a fuzzy search in which a particular number of errors is accepted during matching of a sequence. Positions of mismatches in fuzzy searches are displayed graphically the user. Conclusion SeqS provides an improved feature set and functions as a stand-alone tool or could be integrated into other bioinformatics platforms.

  7. Genome Sequences of the High-Acetic Acid-Resistant Bacteria Gluconacetobacter europaeus LMG 18890T and G. europaeus LMG 18494 (Reference Strains), G. europaeus 5P3, and Gluconacetobacter oboediens 174Bp2 (Isolated from Vinegar) ▿

    OpenAIRE

    Andrés-Barrao, Cristina; Falquet, Laurent; Sandra P Calderon-Copete; Descombes, Patrick; Ortega Pérez, Ruben; Barja, François

    2011-01-01

    Bacteria of the genus Gluconacetobacter are usually involved in the industrial production of vinegars with high acetic acid concentrations. We describe here the genome sequence of three Gluconacetobacter europaeus strains, a very common bacterial species from industrial fermentors, as well as of a Gluconacetobacter oboediens strain.

  8. Domain structures and molecular evolution of class I and class II major histocompatibility gene complex (MHC) products deduced from amino acid and nucleotide sequence homologies

    Science.gov (United States)

    Ohnishi, Koji

    1984-12-01

    Domain structures of class I and class II MHC products were analyzed from a viewpoint of amino acid and nucleotide sequence homologies. Alignment statistics revealed that class I (transplantation) antigen H chains consist of four mutually homologous domains, and that class II (HLA-DR) antigen β and α chains are both composed of three mutually homologous ones. The N-terminal three and two domains of class I and class II (both β and α) gene products, respectively, all of which being ˜90 residues long, were concluded to be homologous to β2-microglobulin (β2M). The membraneembedded C-terminal shorter domains of these MHC products were also found to be homologous to one another and to the third domain of class I H chains. Class I H chains were found to be more closely related to class II α chains than to class II β chains. Based on these findings, an exon duplication history from a common ancestral gene encoding a β2M-like primodial protein of one-domain-length up to the contemporary MHC products was proposed.

  9. PhTX-II a Basic Myotoxic Phospholipase A2 from Porthidium hyoprora Snake Venom, Pharmacological Characterization and Amino Acid Sequence by Mass Spectrometry

    Directory of Open Access Journals (Sweden)

    Salomón Huancahuire-Vega

    2014-10-01

    Full Text Available A monomeric basic PLA2 (PhTX-II of 14149.08 Da molecular weight was purified to homogeneity from Porthidium hyoprora venom. Amino acid sequence by in tandem mass spectrometry revealed that PhTX-II belongs to Asp49 PLA2 enzyme class and displays conserved domains as the catalytic network, Ca2+-binding loop and the hydrophobic channel of access to the catalytic site, reflected in the high catalytic activity displayed by the enzyme. Moreover, PhTX-II PLA2 showed an allosteric behavior and its enzymatic activity was dependent on Ca2+. Examination of PhTX-II PLA2 by CD spectroscopy indicated a high content of alpha-helical structures, similar to the known structure of secreted phospholipase IIA group suggesting a similar folding. PhTX-II PLA2 causes neuromuscular blockade in avian neuromuscular preparations with a significant direct action on skeletal muscle function, as well as, induced local edema and myotoxicity, in mice. The treatment of PhTX-II by BPB resulted in complete loss of their catalytic activity that was accompanied by loss of their edematogenic effect. On the other hand, enzymatic activity of PhTX-II contributes to this neuromuscular blockade and local myotoxicity is dependent not only on enzymatic activity. These results show that PhTX-II is a myotoxic Asp49 PLA2 that contributes with toxic actions caused by P. hyoprora venom.

  10. Indole acetic acid production by fluorescent Pseudomonas spp. from the rhizosphere of Plectranthus amboinicus (Lour.) Spreng. and their variation in extragenic repetitive DNA sequences.

    Science.gov (United States)

    Sethia, Bedhya; Mustafa, Mariam; Manohar, Sneha; Patil, Savita V; Jayamohan, Nellickal Subramanian; Kumudini, Belur Satyan

    2015-06-01

    Fluorescent Pseudomonas (FP) is a heterogenous group of growth promoting rhizobacteria that regulate plant growth by releasing secondary metabolic compounds viz., indole acetic acid (IAA), siderophores, ammonia and hydrogen cyanide. In the present study, IAA producing FPs from the rhizosphere of Plectranthus amboinicus were characterized morphologically, biochemically and at the molecular level. Molecular identification of the isolates were carried out using Pseudomonas specific primers. The effect of varying time (24, 48, 72 and 96 h), Trp concentrations (100, 200, 300, 400 and 500 μg x ml(-1)), temperature (10, 26, 37 and 50 ± 2 degrees C) and pH (6, 7 and 8) on IAA production by 10 best isolates were studied. Results showed higher IAA production at 72 h incubation, at 300 μg x ml(-1) Trp concentration, temperature 26 ± 2 degrees C and pH 7. TLC with acidified ethyl acetate extract showed that the IAA produced has a similar Rf value to that of the standard IAA. Results of TLC were confirmed by HPLC analysis. Genetic diversity of the isolates was also studied using 40 RAPD and 4 Rep primers. Genetic diversity parameters such as dominance, Shannon index and Simpson index were calculated. Out of 40 RAPD primers tested, 9 (2 OP-D series and 7 OP-E series) were shortlisted for further analysis. Studies using RAPD, ERIC, BOX, REP and GTG5 primers revealed that isolates exhibit significant diversity in repetitive DNA sequences irrespective of the rhizosphere. PMID:26155673

  11. Triplex formation at single-stranded nucleic acid target sites of unrestricted sequence by two added strands of oligonucleotides: A proposed model

    Energy Technology Data Exchange (ETDEWEB)

    Trapane, T.L.; Ts' o, P.O.P. (John Hopkins Univ., Baltimore, MD (United States))

    1994-11-16

    By using the standard purine nucleosides, guanosine and adenosine, and the pyrimidine C-nucleosides, pseudoisocytidine and pseudouridine, as complements on a probe strand, it is possible to construct a regular Watson-Crick helix with a single-stranded target sequence having any arrangement of the four naturally-occurring bases found in nucleic acids. The major groove of this helix will have a unique configuration of hydrogen-bonding sites on the probe strand for each of these four base pairs. By using this duplex as a framework, an ensemble of recognition patterns composed of base triads may be constructed. In these patterns, either a homopyrimidine or homopurine third strand binds in the major groove of the duplex formed by the target and probe strands. Ten distinct geometries, or motifs, are shown, each one consisting of four isomorphic base triads built upon recognition of C, G, A, or U(T) residues in the target strand. In order to maintain specific hydrogen bonding and to construct isomorphous triads, the use of several nonstandard bases is proposed. 33 refs., 9 figs., 3 tabs.

  12. Solar Features

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Collection includes a variety of solar feature datasets contributed by a number of national and private solar observatories located worldwide.

  13. Further considerations about the ophthalmic features of the Möbius sequence, with data of 28 cases Novas considerações sobre o quadro clínico da seqüência de Möbius com dados de 28 casos

    OpenAIRE

    Carlos Ramos Souza-Dias; Mauro Goldchmit

    2007-01-01

    PURPOSE: There is no uniformity in the literature about the core features required to make the diagnosis of Möbius sequence. Originally, the minimum requirements were the bilateral paralysis of the VI and the VII cranial nerves. The bilateral facial nerve paralysis or paresis, often asymmetric, is common to all patients but some facts show that the isolated VI nerve palsy in the Möbius sequence is not the rule. 1) When there is an esotropia in Möbius sequence, it is often too small to be caus...

  14. Design features of a sulphuric acid plant based on lead and zinc sintering machine off-gas%硫酸用新型耐蚀合金的研究与开发

    Institute of Scientific and Technical Information of China (English)

    刘焕安

    2001-01-01

    Design features of a 150kt/a sulphuric acid plant based on lead and zinc sintering machine off-gases are described. The plant adopted a single absorption technology including closed dilute-acid-scrubbing gas cleaning and ammonia-acid off-gas treatment. The dilute acid settling system,cooling water circulation system, installation of electrostatic precipitator, high-temperature absorption technology and acid distributor of drying and absorption section, and preheater, hot bypass and insulation of conversion section are emphasized in detail.%论述硫酸对金属腐蚀的特殊性和合金设计的基本原理。介绍高温浓硫酸用高硅不锈钢HD-1、合金球墨铸铁HD-3以及稀硫酸用高钼含氮奥氏体不锈钢HD-7、HD-11的研究开发和应用范围。

  15. Temporal Feature Integration for Music Organisation

    OpenAIRE

    Meng, Anders; Larsen, Jan; Hansen, Lars Kai

    2006-01-01

    This Ph.D. thesis focuses on temporal feature integration for music organisation. Temporal feature integration is the process of combining all the feature vectors of a given time-frame into a single new feature vector in order to capture relevant information in the frame. Several existing methods for handling sequences of features are formulated in the temporal feature integration framework. Two datasets for music genre classification have been considered as valid test-beds for music organisa...

  16. Pseudomonas lini Strain ZBG1 Revealed Carboxylic Acid Utilization and Copper Resistance Features Required for Adaptation to Vineyard Soil Environment: A Draft Genome Analysis

    Science.gov (United States)

    Chan, Kok-Gan; Chong, Teik-Min; Adrian, Tan-Guan-Sheng; Kher, Heng Leong; Grandclément, Catherine; Faure, Denis; Yin, Wai-Fong; Dessaux, Yves; Hong, Kar-Wai

    2016-01-01

    Pseudomonas lini strain ZBG1 was isolated from the soil of vineyard in Zellenberg, France and the draft genome was reported in this study. Bioinformatics analyses of the genome revealed presence of genes encoding tartaric and malic acid utilization as well as copper resistance that correspond to the adaptation this strain in vineyard soil environment.

  17. Pseudomonas lini Strain ZBG1 Revealed Carboxylic Acid Utilization and Copper Resistance Features Required for Adaptation to Vineyard Soil Environment: A Draft Genome Analysis

    Science.gov (United States)

    Chan, Kok-Gan; Chong, Teik-Min; Adrian, Tan-Guan-Sheng; Kher, Heng Leong; Grandclément, Catherine; Faure, Denis; Yin, Wai-Fong; Dessaux, Yves; Hong, Kar-Wai

    2016-01-01

    Pseudomonas lini strain ZBG1 was isolated from the soil of vineyard in Zellenberg, France and the draft genome was reported in this study. Bioinformatics analyses of the genome revealed presence of genes encoding tartaric and malic acid utilization as well as copper resistance that correspond to the adaptation this strain in vineyard soil environment. PMID:27512520

  18. A unique insertion of low complexity amino acid sequence underlies protein-protein interaction in human malaria parasite orotate phosphoribosyltransferase and orotidine 5'-monophosphate decarboxylase

    Institute of Scientific and Technical Information of China (English)

    Waranya Imprasittichai; Sittiruk Roytrakul; Sudaratana R Krungkrai; Jerapan Krungkrai

    2014-01-01

    Objective:To investigate the multienzyme complex formation of human malaria parasite Plasmodium falciparum(P. falciparum) orotate phosphoribosyltransferase(OPRT) and orotidine 5'-monophosphate decarboxylase(OMPDC), the fifth and sixth enzyme of the de novo pyrimidine biosynthetic pathway.Previously, we have clearly established that the two enzymes in the malaria parasite exist physically as a heterotetrameric(OPRT)2(OMPDC)2 complex containing two subunits each ofOPRT andOMPDC, and that the complex have catalytic kinetic advantages over the monofunctional enzyme.Methods:Both enzymes were cloned and expressed as recombinant proteins.The protein-protein interaction in the enzyme complex was identified using bifunctional chemical cross-linker, liquid chromatography-mass spectrometric analysis and homology modeling.Results:The unique insertions of low complexity region at the α2 and α5 helices of the parasiteOMPDC, characterized by single amino acid repeat sequence which was not found in homologous proteins from other organisms, was located on theOPRT-OMPDC interface.The structural models for the protein-protein interaction of the heterotetrameric(OPRT)2(OMPDC)2 multienzyme complex were proposed.Conclusions:Based on the proteomic data and structural modeling, it is surmised that the human malaria parasite low complexity region is responsible for theOPRT-OMPDC interaction.The structural complex of the parasite enzymes, thus, represents an efficient functional kinetic advantage, which in line with co-localization principles of evolutional origin, and allosteric control in protein-protein-interactions.

  19. Hydrate and alkoxide forms of molybdophosphoric heteropoly acid and specific features of their formation under conditions of isothermal absorption of water, methanol, and ethanol vapors

    International Nuclear Information System (INIS)

    By the gravimetry method it was ascertained that interaction of vapors of water, methanol and ethanol with solid molybdophosphoric acid at room temperature is accompanied by formation of volumetric hydrates and alcoholates proceeding in several stages, clearly defined during isotherm recording under sorption and desorption conditions. Formation of alcoholates gives rise to loosening of the acid lattice, which dictates largely more active absorption of alcohols at the first stages of sorption compared with water and, when the compositions H3PMo12O40 · 11.5 CH3OH and H3PMo12O40 · 14.7 C2H5OH is attained, it results in their liquefying

  20. Determination of structural and vibrational spectroscopic features of neutral and anion forms of dinicotinic acid by using NMR, infrared and Raman experimental methods combined with DFT and HF

    Science.gov (United States)

    Kose, E.; Bardak, F.; Atac, A.; Karabacak, M.; Cipiloglu, M. A.

    2013-10-01

    In this study; the experimental (NMR, infrared and Raman) and theoretical (HF and DFT) analysis of dinicotinic acid were presented. 1H and 13C NMR spectra were recorded in DMSO solution and chemical shifts were calculated by using the gauge-invariant atomic orbital (GIAO) method. The vibrational spectra of dinicotinic acid were recorded by FT-Raman and FT-IR spectra in the range of 4000-10 cm-1 and 4000-400 cm-1, respectively. To determine the most stable neutral conformer of molecule, the selected torsion angle was changed every 10° and molecular energy profile was calculated from 0° to 360°. The geometrical parameters and energies were obtained for all conformers form from density functional theory (DFT/B3LYP) and HF with 6-311++G(d,p) basis set calculations. However, the results of the most stable neutral and two anion forms (anion-1 and anion-2 forms) of dinicotinic acid are reported here. The complete assignments were performed on the basis of the total energy distribution (TED) of the vibrational wavenumbers, calculated with scaled quantum mechanics (SQM) method and PQS program.

  1. Loop Sequence Context Influences the Formation and Stability of the i-Motif for DNA Oligomers of Sequence (CCCXXX)4, where X = A and/or T, under Slightly Acidic Conditions.

    Science.gov (United States)

    McKim, Mikeal; Buxton, Alexander; Johnson, Courtney; Metz, Amanda; Sheardy, Richard D

    2016-08-11

    The structure and stability of DNA is highly dependent upon the sequence context of the bases (A, G, C, and T) and the environment under which the DNA is prepared (e.g., buffer, temperature, pH, ionic strength). Understanding the factors that influence structure and stability of the i-motif conformation can lead to the design of DNA sequences with highly tunable properties. We have been investigating the influence of pH and temperature on the conformations and stabilities for all permutations of the DNA sequence (CCCXXX)4, where X = A and/or T, using spectroscopic approaches. All oligomers undergo transitions from single-stranded structures at pH 7.0 to i-motif conformations at pH 5.0 as evidenced by circular dichroism (CD) studies. These folded structures possess stacked C:CH(+) base pairs joined by loops of 5'-XXX-3'. Although the pH at the midpoint of the transition (pHmp) varies slightly with loop sequence, the linkage between pH and log K for the proton induced transition is highly loop sequence dependent. All oligomers also undergo the thermally induced i-motif to single-strand transition at pH 5.0 as the temperature is increased from 25 to 95 °C. The temperature at the midpoint of this transition (Tm) is also highly dependent on loop sequence context effects. For seven of eight possible permutations, the pH induced, and thermally induced transitions appear to be highly cooperative and two state. Analysis of the CD optical melting profiles via a van't Hoff approach reveals sequence-dependent thermodynamic parameters for the unfolding as well. Together, these data reveal that the i-motif conformation exhibits exquisite sensitivity to loop sequence context with respect to formation and stability. PMID:27438583

  2. Localization of an O-glycosylated site in the recombinant barley alpha-amylase 1 produced in yeast and correction of the amino acid sequence using matrix-assisted laser desorption/ionization mass spectrometry of peptide mixtures

    DEFF Research Database (Denmark)

    Andersen, Jens S.; Søgaard, M; Svensson, B;

    1994-01-01

    , and analyzed directly by MALDI-MS. Based on the three mass spectrometric peptide maps, an error in the sequence deduced from cDNA, resulting in a mass difference of 28 Da, was located to a sequence stretch of 5 amino acid residues; furthermore, a dihexose substituent was identified on Thr410. Subsequent Edman...... degradation of two selected peptides isolated from the endoproteinase Lys-C digest corrected the sequence to be Val instead of Ala in position 284 and confirmed the O-glycosylation. These results demonstrate that the direct peptide mixture analysis by MALDI-MS is a rapid and sensitive method for protein......Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) of peptide mixtures was used to characterize recombinant barley alpha-amylase 1, produced in yeast. Three peptide mixtures were generated by cleavage with CNBr, digestion with endoproteinase Lys-C and Asp-N, respectively...

  3. Molecular cloning of the α-subunit of human prolyl 4-hydroxylase: The complete cDNA-derived amino acid sequence and evidence for alternative splicing of RNA transcripts

    International Nuclear Information System (INIS)

    Prolyl 4-hydroxylase an α2β2 tetramer, catalyzes the formation of 4-hydroxyproline in collagens by the hydroxylation of proline residues in peptide linkages. The authors report here on the isolation of cDNA clones encoding the α-subunit of the enzyme from human tumor HT-1080, placenta, and fibroblast cDNA libraries. Eight overlapping clones covering almost all of the corresponding 3,000-nucleotide mRNA, including all the coding sequences, were characterized. These clones encode a polypeptide of 517 amino acid residues and a signal peptide of 17 amino acids. Previous characterization of cDNA clones for the β-subunit of prolyl 4-hydroxylase has indicated that its C terminus has the amino acid sequence Lys-Asp-Gly-Leu, which, it has been suggested, is necessary for the retention of a polypeptide within the lumen of the endoplasmic reticulum. The α-subunit does not have this C-terminal sequence, and thus one function of the β-subunit in the prolyl 4-hydroxylase tetramer appears to be to retain the enzyme within this cell organelle. Southern blot analyses of human genomic DNA with a cDNA probe for the α-subunit suggested the presence of only one gene encoding the two types of mRNA, which appear to result from mutually exclusive alternative splicing of primary transcripts of one gene

  4. Sequence of echinochloa hoja blanca tenuivirus RNA-5.

    Science.gov (United States)

    de Miranda, J R; Muñoz, M; Wu, R; Espinoza, A M

    1996-01-01

    The sequence is presented of RNA-5 of Echinochloa hoja blanca tenuivirus, a second tenuivirus associated with rice cultivation in Latin America (after rice hoja blanca virus). The RNA is 1334 nucleotides long and contains in the complementary sense RNA a single long open reading frame. The deduced amino acid sequence of this open reading frame shows that it encodes a highly basic and hydrophilic 44 kD protein (pc5) with about 50% similarity to the pc5 protein of maize stripe virus (MStV). This and other features of the RNA are discussed. PMID:8879129

  5. Assessment of Epstein-Barr virus nucleic acids in gastric but not in breast cancer by next-generation sequencing of pooled Mexican samples

    Science.gov (United States)

    Fuentes-Pananá, Ezequiel M; Larios-Serrato, Violeta; Méndez-Tenorio, Alfonso; Morales-Sánchez, Abigail; Arias, Carlos F; Torres, Javier

    2016-01-01

    Gastric (GC) and breast (BrC) cancer are two of the most common and deadly tumours. Different lines of evidence suggest a possible causative role of viral infections for both GC and BrC. Wide genome sequencing (WGS) technologies allow searching for viral agents in tissues of patients with cancer. These technologies have already contributed to establish virus-cancer associations as well as to discovery new tumour viruses. The objective of this study was to document possible associations of viral infection with GC and BrC in Mexican patients. In order to gain idea about cost effective conditions of experimental sequencing, we first carried out an in silico simulation of WGS. The next-generation-platform IlluminaGallx was then used to sequence GC and BrC tumour samples. While we did not find viral sequences in tissues from BrC patients, multiple reads matching Epstein-Barr virus (EBV) sequences were found in GC tissues. An end-point polymerase chain reaction confirmed an enrichment of EBV sequences in one of the GC samples sequenced, validating the next-generation sequencing-bioinformatics pipeline. PMID:26910355

  6. Sequence-specific nucleic acid mobility using a reversible block copolymer gel matrix and DNA amphiphiles (lipid-DNA) in capillary and microfluidic electrophoretic separations

    NARCIS (Netherlands)

    Wagler, Patrick; Minero, Gabriel Antonio S.; Tangen, Uwe; de Vries, Jan Willem; Prusty, Deepak; Kwak, Minseok; Herrmann, Andreas; McCaskill, John S.

    2015-01-01

    Reversible noncovalent but sequence-dependent attachment of DNA to gels is shown to allow programmable mobility processing of DNA populations. The covalent attachment of DNA oligomers to polyacrylamide gels using acrydite-modified oligonucleotides has enabled sequence-specific mobility assays for DN

  7. Automatic sequences

    CERN Document Server

    Haeseler, Friedrich

    2003-01-01

    Automatic sequences are sequences which are produced by a finite automaton. Although they are not random they may look as being random. They are complicated, in the sense of not being not ultimately periodic, they may look rather complicated, in the sense that it may not be easy to name the rule by which the sequence is generated, however there exists a rule which generates the sequence. The concept automatic sequences has special applications in algebra, number theory, finite automata and formal languages, combinatorics on words. The text deals with different aspects of automatic sequences, in particular:· a general introduction to automatic sequences· the basic (combinatorial) properties of automatic sequences· the algebraic approach to automatic sequences· geometric objects related to automatic sequences.

  8. Vibrational analysis of amino acids and short peptides in aqueous media. V. The effect of the disulfide bridge on the structural features of the peptide hormone somatostatin-14.

    Science.gov (United States)

    Hernández, Belén; Carelli, Claude; Coïc, Yves-Marie; De Coninck, Joël; Ghomi, Mahmoud

    2009-09-24

    To emphasize the role played by the S-S bridge in the structural features of somatostatin-14 (SST-14), newly recorded CD and Raman spectra of this cyclic peptide and its open analogue obtained by Cys-->Ser substitution are presented. CD spectra of both peptides recorded in aqueous solutions in the 100-500 microM concentration range are strikingly similar. They reveal principally that random conformers constitute the major population in both peptides. Consequently, the S-S bridge has no structuring effect at submillimolar concentrations. In methanol, the CD spectrum of somatostatin-14 keeps globally the same spectral shape as that observed in water, whereas its open analogue presents a major population of helical conformers. Raman spectra recorded as a function of peptide concentration (5-20 mM) and also in the presence of 150 mM NaCl provide valuable conformational information. All Raman spectra present a mixture of random and beta-hairpin structures for both cyclic and open peptides. More importantly, the presence or the absence of the disulfide bridge does not seem to influence considerably different populations of secondary structures within this range of concentrations. CD and Raman data obtained in the submillimolar and millimolar ranges of concentrations, respectively, lead us to accept the idea that SST-14 monomers aggregate upon increasing concentration, thus stabilizing beta-hairpin conformations in solution. However, even at high concentrations, random conformers do not disappear. Raman spectra of SST-14 also reveal a concentration effect on the flexibility of the S-S linkage and consequently on that of its cyclic part. In conclusion, although the disulfide linkage does not seem to markedly influence the SST-14 conformational features in aqueous solutions, its presence seems to be necessary to ensure the flexibility of the cyclic part of this peptide and to maintain its closed structure in lower dielectric constant environments.

  9. X-ray studies of crystalline complexes involving amino acids and peptides. XLIV. Invariant features of supramolecular association and chiral effects in the complexes of arginine and lysine with tartaric acid.

    Science.gov (United States)

    Selvaraj, M; Thamotharan, S; Roy, Siddhartha; Vijayan, M

    2007-06-01

    The tartaric acid complexes with arginine and lysine exhibit two stoichiometries depending upon the ionization state of the anion. The structures reported here are DL-argininium DL-hydrogen tartrate, bis(L-argininium) L-tartrate, bis(DL-lysinium) DL-tartrate monohydrate, L-lysinium D-hydrogen tartrate and L-lysinium L-hydrogen tartrate. During crystallization, L-lysine preferentially interacts with D-tartaric acid to form a complex when DL-tartaric acid is used in the experiment. The anions and the cations aggregate into separate alternating layers in four of the five complexes. In bis(L-argininium) L-tartrate, the amino acid layers are interconnected by individual tartrate ions which do not interact among themselves. The aggregation of argininium ions in the DL- and the L-arginine complexes is remarkably similar, which is in turn similar to those observed in other dicarboxylic acid complexes of arginine. Thus, argininium ions have a tendency to assume similar patterns of aggregation, which are largely unaffected by a change in the chemistry of partner molecules such as the introduction of hydroxyl groups or a change in chirality or stoichiometry. On the contrary, the lysinium ions exhibit fundamentally different aggregation patterns in the DL-DL complexes on the one hand and L-D and L-L complexes on the other. Interestingly, the pattern in the L-D complex is similar to that in the L-L complex. The lysinium ions in the DL-DL complex exhibit an aggregation pattern similar to those observed in the DL-lysine complexes involving other dicarboxylic acids. Thus, the effect of change in the chirality of a subset of the component complexes could be profound or marginal, in an unpredictable manner. The relevant crystal structures appear to indicate that the preference of L-lysine for D-tartaric acid is perhaps caused by chiral discrimination resulting from the amplification of a small energy difference.

  10. Using of acid hydrolysis for preparation of porous starch and feature analysis%酸法制备多孔淀粉及其特性研究

    Institute of Scientific and Technical Information of China (English)

    李梅; 刘可志; 徐雅雯; 孙平

    2012-01-01

    Porous starch was made of corn starch by using of different concentrations of hydrochloric acid or compound enzyme under certain conditions.The morphology and some properties of the prepared porous starch were determined.Water absorption and oil absorption of the two products were compared.The results showed that the porous starch which made in the conditions of solid to liquid ratio 1:4,the concentration of hydrochloric acid 8% ,temperature 45~(3 and time 20h was similar to that made by enzymatic hydrolysis on water absorption and oil absorption,and the property was greatly improved on the basis of the native starch.The SEM results showed that the surface of porous starch made by enzymatic hydrolysis was covered with holes and pits. But the starch granules made by acid hydrolysis were random, and the effect was worse than enzymatic hydrolysis. Compared with native starch,gelatinization start temperature of porous starch was slightly increased but peak viscosity was decreased in a great degree.With high price advantage than enzymatic hydrolysis,acid hydrolysis was simple and inexpensive,and had industrial production value.%以玉米淀粉为原料,分别用不同浓度的盐酸溶液和复合酶在一定条件下制备多孔淀粉。对制得的多孔淀粉进行形态结构和部分性质的测定,并对两种方式处理所得产品的吸水性和吸油性进行对比。结果显示,料液比1:4、8%浓度的盐酸溶液、45℃下水浴反应20h制备的多孔淀粉在吸水和吸油性能上与酶法制备的多孔淀粉较为相似,同时相对于原淀粉性能有很大的提高。扫描电子显微镜(SEM)显示,酶法制备的多孔淀粉表面布有类似蜂窝状的孔洞或凹坑,而酸法制得的多孔淀粉颗粒具有随机性,效果不如酶法处理。与原淀粉相比,酸法制备的多孔淀粉糊化开始温度略微升高,而峰值黏度有很大降低。酸法制备多孔淀粉的方法简单廉价,相对酶

  11. Cleavage of nucleic acids

    Science.gov (United States)

    Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

    2000-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.

  12. The gene clone and sequence analysis of FAD2 from different erucic acid content rapeseed lines%不同芥酸含量油菜品系FAD2基因克隆测序分析

    Institute of Scientific and Technical Information of China (English)

    陈光尧; 王国槐; 张振乾; 官春云; 陈社员

    2011-01-01

    对高芥酸品系M13、60Coy辐照得到的高芥酸油菜突变体H27和低芥酸品系742的FAD2基因克隆并测序.结果发现,FAD2基因存在两个拷贝FAD2.1(1155bp)、FAD2.2(1140bp).H27的FAD2.1与M13序列相同.但FAD2.2核苷酸序列中M13在409位点A(腺嘌呤)变成了T(鸟嘌呤),导致密码子由AGA变换为TGA(终止密码子),使H27中的FAD2.2基因的功能丧失,这可能是芥酸含量的提高原因.%In order to better understand the molecular mechanism of erucic acid mutation, the FAD2 genes of high erucic acid M13 strain, high-erucic acid rapeseed mutant H27 by 60Coγ irradiation and low erucic acid strain 742 were sequenced by our laboratory. The results showed that there were two copies of FAD2 gene FAD2. 1 ( 1155bp) , FAD2. 2 (1140bp) .Compared with the FAD2. 1 and FAD2. 2 gene sequences of H27 and M13 , both of them had the same FAD2. 1 nucleotide sequence and the different FAD2. 2 nucleotide sequence. The FAD2. 2 nucleotide sequence of M13 in the 409 point A ( adenine) was replaced by T ( guanine) in H27 , then the AGA codon transformed into TGA ( stop codon) , which resulted in FAD2. 2 gene some function loss. It might be the reason why H27 had higher erucic acid content than M13.

  13. Featuring animacy

    Directory of Open Access Journals (Sweden)

    Elizabeth Ritter

    2015-01-01

    Full Text Available Algonquian languages are famous for their animacy-based grammatical properties—an animacy based noun classification system and direct/inverse system which gives rise to animacy hierarchy effects in the determination of verb agreement. In this paper I provide new evidence for the proposal that the distinctive properties of these languages is due to the use of participant-based features, rather than spatio-temporal ones, for both nominal and verbal functional categories (Ritter & Wiltschko 2009, 2014. Building on Wiltschko (2012, I develop a formal treatment of the Blackfoot aspectual system that assumes a category Inner Aspect (cf. MacDonald 2008, Travis 1991, 2010. Focusing on lexical aspect in Blackfoot, I demonstrate that the classification of both nouns (Seinsarten and verbs (Aktionsarten is based on animacy, rather than boundedness, resulting in a strikingly different aspectual system for both categories. 

  14. Draft Genome Sequence of D-Branched-Chain Amino Acid Producer Lactobacillus otakiensis JCM 15040T, Isolated from a Traditional Japanese Pickle.

    Science.gov (United States)

    Doi, Katsumi; Mori, Kazuki; Mutaguchi, Yuta; Tashiro, Kosuke; Fujino, Yasuhiro; Ohmori, Taketo; Kuhara, Satoru; Ohshima, Toshihisa

    2013-01-01

    Lactobacillus otakiensis strain JCM 15040(T) was isolated from an unsalted pickling solution used in the production of sunki, a traditional Japanese pickle. Here, we prepared a draft genome sequence for this strain consisting of 40 contigs containing a total of 2,347,132 bp, 2,310 predicted coding sequences, and a G+C content of 42.4%. PMID:23929467

  15. Detection of Differences in the Nucleotide and Amino Acid Sequences of Diphtheria Toxin from Corynebacterium diphtheriae and Corynebacterium ulcerans Causing Extrapharyngeal Infections

    OpenAIRE

    Sing, Andreas; Hogardt, Michael; Bierschenk, Suse; Heesemann, Jürgen

    2003-01-01

    While Corynebacterium ulcerans can mimic classical diphtheria, extrapharyngeal infections are extremely rare. Sequencing of the diphtheria toxin (DT)-encoding tox gene of two C. ulcerans isolates from extrapharyngeal infections revealed differences from C. diphtheriae DT sequences, mainly in the translocation and receptor-binding domains. C. ulcerans supernatants were much less potent than supernatant from C. diphtheriae. A C. ulcerans DT-specific PCR is described below.

  16. Determination of structural, spectrometric and nonlinear optical features of 2-(4-hydroxyphenylazo)benzoic acid by experimental techniques and quantum chemical calculations

    Science.gov (United States)

    Cinar, Mehmet; Yildiz, Nihat; Karabacak, Mehmet; Kurt, Mustafa

    2013-03-01

    The optimized geometrical structure, vibrational and electronic transitions, chemical shifts and nonlinear optical properties of 2-(4-hydroxyphenylazo)benzoic acid (HABA) compound were presented in this study. The ground state geometrical structure and vibrational wavenumbers were carried out by using density functional (DFT/B3LYP) method with 6-311++G(d,p) basis set. The vibrational spectra of title compound were recorded in solid state with FT-IR and FT-Raman spectrometry in the range of 4000-400 cm-1 and 4000-10 cm-1; respectively. The fundamental assignments were done on the basis of the recorded spectra and total energy distribution (TED) of the vibrational modes. The 1H and 13C NMR spectra were recorded in deuterated DMSO solution, and gauge-invariant atomic orbitals (GIAOs) method was used to predict the isotropic chemical shifts. The UV-Vis absorption spectra of the compound were observed in the range of 200-800 nm in ethanol, methanol and water solvents. To investigate the nonlinear optical properties, the polarizability, anisotropy of polarizability and molecular first hyperpolarizability were computed. A detailed description of spectroscopic behaviors of compound was given based on the comparison of experimental measurements and theoretical computations.

  17. Molecular level understanding of resistance to nalidixic acid in Salmonella enteric serovar typhimurium associates with the S83F sequence type.

    Science.gov (United States)

    Preethi, B; Ramanathan, K

    2016-01-01

    Nalidixic acid is an antibiotic drug used for treatment of Salmonellosis, a gastrointestinal infection. DNA gyrase subunit A (GyrA) of Salmonella typhimurium is the drug target for nalidixic acid. Resistance of GyrA to nalidixic acid, because of a point mutation in S. typhimurium, was recently reported. Substitution of Phe in place of Ser at locus 83 in GyrA of S. typhimurium has been experimentally associated with nalidixic acid resistance. Despite recent efforts, the mechanism of this resistance is not well understood. In this investigation we used computational techniques to address this shortcoming. Our results showed that contact with residue Arg 91 is certainly important for efficient binding of nalidixic acid to the target protein, and that mutation of this residue results in 180° rotation of the antibiotic in its binding pocket, around its own long axis. It is hoped these findings may enable development of new antibiotics against resistant forms of Salmonella. PMID:26329667

  18. Bipotential precursors of putative fibrous astrocytes and oligodendrocytes in rat cerebellar cultures express distinct surface features and neuron-like γ-aminobutyric acid transport

    International Nuclear Information System (INIS)

    When postnatal rat cerebellar cells were cultured in a chemically defined, serum-free medium, the only type of astrocyte present was unable to accumulate γ-[3H]aminobutyric acid (GABA), did not express surface antigens recognized by two monoclonal antibodies, A2B5 and LB1, and showed minimal proliferation. In these cultures, nonneuronal A2B5+, LB1+ stellate cells exhibiting neuron-like [3H]GABA uptake formed cell colonies of increasing size and were GFAP-. After about one week of culturing, the A2B5+, LB1+, GABA-uptake positive cell groups became galactocerebroside (GalCer) positive. Immunocytolysis of the A2B5+ cells at 3 and 4 days in vitro prevented the appearance of the A2B5+, LB1+, GABA-uptake positive cell colonies, and also of the GalCer+ cell groups. If 10% (vol/vol) fetal calf serum was added to 6-day cultures, the A2B5+, LB1+, GABA-uptake positive cell groups expressed GFAP and not GalCer. If the serum was added to the cultures 2 days after lysing the A2B5+ cells, only A2B5-, LB1-, GABA-uptake negative astrocytes proliferated. It is concluded that the putative fibrous astrocytes previously described in serum-containing cultures derive from bipotential precursors that differentiate into oligodendrocytes (GalCer+) in serum-free medium or into astrocytes (GFAP+) in the presence of serum, while the epithelioid A2B5-, LB1-, GABA-uptake negative astrocytes originate from a different precursor not yet identified

  19. Chaos game representation (CGR)-walk model for DNA sequences

    Institute of Scientific and Technical Information of China (English)

    Gao Jie; Xu Zhen-Yuan

    2009-01-01

    Chaos game representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to determine the coordinates of their positions in a continuous space. This distribution of positions has two features: one is unique, and the other is source sequence that can be recovered from the coordinates so that the distance between positions may serve as a measure of similarity between the corresponding sequences. A CGR-walk model is proposed based on CGR coordinates for the DNA sequences. The CGR coordinates are converted into a time series, and a long-memory ARFIMA (p, d, q) model, where ARFIMA stands for autoregressive fractionally integrated moving average, is introduced into the DNA sequence analysis. This model is applied to simulating real CGR-walk sequence data of ten genomic sequences. Remarkably long-range correlations are uncovered in the data, and the results from these models are reasonably fitted with those from the ARFIMA (p, d, q) model.

  20. Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens

    Directory of Open Access Journals (Sweden)

    Peijnenburg Ad ACM

    2002-12-01

    Full Text Available Abstract Background Transgenic proteins expressed by genetically modified food crops are evaluated for their potential allergenic properties prior to marketing, among others by identification of short identical amino acid sequences that occur both in the transgenic protein and allergenic proteins. A strategy is proposed, in which the positive outcomes of the sequence comparison with a minimal length of six amino acids are further screened for the presence of potential linear IgE-epitopes. This double track approach involves the use of literature data on IgE-epitopes and an antigenicity prediction algorithm. Results Thirty-three transgenic proteins have been screened for identities of at least six contiguous amino acids shared with allergenic proteins. Twenty-two transgenic proteins showed positive results of six- or seven-contiguous amino acids length. Only a limited number of identical stretches shared by transgenic proteins (papaya ringspot virus coat protein, acetolactate synthase GH50, and glyphosate oxidoreductase and allergenic proteins could be identified as (part of potential linear epitopes. Conclusion Many transgenic proteins have identical stretches of six or seven amino acids in common with allergenic proteins. Most identical stretches are likely to be false positives. As shown in this study, identical stretches can be further screened for relevance by comparison with linear IgE-binding epitopes described in literature. In the absence of literature data on epitopes, antigenicity prediction by computer aids to select potential antibody binding sites that will need verification of IgE binding by sera binding tests. Finally, the positive outcomes of this approach warrant further clinical testing for potential allergenicity.

  1. Cloning of an Erwinia herbicola gene necessary for gluconic acid production and enhanced mineral phosphate solubilization in Escherichia coli HB101: nucleotide sequence and probable involvement in biosynthesis of the coenzyme pyrroloquinoline quinone.

    Science.gov (United States)

    Liu, S T; Lee, L Y; Tai, C Y; Hung, C H; Chang, Y S; Wolfram, J H; Rogers, R; Goldstein, A H

    1992-09-01

    Escherichia coli is capable of synthesizing the apo-glucose dehydrogenase enzyme (GDH) but not the cofactor pyrroloquinoline quinone (PQQ), which is essential for formation of the holoenzyme. Therefore, in the absence of exogenous PQQ, E. coli does not produce gluconic acid. Evidence is presented to show that the expression of an Erwinia herbicola gene in E. coli HB101(pMCG898) resulted in the production of gluconic acid, which, in turn, implied PQQ biosynthesis. Transposon mutagenesis showed that the essential gene or locus was within a 1.8-kb region of a 4.5-kb insert of the plasmid pMCG898. This 1.8-kb region contained only one apparent open reading frame. In this paper, we present the nucleotide sequence of this open reading frame, a 1,134-bp DNA fragment coding for a protein with an M(r) of 42,160. The deduced sequence of this protein had a high degree of homology with that of gene III (M(r), 43,600) of a PQQ synthase gene complex from Acinetobacter calcoaceticus previously identified by Goosen et al. (J. Bacteriol. 171:447-455, 1989). In minicell analysis, pMCG898 encoded a protein with an M(r) of 41,000. These data indicate that E. coli HB101(pMCG898) produced the GDH-PQQ holoenzyme, which, in turn, catalyzed the oxidation of glucose to gluconic acid in the periplasmic space. As a result of the gluconic acid production, E. coli HB101(pMCG898) showed an enhanced mineral phosphate-solubilizing phenotype due to acid dissolution of the hydroxyapatite substrate.

  2. Cloning and Sequence Analysis of Y-box Binding Protein Gene in Min Pig

    Institute of Scientific and Technical Information of China (English)

    Zhang Dong-jie; Liu Di; Wang Liang; He Xin-miao; Wang Wen-tao

    2014-01-01

    In order to study the gene sequence of Min pig Y-box binding protein (YB-1) gene, the complete coding sequence of Min pig YB-1 gene was cloned by RT-PCR, the sequence features were analyzed by some software and online website. The results showed that the complete CDS of Min pig Y-box was found to be 975 bp long, encoding 324 amino acids. It contained a conserved cold shock domain and several phosphorylation sites, but had no transmembrane domains, and was consistent with a protein found in the cytoplasm. Min pig YB-1 nucleotides shared high similarity (61.37%-97.66%) with other mammals.

  3. Abalone (Haliotis tuberculata) hemocyanin type 1 (HtH1). Organization of the approximately 400 kDa subunit, and amino acid sequence of its functional units f, g and h.

    Science.gov (United States)

    Keller, H; Lieb Bp6; Altenhein, B; Gebauer, D; Richter, S; Stricker, S; Markl, J

    1999-08-01

    We have identified two separate hemocyanin types (HtH1 and HtH2) in the European abalone Haliotis tuberculata. HtH1/HtH2 hybrid molecules were not found. By selective dissociation of HtH2 we isolated HtH1 which, as revealed by electron microscopy and SDS/PAGE, is present as didecamers of a approximately 400 kDa subunit. Immunologically, HtH1 and HtH2 correspond to keyhole limpet hemocyanin (KLH)1 and KLH2, respectively, the two well-studied hemocyanin types of the closely related marine gastropod Megathura crenulata. On the basis of limited proteolytic cleavage, two-dimensional immunoelectrophoresis, SDS/PAGE and N-terminal sequencing, we identified eight different 40-60 kDa functional units in HtH1, termed HtH1-a to HtH1-h, and determined their linear arrangement within the elongated subunit. From Haliotis mantle tissue, rich in hemocyanin-producing pore cells, we isolated mRNA and constructed a cDNA library. By expression screening with HtH-specific rabbit antibodies, a cDNA clone was isolated and sequenced which codes for the three C-terminal functional units f, g and h of HtH1. Their sequences were aligned to those available from other molluscs, notably to functional unit f and functional unit g from the cephalopod Octopus dofleini. HtH1-f, which is the first sequenced functional unit of type f from a gastropod hemocyanin, corresponds to functional unit f from Octopus. Also functional unit g from Haliotis and Octopus correspond to each other. HtH1-h is a gastropod hemocyanin functional unit type which is absent in cephalopods and has not been sequenced previously. It exhibits a unique tail extension of approximately 95 amino acids, which is lacking in functional units a to g and aligns with a published peptide sequence of 48 amino acids from functional unit h of Helix pomatia hemocyanin. The new Haliotis sequences are discussed with respect to their counterparts in Octopus, the 15 A three-dimensional reconstruction of the KLH1 didecamer from electron

  4. Local amino acid sequence patterns dominate the heterogeneous phenotype for the collagen connective tissue disease Osteogenesis Imperfecta resulting from Gly mutations.

    Science.gov (United States)

    Xiao, Jianxi; Yang, Zhangfu; Sun, Xiuxia; Addabbo, Rayna; Baum, Jean

    2015-10-01

    Osteogenesis Imperfecta (OI), a hereditary connective tissue disease in collagen that arises from a single Gly → X mutation in the collagen chain, varies widely in phenotype from perinatal lethal to mild. It is unclear why there is such a large variation in the severity of the disease considering the repeating (Gly-X-Y)n sequence and the uniform rod-like structure of collagen. We systematically evaluate the effect of local (Gly-X-Y)n sequence around the mutation site on OI phenotype using integrated bio-statistical approaches, including odds ratio analysis and decision tree modeling. We show that different Gly → X mutations have different local sequence patterns that are correlated with lethal and nonlethal phenotypes providing a mechanism for understanding the sensitivity of local context in defining lethal and non-lethal OI. A number of important trends about which factors are related to OI phenotypes are revealed by the bio-statistical analyses; most striking is the complementary relationship between the placement of Pro residues and small residues and their correlation to OI phenotype. When Pro is present or small flexible residues are absent nearby a mutation site, the OI case tends to be lethal; when Pro is present or small flexible residues are absent further away from the mutation site, the OI case tends to be nonlethal. The analysis also reveals the dominant role of local sequence around mutation sites in the Major Ligand Binding Regions that are primarily responsible for collagen binding to its receptors and shows that non-lethal mutations are highly predicted by local sequence considerations alone whereas lethal mutations are not as easily predicted and may be a result of more complex interactions. Understanding the sequence determinants of OI mutations will enhance genetic counseling and help establish which steps in the collagen hierarchy to target for drug therapy.

  5. 多位点序列分型技术以及在乳酸菌分类鉴定中的应用%Multi-locus sequence typing technology and its application in Lactic acid bacteria identification

    Institute of Scientific and Technical Information of China (English)

    徐海燕; 张和平

    2013-01-01

    Multi-locus sequence typing (MLST) is a molecular typing method based on the DNA partial nucleotide sequencing of multiple housekeeping genes,the differences of alleles at each loci were defined by the allelic profile or sequence type,it is an effective tool for analysis genetic relationship among strains.MLST schemes have been developed for a variety of bacterial research,in this paper,the method of MLST and its application on classification and identification of Lactic acid bacteria are introduced.%多位点序列分型(Multi-Locus Sequence Typing,MLST)技术是通过测定同种细菌管家基因(housekeeping gene)的序列,来确定等位基因的差异,进而比较分析菌株之间进化关系的一种分子分型方法.该方法在细菌生物学基础研究方面已经发挥了重要的作用,本文主要阐述MLST技术的方法和其在乳酸菌分类鉴定中的应用.

  6. Deduced amino acid sequence of the small hydrophobic protein of US avian pneumovirus has greater identity with that of human metapneumovirus than those of non-US avian pneumoviruses.

    Science.gov (United States)

    Yunus, Abdul S; Govindarajan, Dhanasekaran; Huang, Zhuhui; Samal, Siba K

    2003-05-01

    We report here the nucleotide and deduced amino acid (aa) sequences of the small hydrophobic (SH) gene of the avian pneumovirus strain Colorado (APV/CO). The SH gene of APV/CO is 628 nucleotides in length from gene-start to gene-end. The longest ORF of the SH gene encoded a protein of 177 aas in length. Comparison of the deduced aa sequence of the SH protein of APV/CO with the corresponding published sequences of other members of genera metapneumovirus showed 28% identity with the newly discovered human metapneumovirus (hMPV), but no discernable identity with the APV subgroup A or B. Collectively, this data supports the hypothesis that: (i) APV/CO is distinct from European APV subgroups and belongs to the novel subgroup APV/C (APV/US); (ii) APV/CO is more closely related to hMPV, a mammalian metapneumovirus, than to either APV subgroup A or B. The SH gene of APV/CO was cloned using a genomic walk strategy which initiated cDNA synthesis from genomic RNA that traversed the genes in the order 3'-M-F-M2-SH-G-5', thus confirming that gene-order of APV/CO conforms in the genus Metapneumovirus. We also provide the sequences of transcription-signals and the M-F, F-M2, M2-SH and SH-G intergenic regions of APV/CO.

  7. Independence divergence-generated binary trees of amino acids.

    Science.gov (United States)

    Tusnády, G E; Tusnády, G; Simon, I

    1995-05-01

    The discovery of the relationship between amino acids is important in terms of the replacement ability, as used in protein engineering homology studies, and gaining a better understanding of the roles which various properties of the residues play in the creation of a unique, stable, 3-D protein structure. Amino acid sequences of proteins edited by evolution are anything but random. The measure of nonrandomness, i.e. the level of editing, can be characterized by an independence divergence value. This parameter is used to generate binary tree relationships between amino acids. The relationships of residues presented in this paper are based on protein building features and not on the physico-chemical characteristics of amino acids. This approach is not biased by the tautology present in all sequence similarity-based relationship studies. The roles which various physico-chemical characteristics play in the determination of the relationships between amino acids are also discussed.

  8. In vitro nucleosome positioning features of DNA repeats sequence associated with human genetic disease%与人遗传病相关的DNA重复序列的体外核小体定位特性

    Institute of Scientific and Technical Information of China (English)

    柴荣; 赵宏宇; 蔡禄

    2013-01-01

    Objective To investigate the nucleosome positioning of DNA repeats sequence ire vitro which can cause human genetic disease. Methods The recombinant plasmids containing (GAA)42, (ATTCT)43, (GCCT)18 and 601 sequence were cloned. The histone and plasmids were used to assemble chromatin structure ire vitro,and then analyzed by agarose gel electrophoresis after micrococcal nuclease digestion. Results The plasmid containing ATTCT repeats sequence was easier to form nucleosome than GAA containing repeats sequence ire vitro. Conclusions The recombinant plasmids' ability to form chromatin structure was changed because of the insert of the different repeats sequence fragment.%目的 研究与人遗传病相关的DNA重复序列的体外核小体定位.方法 构建含有(GAA)42、(ATTCT)43、(GCCT)18和601序列的重组质粒,体外利用盐透析将质粒与组蛋白八聚体组装形成染色质结构,微球菌核酸酶消化后,用琼脂糖凝胶电泳分析染色质的结构.结果 含有ATTCT重复序列的质粒较含GAA重复序列质粒在体外易于形成核小体.结论 在重组质粒中,由于引入的重复序列片段形成核小体能力的不同会影响其局部染色质结构.

  9. ORFer – retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files

    Directory of Open Access Journals (Sweden)

    Sievert Volker

    2002-12-01

    Full Text Available Abstract Background Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite of the cloning and recombinant expression of these proteins. Results A Java program was developed for retrieval of protein and nucleic acid sequences and annotations from NCBI GenBank, using the XML sequence format. Annotations retrieved by ORFer include sequence name, organism and also the completeness of the sequence. The program has a graphical user interface, although it can be used in a non-interactive mode. For protein sequences, the program also extracts the open reading frame sequence, if available, and checks its correct translation. ORFer accepts user input in the form of single or lists of GenBank GI identifiers or accession numbers. It can be used to extract complete sets of open reading frames and protein sequences from any kind of GenBank sequence entry, including complete genomes or chromosomes. Sequences are either stored with their features in a relational database or can be exported as text files in Fasta or tabulator delimited format. The ORFer program is freely available at http://www.proteinstrukturfabrik.de/orfer. Conclusion The ORFer program allows for fast retrieval of DNA sequences, protein sequences and their open reading frames and sequence annotations from GenBank. Furthermore, storage of sequences and features in a relational database is supported. Such a database can supplement a laboratory information system (LIMS with appropriate sequence information.

  10. Genome Sequence of "Candidatus Microthrix parvicella" Bio17-1, a Long-Chain-Fatty-Acid-Accumulating Filamentous Actinobacterium from a Biological Wastewater Treatment Plant

    OpenAIRE

    Muller, Emilie; Pinel, Nicolás; Gillece, John D.; Schupp, James M.; Lance B Price; Engelthaler, David M.; Levantesi, Caterina; Tandoi, Valter; Luong, Kkai; Baliga, Nitin S.; Korlach, Jonas; Keim, Paul S.; Wilmes, Paul

    2012-01-01

    “Candidatus Microthrix” bacteria are deeply branching filamentous actinobacteria which occur at the water-air interface of biological wastewater treatment plants, where they are often responsible for foaming and bulking. Here, we report the first draft genome sequence of a strain from this genus: “Candidatus Microthrix parvicella” strain Bio17-1.

  11. Electrochemical deoxyribonucleic acid biosensor based on electrodeposited graphene and nickel oxide nanoparticle modified electrode for the detection of salmonella enteritidis gene sequence.

    Science.gov (United States)

    Sun, Wei; Wang, Xiuli; Lu, Yongxi; Gong, Shixing; Qi, Xiaowei; Lei, Bingxin; Sun, Zhenfan; Li, Guangjiu

    2015-04-01

    In this paper a new electrochemical DNA biosensor was prepared by using graphene (GR) and nickel oxide (NiO) nanocomposite modified carbon ionic liquid electrode (CILE) as the substrate electrode. GR and NiO nanoparticles were electrodeposited on the CILE surface step-by-step to get the nanocomposite. Due to the strong affinity of NiO with phosphate groups of ssDNA, oligonucleotide probe with a terminal 5'-phosphate group could be attached on the surface of NiO/GR/CILE, which could further hybridize with the target ssDNA sequence. Methylene blue (MB) was used as the electrochemical indicator for monitoring the hybridization reaction. Under the optimal conditions the reduction peak current of MB was proportional to the concentration of salmonella enteritidis gene sequence in the range from 1.0×10(-13) to 1.0×10(-6)molL(-1) with a detection limit as 3.12×10(-14)molL(-1). This electrochemical DNA sensor exhibited good discrimination ability to one-base and three-base mismatched ssDNA sequences, and the polymerase chain reaction amplification product of salmonella enteritidis gene sequences were further detected with satisfactory results.

  12. Studies on deoxyribonucleic acids and related compounds. V. Synthesis of pentadecanucleotide duplex containing the ideal Pribnow sequence of promoter by the phosphotriester method using a new phosphorylating reagent.

    OpenAIRE

    Ohtsuka, E; Taniyama, Y; Marumoto, R; Sato, H.; Hirosaki, H; Ikehara, M

    1982-01-01

    A phosphorylating reagent o-chlorophenyl phosphoro-p-anisi-dochloridate was synthesized to phosphorylate the 3'-hydroxyl group of N, 5'-protected deoxynucleosides. These nucleotides served as 3'-terminal units for the synthesis of oligonucleotide blocks. By condensation of these oligonucleotide blocks the partially complementary deoxypentadecanucleotides dAGCTTATAATGC-TCG and dAGCTCGAGCATTATA, which contained the ideal Pribnow sequence TATAATG, were synthesized.

  13. Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS%LC-MS/MS法对融合蛋白FP3的氨基酸全序列测定

    Institute of Scientific and Technical Information of China (English)

    李响; 高向东; 陶磊; 裴德宁; 郭莹; 饶春明; 王军志

    2012-01-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein ammo acid sequence, they are important approach to structure analysis and identification of recombinant protein.%运用液质联用、两种串联质谱对融合蛋白FP3的氨基酸全序列测定,确证其一级结构.将样品还原烷基化后,通过胰蛋白酶酶解蛋白,PNGase F去除多肽混合物中糖肽的糖基化,将去糖后的总肽用于液质联用系统,通过液相分离后,运用Q-TOF和线性离子阱两种串联质谱测定各个肽段的b,y碎片离子,分析测定融合蛋白FP3的氨基酸全序列.通过LC-ESI-Q-TOF完成了融合蛋白FP3的76%氨基酸序列测定,通过LC-ESI-Trap完成余下24%氨基酸序列测定.液质联用、串联质谱法测定蛋白质氨基酸序列快速、灵敏、准确,是对重组蛋白结构分析和确证的重要手段.

  14. Development of Electrochemical Biosensorfor Sequence-Specific Deoxyribonucleic Acid%特定序列脱氧核糖核酸电化学生物传感器进展

    Institute of Scientific and Technical Information of China (English)

    杨丽菊; 彭图治

    2001-01-01

    对当今电分析化学中的研究热点之一——脱氧核糖核酸(DNA)的电化学生物传感技术的最新进展进行了综述,评述了其发展前景。%This is a review of electrochemical biosensor for sequence-specific deoxyribonucleic acid (DNA) with 51 references. Resent Progress in designation of the sensor, immobilization of DNA, indicator of hybridizatio, applications and some new concepts of the sensor are discussed in detail.

  15. Sequence diversity and functional conformity

    OpenAIRE

    de Lange, Orlando

    2015-01-01

    At least four phylogenetically distinct groups of bacteria encode repeat proteins with the common ability to bind specific DNA sequences with a unique but conserved code. Each repeat binds a single DNA base, and specificity is determined by the amino acid residue at position 13 of each repeat. Repeats are typically 33-35 amino acids long. Comparing repeat sequences across all groups reveals that only three positions are hyper-conserved. Repeats are in most cases functionally compatible such t...

  16. Application of difference sequences theory

    OpenAIRE

    Khantarzhiev, Georgii

    2013-01-01

    The results of difference sequences theory are applied to analytic function theory and Diophantine equations. As a result we have the equation which connects the $n$-th derivative of a function with the difference sequence for the values of this function. Also the results of difference sequences theory helps to discover some features of the whole kind of Diophantine equations. The method presented allows to find limits where Diophantine equation does not have integer solutions. The higher pow...

  17. Genome Sequencing

    DEFF Research Database (Denmark)

    Sato, Shusei; Andersen, Stig Uggerhøj

    2014-01-01

    The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based on transcr......The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based...

  18. OPEN QUESTIONS IN ORIGIN OF LIFE: EXPERIMENTAL STUDIES ON THE ORIGIN OF NUCLEIC ACIDS AND PROTEINS WITH SPECIFIC AND FUNCTIONAL SEQUENCES BY A CHEMICAL SYNTHETIC BIOLOGY APPROACH

    Directory of Open Access Journals (Sweden)

    Katarzyna Adamala

    2014-02-01

    We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides, like Ser-His, with respect to peptide- and nucleotides-condensation, as a realistic model of primitive organocatalysis. We have also set up a strategy for exploring the sequence space of random proteins and RNAs (the so-called “never born biopolymer” project with respect to the production of folded structures. Being still far from solved, the main aspects of these “open questions” are discussed here, by commenting on recent results obtained in our groups and by providing a unifying view on the problem and possible solutions. In particular, we propose a general scenario for macromolecule formation via fragment-condensation, as a scheme for the emergence of specific sequences based on molecular growth and selection.

  19. The amino acid sequence of a 20 kDa bifunctional subtilisin/alpha-amylase inhibitor from bran [correction of brain] of rice (Oryza sativa L.) seeds.

    Science.gov (United States)

    Ohtsubo, K; Richardson, M

    1992-08-31

    A 20 kDa bifunctional inhibitor of the microbial proteinase, subtilisin, and the alpha-amylase from the larvae of the red flour beetle (Tribolium castaneum) was purified from bran of rice seeds by saline extraction, precipitation with ammonium sulphate, ion-exchange chromatography on DEAE-Cellulose and Toyopearl CM-650, and preparative HPLC on Vydac C18. The complete primary structure was determined by automatic degradation of the intact, reduced and S-alkylated protein, and by manual DABITC/PITC micro-sequencing of peptides obtained from the protein following separate enzymic digestions with trypsin, pepsin, chymotrypsin, elastase and the protease from S. aureus V8. The protein sequence, which contained 176 residues, showed strong homology with similar bifunctional inhibitors previously isolated from wheat and barley which are related to the Kunitz family of proteinase inhibitors from legume seeds. PMID:1511747

  20. Nanoelectronic three-dimensional (3D) nanotip sensing array for real-time, sensitive, label-free sequence specific detection of nucleic acids.

    Science.gov (United States)

    Esfandyarpour, Rahim; Yang, Lu; Koochak, Zahra; Harris, James S; Davis, Ronald W

    2016-02-01

    The improvements in our ability to sequence and genotype DNA have opened up numerous avenues in the understanding of human biology and medicine with various applications, especially in medical diagnostics. But the realization of a label free, real time, high-throughput and low cost biosensing platforms to detect molecular interactions with a high level of sensitivity has been yet stunted due to two factors: one, slow binding kinetics caused by the lack of probe molecules on the sensors and two, limited mass transport due to the planar structure (two-dimensional) of the current biosensors. Here we present a novel three-dimensional (3D), highly sensitive, real-time, inexpensive and label-free nanotip array as a rapid and direct platform to sequence-specific DNA screening. Our nanotip sensors are designed to have a nano sized thin film as their sensing area (~ 20 nm), sandwiched between two sensing electrodes. The tip is then conjugated to a DNA oligonucleotide complementary to the sequence of interest, which is electrochemically detected in real-time via impedance changes upon the formation of a double-stranded helix at the sensor interface. This 3D configuration is specifically designed to improve the biomolecular hit rate and the detection speed. We demonstrate that our nanotip array effectively detects oligonucleotides in a sequence-specific and highly sensitive manner, yielding concentration-dependent impedance change measurements with a target concentration as low as 10 pM and discrimination against even a single mismatch. Notably, our nanotip sensors achieve this accurate, sensitive detection without relying on signal indicators or enhancing molecules like fluorophores. It can also easily be scaled for highly multiplxed detection with up to 5000 sensors/square centimeter, and integrated into microfluidic devices. The versatile, rapid, and sensitive performance of the nanotip array makes it an excellent candidate for point-of-care diagnostics, and high

  1. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    OpenAIRE

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIK...

  2. Amino acid sequence requirements in the human IgA1 hinge for cleavage by streptococcal IgA1 proteases

    DEFF Research Database (Denmark)

    Senior, BW; Batten, MR; Kilian, Mogens;

    2002-01-01

    All the IgA1 proteases of the different pathogenic species of Streptococcus cleave the hinge of the alpha chain of human IgA1 only at one proline-threonine peptide bond. In order to study the importance of these amino acids for cleavage, several hinge mutant recombinant IgA1 antibodies were...

  3. Promoter sequence of 3-phosphoglycerate kinase gene 2 of lactic acid-producing fungus rhizopus oryzae and a method of expressing a gene of interest in fungal species

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Johnway [Richland, WA; Skeen, Rodney S [Pendleton, OR

    2003-03-04

    The present invention provides the promoter clone discovery of phosphoglycerate kinase gene 2 of a lactic acid-producing filamentous fungal strain, Rhizopus oryzae. The isolated promoter can constitutively regulate gene expression under various carbohydrate conditions. In addition, the present invention also provides a design of an integration vector for the transformation of a foreign gene in Rhizopus oryzae.

  4. Promoter sequence of 3-phosphoglycerate kinase gene 1 of lactic acid-producing fungus rhizopus oryzae and a method of expressing a gene of interest in fungal species

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Johnway [Richland, WA; Skeen, Rodney S [Pendleton, OR

    2002-10-15

    The present invention provides the promoter clone discovery of phosphoglycerate kinase gene 1 of a lactic acid-producing filamentous fungal strain, Rhizopus oryzae. The isolated promoter can constitutively regulate gene expression under various carbohydrate conditions. In addition, the present invention also provides a design of an integration vector for the transformation of a foreign gene in Rhizopus oryzae.

  5. REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era

    Directory of Open Access Journals (Sweden)

    Guy Leonard

    2009-01-01

    Full Text Available The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment fi le, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree fi les (with a user-defined combination of species name and/or database accession number. Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file and generation of species and accession number lists for use in supplementary materials or figure legends.

  6. Mechanism analysis of acid tolerance response of bifidobacterium longum subsp. longum BBMN 68 by gene expression profile using RNA-sequencing.

    Directory of Open Access Journals (Sweden)

    Junhua Jin

    Full Text Available To analyze the mechanism of the acid tolerance response (ATR in Bifidobacterium longum subsp. longum BBMN68, we optimized the acid-adaptation condition to stimulate ATR effectively and analyzed the change of gene expression profile after acid-adaptation using high-throughput RNA-Seq. After acid-adaptation at pH 4.5 for 2 hours, the survival rate of BBMN68 at lethal pH 3.5 for 120 min was increased by 70 fold and the expression of 293 genes were upregulated by more than 2 fold, and 245 genes were downregulated by more than 2 fold. Gene expression profiling of ATR in BBMN68 suggested that, when the bacteria faced acid stress, the cells strengthened the integrity of cell wall and changed the permeability of membrane to keep the H(+ from entering. Once the H(+ entered the cytoplasm, the cells showed four main responses: First, the F(0F(1-ATPase system was initiated to discharge H(+. Second, the ability to produce NH(3 by cysteine-cystathionine-cycle was strengthened to neutralize excess H(+. Third, the cells started NER-UVR and NER-VSR systems to minimize the damage to DNA and upregulated HtpX, IbpA, and γ-glutamylcysteine production to protect proteins against damage. Fourth, the cells initiated global response signals ((pppGpp, polyP, and Sec-SRP to bring the whole cell into a state of response to the stress. The cells also secreted the quorum sensing signal (AI-2 to communicate between intraspecies cells by the cellular signal system, such as two-component systems, to improve the overall survival rate. Besides, the cells varied the pathways of producing energy by shifting to BCAA metabolism and enhanced the ability to utilize sugar to supply sufficient energy for the operation of the mechanism mentioned above. Based on these reults, it was inferred that, during industrial applications, the acid resistance of bifidobacteria could be improved by adding BCAA, γ-glutamylcysteine, cysteine, and cystathionine into the acid-stress environment.

  7. Sequence assembly

    DEFF Research Database (Denmark)

    Scheibye-Alsing, Karsten; Hoffmann, S.; Frankel, Annett Maria;

    2009-01-01

    and plays an important role in processing the information generated by these methods. Here, we provide a comprehensive overview of the current publicly available sequence assembly programs. We describe the basic principles of computational assembly along with the main concerns, such as repetitive sequences...

  8. An Matching Method for Vehicle-borne Panoramic Image Sequence Based on Adaptive Structure from Motion Feature%自适应运动结构特征的车载全景序列影像匹配方法

    Institute of Scientific and Technical Information of China (English)

    张正鹏; 江万寿; 张靖

    2015-01-01

    Panoramic image matching method with the constraint condition of local structure from motion similarity feature is an important method ,the process requires multivariable kernel density estimations for the structure frommotion feature used nonparametric mean shift .Proper selection of the kernel bandwidth is a critical step for convergence speed and accuracy of matching method .Variable bandwidth with adaptive structure from motion feature for panoramic image matching method has been proposed in this work .First the bandwidth matrix is defined using the locally adaptive spatial structure of the sampling point in spatial domain and optical flow domain .The rel axation diffusion process of structure from motion simil arity feature is described by distance weighting method of local optical flow feature vector .Then the expression form of adaptive multivari ate kernel density function is given out , and discusses the solution of the mean shift vector ,termination conditions ,and the seed point selection method .The final fusions of multi‐scale SIFT the features and structure features to establish a unified panoramic image matching framework .The sphere panoramic images from vehicle‐borne mobile measurement system are chosen such that a comparison analysis between fixed bandwidth and adaptive bandwidth is carried out in detail .The results show that adaptive bandwidth is good for case with the inlier ratio changes and the object space scale changes .The proposedmethodcanrealizetheadaptivesimilaritymeasureofstructurefrommotionfeature,improvesthe correct matching points and matching rate ,experimental results have shown our method to be robust .%以运动结构特征为约束条件的序列影像匹配,是基于多变量核密度函数,采用非参数均值漂移方法估计最优局部运动相似性结构特征的过程。核密度函数的带宽大小决定了匹配方法的收敛速度和精度。本文提出了一种可变带宽的自适应运动结构特征

  9. Guanine nucleotide-binding proteins that enhance choleragen ADP-ribosyltransferase activity: nucleotide and deduced amino acid sequence of an ADP-ribosylation factor cDNA.

    OpenAIRE

    Price, S R; Nightingale, M.; Tsai, S C; Williamson, K. C.; Adamik, R; H. C. Chen; Moss, J; M. Vaughan

    1988-01-01

    Three (two soluble and one membrane) guanine nucleotide-binding proteins (G proteins) that enhance ADP-ribosylation of the Gs alpha stimulatory subunit of the adenylyl cyclase (EC 4.6.1.1) complex by choleragen have recently been purified from bovine brain. To further define the structure and function of these ADP-ribosylation factors (ARFs), we isolated a cDNA clone (lambda ARF2B) from a bovine retinal library by screening with a mixed heptadecanucleotide probe whose sequence was based on th...

  10. Existing and emerging detection technologies for DNA (Deoxyribonucleic Acid) finger printing, sequencing, bio- and analytical chips: a multidisciplinary development unifying molecular biology, chemical and electronics engineering.

    Science.gov (United States)

    Kumar Khanna, Vinod

    2007-01-01

    The current status and research trends of detection techniques for DNA-based analysis such as DNA finger printing, sequencing, biochips and allied fields are examined. An overview of main detectors is presented vis-à-vis these DNA operations. The biochip method is explained, the role of micro- and nanoelectronic technologies in biochip realization is highlighted, various optical and electrical detection principles employed in biochips are indicated, and the operational mechanisms of these detection devices are described. Although a diversity of biochips for diagnostic and therapeutic applications has been demonstrated in research laboratories worldwide, only some of these chips have entered the clinical market, and more chips are awaiting commercialization. The necessity of tagging is eliminated in refractive-index change based devices, but the basic flaw of indirect nature of most detection methodologies can only be overcome by generic and/or reagentless DNA sensors such as the conductance-based approach and the DNA-single electron transistor (DNA-SET) structure. Devices of the electrical detection-based category are expected to pave the pathway for the next-generation DNA chips. The review provides a comprehensive coverage of the detection technologies for DNA finger printing, sequencing and related techniques, encompassing a variety of methods from the primitive art to the state-of-the-art scenario as well as promising methods for the future.

  11. Complementary DNA and derived amino acid sequence of the β subunit of human complement protein C8: identification of a close structural and ancestral relationship to the α subunit and C9

    International Nuclear Information System (INIS)

    A cDNA clone encoding the β subunit (M/sub r/ 64,000) of the eighth component of complement (C8) has been isolated from a human liver cDNA library. This clone has a cDNA insert of 1.95 kilobases (kb) and contains the entire β sequence [1608 base pairs (bp)]. Analysis of total cellular RNA isolated from the hepatoma cell line HepG2 revealed the mRNA for β to be ∼ 2.5 kb. This is similar to the message size for the α subunit of C8 and confirms the existence of different mRNAs for α and β. This finding supports genetic evidence that α and β are encoded at different loci. Analysis of the derived amino acid sequence revealed several membrane surface seeking segments that may facilitate β interaction with target membranes during complement-mediated cytolysis. Determined of the carbohydrate composition indicated 1 or 2 asparagine-linked but no O-linked oligosaccharide chains. Comparison of the β sequence to that reported earlier and to that of human C9 revealed a striking homology between all three proteins. For β and α, the overall homology is 33% on the basis of identity and 53% when conserved substitutions are allowed. For β and C9, the values are 26% and 475, respectively. All three have a large internal domain that is nearly cysteine free and N- and C-termini that are cysteine-rich and homologous to the low-density lipoprotein receptor repeat and epidermal growth factor type sequences, respectively. The overall homology and similarities in size and structural organization are indicative of a close ancestral relationship. It is concluded that α, β and C9 are members of a family of structurally related proteins that are capable of interacting to produce a hydrophilic to amphiphilic transition and membrane association

  12. Thermoresponsive physical hydrogels of poly(lactic acid)/poly(ethylene glycol) stereoblock copolymers tuned by stereostructure and hydrophobic block sequence.

    Science.gov (United States)

    Mao, Hailiang; Shan, Guorong; Bao, Yongzhong; Wu, Zi Liang; Pan, Pengju

    2016-05-18

    CBABC-type poly(lactic acid) (PLA)/poly(ethylene glycol) (PEG) pentablock copolymers composed of a central PEG block (A) and enantiomeric poly(l-lactic acid) (PLLA, B), poly(d-lactic acid) (PDLA, C) blocks were synthesized. Such pentablock copolymers form physical hydrogels at high concentrations in an aqueous solution, which stem from the aggregation and physical bridging of copolymer micelles. These gels are thermoresponsive and turn into sols upon heating. Physical gelation, gel-to-sol transition, crystalline state, microstructure, rheological behavior, biodegradation, and drug release behavior of PLA/PEG pentablock copolymers and their gels were investigated; they were also compared with PLA-PEG-PLA triblock copolymers containing the isotactic PLLA or atactic poly(d,l-lactide) (PDLLA) endblocks and PLLA-PEG-PLLA/PDLA-PEG-PDLA enantiomeric mixtures. PLA hydrophobic domains in pentablock copolymer gels changed from a homocrystalline to stereocomplexed structure as the PLLA/PDLA block length ratio approached 1/1. The gel of symmetric pentablock copolymer exhibited a wider gelation region, higher gel-to-sol transition temperature, higher hydrophobic domain crystallinity, larger intermicellar distance, higher storage modulus, and slower degradation and drug release rate compared to those of the asymmetric PLA/PEG pentablock copolymers or triblock copolymers. SAXS results indicated that the PLLA/PDLA blocks stereocomplexation in pentablock copolymers facilitated the intermicellar aggregation and bridging. Cylindrical ordered structures were observed in all the gels formed from the PLA/PEG pentablock and triblock copolymers. The stereocomplexation degree and intermicellar distance of the pentablock copolymer gels increased with heating. PMID:27121732

  13. Learning Sequence Neighbourhood Metrics

    CERN Document Server

    Bayer, Justin; van der Smagt, Patrick

    2011-01-01

    Recurrent neural networks (RNNs) in combination with a pooling operator and the neighbourhood components analysis (NCA) objective function are able to detect the characterizing dynamics of sequences and embed them into a fixed-length vector space of arbitrary dimensionality. Subsequently, the resulting features are meaningful and can be used for visualization or nearest neighbour classification in linear time. This kind of metric learning for sequential data enables the use of algorithms tailored towards fixed length vector spaces such as R^n.

  14. Amino acid sequence requirements in the human IgA1 hinge for cleavage by streptococcal IgA1 proteases.

    Science.gov (United States)

    Senior, B W; Batten, M R; Kilian, M; Woof, J M

    2002-08-01

    All the IgA1 proteases of the different pathogenic species of Streptococcus cleave the hinge of the alpha chain of human IgA1 only at one proline-threonine peptide bond. In order to study the importance of these amino acids for cleavage, several hinge mutant recombinant IgA1 antibodies were constructed. The mutations were found to be without major effect upon the structure or functional abilities of the antibodies. However, they had a major effect upon their sensitivity to cleavage by some of the IgA1 proteases. PMID:12196126

  15. Combination of acid labile detergent and C18 Empore™ disks for improved identification and sequence coverage of in-gel digested proteins.

    Science.gov (United States)

    Koehn, Henning; Lau, Benjamin; Clerens, Stefan; Plowman, Jeffrey E; Dyer, Jolon M; Ramli, Umi Salamah; Deb-Choudhury, Santanu

    2011-04-01

    A protocol for improved extraction of peptides from in-gel protein digests, using a combination of the acid labile surfactant, sodium deoxycholate (SDC) and C18 Empore™ membranes, is presented. This approach results in better mass spectrum quality, higher numbers of identified peptide peaks and improved identification scores compared to standard tryptic digestion protocols, or protocols using only SDC or only C18 Empore™ disks. The advantages of the new protocol are demonstrated for two different types of samples: Merino wool intermediate filament proteins and Elaeis guineensis (oil palm) mesocarp proteins. PMID:21327873

  16. Common interruptions in the repeating tripeptide sequence of non-fibrillar collagens: Sequence analysis and structural studies on triple-helix peptide models

    OpenAIRE

    Thiagarajan, Geetha; Li, Yingjie; Mohs, Angela; Strafaci, Christopher; Popiel, Magdalena; Baum, Jean; Brodsky, Barbara

    2007-01-01

    Interruptions in the repeating (Gly-X1-X2)n amino acid sequence pattern are found in the triple-helix domains of all non-fibrillar collagens, and perturbations to the triple-helix at such sites are likely to play a role in collagen higher order structure and function. This report defines the sequence features and structural consequences of the most common interruption, where one residue is missing in the tripeptide pattern, Gly-X1-X2-Gly-AA1-Gly-X1-X2, designated as G1G interruptions. Residue...

  17. Uric acid is associated with features of insulin resistance syndrome in obese children at prepubertal stage El ácido úrico se asocia con características de un síndrome de resistencia insulínica en los niños obesos en etapas prepuberales

    OpenAIRE

    M. Gil-Campos; C. M.ª Aguilera; R Cañete; Gil, A.

    2009-01-01

    Elevated plasma uric acid levels are associated with obesity and could be an expression of insulin-resistant state. The aim of the present study was to evaluate plasma uric acid in obese and normal-weight children exclusively at prepubertal stage and its relationship with anthropometric measurements, intake, and features of the insulin resistance syndrome. A study was performed in 34 obese and 20 normal-weight prepubertal children. Nutrient intake was determined using a 72 h recall questionna...

  18. Sequence-specific {sup 1}H, {sup 13}C, and {sup 15}N resonance assignments for intestinal fatty-acid-binding protein complexed with palmitate (15.4 kDA)

    Energy Technology Data Exchange (ETDEWEB)

    Hodsdon, M.E.; Toner, J.J.; Cistola, D.P. [Washington Univ. School of Medicine, St. Louis, MO (United States)

    1994-12-01

    Intestinal fatty-acid-binding protein (I-FABP) belongs to a family of soluble, cytoplasmic proteins that are thought to function in the intracellular transport and trafficking of polar lipids. Individual members of this protein family have distinct specificities and affinities for fatty acids, cholesterol, bile salts, and retinoids. We are comparing several retinol- and fatty-acid-binding proteins from intestine in order to define the factors that control molecular recognition in this family of proteins. We have established sequential resonance assignments for uniformly {sup 13}C/{sup 15}N-enriched I-FABP complexed with perdeuterated palmitate at pH7.2 and 37{degrees}C. The assignment strategy was similar to that introduced for calmodulin. We employed seven three-dimensional NMR experiments to establish scalar couplings between backbone and sidechain atoms. Backbone atoms were correlated using triple-resonance HNCO, HNCA, TOCSY-HMQC, HCACO, and HCA(CO)N experiments. Sidechain atoms were correlated using CC-TOCSY, HCCH-TOCSY, and TOCSY-HMQC. The correlations of peaks between three-dimensional spectra were established in a computer-assisted manner using NMR COMPASS (Molecular Simulations, Inc.) Using this approach, {sup 1}H, {sup 13}C, and {sup 15}N resonance assignments have been established for 120 of the 131 residues of I-FABP. For 18 residues, amide {sup 1}H and {sup 15}N resonances were unobservable, apparently because of the rapid exchange of amide protons with bulk water at pH 7.2. The missing amide protons correspond to distinct amino acid patterns in the protein sequence, which will be discussed. During the assignment process, several sources of ambiguity in spin correlations were observed. To overcome this ambiguity, the additional inter-residue correlations often observed in the HNCA experiment were used as cross-checks for the sequential backbone assignments.

  19. Novel algorithms for protein sequence analysis

    NARCIS (Netherlands)

    Ye, Kai

    2008-01-01

    Each protein is characterized by its unique sequential order of amino acids, the so-called protein sequence. Biology”s paradigm is that this order of amino acids determines the protein”s architecture and function. In this thesis, we introduce novel algorithms to analyze protein sequences. Chapter 1

  20. Covariance of charged amino acids at positions 322 and 440 of HIV-1 Env contributes to coreceptor specificity of subtype B viruses, and can be used to improve the performance of V3 sequence-based coreceptor usage prediction algorithms.

    Directory of Open Access Journals (Sweden)

    Kieran Cashin

    Full Text Available The ability to determine coreceptor usage of patient-derived human immunodeficiency virus type 1 (HIV-1 strains is clinically important, particularly for the administration of the CCR5 antagonist maraviroc. The envelope glycoprotein (Env determinants of coreceptor specificity lie primarily within the gp120 V3 loop region, although other Env determinants have been shown to influence gp120-coreceptor interactions. Here, we determined whether conserved amino acid alterations outside the V3 loop that contribute to coreceptor usage exist, and whether these alterations improve the performance of V3 sequence-based coreceptor usage prediction algorithms. We demonstrate a significant covariant association between charged amino acids at position 322 in V3 and position 440 in the C4 Env region that contributes to the specificity of HIV-1 subtype B strains for CCR5 or CXCR4. Specifically, positively charged Lys/Arg at position 322 and negatively charged Asp/Glu at position 440 occurred more frequently in CXCR4-using viruses, whereas negatively charged Asp/Glu at position 322 and positively charged Arg at position 440 occurred more frequently in R5 strains. In the context of CD4-bound gp120, structural models suggest that covariation of amino acids at Env positions 322 and 440 has the potential to alter electrostatic interactions that are formed between gp120 and charged amino acids in the CCR5 N-terminus. We further demonstrate that inclusion of a "440 rule" can improve the sensitivity of several V3 sequence-based genotypic algorithms for predicting coreceptor usage of subtype B HIV-1 strains, without compromising specificity, and significantly improves the AUROC of the geno2pheno algorithm when set to its recommended false positive rate of 5.75%. Together, our results provide further mechanistic insights into the intra-molecular interactions within Env that contribute to coreceptor specificity of subtype B HIV-1 strains, and demonstrate that incorporation