acid sequence shows: Topics by WorldWideScience.org

Sample records for acid sequence shows

Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

International Nuclear Information System (INIS)

Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

1987-01-01

Apolipoprotein(a) [apo(a)] is a glycoprotein with M/sub r/ ∼ 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain
SAAS: Short Amino Acid Sequence - A Promising Protein Secondary Structure Prediction Method of Single Sequence

Directory of Open Access Journals (Sweden)

Zhou Yuan Wu

2013-07-01

Full Text Available In statistical methods of predicting protein secondary structure, many researchers focus on single amino acid frequencies in α-helices, β-sheets, and so on, or the impact near amino acids on an amino acid forming a secondary structure. But the paper considers a short sequence of amino acids (3, 4, 5 or 6 amino acids as integer, and statistics short sequence's probability forming secondary structure. Also, many researchers select low homologous sequences as statistical database. But this paper select whole PDB database. In this paper we propose a strategy to predict protein secondary structure using simple statistical method. Numerical computation shows that, short amino acids sequence as integer to statistics, which can easy see trend of short sequence forming secondary structure, and it will work well to select large statistical database (whole PDB database without considering homologous, and Q3 accuracy is ca. 74% using this paper proposed simple statistical method, but accuracy of others statistical methods is less than 70%.
Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

Science.gov (United States)

Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

2017-04-01

The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.
Proteomics shows Hsp70 does not bind peptide sequences indiscriminately in vivo

International Nuclear Information System (INIS)

Grossmann, Michael E.; Madden, Benjamin J.; Gao, Fan; Pang, Yuan-Ping; Carpenter, John E.; McCormick, Daniel; Young, Charles Y.F.

2004-01-01

Heat shock protein 70 (Hsp70) binds peptide and has several functions that include protein folding, protein trafficking, and involvement with immune function. However, endogenous Hsp70-binding peptides had not previously been identified. Therefore, we eluted and identified several hundred endogenously bound peptides from Hsp70 using liquid chromatography ion trap mass spectrophotometry (LC-ITMS). Our work shows that the peptides are capable of binding Hsp70 as previously described. They are generally 8-26 amino acids in length and correspond to specific regions of many proteins. Through computationally assisted analysis of peptides eluted from Hsp70 we determined variable amino acid sequences, including a 5 amino acid core sequence that Hsp70 favorably binds. We also developed a computer algorithm that predicts Hsp70 binding within proteins. This work helps to define what peptides are bound by Hsp70 in vivo and suggests that Hsp70 facilitates peptide selection by aiding a funneling mechanism that is flexible but allows only a limited number of peptides to be processed
Detection of nucleic acid sequences by invader-directed cleavage

Science.gov (United States)

Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

OpenAIRE

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important...
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

Science.gov (United States)

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356
Hybridization and sequencing of nucleic acids using base pair mismatches

Science.gov (United States)

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

Science.gov (United States)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.
Optimization of short amino acid sequences classifier

Science.gov (United States)

Barcz, Aleksy; Szymański, Zbigniew

This article describes processing methods used for short amino acid sequences classification. The data processed are 9-symbols string representations of amino acid sequences, divided into 49 data sets - each one containing samples labeled as reacting or not with given enzyme. The goal of the classification is to determine for a single enzyme, whether an amino acid sequence would react with it or not. Each data set is processed separately. Feature selection is performed to reduce the number of dimensions for each data set. The method used for feature selection consists of two phases. During the first phase, significant positions are selected using Classification and Regression Trees. Afterwards, symbols appearing at the selected positions are substituted with numeric values of amino acid properties taken from the AAindex database. In the second phase the new set of features is reduced using a correlation-based ranking formula and Gram-Schmidt orthogonalization. Finally, the preprocessed data is used for training LS-SVM classifiers. SPDE, an evolutionary algorithm, is used to obtain optimal hyperparameters for the LS-SVM classifier, such as error penalty parameter C and kernel-specific hyperparameters. A simple score penalty is used to adapt the SPDE algorithm to the task of selecting classifiers with best performance measures values.
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

Science.gov (United States)

Pietrowski, D; Förster, M

2000-01-01

The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

Directory of Open Access Journals (Sweden)

Xiaoxia Yang

Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

Science.gov (United States)

Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

2015-01-01

Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
MEANS AND METHODS FOR CLONING NUCLEIC ACID SEQUENCES

NARCIS (Netherlands)

Geertsma, Eric Robin; Poolman, Berend

2008-01-01

The invention provides means and methods for efficiently cloning nucleic acid sequences of interest in micro-organisms that are less amenable to conventional nucleic acid manipulations, as compared to, for instance, E.coli. The present invention enables high-throughput cloning (and, preferably,
Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

DEFF Research Database (Denmark)

Thomsen, Martin Christen Frølund; Nielsen, Morten

2012-01-01

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein...... sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally...
Amino acid sequence analysis of the annexin super-gene family of proteins.

Science.gov (United States)

Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

1991-06-15

The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of
An alignment-free method to find similarity among protein sequences via the general form of Chou's pseudo amino acid composition.

Science.gov (United States)

Gupta, M K; Niyogi, R; Misra, M

2013-01-01

In this paper, we propose a method to create the 60-dimensional feature vector for protein sequences via the general form of pseudo amino acid composition. The construction of the feature vector is based on the contents of amino acids, total distance of each amino acid from the first amino acid in the protein sequence and the distribution of 20 amino acids. The obtained cosine distance metric (also called the similarity matrix) is used to construct the phylogenetic tree by the neighbour joining method. In order to show the applicability of our approach, we tested it on three proteins: 1) ND5 protein sequences from nine species, 2) ND6 protein sequences from eight species, and 3) 50 coronavirus spike proteins. The results are in agreement with known history and the output from the multiple sequence alignment program ClustalW, which is widely used. We have also compared our phylogenetic results with six other recently proposed alignment-free methods. These comparisons show that our proposed method gives a more consistent biological relationship than the others. In addition, the time complexity is linear and space required is less as compared with other alignment-free methods that use graphical representation. It should be noted that the multiple sequence alignment method has exponential time complexity.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.

Science.gov (United States)

Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S

1985-07-01

The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
[Complete genome sequencing of polymalic acid-producing strain Aureobasidium pullulans CCTCC M2012223].

Science.gov (United States)

Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang

2017-01-04

To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.
Hydroquinone: O-glucosyltransferase from cultivated Rauvolfia cells: enrichment and partial amino acid sequences.

Science.gov (United States)

Arend, J; Warzecha, H; Stöckigt, J

2000-01-01

Plant cell suspension cultures of Rauvolfia are able to produce a high amount of arbutin by glucosylation of exogenously added hydroquinone. A four step purification procedure using anion exchange, hydrophobic interaction, hydroxyapatite-chromatography and chromatofocusing delivered in a yield of 0.5%, an approximately 390 fold enrichment of the involved glucosyltransferase. SDS-PAGE showed a M(r) for the enzyme of 52 kDa. Proteolysis of the pure enzyme with endoproteinase LysC revealed six peptide fragments with 9-23 amino acids which were sequenced. Sequence alignment of the six peptides showed high homologies to glycosyltransferases from other higher plants.

Recent advances in nanopore-based nucleic acid analysis and sequencing

International Nuclear Information System (INIS)

Shi, Jidong; Fang, Ying; Hou, Junfeng

2016-01-01

Nanopore-based sequencing platforms are transforming the field of genomic science. This review (containing 116 references) highlights some recent progress on nanopore-based nucleic acid analysis and sequencing. These studies are classified into three categories, biological, solid-state, and hybrid nanopores, according to their nanoporous materials. We begin with a brief description of the translocation-based detection mechanism of nanopores. Next, specific examples are given in nanopore-based nucleic acid analysis and sequencing, with an emphasis on identifying strategies that can improve the resolution of nanopores. This review concludes with a discussion of future research directions that will advance the practical applications of nanopore technology. (author)
WEB-server for search of a periodicity in amino acid and nucleotide sequences

Science.gov (United States)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Leishmania-specific surface antigens show sub-genus sequence variation and immune recognition.

Directory of Open Access Journals (Sweden)

Daniel P Depledge

2010-09-01

Full Text Available A family of hydrophilic acylated surface (HASP proteins, containing extensive and variant amino acid repeats, is expressed at the plasma membrane in infective extracellular (metacyclic and intracellular (amastigote stages of Old World Leishmania species. While HASPs are antigenic in the host and can induce protective immune responses, the biological functions of these Leishmania-specific proteins remain unresolved. Previous genome analysis has suggested that parasites of the sub-genus Leishmania (Viannia have lost HASP genes from their genomes.We have used molecular and cellular methods to analyse HASP expression in New World Leishmania mexicana complex species and show that, unlike in L. major, these proteins are expressed predominantly following differentiation into amastigotes within macrophages. Further genome analysis has revealed that the L. (Viannia species, L. (V. braziliensis, does express HASP-like proteins of low amino acid similarity but with similar biochemical characteristics, from genes present on a region of chromosome 23 that is syntenic with the HASP/SHERP locus in Old World Leishmania species and the L. (L. mexicana complex. A related gene is also present in Leptomonas seymouri and this may represent the ancestral copy of these Leishmania-genus specific sequences. The L. braziliensis HASP-like proteins (named the orthologous (o HASPs are predominantly expressed on the plasma membrane in amastigotes and are recognised by immune sera taken from 4 out of 6 leishmaniasis patients tested in an endemic region of Brazil. Analysis of the repetitive domains of the oHASPs has shown considerable genetic variation in parasite isolates taken from the same patients, suggesting that antigenic change may play a role in immune recognition of this protein family.These findings confirm that antigenic hydrophilic acylated proteins are expressed from genes in the same chromosomal region in species across the genus Leishmania. These proteins are
Representation of protein-sequence information by amino acid subalphabets

DEFF Research Database (Denmark)

Andersen, C.A.F.; Brunak, Søren

2004-01-01

-sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...
Single-Labeled Oligonucleotides Showing Fluorescence Changes upon Hybridization with Target Nucleic Acids

Directory of Open Access Journals (Sweden)

Gil Tae Hwang

2018-01-01

Full Text Available Sequence-specific detection of nucleic acids has been intensively studied in the field of molecular diagnostics. In particular, the detection and analysis of single-nucleotide polymorphisms (SNPs is crucial for the identification of disease-causing genes and diagnosis of diseases. Sequence-specific hybridization probes, such as molecular beacons bearing the fluorophore and quencher at both ends of the stem, have been developed to enable DNA mutation detection. Interestingly, DNA mutations can be detected using fluorescently labeled oligonucleotide probes with only one fluorophore. This review summarizes recent research on single-labeled oligonucleotide probes that exhibit fluorescence changes after encountering target nucleic acids, such as guanine-quenching probes, cyanine-containing probes, probes containing a fluorophore-labeled base, and microenvironment-sensitive probes.
Soil amino acid composition across a boreal forest successional sequence

Science.gov (United States)

Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone

2009-01-01

Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

Science.gov (United States)

Nishizawa, M; Nishizawa, K

2000-10-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Human acid β-glucosidase: isolation and amino acid sequence of a peptide containing the catalytic site

International Nuclear Information System (INIS)

Dinur, T.; Osiecki, K.M.; Legler, G.; Gatt, S.; Desnick, R.J.; Grabowski, G.A.

1986-01-01

Human acid β-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase, EC 3.2.1.45) cleaves the glucosidic bonds of glucosylceramide and synthetic β-glucosides. The deficient activity of this hydrolase is the enzymatic defect in the subtypes and variants of Gaucher disease, the most prevalent lysosomal storage disease. To isolate and characterize the catalytic site of the normal enzyme, brominated 3 H-labeled conduritol B epoxide ( 3 H-Br-CBE), which inhibits the enzyme by binding covalently to this site, was used as an affinity label. Under optimal conditions 1 mol of 3 H-Br-CBE bound to 1 mol of pure enzyme protein, indicating the presence of a single catalytic site per enzyme subunit. After V 8 protease digestion of the 3 H-Br-CBE-labeled homogeneous enzyme, three radiolabeled peptides, designated peptide A, B, or C, were resolved by reverse-phase HPLC. The partial amino acid sequence (37 residues) of peptide A (M/sub r/, 5000) was determined. The sequence of this peptide, which contained the catalytic site, had exact homology to the sequence near the carboxyl terminus of the protein, as predicted from the nucleotide sequence of the full-length cDNA encoding acid β-glucosidase
Amino acid sequences and structures of chicken and turkey beta 2-microglobulin

DEFF Research Database (Denmark)

Welinder, K G; Jespersen, H M; Walther-Rasmussen, J

1991-01-01

The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11...
Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

International Nuclear Information System (INIS)

Chang, Soo-Ik; Hammes, G.G.

1989-01-01

Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the β-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution
The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

DEFF Research Database (Denmark)

Behrendt, N; Rønne, E; Ploug, M

1990-01-01

-PA. The purified protein shows a single 55-60 kDa band after sodium dodecyl sulfate-polyacrylamide gel electrophoresis and silver staining. It is a heavily glycosylated protein, the deglycosylated polypeptide chain comprising only 35 kDa. The glycosylated protein contains N-acetyl-D-glucosamine and sialic acid......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported...
The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

Science.gov (United States)

Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

1986-07-01

Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.
Metazoan Remaining Genes for Essential Amino Acid Biosynthesis: Sequence Conservation and Evolutionary Analyses

Directory of Open Access Journals (Sweden)

Igor R. Costa

2014-12-01

Full Text Available Essential amino acids (EAA consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS and betaine-homocysteine S-methyltransferase (BHMT diverged from the expected Tree of Life (ToL relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.
Purification and amino acid sequence of a bacteriocins produced by Lactobacillus salivarius K7 isolated from chicken intestine

Directory of Open Access Journals (Sweden)

Kenji Sonomoto

2006-03-01

Full Text Available A bacteriocin-producing strain, Lactobacillus K7, was isolated from a chicken intestine. The inhibitory activity was determined by spot-on-lawn technique. Identification of the strain was performed by morphological, biochemical (API 50 CH kit and molecular genetic (16S rDNA basis. Bacteriocin purification processes were carried out by amberlite adsorption, cation exchange and reverse-phase high perform- ance liquid chromatography. N-terminal amino acid sequences were performed by Edman degradation. Molecular mass was determined by electrospray-ionization (ESI mass spectrometry (MS. Lactobacillus K7 showed inhibitory activity against Lactobacillus sakei subsp. sakei JCM 1157T, Leuconostoc mesenteroides subsp. mesenteroides JCM 6124T and Bacillus coagulans JCM 2257T. This strain was identified as Lb. salivarius. The antimicrobial substance was destroyed by proteolytic enzymes, indicating its proteinaceous structure designated as a bacteriocin type. The purification of bacteriocin by amberlite adsorption, cation exchange, and reverse-phase chromatography resulted in only one single active peak, which was designated FK22. Molecular weight of this fraction was 4331.70 Da. By amino acid sequence, this peptide was homology to Abp 118 beta produced by Lb. salivarius UCC118. In addition, Lb. salivarius UCC118 produced 2-peptide bacteriocin, which was Abp 118 alpha and beta. Based on the partial amino acid sequences of Abp 118 beta, specific primers were designed from nucleotide sequences according to data from GenBank. The result showed that the deduced peptide was high homology to 2-peptide bacteriocin, Abp 118 alpha and beta.
Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

Energy Technology Data Exchange (ETDEWEB)

Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

1997-04-01

This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.
RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

DEFF Research Database (Denmark)

Wernersson, Rasmus; Pedersen, Anders Gorm

2003-01-01

The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...... proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...
The amino acid sequence of snapping turtle (Chelydra serpentina) ribonuclease

NARCIS (Netherlands)

Beintema, Jacob; Broos, Jaap; Meulenberg, Janneke; Schüller, Cornelis

1985-01-01

Snapping turtle (Chelydra serpentina) ribonuclease was isolated from pancreatic tissue. Turtle ribonuclease binds much more weakly to the affinity chromatography matrix used than mammalian ribonucleases. The amino acid sequence was determined from overlapping peptides obtained from three different
Amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui.

Science.gov (United States)

Hatakeyama, T; Hatakeyama, T

1990-07-06

The complete amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui were determined. Protein HL30 was found to be acetylated at its N-terminal amino acid and shows homology to the eukaryotic ribosomal proteins YL34 from yeast and RL31 from rat. Protein HmaL5 was homologous to the protein L5 from Escherichia coli and Bacillus stearothermophilus as well as to YL16 from yeast. HmaL5 shows more similarities to its eukaryotic counterpart than to eubacterial ones.
N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

OpenAIRE

Kuhn, H; Fietzek, P P; Lampen, J O

1982-01-01

The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.
PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.

Science.gov (United States)

Kuznetsov, Igor B; McDuffie, Michael

2015-05-07

Alignment of amino acid sequences is the main sequence comparison method used in computational molecular biology. The selection of the amino acid substitution matrix best suitable for a given alignment problem is one of the most important decisions the user has to make. In a conventional amino acid substitution matrix all elements are fixed and their values cannot be easily adjusted. Moreover, most existing amino acid substitution matrices account for the average (dis)similarities between amino acid types and do not distinguish the contribution of a specific biochemical property to these (dis)similarities. PR2ALIGN is a stand-alone software program and a web-server that provide the functionality for implementing flexible user-specified alignment scoring functions and aligning pairs of amino acid sequences based on the comparison of the profiles of biochemical properties of these sequences. Unlike the conventional sequence alignment methods that use 20x20 fixed amino acid substitution matrices, PR2ALIGN uses a set of weighted biochemical properties of amino acids to measure the distance between pairs of aligned residues and to find an optimal minimal distance global alignment. The user can provide any number of amino acid properties and specify a weight for each property. The higher the weight for a given property, the more this property affects the final alignment. We show that in many cases the approach implemented in PR2ALIGN produces better quality pair-wise alignments than the conventional matrix-based approach. PR2ALIGN will be helpful for researchers who wish to align amino acid sequences by using flexible user-specified alignment scoring functions based on the biochemical properties of amino acids instead of the amino acid substitution matrix. To the best of the authors' knowledge, there are no existing stand-alone software programs or web-servers analogous to PR2ALIGN. The software is freely available from http://pr2align.rit.albany.edu.

Correlation between fibroin amino acid sequence and physical silk properties.

Science.gov (United States)

Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

2003-09-12

The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Science.gov (United States)

2010-07-01

... mature protein, with the number 1. When presented, the amino acids preceding the mature protein, e.g... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter... data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
TaALMT1 promoter sequence compositions, acid tolerance, and Al tolerance in wheat cultivars and landraces from Sichuan in China.

Science.gov (United States)

Han, C; Dai, S F; Liu, D C; Pu, Z J; Wei, Y M; Zheng, Y L; Wen, D J; Zhao, L; Yan, Z H

2013-11-18

Previous genetic studies on wheat from various sources have indicated that aluminum (Al) tolerance may have originated independently in USA, Brazil, and China. Here, TaALMT1 promoter sequences of 92 landraces and cultivars from Sichuan, China, were sequenced. Five promoter types (I', II, III, IV, and V) were observed in 39 cultivars, and only three promoter types (I, II, and III) were observed in 53 landraces. Among the wheat collections worldwide, only the Chinese Spring (CS) landrace native to Sichuan, China, carried the TaALMT1 promoter type III. Besides CS, two other Sichuan-bred landraces and six cultivars with TaALMT1 promoter type III were identified in this study. In the phylogenetic tree constructed based on the TaALMT1 promoter sequences, type III formed a separate branch, which was supported by a high bootstrap value. It is likely that TaALMT1 promoter type III originated from Sichuan-bred wheat landraces of China. In addition, the landraces with promoter type I showed the lowest Al tolerance among all landraces and cultivars. Furthermore, the cultivars with promoter type IV showed better Al tolerance than landraces with promoter type II. A comparison of acid tolerance and Al tolerance between cultivars and landraces showed that the landraces had better acid tolerance than the cultivars, whereas the cultivars showed better Al tolerance than the landraces. Moreover, significant difference in Al tolerance was also observed between the cultivars raised by the National Ministry of Agriculture and by Sichuan Province. Among the landraces from different regions, those from the East showed better acid tolerance and Al tolerance than those from the South and West of Sichuan. Additional Al-tolerant and acid-tolerant wheat lines were also identified.
fCCAC: functional canonical correlation analysis to evaluate covariance between nucleic acid sequencing datasets.

Science.gov (United States)

Madrigal, Pedro

2017-03-01

Computational evaluation of variability across DNA or RNA sequencing datasets is a crucial step in genomic science, as it allows both to evaluate reproducibility of biological or technical replicates, and to compare different datasets to identify their potential correlations. Here we present fCCAC, an application of functional canonical correlation analysis to assess covariance of nucleic acid sequencing datasets such as chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We show how this method differs from other measures of correlation, and exemplify how it can reveal shared covariance between histone modifications and DNA binding proteins, such as the relationship between the H3K4me3 chromatin mark and its epigenetic writers and readers. An R/Bioconductor package is available at http://bioconductor.org/packages/fCCAC/ . pmb59@cam.ac.uk. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Nonlinear analysis of sequence repeats of multi-domain proteins

Energy Technology Data Exchange (ETDEWEB)

Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

2007-11-15

Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.
Isolation and amino acid sequence of a short-chain neurotoxin from an Australian elapid snake, Pseudechis australis.

OpenAIRE

Takasaki, C; Tamiya, N

1985-01-01

A short-chain neurotoxin Pseudechis australis a (toxin Pa a) was isolated from the venom of an Australian elapid snake Pseudechis australis (king brown snake) by sequential chromatography on CM-cellulose, Sephadex G-50 and CM-cellulose columns. Toxin Pa a has an LD50 (intravenous) value of 76 micrograms/kg body wt. in mice and consists of 62 amino acid residues. The amino acid sequence of Pa a shows considerable homology with those of short-chain neurotoxins of elapid snakes, especially of tr...
Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

International Nuclear Information System (INIS)

Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

1987-01-01

The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO 4 /PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

Science.gov (United States)

McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Secondary structure classification of amino-acid sequences using state-space modeling

OpenAIRE

Brunnert, Marcus; Krahnke, Tillmann; Urfer, Wolfgang

2001-01-01

The secondary structure classification of amino acid sequences can be carried out by a statistical analysis of sequence and structure data using state-space models. Aiming at this classification, a modified filter algorithm programmed in S is applied to data of three proteins. The application leads to correct classifications of two proteins even when using relatively simple estimation methods for the parameters of the state-space models. Furthermore, it has been shown that the assumed initial...
Amino-acid sequence of two trypsin isoinhibitors, ITD I and ITD III from squash seeds (Cucurbita maxima).

Science.gov (United States)

Wilusz, T; Wieczorek, M; Polanowski, A; Denton, A; Cook, J; Laskowski, M

1983-01-01

The amino-acid sequences of two trypsin isoinhibitors, ITD I and ITD III, from squash seeds (Cucurbita maxima) were determined. Both isoinhibitors contain 29 amino-acid residues, including 6 half cystine residues. They differ only by one amino acid. Lysine in position 9 of ITD III is substituted by glutamic acid in ITD I. Arginine in position 5 is present at the reactive site of both isoinhibitors. The previously published sequence of ITD III has been shown to be incorrect.
Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

Directory of Open Access Journals (Sweden)

Michael Andrew Meyer

2013-07-01

Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.
Prediction of beta-turns from amino acid sequences using the residue-coupled model.

Science.gov (United States)

Guruprasad, K; Shukla, S

2003-04-01

We evaluated the prediction of beta-turns from amino acid sequences using the residue-coupled model with an enlarged representative protein data set selected from the Protein Data Bank. Our results show that the probability values derived from a data set comprising 425 protein chains yielded an overall beta-turn prediction accuracy 68.74%, compared with 94.7% reported earlier on a data set of 30 proteins using the same method. However, we noted that the overall beta-turn prediction accuracy using probability values derived from the 30-protein data set reduces to 40.74% when tested on the data set comprising 425 protein chains. In contrast, using probability values derived from the 425 data set used in this analysis, the overall beta-turn prediction accuracy yielded consistent results when tested on either the 30-protein data set (64.62%) used earlier or a more recent representative data set comprising 619 protein chains (64.66%) or on a jackknife data set comprising 476 representative protein chains (63.38%). We therefore recommend the use of probability values derived from the 425 representative protein chains data set reported here, which gives more realistic and consistent predictions of beta-turns from amino acid sequences.
Predicting protein amidation sites by orchestrating amino acid sequence features

Science.gov (United States)

Zhao, Shuqiu; Yu, Hua; Gong, Xiujun

2017-08-01

Amidation is the fourth major category of post-translational modifications, which plays an important role in physiological and pathological processes. Identifying amidation sites can help us understanding the amidation and recognizing the original reason of many kinds of diseases. But the traditional experimental methods for predicting amidation sites are often time-consuming and expensive. In this study, we propose a computational method for predicting amidation sites by orchestrating amino acid sequence features. Three kinds of feature extraction methods are used to build a feature vector enabling to capture not only the physicochemical properties but also position related information of the amino acids. An extremely randomized trees algorithm is applied to choose the optimal features to remove redundancy and dependence among components of the feature vector by a supervised fashion. Finally the support vector machine classifier is used to label the amidation sites. When tested on an independent data set, it shows that the proposed method performs better than all the previous ones with the prediction accuracy of 0.962 at the Matthew's correlation coefficient of 0.89 and area under curve of 0.964.
Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

International Nuclear Information System (INIS)

Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L.

1988-01-01

Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

Science.gov (United States)

Meiler, Arno; Klinger, Claudia; Kaufmann, Michael

2012-09-08

The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

Energy Technology Data Exchange (ETDEWEB)

Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

1993-12-31

This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.
Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

Directory of Open Access Journals (Sweden)

Samat Kozhakhmetov

2014-01-01

Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.
Isolation and amino acid sequence of corticotropin-releasing factor from pig hypothalami.

OpenAIRE

Patthy, M; Horvath, J; Mason-Garcia, M; Szoke, B; Schlesinger, D H; Schally, A V

1985-01-01

A polypeptide was isolated from acid extracts of porcine hypothalami on the basis of its high ability to stimulate the release of corticotropin from superfused rat pituitary cells. After an initial separation by gel filtration on Sephadex G-25, further purification was carried out by reversed-phase HPLC. The isolated material was homogeneous chromatographically and by N-terminal sequencing. Based on automated gas-phase sequencing of the intact and CNBr-cleaved peptide and on carboxypeptidase ...
Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

Science.gov (United States)

Kimura, M; Kimura, J; Hatakeyama, T

1988-11-21

The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).
Amino acid sequence and biological characterization of BlatPLA₂, a non-toxic acidic phospholipase A₂ from the venom of the arboreal snake Bothriechis lateralis from Costa Rica.

Science.gov (United States)

Van der Laat, Marco; Fernández, Julián; Durban, Jordi; Villalobos, Eva; Camacho, Erika; Calvete, Juan J; Lomonte, Bruno

2013-10-01

Bothriechis is considered a monophyletic, basal genus of arboreal Neotropical pitvipers distributed across Middle America. The four species found in Costa Rica (B. lateralis, B. schlegeli, B. nigroviridis, B. supraciliaris) differ in their venom proteomic profiles, suggesting that different Bothriechis taxa have evolved diverse trophic strategies. In this study, we isolated a phospholipase A₂ (PLA₂) from B. lateralis venom, aiming at increasing our knowledge on the structural and functional characteristics of group II acidic PLA₂s, whose toxic actions are generally more restricted than those displayed by basic PLA₂s. The new acidic enzyme, BlatPLA₂, occurs as a monomer of 13,917 Da, in contrast to many basic group II PLA₂s which associate into dimers and often display myotoxicity and/or neurotoxicity. Its amino acid sequence of 122 residues predicts an isoelectric point of 4.7, and displays significant differences with previously characterized acidic PLA₂s, with which it shows a maximum sequence identity of 78%. BlatPLA₂ is catalytically active but appears to be devoid of major toxic activities, lacking intravenous or intracerebroventricular lethality, myotoxicity, in vitro anticoagulant activity, and platelet aggregation or inhibition effects. Phylogenetic relationships with similar group II enzymes suggest that BlatPLA₂ may represent a basal sequence to other acidic PLA₂s. Due to the metabolic cost of venom protein synthesis, the presence of a relatively abundant (9%) but non-toxic component is somewhat puzzling. Nevertheless, we hypothesize that BlatPLA₂ could have a role in the pre-digestion of prey, possibly having retained characteristics of ancestral PLA₂s without evolving towards potent toxicity. Copyright © 2013 Elsevier Ltd. All rights reserved.

Mitochondrial genome sequencing helps show the evolutionary mechanism of mitochondrial genome formation in Brassica

Science.gov (United States)

2011-01-01

Background Angiosperm mitochondrial genomes are more complex than those of other organisms. Analyses of the mitochondrial genome sequences of at least 11 angiosperm species have showed several common properties; these cannot easily explain, however, how the diverse mitotypes evolved within each genus or species. We analyzed the evolutionary relationships of Brassica mitotypes by sequencing. Results We sequenced the mitotypes of cam (Brassica rapa), ole (B. oleracea), jun (B. juncea), and car (B. carinata) and analyzed them together with two previously sequenced mitotypes of B. napus (pol and nap). The sizes of whole single circular genomes of cam, jun, ole, and car are 219,747 bp, 219,766 bp, 360,271 bp, and 232,241 bp, respectively. The mitochondrial genome of ole is largest as a resulting of the duplication of a 141.8 kb segment. The jun mitotype is the result of an inherited cam mitotype, and pol is also derived from the cam mitotype with evolutionary modifications. Genes with known functions are conserved in all mitotypes, but clear variation in open reading frames (ORFs) with unknown functions among the six mitotypes was observed. Sequence relationship analysis showed that there has been genome compaction and inheritance in the course of Brassica mitotype evolution. Conclusions We have sequenced four Brassica mitotypes, compared six Brassica mitotypes and suggested a mechanism for mitochondrial genome formation in Brassica, including evolutionary events such as inheritance, duplication, rearrangement, genome compaction, and mutation. PMID:21988783
Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

Lifescience Database Archive (English)

Full Text Available List Contact us Gclust Server Amino acid sequences of predicted proteins and their annotation for 95 organis...m species. Data detail Data name Amino acid sequences of predicted proteins and their annotation for 95 orga...nism species. DOI 10.18908/lsdba.nbdc00464-001 Description of data contents Amino acid sequences of predicted proteins...Database Description Download License Update History of This Database Site Policy | Contact Us Amino acid sequences of predicted prot...eins and their annotation for 95 organism species. - Gclust Server | LSDB Archive ...
Non-declarative sequence learning does not show savings in relearning.

Science.gov (United States)

Keisler, Aysha; Willingham, Daniel T

2007-04-01

Researchers have utilized the savings in relearning paradigm in a variety of settings since Ebbinghaus developed the tool over a century ago. In spite of its widespread use, we do not yet understand what type(s) of memory are measurable by savings. Specifically, can savings measure both declarative and non-declarative memories? The lack of conscious recollection of the encoded material in some studies indicates that non-declarative memories may show savings effects, but as all studies to date have used declarative tasks, we cannot be certain. Here, we administer a non-declarative task and then measure savings in relearning the material declaratively. Our results show that while material outside of awareness may show savings effects, non-declarative sequence memory does not. These data highlight the important distinction between memory without awareness and non-declarative memory.
Quantum-Sequencing: Fast electronic single DNA molecule sequencing

Science.gov (United States)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Differences in acid tolerance between Bifidobacterium breve BB8 and its acid-resistant derivative B. breve BB8dpH, revealed by RNA-sequencing and physiological analysis.

Science.gov (United States)

Yang, Xu; Hang, Xiaomin; Tan, Jing; Yang, Hong

2015-06-01

Bifidobacteria are common inhabitants of the human gastrointestinal tract, and their application has increased dramatically in recent years due to their health-promoting effects. The ability of bifidobacteria to tolerate acidic environments is particularly important for their function as probiotics because they encounter such environments in food products and during passage through the gastrointestinal tract. In this study, we generated a derivative, Bifidobacterium breve BB8dpH, which displayed a stable, acid-resistant phenotype. To investigate the possible reasons for the higher acid tolerance of B. breve BB8dpH, as compared with its parental strain B. breve BB8, a combined transcriptome and physiological approach was used to characterize differences between the two strains. An analysis of the transcriptome by RNA-sequencing indicated that the expression of 121 genes was increased by more than 2-fold, while the expression of 146 genes was reduced more than 2-fold, in B. breve BB8dpH. Validation of the RNA-sequencing data using real-time quantitative PCR analysis demonstrated that the RNA-sequencing results were highly reliable. The comparison analysis, based on differentially expressed genes, suggested that the acid tolerance of B. breve BB8dpH was enhanced by regulating the expression of genes involved in carbohydrate transport and metabolism, energy production, synthesis of cell envelope components (peptidoglycan and exopolysaccharide), synthesis and transport of glutamate and glutamine, and histidine synthesis. Furthermore, an analysis of physiological data showed that B. breve BB8dpH displayed higher production of exopolysaccharide and lower H(+)-ATPase activity than B. breve BB8. The results presented here will improve our understanding of acid tolerance in bifidobacteria, and they will lead to the development of new strategies to enhance the acid tolerance of bifidobacterial strains. Copyright © 2015 Elsevier Ltd. All rights reserved.
Cloning and sequence analysis of putative type II fatty acid synthase ...

Indian Academy of Sciences (India)

Prakash

Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L. ... acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, β-ketoacyl-ACP .... Helix II plays a dominant role in the interaction ... main distinguishing features of plant ACPs in plastids and ..... synthase component; J. Biol.
Isolation and complete amino acid sequence of human thymopoietin and splenin

International Nuclear Information System (INIS)

Audhya, T.; Schlesinger, D.H.; Goldstein, G.

1987-01-01

Human thymopoietin and splenin were isolated from human thymus and spleen, respectively, by monitoring tissue fractionation with a bovine thymopoietin RIA cross-reactive with human thymopoietin and splenin. Bovine thymopoietin and splenin are 49-amino acid polypeptides that differ by only 2 amino acids at positions 34 and 43; the change at position 34 in the active-site region changes the receptor specificities and biological activities. The complete amino acid sequences of purified human thymopoietin and splenin were determined and shown to be 48-amino acid polypeptides differing at four positions. Ten amino acids, constant within each species for thymopoietin and splenin, differ between the human and bovine polypeptides. The pentapeptide active side of thymopoietin (residues 32-36) is constant between the human and bovine thymopoietins, but position 34 in the active site of splenin has changed from glutamic acid in bovine splenin to alanine in human splenin, accounting for the biological activity of the human but not the bovine splenin on the human T-cell line MOLT-4
Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

Science.gov (United States)

Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

2016-10-07

In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. Copyright © 2016 Elsevier Ltd. All rights reserved.
Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66

Directory of Open Access Journals (Sweden)

Bin Liu

2016-06-01

Full Text Available Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA. Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276, with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.
Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

OpenAIRE

Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.; Almstrand, Robert

2016-01-01

Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome.
ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

Directory of Open Access Journals (Sweden)

Meiler Arno

2012-09-01

Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

Science.gov (United States)

2012-01-01

Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
Molecular cloning and sequence analysis of complementary DNA encoding rat mammary gland medium-chain S-acyl fatty acid synthetase thio ester hydrolase

International Nuclear Information System (INIS)

Safford, R.; de Silva, J.; Lucas, C.

1987-01-01

Poly(A) + RNA from pregnant rat mammary glands was size-fractionated by sucrose gradient centrifugation, and fractions enriched in medium-chain S-acyl fatty acid synthetase thio ester hydrolase (MCH) were identified by in vitro translation and immunoprecipitation. A cDNA library was constructed, in pBR322, from enriched poly(A) + RNA and screened with two oligonucleotide probes deduced from rat MCH amino acid sequence data. Cross-hybridizing clones were isolated and found to contain cDNA inserts ranging from ∼ 1100 to 1550 base pairs (bp). A 1550-bp cDNA insert, from clone 43H09, was confirmed to encode MCH by hybrid-select translation/immunoprecipitation studies and by comparison of the amino acid sequence deduced from the DNA sequence of the clone to the amino acid sequence of the MCH peptides. Northern blot analysis revealed the size of the MCH mRNA to be 1500 nucleotides, and it is therefore concluded that the 1550-bp insert (including G x C tails) of clone 43H09 represents a full- or near-full-length copy of the MCH gene. The rat MCH sequence is the first reported sequence of a thioesterase from a mammalian source, but comparison of the deduced amino acid sequences of MCH and the recently published mallard duck medium-chain S-acyl fatty acid synthetase thioesterase reveals significant homology. In particular, a seven amino acid sequence containing the proposed active serine of the duck thioesterase is found to be perfectly conserved in rat MCH
Molecular cloning of chicken metallothionein. Deduction of the complete amino acid sequence and analysis of expression using cloned cDNA

Energy Technology Data Exchange (ETDEWEB)

Wei, D; Andrews, G K

1988-01-25

A cDNA library was constructed using RNA isolated from the livers of chickens which had been treated with zinc. This library was screened with a RNA probe complementary to mouse metallothionein-I (MT), and eight chicken MT cDNA clones were obtained. All of the cDNA clones contained nucleotide sequences homologous to regions of the longest (375 bp) cDNA clone. The latter contained an open reading frame of 189 bp, and the deduced amino acid sequence indicates a protein of 63 amino acids of which 20 are cysteine residues. Amino acid composition and partial amino acid sequence analyses of purified chicken MT protein agreed with the amino acid composition and sequence deduced from the cloned cDNA. Amino acid sequence comparison establish that chicken MT shares extensive homology with mammalian MTs. Southern blot analysis of chicken DNA indicates that the chicken MT gene is not a part of a large family of related sequences, but rather is likely to be a unique gene sequence. In the chicken liver, levels of chicken MT mRNA were rapidly induced by metals (Cd/sup 2 +/, Zn/sup 2 +/, Cu/sup 2 +/), glucocorticoids and lipopolysaccharide. MT mRNA was present in low levels in embryonic liver and increased to high levels during the first week after hatching before decreasing again to the basal levels found in adult liver. The results of this study establish that MT is highly conserved between birds and mammals and is regulated in the chicken by agents which also regulate expression of mammalian MT genes. However, in contrast to the mammals, the results suggest the existence of a single isoform of MT in the chicken.
Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

DEFF Research Database (Denmark)

Cowell, G M; Kønigshøfer, E; Danielsen, E M

1988-01-01

The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250...
Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

Science.gov (United States)

Ansari, A A; Shenbagamurthi, P; Marsh, D G

1989-07-05

The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.
Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

Science.gov (United States)

Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

2007-01-01

The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792
Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

International Nuclear Information System (INIS)

Feild, M.J.; Armstrong, F.B.

1987-01-01

E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and [ 3 H]-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealed limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region
Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

Science.gov (United States)

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Science.gov (United States)

2010-07-01

... may not include material other than part of the sequence listing. A fixed-width font should be used... integer expressing the number of bases or amino acid residues M. Type Whether presented sequence molecule is DNA, RNA, or PRT (protein). If a nucleotide sequence contains both DNA and RNA fragments, the type...

Random amino acid mutations and protein misfolding lead to Shannon limit in sequence-structure communication.

Directory of Open Access Journals (Sweden)

Andreas Martin Lisewski

2008-09-01

Full Text Available The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions and in structure (structural defects trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a sensitive to random errors and (b restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials.
Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

Science.gov (United States)

Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

1999-04-16

A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Detection and quantification of Plasmodium falciparum in blood samples using quantitative nucleic acid sequence-based amplification

NARCIS (Netherlands)

Schoone, G. J.; Oskam, L.; Kroon, N. C.; Schallig, H. D.; Omar, S. A.

2000-01-01

A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the
Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

Directory of Open Access Journals (Sweden)

Ruan Jishou

2007-04-01

characterized by accuracies below 70%. Finally, the Naïve Bayes method is shown to provide the highest sensitivity for the prediction of flexible regions, while FlexRP and SVM give the highest sensitivity for rigid regions. Conclusion A new sequence representation that uses k-spaced amino acid pairs is shown to be the most efficient in the prediction of the flexible/rigid regions of protein sequences. The proposed FlexRP method provides the highest prediction accuracy of about 80%. The experimental tests show that the FlexRP and SVM methods achieved high overall accuracy and the highest sensitivity for rigid regions, while the best quality of the predictions for flexible regions is achieved by the Naïve Bayes method.
The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

Science.gov (United States)

Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

2008-10-01

Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.
Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

Science.gov (United States)

Reiz, Bela; Li, Liang

2010-09-01

Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein. 2010 American Society for Mass Spectrometry. Published by Elsevier Inc. All rights reserved.
Nonlinear analysis of sequence symmetry of beta-trefoil family proteins

Energy Technology Data Exchange (ETDEWEB)

Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xu Ruizhen [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: yxiao@mail.hust.edu.cn

2005-07-01

The tertiary structures of proteins of beta-trefoil family have three-fold quasi-symmetry while their amino acid sequences appear almost at random. In the present paper we show that these amino acid sequences have hidden symmetries in fact and furthermore the degrees of these hidden symmetries are the same as those of their tertiary structures. We shall present a modified recurrence plot to reveal hidden symmetries in protein sequences. Our results can explain the contradiction in sequence-structure relations of proteins of beta-trefoil family.
Application of Ammonium Persulfate for Selective Oxidation of Guanines for Nucleic Acid Sequencing

Directory of Open Access Journals (Sweden)

Yafen Wang

2017-07-01

Full Text Available Nucleic acids can be sequenced by a chemical procedure that partially damages the nucleotide positions at their base repetition. Many methods have been reported for the selective recognition of guanine. The accurate identification of guanine in both single and double regions of DNA and RNA remains a challenging task. Herein, we present a new, non-toxic and simple method for the selective recognition of guanine in both DNA and RNA sequences via ammonium persulfate modification. This strategy can be further successfully applied to the detection of 5-methylcytosine by using PCR.
5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

Science.gov (United States)

Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

1989-01-01

The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.
The amino acid sequence of cytochrome c from Cucurbita maxima L. (pumpkin)

Science.gov (United States)

Thompson, E. W.; Richardson, M.; Boulter, D.

1971-01-01

The amino acid sequence of pumpkin cytochrome c was determined on 2μmol of protein. Some evidence was found for the occurrence of two forms of cytochrome c, whose sequences differed in three positions. Pumpkin cytochrome c consists of 111 residues and is homologous with mitochondrial cytochromes c from other plants. Experimental details are given in a supplementary paper that has been deposited as Supplementary Publication SUP 50005 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1971), 121, 7. PMID:5131733
Isolation and sequence of complementary DNA encoding human extracellular superoxide dismutase

International Nuclear Information System (INIS)

Hjalmarsson, K.; Marklund, S.L.; Engstroem, A.; Edlund, T.

1987-01-01

A complementary DNA (cDNA) clone from a human placenta cDNA library encoding extracellular superoxide dismutase has been isolated and the nucleotide sequence determined. The cDNA has a very high G + C content. EC-SOD is synthesized with a putative 18-amino acid signal peptide, preceding the 222 amino acids in the mature enzyme, indicating that the enzyme is a secretory protein. The first 95 amino acids of the mature enzyme show no sequence homology with other sequenced proteins and there is one possible N-glycosylation site (Asn-89). The amino acid sequence from residues 96-193 shows strong homology (∼ 50%) with the final two-thirds of the sequences of all know eukaryotic CuZn SODs, whereas the homology with the P. leiognathi CuZn SOD is clearly lower. The ligands to Cu and Zn, the cysteines forming the intrasubunit disulfide bridge in the CuZn SODs, and the arginine found in all CuZn SODs in the entrance to the active site can all be identified in EC-SOD. A comparison with bovine CuZn SOD, the three-dimensional structure of which is known, reveals that the homologies occur in the active site and the divergencies are in the part constituting the subunit contact area in CuZn SOD. Amino acid sequence 194-222 in the carboxyl-terminal end of EC-SOD is strongly hydrophilic and contains nine amino acids with a positive charge. This sequence probably confers the affinity of EC-SOD for heparin and heparan sulfate. An analysis of the amino acid sequence homologies with CuZn SODs from various species indicates that the EC-SODs may have evolved form the CuZn SODs before the evolution of fungi and plants
Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

KAUST Repository

Sugumar, Thennarasu

2011-12-12

Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine IL-3. There are 10 amino acid substitutions in buffalo compared with that of bovine. The amino acid sequence of buffalo IL-3 also showed very high identity with that of other ruminants, indicating functional cross-reactivity. Structural homology modelling of buffalo IL-3 protein with human IL-3 showed the presence of five helical structures.
Effect of amino acid sequence and pH on nanofiber formation of self-assembling peptides EAK16-II and EAK16-IV.

Science.gov (United States)

Hong, Yooseong; Legge, Raymond L; Zhang, S; Chen, P

2003-01-01

Atomic force microscopy (AFM) and axisymmetric drop shape analysis-profile (ASDA-P) were used to investigate the mechanism of self-assembly of peptides. The peptides chosen consisted of 16 alternating hydrophobic and hydrophilic amino acids, where the hydrophilic residues possess alternating negative and positive charges. Two types of peptides, AEAEAKAKAEAEAKAK (EAK16-II) and AEAEAEAEAKAKAKAK (EAK16-IV), were investigated in terms of nanostructure formation through self-assembly. The experimental results, which focused on the effects of the amino acid sequence and pH, show that the nanostructures formed by the peptides are dependent on the amino acid sequence and the pH of the solution. For pH conditions around neutrality, one of the peptides used in this study, EAK16-IV, forms globular assemblies and has lower surface tension at air-water interfaces than another peptide, EAK16-II, which forms fibrillar assemblies at the same pH. When the pH is lowered below 6.5 or raised above 7.5, there is a transition from globular to fibrillar structures for EAK16-IV, but EAK16-II does not show any structural transition. Surface tension measurements using ADSA-P showed different surface activities of peptides at air-water interfaces. EAK16-II does not show a significant difference in surface tension for the pH range between 4 and 9. However, EAK16-IV shows a noticeable decrease in surface tension at pH around neutrality, indicating that the formation of globular assemblies is related to the molecular hydrophobicity.
Nucleic Acid Amplification Testing and Sequencing Combined with Acid-Fast Staining in Needle Biopsy Lung Tissues for the Diagnosis of Smear-Negative Pulmonary Tuberculosis.

Directory of Open Access Journals (Sweden)

Faming Jiang

Full Text Available Smear-negative pulmonary tuberculosis (PTB is common and difficult to diagnose. In this study, we investigated the diagnostic value of nucleic acid amplification testing and sequencing combined with acid-fast bacteria (AFB staining of needle biopsy lung tissues for patients with suspected smear-negative PTB.Patients with suspected smear-negative PTB who underwent percutaneous transthoracic needle biopsy between May 1, 2012, and June 30, 2015, were enrolled in this retrospective study. Patients with AFB in sputum smears were excluded. All lung biopsy specimens were fixed in formalin, embedded in paraffin, and subjected to acid-fast staining and tuberculous polymerase chain reaction (TB-PCR. For patients with positive AFB and negative TB-PCR results in lung tissues, probe assays and 16S rRNA sequencing were used for identification of nontuberculous mycobacteria (NTM. The sensitivity, specificity, positive predictive value (PPV, negative predictive value (NPV, and diagnostic accuracy of PCR and AFB staining were calculated separately and in combination.Among the 220 eligible patients, 133 were diagnosed with TB (men/women: 76/57; age range: 17-80 years, confirmed TB: 9, probable TB: 124. Forty-eight patients who were diagnosed with other specific diseases were assigned as negative controls, and 39 patients with indeterminate final diagnosis were excluded from statistical analysis. The sensitivity, specificity, PPV, NPV, and accuracy of histological AFB (HAFB for the diagnosis of smear-negative were 61.7% (82/133, 100% (48/48, 100% (82/82, 48.5% (48/181, and 71.8% (130/181, respectively. The sensitivity, specificity, PPV, and NPV of histological PCR were 89.5% (119/133, 95.8% (46/48, 98.3% (119/121, and 76.7% (46/60, respectively, demonstrating that histological PCR had significantly higher accuracy (91.2% [165/181] than histological acid-fast staining (71.8% [130/181], P < 0.001. Parallel testing of histological AFB staining and PCR showed the
A Novel Phytase with Sequence Similarity to Purple Acid Phosphatases Is Expressed in Cotyledons of Germinating Soybean Seedlings 1

Science.gov (United States)

Hegeman, Carla E.; Grabau, Elizabeth A.

2001-01-01

Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases. PMID:11500558
Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme.

Directory of Open Access Journals (Sweden)

Hashem Koohy

Full Text Available DNase I is an enzyme which cuts duplex DNA at a rate that depends strongly upon its chromatin environment. In combination with high-throughput sequencing (HTS technology, it can be used to infer genome-wide landscapes of open chromatin regions. Using this technology, systematic identification of hundreds of thousands of DNase I hypersensitive sites (DHS per cell type has been possible, and this in turn has helped to precisely delineate genomic regulatory compartments. However, to date there has been relatively little investigation into possible biases affecting this data.We report a significant degree of sequence preference spanning sites cut by DNase I in a number of published data sets. The two major protocols in current use each show a different pattern, but for a given protocol the pattern of sequence specificity seems to be quite consistent. The patterns are substantially different from biases seen in other types of HTS data sets, and in some cases the most constrained position lies outside the sequenced fragment, implying that this constraint must relate to the digestion process rather than events occurring during library preparation or sequencing.DNase I is a sequence-specific enzyme, with a specificity that may depend on experimental conditions. This sequence specificity is not taken into account by existing pipelines for identifying open chromatin regions. Care must be taken when interpreting DNase I results, especially when looking at the precise locations of the reads. Future studies may be able to improve the sensitivity and precision of chromatin state measurement by compensating for sequence bias.
Evolution of sequence-defined highly functionalized nucleic acid polymers

Science.gov (United States)

Chen, Zhen; Lichtor, Phillip A.; Berliner, Adrian P.; Chen, Jonathan C.; Liu, David R.

2018-03-01

The evolution of sequence-defined synthetic polymers made of building blocks beyond those compatible with polymerase enzymes or the ribosome has the potential to generate new classes of receptors, catalysts and materials. Here we describe a ligase-mediated DNA-templated polymerization and in vitro selection system to evolve highly functionalized nucleic acid polymers (HFNAPs) made from 32 building blocks that contain eight chemically diverse side chains on a DNA backbone. Through iterated cycles of polymer translation, selection and reverse translation, we discovered HFNAPs that bind proprotein convertase subtilisin/kexin type 9 (PCSK9) and interleukin-6, two protein targets implicated in human diseases. Mutation and reselection of an active PCSK9-binding polymer yielded evolved polymers with high affinity (KD = 3 nM). This evolved polymer potently inhibited the binding between PCSK9 and the low-density lipoprotein receptor. Structure-activity relationship studies revealed that specific side chains at defined positions in the polymers are required for binding to their respective targets. Our findings expand the chemical space of evolvable polymers to include densely functionalized nucleic acids with diverse, researcher-defined chemical repertoires.
An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

Science.gov (United States)

Li, Yushuang; Song, Tian; Yang, Jiasheng; Zhang, Yi; Yang, Jialiang

2016-01-01

In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector.
Cloning and sequence analysis of benzo-a-pyreneinducible ...

African Journals Online (AJOL)

The phylogenetic tree based on the amino acid sequences clearly shows tilapia CYP1A and killifish CYP1A to be more closely related to each other than to the other CYP1A subfamilies. Sequence analysis of 3727 bp of genomic DNA showed that the clone obtained was the structural gene of CYP1A which consists of ...
Sequence-dependent DNA deformability studied using molecular dynamics simulations.

Science.gov (United States)

Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

2007-01-01

Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.

Tellurium-123m-labeled isosteres of palmitoleic and oleic acids show high myocardial uptake

International Nuclear Information System (INIS)

Knapp, F.F. Jr.; Ambrose, K.R.; Callahan, A.P.; Grigsby, R.A.; Irgolic, K.J.

1979-01-01

These studies were directed at determining if the telluro fatty acids prepared by the isosteric replacement of the Δ 9 -double bonds of oleic and palmitoleic acids with /sup 123m/Te would show heart uptake in rats. The isostere of palmitoleic acid, 9-tellurapentadecanoic acid(II), was prepared by basic hydrolysis of the product formed by the coupling of /sup 123m/Te-sodium hexyl tellurol with methyl-8-bromooctadecanoate. Similarly, the isostere of oleic acid, 9-telluraheptadecanoic acid(IV), was prepared by the same route beginning with the reaction of /sup 123m/Te-sodium octyl tellurol with methyl-8-bromooctadecanoate. Both /sup 123m/Te-(II) and /sup 123m/Te-(IV) showed remarkably high heart uptake in rats (2 to 3% dose/gm) ten minutes after intravenous administration, and the heart/blood ratios were high (20-30/1). Finally, the hearts of rats injected with /sup 123m/Te-(IV) have been clearly imaged with a rectilinear scanner
Amino acid sequence of bovine muzzle epithelial desmocollin derived from cloned cDNA: a novel subtype of desmosomal cadherins.

Science.gov (United States)

Koch, P J; Goldschmidt, M D; Walsh, M J; Zimbelmann, R; Schmelz, M; Franke, W W

1991-05-01

Desmosomes are cell-type-specific intercellular junctions found in epithelium, myocardium and certain other tissues. They consist of assemblies of molecules involved in the adhesion of specific cell types and in the anchorage of cell-type-specific cytoskeletal elements, the intermediate-size filaments, to the plasma membrane. To explore the individual desmosomal components and their functions we have isolated DNA clones encoding the desmosomal glycoprotein, desmocollin, using antibodies and a cDNA expression library from bovine muzzle epithelium. The cDNA-deduced amino-acid sequence of desmocollin (presently we cannot decide to which of the two desmocollins, DC I or DC II, this clone relates) defines a polypeptide with a calculated molecular weight of 85,000, with a single candidate sequence of 24 amino acids sufficiently long for a transmembrane arrangement, and an extracellular aminoterminal portion of 561 amino acid residues, compared to a cytoplasmic part of only 176 amino acids. Amino acid sequence comparisons have revealed that desmocollin is highly homologous to members of the cadherin family of cell adhesion molecules, including the previously sequenced desmoglein, another desmosome-specific cadherin. Using riboprobes derived from cDNAs for Northern-blot analyses, we have identified an mRNA of approximately 6 kb in stratified epithelia such as muzzle epithelium and tongue mucosa but not in two epithelial cell culture lines containing desmosomes and desmoplakins. The difference may indicate drastic differences in mRNA concentration or the existence of cell-type-specific desmocollin subforms. The molecular topology of desmocollin(s) is discussed in relation to possible functions of the individual molecular domains.
RNA Sequencing and Coexpression Analysis Reveal Key Genes Involved in α-Linolenic Acid Biosynthesis in Perilla frutescens Seed

Directory of Open Access Journals (Sweden)

Tianyuan Zhang

2017-11-01

Full Text Available Perilla frutescen is used as traditional food and medicine in East Asia. Its seeds contain high levels of α-linolenic acid (ALA, which is important for health, but is scarce in our daily meals. Previous reports on RNA-seq of perilla seed had identified fatty acid (FA and triacylglycerol (TAG synthesis genes, but the underlying mechanism of ALA biosynthesis and its regulation still need to be further explored. So we conducted Illumina RNA-sequencing in seven temporal developmental stages of perilla seeds. Sequencing generated a total of 127 million clean reads, containing 15.88 Gb of valid data. The de novo assembly of sequence reads yielded 64,156 unigenes with an average length of 777 bp. A total of 39,760 unigenes were annotated and 11,693 unigenes were found to be differentially expressed in all samples. According to Kyoto Encyclopedia of Genes and Genomes (KEGG pathway analysis, 486 unigenes were annotated in the “lipid metabolism” pathway. Of these, 150 unigenes were found to be involved in fatty acid (FA biosynthesis and triacylglycerol (TAG assembly in perilla seeds. A coexpression analysis showed that a total of 104 genes were highly coexpressed (r > 0.95. The coexpression network could be divided into two main subnetworks showing over expression in the medium or earlier and late phases, respectively. In order to identify the putative regulatory genes, a transcription factor (TF analysis was performed. This led to the identification of 45 gene families, mainly including the AP2-EREBP, bHLH, MYB, and NAC families, etc. After coexpression analysis of TFs with highly expression of FAD2 and FAD3 genes, 162 TFs were found to be significantly associated with two FAD genes (r > 0.95. Those TFs were predicted to be the key regulatory factors in ALA biosynthesis in perilla seed. The qRT-PCR analysis also verified the relevance of expression pattern between two FAD genes and partial candidate TFs. Although it has been reported that some TFs
Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

DEFF Research Database (Denmark)

Radova, A.; Sebela, M.; Galuszka, P.

2001-01-01

This paper reports the first purification method developed for the isolation of an homogeneous polyamine oxidase (PAO) from etiolated barley seedlings. The crude enzyme preparation was obtained after initial precipitation of the extract with protamine sulphate and ammonium sulphate. The enzyme...... was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...... PAO shows a high degree of similarity to that of maize PAO and to several other flavoprotein oxidases. The polyamines spermine and spermidine were the only two substrates of the enzyme with K-m values 4 x 10(-5) and 3 x 10(-5) M and pH optima of 5.0 and 6.0, respectively. Barley polyamine oxidase...
Prestin shows divergent evolution between constant frequency echolocating bats.

Science.gov (United States)

Shen, Bin; Avila-Flores, Rafael; Liu, Yang; Rossiter, Stephen J; Zhang, Shuyi

2011-10-01

The gene Prestin encodes a motor protein that is thought to confer the high-frequency sensitivity and selectivity that characterizes the mammalian auditory system. Recent research shows that the Prestin gene has undergone a burst of positive selection on the ancestral branch of the Old World horseshoe and leaf-nosed bats (Rhinolophidae and Hipposideridae, respectively), and also on the branch leading to echolocating cetaceans. Moreover, these two groups share a large number of convergent amino acid sequence replacements. Horseshoe and leaf-nosed bats exhibit narrowband echolocation, in which the emitted calls are based on the second harmonic of a predominantly constant frequency (CF) component, the frequency of which is also over-represented in the cochlea. This highly specialized form of echolocation has also evolved independently in the neotropical Parnell's mustached bat (Pteronotus parnellii). To test whether the convergent evolution of CF echolocation between lineages has arisen from common changes in the Prestin gene, we sequenced the Prestin coding region (~2,212 bp, >99% coverage) in P. parnellii and several related species that use broadband echolocation calls. Our reconstructed Prestin gene tree and amino acid tree showed that P. parnellii did not group together with Old World horseshoe and leaf-nosed bats, but rather clustered within its true sister species. Comparisons of sequences confirmed that P. parnellii shared most amino acid changes with its congeners, and we found no evidence of positive selection in the branch leading to the genus of Pteronotus. Our result suggests that the adaptive changes seen in Prestin in horseshoe and leaf-nosed bats are not necessary for CF echolocation in P. parnellii.
Polyvinyl-alcohol-based magnetic beads for rapid and efficient separation of specific or unspecific nucleic acid sequences

International Nuclear Information System (INIS)

Oster, J.; Parker, Jeffrey; Brassard, Lothar

2001-01-01

The versatile application of polyvinyl-alcohol-based magnetic M-PVA beads is demonstrated in the separation of genomic DNA, sequence specific nucleic acid purification, and binding of bacteria for subsequent DNA extraction and detection. It is shown that nucleic acids can be obtained in high yield and purity using M-PVA beads, making sample preparation efficient, fast and highly adaptable for automation processes
BnNHL18A shows a localization change by stress-inducing chemical treatments

International Nuclear Information System (INIS)

Lee, Suk-Bae; Ham, Byung-Kook; Park, Jeong Mee; Kim, Young Jin; Paek, Kyung-Hee

2006-01-01

The two genes, named BnNHL18A and BnNHL18B, showing sequence homology with Arabidopsis NDR1/HIN1-like (NHL) genes, were isolated from cDNA library prepared with oilseed rape (Brassica napus) seedlings treated with NaCl. The transcript level of BnNHL18A was increased by sodium chloride, ethephon, hydrogen peroxide, methyl jasmonate, or salicylic acid treatment. The coding regions of BnNHL18A and BnNHL18B contain a sarcolipin (SLN)-like sequence. Analysis of the localization of smGFP fusion proteins showed that BnNHL18A is mainly localized to endoplasmic reticulum (ER). This result suggests that the SLN-like sequence plays a role in retaining proteins in ER membrane in plants. In response to NaCl, hydrogen peroxide, ethephon, and salicylic acid treatments, the protein localization of BnNHL18A was changed. Our findings suggest a common function of BnNHL18A in biotic and abiotic stresses, and demonstrate the presence of the shared mechanism of protein translocalization between the responses to plant pathogen and to osmotic stress
Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

OpenAIRE

Mohn, W W

1995-01-01

Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per...
3D representations of amino acids—applications to protein sequence comparison and classification

Directory of Open Access Journals (Sweden)

Jie Li

2014-08-01

Full Text Available The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the representation of such a protein sequence facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D space derived from amino acid substitution matrices provides an adequate representation that can be used for direct comparison of protein sequences based on geometry. We measure the performance of such a representation in the context of the protein structural fold prediction problem. We compare the results of classifying different sets of proteins belonging to distinct structural folds against classifications of the same proteins obtained from sequence alone or directly from structural information. We find that sequence alone performs poorly as a structure classifier. We show in contrast that the use of the three dimensional representation of the sequences significantly improves the classification accuracy. We conclude with a discussion of the current limitations of such a representation and with a description of potential improvements.
The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

Science.gov (United States)

Mir, Rafia; Jallu, Shais; Singh, T P

2015-06-01

The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.
The use of orthologous sequences to predict the impact of amino acid substitutions on protein function.

Directory of Open Access Journals (Sweden)

Nicholas J Marini

2010-05-01

Full Text Available Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR, in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that "resurrects" the human-macaque ancestor result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an "ancestral site preservation" measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.
Computational analysis of sequence selection mechanisms.

Science.gov (United States)

Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

2004-04-01

Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
Amino-acid sequences of trypsin inhibitors from watermelon (Citrullus vulgaris) and red bryony (Bryonia dioica) seeds.

Science.gov (United States)

Otlewski, J; Whatley, H; Polanowski, A; Wilusz, T

1987-11-01

The amino-acid sequences of two trypsin inhibitors isolated from red bryony (Bryonia dioica) and watermelon (Citrullus vulgaris) seeds are reported. Both species represent different genera of the Cucurbitaceae family, which have not been previously investigated as a source of proteinase inhibitors. The sequences are unique but are very similar to those of other proteinase inhibitors which have been isolated from squash seeds. Based on structural homology we assume that the Arg5-Ile6 peptide bond represents the reactive site bond of both inhibitors.
Fast computational methods for predicting protein structure from primary amino acid sequence

Science.gov (United States)

Agarwal, Pratul Kumar [Knoxville, TN

2011-07-19

The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
Sequence dependent aggregation of peptides and fibril formation

Science.gov (United States)

Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

2017-09-01

Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
Comparative analysis of the prion protein gene sequences in African lion.

Science.gov (United States)

Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

2006-10-01

The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.
The cDNA sequence of a neutral horseradish peroxidase.

Science.gov (United States)

Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

1991-02-16

A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Complete Genome Sequence of the Gamma-Aminobutyric Acid-Producing Strain Streptococcus thermophilus APC151.

Science.gov (United States)

Linares, Daniel M; Arboleya, Silvia; Ross, R Paul; Stanton, Catherine

2017-04-27

Here is presented the whole-genome sequence of Streptococcus thermophilus APC151, isolated from a marine fish. This bacterium produces gamma-aminobutyric acid (GABA) in high yields and is biotechnologically suitable to produce naturally GABA-enriched biofunctional yogurt. Its complete genome comprises 2,097 genes and 1,839,134 nucleotides, with an average G+C content of 39.1%. Copyright © 2017 Linares et al.
Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.

Science.gov (United States)

Hayat, Maqsood; Khan, Asifullah

2011-02-21

Membrane proteins are vital type of proteins that serve as channels, receptors, and energy transducers in a cell. Prediction of membrane protein types is an important research area in bioinformatics. Knowledge of membrane protein types provides some valuable information for predicting novel example of the membrane protein types. However, classification of membrane protein types can be both time consuming and susceptible to errors due to the inherent similarity of membrane protein types. In this paper, neural networks based membrane protein type prediction system is proposed. Composite protein sequence representation (CPSR) is used to extract the features of a protein sequence, which includes seven feature sets; amino acid composition, sequence length, 2 gram exchange group frequency, hydrophobic group, electronic group, sum of hydrophobicity, and R-group. Principal component analysis is then employed to reduce the dimensionality of the feature vector. The probabilistic neural network (PNN), generalized regression neural network, and support vector machine (SVM) are used as classifiers. A high success rate of 86.01% is obtained using SVM for the jackknife test. In case of independent dataset test, PNN yields the highest accuracy of 95.73%. These classifiers exhibit improved performance using other performance measures such as sensitivity, specificity, Mathew's correlation coefficient, and F-measure. The experimental results show that the prediction performance of the proposed scheme for classifying membrane protein types is the best reported, so far. This performance improvement may largely be credited to the learning capabilities of neural networks and the composite feature extraction strategy, which exploits seven different properties of protein sequences. The proposed Mem-Predictor can be accessed at http://111.68.99.218/Mem-Predictor. Copyright Â© 2010 Elsevier Ltd. All rights reserved.
Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry

Directory of Open Access Journals (Sweden)

Javier Villacreses

2015-04-01

Full Text Available Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1. High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs: ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV, Petuvirus genus. ORF1 encodes a movement protein (MP; ORF2 a Reverse Transcriptase (RT and a Ribonuclease H (RNase H domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs, AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq. Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant.

Tannic acid modified silver nanoparticles show antiviral activity in herpes simplex virus type 2 infection.

Directory of Open Access Journals (Sweden)

Piotr Orlowski

Full Text Available The interaction between silver nanoparticles and herpesviruses is attracting great interest due to their antiviral activity and possibility to use as microbicides for oral and anogenital herpes. In this work, we demonstrate that tannic acid modified silver nanoparticles sized 13 nm, 33 nm and 46 nm are capable of reducing HSV-2 infectivity both in vitro and in vivo. The antiviral activity of tannic acid modified silver nanoparticles was size-related, required direct interaction and blocked virus attachment, penetration and further spread. All tested tannic acid modified silver nanoparticles reduced both infection and inflammatory reaction in the mouse model of HSV-2 infection when used at infection or for a post-infection treatment. Smaller-sized nanoparticles induced production of cytokines and chemokines important for anti-viral response. The corresponding control buffers with tannic acid showed inferior antiviral effects in vitro and were ineffective in blocking in vivo infection. Our results show that tannic acid modified silver nanoparticles are good candidates for microbicides used in treatment of herpesvirus infections.
Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.

Science.gov (United States)

Tan, Yen Hock; Huang, He; Kihara, Daisuke

2006-08-15

Aligning distantly related protein sequences is a long-standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile-profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile-profile alignments. Here we have developed novel amino acid similarity matrices from knowledge-based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential-based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential-based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential-based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space.
Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

Science.gov (United States)

Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

1993-02-01

A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.
Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

Science.gov (United States)

Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

2011-01-01

Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583
Structure and Sequence Search on Aptamer-Protein Docking

Science.gov (United States)

Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

2015-03-01

Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
Sequence of a cDNA encoding turtle high mobility group 1 protein.

Science.gov (United States)

Zheng, Jifang; Hu, Bi; Wu, Duansheng

2005-07-01

In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.
Sequence of human protamine 2 cDNA

Energy Technology Data Exchange (ETDEWEB)

Domenjoud, L; Fronia, C; Uhde, F; Engel, W [Universitaet Goettingen (West Germany)

1988-08-11

The authors report the cloning and sequencing of a cDNA clone for human protamine 2 (hp2), isolated from a human testis cDNA library cloned in the vector {lambda}-gt11. A 66mer oligonucleotide, that corresponds to an amino acid sequence which is highly conserved between hp2 and mouse protamine 2 (mp2) served as hybridization probe. The homology between the amino acid sequence deduced from our cDNA and the published amino acid sequence for hp2 is 100%.
Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

Science.gov (United States)

Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

2007-01-01

Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
CDNA encoding a polypeptide including a hevein sequence

Science.gov (United States)

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
The Saccharomyces cerevisiae RAD18 gene encodes a protein that contains potential zinc finger domains for nucleic acid binding and a putative nucleotide binding sequence

Energy Technology Data Exchange (ETDEWEB)

Jones, J.S.; Prakash, L. (Univ. of Rochester School of Medicine, NY (USA)); Weber, S. (Kodak Research Park, Rochester, NY (USA))

1988-07-25

The RAD18 gene of Saccharomyces cerevisiae is required for postreplication repair of UV damaged DNA. The authors have isolated the RAD18 gene, determined its nucleotide sequence and examined if deletion mutations of this gene show different or more pronounced phenotypic effects than the previously described point mutations. The RAD18 gene open reading frame encodes a protein of 487 amino acids, with a calculated molecular weight of 55,512. The RAD18 protein contains three potential zinc finger domains for nucleic acid binding, and a putative nucleotide binding sequence that is present in many proteins that bind and hydrolyze ATP. The DNA binding and nucleotide binding activities could enable the RAD18 protein to bind damaged sites in the template DNA with high affinity. Alternatively, or in addition, RAD18 protein may be a transcriptional regulator. The RAD18 deletion mutation resembles the previously described point mutations in its effects on viability, DNA repair, UV mutagenesis, and sporulation.
Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

Directory of Open Access Journals (Sweden)

Pelletier Eric

2010-10-01

Full Text Available Abstract Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C
Nucleic Acid Amplification Testing and Sequencing Combined with Acid-Fast Staining in Needle Biopsy Lung Tissues for the Diagnosis of Smear-Negative Pulmonary Tuberculosis.

Science.gov (United States)

Jiang, Faming; Huang, Weiwei; Wang, Ye; Tian, Panwen; Chen, Xuerong; Liang, Zongan

2016-01-01

Smear-negative pulmonary tuberculosis (PTB) is common and difficult to diagnose. In this study, we investigated the diagnostic value of nucleic acid amplification testing and sequencing combined with acid-fast bacteria (AFB) staining of needle biopsy lung tissues for patients with suspected smear-negative PTB. Patients with suspected smear-negative PTB who underwent percutaneous transthoracic needle biopsy between May 1, 2012, and June 30, 2015, were enrolled in this retrospective study. Patients with AFB in sputum smears were excluded. All lung biopsy specimens were fixed in formalin, embedded in paraffin, and subjected to acid-fast staining and tuberculous polymerase chain reaction (TB-PCR). For patients with positive AFB and negative TB-PCR results in lung tissues, probe assays and 16S rRNA sequencing were used for identification of nontuberculous mycobacteria (NTM). The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic accuracy of PCR and AFB staining were calculated separately and in combination. Among the 220 eligible patients, 133 were diagnosed with TB (men/women: 76/57; age range: 17-80 years, confirmed TB: 9, probable TB: 124). Forty-eight patients who were diagnosed with other specific diseases were assigned as negative controls, and 39 patients with indeterminate final diagnosis were excluded from statistical analysis. The sensitivity, specificity, PPV, NPV, and accuracy of histological AFB (HAFB) for the diagnosis of smear-negative were 61.7% (82/133), 100% (48/48), 100% (82/82), 48.5% (48/181), and 71.8% (130/181), respectively. The sensitivity, specificity, PPV, and NPV of histological PCR were 89.5% (119/133), 95.8% (46/48), 98.3% (119/121), and 76.7% (46/60), respectively, demonstrating that histological PCR had significantly higher accuracy (91.2% [165/181]) than histological acid-fast staining (71.8% [130/181]), P pulmonary tuberculosis. For patients with positive histological AFB and
Next-generation sequencing library preparation method for identification of RNA viruses on the Ion Torrent Sequencing Platform.

Science.gov (United States)

Chen, Guiqian; Qiu, Yuan; Zhuang, Qingye; Wang, Suchun; Wang, Tong; Chen, Jiming; Wang, Kaicheng

2018-05-09

Next generation sequencing (NGS) is a powerful tool for the characterization, discovery, and molecular identification of RNA viruses. There were multiple NGS library preparation methods published for strand-specific RNA-seq, but some methods are not suitable for identifying and characterizing RNA viruses. In this study, we report a NGS library preparation method to identify RNA viruses using the Ion Torrent PGM platform. The NGS sequencing adapters were directly inserted into the sequencing library through reverse transcription and polymerase chain reaction, without fragmentation and ligation of nucleic acids. The results show that this method is simple to perform, able to identify multiple species of RNA viruses in clinical samples.
Sequence Design for a Test Tube of Interacting Nucleic Acid Strands.

Science.gov (United States)

Wolfe, Brian R; Pierce, Niles A

2015-10-16

We describe an algorithm for designing the equilibrium base-pairing properties of a test tube of interacting nucleic acid strands. A target test tube is specified as a set of desired "on-target" complexes, each with a target secondary structure and target concentration, and a set of undesired "off-target" complexes, each with vanishing target concentration. Sequence design is performed by optimizing the test tube ensemble defect, corresponding to the concentration of incorrectly paired nucleotides at equilibrium evaluated over the ensemble of the test tube. To reduce the computational cost of accepting or rejecting mutations to a random initial sequence, the structural ensemble of each on-target complex is hierarchically decomposed into a tree of conditional subensembles, yielding a forest of decomposition trees. Candidate sequences are evaluated efficiently at the leaf level of the decomposition forest by estimating the test tube ensemble defect from conditional physical properties calculated over the leaf subensembles. As optimized subsequences are merged toward the root level of the forest, any emergent defects are eliminated via ensemble redecomposition and sequence reoptimization. After successfully merging subsequences to the root level, the exact test tube ensemble defect is calculated for the first time, explicitly checking for the effect of the previously neglected off-target complexes. Any off-target complexes that form at appreciable concentration are hierarchically decomposed, added to the decomposition forest, and actively destabilized during subsequent forest reoptimization. For target test tubes representative of design challenges in the molecular programming and synthetic biology communities, our test tube design algorithm typically succeeds in achieving a normalized test tube ensemble defect ≤1% at a design cost within an order of magnitude of the cost of test tube analysis.
Identification of metal ion binding sites based on amino acid sequences.

Science.gov (United States)

Cao, Xiaoyong; Hu, Xiuzhen; Zhang, Xiaojin; Gao, Sujuan; Ding, Changjiang; Feng, Yonge; Bao, Weihua

2017-01-01

The identification of metal ion binding sites is important for protein function annotation and the design of new drug molecules. This study presents an effective method of analyzing and identifying the binding residues of metal ions based solely on sequence information. Ten metal ions were extracted from the BioLip database: Zn2+, Cu2+, Fe2+, Fe3+, Ca2+, Mg2+, Mn2+, Na+, K+ and Co2+. The analysis showed that Zn2+, Cu2+, Fe2+, Fe3+, and Co2+ were sensitive to the conservation of amino acids at binding sites, and promising results can be achieved using the Position Weight Scoring Matrix algorithm, with an accuracy of over 79.9% and a Matthews correlation coefficient of over 0.6. The binding sites of other metals can also be accurately identified using the Support Vector Machine algorithm with multifeature parameters as input. In addition, we found that Ca2+ was insensitive to hydrophobicity and hydrophilicity information and Mn2+ was insensitive to polarization charge information. An online server was constructed based on the framework of the proposed method and is freely available at http://60.31.198.140:8081/metal/HomePage/HomePage.html.
Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

Directory of Open Access Journals (Sweden)

Leitner Dietmar

2005-04-01

Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.
Phenolic Acids from Wheat Show Different Absorption Profiles in Plasma: A Model Experiment with Catheterized Pigs

DEFF Research Database (Denmark)

Nørskov, Natalja; Hedemann, Mette Skou; Theil, Peter Kappel

2013-01-01

The concentration and absorption of the nine phenolic acids of wheat were measured in a model experiment with catheterized pigs fed whole grain wheat and wheat aleurone diets. Six pigs in a repeated crossover design were fitted with catheters in the portal vein and mesenteric artery to study...... the absorption of phenolic acids. The difference between the artery and the vein for all phenolic acids was small, indicating that the release of phenolic acids in the large intestine was not sufficient to create a porto-arterial concentration difference. Although, the porto-arterial difference was small...... consumed. Benzoic acid derivatives showed low concentration in the plasma (phenolic acids, likely because it is an intermediate in the phenolic acid metabolism...
Nucleotide sequence of the coat protein gene of the Skierniewice isolate of plum pox virus (PPV)

International Nuclear Information System (INIS)

Wypijewski, K.; Musial, W.; Augustyniak, J.; Malinowski, T.

1994-01-01

The coat protein (CP) gene of the Skierniewice isolate of plum pox virus (PPV-S) has been amplified using the reverse transcription - polymerase chain reaction (RT-PCR), cloned and sequenced. The nucleotide sequence of the gene and the deduced amino-acid sequences of PPV-S CP were compared with those of other PPV strains. The nucleotide sequence showed very high homology to most of the published sequences. The motif: Asp-Ala-Gly (DAG), important for the aphid transmissibility, was present in the amino-acid sequence. Our isolate did not react in ELISA with monoclonal antibodies MAb06 supposed to be specific for PPV-D. (author). 32 refs, 1 fig., 2 tabs
Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.

Science.gov (United States)

Osipiuk, J; Górnicki, P; Maj, L; Dementieva, I; Laskowski, R; Joachimiak, A

2001-11-01

The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 A. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 A from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer alpha/beta sandwich with the overall shape of a cylinder and shows no structural homology to proteins of known structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the alpha-beta plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.
Genome sequence of the thermophilic strain Bacillus coagulans 2-6, an efficient producer of high-optical-purity L-lactic acid.

Science.gov (United States)

Su, Fei; Yu, Bo; Sun, Jibin; Ou, Hong-Yu; Zhao, Bo; Wang, Limin; Qin, Jiayang; Tang, Hongzhi; Tao, Fei; Jarek, Michael; Scharfe, Maren; Ma, Cuiqing; Ma, Yanhe; Xu, Ping

2011-09-01

Bacillus coagulans 2-6 is an efficient producer of lactic acid. The genome of B. coagulans 2-6 has the smallest genome among the members of the genus Bacillus known to date. The frameshift mutation at the start of the d-lactate dehydrogenase sequence might be responsible for the production of high-optical-purity l-lactic acid.

BLEACHING EUCALYPTUS PULPS WITH SHORT SEQUENCES

Directory of Open Access Journals (Sweden)

Flaviana Reis Milagres

2011-03-01

Full Text Available Eucalyptus spp kraft pulp, due to its high content of hexenuronic acids, is quite easy to bleach. Therefore, investigations have been made attempting to decrease the number of stages in the bleaching process in order to minimize capital costs. This study focused on the evaluation of short ECF (Elemental Chlorine Free and TCF (Totally Chlorine Free sequences for bleaching oxygen delignified Eucalyptus spp kraft pulp to 90% ISO brightness: PMoDP (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, PMoD/P (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, without washing PMoD(PO (Molybdenum catalyzed acid peroxide, chlorine dioxide and pressurized peroxide, D(EPODP (chlorine dioxide, extraction oxidative with oxygen and peroxide, chlorine dioxide and hydrogen peroxide, PMoQ(PO (Molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide, and XPMoQ(PO (Enzyme, molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide. Uncommon pulp treatments, such as molybdenum catalyzed acid peroxide (PMo and xylanase (X bleaching stages, were used. Among the ECF alternatives, the two-stage PMoD/P sequence proved highly cost-effective without affecting pulp quality in relation to the traditional D(EPODP sequence and produced better quality effluent in relation to the reference. However, a four stage sequence, XPMoQ(PO, was required to achieve full brightness using the TCF technology. This sequence was highly cost-effective although it only produced pulp of acceptable quality.
Streptococcus pneumonia YlxR at 1.35 Å shows a putative new fold

OpenAIRE

Osipiuk, Jerzy; Górnicki, Piotr; Maj, Luke; Dementieva, Irina; Laskowski, Roman; Joachimiak, Andrzej

2001-01-01

The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 Å. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 Å from orthorhombic crystals using synchrotron radiation and the structure was determined ...
Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

DEFF Research Database (Denmark)

Ibaraki, K; Kozak, C A; Wewer, U M

1995-01-01

regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76......(s) of tetranectin. The sequence analysis revealed a difference in both sequence and size of the noncoding regions between mouse and human cDNAs. Northern analysis of the various tissues from mouse, rat, and cow showed the major transcript(s) to be approximately 1 kb, which is similar in size to that observed...
Amino acid sequence and posttranslational modifications of human factor VIIa from plasma and transfected baby hamster kidney cells

International Nuclear Information System (INIS)

Thim, L.; Bjoern, S.; Christensen, M.; Nicolaisen, E.M.; Lund-Hansen, T.; Pedersen, A.H.; Hedner, U.

1988-01-01

Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VII a , participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca 2+ and tissue factor. Three types of potential posttranslational modifications exist in the human factor VII a molecule, namely, 10 γ-carboxylated, N-terminally located glutamic acid residues, 1 β-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VII a as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VII a . By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VII a was found to be identical with human factor VII a . Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VII a . In the recombinant factor VII a , asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VII a and human plasma factor VII a . These results show that factor VII a as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VII a and that this cell line thus might represent an alternative source for human factor VII a
Sequence homology: A poor predictive value for profilins cross-reactivity

Directory of Open Access Journals (Sweden)

Pazouki Nazanin

2005-09-01

Full Text Available Summary Background Profilins are highly cross-reactive allergens which bind IgE antibodies of almost 20% of plant-allergic patients. This study is aimed at investigating cross-reactivity of melon profilin with other plant profilins and the role of the linear and conformational epitopes in human IgE cross-reactivity. Methods Seventeen patients with melon allergy were selected based on clinical history and a positive skin prick test to melon extract. Melon profilin has been cloned and expressed in E. coli. The IgE binding and cross-reactivity of the recombinant profilin were measured by ELISA and inhibition ELISA. The amino acid sequence of melon profilin was compared with other profilin sequences. A combination of chemical cleavage and immunoblotting techniques were used to define the role of conformational and linear epitopes in IgE binding. Comparative modeling was used to construct three-dimensional models of profilins and to assess theoretical impact of amino acid differences on conformational structure. Results Profilin was identified as a major IgE-binding component of melon. Alignment of amino acid sequences of melon profilin with other profilins showed the most identity with watermelon profilin. This melon profilin showed substantial cross-reactivity with the tomato, peach, grape and Cynodon dactylon (Bermuda grass pollen profilins. Cantaloupe, watermelon, banana and Poa pratensis (Kentucky blue grass displayed no notable inhibition. Our experiments also indicated human IgE only react with complete melon profilin. Immunoblotting analysis with rabbit polyclonal antibody shows the reaction of the antibody to the fragmented and complete melon profilin. Although, the well-known linear epitope of profilins were identical in melon and watermelon, comparison of three-dimensional models of watermelon and melon profilins indicated amino acid differences influence the electric potential and accessibility of the solvent-accessible surface of
The complete nucleotide sequence of the barley yellow dwarf GPV isolate from China shows that it is a new member of the genus Polerovirus.

Science.gov (United States)

Zhang, Wenwei; Cheng, Zhuomin; Xu, Lei; Wu, Maosen; Waterhouse, Peter; Zhou, Guanghe; Li, Shifang

2009-01-01

The complete nucleotide sequence of the ssRNA genome of a Chinese GPV isolate of barley yellow dwarf virus (BYDV) was determined. It comprised 5673 nucleotides, and the deduced genome organization resembled that of members of the genus Polerovirus. It was most closely related to cereal yellow dwarf virus-RPV (77% nt identity over the entire genome; coat protein amino acid identity 79%). The GPV isolate also differs in vector specificity from other BYDV strains. Biological properties, phylogenetic analyses and detailed sequence comparisons suggest that GPV should be considered a member of a new species within the genus, and the name Wheat yellow dwarf virus-GPV is proposed.
Computer-aided visualization and analysis system for sequence evaluation

Energy Technology Data Exchange (ETDEWEB)

Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

2004-05-11

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

2000-07-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 12 figs.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 11 figures.
First draft genome sequencing of indole acetic acid producing and plant growth promoting fungus Preussia sp. BSL10.

Science.gov (United States)

Khan, Abdul Latif; Asaf, Sajjad; Khan, Abdur Rahim; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

2016-05-10

Preussia sp. BSL10, family Sporormiaceae, was actively producing phytohormone (indole-3-acetic acid) and extra-cellular enzymes (phosphatases and glucosidases). The fungus was also promoting the growth of arid-land tree-Boswellia sacra. Looking at such prospects of this fungus, we sequenced its draft genome for the first time. The Illumina based sequence analysis reveals an approximate genome size of 31.4Mbp for Preussia sp. BSL10. Based on ab initio gene prediction, total 32,312 coding sequences were annotated consisting of 11,967 coding genes, pseudogenes, and 221 tRNA genes. Furthermore, 321 carbohydrate-active enzymes were predicted and classified into many functional families. Copyright © 2016 Elsevier B.V. All rights reserved.
Acid mine drainage neutralization in a pilot sequencing batch reactor using limestone from a paper and pulp industry

CSIR Research Space (South Africa)

Vadapalli, VRK

2015-10-01

Full Text Available This study investigated the implications of using two grades of limestone from a paper and pulp industry for neutralization of acid mine drainage (AMD) in a pilot sequencing batch reactor (SBR). In this regard, two grades of calcium carbonate were...
Novel algorithms for protein sequence analysis

NARCIS (Netherlands)

Ye, Kai

2008-01-01

Each protein is characterized by its unique sequential order of amino acids, the so-called protein sequence. Biology”s paradigm is that this order of amino acids determines the protein”s architecture and function. In this thesis, we introduce novel algorithms to analyze protein sequences. Chapter 1
Influence of the Amino Acid Sequence on Protein-Mineral Interactions in Soil

Science.gov (United States)

Chacon, S. S.; Reardon, P. N.; Purvine, S.; Lipton, M. S.; Washton, N.; Kleber, M.

2017-12-01

The intimate associations between protein and mineral surfaces have profound impacts on nutrient cycling in soil. Proteins are an important source of organic C and N, and a subset of proteins, extracellular enzymes (EE), can catalyze the depolymerization of soil organic matter (SOM). Our goal was to determine how variation in the amino acid sequence could influence a protein's susceptibility to become chemically altered by mineral surfaces to infer the fate of adsorbed EE function in soil. We hypothesized that (1) addition of charged amino acids would enhance the adsorption onto oppositely charged mineral surfaces (2) addition of aromatic amino acids would increase adsorption onto zero charged surfaces (3) Increase adsorption of modified proteins would enhance their susceptibility to alterations by redox active minerals. To test these hypotheses, we generated three engineered proxies of a model protein Gb1 (IEP 4.0, 6.2 kDA) by inserting either negatively charged, positively charged or aromatic amino acids in the second loop. These modified proteins were allowed to interact with functionally different mineral surfaces (goethite, montmorillonite, kaolinite and birnessite) at pH 5 and 7. We used LC-MS/MS and solution-state Heteronuclear Single Quantum Coherence Spectroscopy NMR to observe modifications on engineered proteins as a consequence to mineral interactions. Preliminary results indicate that addition of any amino acids to a protein increase its susceptibility to fragmentation and oxidation by redox active mineral surfaces, and alter adsorption to the other mineral surfaces. This suggest that not all mineral surfaces in soil may act as sorbents for EEs and chemical modification of their structure should also be considered as an explanation for decrease in EE activity. Fragmentation of proteins by minerals can bypass the need to produce proteases, but microbial acquisition of other nutrients that require enzymes such as cellulases, ligninases or phosphatases
Acid Evolution of Escherichia coli K-12 Eliminates Amino Acid Decarboxylases and Reregulates Catabolism.

Science.gov (United States)

He, Amanda; Penix, Stephanie R; Basting, Preston J; Griffith, Jessie M; Creamer, Kaitlin E; Camperchioli, Dominic; Clark, Michelle W; Gonzales, Alexandra S; Chávez Erazo, Jorge Sebastian; George, Nadja S; Bhagwat, Arvind A; Slonczewski, Joan L

2017-06-15

Acid-adapted strains of Escherichia coli K-12 W3110 were obtained by serial culture in medium buffered at pH 4.6 (M. M. Harden, A. He, K. Creamer, M. W. Clark, I. Hamdallah, K. A. Martinez, R. L. Kresslein, S. P. Bush, and J. L. Slonczewski, Appl Environ Microbiol 81:1932-1941, 2015, https://doi.org/10.1128/AEM.03494-14). Revised genomic analysis of these strains revealed insertion sequence (IS)-driven insertions and deletions that knocked out regulators CadC (acid induction of lysine decarboxylase), GadX (acid induction of glutamate decarboxylase), and FNR (anaerobic regulator). Each acid-evolved strain showed loss of one or more amino acid decarboxylase systems, which normally help neutralize external acid (pH 5 to 6) and increase survival in extreme acid (pH 2). Strains from populations B11, H9, and F11 had an IS 5 insertion or IS-mediated deletion in cadC , while population B11 had a point mutation affecting the arginine activator adiY The cadC and adiY mutants failed to neutralize acid in the presence of exogenous lysine or arginine. In strain B11-1, reversion of an rpoC (RNA polymerase) mutation partly restored arginine-dependent neutralization. All eight strains showed deletion or downregulation of the Gad acid fitness island. Strains with the Gad deletion lost the ability to produce GABA (gamma-aminobutyric acid) and failed to survive extreme acid. Transcriptome sequencing (RNA-seq) of strain B11-1 showed upregulated genes for catabolism of diverse substrates but downregulated acid stress genes (the biofilm regulator ariR , yhiM , and Gad). Other strains showed downregulation of H 2 consumption mediated by hydrogenases ( hya and hyb ) which release acid. Strains F9-2 and F9-3 had a deletion of fnr and showed downregulation of FNR-dependent genes ( dmsABC , frdABCD , hybABO , nikABCDE , and nrfAC ). Overall, strains that had evolved in buffered acid showed loss or downregulation of systems that neutralize unbuffered acid and showed altered regulation of
Deep sequencing shows microRNA involvement in bovine mammary gland adaptation to diets supplemented with linseed oil or safflower oil.

Science.gov (United States)

Li, Ran; Beaudoin, Frédéric; Ammah, Adolf A; Bissonnette, Nathalie; Benchaar, Chaouki; Zhao, Xin; Lei, Chuzhao; Ibeagha-Awemu, Eveline M

2015-10-30

Bovine milk fat composition is responsive to dietary manipulation providing an avenue to modify the content of fatty acids and especially some specific unsaturated fatty acid (USFA) isomers of benefit to human health. MicroRNAs (miRNAs) regulate gene expression but their specific roles in bovine mammary gland lipogenesis are unclear. The objective of this study was to determine the expression pattern of miRNAs following mammary gland adaptation to dietary supplementation with 5 % linseed or safflower oil using next generation RNA-sequencing. Twenty-four Canadian Holstein dairy cows (twelve per treatment) in mid lactation were fed a control diet (total mixed ration of corn:grass silages) for 28 days followed by a treatment period (control diet supplemented with 5 % linseed or safflower oil) of 28 days. Milk samples were collected weekly for fat and individual fatty acid determination. RNA from mammary gland biopsies harvested on day-14 (control period) and on days +7 and +28 (treatment period) from six randomly selected cows per treatment was subjected to small RNA sequencing. Milk fat percentage decreased significantly (P safflower oil treatments, respectively. Seven miRNAs including six up-regulated (bta-miR-199c, miR-199a-3p, miR-98, miR-378, miR-148b and miR-21-5p) and one down-regulated (bta-miR-200a) were found to be regulated (P < 0.05) by both treatments, and thus considered core differentially expressed (DE) miRNAs. The gene targets of core DE miRNAs have functions related to gene expression and general cellular metabolism (P < 0.05) and are enriched in four pathways of lipid metabolism (3-phosphoinositide biosynthesis, 3-phosphoinositide degradation, D-myo-inisitol-5-phosphate metabolism and the superpathway of inositol phosphate compounds). Our results suggest that DE miRNAs in this study might be important regulators of bovine mammary lipogenesis and metabolism. The novel miRNAs identified in this study will further enrich the bovine miRNome repertoire
Cloning and sequencing of the bovine gastrin gene

DEFF Research Database (Denmark)

Lund, T; Rehfeld, J F; Olsen, Jørgen

1989-01-01

In order to deduce the primary structure of bovine preprogastrin we therefore sequenced a gastrin DNA clone isolated from a bovine liver cosmid library. Bovine preprogastrin comprises 104 amino acids and consists of a signal peptide, a 37 amino acid spacer-sequence, the gastrin-34 sequence followed...
Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

Science.gov (United States)

Benyo, B; Biro, J C; Benyo, Z

2004-01-01

The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
A New Approach to Sequence Analysis Exemplified by Identification of cis-Elements in Abscisic Acid Inducible Promoters

DEFF Research Database (Denmark)

Busk, Peter Kamp; Hallin, Peter Fischer; Salomon, Jesper

-regulatory elements. We have developed a method for identifying short, conserved motifs in biological sequences such as proteins, DNA and RNA5. This method was used for analysis of approximately 2000 Arabidopsis thaliana promoters that have been shown by DNA array analysis to be induced by abscisic acid6....... These promoters were compared to 28000 promoters that are not induced by abscisic acid. The analysis identified previously described ABA-inducible promoter elements such as ABRE, CE3 and CRT1 but also new cis-elements were found. Furthermore, the list of DNA elements could be used to predict ABA...

cDNA encoding a polypeptide including a hev ein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

2000-07-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
Characterization, Genome Sequence, and Analysis of Escherichia Phage CICC 80001, a Bacteriophage Infecting an Efficient L-Aspartic Acid Producing Escherichia coli.

Science.gov (United States)

Xu, Youqiang; Ma, Yuyue; Yao, Su; Jiang, Zengyan; Pei, Jiangsen; Cheng, Chi

2016-03-01

Escherichia phage CICC 80001 was isolated from the bacteriophage contaminated medium of an Escherichia coli strain HY-05C (CICC 11022S) which could produce L-aspartic acid. The phage had a head diameter of 45-50 nm and a tail of about 10 nm. The one-step growth curve showed a latent period of 10 min and a rise period of about 20 min. The average burst size was about 198 phage particles per infected cell. Tests were conducted on the plaques, multiplicity of infection, and host range. The genome of CICC 80001 was sequenced with a length of 38,810 bp, and annotated. The key proteins leading to host-cell lysis were phylogenetically analyzed. One protein belonged to class II holin, and the other two belonged to the endopeptidase family and N-acetylmuramoyl-L-alanine amidase family, respectively. The genome showed the sequence identity of 82.7% with that of Enterobacteria phage T7, and carried ten unique open reading frames. The bacteriophage resistant E. coli strain designated CICC 11021S was breeding and its L-aspartase activity was 84.4% of that of CICC 11022S.
Irritable bowel syndrome-diarrhea: characterization of genotype by exome sequencing, and phenotypes of bile acid synthesis and colonic transit

Science.gov (United States)

Klee, Eric W.; Shin, Andrea; Carlson, Paula; Li, Ying; Grover, Madhusudan; Zinsmeister, Alan R.

2013-01-01

The study objectives were: to mine the complete exome to identify putative rare single nucleotide variants (SNVs) associated with irritable bowel syndrome (IBS)-diarrhea (IBS-D) phenotype, to assess genes that regulate bile acids in IBS-D, and to explore univariate associations of SNVs with symptom phenotype and quantitative traits in an independent IBS cohort. Using principal components analysis, we identified two groups of IBS-D (n = 16) with increased fecal bile acids: rapid colonic transit or high bile acids synthesis. DNA was sequenced in depth, analyzing SNVs in bile acid genes (ASBT, FXR, OSTα/β, FGF19, FGFR4, KLB, SHP, CYP7A1, LRH-1, and FABP6). Exome findings were compared with those of 50 similar ethnicity controls. We assessed univariate associations of each SNV with quantitative traits and a principal components analysis and associations between SNVs in KLB and FGFR4 and symptom phenotype in 405 IBS, 228 controls and colonic transit in 70 IBS-D, 71 IBS-constipation. Mining the complete exome did not reveal significant associations with IBS-D over controls. There were 54 SNVs in 10 of 11 bile acid-regulating genes, with no SNVs in FGF19; 15 nonsynonymous SNVs were identified in similar proportions of IBS-D and controls. Variations in KLB (rs1015450, downstream) and FGFR4 [rs434434 (intronic), rs1966265, and rs351855 (nonsynonymous)] were associated with colonic transit (rs1966265; P = 0.043), fecal bile acids (rs1015450; P = 0.064), and principal components analysis groups (all 3 FGFR4 SNVs; P transit (P = 0.066). Thus exome sequencing identified additional variants in KLB and FGFR4 associated with bile acids or colonic transit in IBS-D. PMID:24200957
Haloarcula hispanica CRISPR authenticates PAM of a target sequence to prime discriminative adaptation.

Science.gov (United States)

Li, Ming; Wang, Rui; Xiang, Hua

2014-06-01

The prokaryotic immune system CRISPR/Cas (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR-associated genes) adapts to foreign invaders by acquiring their short deoxyribonucleic acid (DNA) fragments as spacers, which guide subsequent interference to foreign nucleic acids based on sequence matching. The adaptation mechanism avoiding acquiring 'self' DNA fragments is poorly understood. In Haloarcula hispanica, we previously showed that CRISPR adaptation requires being primed by a pre-existing spacer partially matching the invader DNA. Here, we further demonstrate that flanking a fully-matched target sequence, a functional PAM (protospacer adjacent motif) is still required to prime adaptation. Interestingly, interference utilizes only four PAM sequences, whereas adaptation-priming tolerates as many as 23 PAM sequences. This relaxed PAM selectivity explains how adaptation-priming maximizes its tolerance of PAM mutations (that escape interference) while avoiding mis-targeting the spacer DNA within CRISPR locus. We propose that the primed adaptation, which hitches and cooperates with the interference pathway, distinguishes target from non-target by CRISPR ribonucleic acid guidance and PAM recognition. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

Science.gov (United States)

Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

1995-04-01

The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Insights into the sequence parameters for halophilic adaptation.

Science.gov (United States)

Nath, Abhigyan

2016-03-01

The sequence parameters for halophilic adaptation are still not fully understood. To understand the molecular basis of protein hypersaline adaptation, a detailed analysis is carried out, and investigated the likely association of protein sequence attributes to halophilic adaptation. A two-stage strategy is implemented, where in the first stage a supervised machine learning classifier is build, giving an overall accuracy of 86 % on stratified tenfold cross validation and 90 % on blind testing set, which are better than the previously reported results. The second stage consists of statistical analysis of sequence features and possible extraction of halophilic molecular signatures. The results of this study showed that, halophilic proteins are characterized by lower average charge, lower K content, and lower S content. A statistically significant preference/avoidance list of sequence parameters is also reported giving insights into the molecular basis of halophilic adaptation. D, Q, E, H, P, T, V are significantly preferred while N, C, I, K, M, F, S are significantly avoided. Among amino acid physicochemical groups, small, polar, charged, acidic and hydrophilic groups are preferred over other groups. The halophilic proteins also showed a preference for higher average flexibility, higher average polarity and avoidance for higher average positive charge, average bulkiness and average hydrophobicity. Some interesting trends observed in dipeptide counts are also reported. Further a systematic statistical comparison is undertaken for gaining insights into the sequence feature distribution in different residue structural states. The current analysis may facilitate the understanding of the mechanism of halophilic adaptation clearer, which can be further used for rational design of halophilic proteins.
Strains of Lactococcus lactis with a partial pyrimidine requirement show sensitivity toward aspartic acid

DEFF Research Database (Denmark)

Wadskov-Hansen, Steen Lyders Lerche; Martinussen, Jan

2009-01-01

The growth rate of the widely used laboratory strain Lactococcus lactis subsp. cremoris LM0230 was reduced if aspartic acid were present in the growth medium. The strain LM0230 is a plasmid- and phage-cured derivative of L. lactis subsp. cremoris C2, the ancestor of the original dairy isolate L...... with the wild-type strain, and this varied with the concentration of aspartic acid. The observed effect of aspartate could be explained by the accumulation of the toxic pyrimidine de novo pathway intermediate, carbamoyl aspartate. Assays of the pyrimidine biosynthetic enzymes of L. lactis LM0230 showed...... that the partial pyrimidine requirement can be explained by a low specific activity of the pyrimidine biosynthetic enzymes. In conclusion, L. lactis LM0230 during the process of plasmid- and prophage-curing has acquired a partial pyrimidine requirement resulting in sensitivity toward aspartic acid....
Cloning and sequencing of the gene for human β-casein

International Nuclear Information System (INIS)

Loennerdal, B.; Bergstroem, S.; Andersson, Y.; Hialmarsson, K.; Sundgyist, A.; Hernell, O.

1990-01-01

Human β-casein is a major protein in human milk. This protein is part of the casein micelle and has been suggested to have several physiological functions in the newborn. Since there is limited information on βcasein and the factors that affect its concentration in human milk, the authors have isolated and sequenced the gene for this protein. A human mammary gland cDNA library (Clontech) in gt 11 was screened by plaque hy-hybridization using a 42-mer synthetic 32 p-labelled oligo-nucleotide. Positive clones were identified and isolated, DNA was prepared and the gene isolated by cleavage with EcoR1. Following subcloning (PUC18), restriction mapping and Southern blotting, DNA for sequencing was prepared. The gene was sequenced by the dideoxy method. Human β-casein has 212 amino acids and the amino acid sequence deducted from the nucleotide sequence is to 91% identical to the published sequence for human β-casein show a high degree of conservation at the leader peptide and the highly phosphorylated sequences, but also deletions and divergence at several positions. These results provide insight into the structure of the human β-casein gene and will facilitate studies on factors affecting its expression
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

Science.gov (United States)

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or 15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
A branch-heterogeneous model of protein evolution for efficient inference of ancestral sequences.

Science.gov (United States)

Groussin, M; Boussau, B; Gouy, M

2013-07-01

Most models of nucleotide or amino acid substitution used in phylogenetic studies assume that the evolutionary process has been homogeneous across lineages and that composition of nucleotides or amino acids has remained the same throughout the tree. These oversimplified assumptions are refuted by the observation that compositional variability characterizes extant biological sequences. Branch-heterogeneous models of protein evolution that account for compositional variability have been developed, but are not yet in common use because of the large number of parameters required, leading to high computational costs and potential overparameterization. Here, we present a new branch-nonhomogeneous and nonstationary model of protein evolution that captures more accurately the high complexity of sequence evolution. This model, henceforth called Correspondence and likelihood analysis (COaLA), makes use of a correspondence analysis to reduce the number of parameters to be optimized through maximum likelihood, focusing on most of the compositional variation observed in the data. The model was thoroughly tested on both simulated and biological data sets to show its high performance in terms of data fitting and CPU time. COaLA efficiently estimates ancestral amino acid frequencies and sequences, making it relevant for studies aiming at reconstructing and resurrecting ancestral amino acid sequences. Finally, we applied COaLA on a concatenate of universal amino acid sequences to confirm previous results obtained with a nonhomogeneous Bayesian model regarding the early pattern of adaptation to optimal growth temperature, supporting the mesophilic nature of the Last Universal Common Ancestor.
[Cloning and bioinformatics analysis of abscisic acid 8'-hydroxylase from Pseudostellariae Radix].

Science.gov (United States)

Li, Jun; Long, Deng-Kai; Zhou, Tao; Ding, Ling; Zheng, Wei; Jiang, Wei-Ke

2016-07-01

Abscisic acid 8'-hydroxylase was one of key enzymes genes in the metabolism of abscisic acid (ABA). Seven menbers of abscisic acid 8'-hydroxylase were identified from Pseudostellaria heterophylla transcriptome sequencing results by using sequence homology. The expression profiles of these genes were analyzed by transcriptome data. The coding sequence of ABA8ox1 was cloned and analyzed by informational technology. The full-length cDNA of ABA8ox1 was 1 401 bp,with 480 encoded amino acids. The predicated isoelectric point (pI) and relative molecular mass (MW) were 8.55 and 53 kDa,respectively. Transmembrane structure analysis showed that there were 21 amino acids in-side and 445 amino acids out-side. High level of transcripts can detect in bark of root and fibrous root. Multi-alignment and phylogenetic analysis both show that ABA8ox1 had a high similarity with the CYP707As from other plants,especially with AtCYP707A1 and AtCYP707A3 in Arabidopsis thaliana. These results lay a foundation for molecular mechanism of tuberous root expanding and response to adversity stress. Copyright© by the Chinese Pharmaceutical Association.
An acid phosphatase in the plasma membranes of human astrocytoma showing marked specificity toward phosphotyrosine protein.

Science.gov (United States)

Leis, J F; Kaplan, N O

1982-11-01

The plasma membrane from the human tumor astrocytoma contains an active acid phosphatase activity based on hydrolysis of p-nitrophenyl phosphate. Other acid phosphatase substrates--beta-glycerophosphate, O-phosphorylcholine, and 5'-AMP--are not hydrolyzed significantly. The phosphatase activity is tartrate insensitive and is stimulated by Triton X-100 and EDTA. Of the three known phosphoamino acids, only free O-phosphotyrosine is hydrolyzed by the membrane phosphatase activity. Other acid phosphatases tested from potato, wheat germ, milk, and bovine prostate did not show this degree of specificity. The plasma membrane activity also dephosphorylated phosphotyrosine histone at a much greater rate than did the other acid phosphatases. pH profiles for free O-phosphotyrosine and phosphotyrosine histone showed a shift toward physiological pH, indicating possible physiological significance. Phosphotyrosine histone dephosphorylation activity was nearly 10 times greater than that seen for phosphoserine histone dephosphorylation, and Km values were much lower for phosphotyrosine histone dephosphorylation (0.5 microM vs. 10 microM). Fluoride and zinc significantly inhibited phosphoserine histone dephosphorylation. Vanadate, on the other hand, was a potent inhibitor of phosphotyrosine histone dephosphorylation (50% inhibition at 0.5 microM) but not of phosphoserine histone. ATP stimulated phosphotyrosine histone dephosphorylation (160-250%) but inhibited phosphoserine histone dephosphorylation (95%). These results suggest the existence of a highly specific phosphotyrosine protein phosphatase activity associated with the plasma membrane of human astrocytoma.
Sequence variations in the FAD2 gene in seeded pumpkins.

Science.gov (United States)

Ge, Y; Chang, Y; Xu, W L; Cui, C S; Qu, S P

2015-12-21

Seeded pumpkins are important economic crops; the seeds contain various unsaturated fatty acids, such as oleic acid and linoleic acid, which are crucial for human and animal nutrition. The fatty acid desaturase-2 (FAD2) gene encodes delta-12 desaturase, which converts oleic acid to linoleic acid. However, little is known about sequence variations in FAD2 in seeded pumpkins. Twenty-seven FAD2 clones from 27 accessions of Cucurbita moschata, Cucurbita maxima, Cucurbita pepo, and Cucurbita ficifolia were obtained (totally 1152 bp; a single gene without introns). More than 90% nucleotide identities were detected among the 27 FAD2 clones. Nucleotide substitution, rather than nucleotide insertion and deletion, led to sequence polymorphism in the 27 FAD2 clones. Furthermore, the 27 FAD2 selected clones all encoded the FAD2 enzyme (delta-12 desaturase) with amino acid sequence identities from 91.7 to 100% for 384 amino acids. The same main-function domain between 47 and 329 amino acids was identified. The four species clustered separately based on differences in the sequences that were identified using the unweighted pair group method with arithmetic mean. Geographic origin and species were found to be closely related to sequence variation in FAD2.
Complementary DNA and derived amino acid sequence of the α subunit of human complement protein C8: evidence for the existence of a separate α subunit messenger RNA

International Nuclear Information System (INIS)

Rao, A.G.; Howard, O.M.Z.; Ng, S.C.; Whitehead, A.S.; Colten, H.R.; Sodetz, J.M.

1987-01-01

The entire amino acid sequence of the α subunit (M/sub r/ 64,000) of the eight component of complement (C8) was determined by characterizing cDNA clones isolated from a human liver cDNA library. Two clones with overlapping inserts of net length 2.44 kilobases (kb) were isolated and found to contain the entire α coding region [1659 base pairs (bp)]. The 5' end consists of an untranslated region and a leader sequence of 30 amino acids. This sequence contains an apparent initiation Met, signal peptide, and propeptide which ends with an arginine-rich sequence that is characteristic of proteolytic processing sites found in the pro form of protein precursors. The 3' untranslated region contains two polyadenylation signals and a poly(A)sequence. RNA blot analysis of total cellular RNA from the human hepatoma cell line HepG2 revealed a message size of ∼2.5 kb. Features of the 5' and 3' sequences and the message size suggest that a separate mRNA codes for α and argues against the occurrence of a single-chain precursor form of the disulfide-linked α-λ subunit found in mature C8. Analysis of the derived amino acid sequence revealed several membrane surface seeking domains and a possible transmembrane domain. Analysis of the carbohydrate composition indicates 1 or 2 asparagine-linked but no O-linked oligosaccharide chains, a result consistent with predictions from the amino acid sequence. Most significantly, it exhibits a striking overall homology to human C9, with values of 24% on the basis of identity and 46% when conserved substitutions are allowed. As described in an accompanying report this homology also extends to the β subunit of C8
Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

Science.gov (United States)

Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

2003-01-01

A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171
Complete genome sequence of the actinobacterium Amycolatopsis japonica MG417-CF17T (=DSM 44213T) producing (S,S)-N,N′-ethylenediaminedisuccinic acid

DEFF Research Database (Denmark)

Stegmann, Evi; Albersmeier, Andreas; Spohn, Marius

2014-01-01

We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons: the chro......We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons...
Complete amino acid sequences of the ribosomal proteins L25, L29 and L31 from the archaebacterium Halobacterium marismortui.

Science.gov (United States)

Hatakeyama, T; Kimura, M

1988-03-15

Ribosomal proteins were extracted from 50S ribosomal subunits of the archaebacterium Halobacterium marismortui by decreasing the concentration of Mg2+ and K+, and the proteins were separated and purified by ion-exchange column chromatography on DEAE-cellulose. Ten proteins were purified to homogeneity and three of these proteins were subjected to sequence analysis. The complete amino acid sequences of the ribosomal proteins L25, L29 and L31 were established by analyses of the peptides obtained by enzymatic digestion with trypsin, Staphylococcus aureus protease, chymotrypsin and lysylendopeptidase. Proteins L25, L29 and L31 consist of 84, 115 and 95 amino acid residues with the molecular masses of 9472 Da, 12293 Da and 10418 Da respectively. A comparison of their sequences with those of other large-ribosomal-subunit proteins from other organisms revealed that protein L25 from H. marismortui is homologous to protein L23 from Escherichia coli (34.6%), Bacillus stearothermophilus (41.8%), and tobacco chloroplasts (16.3%) as well as to protein L25 from yeast (38.0%). Proteins L29 and L31 do not appear to be homologous to any other ribosomal proteins whose structures are so far known.
A protein with amino acid sequence homology to bovine insulin is present in the legume Vigna unguiculata (cowpea

Directory of Open Access Journals (Sweden)

Venâncio T.M.

2003-01-01

Full Text Available Since the discovery of bovine insulin in plants, much effort has been devoted to the characterization of these proteins and elucidation of their functions. We report here the isolation of a protein with similar molecular mass and same amino acid sequence to bovine insulin from developing fruits of cowpea (Vigna unguiculata genotype Epace 10. Insulin was measured by ELISA using an anti-human insulin antibody and was detected both in empty pods and seed coats but not in the embryo. The highest concentrations (about 0.5 ng/µg of protein of the protein were detected in seed coats at 16 and 18 days after pollination, and the values were 1.6 to 4.0 times higher than those found for isolated pods tested on any day. N-terminal amino acid sequencing of insulin was performed on the protein purified by C4-HPLC. The significance of the presence of insulin in these plant tissues is not fully understood but we speculate that it may be involved in the transport of carbohydrate to the fruit.
Filovirus Glycoprotein Sequence, Structure and Virulence

OpenAIRE

Phillips, J. C.

2014-01-01

Leading Ebola subtypes exhibit a wide mortality range, here explained at the molecular level by using fractal hydropathic scaling of amino acid sequences based on protein self-organized criticality. Specific hydrophobic features in the hydrophilic mucin-like domain suffice to account for the wide mortality range. Significance statement: Ebola virus is spreading rapidly in Africa. The connection between protein amino acid sequence and mortality is identified here.
Analysis and comparison of fragrant gene sequence in some rice cultivars

Directory of Open Access Journals (Sweden)

Karami Noushafarin

2016-01-01

Full Text Available It is known that the fragrant trait in rice (Oryza sativa L. is largely controlled by fgr gene on chromosome 8 and it has been specified that the existence of an 8 bp deletion and three single nucleotide polymorphism (SNP in exon 7 is effective on this trait. In this study, sequence alignment analysis of fgr exon7 on chromosome 8 for 11 different fragrant and non-fragrant cultivars revealed that 5 aromatic rice cultivars carried 3 SNPs and 8 bp deletion in exon7 which terminates prematurely at a TAA stop codon. However, 5 of the non-aromatics showed a sequence identical to the published Nipponbare, being non-fragrant Japonica variety sequence. An exception among them was Bejar, which had 8 bp deletion and 3SNPs but it was non-aromatic. Sequencing can determine nucleotide alignment of a gene and give beneficial information about gene function. In silico prediction showed proteins sequences alignment of fgr gene for Khazar and Domsiah genotypes were different. Betaine aldehyde dehydrogenase complete enzyme belongs to Khazar non-fragrant genotype that has complete length and 503 amino acids while non-functional BADH2 enzyme for Domsiah fragrant genotype has 251 amino acids that result in accumulate 2-acetyl-1-pyrroline (2AP and produces aroma in fragrant genotypes.

Sequence analysis and overexpression of a pectin lyase gene (pel1) from Aspergillus oryzae KBN616.

Science.gov (United States)

Kitamoto, N; Yoshino-Yasuda, S; Ohmiya, K; Tsukagoshi, N

2001-01-01

A gene (pel1) encoding pectin lyase (Pel1) was isolated from a shoyu koji mold, Aspergillus oryzae KBN616, and characterized. The structural gene comprised 1,196 bp with a single intron. The ORF encoded 381 amino acids with a signal peptide of 20 amino acids. The deduced amino acid sequence showed high similarity to those of Aspergillus niger pectin lyases and Glomerella cingulata PnlA. The pel1 gene was successfully overexpressed under the promoter of the A. oryzae TEF1 gene. The molecular mass of the recombinant pectin lyase substantially coincided with that calculated based on nucleotide sequence.
GuiTope: an application for mapping random-sequence peptides to protein sequences.

Science.gov (United States)

Halperin, Rebecca F; Stafford, Phillip; Emery, Jack S; Navalkar, Krupa Arun; Johnston, Stephen Albert

2012-01-03

Random-sequence peptide libraries are a commonly used tool to identify novel ligands for binding antibodies, other proteins, and small molecules. It is often of interest to compare the selected peptide sequences to the natural protein binding partners to infer the exact binding site or the importance of particular residues. The ability to search a set of sequences for similarity to a set of peptides may sometimes enable the prediction of an antibody epitope or a novel binding partner. We have developed a software application designed specifically for this task. GuiTope provides a graphical user interface for aligning peptide sequences to protein sequences. All alignment parameters are accessible to the user including the ability to specify the amino acid frequency in the peptide library; these frequencies often differ significantly from those assumed by popular alignment programs. It also includes a novel feature to align di-peptide inversions, which we have found improves the accuracy of antibody epitope prediction from peptide microarray data and shows utility in analyzing phage display datasets. Finally, GuiTope can randomly select peptides from a given library to estimate a null distribution of scores and calculate statistical significance. GuiTope provides a convenient method for comparing selected peptide sequences to protein sequences, including flexible alignment parameters, novel alignment features, ability to search a database, and statistical significance of results. The software is available as an executable (for PC) at http://www.immunosignature.com/software and ongoing updates and source code will be available at sourceforge.net.
GuiTope: an application for mapping random-sequence peptides to protein sequences

Directory of Open Access Journals (Sweden)

Halperin Rebecca F

2012-01-01

Full Text Available Abstract Background Random-sequence peptide libraries are a commonly used tool to identify novel ligands for binding antibodies, other proteins, and small molecules. It is often of interest to compare the selected peptide sequences to the natural protein binding partners to infer the exact binding site or the importance of particular residues. The ability to search a set of sequences for similarity to a set of peptides may sometimes enable the prediction of an antibody epitope or a novel binding partner. We have developed a software application designed specifically for this task. Results GuiTope provides a graphical user interface for aligning peptide sequences to protein sequences. All alignment parameters are accessible to the user including the ability to specify the amino acid frequency in the peptide library; these frequencies often differ significantly from those assumed by popular alignment programs. It also includes a novel feature to align di-peptide inversions, which we have found improves the accuracy of antibody epitope prediction from peptide microarray data and shows utility in analyzing phage display datasets. Finally, GuiTope can randomly select peptides from a given library to estimate a null distribution of scores and calculate statistical significance. Conclusions GuiTope provides a convenient method for comparing selected peptide sequences to protein sequences, including flexible alignment parameters, novel alignment features, ability to search a database, and statistical significance of results. The software is available as an executable (for PC at http://www.immunosignature.com/software and ongoing updates and source code will be available at sourceforge.net.
CodonTest: modeling amino acid substitution preferences in coding sequences.

Directory of Open Access Journals (Sweden)

Wayne Delport

2010-08-01

Full Text Available Codon models of evolution have facilitated the interpretation of selective forces operating on genomes. These models, however, assume a single rate of non-synonymous substitution irrespective of the nature of amino acids being exchanged. Recent developments have shown that models which allow for amino acid pairs to have independent rates of substitution offer improved fit over single rate models. However, these approaches have been limited by the necessity for large alignments in their estimation. An alternative approach is to assume that substitution rates between amino acid pairs can be subdivided into rate classes, dependent on the information content of the alignment. However, given the combinatorially large number of such models, an efficient model search strategy is needed. Here we develop a Genetic Algorithm (GA method for the estimation of such models. A GA is used to assign amino acid substitution pairs to a series of rate classes, where is estimated from the alignment. Other parameters of the phylogenetic Markov model, including substitution rates, character frequencies and branch lengths are estimated using standard maximum likelihood optimization procedures. We apply the GA to empirical alignments and show improved model fit over existing models of codon evolution. Our results suggest that current models are poor approximations of protein evolution and thus gene and organism specific multi-rate models that incorporate amino acid substitution biases are preferred. We further anticipate that the clustering of amino acid substitution rates into classes will be biologically informative, such that genes with similar functions exhibit similar clustering, and hence this clustering will be useful for the evolutionary fingerprinting of genes.
Sequence determination and analysis of the NSs genes of two tospoviruses.

Science.gov (United States)

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.
Ruthenium Hydride/Brønsted Acid-Catalyzed Tandem Isomerization/N-Acyliminium Cyclization Sequence for the Synthesis of Tetrahydro-β-carbolines

DEFF Research Database (Denmark)

Hansen, Casper Lykke; Clausen, Janie Regitse Waël; Ohm, Ragnhild Gaard

2013-01-01

This paper describes an efficient tandem sequence for the synthesis of 1,2,3,4-tetrahydro-β-carbolines (THBCs) relying on a ruthenium hydride/Brønsted acid- catalyzed isomerization of allylic amides to N-acyliminium ion intermediates which are trapped by a tethered indolenucleophile. The methodol...... the Suzuki cross-coupling reaction to the isomerization/N-acyliminium cyclization sequence. Finally, diastereo- and enantioselective versions of the title reaction have been examined using substrate control (with dr >15: 1) and asymmetric catalysis (ee up to 57%), respectively...
Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens

Directory of Open Access Journals (Sweden)

Peijnenburg Ad ACM

2002-12-01

Full Text Available Abstract Background Transgenic proteins expressed by genetically modified food crops are evaluated for their potential allergenic properties prior to marketing, among others by identification of short identical amino acid sequences that occur both in the transgenic protein and allergenic proteins. A strategy is proposed, in which the positive outcomes of the sequence comparison with a minimal length of six amino acids are further screened for the presence of potential linear IgE-epitopes. This double track approach involves the use of literature data on IgE-epitopes and an antigenicity prediction algorithm. Results Thirty-three transgenic proteins have been screened for identities of at least six contiguous amino acids shared with allergenic proteins. Twenty-two transgenic proteins showed positive results of six- or seven-contiguous amino acids length. Only a limited number of identical stretches shared by transgenic proteins (papaya ringspot virus coat protein, acetolactate synthase GH50, and glyphosate oxidoreductase and allergenic proteins could be identified as (part of potential linear epitopes. Conclusion Many transgenic proteins have identical stretches of six or seven amino acids in common with allergenic proteins. Most identical stretches are likely to be false positives. As shown in this study, identical stretches can be further screened for relevance by comparison with linear IgE-binding epitopes described in literature. In the absence of literature data on epitopes, antigenicity prediction by computer aids to select potential antibody binding sites that will need verification of IgE binding by sera binding tests. Finally, the positive outcomes of this approach warrant further clinical testing for potential allergenicity.
Draft Genome Sequence of a Clostridium botulinum Isolate from Water Used for Cooling at a Plant Producing Low-Acid Canned Foods.

Science.gov (United States)

Basavanna, Uma; Gonzalez-Escalona, Narjol; Timme, Ruth; Datta, Shomik; Schoen, Brianna; Brown, Eric W; Zink, Donald; Sharma, Shashi K

2013-01-01

Clostridium botulinum is a pathogen of concern for low-acid canned foods. Here we report draft genomes of a neurotoxin-producing C. botulinum strain isolated from water samples used for cooling low-acid canned foods at a canning facility. The genome sequence confirmed that this strain belonged to C. botulinum serotype B1, albeit with major differences, including thousands of unique single nucleotide polymorphisms (SNPs) compared to other genomes of the same serotype.
Draft Genome Sequence of a Clostridium botulinum Isolate from Water Used for Cooling at a Plant Producing Low-Acid Canned Foods

OpenAIRE

Basavanna, Uma; Gonzalez-Escalona, Narjol; Timme, Ruth; Datta, Shomik; Schoen, Brianna; Brown, Eric W.; Zink, Donald; Sharma, Shashi K.

2013-01-01

Clostridium botulinum is a pathogen of concern for low-acid canned foods. Here we report draft genomes of a neurotoxin-producing C.?botulinum strain isolated from water samples used for cooling low-acid canned foods at a canning facility. The genome sequence confirmed that this strain belonged to C.?botulinum serotype B1, albeit with major differences, including thousands of unique single nucleotide polymorphisms (SNPs) compared to other genomes of the same serotype.
A Single Electrochemical Probe Used for Analysis of Multiple Nucleic Acid Sequences

Science.gov (United States)

Mills, Dawn M.; Calvo-Marzal, Percy; Pinzon, Jeffer M.; Armas, Stephanie; Kolpashchikov, Dmitry M.; Chumbimuni-Torres, Karin Y.

2017-01-01

Electrochemical hybridization sensors have been explored extensively for analysis of specific nucleic acids. However, commercialization of the platform is hindered by the need for attachment of separate oligonucleotide probes complementary to a RNA or DNA target to an electrode’s surface. Here we demonstrate that a single probe can be used to analyze several nucleic acid targets with high selectivity and low cost. The universal electrochemical four-way junction (4J)-forming (UE4J) sensor consists of a universal DNA stem-loop (USL) probe attached to the electrode’s surface and two adaptor strands (m and f) which hybridize to the USL probe and the analyte to form a 4J associate. The m adaptor strand was conjugated with a methylene blue redox marker for signal ON sensing and monitored using square wave voltammetry. We demonstrated that a single sensor can be used for detection of several different DNA/RNA sequences and can be regenerated in 30 seconds by a simple water rinse. The UE4J sensor enables a high selectivity by recognition of a single base substitution, even at room temperature. The UE4J sensor opens a venue for a re-useable universal platform that can be adopted at low cost for the analysis of DNA or RNA targets. PMID:29371782
Amino Acids Sequence Based in Silico Analysis of RuBisCO (Ribulose-1,5 Bisphosphate Carboxylase Oxygenase Proteins in Some Carthamus L. ssp.

Directory of Open Access Journals (Sweden)

Emre SEVİNDİK

2017-06-01

Full Text Available RuBisCO is an important enzyme for plants to photosynthesize and balance carbon dioxide in the atmosphere. This study aimed to perform sequence, physicochemical, phylogenetic and 3D (three-dimensional comparative analyses of RuBisCO proteins in the Carthamus ssp. using various bioinformatics tools. The sequence lengths of the RuBisCO proteins were between 166 and 477 amino acids, with an average length of 411.8 amino acids. Their molecular weights (Mw ranged from 18711.47 to 52843.09 Da; the most acidic and basic protein sequences were detected in C. tinctorius (pI = 5.99 and in C. tenuis (pI = 6.92, respectively. The extinction coefficients of RuBisCO proteins at 280 nm ranged from 17,670 to 69,830 M-1 cm-1, the instability index (II values for RuBisCO proteins ranged from 33.31 to 39.39, while the GRAVY values of RuBisCO proteins ranged from -0.313 to -0.250. The most abundant amino acid in the RuBisCO protein was Gly (9.7%, while the least amino acid ratio was Trp (1.6 %. The putative phosphorylation sites of RuBisCO proteins were determined by NetPhos 2.0. Phylogenetic analysis revealed that RuBisCO proteins formed two main clades. A RAMPAGE analysis revealed that 96.3%-97.6% of residues were located in the favoured region of RuBisCO proteins. To predict the three dimensional (3D structure of the RuBisCO proteins PyMOL was used. The results of the current study provide insights into fundamental characteristic of RuBisCO proteins in Carthamus ssp.
Exome sequencing and SNP analysis detect novel compound heterozygosity in fatty acid hydroxylase-associated neurodegeneration

Science.gov (United States)

Pierson, Tyler Mark; Simeonov, Dimitre R; Sincan, Murat; Adams, David A; Markello, Thomas; Golas, Gretchen; Fuentes-Fajardo, Karin; Hansen, Nancy F; Cherukuri, Praveen F; Cruz, Pedro; Blackstone, Craig; Tifft, Cynthia; Boerkoel, Cornelius F; Gahl, William A

2012-01-01

Fatty acid hydroxylase-associated neurodegeneration due to fatty acid 2-hydroxylase deficiency presents with a wide range of phenotypes including spastic paraplegia, leukodystrophy, and/or brain iron deposition. All previously described families with this disorder were consanguineous, with homozygous mutations in the probands. We describe a 10-year-old male, from a non-consanguineous family, with progressive spastic paraplegia, dystonia, ataxia, and cognitive decline associated with a sural axonal neuropathy. The use of high-throughput sequencing techniques combined with SNP array analyses revealed a novel paternally derived missense mutation and an overlapping novel maternally derived ∼28-kb genomic deletion in FA2H. This patient provides further insight into the consistent features of this disorder and expands our understanding of its phenotypic presentation. The presence of a sural nerve axonal neuropathy had not been previously associated with this disorder and so may extend the phenotype. PMID:22146942
Design of Tail-Clamp Peptide Nucleic Acid Tethered with Azobenzene Linker for Sequence-Specific Detection of Homopurine DNA

Directory of Open Access Journals (Sweden)

Shinjiro Sawada

2017-10-01

Full Text Available DNA carries genetic information in its sequence of bases. Synthetic oligonucleotides that can sequence-specifically recognize a target gene sequence are a useful tool for regulating gene expression or detecting target genes. Among the many synthetic oligonucleotides, tail-clamp peptide nucleic acid (TC-PNA offers advantages since it has two homopyrimidine PNA strands connected via a flexible ethylene glycol-type linker that can recognize complementary homopurine sequences via Watson-Crick and Hoogsteen base pairings and form thermally-stable PNA/PNA/DNA triplex structures. Here, we synthesized a series of TC-PNAs that can possess different lengths of azobenzene-containing linkers and studied their binding behaviours to homopurine single-stranded DNA. Introduction of azobenzene at the N-terminus amine of PNA increased the thermal stability of PNA-DNA duplexes. Further extension of the homopyrimidine PNA strand at the N-terminus of PNA-AZO further increased the binding stability of the PNA/DNA/PNA triplex to the target homopurine sequence; however, it induced TC-PNA/DNA/TC-PNA complex formation. Among these TC-PNAs, 9W5H-C4-AZO consisting of nine Watson-Crick bases and five Hoogsteen bases tethered with a beta-alanine conjugated azobenzene linker gave a stable 1:1 TC-PNA/ssDNA complex and exhibited good mismatch recognition. Our design for TC-PNA-AZO can be utilized for detecting homopurine sequences in various genes.
Genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia

Directory of Open Access Journals (Sweden)

Anastasiia Kovaliova

2017-03-01

Full Text Available Here we report the draft genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia. The draft genome has a size of 4.9 Mb and encodes multiple K+-transporters and proton-consuming decarboxylases. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain DV clusters together with the acid-tolerant Desulfovibrio sp. TomC and Desulfovibrio magneticus. The draft genome sequence and annotation have been deposited at GenBank under the accession number MLBG00000000.
EGNAS: an exhaustive DNA sequence design algorithm

Directory of Open Access Journals (Sweden)

Kick Alfred

2012-06-01

Full Text Available Abstract Background The molecular recognition based on the complementary base pairing of deoxyribonucleic acid (DNA is the fundamental principle in the fields of genetics, DNA nanotechnology and DNA computing. We present an exhaustive DNA sequence design algorithm that allows to generate sets containing a maximum number of sequences with defined properties. EGNAS (Exhaustive Generation of Nucleic Acid Sequences offers the possibility of controlling both interstrand and intrastrand properties. The guanine-cytosine content can be adjusted. Sequences can be forced to start and end with guanine or cytosine. This option reduces the risk of “fraying” of DNA strands. It is possible to limit cross hybridizations of a defined length, and to adjust the uniqueness of sequences. Self-complementarity and hairpin structures of certain length can be avoided. Sequences and subsequences can optionally be forbidden. Furthermore, sequences can be designed to have minimum interactions with predefined strands and neighboring sequences. Results The algorithm is realized in a C++ program. TAG sequences can be generated and combined with primers for single-base extension reactions, which were described for multiplexed genotyping of single nucleotide polymorphisms. Thereby, possible foldback through intrastrand interaction of TAG-primer pairs can be limited. The design of sequences for specific attachment of molecular constructs to DNA origami is presented. Conclusions We developed a new software tool called EGNAS for the design of unique nucleic acid sequences. The presented exhaustive algorithm allows to generate greater sets of sequences than with previous software and equal constraints. EGNAS is freely available for noncommercial use at http://www.chm.tu-dresden.de/pc6/EGNAS.
Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

OpenAIRE

Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Fran?oise; Loux, Valentin; Vidal, Marie; Passot, St?phanie; B?al, Catherine; Layec, S?verine; Fonseca, Fernanda

2016-01-01

Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.
Sequence and transcription analysis of the human cytomegalovirus DNA polymerase gene

International Nuclear Information System (INIS)

Kouzarides, T.; Bankier, A.T.; Satchwell, S.C.; Weston, K.; Tomlinson, P.; Barrell, B.G.

1987-01-01

DNA sequence analysis has revealed that the gene coding for the human cytomegalovirus (HCMV) DNA polymerase is present within the long unique region of the virus genome. Identification is based on extensive amino acid homology between the predicted HCMV open reading frame HFLF2 and the DNA polymerase of herpes simplex virus type 1. The authors present here a 5280 base-pair DNA sequence containing the HCMV pol gene, along with the analysis of transcripts encoded within this region. Since HCMV pol also shows homology to the predicted Epstein-Barr virus pol, they were able to analyze the extent of homology between the DNA polymerases of three distantly related herpes viruses, HCMV, Epstein-Barr virus, and herpes simplex virus. The comparison shows that these DNA polymerases exhibit considerable amino acid homology and highlights a number of highly conserved regions; two such regions show homology to sequences within the adenovirus type 2 DNA polymerase. The HCMV pol gene is flanked by open reading frames with homology to those of other herpes viruses; upstream, there is a reading frame homologous to the glycoprotein B gene of herpes simplex virus type I and Epstein-Barr virus, and downstream there is a reading frame homologous to BFLF2 of Epstein-Barr virus
The structural analysis of protein sequences based on the quasi-amino acids code

International Nuclear Information System (INIS)

Ping, Zhu; Xu-Qing, Tang; Zhen-Yuan, Xu

2009-01-01

Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (Σ, +, *) is introduced, where Σ is the set of 64 codons. According to the characteristics of (Σ, +, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, ⊕, ) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3). (cross-disciplinary physics and related areas of science and technology)
A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites.

Directory of Open Access Journals (Sweden)

Tony Håndstad

Full Text Available BACKGROUND: Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial. RESULTS: Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods. CONCLUSIONS: Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites.
The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.

Science.gov (United States)

Hammond, R W; Crosslin, J M

1995-04-01

The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.

The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

Directory of Open Access Journals (Sweden)

Roberts Richard J

2008-05-01

Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.
Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

Directory of Open Access Journals (Sweden)

Xiaoyu Wang

Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.
Characterization of the haloacid dehalogenase from Xanthobacter autotrophicus GJ10 and sequencing of the dhlB gene

DEFF Research Database (Denmark)

van der Ploeg, J; Van Hall, Gerrit; Janssen, D B

1991-01-01

B) was cloned and could be allocated to a 6.5-kb EcoRI-BglII fragment. Part of this fragment was sequenced, and the dhlB open reading frame was identified by comparison with the N-terminal amino acid sequence of the protein. The gene was found to encode a protein of 27,433 Da that showed considerable homology...... chromatography. The enzyme was active with 2-halogenated carboxylic acids and converted only the L-isomer of 2-chloropropionic acid with inversion of configuration to produce D-lactate. The activity of the enzyme was not readily influenced by thiol reagents. The gene encoding the haloacid dehalogenase (dhl...... (60.5 and 61.0% similarity) with the two other haloacid dehalogenases sequenced to date but not with the haloalkane dehalogenase from X. autotrophicus GJ10....
Amino acid sequence surrounding the chondroitin sulfate attachment site of thrombomodulin regulates chondroitin polymerization.

Science.gov (United States)

Izumikawa, Tomomi; Kitagawa, Hiroshi

2015-05-01

Thrombomodulin (TM) is a cell-surface glycoprotein and a critical mediator of endothelial anticoagulant function. TM exists as both a chondroitin sulfate (CS) proteoglycan (PG) form and a non-PG form lacking a CS chain (α-TM); therefore, TM can be described as a part-time PG. Previously, we reported that α-TM bears an immature, truncated linkage tetrasaccharide structure (GlcAβ1-3Galβ1-3Galβ1-4Xyl). However, the biosynthetic mechanism to generate part-time PGs remains unclear. In this study, we used several mutants to demonstrate that the amino acid sequence surrounding the CS attachment site influences the efficiency of chondroitin polymerization. In particular, the presence of acidic residues surrounding the CS attachment site was indispensable for the elongation of CS. In addition, mutants defective in CS elongation did not exhibit anti-coagulant activity, as in the case with α-TM. Together, these data support a model for CS chain assembly in which specific core protein determinants are recognized by a key biosynthetic enzyme involved in chondroitin polymerization. Copyright © 2015 Elsevier Inc. All rights reserved.
Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

Science.gov (United States)

Schirmer, Melanie; Ijaz, Umer Z; D'Amore, Rosalinda; Hall, Neil; Sloan, William T; Quince, Christopher

2015-03-31

With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
EGVII endoglucanase and nucleic acids encoding the same

Science.gov (United States)

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2009-05-05

The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
Recovery of phosphorus and volatile fatty acids from wastewater and food waste with an iron-flocculation sequencing batch reactor and acidogenic co-fermentation.

Science.gov (United States)

Li, Ruo-Hong; Li, Xiao-Yan

2017-12-01

A sequencing batch reactor-based system was developed for enhanced phosphorus (P) removal and recovery from municipal wastewater. The system consists of an iron-dosing SBR for P precipitation and a side-stream anaerobic reactor for sludge co-fermentation with food waste. During co-fermentation, sludge and food waste undergo acidogenesis, releasing phosphates under acidic conditions and producing volatile fatty acids (VFAs) into the supernatant. A few types of typical food waste were investigated for their effectiveness in acidogenesis and related enzymatic activities. The results show that approximately 96.4% of total P in wastewater was retained in activated sludge. Food waste with a high starch content favoured acidogenic fermentation. Around 55.7% of P from wastewater was recovered as vivianite, and around 66% of food waste loading was converted into VFAs. The new integration formed an effective system for wastewater treatment, food waste processing and simultaneous recovery of P and VFAs. Copyright © 2017 Elsevier Ltd. All rights reserved.
Amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by streptococcal IgA1 proteases

DEFF Research Database (Denmark)

Batten, MR; Senior, BW; Kilian, Mogens

2003-01-01

The amino acid sequence requirements in the hinge of human immunoglobulin A1 (IgA1) for cleavage by IgA1 proteases of different species of Streptococcus were investigated. Recombinant IgA1 antibodies were generated with point mutations at proline 227 and threonine 228, the residues lying on either...... side of the peptide bond at which all streptococcal IgA1 proteases cleave wild-type human IgA1. The amino acid substitutions produced no major effect upon the structure of the mutant IgA1 antibodies or their functional ability to bind to Fcalpha receptors. However, the substitutions had a substantial...... effect upon sensitivity to cleavage with some streptococcal IgA1 proteases, with, in some cases, a single point mutation rendering the antibody resistant to a particular IgA1 protease. This effect was least marked with the IgA1 protease from Streptococcus pneumoniae, which showed no absolute requirement...
Nucleotide sequence of the melA gene, coding for alpha-galactosidase in Escherichia coli K-12.

OpenAIRE

Liljeström, P L; Liljeström, P

1987-01-01

Melibiose uptake and hydrolysis in E.coli is performed by the MelB and MelA proteins, respectively. We report the cloning and sequencing of the melA gene. The nucleotide sequence data showed that melA codes for a 450 amino acid long protein with a molecular weight of 50.6 kd. The sequence data also supported the assumption that the mel locus forms an operon with melA in proximal position. A comparison of MelA with alpha-galactosidase proteins from yeast and human origin showed that these prot...
Isolation of laccase gene-specific sequences from white rot and brown rot fungi by PCR.

Science.gov (United States)

D'Souza, T M; Boominathan, K; Reddy, C A

1996-01-01

Degenerate primers corresponding to the consensus sequences of the copper-binding regions in the N-terminal domains of known basidiomycete laccases were used to isolate laccase gene-specific sequences from strains representing nine genera of wood rot fungi. All except three gave the expected PCR product of about 200 bp. Computer searches of the databases identified the sequence of each of the PCR products analyzed as a laccase gene sequence, suggesting the specificity of the primers. PCR products of the white rot fungi Ganoderma lucidum, Phlebia brevispora, and Trametes versicolor showed 65 to 74% nucleotide sequence similarity to each other; the similarity in deduced amino acid sequences was 83 to 91%. The PCR products of Lentinula edodes and Lentinus tigrinus, on the other hand, showed relatively low nucleotide and amino acid similarities (58 to 64 and 62 to 81%, respectively); however, these similarities were still much higher than when compared with the corresponding regions in the laccases of the ascomycete fungi Aspergillus nidulans and Neurospora crassa. A few of the white rot fungi, as well as Gloeophyllum trabeum, a brown rot fungus, gave a 144-bp PCR fragment which had a nucleotide sequence similarity of 60 to 71%. Demonstration of laccase activity in G. trabeum and several other brown rot fungi was of particular interest because these organisms were not previously shown to produce laccases. PMID:8837429
Genome sequence of Lactobacillus salivarius SMXD51, a potential probiotic strain isolated from chicken cecum, showing anti-campylobacter activity.

Science.gov (United States)

Kergourlay, Gilles; Messaoudi, Soumaya; Dousset, Xavier; Prévost, Hervé

2012-06-01

We report the draft genome sequence of Lactobacillus salivarius SMXD51, isolated from the cecum of healthy chickens showing an activity against Campylobacter--the food-borne pathogen that is the most common cause of gastroenteritis in the European Union (EU)--and potentially interesting features for a probiotic strain, explaining our interest in it.
Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

Science.gov (United States)

Martin, Andrew C R

2014-01-01

The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.
Nucleotide sequence of a cDNA coding for the amino-terminal region of human prepro. alpha. 1(III) collagen

Energy Technology Data Exchange (ETDEWEB)

Toman, P D; Ricca, G A [Rorer Biotechnology, Inc., Springfield, VA (USA); de Crombrugghe, B [National Institutes of Health, Bethesda, MD (USA)

1988-07-25

Type III Collagen is synthesized in a variety of tissues as a precursor macromolecule containing a leader sequence, a N-propeptide, a N-telopeptide, the triple helical region, a C-telopeptide, and C-propeptide. To further characterize the human type III collagen precursor, a human placental cDNA library was constructed in gt11 using an oligonucleotide derived from a partial cDNA sequence corresponding to the carboxy-terminal part of the 1(III) collagen. A cDNA was identified which contains the leader sequence, the N-propeptide and N-telopeptide regions. The DNA sequence of these regions are presented here. The triple helical, C-telopeptide and C-propeptide amino acid sequence for human type III collagen has been determined previously. A comparison of the human amino acid sequence with mouse, chicken, and calf sequence shows 81%, 81%, and 92% similarity, respectively. At the DNA level, the sequence similarity between human and mouse or chicken type III collagen sequences in this area is 82% and 77%, respectively.
Molecular cloning and sequence analysis of growth hormone cDNA of Neotropical freshwater fish Pacu (Piaractus mesopotamicus

Directory of Open Access Journals (Sweden)

Janeth Silva Pinheiro

2008-01-01

Full Text Available RT-PCR was used for amplifying Piaractus mesopotamicus growth hormone (GH cDNA obtained from mRNA extracted from pituitary cells. The amplified fragment was cloned and the complete cDNA sequence was determined. The cloned cDNA encompassed a sequence of 543 nucleotides that encoded a polypeptide of 178 amino acids corresponding to mature P. mesopotamicus GH. Comparison with other GH sequences showed a gap of 10 amino acids localized in the N terminus of the putative polypeptide of P. mesopotamicus. This same gap was also observed in other members of the family. Neighbor-joining tree analysis with GH sequences from fishes belonging to different taxonomic groups placed the P. mesopotamicus GH within the Otophysi group. To our knowledge, this is the first GH sequence of a Neotropical characiform fish deposited in GenBank.
Characterization of genomic sequence showing strong association with polyembryony among diverse Citrus species and cultivars, and its synteny with Vitis and Populus.

Science.gov (United States)

Nakano, Michiharu; Shimada, Takehiko; Endo, Tomoko; Fujii, Hiroshi; Nesumi, Hirohisa; Kita, Masayuki; Ebina, Masumi; Shimizu, Tokurou; Omura, Mitsuo

2012-02-01

Polyembryony, in which multiple somatic nucellar cell-derived embryos develop in addition to the zygotic embryo in a seed, is common in the genus Citrus. Previous genetic studies indicated polyembryony is mainly determined by a single locus, but the underlying molecular mechanism is still unclear. As a step towards identification and characterization of the gene or genes responsible for nucellar embryogenesis in Citrus, haplotype-specific physical maps around the polyembryony locus were constructed. By sequencing three BAC clones aligned on the polyembryony haplotype, a single contiguous draft sequence consisting of 380 kb containing 70 predicted open reading frames (ORFs) was reconstructed. Single nucleotide polymorphism genotypes detected in the sequenced genomic region showed strong association with embryo type in Citrus, indicating a common polyembryony locus is shared among widely diverse Citrus cultivars and species. The arrangement of the predicted ORFs in the characterized genomic region showed high collinearity to the genomic sequence of chromosome 4 of Vitis vinifera and linkage group VI of Populus trichocarpa, suggesting that the syntenic relationship among these species is conserved even though V. vinifera and P. trichocarpa are non-apomictic species. This is the first study to characterize in detail the genomic structure of an apomixis locus determining adventitious embryony. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Functional dissection of the alphavirus capsid protease: sequence requirements for activity.

Science.gov (United States)

Thomas, Saijo; Rai, Jagdish; John, Lijo; Günther, Stephan; Drosten, Christian; Pützer, Brigitte M; Schaefer, Stephan

2010-11-18

The alphavirus capsid is multifunctional and plays a key role in the viral life cycle. The nucleocapsid domain is released by the self-cleavage activity of the serine protease domain within the capsid. All alphaviruses analyzed to date show this autocatalytic cleavage. Here we have analyzed the sequence requirements for the cleavage activity of Chikungunya virus capsid protease of genus alphavirus. Amongst alphaviruses, the C-terminal amino acid tryptophan (W261) is conserved and found to be important for the cleavage. Mutating tryptophan to alanine (W261A) completely inactivated the protease. Other amino acids near W261 were not having any effect on the activity of this protease. However, serine protease inhibitor AEBSF did not inhibit the activity. Through error-prone PCR we found that isoleucine 227 is important for the effective activity. The loss of activity was analyzed further by molecular modelling and comparison of WT and mutant structures. It was found that lysine introduced at position 227 is spatially very close to the catalytic triad and may disrupt electrostatic interactions in the catalytic site and thus inactivate the enzyme. We are also examining other sequence requirements for this protease activity. We analyzed various amino acid sequence requirements for the activity of ChikV capsid protease and found that amino acids outside the catalytic triads are important for the activity.
Planarian homeobox genes: cloning, sequence analysis, and expression.

Science.gov (United States)

Garcia-Fernàndez, J; Baguñà, J; Saló, E

1991-01-01

Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599
RNAblueprint: flexible multiple target nucleic acid sequence design.

Science.gov (United States)

Hammer, Stefan; Tschiatschek, Birgit; Flamm, Christoph; Hofacker, Ivo L; Findeiß, Sven

2017-09-15

Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

Science.gov (United States)

Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

2016-03-03

Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. Copyright © 2016 Meneghel et al.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

Science.gov (United States)

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of

Axolotl hemoglobin: cDNA-derived amino acid sequences of two alpha globins and a beta globin from an adult Ambystoma mexicanum.

Science.gov (United States)

Shishikura, Fumio; Takeuchi, Hiro-aki; Nagai, Takatoshi

2005-11-01

Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.
GAWK, a novel human pituitary polypeptide: isolation, immunocytochemical localization and complete amino acid sequence.

Science.gov (United States)

Benjannet, S; Leduc, R; Lazure, C; Seidah, N G; Marcinkiewicz, M; Chrétien, M

1985-01-16

During the course of reverse-phase high pressure liquid chromatography (RP-HPLC) purification of a postulated big ACTH (1) from human pituitary gland extracts, a highly purified peptide bearing no resemblance to any known polypeptide was isolated. The complete sequence of this 74 amino acid polypeptide, called GAWK, has been determined. Search on a computer data bank on the possible homology to any known protein or fragment, using a mutation data matrix, failed to reveal any homology greater than 30%. An antibody produced against a synthetic fragment allowed us to detect several immunoreactive forms. The antisera also enabled us to localize the polypeptide, by immunocytochemistry, in the anterior lobe of the pituitary gland.
Chameleon sequences in neurodegenerative diseases.

Science.gov (United States)

Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.
Chameleon sequences in neurodegenerative diseases

International Nuclear Information System (INIS)

Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

2016-01-01

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.
Chameleon sequences in neurodegenerative diseases

Energy Technology Data Exchange (ETDEWEB)

Bahramali, Golnaz [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Goliaei, Bahram, E-mail: goliaei@ut.ac.ir [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of); Salari, Ali [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of)

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.
Lion (Panthera leo) and cheetah (Acinonyx jubatus) IFN-gamma sequences.

Science.gov (United States)

Maas, Miriam; Van Rhijn, Ildiko; Allsopp, Maria T E P; Rutten, Victor P M G

2010-04-15

Cloning and sequencing of the full length lion and cheetah interferon-gamma (IFN-gamma) transcript will enable the expression of the recombinant cytokine, to be used for production of monoclonal antibodies and to set up lion and cheetah-specific IFN-gamma ELISAs. These are relevant in blood-based diagnosis of bovine tuberculosis, an important threat to lions in the Kruger National Park. Alignment of nucleotide and amino acid sequences of lion and cheetah and that of domestic cats showed homologies of 97-100%. Copyright 2009 Elsevier B.V. All rights reserved.
A novel Y-xylosidase, nucleotide sequence encoding it and use thereof.

NARCIS (Netherlands)

Graaff, de L.H.; Peij, van N.N.M.E.; Broeck, van den H.C.; Visser, J.

1996-01-01

A nucleotide sequence is provided which encodes a peptide having beta-xylosidase activity and exhibits at least 30mino acid identity with the amino acid sequence shown in SEQ ID NO. 1 or hybridises under stringent conditions with a nucleotide sequence shown in SEQ ID NO. 1, or a part thereof having
Replacement of C305 in heart/muscle-type isozyme of human carnitine palmitoyltransferase I with aspartic acid and other amino acids.

Science.gov (United States)

Matsuo, Taisuke; Yamamoto, Atsushi; Yamamoto, Takenori; Otsuki, Kaoru; Yamazaki, Naoshi; Kataoka, Masatoshi; Terada, Hiroshi; Shinohara, Yasuo

2010-04-01

Liver- and heart/muscle-type isozymes of human carnitine palmitoyltransferase I (L- and M-CPTI, respectively) show a certain similarity in their amino acid sequences, and mutation studies on the conserved amino acids between these two isozymes often show essentially the same effects on their enzymatic properties. Earlier mutation studies on C305 in human M-CPTI and its counterpart residue, C304, in human L-CPTI showed distinct effects of the mutations, especially in the aspect of enzyme stability; however, simple comparison of these effects on the conserved Cys residue between L- and M-CPTI was difficult, because these studies were carried out using different expression systems and distinct amino acids as replacements. In the present study, we carried out mutation studies on the C305 in human M-CPTI using COS cells for the expression system. Our results showed that C305 was replaceable with aspartic acid but that substitution with other amino acids caused both loss of function and reduced expression.
Lysine and pipecolic acid and some of their derivatives show anticonvulsant activity, and stimulation of benzodiazepine receptor activity

International Nuclear Information System (INIS)

Chang, Yung-Feng; Gao, Xue-Min

1989-01-01

Benzodiazepines are one of the most widely prescribed drugs in the treatment of anxiety, epilepsy and muscle tension. The natural products lysine and pipecolic acid known to be present in the animal, plant and microorganism, have been shown to be anticonvulsant against pentetrazol (PTZ)-induced seizures in mice. Methyl and ethyl esters of L-lysine and the N-isopropanol derivative of pipecolic acid appear to increase the anticonvulsant potency of the parent compounds, presumably due to their increase in hydrophobicity. Lysine and pipecolic acid showed significant stimulation of specific [ 3 H]flunitrazepam (FZ) binding to mouse brain membranes. This stimulation was enhanced by chloride ions and stereospecific with L-isomer having higher effect. The dose-dependent anticonvulsant activity of lysine and pipecolic acid, and their stimulation of [ 3 H]FZ binding appear to be correlated. The antiepileptic activity lysine, pipecolic acid and their derivatives therefore may be mediated through the γ-aminobutyric acid-benzodiazepine receptor complex
Partial characterization of the lettuce infectious yellows virus genomic RNAs, identification of the coat protein gene and comparison of its amino acid sequence with those of other filamentous RNA plant viruses.

Science.gov (United States)

Klaassen, V A; Boeshore, M; Dolja, V V; Falk, B W

1994-07-01

Purified virions of lettuce infectious yellows virus (LIYV), a tentative member of the closterovirus group, contained two RNAs of approximately 8500 and 7300 nucleotides (RNAs 1 and 2 respectively) and a single coat protein species with M(r) of approximately 28,000. LIYV-infected plants contained multiple dsRNAs. The two largest were the correct size for the replicative forms of LIYV virion RNAs 1 and 2. To assess the relationships between LIYV RNAs 1 and 2, cDNAs corresponding to the virion RNAs were cloned. Northern blot hybridization analysis showed no detectable sequence homology between these RNAs. A partial amino acid sequence obtained from purified LIYV coat protein was found to align in the most upstream of four complete open reading frames (ORFs) identified in a LIYV RNA 2 cDNA clone. The identity of this ORF was confirmed as the LIYV coat protein gene by immunological analysis of the gene product expressed in vitro and in Escherichia coli. Computer analysis of the LIYV coat protein amino acid sequence indicated that it belongs to a large family of proteins forming filamentous capsids of RNA plant viruses. The LIYV coat protein appears to be most closely related to the coat proteins of two closteroviruses, beet yellows virus and citrus tristeza virus.
The nonenzymatic subunit of pseutarin C, a prothrombin activator from eastern brown snake (Pseudonaja textilis) venom, shows structural similarity to mammalian coagulation factor V.

Science.gov (United States)

Rao, Veena S; Swarup, Sanjay; Kini, R Manjunatha

2003-08-15

Pseutarin C is a group C prothrombin activator from the venom of the eastern brown snake Pseudonaja textilis. It is a multi-subunit protein complex consisting of catalytic and nonenzymatic subunits similar to coagulation factor Xa and factor Va, respectively. Here we describe the complete sequence of the nonenzymatic subunit. Based on the partial amino acid sequence of the nonenzymatic subunit, degenerate primers were designed. Using a "walking" strategy based on sequentially designed primers, we determined the complete cDNA sequence of the nonenzymatic subunit. The cDNA encodes a protein of 1461 amino acid residues, which includes a 30-residue signal peptide, a mature protein of 1430 amino acid residues, and a stop codon. cDNA blot analysis showed a single transcript of approximately 4.6 kb. The deduced amino acid sequence shows approximately 50% identity to mammalian factor V and by homology has a similar domain structure consisting of domains A1-A2-B-A3-C1-C2. Interestingly, the B domain of pseutarin C is shorter than that of mammalian factor V (FV). Although most of the proteolytic activation sites are conserved, 2 of 3 proteolytic sites cleaved by activated protein C are mutated, and thus activated protein C is not able to inactivate this procoagulant toxin. The predicted posttranslational modifications, including disulfide bonds, N-glycosylation, phosphorylation, and sulfation, in pseutarin C are significantly different compared with bovine factor V. Thus, our data demonstrate that the nonenzymatic subunit of group C prothrombin activators is structurally similar to mammalian FV.
A Novel Phytase Derived from an Acidic Peat-Soil Microbiome Showing High Stability under Acidic Plus Pepsin Conditions.

Science.gov (United States)

Tan, Hao; Wu, Xiang; Xie, Liyuan; Huang, Zhongqian; Peng, Weihong; Gan, Bingcheng

2016-01-01

Four novel phytases of the histidine acid phosphatase family were identified in two publicly available metagenomic datasets of an acidic peat-soil microbiome in northeastern Bavaria, Germany. These enzymes have low similarity to all the reported phytases. They were overexpressed in Escherichia coli and purified. Catalytic efficacy in simulated gastric fluid was measured and compared among the four candidates. The phytase named rPhyPt4 was selected for its high activity. It is the first phytase identified from unculturable Acidobacteria. The phytase showed a longer half-life than all the gastric-stable phytases that have been reported to date, suggesting a strong resistance to low pH and pepsin. A wide pH profile was observed between pH 1.5 and 5.0. At the optimum pH (2.5) the activity was 2,790 μmol/min/mg at the physiological temperature of 37°C and 3,989 μmol/min/mg at the optimum temperature of 60°C. Due to the competent activity level as well as the high gastric stability, the phytase could be a potential candidate for practical use in livestock and poultry feeding. © 2016 S. Karger AG, Basel.
Complete genome sequence of a new enamovirus from Argentina infecting alfalfa plants showing dwarfism symptoms.

Science.gov (United States)

Bejerman, Nicolás; Giolitti, Fabián; Trucco, Verónica; de Breuil, Soledad; Dietzgen, Ralf G; Lenardon, Sergio

2016-07-01

Alfalfa dwarf disease, probably caused by synergistic interactions of mixed virus infections, is a major and emergent disease that threatens alfalfa production in Argentina. Deep sequencing of diseased alfalfa plant samples from the central region of Argentina resulted in the identification of a new virus genome resembling enamoviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Enamovirus, family Luteoviridae. The virus is tentatively named "alfalfa enamovirus 1" (AEV-1). The availability of the AEV-1 genome sequence will make it possible to assess the genetic variability of this virus and to construct an infectious clone to investigate its role in alfalfa dwarfism disease.
The nucleotide sequences of two leghemoglobin genes from soybean

DEFF Research Database (Denmark)

Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O

1982-01-01

We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences in identical positions. Comparison of the coding sequences with known amino-acid sequences of soybean leghemoglobins suggest that the two genes...
Osteocalcin protein sequences of Neanderthals and modern primates.

Science.gov (United States)

Nielsen-Marsh, Christina M; Richards, Michael P; Hauschka, Peter V; Thomas-Oates, Jane E; Trinkaus, Erik; Pettitt, Paul B; Karavanic, Ivor; Poinar, Hendrik; Collins, Matthew J

2005-03-22

We report here protein sequences of fossil hominids, from two Neanderthals dating to approximately 75,000 years old from Shanidar Cave in Iraq. These sequences, the oldest reported fossil primate protein sequences, are of bone osteocalcin, which was extracted and sequenced by using MALDI-TOF/TOF mass spectrometry. Through a combination of direct sequencing and peptide mass mapping, we determined that Neanderthals have an osteocalcin amino acid sequence that is identical to that of modern humans. We also report complete osteocalcin sequences for chimpanzee (Pan troglodytes) and gorilla (Gorilla gorilla gorilla) and a partial sequence for orangutan (Pongo pygmaeus), all of which are previously unreported. We found that the osteocalcin sequences of Neanderthals, modern human, chimpanzee, and orangutan are unusual among mammals in that the ninth amino acid is proline (Pro-9), whereas most species have hydroxyproline (Hyp-9). Posttranslational hydroxylation of Pro-9 in osteocalcin by prolyl-4-hydroxylase requires adequate concentrations of vitamin C (l-ascorbic acid), molecular O(2), Fe(2+), and 2-oxoglutarate, and also depends on enzyme recognition of the target proline substrate consensus sequence Leu-Gly-Ala-Pro-9-Ala-Pro-Tyr occurring in most mammals. In five species with Pro-9-Val-10, hydroxylation is blocked, whereas in gorilla there is a mixture of Pro-9 and Hyp-9. We suggest that the absence of hydroxylation of Pro-9 in Pan, Pongo, and Homo may reflect response to a selective pressure related to a decline in vitamin C in the diet during omnivorous dietary adaptation, either independently or through the common ancestor of these species.
Sequence analysis of putative swrW gene required for surfactant ...

African Journals Online (AJOL)

Serratia marcescens produces biosurfactant serrawettin, essential for its population migration behavior. Serrawettin W1 was revealed to be an antibiotic serratamolide that makes it significant for deoxyribonucleic acid (DNA) and protein sequence analysis. Four nucleotide and amino-acid sequences from local strains ...
Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the influenza A virus subtypes responsible for the 20th‐century pandemics

Science.gov (United States)

Pasricha, Gunisha; Mishra, Akhilesh C.; Chakrabarti, Alok K.

2012-01-01

Please cite this paper as: Pasricha et al. (2012) Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the Influenza A virus subtypes responsible for the 20th‐century pandemics. Influenza and Other Respiratory Viruses 7(4), 497–505. Background PB1F2 is the 11th protein of influenza A virus translated from +1 alternate reading frame of PB1 gene. Since the discovery, varying sizes and functions of the PB1F2 protein of influenza A viruses have been reported. Selection of PB1 gene segment in the pandemics, variable size and pleiotropic effect of PB1F2 intrigued us to analyze amino acid sequences of this protein in various influenza A viruses. Methods Amino acid sequences for PB1F2 protein of influenza A H5N1, H1N1, H2N2, and H3N2 subtypes were obtained from Influenza Research Database. Multiple sequence alignments of the PB1F2 protein sequences of the aforementioned subtypes were used to determine the size, variable and conserved domains and to perform mutational analysis. Results Analysis showed that 96·4% of the H5N1 influenza viruses harbored full‐length PB1F2 protein. Except for the 2009 pandemic H1N1 virus, all the subtypes of the 20th‐century pandemic influenza viruses contained full‐length PB1F2 protein. Through the years, PB1F2 protein of the H1N1 and H3N2 viruses has undergone much variation. PB1F2 protein sequences of H5N1 viruses showed both human‐ and avian host‐specific conserved domains. Global database of PB1F2 protein revealed that N66S mutation was present only in 3·8% of the H5N1 strains. We found a novel mutation, N84S in the PB1F2 protein of 9·35% of the highly pathogenic avian influenza H5N1 influenza viruses. Conclusions Varying sizes and mutations of the PB1F2 protein in different influenza A virus subtypes with pandemic potential were obtained. There was genetic divergence of the protein in various hosts which highlighted the host‐specific evolution of the virus
Cloning and sequence analysis of cDNA coding for rat nucleolar protein C23

International Nuclear Information System (INIS)

Ghaffari, S.H.; Olson, M.O.J.

1986-01-01

Using synthetic oligonucleotides as primers and probes, the authors have isolated and sequenced cDNA clones encoding protein C23, a putative nucleolus organizer protein. Poly(A + ) RNA was isolated from rat Novikoff hepatoma cells and enriched in C23 mRNA by sucrose density gradient ultracentrifugation. Two deoxyoligonuleotides, a 48- and a 27-mer, were synthesized on the basis of amino acid sequence from the C-terminal half of protein C23 and cDNA sequence data from CHO cell protein. The 48-mer was used a primer for synthesis of cDNA which was then inserted into plasmid pUC9. Transformed bacterial colonies were screened by hybridization with 32 P labeled 27-mer. Two clones among 5000 gave a strong positive signal. Plasmid DNAs from these clones were purified and characterized by blotting and nucleotide sequence analysis. The length of C23 mRNA was estimated to be 3200 bases in a northern blot analysis. The sequence of a 267 b.p. insert shows high homology with the CHO cDNA with only 9 nucleotide differences and an identical amino acid sequence. These studies indicate that this region of the protein is highly conserved
Quantitative thermodynamic predication of interactions between nucleic acid and non-nucleic acid species using Microsoft excel.

Science.gov (United States)

Zou, Jiaqi; Li, Na

2013-09-01

Proper design of nucleic acid sequences is crucial for many applications. We have previously established a thermodynamics-based quantitative model to help design aptamer-based nucleic acid probes by predicting equilibrium concentrations of all interacting species. To facilitate customization of this thermodynamic model for different applications, here we present a generic and easy-to-use platform to implement the algorithm of the model with Microsoft(®) Excel formulas and VBA (Visual Basic for Applications) macros. Two Excel spreadsheets have been developed: one for the applications involving only nucleic acid species, the other for the applications involving both nucleic acid and non-nucleic acid species. The spreadsheets take the nucleic acid sequences and the initial concentrations of all species as input, guide the user to retrieve the necessary thermodynamic constants, and finally calculate equilibrium concentrations for all species in various bound and unbound conformations. The validity of both spreadsheets has been verified by comparing the modeling results with the experimental results on nucleic acid sequences reported in the literature. This Excel-based platform described here will allow biomedical researchers to rationalize the sequence design of nucleic acid probes using the thermodynamics-based modeling even without relevant theoretical and computational skills. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Cloning, sequence and expression of the pel gene from an Amycolata sp.

Science.gov (United States)

Brühlmann, F; Keen, N T

1997-11-20

The pel gene from an Amycolata sp. encoding a pectate lyase (EC 4.2.2.2) was isolated by activity screening a genomic DNA library in Streptomyces lividans TK24. Subsequent subcloning and sequencing of a 2.3 kb BamHI BglII fragment revealed an open reading frame of 930 nt corresponding to a protein of 29,660 Da. The overall G + C content for the coding region was 65%, with a strong G + C preference in the third (wobble) codon position (93%). A putative ribosome-binding site 5'-GGGAG-3' preceded the translational start codon by 7 base pairs. The Amycolata pectate lyase contains a signal peptide of 26 amino acids, that is cleaved after the sequence Ala-Thr-Ala. The size of the deduced protein as well as its N-terminal amino-acid sequence match the wild-type pectate lyase from the Amycolata sp. Expression of the pel gene in S. lividans TK24 resulted in high pectate lyase activity in the culture supernatant, concomitant with the appearance of a dominant protein band on a sodium dodecyl polyacrylamide gel at 30 kDa. No pectate lyase activity was detected in E. coli BL21 with the pel gene under the strong T7 promotor. The deduced amino-acid sequence showed 40% identity with PelE from Erwinia chrysanthemi and the pectate lyase from Glomerella cingulata. The Amycolata pectate lyase clearly belongs to the pectate lyase superfamily, sharing all functional amino acids and likely has a similar structural topology as Pels from Erwinia chrysanthemi and Bacillus subtilis.

Genomic sequencing in clinical trials

OpenAIRE

Mestan, Karen K; Ilkhanoff, Leonard; Mouli, Samdeep; Lin, Simon

2011-01-01

Abstract Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to fin...
Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

NARCIS (Netherlands)

Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

2013-01-01

Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which
Mass spectrometric amino acid sequencing of a mixture of seed storage proteins (napin) from Brassica napus, products of a multigene family.

OpenAIRE

Gehrig, P M; Krzyzaniak, A; Barciszewski, J; Biemann, K

1996-01-01

The amino acid sequences of a number of closely related proteins ("napin") isolated from Brassica napus were determined by mass spectrometry without prior separation into individual components. Some of these proteins correspond to those previously deduced (napA, BngNAP1, and gNa), chiefly from DNA sequences. Others were found to differ to a varying extent (BngNAP1', BngNAP1A, BngNAP1B, BngNAP1C, gNa', and gNaA). The short chains of gNa and gNa' and of BngNAP1 and BngNAP1' differ by the replac...
Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

Directory of Open Access Journals (Sweden)

James B Howard

Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification
Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

DEFF Research Database (Denmark)

Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk

2013-01-01

to the consensus sequence. Additionally, we got an average sequence depth for the genome of 4000 for the Iontorrent PGM and 400 for the FLX platform making the mapping suitable for single nucleotide variant (SNV) detection. The analysis revealed a single non-silent SNV A10665G leading to the amino acid change D......Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms...
Peptide Nucleic Acids Having Amino Acid Side Chains

DEFF Research Database (Denmark)

1998-01-01

A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than the corresponding DNA or RNA strands, and exhibit increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from a group consisting...
T2{sup *} mapping from multi-echo dixon sequence on gadoxetic acid-enhanced magnetic resonance imaging for the hepatic fat quantification: Can it be used for hepatic function assessment?

Energy Technology Data Exchange (ETDEWEB)

Yoo, Hyun Suk; Lee, Jeong Min; Yoon, Jeong Hee; Kang, Hyo Jin; Lee, Sang Min; Yang, Hyun Kyung; Han, Joon Koo [Dept. of Radiology, Seoul National University Hospital, Seoul (Korea, Republic of)

2017-08-01

To evaluate the diagnostic value of T2{sup *} mapping using 3D multi-echo Dixon gradient echo acquisition on gadoxetic acid-enhanced liver magnetic resonance imaging (MRI) as a tool to evaluate hepatic function. This retrospective study was approved by the IRB and the requirement of informed consent was waived. 242 patients who underwent liver MRIs, including 3D multi-echo Dixon fast gradient-recalled echo (GRE) sequence at 3T, before and after administration of gadoxetic acid, were included. Based on clinico-laboratory manifestation, the patients were classified as having normal liver function (NLF, n = 50), mild liver damage (MLD, n = 143), or severe liver damage (SLD, n = 30). The 3D multi-echo Dixon GRE sequence was obtained before, and 10 minutes after, gadoxetic acid administration. Pre- and post-contrast T2{sup *} values, as well as T2{sup *} reduction rates, were measured from T2{sup *} maps, and compared among the three groups. There was a significant difference in T2{sup *} reduction rates between the NLF and SLD groups (−0.2 ± 4.9% vs. 5.0 ± 6.9%, p = 0.002), and between the MLD and SLD groups (3.2 ± 6.0% vs. 5.0 ± 6.9%, p = 0.003). However, there was no significant difference in both the pre- and post-contrast T2{sup *} values among different liver function groups (p = 0.735 and 0.131, respectively). A receiver operating characteristic (ROC) curve analysis showed that the area under the ROC curve for using T2{sup *} reduction rates to differentiate the SLD group from the NLF group was 0.74 (95% confidence interval: 0.63–0.83). Incorporation of T2{sup *} mapping using 3D multi-echo Dixon GRE sequence in gadoxetic acid-enhanced liver MRI protocol may provide supplemental information for liver function deterioration in patients with SLD.
Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

Science.gov (United States)

Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

2015-11-21

Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties.
Isolation of laccase gene-specific sequences from white rot and brown rot fungi by PCR

Energy Technology Data Exchange (ETDEWEB)

D`Souza, T.M.; Boominathan, K.; Reddy, C.A. [Michigan State Univ., East Lansing, MI (United States)

1996-10-01

Degenerate primers corresponding to the consensus sequences of the copper-binding regions in the N-terminal domains of known basidiomycete laccases were used to isolate laccase gene-specific sequences from strains representing nine genera of wood rot fungi. All except three gave the expected PCR product of about 200 bp. Computer searches of the databases identified the sequences of each of the PCR product of about 200 bp. Computer searches of the databases identified the sequence of each of the PCR products analyzed as a laccase gene sequence, suggesting the specificity of the primers. PCR products of the white rot fungi Ganoderma lucidum, Phlebia brevispora, and Trametes versicolor showed 65 to 74% nucleotide sequence similarity to each other; the similarity in deduced amino acid sequences was 83 to 91%. The PCR products of Lentinula edodes and Lentinus tigrinus, on the other hand, showed relatively low nucleotide and amino acid similarities (58 to 64 and 62 to 81%, respectively); however, these similarities were still much higher than when compared with the corresponding regions in the laccases of the ascomycete fungi Aspergillus nidulans and Neurospora crassa. A few of the white rot fungi, as well as Gloeophyllum trabeum, a brown rot fungus, gave a 144-bp PCR fragment which had a nucleotide sequence similarity of 60 to 71%. Demonstration of laccase activity in G. trabeum and several other brown rot fungi was of particular interest because these organisms were not previously shown to produce laccases. 36 refs., 6 figs., 2 tabs.
Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

Science.gov (United States)

Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

2014-10-01

Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.
Identifying a base in a nucleic acid

Science.gov (United States)

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2005-02-08

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

Directory of Open Access Journals (Sweden)

Budzanivska I. G.

2014-03-01

Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.
Variation of amino acid sequences of serum amyloid a (SAA) and immunohistochemical analysis of amyloid a (AA) in Japanese domestic cats.

Science.gov (United States)

Tei, Meina; Uchida, Kazuyuki; Chambers, James K; Watanabe, Ken-Ichi; Tamamoto, Takashi; Ohno, Koichi; Nakayama, Hiroyuki

2018-02-02

Amyloid A (AA) amyloidosis, a fatal systemic amyloid disease, occurs secondary to chronic inflammatory conditions in humans. Although persistently elevated serum amyloid A (SAA) levels are required for its pathogenesis, not all individuals with chronic inflammation necessarily develop AA amyloidosis. Furthermore, many diseases in cats are associated with the elevated production of SAA, whereas only a small number actually develop AA amyloidosis. We hypothesized that a genetic mutation in the SAA gene may strongly contribute to the pathogenesis of feline AA amyloidosis. In the present study, genomic DNA from four Japanese domestic cats (JDCs) with AA amyloidosis and from five without amyloidosis was analyzed using polymerase chain reaction (PCR) amplification and direct sequencing. We identified the novel variation combination of 45R-51A in the deduced amino acid sequences of four JDCs with amyloidosis and five without. However, there was no relationship between amino acid variations and the distribution of AA amyloid deposits, indicating that differences in SAA sequences do not contribute to the pathogenesis of AA amyloidosis. Immunohistochemical analysis using antisera against the three different parts of the feline SAA protein-i.e., the N-terminal, central, and C-terminal regions-revealed that feline AA contained the C-terminus, unlike human AA. These results indicate that the cleavage and degradation of the C-terminus are not essential for amyloid fibril formation in JDCs.
Purification and partial amino-acid sequence of gibberellin 20-oxidase from Cucurbita maxima L. endosperm.

Science.gov (United States)

Lange, T

1994-01-01

Gibberellin (GA) 20-oxidase was purified to apparent homogeneity from Cucurbita maxima endosperm by fractionated ammonium-sulphate precipitation, gel-filtration chromatography and anion-exchange and hydrophobic-interaction high-performance liquid chromatography (HPLC). Average purification after the last step was 55-fold with 3.9% of the activity recovered. The purest single fraction was enriched 101-fold with 0.2% overall recovery. Apparent relative molecular mass of the enzyme was 45 kDa, as determined by gel-filtration HPLC and sodium dodecyl sulphate-polyacrylamide gel electrophoresis, indicating that GA 20-oxidase is probably a monomeric enzyme. The purified enzyme degraded on two-dimensional gel electrophoresis, giving two protein spots: a major one corresponding to a molecular mass of 30 kDa and a minor one at 45 kDa. The isoelectric point for both was 5.4. The amino-acid sequences of the amino-terminus of the purified enzyme and of two peptides from a tryptic digest were determined. The purified enzyme catalysed the sequential conversion of [14C]GA12 to [14C]GA15, [14C]GA24 and [14C]GA25, showing that carbon atom 20 was oxidised to the corresponding alcohol, aldehyde and carboxylic acid in three consecutive reactions. [14C]Gibberellin A53 was similarly converted to [14C]GA44, [14C]GA19, [14C]GA17 and small amounts of a fourth product, which was preliminarily identified as [14C]GA20, a C19-gibberellin. All GAs except [14C]GA20 were identified by combined gas chromatography-mass spectrometry. The cofactor requirements in the absence of dithiothreitol were essentially as in its presence (Lange et al., Planta 195, 98-107, 1994), except that ascorbate was essential for enzyme activity and the optimal concentration of catalase was lower.
Feature Selection and the Class Imbalance Problem in Predicting Protein Function from Sequence

NARCIS (Netherlands)

Al-Shahib, A.; Breitling, R.; Gilbert, D.

2005-01-01

Abstract: When the standard approach to predict protein function by sequence homology fails, other alternative methods can be used that require only the amino acid sequence for predicting function. One such approach uses machine learning to predict protein function directly from amino acid sequence
Sequences of 12 monoclonal anti-dinitrophenyl spin-label antibodies for NMR studies

International Nuclear Information System (INIS)

Leahy, D.J.; Rule, G.S.; Whittaker, M.M.; McConnell, H.M.

1988-01-01

Eleven monoclonal antibodies specific for a spin-labeled dinitrophenyl hapten (DNP-SL) have been produces for use in NMR studies. They have been named AN01 and ANO3-AN12. The stability constants for the association of these antibodies with DNP-SL and related haptens were measured by fluorescence quenching. cDNA clones coding for the heavy and light chains of each antibody and of an additional anti-DNP-SL monoclonal antibody, ANO2, have been isolated. The nucleic acid sequence of the 5' end of each clone has been determined, and the amino acid sequence of the variable regions of each antibody has been deduced from the cDNA sequence. The sequences are relatively heterogeneous, but both the heavy and the light chains of ANO1 and ANO3 are derived from the same variable-region gene families as those of the ANO2 antibody. ANO7 has a heavy chain that is related to that of ANO2, and ANO9 has a related light chain. ANO5 and ANO6 are unrelated to ANO2 but share virtually identical heavy and light chains. Preliminary NMR difference spectra comparing related antibodies show that sequence-specific assignment of resonances is possible. Such spectra also provide a measure of structural relatedness
Cloning and characterization of cDNAs encoding the complete sequence of decay-accelerating factor of human complement

International Nuclear Information System (INIS)

Medof, M.E.; Lublin, D.M.; Holers, V.M.; Ayers, D.J.; Getty, R.R.; Leykam, J.F.; Atkinson, J.P.; Tykocinski, M.L.

1987-01-01

cDNAs encoding the complement decay-accelerating factor (DAF) were isolated from HeLa and differentiated HL-60 λgt cDNA libraries by screening with a codon preference oligonucleotide corresponding to DAF NH 2 -terminal amino acids 3-14. The composite cDNA sequence showed a 347-amino acid protein preceded by an NH 2 -terminal leader peptide sequence. The translated sequence beginning at the DAF NH 2 terminus encodes four contiguous ≅ 61-amino acid long repetitive units of internal homology. The repetitive regions contain four conserved cysteines, one proline, one glycine, one glycine/alanine, four leucines/isoleucines/valines, one serine, three tyrosines/phenylalanines, and on tryptophan and show striking homology to similar regions previously identified in factor B, C2, C4 binding protein, factor H, C1r, factor XIII, interleukin 2 receptor, and serum β 2 -glycoprotein I. The consensus repeats are attached to a 70-amino acid long segment rich in serine and threonine (potential O-glycosylation sites), which is in turn followed by a stretch of hydrophobic amino acids. RNA blot analysis of HeLa and HL-60 RNA revealed three DAF mRNA species of 3.1, 2.7, and 2.0 kilobases. The results indicate that portions of the DAF gene may have evolved from a DNA element common to the above proteins, that DAF cDNA predicts a COOH-terminal anchoring polypeptide, and that distinct species of DAF message are elaborated in cells
Sequence comparison and phylogenetic analysis of core gene of ...

African Journals Online (AJOL)

STORAGESEVER

2010-07-19

Jul 19, 2010 ... and antisense primers, a single band of 573 base pairs .... Amino acid sequence alignment of Cluster I and Cluster II of phylogenetic tree. First ten sequences ... sequence weighting, postion-spiecific gap penalties and weight.
Chimera: construction of chimeric sequences for phylogenetic analysis

NARCIS (Netherlands)

Leunissen, J.A.M.

2003-01-01

Chimera allows the construction of chimeric protein or nucleic acid sequence files by concatenating sequences from two or more sequence files in PHYLIP formats. It allows the user to interactively select genes and species from the input files. The concatenated result is stored to one single output
Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

Science.gov (United States)

Zimmermann, Karel; Gibrat, Jean-François

2010-01-04

Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.

Amino acid "little Big Bang": Representing amino acid substitution matrices as dot products of Euclidian vectors

Directory of Open Access Journals (Sweden)

Zimmermann Karel

2010-01-01

Full Text Available Abstract Background Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. Results We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. Conclusions This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.
NEAT-FLEX: Predicting the conformational flexibility of amino acids using neuroevolution of augmenting topologies.

Science.gov (United States)

Grisci, Bruno; Dorn, Márcio

2017-06-01

The development of computational methods to accurately model three-dimensional protein structures from sequences of amino acid residues is becoming increasingly important to the structural biology field. This paper addresses the challenge of predicting the tertiary structure of a given amino acid sequence, which has been reported to belong to the NP-Complete class of problems. We present a new method, namely NEAT-FLEX, based on NeuroEvolution of Augmenting Topologies (NEAT) to extract structural features from (ABS) proteins that are determined experimentally. The proposed method manipulates structural information from the Protein Data Bank (PDB) and predicts the conformational flexibility (FLEX) of residues of a target amino acid sequence. This information may be used in three-dimensional structure prediction approaches as a way to reduce the conformational search space. The proposed method was tested with 24 different amino acid sequences. Evolving neural networks were compared against a traditional error back-propagation algorithm; results show that the proposed method is a powerful way to extract and represent structural information from protein molecules that are determined experimentally.
Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

KAUST Repository

Chen, Peng; Li, Jinyan; Limsoon, Wong; Kuwahara, Hiroyuki; Huang, Jianhua Z.; Gao, Xin

2013-01-01

Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.
Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

KAUST Repository

Chen, Peng

2013-07-23

Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.
Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences.

Science.gov (United States)

Chen, Peng; Li, Jinyan; Wong, Limsoon; Kuwahara, Hiroyuki; Huang, Jianhua Z; Gao, Xin

2013-08-01

Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. Copyright © 2013 Wiley Periodicals, Inc.
A Glutamic Acid-Producing Lactic Acid Bacteria Isolated from Malaysian Fermented Foods

Science.gov (United States)

Zareian, Mohsen; Ebrahimpour, Afshin; Bakar, Fatimah Abu; Mohamed, Abdul Karim Sabo; Forghani, Bita; Ab-Kadir, Mohd Safuan B.; Saari, Nazamid

2012-01-01

l-glutamaic acid is the principal excitatory neurotransmitter in the brain and an important intermediate in metabolism. In the present study, lactic acid bacteria (218) were isolated from six different fermented foods as potent sources of glutamic acid producers. The presumptive bacteria were tested for their ability to synthesize glutamic acid. Out of the 35 strains showing this capability, strain MNZ was determined as the highest glutamic-acid producer. Identification tests including 16S rRNA gene sequencing and sugar assimilation ability identified the strain MNZ as Lactobacillus plantarum. The characteristics of this microorganism related to its glutamic acid-producing ability, growth rate, glucose consumption and pH profile were studied. Results revealed that glutamic acid was formed inside the cell and excreted into the extracellular medium. Glutamic acid production was found to be growth-associated and glucose significantly enhanced glutamic acid production (1.032 mmol/L) compared to other carbon sources. A concentration of 0.7% ammonium nitrate as a nitrogen source effectively enhanced glutamic acid production. To the best of our knowledge this is the first report of glutamic acid production by lactic acid bacteria. The results of this study can be further applied for developing functional foods enriched in glutamic acid and subsequently γ-amino butyric acid (GABA) as a bioactive compound. PMID:22754309
Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

Directory of Open Access Journals (Sweden)

Zijian Li

2014-02-01

Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.
Identification of multiple mRNA and DNA sequences from small tissue samples isolated by laser-assisted microdissection.

Science.gov (United States)

Bernsen, M R; Dijkman, H B; de Vries, E; Figdor, C G; Ruiter, D J; Adema, G J; van Muijen, G N

1998-10-01

Molecular analysis of small tissue samples has become increasingly important in biomedical studies. Using a laser dissection microscope and modified nucleic acid isolation protocols, we demonstrate that multiple mRNA as well as DNA sequences can be identified from a single-cell sample. In addition, we show that the specificity of procurement of tissue samples is not compromised by smear contamination resulting from scraping of the microtome knife during sectioning of lesions. The procedures described herein thus allow for efficient RT-PCR or PCR analysis of multiple nucleic acid sequences from small tissue samples obtained by laser-assisted microdissection.
Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

Directory of Open Access Journals (Sweden)

Nagib eAhsan

2012-07-01

Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.
In Silico Characterization of Pectate Lyase Protein Sequences from Different Source Organisms

Directory of Open Access Journals (Sweden)

Amit Kumar Dubey

2010-01-01

Full Text Available A total of 121 protein sequences of pectate lyases were subjected to homology search, multiple sequence alignment, phylogenetic tree construction, and motif analysis. The phylogenetic tree constructed revealed different clusters based on different source organisms representing bacterial, fungal, plant, and nematode pectate lyases. The multiple accessions of bacterial, fungal, nematode, and plant pectate lyase protein sequences were placed closely revealing a sequence level similarity. The multiple sequence alignment of these pectate lyase protein sequences from different source organisms showed conserved regions at different stretches with maximum homology from amino acid residues 439–467, 715–816, and 829–910 which could be used for designing degenerate primers or probes specific for pectate lyases. The motif analysis revealed a conserved Pec_Lyase_C domain uniformly observed in all pectate lyases irrespective of variable sources suggesting its possible role in structural and enzymatic functions.
The catalytic chain of human complement subcomponent C1r. Purification and N-terminal amino acid sequences of the major cyanogen bromide-cleavage fragments.

Science.gov (United States)

Arlaud, G J; Gagnon, J; Porter, R R

1982-01-01

1. The a- and b-chains of reduced and alkylated human complement subcomponent C1r were separated by high-pressure gel-permeation chromatography and isolated in good yield and in pure form. 2. CNBr cleavage of C1r b-chain yielded eight major peptides, which were purified by gel filtration and high-pressure reversed-phase chromatography. As determined from the sum of their amino acid compositions, these peptides accounted for a minimum molecular weight of 28 000, close to the value 29 100 calculated from the whole b-chain. 3. N-Terminal sequence determinations of C1r b-chain and its CNBr-cleavage peptides allowed the identification of about two-thirds of the amino acids of C1r b-chain. From our results, and on the basis of homology with other serine proteinases, an alignment of the eight CNBr-cleavage peptides from C1r b-chain is proposed. 4. The residues forming the 'charge-relay' system of the active site of serine proteinases (His-57, Asp-102 and Ser-195 in the chymotrypsinogen numbering) are found in the corresponding regions of C1r b-chain, and the amino acid sequence around these residues has been determined. 5. The N-terminal sequence of C1r b-chain has been extended to residue 60 and reveals that C1r b-chain lacks the 'histidine loop', a disulphide bond that is present in all other known serine proteinases.
[Comparative genomics and evolutionary analysis of CRISPR loci in acetic acid bacteria].

Science.gov (United States)

Xia, Kai; Liang, Xin-le; Li, Yu-dong

2015-12-01

The clustered regularly interspaced short palindromic repeat (CRISPR) is a widespread adaptive immunity system that exists in most archaea and many bacteria against foreign DNA, such as phages, viruses and plasmids. In general, CRISPR system consists of direct repeat, leader, spacer and CRISPR-associated sequences. Acetic acid bacteria (AAB) play an important role in industrial fermentation of vinegar and bioelectrochemistry. To investigate the polymorphism and evolution pattern of CRISPR loci in acetic acid bacteria, bioinformatic analyses were performed on 48 species from three main genera (Acetobacter, Gluconacetobacter and Gluconobacter) with whole genome sequences available from the NCBI database. The results showed that the CRISPR system existed in 32 species of the 48 strains studied. Most of the CRISPR-Cas system in AAB belonged to type I CRISPR-Cas system (subtype E and C), but type II CRISPR-Cas system which contain cas9 gene was only found in the genus Acetobacter and Gluconacetobacter. The repeat sequences of some CRISPR were highly conserved among species from different genera, and the leader sequences of some CRISPR possessed conservative motif, which was associated with regulated promoters. Moreover, phylogenetic analysis of cas1 demonstrated that they were suitable for classification of species. The conservation of cas1 genes was associated with that of repeat sequences among different strains, suggesting they were subjected to similar functional constraints. Moreover, the number of spacer was positively correlated with the number of prophages and insertion sequences, indicating the acetic acid bacteria were continually invaded by new foreign DNA. The comparative analysis of CRISR loci in acetic acid bacteria provided the basis for investigating the molecular mechanism of different acetic acid tolerance and genome stability in acetic acid bacteria.
Next generation sequencing (NGS)technologies and applications

Energy Technology Data Exchange (ETDEWEB)

Vuyisich, Momchilo [Los Alamos National Laboratory

2012-09-11

NGS technology overview: (1) NGS library preparation - Nucleic acids extraction, Sample quality control, RNA conversion to cDNA, Addition of sequencing adapters, Quality control of library; (2) Sequencing - Clonal amplification of library fragments, (except PacBio), Sequencing by synthesis, Data output (reads and quality); and (3) Data analysis - Read mapping, Genome assembly, Gene expression, Operon structure, sRNA discovery, and Epigenetic analyses.
In Silico Phylogenetic Analysis and Molecular Modelling Study of 2-Haloalkanoic Acid Dehalogenase Enzymes from Bacterial and Fungal Origin

Directory of Open Access Journals (Sweden)

Raghunath Satpathy

2016-01-01

Full Text Available 2-Haloalkanoic acid dehalogenase enzymes have broad range of applications, starting from bioremediation to chemical synthesis of useful compounds that are widely distributed in fungi and bacteria. In the present study, a total of 81 full-length protein sequences of 2-haloalkanoic acid dehalogenase from bacteria and fungi were retrieved from NCBI database. Sequence analysis such as multiple sequence alignment (MSA, conserved motif identification, computation of amino acid composition, and phylogenetic tree construction were performed on these primary sequences. From MSA analysis, it was observed that the sequences share conserved lysine (K and aspartate (D residues in them. Also, phylogenetic tree indicated a subcluster comprised of both fungal and bacterial species. Due to nonavailability of experimental 3D structure for fungal 2-haloalkanoic acid dehalogenase in the PDB, molecular modelling study was performed for both fungal and bacterial sources of enzymes present in the subcluster. Further structural analysis revealed a common evolutionary topology shared between both fungal and bacterial enzymes. Studies on the buried amino acids showed highly conserved Leu and Ser in the core, despite variation in their amino acid percentage. Additionally, a surface exposed tryptophan was conserved in all of these selected models.
Isolation of a novel abscisic acid stress ripening ( OsASR ) gene ...

African Journals Online (AJOL)

Isolation of a novel abscisic acid stress ripening ( OsASR ) gene from rice and analysis of the response of this gene to abiotic stresses. ... The cDNA with the whole open reading frame (ORF) was amplified by PCR and cloned. Sequence analysis showed that the cDNA encodes a protein of 284 amino acid residues with ...
Molecular cloning, nucleotide sequence, and expression of the gene encoding human eosinophil differentiation factor (interleukin 5)

International Nuclear Information System (INIS)

Campbell, H.D.; Tucker, W.Q.J.; Hort, Y.; Martinson, M.E.; Mayo, G.; Clutterbuck, E.J.; Sanderson, C.J.; Young, I.G.

1987-01-01

The human eosinophil differentiation factor (EDF) gene was cloned from a genomic library in λ phage EMBL3A by using a murine EDF cDNA clone as a probe. The DNA sequence of a 3.2-kilobase BamHI fragment spanning the gene was determined. The gene contains three introns. The predicted amino acid sequence of 134 amino acids is identical with that recently reported for human interleukin 5 but shows no significant homology with other known hemopoietic growth regulators. The amino acid sequence shows strong homology (∼ 70% identity) with that of murine EDF. Recombinant human EDF, expressed from the human EDF gene after transfection into monkey COS cells, stimulated the production of eosinophils and eosinophil colonies from normal human bone marrow but had no effect on the production of neutrophils or mononuclear cells (monocytes and lymphoid cells). The apparent specificity of human EDF for the eosinophil lineage in myeloid hemopoiesis contrasts with the properties of human interleukin 3 and granulocyte/macrophage and granulocyte colony-stimulating factors but is directly analogous to the biological properties of murine EDF. Human EDF therefore represents a distinct hemopoietic growth factor that could play a central role in the regulation of eosinophilia
Adhesive proteins of stalked and acorn barnacles display homology with low sequence similarities.

Directory of Open Access Journals (Sweden)

Jaimie-Leigh Jonker

Full Text Available Barnacle adhesion underwater is an important phenomenon to understand for the prevention of biofouling and potential biotechnological innovations, yet so far, identifying what makes barnacle glue proteins 'sticky' has proved elusive. Examination of a broad range of species within the barnacles may be instructive to identify conserved adhesive domains. We add to extensive information from the acorn barnacles (order Sessilia by providing the first protein analysis of a stalked barnacle adhesive, Lepas anatifera (order Lepadiformes. It was possible to separate the L. anatifera adhesive into at least 10 protein bands using SDS-PAGE. Intense bands were present at approximately 30, 70, 90 and 110 kilodaltons (kDa. Mass spectrometry for protein identification was followed by de novo sequencing which detected 52 peptides of 7-16 amino acids in length. None of the peptides matched published or unpublished transcriptome sequences, but some amino acid sequence similarity was apparent between L. anatifera and closely-related Dosima fascicularis. Antibodies against two acorn barnacle proteins (ab-cp-52k and ab-cp-68k showed cross-reactivity in the adhesive glands of L. anatifera. We also analysed the similarity of adhesive proteins across several barnacle taxa, including Pollicipes pollicipes (a stalked barnacle in the order Scalpelliformes. Sequence alignment of published expressed sequence tags clearly indicated that P. pollicipes possesses homologues for the 19 kDa and 100 kDa proteins in acorn barnacles. Homology aside, sequence similarity in amino acid and gene sequences tended to decline as taxonomic distance increased, with minimum similarities of 18-26%, depending on the gene. The results indicate that some adhesive proteins (e.g. 100 kDa are more conserved within barnacles than others (20 kDa.
Characterization of promoter sequence of toll-like receptor genes in Vechur cattle

Directory of Open Access Journals (Sweden)

R. Lakshmi

2016-06-01

Full Text Available Aim: To analyze the promoter sequence of toll-like receptor (TLR genes in Vechur cattle, an indigenous breed of Kerala with the sequence of Bos taurus and access the differences that could be attributed to innate immune responses against bovine mastitis. Materials and Methods: Blood samples were collected from Jugular vein of Vechur cattle, maintained at Vechur cattle conservation center of Kerala Veterinary and Animal Sciences University, using an acid-citrate-dextrose anticoagulant. The genomic DNA was extracted, and polymerase chain reaction was carried out to amplify the promoter region of TLRs. The amplified product of TLR2, 4, and 9 promoter regions was sequenced by Sanger enzymatic DNA sequencing technique. Results: The sequence of promoter region of TLR2 of Vechur cattle with the B. taurus sequence present in GenBank showed 98% similarity and revealed variants for four sequence motifs. The sequence of the promoter region of TLR4 of Vechur cattle revealed 99% similarity with that of B. taurus sequence but not reveals significant variant in motifregions. However, two heterozygous loci were observed from the chromatogram. Promoter sequence of TLR9 gene also showed 99% similarity to B. taurus sequence and revealed variants for four sequence motifs. Conclusion: The results of this study indicate that significant variation in the promoter of TLR2 and 9 genes in Vechur cattle breed and may potentially link the influence the innate immunity response against mastitis diseases.
BGL6 beta-glucosidase and nucleic acids encoding the same

Science.gov (United States)

Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

2009-09-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
Identities among actin-encoding cDNAs of the Nile tilapia (Oreochromis niloticus and other eukaryote species revealed by nucleotide and amino acid sequence analyses

Directory of Open Access Journals (Sweden)

Andréia B. Poletto

2008-01-01

Full Text Available Actin-encoding cDNAs of Nile tilapia (Oreochromis niloticus were isolated by RT-PCR using total RNA samples of different tissues and further characterized by nucleotide sequencing and in silico amino acid (aa sequence analysis. Comparisons among the actin gene sequences of O. niloticus and those of other species evidenced that the isolated genes present a high similarity to other fish and other vertebrate actin genes. The highest nucleotide resemblance was observed between O. niloticus and O. mossambicus a-actin and b-actin genes. Analysis of the predicted aa sequences revealed two distinct types of cytoplasmic actins, one cardiac muscle actin type and one skeletal muscle actin type that were expressed in different tissues of Nile tilapia. The evolutionary relationships between the Nile tilapia actin genes and diverse other organisms is discussed.

Mixed cultures of Kimchi lactic acid bacteria show increased cell ...

African Journals Online (AJOL)

ufuoma

production and amino acid release among the tested bacteria. W. koreensis 521 ... production of fermented food products, such as yogurt, cheese, sauerkraut and ... habits, stress and excessive dieting (Kapka-Skrzypczak et al., 2012). Mixed ...
Deep sequencing shows low-level oncogenic hepatitis B virus variants persists post-liver transplant despite potent anti-HBV prophylaxis.

Science.gov (United States)

Lau, K C K; Osiowy, C; Giles, E; Lusina, B; van Marle, G; Burak, K W; Coffin, C S

2018-01-06

Recent studies suggest that withdrawal of hepatitis B immune globulin (HBIG) and nucleos(t)ide analogues (NA) prophylaxis may be considered in HBV surface antigen (HBsAg)-negative liver transplant (LT) recipients with a low risk of disease recurrence. However, the frequency of occult HBV infection (OBI) and HBV variants after LT in the current era of potent NA therapy is unknown. Twelve LT recipients on prophylaxis were tested in matched plasma and peripheral blood mononuclear cells (PBMCs) for HBV quasispecies by in-house nested PCR and next-generation sequencing of amplicons. HBV covalently closed circular DNA (cccDNA) was detected in Hirt DNA isolated from PBMCs with cccDNA-specific primers and confirmed by nucleic acid hybridization and Sanger sequencing. HBV mRNA in PBMC was detected with reverse-transcriptase nested PCR. In LT recipients on immunosuppressive therapy (10/12 male; median age 57.5 [IQR: 39.8-66.5]; median follow-up post-LT 60 months; 6 pre-LT hepatocellular carcinoma [HCC]), 9 were HBsAg-. HBV DNA was detected in all plasma and PBMC tested; cccDNA and/or mRNA was detected in the PBMC of 10/12 patients. Significant HBV quasispecies diversity (ie 143-2212 nonredundant HBV species) was noted in both sites, and single nucleotide polymorphisms associated with cirrhosis and HCC were detected at varying frequencies. In conclusion, OBI and HBV variants associated with severe liver disease persist in LT recipients on prophylaxis. Although HBV control and cccDNA transcriptional silencing may occur despite immunosuppression, complete virological eradication does not occur in LT recipients with a history of HBV-related end-stage liver disease. © 2018 John Wiley & Sons Ltd.
MIPS: a database for genomes and protein sequences.

Science.gov (United States)

Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
cDNA sequences of two apolipoproteins from lamprey

International Nuclear Information System (INIS)

Pontes, M.; Xu, X.; Graham, D.; Riley, M.; Doolittle, R.F.

1987-01-01

The messages for two small but abundant apolipoproteins found in lamprey blood plasma were cloned with the aid of oligonucleotide probes based on amino-terminal sequences. In both cases, numerous clones were identified in a lamprey liver cDNA library, consistent with the great abundance of these proteins in lamprey blood. One of the cDNAs (LAL1) has a coding region of 105 amino acids that corresponds to a 21-residue signal peptide, a putative 8-residue propeptide, and the 76-residue mature protein found in blood. The other cDNA (LAL2) codes for a total of 191 residues, the first 23 of which constitute a signal peptide. The two proteins, which occur in the high-density lipoprotein fraction of ultracentrifuged plasma, have amino acid compositions similar to those of apolipoproteins found in mammalian blood; computer analysis indicates that the sequences are largely helix-permissive. When the sequences were searched against an amino acid sequence data base, rat apolipoprotein IV was the best matching candidate in both cases. Although a reasonable alignment can be made with that sequence and LAL1, definitive assignment of the two lamprey proteins to typical mammalian classes cannot be made at this point
Multimodal sequence learning.

Science.gov (United States)

Kemény, Ferenc; Meier, Beat

2016-02-01

While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.
Multiplex, rapid and sensitive isothermal detection of nucleic-acid sequence by endonuclease restriction-mediated real-time multiple cross displacement amplification

Directory of Open Access Journals (Sweden)

Yi eWang

2016-05-01

Full Text Available We have devised a novel isothermal amplification technology, termed endonuclease restriction-mediated real-time multiple cross displacement amplification (ET-MCDA, which facilitated multiplex, rapid, specific and sensitive detection of nucleic-acid sequences at a constant temperature. The ET-MCDA integrated multiple cross displacement amplification strategy, restriction endonuclease cleavage and real-time fluorescence detection technique. In the ET-MCDA system, the functional cross primer E-CP1 or E-CP2 was constructed by adding a short sequence at the 5’ end of CP1 or CP2, respectively, and the new E-CP1 or E-CP2 primer was labelled at the 5’ end with a fluorophore and in the middle with a dark quencher. The restriction endonuclease Nb.BsrDI specifically recognized the short sequence and digested the newly synthesized double-stranded terminal sequences (5’ end short sequences and their complementary sequences, which released the quenching, resulting on a gain of fluorescence signal. Thus, the ET-MCDA allowed real-time detection of single or multiple targets in only a single reaction, and the positive results were observed in as short as 12 minutes, detecting down to 3.125 fg of genomic DNA per tube. Moreover, the analytical specificity and the practical application of the ET-MCDA were also successfully evaluated in this study. Here we provided the details on the novel ET-MCDA technique and expounded the basic ET-MCDA amplification mechanism.
Multiplex, Rapid, and Sensitive Isothermal Detection of Nucleic-Acid Sequence by Endonuclease Restriction-Mediated Real-Time Multiple Cross Displacement Amplification.

Science.gov (United States)

Wang, Yi; Wang, Yan; Zhang, Lu; Liu, Dongxin; Luo, Lijuan; Li, Hua; Cao, Xiaolong; Liu, Kai; Xu, Jianguo; Ye, Changyun

2016-01-01

We have devised a novel isothermal amplification technology, termed endonuclease restriction-mediated real-time multiple cross displacement amplification (ET-MCDA), which facilitated multiplex, rapid, specific and sensitive detection of nucleic-acid sequences at a constant temperature. The ET-MCDA integrated multiple cross displacement amplification strategy, restriction endonuclease cleavage and real-time fluorescence detection technique. In the ET-MCDA system, the functional cross primer E-CP1 or E-CP2 was constructed by adding a short sequence at the 5' end of CP1 or CP2, respectively, and the new E-CP1 or E-CP2 primer was labeled at the 5' end with a fluorophore and in the middle with a dark quencher. The restriction endonuclease Nb.BsrDI specifically recognized the short sequence and digested the newly synthesized double-stranded terminal sequences (5' end short sequences and their complementary sequences), which released the quenching, resulting on a gain of fluorescence signal. Thus, the ET-MCDA allowed real-time detection of single or multiple targets in only a single reaction, and the positive results were observed in as short as 12 min, detecting down to 3.125 fg of genomic DNA per tube. Moreover, the analytical specificity and the practical application of the ET-MCDA were also successfully evaluated in this study. Here, we provided the details on the novel ET-MCDA technique and expounded the basic ET-MCDA amplification mechanism.
Molecular Cloning and Sequencing of AlkalophilicCellulosimicrobium cellulans CKMX1 Xylanase Gene Isolated from Mushroom Compost and Characterization of the Gene Product

Directory of Open Access Journals (Sweden)

Abhishek Walia

2015-12-01

Full Text Available ABSTRACT A xylanolytic bacterium was isolated from mushroom compost by using enrichment technique. Results from the metabolic fingerprinting, whole-cell fatty acids methyl ester analysis and 16S rDNA sequencing suggested the bacterium to be Cellulosimicrobium cellulans CKMX1. Due to the xylanolytic activity of this bacterium, isolation and characterization of the xylanase gene were attempted. A distinct fragment of about 1671 bp was successfully amplified using PCR and cloned into Escherichia coli DH5α. A BLAST search confirmed that the DNA sequence from the amplified fragment was endo-1, 4-beta-xylanase, which was a member of glycoside hydrolase family 11. It showed 98% homology withCellulosimicrobium sp. xylanase gene (Accession no. FJ859907.1 reported from the gut of Eisenia fetida in Korea. In silicophysico-chemical characterization of amino acid sequence of xylanase showed an open reading frame encoding a 556 amino acid sequence with a molecular weight of 58 kDa and theoretical isolectric point (pI of 4.46 was computed using Expasy's ProtParam server. Secondary and homology based 3D structure of xylanase was analysed using SOPMA and Swiss-Prot software.
RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

Science.gov (United States)

Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

1990-07-01

The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.
Carbon isotope composition of intermediates of the starch-malate sequence and level of the crassulacean acid metabolism in leaves of Kalanchoe blossfeldiana Tom Thumb.

Science.gov (United States)

Deleens, E; Garnier-Dardart, J; Queiroz, O

1979-09-01

Isotype analyses were performed on biochemical fractions isolated from leaves of Kalanchoe blossfeldiana Tom Thumb. during aging under long days or short days. Irrespective of the age or photoperiodic conditions, the intermediates of the starch-malate sequence (starch, phosphorylated compounds and organic acids) have a level of (13)C higher than that of soluble sugars, cellulose and hemicellulose. In short days, the activity of the crassulacean acid metabolism pathway is predominant as compared to that of C3 pathway: leaves accumulate organic acids, rich in (13)C. In long days, the activity of the crassulacean acid metabolism pathway increases as the leaves age, remaining, however, relatively low as compared to that of C3 pathway: leaves accumulate soluble sugars, poor in (13)C. After photoperiodic change (long days→short days), isotopic modifications of starch and organic acids suggest evidence for a lag phase in the establishment of the crassulacean acid metabolism pathway specific to short days. The relative proportions of carbon from a C3-origin (RuBPC acitivity as strong discriminating step, isotope discrimination in vivo=20‰) or C4-origin (PEPC activity as weak discriminating step, isotope discrimination in vivo=4‰) present in the biochemical fractions were calculated from their δ(13)C values. Under long days, 30 to 70% versus 80 to 100% under short days, of the carbon of the intermediates linked to the starch-malate sequence, or CAM pathway (starch, phosphorylated compounds and organic acids), have a C4-origin. Products connected to the C3 pathway (free sugars, cellulose, hemicellulose) have 0 to 50% of their carbon, arising from reuptake of the C4 from malate, under long days versus 30 to 70% under short days.
A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.

Science.gov (United States)

Wickland, Daniel P; Battu, Gopal; Hudson, Karen A; Diers, Brian W; Hudson, Matthew E

2017-12-28

Genotyping-by-sequencing (GBS), a method to identify genetic variants and quickly genotype samples, reduces genome complexity by using restriction enzymes to divide the genome into fragments whose ends are sequenced on short-read sequencing platforms. While cost-effective, this method produces extensive missing data and requires complex bioinformatics analysis. GBS is most commonly used on crop plant genomes, and because crop plants have highly variable ploidy and repeat content, the performance of GBS analysis software can vary by target organism. Here we focus our analysis on soybean, a polyploid crop with a highly duplicated genome, relatively little public GBS data and few dedicated tools. We compared the performance of five GBS pipelines using low-coverage Illumina sequence data from three soybean populations. To address issues identified with existing methods, we developed GB-eaSy, a GBS bioinformatics workflow that incorporates widely used genomics tools, parallelization and automation to increase the accuracy and accessibility of GBS data analysis. Compared to other GBS pipelines, GB-eaSy rapidly and accurately identified the greatest number of SNPs, with SNP calls closely concordant with whole-genome sequencing of selected lines. Across all five GBS analysis platforms, SNP calls showed unexpectedly low convergence but generally high accuracy, indicating that the workflows arrived at largely complementary sets of valid SNP calls on the low-coverage data analyzed. We show that GB-eaSy is approximately as good as, or better than, other leading software solutions in the accuracy, yield and missing data fraction of variant calling, as tested on low-coverage genomic data from soybean. It also performs well relative to other solutions in terms of the run time and disk space required. In addition, GB-eaSy is built from existing open-source, modular software packages that are regularly updated and commonly used, making it straightforward to install and maintain
Complete genome sequence of the first human parechovirus type 3 isolated in Taiwan

Directory of Open Access Journals (Sweden)

Jenn-Tzong Chang

2017-11-01

Full Text Available The first human parechovirus 3 (HPeV3 VGHKS-2007 in Taiwan was identified from a clinical specimen from a male infant. The entire genome of the HPeV3 isolate was sequenced and compared to known HPeV3 sequences. Genome alignment data showed that HPeV3 VGHKS-2007 shares the highest nucleotide identity, 99%, with the Japanese strain of HPeV3 1361K-162589-Yamagata-2008. All HPeV3 isolates possess at least 97% amino acid identity. The analysis of the genome sequence of HPeV3 VGHKS-2007 will facilitate future investigations of the epidemiology and pathogenicity of HPeV3 infection.
Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the influenza A virus subtypes responsible for the 20th-century pandemics.

Science.gov (United States)

Pasricha, Gunisha; Mishra, Akhilesh C; Chakrabarti, Alok K

2013-07-01

PB1F2 is the 11th protein of influenza A virus translated from +1 alternate reading frame of PB1 gene. Since the discovery, varying sizes and functions of the PB1F2 protein of influenza A viruses have been reported. Selection of PB1 gene segment in the pandemics, variable size and pleiotropic effect of PB1F2 intrigued us to analyze amino acid sequences of this protein in various influenza A viruses. Amino acid sequences for PB1F2 protein of influenza A H5N1, H1N1, H2N2, and H3N2 subtypes were obtained from Influenza Research Database. Multiple sequence alignments of the PB1F2 protein sequences of the aforementioned subtypes were used to determine the size, variable and conserved domains and to perform mutational analysis. Analysis showed that 96·4% of the H5N1 influenza viruses harbored full-length PB1F2 protein. Except for the 2009 pandemic H1N1 virus, all the subtypes of the 20th-century pandemic influenza viruses contained full-length PB1F2 protein. Through the years, PB1F2 protein of the H1N1 and H3N2 viruses has undergone much variation. PB1F2 protein sequences of H5N1 viruses showed both human- and avian host-specific conserved domains. Global database of PB1F2 protein revealed that N66S mutation was present only in 3·8% of the H5N1 strains. We found a novel mutation, N84S in the PB1F2 protein of 9·35% of the highly pathogenic avian influenza H5N1 influenza viruses. Varying sizes and mutations of the PB1F2 protein in different influenza A virus subtypes with pandemic potential were obtained. There was genetic divergence of the protein in various hosts which highlighted the host-specific evolution of the virus. However, studies are required to correlate this sequence variability with the virulence and pathogenicity. © 2012 John Wiley & Sons Ltd.
New Grocott Stain without Using Chromic Acid

International Nuclear Information System (INIS)

Shiogama, Kazuya; Kitazawa, Kayo; Mizutani, Yasuyoshi; Onouchi, Takanori; Inada, Ken-ichi; Tsutsumi, Yutaka

2015-01-01

We established a new “ecological” Grocott stain for demonstrating fungi, based upon a 4R principle of refusal, reduction, reuse, and recycle of waste management. Conventional Grocott stain employs environmentally harsh 5% chromic acid for oxidization. Initially, we succeeded in reducing the concentration of chromic acid from 5% to 1% by incubating the solution at 60°C and using five-fold diluted chromic acid solution at which point it was reusable. Eventually, we reached the refusal level where 1% periodic acid oxidization was efficient enough, when combined with preheating of sections in the electric jar, microwave oven, or pressure pan. For convenience sake, we recommend pressure pan heating in tap water for 10 min. Stainability of fungi in candidiasis and aspergillosis was comparable with conventional Grocott stain, while Mucor hyphae showed enhanced staining. The modified sequence was further applicable to detecting a variety of mycotic pathogens in paraffin sections. Our environmentally-friendly Grocott stain also has the advantage of avoiding risk of human exposure to hexavalent chromium solution in the histopathology laboratory. The simple stain sequence is can be easily applied worldwide
Information decomposition method to analyze symbolical sequences

International Nuclear Information System (INIS)

Korotkov, E.V.; Korotkova, M.A.; Kudryashov, N.A.

2003-01-01

The information decomposition (ID) method to analyze symbolical sequences is presented. This method allows us to reveal a latent periodicity of any symbolical sequence. The ID method is shown to have advantages in comparison with application of the Fourier transformation, the wavelet transform and the dynamic programming method to look for latent periodicity. Examples of the latent periods for poetic texts, DNA sequences and amino acids are presented. Possible origin of a latent periodicity for different symbolical sequences is discussed
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2

International Nuclear Information System (INIS)

Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.; Cohen, G.H.

1984-01-01

Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar. For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells
Mildly abnormal general movement quality in infants is associated with higher Mead acid and lower arachidonic acid and shows a U-shaped relation with the DHA/AA ratio.

Science.gov (United States)

van Goor, S A; Schaafsma, A; Erwich, J J H M; Dijck-Brouwer, D A J; Muskiet, F A J

2010-01-01

We showed that docosahexaenoic acid (DHA) supplementation during pregnancy and lactation was associated with more mildly abnormal (MA) general movements (GMs) in the infants. Since this finding was unexpected and inter-individual DHA intakes are highly variable, we explored the relationship between GM quality and erythrocyte DHA, arachidonic acid (AA), DHA/AA and Mead acid in 57 infants of this trial. MA GMs were inversely related to AA, associated with Mead acid, and associated with DHA/AA in a U-shaped manner. These relationships may indicate dependence of newborn AA status on synthesis from linoleic acid. This becomes restricted during the intrauterine period by abundant de novo synthesis of oleic and Mead acids from glucose, consistent with reduced insulin sensitivity during the third trimester. The descending part of the U-shaped relation between MA GMs and DHA/AA probably indicates DHA shortage next to AA shortage. The ascending part may reflect a different developmental trajectory that is not necessarily unfavorable. Copyright 2009 Elsevier Ltd. All rights reserved.
Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

Science.gov (United States)

Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

2010-10-01

Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (Pc) 2010 Elsevier B.V. All rights reserved.
Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids.

Science.gov (United States)

Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

2015-08-13

Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of three different stages of fruit ripening (pre-climacteric, climacteric and post-climacteric) were also analyzed. The analysis of the RNAseqatlas presented here reveals strong differences in gene expression patterns between different organs, especially between root and flower, but also reveals similarities among the gene expression patterns in other organs, such as stem, leaves and aerial buds (vegetative organs) or seed and fruit (storage organs). Important regulators, functional categories, and differentially expressed genes involved in avocado fruit ripening were identified. Additionally, to demonstrate the utility of the avocado gene expression atlas, we investigated the expression patterns of genes implicated in fatty acid metabolism and fruit ripening. A description of transcriptomic changes occurring during fruit ripening was obtained in Mexican avocado, contributing to a dynamic view of the expression patterns of genes involved in fatty acid biosynthesis and the fruit ripening process.
Model of the synthesis of trisporic acid in Mucorales showing bistability.

Science.gov (United States)

Werner, S; Schroeter, A; Schimek, C; Vlaic, S; Wöstemeyer, J; Schuster, S

2012-12-01

An important substance in the signalling between individuals of Mucor-like fungi is trisporic acid (TA). This compound, together with some of its precursors, serves as a pheromone in mating between (+)- and (-)-mating types. Moreover, intermediates of the TA pathway are exchanged between the two mating partners. Based on differential equations, mathematical models of the synthesis pathways of TA in the two mating types of an idealised Mucor-fungus are here presented. These models include the positive feedback of TA on its own synthesis. The authors compare three sub-models in view of bistability, robustness and the reversibility of transitions. The proposed modelling study showed that, in a system where intermediates are exchanged, a reversible transition between the two stable steady states occurs, whereas an exchange of the end product leads to an irreversible transition. The reversible transition is physiologically favoured, because the high-production state of TA must come to an end eventually. Moreover, the exchange of intermediates and TA is compared with the 3-way handshake widely used by computers linked in a network.

High-Throughput Next-Generation Sequencing of Polioviruses

Science.gov (United States)

Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.

2016-01-01

ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929
Genome Sequence of Lactobacillus plantarum Strain UCMA 3037

OpenAIRE

Naz, Saima; Tareb, Raouf; Bernardeau, Marion; Vaisse, Melissa; Lucchetti-Miganeh, Celine; Rechenmann, Mathias; Vernoux, Jean-Paul

2013-01-01

Nucleic acid of the strain Lactobacillus plantarum UCMA 3037, isolated from raw milk camembert cheese in our laboratory, was sequenced. We present its draft genome sequence with the aim of studying its functional properties and relationship to the cheese ecosystem.
Genome Sequence of Lactobacillus plantarum Strain UCMA 3037.

Science.gov (United States)

Naz, Saima; Tareb, Raouf; Bernardeau, Marion; Vaisse, Melissa; Lucchetti-Miganeh, Celine; Rechenmann, Mathias; Vernoux, Jean-Paul

2013-05-23

Nucleic acid of the strain Lactobacillus plantarum UCMA 3037, isolated from raw milk camembert cheese in our laboratory, was sequenced. We present its draft genome sequence with the aim of studying its functional properties and relationship to the cheese ecosystem.
Single-cell sequencing unveils the lifestyle and CRISPR-based population history of Hydrotalea sp. in acid mine drainage.

Science.gov (United States)

Medeiros, J D; Leite, L R; Pylro, V S; Oliveira, F S; Almeida, V M; Fernandes, G R; Salim, A C M; Araújo, F M G; Volpini, A C; Oliveira, G; Cuadros-Orellana, S

2017-10-01

Acid mine drainage (AMD) is characterized by an acid and metal-rich run-off that originates from mining systems. Despite having been studied for many decades, much remains unknown about the microbial community dynamics in AMD sites, especially during their early development, when the acidity is moderate. Here, we describe draft genome assemblies from single cells retrieved from an early-stage AMD sample. These cells belong to the genus Hydrotalea and are closely related to Hydrotalea flava. The phylogeny and average nucleotide identity analysis suggest that all single amplified genomes (SAGs) form two clades that may represent different strains. These cells have the genomic potential for denitrification, copper and other metal resistance. Two coexisting CRISPR-Cas loci were recovered across SAGs, and we observed heterogeneity in the population with regard to the spacer sequences, together with the loss of trailer-end spacers. Our results suggest that the genomes of Hydrotalea sp. strains studied here are adjusting to a quickly changing selective pressure at the microhabitat scale, and an important form of this selective pressure is infection by foreign DNA. © 2017 John Wiley & Sons Ltd.
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

Science.gov (United States)

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Biological characterization and complete nucleotide sequence of a Tunisian isolate of Moroccan watermelon mosaic virus.

Science.gov (United States)

Yakoubi, S; Desbiez, C; Fakhfakh, H; Wipf-Scheibel, C; Marrakchi, M; Lecoq, H

2008-01-01

During a survey conducted in October 2005, cucurbit leaf samples showing virus-like symptoms were collected from the major cucurbit-growing areas in Tunisia. DAS-ELISA showed the presence of Moroccan watermelon mosaic virus (MWMV, Potyvirus), detected for the first time in Tunisia, in samples from the region of Cap Bon (Northern Tunisia). MWMV isolate TN05-76 (MWMV-Tn) was characterized biologically and its full-length genome sequence was established. MWMV-Tn was found to have biological properties similar to those reported for the MWMV type strain from Morocco. Phylogenetic analysis including the comparison of complete amino-acid sequences of 42 potyviruses confirmed that MWMV-Tn is related (65% amino-acid sequence identity) to Papaya ringspot virus (PRSV) isolates but is a member of a distinct virus species. Sequence analysis on parts of the CP gene of MWMV isolates from different geographical origins revealed some geographic structure of MWMV variability, with three different clusters: one cluster including isolates from the Mediterranean region, a second including isolates from western and central Africa, and a third one including isolates from the southern part of Africa. A significant correlation was observed between geographic and genetic distances between isolates. Isolates from countries in the Mediterranean region where MWMV has recently emerged (France, Spain, Portugal) have highly conserved sequences, suggesting that they may have a common and recent origin. MWMV from Sudan, a highly divergent variant, may be considered an evolutionary intermediate between MWMV and PRSV.
Image correlation method for DNA sequence alignment.

Science.gov (United States)

Curilem Saldías, Millaray; Villarroel Sassarini, Felipe; Muñoz Poblete, Carlos; Vargas Vásquez, Asticio; Maureira Butler, Iván

2012-01-01

The complexity of searches and the volume of genomic data make sequence alignment one of bioinformatics most active research areas. New alignment approaches have incorporated digital signal processing techniques. Among these, correlation methods are highly sensitive. This paper proposes a novel sequence alignment method based on 2-dimensional images, where each nucleic acid base is represented as a fixed gray intensity pixel. Query and known database sequences are coded to their pixel representation and sequence alignment is handled as object recognition in a scene problem. Query and database become object and scene, respectively. An image correlation process is carried out in order to search for the best match between them. Given that this procedure can be implemented in an optical correlator, the correlation could eventually be accomplished at light speed. This paper shows an initial research stage where results were "digitally" obtained by simulating an optical correlation of DNA sequences represented as images. A total of 303 queries (variable lengths from 50 to 4500 base pairs) and 100 scenes represented by 100 x 100 images each (in total, one million base pair database) were considered for the image correlation analysis. The results showed that correlations reached very high sensitivity (99.01%), specificity (98.99%) and outperformed BLAST when mutation numbers increased. However, digital correlation processes were hundred times slower than BLAST. We are currently starting an initiative to evaluate the correlation speed process of a real experimental optical correlator. By doing this, we expect to fully exploit optical correlation light properties. As the optical correlator works jointly with the computer, digital algorithms should also be optimized. The results presented in this paper are encouraging and support the study of image correlation methods on sequence alignment.
Chirality- and sequence-selective successive self-sorting via specific homo- and complementary-duplex formations

Science.gov (United States)

Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji

2015-01-01

Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291
Identification of single amino acid substitutions (SAAS) in neuraminidase from influenza a virus (H1N1) via mass spectrometry analysis coupled with de novo peptide sequencing.

Science.gov (United States)

Peng, Qisheng; Wang, Zijian; Wu, Donglin; Li, Xiaoou; Liu, Xiaofeng; Sun, Wanchun; Liu, Ning

2016-08-01

Amino acid substitutions in the neuraminidase of the influenza virus are the main cause of the emergence of resistance to zanamivir or oseltamivir during seasonal influenza treatment; they are the result of non-synonymous mutations in the viral genome that can be successfully detected by polymer chain reaction (PCR)-based approaches. There is always an urgent need to detect variation in amino acid sequences directly at the protein level. Mass spectrometry coupled with de novo sequencing has been explored as an alternative and straightforward strategy for detecting amino acid substitutions, as well - this approach is the primary focus of the present study. Influenza virus (A/Puerto Rico/8/1934 H1N1) propagated in embryonated chicken eggs was purified by ultracentrifugation, followed by PNGase F treatment. The deglycosylated virion was lysed and separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The gel band corresponding to neuraminidase was picked up and subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis. LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three amino acid substitutions: R346K, S349 N, and S370I/L, in the neuraminidase from the influenza virus (A/Puerto Rico/8/1934 H1N1), which were located in three mutated peptides of the neuraminidase: YGNGVWIGK, TKNHSSR, and PNGWTETDI/LK, respectively. We found that the amino acid substitutions in the proteins of RNA viruses (including influenza A virus) resulting from non-synonymous gene mutations can indeed be directly analyzed via mass spectrometry, and that manual interpretation of the MS/MS data may be beneficial. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Bm86 midgut protein sequence variation in South Texas cattle fever ticks

Directory of Open Access Journals (Sweden)

Kammlah Diane M

2010-11-01

Full Text Available Abstract Background Cattle fever ticks, Rhipicephalus (Boophilus microplus and R. (B. annulatus, vector bovine and equine babesiosis, and have significantly expanded beyond the permanent quarantine zone established in South Texas. Currently, there are no vaccines approved for use within the United States for controlling these vectors. Vaccines developed in Australia and Cuba based on the midgut antigen Bm86 have variable efficacy against cattle fever ticks. A possible explanation for this variation in vaccine efficacy is amino acid sequence divergence between the recombinant Bm86 vaccine component and native Bm86 expressed in ticks from different geographical regions of the world. Results There was 91.8% amino acid sequence identity in Bm86 among R. microplus and R. annulatus sequenced from South Texas infestations. When South Texas isolates were compared to the Australian Yeerongpilly and Cuban Camcord vaccine strains, there was 89.8% and 90.0% identity, respectively. Most of the sequence divergence was focused in one region of the protein, amino acids 206-298. Hydrophilicity profiles revealed that two short regions of Bm86 (amino acids 206-210 and 560-570 appear to be more hydrophilic in South Texas isolates compared to vaccine strains. Only one amino acid difference was found between South Texas and vaccine strains within two previously described B-cell epitopes. A total of 4 amino acid differences were observed within three peptides previously shown to induce protective immune responses in cattle. Conclusions Sequence differences between South Texas isolates and Yeerongpilly and Camcord strains are spread throughout the entire Bm86 sequence, suggesting that geographic variation does exist. Differences within previously described B-cell epitopes between South Texas isolates and vaccine strains are minimal; however, short regions of hydrophilic amino acids found unique to South Texas isolates suggest that additional unique surface exposed
Biological sequence analysis: probabilistic models of proteins and nucleic acids

National Research Council Canada - National Science Library

Durbin, Richard

1998-01-01

... analysis methods are now based on principles of probabilistic modelling. Examples of such methods include the use of probabilistically derived score matrices to determine the signiﬁcance of sequence alignments, the use of hidden Markov models as the basis for proﬁle searches to identify distant members of sequence families, and the inference...
Genomic localization, sequence analysis, and transcription of the putative human cytomegalovirus DNA polymerase gene

International Nuclear Information System (INIS)

Heilbronn, T.; Jahn, G.; Buerkle, A.; Freese, U.K.; Fleckenstein, B.; Zur Hausen, H.

1987-01-01

The human cytomegalovirus (HCMV)-induced DNA polymerase has been well characterized biochemically and functionally, but its genomic location has not yet been assigned. To identify the coding sequence, cross-hybridization with the herpes simplex virus type 1 (HSV-1) polymerase gene was used, as suggested by the close similarity of the herpes group virus-induced DNA polymerases to the HCMV DNA polymerase. A cosmid and plasmid library of the entire HCMV genome was screened with the BamHI Q fragment of HSF-1 at different stringency conditions. One PstI-HincII restriction fragment of 850 base pairs mapping within the EcoRI M fragment of HCMV cross-hybridized at T/sub m/ - 25/degrees/C. Sequence analysis revealed one open reading frame spanning the entire sequence. The amino acid sequence showed a highly conserved domain of 133 amino acids shared with the HSV and putative Esptein-Barr virus polymerase sequences. This domain maps within the C-terminal part of the HSV polymerase gene, which has been suggested to contain part of the catalytic center of the enzyme. Transcription analysis revealed one 5.4-kilobase early transcript in the sense orientation with respect to the open reading frame identified. This transcript appears to code for the 140-kilodalton HCMV polymerase protein
The myoglobin of Emperor penguin (Aptenodytes forsteri): amino acid sequence and functional adaptation to extreme conditions.

Science.gov (United States)

Tamburrini, M; Romano, M; Giardina, B; di Prisco, G

1999-02-01

In the framework of a study on molecular adaptations of the oxygen-transport and storage systems to extreme conditions in Antarctic marine organisms, we have investigated the structure/function relationship in Emperor penguin (Aptenodytes forsteri) myoglobin, in search of correlation with the bird life style. In contrast with previous reports, the revised amino acid sequence contains one additional residue and 15 differences. The oxygen-binding parameters seem well adapted to the diving behaviour of the penguin and to the environmental conditions of the Antarctic habitat. Addition of lactate has no major effect on myoglobin oxygenation over a large temperature range. Therefore, metabolic acidosis does not impair myoglobin function under conditions of prolonged physical effort, such as diving.
Non-natural and photo-reactive amino acids as biochemical probes of immune function.

Directory of Open Access Journals (Sweden)

Marta Gómez-Nuñez

Full Text Available Wilms tumor protein (WT1 is a transcription factor selectively overexpressed in leukemias and cancers; clinical trials are underway that use altered WT1 peptide sequences as vaccines. Here we report a strategy to study peptide-MHC interactions by incorporating non-natural and photo-reactive amino acids into the sequence of WT1 peptides. Thirteen WT1 peptides sequences were synthesized with chemically modified amino acids (via fluorination and photo-reactive group additions at MHC and T cell receptor binding positions. Certain new non-natural peptide analogs could stabilize MHC class I molecules better than the native sequences and were also able to elicit specific T-cell responses and sometimes cytotoxicity to leukemia cells. Two photo-reactive peptides, also modified with a biotin handle for pull-down studies, formed covalent interactions with MHC molecules on live cells and provided kinetic data showing the rapid clearance of the peptide-MHC complex. Despite "infinite affinity" provided by the covalent peptide bonding to the MHC, immunogenicity was not enhanced by these peptides because the peptide presentation on the surface was dominated by catabolism of the complex and only a small percentage of peptide molecules covalently bound to the MHC molecules. This study shows that non-natural amino acids can be successfully incorporated into T cell epitopes to provide novel immunological, biochemical and kinetic information.
Comparison of G protein sequences of South African street rabies viruses showing distinct progression of the disease in a mouse model of experimental rabies.

Science.gov (United States)

Seo, Wonhyo; Servat, Alexandre; Cliquet, Florence; Akinbowale, Jenkins; Prehaud, Christophe; Lafon, Monique; Sabeta, Claude

Rabies is a fatal zoonotic disease and infections generally lead to a fatal encephalomyelitis in both humans and animals. In South Africa, domestic (dogs) and the wildlife (yellow mongoose) host species maintain the canid and mongoose rabies variants respectively. In this study, pathogenicity differences of South African canid and mongoose rabies viruses were investigated in a murine model, by assessing the progression of clinical signs and survivorship. Comparison of glycoprotein gene sequences revealed amino acid differences that may underpin the observed pathogenicity differences. Cumulatively, our results suggest that the canid rabies virus may be more neurovirulent in mice than the mongoose rabies variant. Copyright © 2017 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
The Transcriptional Heat Shock Response of Salmonella Typhimurium Shows Hysteresis and Heated Cells Show Increased Resistance to Heat and Acid Stress

DEFF Research Database (Denmark)

Pin, C.; Hansen, Trine; Munoz-Cuevas, M.

2012-01-01

We investigated if the transcriptional response of Salmonella Typhimurium to temperature and acid variations was hysteretic, i.e. whether the transcriptional regulation caused by environmental stimuli showed memory and remained after the stimuli ceased. The transcriptional activity of non......, implying that down-regulation was significantly less synchronized than upregulation. The hysteretic transcriptional response to heat shock was accompanied by higher resistance to inactivation at 50uC as well as cross-resistance to inactivation at pH 3; however, growth rates and lag times at 43uC and at p......H 4.5 were not affected. The exposure to pH 5 only caused up-regulation of 12 genes and this response was neither hysteretic nor accompanied of increased resistance to inactivation conditions. Cellular memory at the transcriptional level may represent a mechanism of adaptation to the environment...
Impact of Acid Cleaning on the Performance of PVDF UF Membranes in Seawater Reverse Osmosis Pretreatment

KAUST Repository

Alsogair, Safiya

2016-05-05

Low-pressure membrane systems such as Microfiltration (MF) and Ultrafiltration (UF) have been presented as viable option to pre-treatment systems in potable water applications. UF membranes are sporadically backwashed with ultra-filtered water to remove deposited matter from the membrane and restore it. Several factors that may cause permeability and selectivity decrease are involved and numerous procedures are applicable to achieve this objective. Membrane cleaning is the most important step required to maintain the characteristics of the membrane. This research was made with the purpose of investigating the effects of acid cleaning during chemically enhanced backwashing (CEB) on the performance of ultrafiltration (UF) membranes in seawater reverse osmosis (SWRO) pretreatment. To accomplish this, the questions made were: Does the acid addition (before or after the alkali CEB) influence the overall CEB cleaning effectiveness on Dow UF membrane? Does the CEB order of alkali (NaOCl) and acid (H2SO4) affect the overall CEB cleaning effectiveness? If yes, which order is better/worse? What is the optimal acid CEB frequency that will ensure the most reliable performance of the UF?. To answer this queries, a series of sequences were carried out with different types of chemical treatments: Only NaOCl, daily NaOCl plus weekly acid, daily NaOCl plus daily acid, and weekly acid plus daily NaOCl. To investigate the consequence of acid by studying the effect of operational data like the trans-pressure membrane, resistance or permeability and support that by the analytical experiments (organic, inorganic and microbial characterization). Microorganisms were removed almost completely at hydraulic cleaning and showed no difference with addition of acid. As a conclusion of the operational data the organic and inorganic chatacterization resulted in the elimination of the first sequence due to the acummulation of fouling over time, which produces that the cleaning increases downtime
Nucleic acid drugs: a novel approach

African Journals Online (AJOL)

Administrator

Nucleic acid base sequence of proteins plays a crucial role in the expression of gene. The gene is responsible for the synthesis of proteins and these proteins, which are synthesized, are responsible for the biological process and also for dreadful diseases as well. Once if the nucleic acid sequence is altered, we would be ...
CodonLogo: a sequence logo-based viewer for codon patterns.

Science.gov (United States)

Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

2012-07-15

Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

Science.gov (United States)

Deroost, Natacha; Coomans, Daphné

2018-02-01

We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.

Method of Identifying a Base in a Nucleic Acid

Science.gov (United States)

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

1999-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Automatic discovery of cross-family sequence features associated with protein function

Directory of Open Access Journals (Sweden)

Krings Andrea

2006-01-01

Full Text Available Abstract Background Methods for predicting protein function directly from amino acid sequences are useful tools in the study of uncharacterised protein families and in comparative genomics. Until now, this problem has been approached using machine learning techniques that attempt to predict membership, or otherwise, to predefined functional categories or subcellular locations. A potential drawback of this approach is that the human-designated functional classes may not accurately reflect the underlying biology, and consequently important sequence-to-function relationships may be missed. Results We show that a self-supervised data mining approach is able to find relationships between sequence features and functional annotations. No preconceived ideas about functional categories are required, and the training data is simply a set of protein sequences and their UniProt/Swiss-Prot annotations. The main technical aspect of the approach is the co-evolution of amino acid-based regular expressions and keyword-based logical expressions with genetic programming. Our experiments on a strictly non-redundant set of eukaryotic proteins reveal that the strongest and most easily detected sequence-to-function relationships are concerned with targeting to various cellular compartments, which is an area already well studied both experimentally and computationally. Of more interest are a number of broad functional roles which can also be correlated with sequence features. These include inhibition, biosynthesis, transcription and defence against bacteria. Despite substantial overlaps between these functions and their corresponding cellular compartments, we find clear differences in the sequence motifs used to predict some of these functions. For example, the presence of polyglutamine repeats appears to be linked more strongly to the "transcription" function than to the general "nuclear" function/location. Conclusion We have developed a novel and useful approach for
Molecular Cloning and Sequence Analysis of a Phenylalanine Ammonia-Lyase Gene from Dendrobium

Science.gov (United States)

Cai, Yongping; Lin, Yi

2013-01-01

In this study, a phenylalanine ammonia-lyase (PAL) gene was cloned from Dendrobium candidum using homology cloning and RACE. The full-length sequence and catalytic active sites that appear in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum are also found: PAL cDNA of D. candidum (designated Dc-PAL1, GenBank No. JQ765748) has 2,458 bps and contains a complete open reading frame (ORF) of 2,142 bps, which encodes 713 amino acid residues. The amino acid sequence of DcPAL1 has more than 80% sequence identity with the PAL genes of other plants, as indicated by multiple alignments. The dominant sites and catalytic active sites, which are similar to that showing in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum, are also found in DcPAL1. Phylogenetic tree analysis revealed that DcPAL is more closely related to PALs from orchidaceae plants than to those of other plants. The differential expression patterns of PAL in protocorm-like body, leaf, stem, and root, suggest that the PAL gene performs multiple physiological functions in Dendrobium candidum. PMID:23638048
Identification of microRNAs actively involved in fatty acid biosynthesis in developing Brassica napus seeds using high-throughput sequencing

Directory of Open Access Journals (Sweden)

Jia Wang

2016-10-01

Full Text Available Seed development has a critical role during the spermatophyte life cycle. In Brassica napus, a major oil crop, fatty acids are synthesized and stored in specific tissues during embryogenesis, and understanding the molecular mechanism underlying fatty acid biosynthesis during seed development is an important research goal. In this study, we constructed three small RNA libraries from early seeds at 14, 21 and 28 days after flowering (DAF and used high-throughput sequencing to examine microRNA (miRNA expression. A total of 85 known miRNAs from 30 families and 1,160 novel miRNAs were identified, of which 24, including 5 known and 19 novel miRNAs, were found to be involved in fatty acid biosynthesis. bna-miR156b, bna-miR156c, bna-miR156g, novel_mir_1706, novel_mir_1407, novel_mir_173, and novel_mir_104 were significantly down-regulated at 21 DAF and 28 DAF, whereas bna-miR159, novel_mir_1081, novel_mir_19 and novel_mir_555 were significantly up-regulated. In addition, we found that some miRNAs regulate functional genes that are directly involved in fatty acid biosynthesis and that other miRNAs regulate the process of fatty acid biosynthesis by acting on a large number of transcription factors. The miRNAs and their corresponding predicted targets were partially validated by quantitative RT-PCR. Our data suggest that diverse and complex miRNAs are involved in the seed development process and that miRNAs play important roles in fatty acid biosynthesis during seed development.
The ABC transporter Rv1272c of Mycobacterium tuberculosis enhances the import of long-chain fatty acids in Escherichia coli.

Science.gov (United States)

Martin, Audrey; Daniel, Jaiyanth

2018-02-05

Mycobacterium tuberculosis (Mtb), which causes tuberculosis, is capable of accumulating triacylglycerol (TAG) by utilizing fatty acids from host cells. ATP-binding cassette (ABC) transporters are involved in transport processes in all organisms. Among the classical ABC transporters in Mtb none have been implicated in fatty acid import. Since the transport of fatty acids from the host cell is important for dormancy-associated TAG synthesis in the pathogen, mycobacterial ABC transporter(s) could potentially be involved in this process. Based on sequence identities with a bacterial ABC transporter that mediates fatty acid import for TAG synthesis, we identified Rv1272c, a hitherto uncharacterized ABC-transporter in Mtb that also shows sequence identities with a plant ABC transporter involved in fatty acid transport. We expressed Rv1272c in E. coli and show that it enhances the import of radiolabeled fatty acids. We also show that Rv1272c causes a significant increase in the metabolic incorporation of radiolabeled long-chain fatty acids into cardiolipin, a tetra-acylated phospholipid, and phosphatidylglycerol in E. coli. This is the first report on the function of Rv1272c showing that it displays a long-chain fatty acid transport function. Copyright © 2018 Elsevier Inc. All rights reserved.
Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang

2010-11-08

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang; Yu, Jun

2010-01-01

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Deduced amino acid sequence of the small hydrophobic protein of US avian pneumovirus has greater identity with that of human metapneumovirus than those of non-US avian pneumoviruses.

Science.gov (United States)

Yunus, Abdul S; Govindarajan, Dhanasekaran; Huang, Zhuhui; Samal, Siba K

2003-05-01

We report here the nucleotide and deduced amino acid (aa) sequences of the small hydrophobic (SH) gene of the avian pneumovirus strain Colorado (APV/CO). The SH gene of APV/CO is 628 nucleotides in length from gene-start to gene-end. The longest ORF of the SH gene encoded a protein of 177 aas in length. Comparison of the deduced aa sequence of the SH protein of APV/CO with the corresponding published sequences of other members of genera metapneumovirus showed 28% identity with the newly discovered human metapneumovirus (hMPV), but no discernable identity with the APV subgroup A or B. Collectively, this data supports the hypothesis that: (i) APV/CO is distinct from European APV subgroups and belongs to the novel subgroup APV/C (APV/US); (ii) APV/CO is more closely related to hMPV, a mammalian metapneumovirus, than to either APV subgroup A or B. The SH gene of APV/CO was cloned using a genomic walk strategy which initiated cDNA synthesis from genomic RNA that traversed the genes in the order 3'-M-F-M2-SH-G-5', thus confirming that gene-order of APV/CO conforms in the genus Metapneumovirus. We also provide the sequences of transcription-signals and the M-F, F-M2, M2-SH and SH-G intergenic regions of APV/CO.
A novel halophilic lipase, LipBL, showing high efficiency in the production of eicosapentaenoic acid (EPA.

Directory of Open Access Journals (Sweden)

Dolores Pérez

Full Text Available BACKGROUND: Among extremophiles, halophiles are defined as microorganisms adapted to live and thrive in diverse extreme saline environments. These extremophilic microorganisms constitute the source of a number of hydrolases with great biotechnological applications. The interest to use extremozymes from halophiles in industrial applications is their resistance to organic solvents and extreme temperatures. Marinobacter lipolyticus SM19 is a moderately halophilic bacterium, isolated previously from a saline habitat in South Spain, showing lipolytic activity. METHODS AND FINDINGS: A lipolytic enzyme from the halophilic bacterium Marinobacter lipolyticus SM19 was isolated. This enzyme, designated LipBL, was expressed in Escherichia coli. LipBL is a protein of 404 amino acids with a molecular mass of 45.3 kDa and high identity to class C β-lactamases. LipBL was purified and biochemically characterized. The temperature for its maximal activity was 80°C and the pH optimum determined at 25°C was 7.0, showing optimal activity without sodium chloride, while maintaining 20% activity in a wide range of NaCl concentrations. This enzyme exhibited high activity against short-medium length acyl chain substrates, although it also hydrolyzes olive oil and fish oil. The fish oil hydrolysis using LipBL results in an enrichment of free eicosapentaenoic acid (EPA, but not docosahexaenoic acid (DHA, relative to its levels present in fish oil. For improving the stability and to be used in industrial processes LipBL was immobilized in different supports. The immobilized derivatives CNBr-activated Sepharose were highly selective towards the release of EPA versus DHA. The enzyme is also active towards different chiral and prochiral esters. Exposure of LipBL to buffer-solvent mixtures showed that the enzyme had remarkable activity and stability in all organic solvents tested. CONCLUSIONS: In this study we isolated, purified, biochemically characterized and immobilized a
Peptide Nucleic Acids

DEFF Research Database (Denmark)

2004-01-01

A novel class of compounds known as peptide nucleic acids, bind complementary DNA and RNA strands, and generally do so more strongly than the corresponding DNA or RNA strands while exhibiting increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from...
Purification and characterization of gamma poly glutamic acid from newly Bacillus licheniformis NRC20.

Science.gov (United States)

Tork, Sanaa E; Aly, Magda M; Alakilli, Saleha Y; Al-Seeni, Madeha N

2015-03-01

γ-poly glutamic acid (γ-PGA) has received considerable attention for pharmaceutical and biomedical applications. γ-PGA from the newly isolate Bacillus licheniformis NRC20 was purified and characterized using diffusion distance agar plate, mass spectrometry and thin layer chromatography. All analysis indicated that γ-PGA is a homopolymer composed of glutamic acid. Its molecular weight was determined to be 1266 kDa. It was composed of L- and D-glutamic acid residues. An amplicon of 3050 represents the γ-PGA-coding genes was obtained, sequenced and submitted in genbank database. Its amino acid sequence showed high similarity with that obtained from B. licheniformis strains. The bacterium NRC 20 was independent of L-glutamic acid but the polymer production enhanced when cultivated in medium containing L-glutamic acid as the sole nitrogen source. Finally we can conclude that γ-PGA production from B. licheniformis NRC20 has many promised applications in medicine, industry and nanotechnology. Copyright © 2014 Elsevier B.V. All rights reserved.
The Biomolecule Sequencer Project: Nanopore Sequencing as a Dual-Use Tool for Crew Health and Astrobiology Investigations

Science.gov (United States)

John, K. K.; Botkin, D. S.; Burton, A. S.; Castro-Wallace, S. L.; Chaput, J. D.; Dworkin, J. P.; Lehman, N.; Lupisella, M. L.; Mason, C. E.; Smith, D. J.;

2016-01-01

Human missions to Mars will fundamentally transform how the planet is explored, enabling new scientific discoveries through more sophisticated sample acquisition and processing than can currently be implemented in robotic exploration. The presence of humans also poses new challenges, including ensuring astronaut safety and health and monitoring contamination. Because the capability to transfer materials to Earth will be extremely limited, there is a strong need for in situ diagnostic capabilities. Nucleotide sequencing is a particularly powerful tool because it can be used to: (1) mitigate microbial risks to crew by allowing identification of microbes in water, in air, and on surfaces; (2) identify optimal treatment strategies for infections that arise in crew members; and (3) track how crew members, microbes, and mission-relevant organisms (e.g., farmed plants) respond to conditions on Mars through transcriptomic and genomic changes. Sequencing would also offer benefits for science investigations occurring on the surface of Mars by permitting identification of Earth-derived contamination in samples. If Mars contains indigenous life, and that life is based on nucleic acids or other closely related molecules, sequencing would serve as a critical tool for the characterization of those molecules. Therefore, spaceflight-compatible nucleic acid sequencing would be an important capability for both crew health and astrobiology exploration. Advances in sequencing technology on Earth have been driven largely by needs for higher throughput and read accuracy. Although some reduction in size has been achieved, nearly all commercially available sequencers are not compatible with spaceflight due to size, power, and operational requirements. Exceptions are nanopore-based sequencers that measure changes in current caused by DNA passing through pores; these devices are inherently much smaller and require significantly less power than sequencers using other detection methods

Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme

Science.gov (United States)

Gallage, Nethaji J.; Hansen, Esben H.; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg

2014-01-01

Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco. PMID:24941968
The nucleotide sequence of a Polish isolate of Tomato torrado virus.

Science.gov (United States)

Budziszewska, Marta; Obrepalska-Steplowska, Aleksandra; Wieczorek, Przemysław; Pospieszny, Henryk

2008-12-01

A new virus was isolated from greenhouse tomato plants showing symptoms of leaf and apex necrosis in Wielkopolska province in Poland in 2003. The observed symptoms and the virus morphology resembled viruses previously reported in Spain called Tomato torrado virus (ToTV) and that in Mexico called Tomato marchitez virus (ToMarV). The complete genome of a Polish isolate Wal'03 was determined using RT-PCR amplification using oligonucleotide primers developed against the ToTV sequences deposited in Genbank, followed by cloning, sequencing, and comparison with the sequence of the type isolate. Phylogenetic analyses, performed on the basis of fragments of polyproteins sequences, established the relationship of Polish isolate Wal'03 with Spanish ToTV and Mexican ToMarV, as well as with other viruses from Sequivirus, Sadwavirus, and Cheravirus genera, reported to be the most similar to the new tomato viruses. Wal'03 genome strands has the same organization and very high homology with the ToTV type isolate, showing only some nucleotide and deduced amino acid changes, in contrast to ToMarV, which was significantly different. The phylogenetic tree clustered aforementioned viruses to the same group, indicating that they have a common origin.
Spreadsheet macros for coloring sequence alignments.

Science.gov (United States)

Haygood, M G

1993-12-01

This article describes a set of Microsoft Excel macros designed to color amino acid and nucleotide sequence alignments for review and preparation of visual aids. The colored alignments can then be modified to emphasize features of interest. Procedures for importing and coloring sequences are described. The macro file adds a new menu to the menu bar containing sequence-related commands to enable users unfamiliar with Excel to use the macros more readily. The macros were designed for use with Macintosh computers but will also run with the DOS version of Excel.
Sequence protein identification by randomized sequence database and transcriptome mass spectrometry (SPIDER-TMS): from manual to automatic application of a 'de novo sequencing' approach.

Science.gov (United States)

Pascale, Raffaella; Grossi, Gerarda; Cruciani, Gabriele; Mecca, Giansalvatore; Santoro, Donatello; Sarli Calace, Renzo; Falabella, Patrizia; Bianco, Giuliana

Sequence protein identification by a randomized sequence database and transcriptome mass spectrometry software package has been developed at the University of Basilicata in Potenza (Italy) and designed to facilitate the determination of the amino acid sequence of a peptide as well as an unequivocal identification of proteins in a high-throughput manner with enormous advantages of time, economical resource and expertise. The software package is a valid tool for the automation of a de novo sequencing approach, overcoming the main limits and a versatile platform useful in the proteomic field for an unequivocal identification of proteins, starting from tandem mass spectrometry data. The strength of this software is that it is a user-friendly and non-statistical approach, so protein identification can be considered unambiguous.
Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

Science.gov (United States)

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

Science.gov (United States)

Liang, Shaobo; McDonald, Armando G; Coats, Erik R

2015-11-01

Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. Copyright © 2015 Elsevier Ltd. All rights reserved.
In silico Analysis of osr40c1 Promoter Sequence Isolated from Indica Variety Pokkali

OpenAIRE

W.S.I. de Silva; M.M.N. Perera; K.L.N.S. Perera; A.M. Wickramasuriya; G.A.U. Jayasekera

2017-01-01

The promoter region of a drought and abscisic acid (ABA) inducible gene, osr40c1, was isolated from a salt-tolerant indica rice variety Pokkali, which is 670 bp upstream of the putative translation start codon. In silico promoter analysis of resulted sequence showed that at least 15 types of putative motifs were distributed within the sequence, including two types of common promoter elements, TATA and CAAT boxes. Additionally, several putative cis-acing regulatory elements which may be involv...
Molecular cloning and expression analysis of jasmonic acid dependent but salicylic acid independent LeWRKY1.

Science.gov (United States)

Lu, M; Wang, L F; Du, X H; Yu, Y K; Pan, J B; Nan, Z J; Han, J; Wang, W X; Zhang, Q Z; Sun, Q P

2015-11-30

Various plant genes can be activated or inhibited by phytohormones under conditions of biotic and abiotic stress, especially in response to jasmonic acid (JA) and salicylic acid (SA). Interactions between JA and SA may be synergistic or antagonistic, depending on the stress condition. In this study, we cloned a full-length cDNA (LeWRKY1, GenBank accession No. FJ654265) from Lycopersicon esculentum by rapid amplification of cDNA ends. Sequence analysis showed that this gene is a group II WRKY transcription factor. Analysis of LeWRKY1 mRNA expression in various tissues by qRT-PCR showed that the highest and lowest expression occurred in the leaves and stems, respectively. In addition, LeWRKY1 expression was induced by JA and Botrytis cinerea Pers., but not by SA.

Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-?-d-Glutamic Acid Anthrax Capsule

OpenAIRE

Stabler, Richard A.; Negus, David; Pain, Arnab; Taylor, Peter W.

2013-01-01

A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-?-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.
Protein-Protein Interactions Prediction Using a Novel Local Conjoint Triad Descriptor of Amino Acid Sequences

Directory of Open Access Journals (Sweden)

Jun Wang

2017-11-01

Full Text Available Protein-protein interactions (PPIs play crucial roles in almost all cellular processes. Although a large amount of PPIs have been verified by high-throughput techniques in the past decades, currently known PPIs pairs are still far from complete. Furthermore, the wet-lab experiments based techniques for detecting PPIs are time-consuming and expensive. Hence, it is urgent and essential to develop automatic computational methods to efficiently and accurately predict PPIs. In this paper, a sequence-based approach called DNN-LCTD is developed by combining deep neural networks (DNNs and a novel local conjoint triad description (LCTD feature representation. LCTD incorporates the advantage of local description and conjoint triad, thus, it is capable to account for the interactions between residues in both continuous and discontinuous regions of amino acid sequences. DNNs can not only learn suitable features from the data by themselves, but also learn and discover hierarchical representations of data. When performing on the PPIs data of Saccharomyces cerevisiae, DNN-LCTD achieves superior performance with accuracy as 93.12%, precision as 93.75%, sensitivity as 93.83%, area under the receiver operating characteristic curve (AUC as 97.92%, and it only needs 718 s. These results indicate DNN-LCTD is very promising for predicting PPIs. DNN-LCTD can be a useful supplementary tool for future proteomics study.
The complete genome sequence of a south Indian isolate of Rice tungro spherical virus reveals evidence of genetic recombination between distinct isolates.

Science.gov (United States)

Sailaja, B; Anjum, Najreen; Patil, Yogesh K; Agarwal, Surekha; Malathi, P; Krishnaveni, D; Balachandran, S M; Viraktamath, B C; Mangrauthia, Satendra K

2013-12-01

In this study, complete genome of a south Indian isolate of Rice tungro spherical virus (RTSV) from Andhra Pradesh (AP) was sequenced, and the predicted amino acid sequence was analysed. The RTSV RNA genome consists of 12,171 nt without the poly(A) tail, encoding a putative typical polyprotein of 3,470 amino acids. Furthermore, cleavage sites and sequence motifs of the polyprotein were predicted. Multiple alignment with other RTSV isolates showed a nucleotide sequence identity of 95% to east Indian isolates and 90% to Philippines isolates. A phylogenetic tree based on complete genome sequence showed that Indian isolates clustered together, while Vt6 and PhilA isolates of Philippines formed two separate clusters. Twelve recombination events were detected in RNA genome of RTSV using the Recombination Detection Program version 3. Recombination analysis suggested significant role of 5' end and central region of genome in virus evolution. Further, AP and Odisha isolates appeared as important RTSV isolates involved in diversification of this virus in India through recombination phenomenon. The new addition of complete genome of first south Indian isolate provided an opportunity to establish the molecular evolution of RTSV through recombination analysis and phylogenetic relationship.
Comparative sequence, structure and redox analyses of Klebsiella pneumoniae DsbA show that anti-virulence target DsbA enzymes fall into distinct classes.

Directory of Open Access Journals (Sweden)

Fabian Kurth

Full Text Available Bacterial DsbA enzymes catalyze oxidative folding of virulence factors, and have been identified as targets for antivirulence drugs. However, DsbA enzymes characterized to date exhibit a wide spectrum of redox properties and divergent structural features compared to the prototypical DsbA enzyme of Escherichia coli DsbA (EcDsbA. Nonetheless, sequence analysis shows that DsbAs are more highly conserved than their known substrate virulence factors, highlighting the potential to inhibit virulence across a range of organisms by targeting DsbA. For example, Salmonella enterica typhimurium (SeDsbA, 86 % sequence identity to EcDsbA shares almost identical structural, surface and redox properties. Using comparative sequence and structure analysis we predicted that five other bacterial DsbAs would share these properties. To confirm this, we characterized Klebsiella pneumoniae DsbA (KpDsbA, 81 % identity to EcDsbA. As expected, the redox properties, structure and surface features (from crystal and NMR data of KpDsbA were almost identical to those of EcDsbA and SeDsbA. Moreover, KpDsbA and EcDsbA bind peptides derived from their respective DsbBs with almost equal affinity, supporting the notion that compounds designed to inhibit EcDsbA will also inhibit KpDsbA. Taken together, our data show that DsbAs fall into different classes; that DsbAs within a class may be predicted by sequence analysis of binding loops; that DsbAs within a class are able to complement one another in vivo and that compounds designed to inhibit EcDsbA are likely to inhibit DsbAs within the same class.
CLONING AND SEQUENCING OF PGIP FROM ‘JIN SERIES’ ALMOND (PRUNUS DULCIS

Directory of Open Access Journals (Sweden)

Yuhu Han

2015-12-01

Full Text Available Specific primers synthesized according to conservative regions of polygalacturonase inhibiting protein (PGIP gene were used to amplify Prunus Dulcis genomic DNA by polymerase-chain reaction (PCR. Six bands (pgip1, pgip2, pgip3, pgip4, pgip5 and pgip6 of genes were obtained and cloned into PBS-T vector. According to the length of bands, 717bp, 864bp, 796bp were A1 (pgip1, pgip2, pgip3, A2 (pgip4, A4 (pgip5, pgip6, respectively. DNA sequences showed that the fragments taken together were the gene encoding PGIP. A2 and A3 contained two exons interrupted by one intron, which has GT-AG sequence. Its DNA and amino acid sequences were highly homologies to those from Prunus Persica; Prunus Salicina; Prunus Americana; Prunus Mume, respectively. A conserved lencinerial fragment exists in the derived protein sequence.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

Science.gov (United States)

Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

2002-11-01

The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Overexpression of a Protein Phosphatase 2C from Beech Seeds in Arabidopsis Shows Phenotypes Related to Abscisic Acid Responses and Gibberellin Biosynthesis1

Science.gov (United States)

Reyes, David; Rodríguez, Dolores; González-García, Mary Paz; Lorenzo, Oscar; Nicolás, Gregorio; García-Martínez, José Luis; Nicolás, Carlos

2006-01-01

A functional abscisic acid (ABA)-induced protein phosphatase type 2C (PP2C) was previously isolated from beech (Fagus sylvatica) seeds (FsPP2C2). Because transgenic work is not possible in beech, in this study we overexpressed this gene in Arabidopsis (Arabidopsis thaliana) to provide genetic evidence on FsPP2C2 function in seed dormancy and other plant responses. In contrast with other PP2Cs described so far, constitutive expression of FsPP2C2 in Arabidopsis, under the cauliflower mosaic virus 35S promoter, produced enhanced sensitivity to ABA and abiotic stress in seeds and vegetative tissues, dwarf phenotype, and delayed flowering, and all these effects were reversed by gibberellic acid application. The levels of active gibberellins (GAs) were reduced in 35S:FsPP2C2 plants, although transcript levels of AtGA20ox1 and AtGA3ox1 increased, probably as a result of negative feedback regulation, whereas the expression of GASA1 was induced by GAs. Additionally, FsPP2C2-overexpressing plants showed a strong induction of the Responsive to ABA 18 (RAB18) gene. Interestingly, FsPP2C2 contains two nuclear targeting sequences, and transient expression assays revealed that ABA directed this protein to the nucleus. Whereas other plant PP2Cs have been shown to act as negative regulators, our results support the hypothesis that FsPP2C2 is a positive regulator of ABA. Moreover, our results indicate the existence of potential cross-talk between ABA signaling and GA biosynthesis. PMID:16815952
Probe kit for identifying a base in a nucleic acid

Science.gov (United States)

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Molecular cloning and nucleotide sequence of cDNA for human liver arginase

International Nuclear Information System (INIS)

Haraguchi, Y.; Takiguchi, M.; Amaya, Y.; Kawamoto, S.; Matsuda, I.; Mori, M.

1987-01-01

Arginase (EC3.5.3.1) catalyzes the last step of the urea cycle in the liver of ureotelic animals. Inherited deficiency of the enzyme results in argininemia, an autosomal recessive disorder characterized by hyperammonemia. To facilitate investigation of the enzyme and gene structures and to elucidate the nature of the mutation in argininemia, the authors isolated cDNA clones for human liver arginase. Oligo(dT)-primed and random primer human liver cDNA libraries in λ gt11 were screened using isolated rat arginase cDNA as a probe. Two of the positive clones, designated λ hARG6 and λ hARG109, contained an overlapping cDNA sequence with an open reading frame encoding a polypeptide of 322 amino acid residues (predicted M/sub r/, 34,732), a 5'-untranslated sequence of 56 base pairs, a 3'-untranslated sequence of 423 base pairs, and a poly(A) segment. Arginase activity was detected in Escherichia coli cells transformed with the plasmid carrying λ hARG6 cDNA insert. RNA gel blot analysis of human liver RNA showed a single mRNA of 1.6 kilobases. The predicted amino acid sequence of human liver arginase is 87% and 41% identical with those of the rat liver and yeast enzymes, respectively. There are several highly conserved segments among the human, rat, and yeast enzymes
Sequence Algebra, Sequence Decision Diagrams and Dynamic Fault Trees

International Nuclear Information System (INIS)

Rauzy, Antoine B.

2011-01-01

A large attention has been focused on the Dynamic Fault Trees in the past few years. By adding new gates to static (regular) Fault Trees, Dynamic Fault Trees aim to take into account dependencies among events. Merle et al. proposed recently an algebraic framework to give a formal interpretation to these gates. In this article, we extend Merle et al.'s work by adopting a slightly different perspective. We introduce Sequence Algebras that can be seen as Algebras of Basic Events, representing failures of non-repairable components. We show how to interpret Dynamic Fault Trees within this framework. Finally, we propose a new data structure to encode sets of sequences of Basic Events: Sequence Decision Diagrams. Sequence Decision Diagrams are very much inspired from Minato's Zero-Suppressed Binary Decision Diagrams. We show that all operations of Sequence Algebras can be performed on this data structure.
Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST

Directory of Open Access Journals (Sweden)

Nalin CW Goonesekere

2009-06-01

Full Text Available Nalin CW GoonesekereDepartment of Chemistry and Biochemistry, University of Northern iowa, Cedar Falls, IA, USAAbstract: The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP database. We show that when incorporated into the homology search algorithms BLAST and PSI-blaST, the structure-based substitution matrices enhance the efficacy of detecting remote homologs. Keywords: computational biology, protein homology, amino acid substitution matrix, protein structure
Thermodynamics of sequence-specific binding of PNA to DNA

DEFF Research Database (Denmark)

Ratilainen, T; Holmén, A; Tuite, E

2000-01-01

For further characterization of the hybridization properties of peptide nucleic acids (PNAs), the thermodynamics of hybridization of mixed sequence PNA-DNA duplexes have been studied. We have characterized the binding of PNA to DNA in terms of binding affinity (perfectly matched duplexes) and seq......For further characterization of the hybridization properties of peptide nucleic acids (PNAs), the thermodynamics of hybridization of mixed sequence PNA-DNA duplexes have been studied. We have characterized the binding of PNA to DNA in terms of binding affinity (perfectly matched duplexes...
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

Science.gov (United States)

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

Science.gov (United States)

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST.

Science.gov (United States)

Goonesekere, Nalin Cw

2009-01-01

The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
Subset of Kappa and Lambda Germline Sequences Result in Light Chains with a Higher Molecular Mass Phenotype.

Science.gov (United States)

Barnidge, David R; Lundström, Susanna L; Zhang, Bo; Dasari, Surendra; Murray, David L; Zubarev, Roman A

2015-12-04

In our previous work, we showed that electrospray ionization of intact polyclonal kappa and lambda light chains isolated from normal serum generates two distinct, Gaussian-shaped, molecular mass distributions representing the light-chain repertoire. During the analysis of a large (>100) patient sample set, we noticed a low-intensity molecular mass distribution with a mean of approximately 24 250 Da, roughly 800 Da higher than the mean of the typical kappa molecular-mass distribution mean of 23 450 Da. We also observed distinct clones in this region that did not appear to contain any typical post-translational modifications that would account for such a large mass shift. To determine the origin of the high molecular mass clones, we performed de novo bottom-up mass spectrometry on a purified IgM monoclonal light chain that had a calculated molecular mass of 24 275.03 Da. The entire sequence of the monoclonal light chain was determined using multienzyme digestion and de novo sequence-alignment software and was found to belong to the germline allele IGKV2-30. The alignment of kappa germline sequences revealed ten IGKV2 and one IGKV4 sequences that contained additional amino acids in their CDR1 region, creating the high-molecular-mass phenotype. We also performed an alignment of lambda germline sequences, which showed additional amino acids in the CDR2 region, and the FR3 region of functional germline sequences that result in a high-molecular-mass phenotype. The work presented here illustrates the ability of mass spectrometry to provide information on the diversity of light-chain molecular mass phenotypes in circulation, which reflects the germline sequences selected by the immunoglobulin-secreting B-cell population.
Direct quantification of human cytomegalovirus immediate-early and late mRNA levels in blood of lung transplant recipients by competitive nucleic acid sequence-based amplification

NARCIS (Netherlands)

Greijer, AE; Verschuuren, EAM; Harmsen, MC; Dekkers, CAJ; Adriaanse, HMA; The, TH; Middeldorp, JM

The dynamics of active human cytomegalovirus (HCMV) infection was monitored by competitive nucleic acid sequence-based amplification (NASBA) assays for quantification of IE1 (UL123) and pp67 (UL65) mRNA expression levels In the blood of patients after lung transplantation. RNA was isolated from 339
Microbial diversity of acidic hot spring (kawah hujan B) in geothermal field of kamojang area, west java-indonesia.

Science.gov (United States)

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria.
RNA Sequencing Identifies Upregulated Kyphoscoliosis Peptidase and Phosphatidic Acid Signaling Pathways in Muscle Hypertrophy Generated by Transgenic Expression of Myostatin Propeptide

Directory of Open Access Journals (Sweden)

Yuanxin Miao

2015-04-01

Full Text Available Myostatin (MSTN, a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph, and zinc metallopeptidase STE24 (Zmpste24. In addition, kyphoscoliosis peptidase (Ky, which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA pathways (Dgki, Dgkz, Plcd4 were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.
RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

Science.gov (United States)

Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

2015-04-09

Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

Differentiation of sheep pox and goat poxviruses by sequence analysis and PCR-RFLP of P32 gene.

Science.gov (United States)

Hosamani, Madhusudan; Mondal, Bimalendu; Tembhurne, Prabhakar A; Bandyopadhyay, Santanu Kumar; Singh, Raj Kumar; Rasool, Thaha Jamal

2004-08-01

Sheep pox and Goat pox are highly contagious viral diseases of small ruminants. These diseases were earlier thought to be caused by a single species of virus, as they are serologically indistinguishable. P32, one of the major immunogenic genes of Capripoxvirus, was isolated and Sequenced from two Indian isolates of goat poxvirus (GPV) and a vaccine strain of sheep poxvirus (SPV). The sequences were compared with other P32 sequences of capripoxviruses available in the database. Sequence analysis revealed that sheep pox and goat poxviruses share 97.5 and 94.7% homology at nucleotide and amino acid level, respectively. A major difference between them is the presence of an additional aspartic acid at 55th position of P32 of sheep poxvirus that is absent in both goat poxvirus and lumpy skin disease virus. Further, six unique neutral nucleotide substitutions were observed at positions 77, 275, 403, 552, 867 and 964 in the sequence of goat poxvirus, which can be taken as GPV signature residues. Similar unique nucleotide signatures could be identified in SPV and LSDV sequences also. Phylogenetic analysis showed that members of the Capripoxvirus could be delineated into three distinct clusters of GPV, SPV and LSDV based on the P32 genomic sequence. Using this information, a PCR-RFLP method has been developed for unequivocal genomic differentiation of SPV and GPV.
Reaction mixtures formed by nitrite and selected sulfa-drugs showed mutagenicity in acidic medium

Directory of Open Access Journals (Sweden)

Claudia Trossero

2009-01-01

Full Text Available Nitrite, which is present in preserved meat and can be produced in the oral cavity by reduction of nitrate taken from vegetables, could react in stomach with nitrosatable drugs, giving genotoxic-carcinogenic N-nitroso compounds (NOC. The mutagenicity of reaction mixtures formed by sodium nitrite and selected sulfa-drugs (sulfathiazole, HST; phtalylsulfathiazole, PhST; complex Co(II-sulfathiazole, Co(II-ST in acidic medium was evaluated using the Salmonella typhimurium reverse mutation assay (Ames test, with TA98 and TA 100 strains. The reactions were carried out at room temperature, with a mole ratio [nitrite]/[sulfa-drug] > 1. The three reaction mixtures showed mutagenic effects in the considered range.
Enhanced anti-HIV-1 activity of G-quadruplexes comprising locked nucleic acids and intercalating nucleic acids

DEFF Research Database (Denmark)

Pedersen, Erik Bjerregaard; Nielsen, Jakob Toudahl; Nielsen, Claus

2011-01-01

Two G-quadruplex forming sequences, 50-TGGGAG and the 17-mer sequence T30177, which exhibit anti-HIV-1 activity on cell lines, were modified using either locked nucleic acids (LNA) or via insertions of (R)-1-O-(pyren-1-ylmethyl)glycerol (intercalating nucleic acid, INA) or (R)-1-O-[4-(1......-pyrenylethynyl)phenylmethyl]glycerol (twisted intercalating nucleic acid, TINA). Incorporation of LNA or INA/TINA monomers provide as much as 8-fold improvement of anti-HIV-1 activity. We demonstrate for the first time a detailed analysis of the effect the incorporation of INA/TINA monomers in quadruplex forming...
Murine mammary tumor virus pol-related sequences in human DNA: characterization and sequence comparison with the complete murine mammary tumor virus pol gene

International Nuclear Information System (INIS)

Deen, K.C.; Sweet, R.W.

1986-01-01

Sequences in the human genome with homology to the murine mammary tumor virus (MMTV) pol gene were isolated from a human phage library. Ten clones with extensive pol homology were shown to define five separate loci. These loci share common sequences immediately adjacent to the pol-like segments and, in addition, contain a related repeat element which bounds this region. This organization is suggestive of a proviral structure. The authors estimate that the human genome contains 30 to 40 copies of these pol-related sequences. The pol region of one of the cloned segments (HM16) and the complete MMTV pol gene were sequenced and compared. The nucleotide homology between these pol sequences is 52% and is concentrated in the terminal regions. The MMTV pol gene contains a single long open reading frame encoding 899 amino acids and is demarcated from the partially overlapping putative gag gene by termination codons and a shift in translational reading frame. The pol sequence of HM16 is multiply terminated but does contain open reading frames which encode 370, 105, and 112 amino acids residues in separate reading frames. The authors deduced a composite pol protein sequence for HM16 by aligning it to the MMTV pol gene and then compared these sequences with other retroviral pol protein sequences. Conserved sequences occur in both the amino and carboxyl regions which lie within the polymerase and endonuclease domains of pol, respectively
Procedures of amino acid sequencing of peptides in natural proteins collection of knowledge and intelligence for construction of reliable chemical inference system

OpenAIRE

Kudo, Yoshihiro; Kanaya, Shigehiko

1994-01-01

In order to establish a reliable chemical inference system on amino acid sequencing of natural peptides, as various kinds of relevant knowledge and intelligence as possible are collected. Topics are on didemnins, dolastatin 3, TL-119 and/or A-3302-B, mycosubtilin, patellamide A, duramycin (and cinnamycin), bottoromycin A 2, A19009, galantin I, vancomycin, stenothricin, calf speleen profilin, neocarzinostatin, pancreatic spasmolytic polypeptide, cerebratulus toxin B-IV, RNAase U 2, ferredoxin ...
Biodegradation of clofibric acid and identification of its metabolites

International Nuclear Information System (INIS)

Salgado, R.; Oehmen, A.; Carvalho, G.; Noronha, J.P.; Reis, M.A.M.

2012-01-01

Graphical abstract: Metabolites produced during clofibric acid biodegradation. Highlights: ► Clofibric acid is biodegradable. ► Mainly heterotrophic bacteria degraded the clofibric acid. ► Metabolites of clofibric acid biodegradation were identified. ► The metabolic pathway of clofibric acid biodegradation is proposed. - Abstract: Clofibric acid (CLF) is the pharmaceutically active metabolite of lipid regulators clofibrate, etofibrate and etofyllinclofibrate, and it is considered both environmentally persistent and refractory. This work studied the biotransformation of CLF in aerobic sequencing batch reactors (SBRs) with mixed microbial cultures, monitoring the efficiency of biotransformation of CLF and the production of metabolites. The maximum removal achieved was 51% biodegradation (initial CLF concentration = 2 mg L −1 ), where adsorption and abiotic removal mechanisms were shown to be negligible, showing that CLF is indeed biodegradable. Tests showed that the observed CLF biodegradation was mainly carried out by heterotrophic bacteria. Three main metabolites were identified, including α-hydroxyisobutyric acid, lactic acid and 4-chlorophenol. The latter is known to exhibit higher toxicity than the parent compound, but it did not accumulate in the SBRs. α-Hydroxyisobutyric acid and lactic acid accumulated for a period, where nitrite accumulation may have been responsible for inhibiting their degradation. A metabolic pathway for the biodegradation of CLF is proposed in this study.
Comparison of complete genome sequences of dog rabies viruses isolated from China and Mexico reveals key amino acid changes that may be associated with virus replication and virulence.

Science.gov (United States)

Yu, Fulai; Zhang, Guoqing; Zhong, Xiangfu; Han, Na; Song, Yunfeng; Zhao, Ling; Cui, Min; Rayner, Simon; Fu, Zhen F

2014-07-01

Rabies is a global problem, but its impact and prevalence vary across different regions. In some areas, such as parts of Africa and Asia, the virus is prevalent in the domestic dog population, leading to epidemic waves and large numbers of human fatalities. In other regions, such as the Americas, the virus predominates in wildlife and bat populations, with sporadic spillover into domestic animals. In this work, we attempted to investigate whether these distinct environments led to selective pressures that result in measurable changes within the genome at the amino acid level. To this end, we collected and sequenced the full genome of two isolates from divergent environments. The first isolate (DRV-AH08) was from China, where the virus is present in the dog population and the country is experiencing a serious epidemic. The second isolate (DRV-Mexico) was taken from Mexico, where the virus is present in both wildlife and domestic dog populations, but at low levels as a consequence of an effective vaccination program. We then combined and compared these with other full genome sequences to identify distinct amino acid changes that might be associated with environment. Phylogenetic analysis identified strain DRV-AH08 as belonging to the China-I lineage, which has emerged to become the dominant lineage in the current epidemic. The Mexico strain was placed in the D11 Mexico lineage, associated with the West USA-Mexico border clade. Amino acid sequence analysis identified only 17 amino acid differences in the N, G and L proteins. These differences may be associated with virus replication and virulence-for example, the short incubation period observed in the current epidemic in China.
Nucleotide sequence of tomato ringspot virus RNA-2.

Science.gov (United States)

Rott, M E; Tremaine, J H; Rochon, D M

1991-07-01

The sequence of tomato ringspot virus (TomRSV) RNA-2 has been determined. It is 7273 nucleotides in length excluding the 3' poly(A) tail and contains a single long open reading frame (ORF) of 5646 nucleotides in the positive sense beginning at position 78 and terminating at position 5723. A second in-frame AUG at position 441 is in a more favourable context for initiation of translation and may act as a site for initiation of translation. The TomRSV RNA-2 3' noncoding region is 1550 nucleotides in length. The coat protein is located in the C-terminal region of the large polypeptide and shows significant but limited amino acid sequence similarity to the putative coat proteins of the nepoviruses tomato black ring (TBRV), Hungarian grapevine chrome mosaic (GCMV) and grapevine fanleaf (GFLV). Comparisons of the coding and non-coding regions of TomRSV RNA-2 and the RNA components of TBRV, GCMV, GFLV and the comovirus cowpea mosaic virus revealed significant similarity for over 300 amino acids between the coding region immediately to the N-terminal side of the putative coat proteins of TomRSV and GFLV; very little similarity could be detected among the non-coding regions of TomRSV and any of these viruses.
Mammalian prions: tolerance to sequence changes-how far?

Science.gov (United States)

Salamat, Muhammad Khalid; Munoz-Montesino, Carola; Moudjou, Mohammed; Rezaei, Human; Laude, Hubert; Béringue, Vincent; Dron, Michel

2013-01-01

Upon prion infection, abnormal prion protein (PrP (Sc) ) self-perpetuate by conformational conversion of α-helix-rich PrP (C) into β sheet enriched form, leading to formation and deposition of PrP (Sc) aggregates in affected brains. However the process remains poorly understood at the molecular level and the regions of PrP critical for conversion are still debated. Minimal amino acid substitutions can impair prion replication at many places in PrP. Conversely, we recently showed that bona fide prions could be generated after introduction of eight and up to 16 additional amino acids in the H2-H3 inter-helix loop of PrP. Prion replication also accommodated the insertions of an octapeptide at different places in the last turns of H2. This reverse genetic approach reveals an unexpected tolerance of prions to substantial sequence changes in the protease-resistant part which is associated with infectivity. It also demonstrates that conversion does not require the presence of a specific sequence in the middle of the H2-H3 area. We discuss the implications of our findings according to different structural models proposed for PrP (Sc) and questioned the postulated existence of an N- or C-terminal prion domain in the protease-resistant region.
Whole-Genome Sequence Analysis of Bombella intestini LMG 28161T, a Novel Acetic Acid Bacterium Isolated from the Crop of a Red-Tailed Bumble Bee, Bombus lapidarius.

Directory of Open Access Journals (Sweden)

Leilei Li

Full Text Available The whole-genome sequence of Bombella intestini LMG 28161T, an endosymbiotic acetic acid bacterium (AAB occurring in bumble bees, was determined to investigate the molecular mechanisms underlying its metabolic capabilities. The draft genome sequence of B. intestini LMG 28161T was 2.02 Mb. Metabolic carbohydrate pathways were in agreement with the metabolite analyses of fermentation experiments and revealed its oxidative capacity towards sucrose, D-glucose, D-fructose and D-mannitol, but not ethanol and glycerol. The results of the fermentation experiments also demonstrated that the lack of effective aeration in small-scale carbohydrate consumption experiments may be responsible for the lack of reproducibility of such results in taxonomic studies of AAB. Finally, compared to the genome sequences of its nearest phylogenetic neighbor and of three other insect associated AAB strains, the B. intestini LMG 28161T genome lost 69 orthologs and included 89 unique genes. Although many of the latter were hypothetical they also included several type IV secretion system proteins, amino acid transporter/permeases and membrane proteins which might play a role in the interaction with the bumble bee host.
Mapping a nucleolar targeting sequence of an RNA binding nucleolar protein, Nop25

International Nuclear Information System (INIS)

Fujiwara, Takashi; Suzuki, Shunji; Kanno, Motoko; Sugiyama, Hironobu; Takahashi, Hisaaki; Tanaka, Junya

2006-01-01

Nop25 is a putative RNA binding nucleolar protein associated with rRNA transcription. The present study was undertaken to determine the mechanism of Nop25 localization in the nucleolus. Deletion experiments of Nop25 amino acid sequence showed Nop25 to contain a nuclear targeting sequence in the N-terminal and a nucleolar targeting sequence in the C-terminal. By expressing derivative peptides from the C-terminal as GFP-fusion proteins in the cells, a lysine and arginine residue-enriched peptide (KRKHPRRAQDSTKKPPSATRTSKTQRRRR) allowed a GFP-fusion protein to be transported and fully retained in the nucleolus. When the peptide was fused with cMyc epitope and expressed in the cells, a cMyc epitope was then detected in the nucleolus. Nop25 did not localize in the nucleolus by deletion of the peptide from Nop25. Furthermore, deletion of a subdomain (KRKHPRRAQ) in the peptide or amino acid substitution of lysine and arginine residues in the subdomain resulted in the loss of Nop25 nucleolar localization. These results suggest that the lysine and arginine residue-enriched peptide is the most prominent nucleolar targeting sequence of Nop25 and that the long stretch of basic residues might play an important role in the nucleolar localization of Nop25. Although Nop25 contained putative SUMOylation, phosphorylation and glycosylation sites, the amino acid substitution in these sites had no effect on the nucleolar localization, thus suggesting that these post-translational modifications did not contribute to the localization of Nop25 in the nucleolus. The treatment of the cells, which expressed a GFP-fusion protein with a nucleolar targeting sequence of Nop25, with RNase A resulted in a complete dislocation of the protein from the nucleolus. These data suggested that the nucleolar targeting sequence might therefore play an important role in the binding of Nop25 to RNA molecules and that the RNA binding of Nop25 might be essential for the nucleolar localization of Nop25
Molecular cloning and sequence analysis of a phenylalanine ammonia-lyase gene from dendrobium.

Directory of Open Access Journals (Sweden)

Qing Jin

Full Text Available In this study, a phenylalanine ammonia-lyase (PAL gene was cloned from Dendrobium candidum using homology cloning and RACE. The full-length sequence and catalytic active sites that appear in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum are also found: PAL cDNA of D. candidum (designated Dc-PAL1, GenBank No. JQ765748 has 2,458 bps and contains a complete open reading frame (ORF of 2,142 bps, which encodes 713 amino acid residues. The amino acid sequence of DcPAL1 has more than 80% sequence identity with the PAL genes of other plants, as indicated by multiple alignments. The dominant sites and catalytic active sites, which are similar to that showing in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum, are also found in DcPAL1. Phylogenetic tree analysis revealed that DcPAL is more closely related to PALs from orchidaceae plants than to those of other plants. The differential expression patterns of PAL in protocorm-like body, leaf, stem, and root, suggest that the PAL gene performs multiple physiological functions in Dendrobium candidum.
Human cyclophilin B: A second cyclophilin gene encodes a peptidyl-prolyl isomerase with a signal sequence

International Nuclear Information System (INIS)

Price, E.R.; Zydowsky, L.D.; Jin, Mingjie; Baker, C.H.; McKeon, F.D.; Walsh, C.T.

1991-01-01

The authors report the cloning and characterization of a cDNA encoding a second human cyclosporin A-binding protein (hCyPB). Homology analyses reveal that hCyPB is a member of the cyclophilin B (CyPB) family, which includes yeast CyPB, Drosophila nina A, and rat cyclophilin-like protein. This family is distinguished from the cyclophilin A (CyPA) family by the presence of endoplasmic reticulum (ER)-directed signal sequences. hCyPB has a hydrophobic leader sequence not found in hCyPA, and its first 25 amino acids are removed upon expression in Escherichia coli. Moreover, they show that hCyPB is a peptidyl-prolyl cis-trans isomerase which can be inhibited by cyclosporin A. These observations suggest that other members of the CyPB family will have similar enzymatic properties. Sequence comparisons of the CyPB proteins show a central, 165-amino acid peptidyl-prolyl isomerase and cyclosprorin A-binding domain, flanked by variable N-terminal and C-terminal domains. These two variable regions may impart compartmental specificity and regulation to this family of cyclophilin proteins containing the conserved core domain. Northern blot analyses show that hCyPB mRNA is expressed in the Jurkat T-cell line, consistent with its possible target role in cyclosporin A-mediated immunosuppression
Synaptotagmin gene content of the sequenced genomes

Directory of Open Access Journals (Sweden)

Craxton Molly

2004-07-01

Full Text Available Abstract Background Synaptotagmins exist as a large gene family in mammals. There is much interest in the function of certain family members which act crucially in the regulated synaptic vesicle exocytosis required for efficient neurotransmission. Knowledge of the functions of other family members is relatively poor and the presence of Synaptotagmin genes in plants indicates a role for the family as a whole which is wider than neurotransmission. Identification of the Synaptotagmin genes within completely sequenced genomes can provide the entire Synaptotagmin gene complement of each sequenced organism. Defining the detailed structures of all the Synaptotagmin genes and their encoded products can provide a useful resource for functional studies and a deeper understanding of the evolution of the gene family. The current rapid increase in the number of sequenced genomes from different branches of the tree of life, together with the public deposition of evolutionarily diverse transcript sequences make such studies worthwhile. Results I have compiled a detailed list of the Synaptotagmin genes of Caenorhabditis, Anopheles, Drosophila, Ciona, Danio, Fugu, Mus, Homo, Arabidopsis and Oryza by examining genomic and transcript sequences from public sequence databases together with some transcript sequences obtained by cDNA library screening and RT-PCR. I have compared all of the genes and investigated the relationship between plant Synaptotagmins and their non-Synaptotagmin counterparts. Conclusions I have identified and compared 98 Synaptotagmin genes from 10 sequenced genomes. Detailed comparison of transcript sequences reveals abundant and complex variation in Synaptotagmin gene expression and indicates the presence of Synaptotagmin genes in all animals and land plants. Amino acid sequence comparisons indicate patterns of conservation and diversity in function. Phylogenetic analysis shows the origin of Synaptotagmins in multicellular eukaryotes and their
Resolution of a protein sequence ambiguity by X-ray crystallographic and mass spectrometric methods

International Nuclear Information System (INIS)

Keefe, L.J.; Lattman, E.E.; Wolkow, C.; Woods, A.; Chevrier, M.; Cotter, R.J.

1992-01-01

Ambiguities in amino acid sequences are a potential problem in X-ray crystallographic studies of proteins. Amino acid side chains often cannot be reliably identified from the electron density. Many protein crystal structures that are now being solved are simple variants of a known wild-type structure. Thus, cloning artifacts or other untoward events can readily lead to cases in which the proposed sequence is not correct. An example is presented showing that mass spectrometry provides an excellent tool for analyzing suspected errors. The X-ray crystal structure of an insertion mutant of Staphylococcal nuclease has been solved to 1.67 A resolution and refined to a crystallographic R value of 0.170. A single residue has been inserted in the C-terminal α helix. The inserted amino acid was believed to be an alanine residue, but the final electron density maps strongly indicated that a glycine had been inserted instead. To confirm the observations from the X-ray data, matrix-assisted laser desorption mass spectrometry was employed to verify the glycine insertion. This mass spectrometric technique has sufficient mass accuracy to detect the methyl group that distinguishes glycine from alanine and can be extended to the more common situation in which crystallographic measurements suggest a problem with the sequence, but cannot pinpoint its location or nature. (orig.)
Resolution of a protein sequence ambiguity by X-ray crystallographic and mass spectrometric methods

Energy Technology Data Exchange (ETDEWEB)

Keefe, L.J.; Lattman, E.E. (Dept. of Biophysics and Biophysical Chemistry, Johns Hopkins Univ. School of Medicine, Baltimore, MD (United States)); Wolkow, C.; Woods, A.; Chevrier, M.; Cotter, R.J. (Middle Atlantic Mass Spectrometry Lab., Johns Hopkins Univ. School of Medicine, Baltimore, MD (United States))

1992-04-01

Ambiguities in amino acid sequences are a potential problem in X-ray crystallographic studies of proteins. Amino acid side chains often cannot be reliably identified from the electron density. Many protein crystal structures that are now being solved are simple variants of a known wild-type structure. Thus, cloning artifacts or other untoward events can readily lead to cases in which the proposed sequence is not correct. An example is presented showing that mass spectrometry provides an excellent tool for analyzing suspected errors. The X-ray crystal structure of an insertion mutant of Staphylococcal nuclease has been solved to 1.67 A resolution and refined to a crystallographic R value of 0.170. A single residue has been inserted in the C-terminal {alpha} helix. The inserted amino acid was believed to be an alanine residue, but the final electron density maps strongly indicated that a glycine had been inserted instead. To confirm the observations from the X-ray data, matrix-assisted laser desorption mass spectrometry was employed to verify the glycine insertion. This mass spectrometric technique has sufficient mass accuracy to detect the methyl group that distinguishes glycine from alanine and can be extended to the more common situation in which crystallographic measurements suggest a problem with the sequence, but cannot pinpoint its location or nature. (orig.).
A method for selecting cis-acting regulatory sequences that respond to small molecule effectors

Directory of Open Access Journals (Sweden)

Allas Ülar

2010-08-01

Full Text Available Abstract Background Several cis-acting regulatory sequences functioning at the level of mRNA or nascent peptide and specifically influencing transcription or translation have been described. These regulatory elements often respond to specific chemicals. Results We have developed a method that allows us to select cis-acting regulatory sequences that respond to diverse chemicals. The method is based on the β-lactamase gene containing a random sequence inserted into the beginning of the ORF. Several rounds of selection are used to isolate sequences that suppress β-lactamase expression in response to the compound under study. We have isolated sequences that respond to erythromycin, troleandomycin, chloramphenicol, meta-toluate and homoserine lactone. By introducing synonymous and non-synonymous mutations we have shown that at least in the case of erythromycin the sequences act at the peptide level. We have also tested the cross-activities of the constructs and found that in most cases the sequences respond most strongly to the compound on which they were isolated. Conclusions Several selected peptides showed ligand-specific changes in amino acid frequencies, but no consensus motif could be identified. This is consistent with previous observations on natural cis-acting peptides, showing that it is often impossible to demonstrate a consensus. Applying the currently developed method on a larger scale, by selecting and comparing an extended set of sequences, might allow the sequence rules underlying the activity of cis-acting regulatory peptides to be identified.
Molecular cloning and sequence of cDNA encoding the plasma membrane proton pump (H+-ATPase) of Arabidopsis thaliana

International Nuclear Information System (INIS)

Harper, J.F.; Surowy, T.K.; Sussman, M.R.

1989-01-01

In plants, the transport of solutes across the plasma membrane is driven by a proton pump (H + -ATPase) that produces an electric potential and pH gradient. The authors isolated and sequenced a full-length cDNA clone that encodes this enzyme in Arabidopsis thaliana. The protein predicted from its nucleotide sequence encodes 959 amino acids and has a molecular mass of 104,207 Da. The plant protein shows structural features common to a family of cation-translocating ATPases found in the plasma membrane of prokaryotic and eukaryotic cells, with the greatest overall identity in amino acid sequence (36%) to the H + -ATPase observed in the plasma membrane of fungi. The structure predicted from a hydropathy plant contains at least eight transmembrane segments, with most of the protein (73%) extending into the cytoplasm and only 5% of the residues exposed on the external surface. Unique features of the plant enzyme include diverged sequences at the amino and carboxyl termini as well as greater hydrophilic character in three extracellular loops
Identification of Meconopsis species by a DNA barcode sequence ...

African Journals Online (AJOL)

Deoxyribonucleic acid (DNA) barcoding is a novel technology that uses a standard DNA sequence to facilitate species identification. Species identification is necessary for the authentication of traditional plant based medicines. Although a consensus has not been agreed regarding which DNA sequences can be used as ...
Cloning, sequencing, and expression of cDNA for human β-glucuronidase

International Nuclear Information System (INIS)

Oshima, A.; Kyle, J.W.; Miller, R.D.

1987-01-01

The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

KAUST Repository

Stabler, R. A.

2013-01-24

A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.
Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-γ-d-Glutamic Acid Anthrax Capsule.

Science.gov (United States)

Stabler, Richard A; Negus, David; Pain, Arnab; Taylor, Peter W

2013-01-01

A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.
Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

KAUST Repository

Stabler, R. A.; Negus, D.; Pain, Arnab; Taylor, P. W.

2013-01-01

A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.
Complete primary structure of a Lolium perenne (perennial rye grass) pollen allergen, Lol p III: comparison with known Lol p I and II sequences.

Science.gov (United States)

Ansari, A A; Shenbagamurthi, P; Marsh, D G

1989-10-17

The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p III, determined by the automated Edman degradation of the protein and its selected fragments, is reported in this paper. Cleavage by enzymatic and chemical techniques established unambiguously the sequence for this 97-residue protein (Mr = 10,909), which lacks cysteine and shows no evidence of glycosylation. The sequence of Lol p III is very similar to that of another L. perenne allergen, Lol p II, which was sequenced recently; of the 97 positions in the two proteins, 57 are occupied by identical amino acids (59% identity). In addition, both allergens share a similar structure with an antibody-binding fragment of a third L. perenne allergen, Lol p I. Since human antibody responsiveness to all these three allergens is associated with HLA-DR3, and since the structure common to the three molecules shows high degrees of amphipathicity in Lol p II and III, we speculate that this common segment in the three molecules might contain or contribute to the respectively Ia/T-cell sites.
Complete nucleotide sequence of Alfalfa mosaic virus isolated from alfalfa (Medicago sativa L.) in Argentina.

Science.gov (United States)

Trucco, Verónica; de Breuil, Soledad; Bejerman, Nicolás; Lenardon, Sergio; Giolitti, Fabián

2014-06-01

The complete nucleotide sequence of an Alfalfa mosaic virus (AMV) isolate infecting alfalfa (Medicago sativa L.) in Argentina, AMV-Arg, was determined. The virus genome has the typical organization described for AMV, and comprises 3,643, 2,593, and 2,038 nucleotides for RNA1, 2 and 3, respectively. The whole genome sequence and each encoding region were compared with those of other four isolates that have been completely sequenced from China, Italy, Spain and USA. The nucleotide identity percentages ranged from 95.9 to 99.1 % for the three RNAs and from 93.7 to 99 % for the protein 1 (P1), protein 2 (P2), movement protein and coat protein (CP) encoding regions, whereas the amino acid identity percentages of these proteins ranged from 93.4 to 99.5 %, the lowest value corresponding to P2. CP sequences of AMV-Arg were compared with those of other 25 available isolates, and the phylogenetic analysis based on the CP gene was carried out. The highest percentage of nucleotide sequence identity of the CP gene was 98.3 % with a Chinese isolate and 98.6 % at the amino acid level with four isolates, two from Italy, one from Brazil and the remaining one from China. The phylogenetic analysis showed that AMV-Arg is closely related to subgroup I of AMV isolates. To our knowledge, this is the first report of a complete nucleotide sequence of AMV from South America and the first worldwide report of complete nucleotide sequence of AMV isolated from alfalfa as natural host.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.

Science.gov (United States)

Wyszyńska-Koko, J; Kurył, J

2004-01-01

MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
FASH: A web application for nucleotides sequence search

Directory of Open Access Journals (Sweden)

Chew Paul

2008-05-01

Full Text Available Abstract FASH (Fourier Alignment Sequence Heuristics is a web application, based on the Fast Fourier Transform, for finding remote homologs within a long nucleic acid sequence. Given a query sequence and a long text-sequence (e.g, the human genome, FASH detects subsequences within the text that are remotely-similar to the query. FASH offers an alternative approach to Blast/Fasta for querying long RNA/DNA sequences. FASH differs from these other approaches in that it does not depend on the existence of contiguous seed-sequences in its initial detection phase. The FASH web server is user friendly and very easy to operate. Availability FASH can be accessed at https://fash.bgu.ac.il:8443/fash/default.jsp (secured website
Functional identification and regulatory analysis of Δ6-fatty acid desaturase from the oleaginous fungus Mucor sp. EIM-10.

Science.gov (United States)

Jiang, Xianzhang; Liu, Hongjiao; Niu, Yongchao; Qi, Feng; Zhang, Mingliang; Huang, Jianzhong

2017-03-01

To enlarge the diversity of the desaturases associated with PUFA biosynthesis and to better understand the transcriptional regulation of desaturases, a Δ 6 -desaturase gene (Md6) from Mucor sp. and its 5'-upstream sequence was functionally identified in Saccharomyces cerevisiae. Expression of the Δ 6 -fatty acid desaturase (Md6) in S. cerevisiae showed that Md6 could convert linolenic acid to γ-linolenic acid. Computational analysis of the promoter of Md6 suggested it contains several eukaryotic fundamental transcription regulatory elements. In vivo functional analysis of the promoter showed the 5'-upstream sequence of Md6 could initiate expression of GFP and Md6 itself in S. cerevisiae. A series deletion analysis of the promoter suggested that sequence between -919 to -784 bp (relative to start site) named as eMd6 is the key factor for high activity of Δ 6 -desaturase. The activity of Δ 6 -desaturase was increased by 2.8-fold and 2.5-fold when the eMd6 sequence was placed upstream of -434 with forward or reverse orientations respectively. To our best knowledge, the native promoter of Md6 from Mucor is the strongest promoter for Δ 6 -desaturase reported so far and the sequence between -919 to -784 bp is an enhancer for Δ 6 -desaturase activity.
Biodegradation of clofibric acid and identification of its metabolites.

Science.gov (United States)

Salgado, R; Oehmen, A; Carvalho, G; Noronha, J P; Reis, M A M

2012-11-30

Clofibric acid (CLF) is the pharmaceutically active metabolite of lipid regulators clofibrate, etofibrate and etofyllinclofibrate, and it is considered both environmentally persistent and refractory. This work studied the biotransformation of CLF in aerobic sequencing batch reactors (SBRs) with mixed microbial cultures, monitoring the efficiency of biotransformation of CLF and the production of metabolites. The maximum removal achieved was 51% biodegradation (initial CLF concentration=2 mg L(-1)), where adsorption and abiotic removal mechanisms were shown to be negligible, showing that CLF is indeed biodegradable. Tests showed that the observed CLF biodegradation was mainly carried out by heterotrophic bacteria. Three main metabolites were identified, including α-hydroxyisobutyric acid, lactic acid and 4-chlorophenol. The latter is known to exhibit higher toxicity than the parent compound, but it did not accumulate in the SBRs. α-Hydroxyisobutyric acid and lactic acid accumulated for a period, where nitrite accumulation may have been responsible for inhibiting their degradation. A metabolic pathway for the biodegradation of CLF is proposed in this study. Copyright © 2012 Elsevier B.V. All rights reserved.
Open questions in origin of life : Experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

NARCIS (Netherlands)

Adamala, K.; Anella, F.M.; Wieczorek, R.; Stano, P.; Chiarabelli, C.; Luisi, P.L.

2014-01-01

In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the
CISAPS: Complex Informational Spectrum for the Analysis of Protein Sequences

Directory of Open Access Journals (Sweden)

Charalambos Chrysostomou

2015-01-01

Full Text Available Complex informational spectrum analysis for protein sequences (CISAPS and its web-based server are developed and presented. As recent studies show, only the use of the absolute spectrum in the analysis of protein sequences using the informational spectrum analysis is proven to be insufficient. Therefore, CISAPS is developed to consider and provide results in three forms including absolute, real, and imaginary spectrum. Biologically related features to the analysis of influenza A subtypes as presented as a case study in this study can also appear individually either in the real or imaginary spectrum. As the results presented, protein classes can present similarities or differences according to the features extracted from CISAPS web server. These associations are probable to be related with the protein feature that the specific amino acid index represents. In addition, various technical issues such as zero-padding and windowing that may affect the analysis are also addressed. CISAPS uses an expanded list of 611 unique amino acid indices where each one represents a different property to perform the analysis. This web-based server enables researchers with little knowledge of signal processing methods to apply and include complex informational spectrum analysis to their work.
Amyloid fibril formation from sequences of a natural beta-structured fibrous protein, the adenovirus fiber.

Science.gov (United States)

Papanikolopoulou, Katerina; Schoehn, Guy; Forge, Vincent; Forsyth, V Trevor; Riekel, Christian; Hernandez, Jean-François; Ruigrok, Rob W H; Mitraki, Anna

2005-01-28

Amyloid fibrils are fibrous beta-structures that derive from abnormal folding and assembly of peptides and proteins. Despite a wealth of structural studies on amyloids, the nature of the amyloid structure remains elusive; possible connections to natural, beta-structured fibrous motifs have been suggested. In this work we focus on understanding amyloid structure and formation from sequences of a natural, beta-structured fibrous protein. We show that short peptides (25 to 6 amino acids) corresponding to repetitive sequences from the adenovirus fiber shaft have an intrinsic capacity to form amyloid fibrils as judged by electron microscopy, Congo Red binding, infrared spectroscopy, and x-ray fiber diffraction. In the presence of the globular C-terminal domain of the protein that acts as a trimerization motif, the shaft sequences adopt a triple-stranded, beta-fibrous motif. We discuss the possible structure and arrangement of these sequences within the amyloid fibril, as compared with the one adopted within the native structure. A 6-amino acid peptide, corresponding to the last beta-strand of the shaft, was found to be sufficient to form amyloid fibrils. Structural analysis of these amyloid fibrils suggests that perpendicular stacking of beta-strand repeat units is an underlying common feature of amyloid formation.
Identification of Biomolecular Building Blocks by Recognition Tunneling: Stride towards Nanopore Sequencing of Biomolecules

Science.gov (United States)

Sen, Suman

DNA, RNA and Protein are three pivotal biomolecules in human and other organisms, playing decisive roles in functionality, appearance, diseases development and other physiological phenomena. Hence, sequencing of these biomolecules acquires the prime interest in the scientific community. Single molecular identification of their building blocks can be done by a technique called Recognition Tunneling (RT) based on Scanning Tunneling Microscope (STM). A single layer of specially designed recognition molecule is attached to the STM electrodes, which trap the targeted molecules (DNA nucleoside monophosphates, RNA nucleoside monophosphates or amino acids) inside the STM nanogap. Depending on their different binding interactions with the recognition molecules, the analyte molecules generate stochastic signal trains accommodating their "electronic fingerprints". Signal features are used to detect the molecules using a machine learning algorithm and different molecules can be identified with significantly high accuracy. This, in turn, paves the way for rapid, economical nanopore sequencing platform, overcoming the drawbacks of Next Generation Sequencing (NGS) techniques. To read DNA nucleotides with high accuracy in an STM tunnel junction a series of nitrogen-based heterocycles were designed and examined to check their capabilities to interact with naturally occurring DNA nucleotides by hydrogen bonding in the tunnel junction. These recognition molecules are Benzimidazole, Imidazole, Triazole and Pyrrole. Benzimidazole proved to be best among them showing DNA nucleotide classification accuracy close to 99%. Also, Imidazole reader can read an abasic monophosphate (AP), a product from depurination or depyrimidination that occurs 10,000 times per human cell per day. In another study, I have investigated a new universal reader, 1-(2-mercaptoethyl)pyrene (Pyrene reader) based on stacking interactions, which should be more specific to the canonical DNA nucleosides. In addition
Biophysical and structural considerations for protein sequence evolution

Directory of Open Access Journals (Sweden)

Grahnen Johan A

2011-12-01

Full Text Available Abstract Background Protein sequence evolution is constrained by the biophysics of folding and function, causing interdependence between interacting sites in the sequence. However, current site-independent models of sequence evolutions do not take this into account. Recent attempts to integrate the influence of structure and biophysics into phylogenetic models via statistical/informational approaches have not resulted in expected improvements in model performance. This suggests that further innovations are needed for progress in this field. Results Here we develop a coarse-grained physics-based model of protein folding and binding function, and compare it to a popular informational model. We find that both models violate the assumption of the native sequence being close to a thermodynamic optimum, causing directional selection away from the native state. Sampling and simulation show that the physics-based model is more specific for fold-defining interactions that vary less among residue type. The informational model diffuses further in sequence space with fewer barriers and tends to provide less support for an invariant sites model, although amino acid substitutions are generally conservative. Both approaches produce sequences with natural features like dN/dS Conclusions Simple coarse-grained models of protein folding can describe some natural features of evolving proteins but are currently not accurate enough to use in evolutionary inference. This is partly due to improper packing of the hydrophobic core. We suggest possible improvements on the representation of structure, folding energy, and binding function, as regards both native and non-native conformations, and describe a large number of possible applications for such a model.
Characterization of the HLA-DRβ1 third hypervariable region amino acid sequence according to charge and parental inheritance in systemic sclerosis.

Science.gov (United States)

Gentil, Coline A; Gammill, Hilary S; Luu, Christine T; Mayes, Maureen D; Furst, Dan E; Nelson, J Lee

2017-03-07

Specific HLA class II alleles are associated with systemic sclerosis (SSc) risk, clinical characteristics, and autoantibodies. HLA nomenclature initially developed with antibodies as typing reagents defining DRB1 allele groups. However, alleles from different DRB1 allele groups encode the same third hypervariable region (3rd HVR) sequence, the primary T-cell recognition site, and 3rd HVR charge differences can affect interactions with T cells. We considered 3rd HVR sequences (amino acids 67-74) irrespective of the allele group and analyzed parental inheritance considered according to the 3rd HVR charge, comparing SSc patients with controls. In total, 306 families (121 SSc and 185 controls) were HLA genotyped and parental HLA-haplotype origin was determined. Analysis was conducted according to DRβ1 3rd HVR sequence, charge, and parental inheritance. The distribution of 3rd HVR sequences differed in SSc patients versus controls (p = 0.007), primarily due to an increase of specific DRB1*11 alleles, in accord with previous observations. The 3rd HVR sequences were next analyzed according to charge and parental inheritance. Paternal transmission of DRB1 alleles encoding a +2 charge 3rd HVR was significantly reduced in SSc patients compared with maternal transmission (p = 0.0003, corrected for analysis of four charge categories p = 0.001). To a lesser extent, paternal transmission was increased when charge was 0 (p = 0.021, corrected for multiple comparisons p = 0.084). In contrast, paternal versus maternal inheritance was similar in controls. SSc patients differed from controls when DRB1 alleles were categorized according to 3rd HVR sequences. Skewed parental inheritance was observed in SSc patients but not in controls when the DRβ1 3rd HVR was considered according to charge. These observations suggest that epigenetic modulation of HLA merits investigation in SSc.
Human thyroid peroxidase: complete cDNA and protein sequence, chromosome mapping, and identification of two alternately spliced mRNAs

International Nuclear Information System (INIS)

Kimura, S.; Kotani, T.; McBride, O.W.; Umeki, K.; Hirai, K.; Nakayama, T.; Ohtaki, S.

1987-01-01

Two forms of human thyroid peroxidase cDNAs were isolated from a λgt11 cDNA library, prepared from Graves disease thyroid tissue mRNA, by use of oligonucleotides. The longest complete cDNA, designated phTPO-1, has 3048 nucleotides and an open reading frame consisting of 933 amino acids, which would encode a protein with a molecular weight of 103,026. Five potential asparagine-linked glycosylation sites are found in the deduced amino acid sequence. The second peroxidase cDNA, designated phTPO-2, is almost identical to phTPO-1 beginning 605 base pairs downstream except that it contains 1-base-pair difference and lacks 171 base pairs in the middle of the sequence. This results in a loss of 57 amino acids corresponding to a molecular weight of 6282. Interestingly, this 171-nucleotide sequence has GT and AG at its 5' and 3' boundaries, respectively, that are in good agreement with donor and acceptor splice site consensus sequences. Using specific oligonucleotide probes for the mRNAs derived from the cDNA sequences hTOP-1 and hTOP-2, the authors show that both are expressed in all thyroid tissues examined and the relative level of two mRNAs is different in each sample. The results suggest that two thyroid peroxidase proteins might be generated through alternate splicing of the same gene. By using somatic cell hybrid lines, the thyroid peroxidase gene was mapped to the short arm of human chromosome 2
A newly constructed primer pair for the PCR amplification, cloning and sequencing of the flagellin (flaA) gene from isolatesof urease-negative Campylobacter lari.

Science.gov (United States)

Sekizuka, Tsuyoshi; Yokoi, Taeko; Murayama, Ohoshi; Millar, B Cherie; Moore, Johne; Matsuda, Motoo

2005-08-01

A newly constructed primer pair (lari-Af/lari-Ar) designed to generate a product of the flagellin (flaA) gene for urease-negative Campylobacter lari produced a PCR amplicon of about 1700 bp for 16 isolates from 7 seagulls, 5 humans, 3 food animals and one mussel in Japan and Northern Ireland. Nucleotide sequencing and alignments of the flaA amplicons from these isolates demonstrated that the deduced amino acid sequences of the possible open reading frame were 564-572 amino acid residues in length with calculated molecular weights of 58,804 to 59,463. The deduced amino acid sequence similarity analysis strongly suggested that the ORF of the flaA from the 16 isolates showed 70-75% sequence similarities to those of Campylobacter jejuni isolates. The approximate Mr of the flagellin purified from some of the isolates of urease-negative C. lari was estimated to range from 59.6 to 61.8 kDa. Thus, flagellin from the isolates of urease-negative C. lari was shown for the first time to have a molecular size similar to those of C. jejuni and Campylobacter coli isolates, but to be different from the shorter flaA and smaller flagellin of urease-positive thermophilic Campylobacter (UPTC) isolates. Flagellins from C. lari spp., consisting of the two representative taxa of urease-negative C. lari and UPTC, thus show genotypic and phenotypic diversity.
Differential Gene Expression of Longan Under Simulated Acid Rain Stress.

Science.gov (United States)

Zheng, Shan; Pan, Tengfei; Ma, Cuilan; Qiu, Dongliang

2017-05-01

Differential gene expression profile was studied in Dimocarpus longan Lour. in response to treatments of simulated acid rain with pH 2.5, 3.5, and a control (pH 5.6) using differential display reverse transcription polymerase chain reaction (DDRT-PCR). Results showed that mRNA differential display conditions were optimized to find an expressed sequence tag (EST) related with acid rain stress. The potential encoding products had 80% similarity with a transcription initiation factor IIF of Gossypium raimondii and 81% similarity with a protein product of Theobroma cacao. This fragment is the transcription factor activated by second messenger substances in longan leaves after signal perception of acid rain.
Optimization of a sequence of reactors

DEFF Research Database (Denmark)

Vidal, Rene Victor Valqui

1991-01-01

Concerns the optimal production of sulphuric acid in a sequence of reactors. Using a suitable approximation to the objective function, this problem can easily be solved using the maximum principle. A numerical example documents the applicability of the suggested approach...
Prospects for Fungal Bioremediation of Acidic Radioactive Waste Sites: Characterization and Genome Sequence of Rhodotorula taiwanensis MD1149.

Science.gov (United States)

Tkavc, Rok; Matrosova, Vera Y; Grichenko, Olga E; Gostinčar, Cene; Volpe, Robert P; Klimenkova, Polina; Gaidamakova, Elena K; Zhou, Carol E; Stewart, Benjamin J; Lyman, Mathew G; Malfatti, Stephanie A; Rubinfeld, Bonnee; Courtot, Melanie; Singh, Jatinder; Dalgard, Clifton L; Hamilton, Theron; Frey, Kenneth G; Gunde-Cimerman, Nina; Dugan, Lawrence; Daly, Michael J

2017-01-01

Highly concentrated radionuclide waste produced during the Cold War era is stored at US Department of Energy (DOE) production sites. This radioactive waste was often highly acidic and mixed with heavy metals, and has been leaking into the environment since the 1950s. Because of the danger and expense of cleanup of such radioactive sites by physicochemical processes, in situ bioremediation methods are being developed for cleanup of contaminated ground and groundwater. To date, the most developed microbial treatment proposed for high-level radioactive sites employs the radiation-resistant bacterium Deinococcus radiodurans . However, the use of Deinococcus spp. and other bacteria is limited by their sensitivity to low pH. We report the characterization of 27 diverse environmental yeasts for their resistance to ionizing radiation (chronic and acute), heavy metals, pH minima, temperature maxima and optima, and their ability to form biofilms. Remarkably, many yeasts are extremely resistant to ionizing radiation and heavy metals. They also excrete carboxylic acids and are exceptionally tolerant to low pH. A special focus is placed on Rhodotorula taiwanensis MD1149, which was the most resistant to acid and gamma radiation. MD1149 is capable of growing under 66 Gy/h at pH 2.3 and in the presence of high concentrations of mercury and chromium compounds, and forming biofilms under high-level chronic radiation and low pH. We present the whole genome sequence and annotation of R. taiwanensis strain MD1149, with a comparison to other Rhodotorula species. This survey elevates yeasts to the frontier of biology's most radiation-resistant representatives, presenting a strong rationale for a role of fungi in bioremediation of acidic radioactive waste sites.

Biodegradation of clofibric acid and identification of its metabolites

Energy Technology Data Exchange (ETDEWEB)

Salgado, R. [REQUIMTE/CQFB, Chemistry Department, FCT, Universidade Nova de Lisboa, 2829-516 Caparica (Portugal); ESTS-IPS, Escola Superior de Tecnologia de Setubal do Instituto Politecnico de Setubal, Rua Vale de Chaves, Campus do IPS, Estefanilha, 2910-761 Setubal (Portugal); Oehmen, A. [REQUIMTE/CQFB, Chemistry Department, FCT, Universidade Nova de Lisboa, 2829-516 Caparica (Portugal); Carvalho, G. [REQUIMTE/CQFB, Chemistry Department, FCT, Universidade Nova de Lisboa, 2829-516 Caparica (Portugal); Instituto de Biologia Experimental e Tecnologica (IBET), Av. da Republica (EAN), 2784-505 Oeiras (Portugal); Noronha, J.P. [REQUIMTE/CQFB, Chemistry Department, FCT, Universidade Nova de Lisboa, 2829-516 Caparica (Portugal); Reis, M.A.M., E-mail: amr@fct.unl.pt [REQUIMTE/CQFB, Chemistry Department, FCT, Universidade Nova de Lisboa, 2829-516 Caparica (Portugal)

2012-11-30

Graphical abstract: Metabolites produced during clofibric acid biodegradation. Highlights: Black-Right-Pointing-Pointer Clofibric acid is biodegradable. Black-Right-Pointing-Pointer Mainly heterotrophic bacteria degraded the clofibric acid. Black-Right-Pointing-Pointer Metabolites of clofibric acid biodegradation were identified. Black-Right-Pointing-Pointer The metabolic pathway of clofibric acid biodegradation is proposed. - Abstract: Clofibric acid (CLF) is the pharmaceutically active metabolite of lipid regulators clofibrate, etofibrate and etofyllinclofibrate, and it is considered both environmentally persistent and refractory. This work studied the biotransformation of CLF in aerobic sequencing batch reactors (SBRs) with mixed microbial cultures, monitoring the efficiency of biotransformation of CLF and the production of metabolites. The maximum removal achieved was 51% biodegradation (initial CLF concentration = 2 mg L{sup -1}), where adsorption and abiotic removal mechanisms were shown to be negligible, showing that CLF is indeed biodegradable. Tests showed that the observed CLF biodegradation was mainly carried out by heterotrophic bacteria. Three main metabolites were identified, including {alpha}-hydroxyisobutyric acid, lactic acid and 4-chlorophenol. The latter is known to exhibit higher toxicity than the parent compound, but it did not accumulate in the SBRs. {alpha}-Hydroxyisobutyric acid and lactic acid accumulated for a period, where nitrite accumulation may have been responsible for inhibiting their degradation. A metabolic pathway for the biodegradation of CLF is proposed in this study.
Direct, rapid RNA sequence analysis

International Nuclear Information System (INIS)

Peattie, D.A.

1987-01-01

The original methods of RNA sequence analysis were based on enzymatic production and chromatographic separation of overlapping oligonucleotide fragments from within an RNA molecule followed by identification of the mononucleotides comprising the oligomer. Over the past decade the field of nucleic acid sequencing has changed dramatically, however, and RNA molecules now can be sequenced in a variety of more streamlined fashions. Most of the more recent advances in RNA sequencing have involved one-dimensional electrophoretic separation of 32 P-end-labeled oligoribonucleotides on polyacrylamide gels. In this chapter the author discusses two of these methods for determining the nucleotide sequences of RNA molecules rapidly: the chemical method and the enzymatic method. Both methods are direct and degradative, i.e., they rely on fragmatic and chemical approaches should be utilized. The single-strand-specific ribonucleases (A, T 1 , T 2 , and S 1 ) provide an efficient means to locate double-helical regions rapidly, and the chemical reactions provide a means to determine the RNA sequence within these regions. In addition, the chemical reactions allow one to assign interactions to specific atoms and to distinguish secondary interactions from tertiary ones. If the RNA molecule is small enough to be sequenced directly by the enzymatic or chemical method, the probing reactions can be done easily at the same time as sequencing reactions
Structural analysis of complementary DNA and amino acid sequences of human and rat androgen receptors

International Nuclear Information System (INIS)

Chang, C.; Kokontis, J.; Liao, S.

1988-01-01

Structural analysis of cDNAs for human and rat androgen receptors (ARs) indicates that the amino-terminal regions of ARs are rich in oligo- and poly(amino acid) motifs as in some homeotic genes. The human AR has a long stretch of repeated glycines, whereas rat AR has a long stretch of glutamines. There is a considerable sequence similarity among ARs and the receptors for glucocorticoids, progestins, and mineralocorticoids within the steroid-binding domains. The cysteine-rich DNA-binding domains are well conserved. Translation of mRNA transcribed from AR cDNAs yielded 94- and 76-kDa proteins and smaller forms that bind to DNA and have high affinity toward androgens. These rat or human ARs were recognized by human autoantibodies to natural Ars. Molecular hybridization studies, using AR cDNAs as probes, indicated that the ventral prostate and other male accessory organs are rich in AR mRNA and that the production of AR mRNA in the target organs may be autoregulated by androgens
Genetic variation assessment of acid lime accessions collected from south of Iran using SSR and ISSR molecular markers.

Science.gov (United States)

Sharafi, Ata Allah; Abkenar, Asad Asadi; Sharafi, Ali; Masaeli, Mohammad

2016-01-01

Iran has a long history of acid lime cultivation and propagation. In this study, genetic variation in 28 acid lime accessions from five regions of south of Iran, and their relatedness with other 19 citrus cultivars were analyzed using Simple Sequence Repeat (SSR) and Inter-Simple Sequence Repeat (ISSR) molecular markers. Nine primers for SSR and nine ISSR primers were used for allele scoring. In total, 49 SSR and 131 ISSR polymorphic alleles were detected. Cluster analysis of SSR and ISSR data showed that most of the acid lime accessions (19 genotypes) have hybrid origin and genetically distance with nucellar of Mexican lime (9 genotypes). As nucellar of Mexican lime are susceptible to phytoplasma, these acid lime genotypes can be used to evaluate their tolerance against biotic constricts like lime "witches' broom disease".
Draft genome sequence of the silver pomfret fish, Pampus argenteus.

Science.gov (United States)

AlMomin, Sabah; Kumar, Vinod; Al-Amad, Sami; Al-Hussaini, Mohsen; Dashti, Talal; Al-Enezi, Khaznah; Akbar, Abrar

2016-01-01

Silver pomfret, Pampus argenteus, is a fish species from coastal waters. Despite its high commercial value, this edible fish has not been sequenced. Hence, its genetic and genomic studies have been limited. We report the first draft genome sequence of the silver pomfret obtained using a Next Generation Sequencing (NGS) technology. We assembled 38.7 Gb of nucleotides into scaffolds of 350 Mb with N50 of about 1.5 kb, using high quality paired end reads. These scaffolds represent 63.7% of the estimated silver pomfret genome length. The newly sequenced and assembled genome has 11.06% repetitive DNA regions, and this percentage is comparable to that of the tilapia genome. The genome analysis predicted 16 322 genes. About 91% of these genes showed homology with known proteins. Many gene clusters were annotated to protein and fatty-acid metabolism pathways that may be important in the context of the meat texture and immune system developmental processes. The reference genome can pave the way for the identification of many other genomic features that could improve breeding and population-management strategies, and it can also help characterize the genetic diversity of P. argenteus.
Increased mRNA expression of a laminin-binding protein in human colon carcinoma: Complete sequence of a full-length cDNA encoding the protein

International Nuclear Information System (INIS)

Yow, Hsiukang; Wong, Jau Min; Chen, Hai Shiene; Lee, C.; Steele, G.D. Jr.; Chen, Lanbo

1988-01-01

Reliable markers to distinguish human colon carcinoma from normal colonic epithelium are needed particularly for poorly differentiated tumors where no useful marker is currently available. To search for markers the authors constructed cDNA libraries from human colon carcinoma cell lines and screened for clones that hybridize to a greater degree with mRNAs of colon carcinomas than with their normal counterparts. Here they report one such cDNA clone that hybridizes with a 1.2-kilobase (kb) mRNA, the level of which is ∼9-fold greater in colon carcinoma than in adjacent normal colonic epithelium. Blot hybridization of total RNA from a variety of human colon carcinoma cell lines shows that the level of this 1.2-kb mRNA in poorly differentiated colon carcinomas is as high as or higher than that in well-differentiated carcinomas. Molecular cloning and complete sequencing of cDNA corresponding to the full-length open reading frame of this 1.2-kb mRNA unexpectedly show it to contain all the partial cDNA sequence encoding 135 amino acid residues previously reported for a human laminin receptor. The deduced amino acid sequence suggests that this putative laminin-binding protein from human colon carcinomas consists of 295 amino acid residues with interesting features. There is an unusual C-terminal 70-amino acid segment, which is trypsin-resistant and highly negatively charged
Mildly abnormal general movement quality in infants is associated with higher Mead acid and lower arachidonic acid and shows a U-shaped relation with the DHA/AA ratio

NARCIS (Netherlands)

van Goor, S. A.; Schaafsma, A.; Erwich, J. J. H. M.; Dijck-Brouwer, D. A. J.; Muskiet, F. A. J.

We showed that docosahexaenoic acid (DHA) supplementation during pregnancy and lactation was associated with more mildly abnormal (MA) general movements (GMs) in the infants. Since this finding was unexpected and inter-individual DHA intakes are highly variable, we explored the relationship between
Characterization of relative abundance of lactic acid bacteria species in French organic sourdough by cultural, qPCR and MiSeq high-throughput sequencing methods.

Science.gov (United States)

Michel, Elisa; Monfort, Clarisse; Deffrasnes, Marion; Guezenec, Stéphane; Lhomme, Emilie; Barret, Matthieu; Sicard, Delphine; Dousset, Xavier; Onno, Bernard

2016-12-19

In order to contribute to the description of sourdough LAB composition, MiSeq sequencing and qPCR methods were performed in association with cultural methods. A panel of 16 French organic bakers and farmer-bakers were selected for this work. The lactic acid bacteria (LAB) diversity of their organic sourdoughs was investigated quantitatively and qualitatively combining (i) Lactobacillus sanfranciscensis-specific qPCR, (ii) global sequencing with MiSeq Illumina technology and (iii) molecular isolates identification. In addition, LAB and yeast enumeration, pH, Total Titratable Acidity, organic acids and bread specific volume were analyzed. Microbial and physico-chemical data were statistically treated by Principal Component Analysis (PCA) and Hierarchical Ascendant Classification (HAC). Total yeast counts were 6 log 10 to 7.6 log 10 CFU/g while LAB counts varied from 7.2 log 10 to 9.6 log 10 CFU/g. Values obtained by L. sanfranciscensis-specific qPCR were estimated between 7.2 and 10.3 log 10 CFU/g, except for one sample at 4.4 log 10 CFU/g. HAC and PCA clustered the sixteen sourdoughs into three classes described by their variables but without links to bakers' practices. L. sanfranciscensis was the dominant species in 13 of the 16 sourdoughs analyzed by Next Generation Sequencing (NGS), by the culture dependent method this species was dominant only in only 10 samples. Based on isolates identification, LAB diversity was higher for 7 sourdoughs with the recovery of L. curvatus, L. brevis, L. heilongjiangensis, L. xiangfangensis, L. koreensis, L. pontis, Weissella sp. and Pediococcus pentosaceus, as the most representative species. L. koreensis, L. heilongjiangensis and L. xiangfangensis were identified in traditional Asian food and here for the first time as dominant in organic sourdough. This study highlighted that L. sanfranciscensis was not the major species in 6/16 sourdough samples and that a relatively high LAB diversity can be observed in French organic
Cloning and sequencing of the casein kinase 2 alpha subunit from Zea mays

DEFF Research Database (Denmark)

Dobrowolska, G; Boldyreff, B; Issinger, O G

1991-01-01

The nucleotide sequence of the cDNA coding for the alpha subunit of casein kinase 2 of Zea mays has been determined. The cDNA clone contains an open reading frame of 996 nucleotides encoding a polypeptide comprising 332 amino acids. The primary amino acid sequence exhibits 75% identity to the alpha...... subunit and 71% identity to the alpha' subunit of human casein kinase 2....
Amino Acid Transporters and Release of Hydrophobic Amino Acids in the Heterocyst-Forming Cyanobacterium Anabaena sp. Strain PCC 7120

Directory of Open Access Journals (Sweden)

Rafael Pernil

2015-04-01

Full Text Available Anabaena sp. strain PCC 7120 is a filamentous cyanobacterium that can use inorganic compounds such as nitrate or ammonium as nitrogen sources. In the absence of combined nitrogen, it can fix N2 in differentiated cells called heterocysts. Anabaena also shows substantial activities of amino acid uptake, and three ABC-type transporters for amino acids have been previously characterized. Seven new loci encoding predicted amino acid transporters were identified in the Anabaena genomic sequence and inactivated. Two of them were involved in amino acid uptake. Locus alr2535-alr2541 encodes the elements of a hydrophobic amino acid ABC-type transporter that is mainly involved in the uptake of glycine. ORF all0342 encodes a putative transporter from the dicarboxylate/amino acid:cation symporter (DAACS family whose inactivation resulted in an increased uptake of a broad range of amino acids. An assay to study amino acid release from Anabaena filaments to the external medium was set up. Net release of the alanine analogue α-aminoisobutyric acid (AIB was observed when transport system N-I (a hydrophobic amino acid ABC-type transporter was engaged in the uptake of a specific substrate. The rate of AIB release was directly proportional to the intracellular AIB concentration, suggesting leakage from the cells by diffusion.
The computational linguistics of biological sequences

Energy Technology Data Exchange (ETDEWEB)

Searls, D. [Univ. of Pennsylvania, Philadelphia, PA (United States)

1995-12-31

This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. Protein sequences are analogous in many respects, particularly their folding behavior. Proteins have a much richer variety of interactions, but in theory the same linguistic principles could come to bear in describing dependencies between distant residues that arise by virtue of three-dimensional structure. This tutorial will concentrate on nucleic acid sequences.
Genomic organization, sequence characterization and expression analysis of Tenebrio molitor apolipophorin-III in response to an intracellular pathogen, Listeria monocytogenes.

Science.gov (United States)

Noh, Ju Young; Patnaik, Bharat Bhusan; Tindwa, Hamisi; Seo, Gi Won; Kim, Dong Hyun; Patnaik, Hongray Howrelia; Jo, Yong Hun; Lee, Yong Seok; Lee, Bok Luel; Kim, Nam Jung; Han, Yeon Soo

2014-01-25

Apolipophorin III (apoLp-III) is a well-known hemolymph protein having a functional role in lipid transport and immune response of insects. We cloned full-length cDNA encoding putative apoLp-III from larvae of the coleopteran beetle, Tenebrio molitor (TmapoLp-III), by identification of clones corresponding to the partial sequence of TmapoLp-III, subsequently followed with full length sequencing by a clone-by-clone primer walking method. The complete cDNA consists of 890 nucleotides, including an ORF encoding 196 amino acid residues. Excluding a putative signal peptide of the first 20 amino acid residues, the 176-residue mature apoLp-III has a calculated molecular mass of 19,146Da. Genomic sequence analysis with respect to its cDNA showed that TmapoLp-III was organized into four exons interrupted by three introns. Several immune-related transcription factor binding sites were discovered in the putative 5'-flanking region. BLAST and phylogenetic analyses reveal that TmapoLp-III has high sequence identity (88%) with Tribolium castaneum apoLp-III but shares little sequence homologies (molitor. Copyright © 2013 Elsevier B.V. All rights reserved.
Genome sequence of Aspergillus luchuensis NBRC 4314

Science.gov (United States)

Yamada, Osamu; Machida, Masayuki; Hosoyama, Akira; Goto, Masatoshi; Takahashi, Toru; Futagami, Taiki; Yamagata, Youhei; Takeuchi, Michio; Kobayashi, Tetsuo; Koike, Hideaki; Abe, Keietsu; Asai, Kiyoshi; Arita, Masanori; Fujita, Nobuyuki; Fukuda, Kazuro; Higa, Ken-ichi; Horikawa, Hiroshi; Ishikawa, Takeaki; Jinno, Koji; Kato, Yumiko; Kirimura, Kohtaro; Mizutani, Osamu; Nakasone, Kaoru; Sano, Motoaki; Shiraishi, Yohei; Tsukahara, Masatoshi; Gomi, Katsuya

2016-01-01

Awamori is a traditional distilled beverage made from steamed Thai-Indica rice in Okinawa, Japan. For brewing the liquor, two microbes, local kuro (black) koji mold Aspergillus luchuensis and awamori yeast Saccharomyces cerevisiae are involved. In contrast, that yeasts are used for ethanol fermentation throughout the world, a characteristic of Japanese fermentation industries is the use of Aspergillus molds as a source of enzymes for the maceration and saccharification of raw materials. Here we report the draft genome of a kuro (black) koji mold, A. luchuensis NBRC 4314 (RIB 2604). The total length of nonredundant sequences was nearly 34.7 Mb, comprising approximately 2,300 contigs with 16 telomere-like sequences. In total, 11,691 genes were predicted to encode proteins. Most of the housekeeping genes, such as transcription factors and N-and O-glycosylation system, were conserved with respect to Aspergillus niger and Aspergillus oryzae. An alternative oxidase and acid-stable α-amylase regarding citric acid production and fermentation at a low pH as well as a unique glutamic peptidase were also found in the genome. Furthermore, key biosynthetic gene clusters of ochratoxin A and fumonisin B were absent when compared with A. niger genome, showing the safety of A. luchuensis for food and beverage production. This genome information will facilitate not only comparative genomics with industrial kuro-koji molds, but also molecular breeding of the molds in improvements of awamori fermentation. PMID:27651094
The complete genome sequence of a new polerovirus in strawberry plants from eastern Canada showing strawberry decline symptoms.

Science.gov (United States)

Xiang, Yu; Bernardy, Mike; Bhagwat, Basdeo; Wiersma, Paul A; DeYoung, Robyn; Bouthillier, Michel

2015-02-01

Strawberry decline disease, probably caused by synergistic reactions of mixed virus infections, threatens the North American strawberry industry. Deep sequencing of strawberry plant samples from eastern Canada resulted in the identification of a new virus genome resembling poleroviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Polerovirus, family Luteoviridae. The virus is tentatively named "strawberry polerovirus 1" (SPV1).
Allergens in Hymenoptera venom. XXV: The amino acid sequences of antigen 5 molecules and the structural basis of antigenic cross-reactivity.

Science.gov (United States)

Hoffman, D R

1993-11-01

The complete amino acid sequences have been determined by solid-phase protein sequencing for eight different vespid venom antigen 5 molecules. These include five species of yellow jackets, Vespula squamosa, V. flavopilosa, V. germanica, V. pensylvanica and V. vidua, representing all three species groups; two variants from the European hornet, Vespa crabro; and a species of paper wasp, Polistes fuscatus, from a second subgenus. The new sequences were compared with the seven previously published sequences from yellow jackets, hornets, and wasps, and to that of Solenopsis invicta 3 allergen from imported fire ant venom. These comparisons provided structural evidence to support the observed high degree of cross-reactivity among the antigens of the common group of yellow jackets and among those of the two common North American subgenera of paper wasps studied. The antigen 5 of V. squamosa and of V. vidua were significantly different from those of the vulgaris group. Common features that could generate immunologic cross-reactivity were seen among the antigen 5 molecules of hornets of both genera and among those of yellow jackets, hornets, and paper wasps. The imported fire ant allergen has only minimal conserved areas in common with the vespid allergens, which explains the lack of observed IgE cross-reactivity. These results provide the structural basis for the cross-reactivity patterns observed in clinical practice and suggest that the commercial extracts of yellow jacket and paper wasp could be prepared with fewer carefully selected species.
Data for amino acid alignment of Japanese stingray melanocortin receptors with other gnathostome melanocortin receptor sequences, and the ligand selectivity of Japanese stingray melanocortin receptors

Directory of Open Access Journals (Sweden)

Akiyoshi Takahashi

2016-06-01

Full Text Available This article contains structure and pharmacological characteristics of melanocortin receptors (MCRs related to research published in “Characterization of melanocortin receptors from stingray Dasyatis akajei, a cartilaginous fish” (Takahashi et al., 2016 [1]. The amino acid sequences of the stingray, D. akajei, MC1R, MC2R, MC3R, MC4R, and MC5R were aligned with the corresponding melanocortin receptor sequences from the elephant shark, Callorhinchus milii, the dogfish, Squalus acanthias, the goldfish, Carassius auratus, and the mouse, Mus musculus. These alignments provide the basis for phylogenetic analysis of these gnathostome melanocortin receptor sequences. In addition, the Japanese stingray melanocortin receptors were separately expressed in Chinese Hamster Ovary cells, and stimulated with stingray ACTH, α-MSH, β-MSH, γ-MSH, δ-MSH, and β-endorphin. The dose response curves reveal the order of ligand selectivity for each stingray MCR.
Exploring the potential of second-generation sequencing in diverse biological contexts

DEFF Research Database (Denmark)

Fordyce, Sarah Louise

Second generation sequencing (SGS) has revolutionized the study of DNA, allowing massive parallel sequencing of nucleic acids with unprecedented depths of coverage. The research undertaken in this thesis occurred in parallel with the increased accessibility of SGS platforms for routine genetic...
Variation of clinical expression in patients with Stargardt dystrophy and sequence variations in the ABCR gene.

Science.gov (United States)

Fishman, G A; Stone, E M; Grover, S; Derlacki, D J; Haines, H L; Hockey, R R

1999-04-01

To report the spectrum of ophthalmic findings in patients with Stargardt dystrophy or fundus flavimaculatus who have a specific sequence variation in the ABCR gene. Twenty-nine patients with Stargardt dystrophy or fundus flavimaculatus from different pedigrees were identified with possible disease-causing sequence variations in the ABCR gene from a group of 66 patients who were screened for sequence variations in this gene. Patients underwent a routine ocular examination, including slitlamp biomicroscopy and a dilated fundus examination. Fluorescein angiography was performed on 22 patients, and electroretinographic measurements were obtained on 24 of 29 patients. Kinetic visual fields were measured with a Goldmann perimeter in 26 patients. Single-strand conformation polymorphism analysis and DNA sequencing were used to identify variations in coding sequences of the ABCR gene. Three clinical phenotypes were observed among these 29 patients. In phenotype I, 9 of 12 patients had a sequence change in exon 42 of the ABCR gene in which the amino acid glutamic acid was substituted for glycine (Gly1961Glu). In only 4 of these 9 patients was a second possible disease-causing mutation found on the other ABCR allele. In addition to an atrophic-appearing macular lesion, phenotype I was characterized by localized perifoveal yellowish white flecks, the absence of a dark choroid, and normal electroretinographic amplitudes. Phenotype II consisted of 10 patients who showed a dark choroid and more diffuse yellowish white flecks in the fundus. None exhibited the Gly1961Glu change. Phenotype III consisted of 7 patients who showed extensive atrophic-appearing changes of the retinal pigment epithelium. Electroretinographic cone and rod amplitudes were reduced. One patient showed the Gly1961Glu change. A wide variation in clinical phenotype can occur in patients with sequence changes in the ABCR gene. In individual patients, a certain phenotype seems to be associated with the presence of
The sequence of camelpox virus shows it is most closely related to variola virus, the cause of smallpox.

Science.gov (United States)

Gubser, Caroline; Smith, Geoffrey L

2002-04-01

Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.
Quantitative assessment of hepatic function: modified look-locker inversion recovery (MOLLI) sequence for T1 mapping on Gd-EOB-DTPA-enhanced liver MR imaging

Energy Technology Data Exchange (ETDEWEB)

Yoon, Jeong Hee [Seoul National University Hospital, Department of Radiology, Seoul (Korea, Republic of); Lee, Jeong Min; Han, Joon Koo; Choi, Byung Ihn [Seoul National University Hospital, Department of Radiology, Seoul (Korea, Republic of); Seoul National University College of Medicine, Institute of Radiation Medicine, Jongno-gu, Seoul (Korea, Republic of); Paek, Munyoung [Siemens Healthcare, Seoul (Korea, Republic of)

2016-06-15

To determine whether multislice T1 mapping of the liver using a modified look-locker inversion recovery (MOLLI) sequence on gadoxetic acid-enhanced magnetic resonance imaging (MRI) can be used as a quantitative tool to estimate liver function and predict the presence of oesophageal or gastric varices. Phantoms filled with gadoxetic acid were scanned three times using MOLLI sequence to test repeatability. Patients with chronic liver disease or liver cirrhosis who underwent gadoxetic acid-enhanced liver MRI including MOLLI sequence at 3 T were included (n = 343). Pre- and postcontrast T1 relaxation times of the liver (T1liver), changes between pre- and postcontrast T1liver (ΔT1liver), and adjusted postcontrast T1liver (postcontrast T1liver-T1spleen/T1spleen) were compared among Child-Pugh classes. In 62 patients who underwent endoscopy, all T1 parameters and spleen sizes were correlated with varices. Phantom study showed excellent repeatability of MOLLI sequence. As Child-Pugh scores increased, pre- and postcontrast T1liver were significantly prolonged (P < 0.001), and ΔT1liver and adjusted postcontrast T1liver decreased (P< 0.001). Adjusted postcontrast T1liver and spleen size were independently associated with varices (R{sup 2} = 0.29, P < 0.001). T1 mapping of the liver using MOLLI sequence on gadoxetic acid-enhanced MRI demonstrated potential in quantitatively estimating liver function, and adjusted postcontrast T1liver was significantly associated with varices. (orig.)

A sequence in subdomain 2 of DBL1α of Plasmodium falciparum erythrocyte membrane protein 1 induces strain transcending antibodies.

Directory of Open Access Journals (Sweden)

Karin Blomqvist

Full Text Available Immunity to severe malaria is the first level of immunity acquired to Plasmodium falciparum. Antibodies to the variant antigen PfEMP1 (P. falciparum erythrocyte membrane protein 1 present at the surface of the parasitized red blood cell (pRBC confer protection by blocking microvascular sequestration. Here we have generated antibodies to peptide sequences of subdomain 2 of PfEMP1-DBL1α previously identified to be associated with severe or mild malaria. A set of sera generated to the amino acid sequence KLQTLTLHQVREYWWALNRKEVWKA, containing the motif ALNRKE, stained the live pRBC. 50% of parasites tested (7/14 were positive both in flow cytometry and immunofluorescence assays with live pRBCs including both laboratory strains and in vitro adapted clinical isolates. Antibodies that reacted selectively with the sequence REYWWALNRKEVWKA in a 15-mer peptide array of DBL1α-domains were also found to react with the pRBC surface. By utilizing a peptide array to map the binding properties of the elicited anti-DBL1α antibodies, the amino acids WxxNRx were found essential for antibody binding. Complementary experiments using 135 degenerate RDSM peptide sequences obtained from 93 Ugandan patient-isolates showed that antibody binding occurred when the amino acids WxLNRKE/D were present in the peptide. The data suggests that the ALNRKE sequence motif, associated with severe malaria, induces strain-transcending antibodies that react with the pRBC surface.
An acid phosphatase in the plasma membranes of human astrocytoma showing marked specificity toward phosphotyrosine protein.

OpenAIRE

Leis, J F; Kaplan, N O

1982-01-01

The plasma membrane from the human tumor astrocytoma contains an active acid phosphatase activity based on hydrolysis of p-nitrophenyl phosphate. Other acid phosphatase substrates--beta-glycerophosphate, O-phosphorylcholine, and 5'-AMP--are not hydrolyzed significantly. The phosphatase activity is tartrate insensitive and is stimulated by Triton X-100 and EDTA. Of the three known phosphoamino acids, only free O-phosphotyrosine is hydrolyzed by the membrane phosphatase activity. Other acid pho...
Complete cDNA sequence coding for human docking protein

Energy Technology Data Exchange (ETDEWEB)

Hortsch, M; Labeit, S; Meyer, D I

1988-01-11

Docking protein (DP, or SRP receptor) is a rough endoplasmic reticulum (ER)-associated protein essential for the targeting and translocation of nascent polypeptides across this membrane. It specifically interacts with a cytoplasmic ribonucleoprotein complex, the signal recognition particle (SRP). The nucleotide sequence of cDNA encoding the entire human DP and its deduced amino acid sequence are given.
The application of strand invasion phenomenon, directed by peptide nucleic acid (PNA) and single-stranded DNA binding protein (SSB) for the recognition of specific sequences of human endogenous retroviral HERV-W family.

Science.gov (United States)

Machnik, Grzegorz; Bułdak, Łukasz; Ruczyński, Jarosław; Gąsior, Tomasz; Huzarska, Małgorzata; Belowski, Dariusz; Alenowicz, Magdalena; Mucha, Piotr; Rekowski, Piotr; Okopień, Bogusław

2017-05-01

The HERV-W family of human endogenous retroviruses represents a group of numerous sequences that show close similarity in genetic composition. It has been documented that some members of HERV-W-derived expression products are supposed to play significant role in humans' pathology, such as multiple sclerosis or schizophrenia. Other members of the family are necessary to orchestrate physiological processes (eg, ERVWE1 coding syncytin-1 that is engaged in syncytiotrophoblast formation). Therefore, an assay that would allow the recognition of particular form of HERV-W members is highly desirable. A peptide nucleic acid (PNA)-mediated technique for the discrimination between multiple sclerosis-associated retrovirus and ERVWE1 sequence has been developed. The assay uses a PNA probe that, being fully complementary to the ERVWE1 but not to multiple sclerosis-associated retrovirus (MSRV) template, shows high selective potential. Single-stranded DNA binding protein facilitates the PNA-mediated, sequence-specific formation of strand invasion complex and, consequently, local DNA unwinding. The target DNA may be then excluded from further analysis in any downstream process such as single-stranded DNA-specific exonuclease action. Finally, the reaction conditions have been optimized, and several PNA probes that are targeted toward distinct loci along whole HERV-W env sequences have been evaluated. We believe that PNA/single-stranded DNA binding protein-based application has the potential to selectively discriminate particular HERV-W molecules as they are at least suspected to play pathogenic role in a broad range of medical conditions, from psycho-neurologic disorders (multiple sclerosis and schizophrenia) and cancers (breast cancer) to that of an auto-immunologic background (psoriasis and lupus erythematosus). Copyright © 2016 John Wiley & Sons, Ltd.
Comparative analysis of microbial community of novel lactic acid fermentation inoculated with different undefined mixed cultures.

Science.gov (United States)

Liang, Shaobo; Gliniewicz, Karol; Mendes-Soares, Helena; Settles, Matthew L; Forney, Larry J; Coats, Erik R; McDonald, Armando G

2015-03-01

Three undefined mixed cultures (activated sludge) from different municipal wastewater treatment plants were used as seeds in a novel lactic acid fermentation process fed with potato peel waste (PPW). Anaerobic sequencing batch fermenters were run under identical conditions to produce predominantly lactic acid. Illumina sequencing was used to examine the 16S rRNA genes of bacteria in the three seeds and fermenters. Results showed that the structure of microbial communities of three seeds were different. All three fermentation products had unique community structures that were dominated (>96%) by species of the genus Lactobacillus, while members of this genus constituted undefined mixed cultures were robust and resilient, which provided engineering prospects for the microbial utilization of carbohydrate wastes to produce lactic acid. Copyright © 2014 Elsevier Ltd. All rights reserved.
Carrot Juice Fermentations as Man-Made Microbial Ecosystems Dominated by Lactic Acid Bacteria.

Science.gov (United States)

Wuyts, Sander; Van Beeck, Wannes; Oerlemans, Eline F M; Wittouck, Stijn; Claes, Ingmar J J; De Boeck, Ilke; Weckx, Stefan; Lievens, Bart; De Vuyst, Luc; Lebeer, Sarah

2018-06-15

Spontaneous vegetable fermentations, with their rich flavors and postulated health benefits, are regaining popularity. However, their microbiology is still poorly understood, therefore raising concerns about food safety. In addition, such spontaneous fermentations form interesting cases of man-made microbial ecosystems. Here, samples from 38 carrot juice fermentations were collected through a citizen science initiative, in addition to three laboratory fermentations. Culturing showed that Enterobacteriaceae were outcompeted by lactic acid bacteria (LAB) between 3 and 13 days of fermentation. Metabolite-target analysis showed that lactic acid and mannitol were highly produced, as well as the biogenic amine cadaverine. High-throughput 16S rRNA gene sequencing revealed that mainly species of Leuconostoc and Lactobacillus (as identified by 8 and 20 amplicon sequence variants [ASVs], respectively) mediated the fermentations in subsequent order. The analyses at the DNA level still detected a high number of Enterobacteriaceae , but their relative abundance was low when RNA-based sequencing was performed to detect presumptive metabolically active bacterial cells. In addition, this method greatly reduced host read contamination. Phylogenetic placement indicated a high LAB diversity, with ASVs from nine different phylogenetic groups of the Lactobacillus genus complex. However, fermentation experiments with isolates showed that only strains belonging to the most prevalent phylogenetic groups preserved the fermentation dynamics. The carrot juice fermentation thus forms a robust man-made microbial ecosystem suitable for studies on LAB diversity and niche specificity. IMPORTANCE The usage of fermented food products by professional chefs is steadily growing worldwide. Meanwhile, this interest has also increased at the household level. However, many of these artisanal food products remain understudied. Here, an extensive microbial analysis was performed of spontaneous fermented
A quantitative measure of chirality inside nucleic acid databank.

Science.gov (United States)

Pietropaolo, Adriana; Parrinello, Michele

2011-08-01

We show the capability of a chirality index (Pietropaolo et al., Proteins 2008;70:667-677) to investigate nucleic acid structures because of its high sensitivity to helical conformations. By analyzing selected structures of DNA and RNA, we have found that sequences rich in cytosine and guanine have a tendency to left-handed chirality, in contrast to regions rich in adenine or thymine which show strong negative, right-handed, chirality values. We also analyze RNA structures, where specific loops and hairpin motifs are characterized by a well-defined chirality value. We find that in nucleosome the chirality is exalted, whereas in ribosome it is reduced. Our results illustrate the sensitivity of this descriptor for nucleic acid conformations. Copyright © 2011 Wiley-Liss, Inc.
Rapid Multiplex Small DNA Sequencing on the MinION Nanopore Sequencing Platform

Directory of Open Access Journals (Sweden)

Shan Wei

2018-05-01

Full Text Available Real-time sequencing of short DNA reads has a wide variety of clinical and research applications including screening for mutations, target sequences and aneuploidy. We recently demonstrated that MinION, a nanopore-based DNA sequencing device the size of a USB drive, could be used for short-read DNA sequencing. In this study, an ultra-rapid multiplex library preparation and sequencing method for the MinION is presented and applied to accurately test normal diploid and aneuploidy samples’ genomic DNA in under three hours, including library preparation and sequencing. This novel method shows great promise as a clinical diagnostic test for applications requiring rapid short-read DNA sequencing.
Prospects for Fungal Bioremediation of Acidic Radioactive Waste Sites: Characterization and Genome Sequence of Rhodotorula taiwanensis MD1149

Directory of Open Access Journals (Sweden)

Rok Tkavc

2018-01-01

Full Text Available Highly concentrated radionuclide waste produced during the Cold War era is stored at US Department of Energy (DOE production sites. This radioactive waste was often highly acidic and mixed with heavy metals, and has been leaking into the environment since the 1950s. Because of the danger and expense of cleanup of such radioactive sites by physicochemical processes, in situ bioremediation methods are being developed for cleanup of contaminated ground and groundwater. To date, the most developed microbial treatment proposed for high-level radioactive sites employs the radiation-resistant bacterium Deinococcus radiodurans. However, the use of Deinococcus spp. and other bacteria is limited by their sensitivity to low pH. We report the characterization of 27 diverse environmental yeasts for their resistance to ionizing radiation (chronic and acute, heavy metals, pH minima, temperature maxima and optima, and their ability to form biofilms. Remarkably, many yeasts are extremely resistant to ionizing radiation and heavy metals. They also excrete carboxylic acids and are exceptionally tolerant to low pH. A special focus is placed on Rhodotorula taiwanensis MD1149, which was the most resistant to acid and gamma radiation. MD1149 is capable of growing under 66 Gy/h at pH 2.3 and in the presence of high concentrations of mercury and chromium compounds, and forming biofilms under high-level chronic radiation and low pH. We present the whole genome sequence and annotation of R. taiwanensis strain MD1149, with a comparison to other Rhodotorula species. This survey elevates yeasts to the frontier of biology's most radiation-resistant representatives, presenting a strong rationale for a role of fungi in bioremediation of acidic radioactive waste sites.
Reference-quality genome sequence of Aegilops tauschii, the source of wheat D genome, shows that recombination shapes genome structure and evolution

Science.gov (United States)

Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat and an important genetic resource for wheat. A reference-quality sequence for the Ae. tauschii genome was produced with a combination of ordered-clone sequencing, whole-genome shotgun sequencing, and BioNano optical geno...
The evolutionary sequence: origin and emergences

Science.gov (United States)

Fox, S. W.

1986-01-01

The evolutionary sequence is being reexamined experimentally from a "Big Bang"origin to the protocell and from the emergence of protocell and variety of species to Darwin's mental power (mind) and society (The Descent of Man). A most fundamentally revisionary consequence of experiments is an emphasis on endogenous ordering. This principle, seen vividly in ordered copolymerization of amino acids, has had new impact on the theory of Darwinian evolution and has been found to apply to the entire sequence. Herein, I will discuss some problems of dealing with teaching controversial subjects.
The evolutionary sequence: origin and emergences.

Science.gov (United States)

Fox, S W

1986-03-01

The evolutionary sequence is being reexamined experimentally from a "Big Bang"origin to the protocell and from the emergence of protocell and variety of species to Darwin's mental power (mind) and society (The Descent of Man). A most fundamentally revisionary consequence of experiments is an emphasis on endogenous ordering. This principle, seen vividly in ordered copolymerization of amino acids, has had new impact on the theory of Darwinian evolution and has been found to apply to the entire sequence. Herein, I will discuss some problems of dealing with teaching controversial subjects.
Analysis of correlations between sites in models of protein sequences

International Nuclear Information System (INIS)

Giraud, B.G.; Lapedes, A.; Liu, L.C.

1998-01-01

A criterion based on conditional probabilities, related to the concept of algorithmic distance, is used to detect correlated mutations at noncontiguous sites on sequences. We apply this criterion to the problem of analyzing correlations between sites in protein sequences; however, the analysis applies generally to networks of interacting sites with discrete states at each site. Elementary models, where explicit results can be derived easily, are introduced. The number of states per site considered ranges from 2, illustrating the relation to familiar classical spin systems, to 20 states, suitable for representing amino acids. Numerical simulations show that the criterion remains valid even when the genetic history of the data samples (e.g., protein sequences), as represented by a phylogenetic tree, introduces nonindependence between samples. Statistical fluctuations due to finite sampling are also investigated and do not invalidate the criterion. A subsidiary result is found: The more homogeneous a population, the more easily its average properties can drift from the properties of its ancestor. copyright 1998 The American Physical Society
The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

International Nuclear Information System (INIS)

Nylund, Stian; Karlsen, Marius; Nylund, Are

2008-01-01

The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses, which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae
Epidemiology of transmissible diseases: Array hybridization and next generation sequencing as universal nucleic acid-mediated typing tools.

Science.gov (United States)

Michael Dunne, W; Pouseele, Hannes; Monecke, Stefan; Ehricht, Ralf; van Belkum, Alex

2017-09-21

The magnitude of interest in the epidemiology of transmissible human diseases is reflected in the vast number of tools and methods developed recently with the expressed purpose to characterize and track evolutionary changes that occur in agents of these diseases over time. Within the past decade a new suite of such tools has become available with the emergence of the so-called "omics" technologies. Among these, two are exponents of the ongoing genomic revolution. Firstly, high-density nucleic acid probe arrays have been proposed and developed using various chemical and physical approaches. Via hybridization-mediated detection of entire genes or genetic polymorphisms in such genes and intergenic regions these so called "DNA chips" have been successfully applied for distinguishing very closely related microbial species and strains. Second and even more phenomenal, next generation sequencing (NGS) has facilitated the assessment of the complete nucleotide sequence of entire microbial genomes. This technology currently provides the most detailed level of bacterial genotyping and hence allows for the resolution of microbial spread and short-term evolution in minute detail. We will here review the very recent history of these two technologies, sketch their usefulness in the elucidation of the spread and epidemiology of mostly hospital-acquired infections and discuss future developments. Copyright © 2017 Elsevier B.V. All rights reserved.
Gene sequencing, cloning, and expression of the recombinant L- Asparaginase of Pseudomonas aeruginosa SN4 strain in Escherichia coli

Directory of Open Access Journals (Sweden)

Arastoo Badoei-dalfard

2016-03-01

Full Text Available Introduction: L- asparaginase is in an excessive demand in medical applications and in food treating industries, the request for this therapeutic enzyme is growing several folds every year. Materials and methods: In this study, a L- asparaginase gene from Pseudomonas aeruginosa strain SN4 was sequenced and cloned in E. coli. Primers were designed based on L- asparaginase from P. aeruginosa DSM 50071, which show high similarity to SN4 strain, according to 16S rRNA sequence. The L- asparaginase gene was exposed to restriction digestion with NdeI and XhoI enzymes and then ligated into pET21a plasmid. The ligated sample was transformed into competent E. coli (DE3 pLysS DH5a cells, according to CaCl2 method. The transformed E. coli cells were grown into LB agar plate containing 100 µg/ml ampicillin, IPTG (1 mM. Results: Recombinant L- asparaginase from E. coli BL21 induced after 9 h of incubation and showed high L- asparaginase activity about 93.4 IU/ml. Recombinant L- asparaginase sequencing and alignments showed that the presumed amino acid sequence composed of 350 amino acid residues showed high similarity with P. aeruginosa L- asparaginases about 99%. The results also indicated that SN4 L- asparaginase has the catalytic residues and conserve region similar to other L- asparaginases. Discussion and conclusion: This is the first report on cloning and expression of P. aeruginosa L- asparaginases in Escherichia coli. These results indicated a potent source of L- asparaginase for in vitro and in vivio anticancer consideration.
Structural and sequence features of two residue turns in beta-hairpins.

Science.gov (United States)

Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu

2014-09-01

Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites.

Science.gov (United States)

Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

2012-10-01

To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.
Comparative genomics of citric-acid producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

Energy Technology Data Exchange (ETDEWEB)

Andersen, Mikael R.; Salazar, Margarita; Schaap, Peter; van de Vondervoort, Peter; Culley, David E.; Thykaer, Jette; Frisvad, Jens C.; Nielsen, Kristian F.; Albang, Richard; Albermann, Kaj; Berka, Randy; Braus, Gerhard; Braus-Stromeyer, Susanna A.; Corrochano, Luis; Dai, Ziyu; van Dijck, Piet; Hofmann, Gerald; Lasure, Linda L.; Magnuson, Jon K.; Menke, Hildegard; Meijer, Martin; Meijer, Susan; Nielsen, Jakob B.; Nielsen, Michael L.; van Ooyen, Albert; Pel, Herman J.; Poulsen, Lars; Samson, Rob; Stam, Hein; Tsang, Adrian; van den Brink, Johannes M.; ATkins, Alex; Aerts, Andrea; Shapiro, Harris; Pangilinan, Jasmyn; Salamov, Asaf; Lou, Yigong; Lindquist, Erika; Lucas, Susan; Grimwood, Jane; Grigoriev, Igor V.; Kubicek, Christian P.; Martinez, Diego; van Peij, Noel; Roubos, Johannes A.; Nielsen, Jens B.; Baker, Scott E.

2011-06-01

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases and protein transporters.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Antifungal activity of secondary plant metabolites from potatoes (Solanum tuberosum L.): Glycoalkaloids and phenolic acids show synergistic effects.

Science.gov (United States)

Sánchez-Maldonado, A F; Schieber, A; Gänzle, M G

2016-04-01

To study the antifungal effects of the potato secondary metabolites α-solanine, α-chaconine, solanidine and caffeic acid, alone or combined. Resistance to glycoalkaloids varied among the fungal species tested, as derived from minimum inhibitory concentrations assays. Synergistic antifungal activity between glycoalkaloids and phenolic compounds was found. Changes in the fluidity of fungal membranes caused by potato secondary plant metabolites were determined by calculation of the generalized polarization values. The results partially explained the synergistic effect between caffeic acid and α-chaconine and supported findings on membrane disruption mechanisms from previous studies on artificial membranes. LC/MS analysis was used to determine variability and relative amounts of sterols in the different fungal species. Results suggested that the sterol pattern of fungi is related to their resistance to potato glycoalkaloids and to their taxonomy. Fungal resistance to α-chaconine and possibly other glycoalkaloids is species dependent. α-Chaconine and caffeic acid show synergistic antifungal activity. The taxonomic classification and the sterol pattern play a role in fungal resistance to glycoalkaloids. Results improve the understanding of the antifungal mode of action of potato secondary metabolites, which is essential for their potential utilization as antifungal agents in nonfood systems. © 2016 The Society for Applied Microbiology.
Isolation and N-terminal sequencing of a novel cadmium-binding protein from Boletus edulis

Science.gov (United States)

Collin-Hansen, C.; Andersen, R. A.; Steinnes, E.

2003-05-01

A Cd-binding protein was isolated from the popular edible mushroom Boletus edulis, which is a hyperaccumulator of both Cd and Hg. Wild-growing samples of B. edulis were collected from soils rich in Cd. Cd radiotracer was added to the crude protein preparation obtained from ethanol precipitation of heat-treated cytosol. Proteins were then further separated in two consecutive steps; gel filtration and anion exchange chromatography. In both steps the Cd radiotracer profile showed only one distinct peak, which corresponded well with the profiles of endogenous Cd obtained by atomic absorption spectrophotometry (AAS). Concentrations of the essential elements Cu and Zn were low in the protein fractions high in Cd. N-terminal sequencing performed on the Cd-binding protein fractions revealed a protein with a novel amino acid sequence, which contained aromatic amino acids as well as proline. Both the N-terminal sequencing and spectrofluorimetric analysis with EDTA and ABD-F (4-aminosulfonyl-7-fluoro-2, 1, 3-benzoxadiazole) failed to detect cysteine in the Cd-binding fractions. These findings conclude that the novel protein does not belong to the metallothionein family. The results suggest a role for the protein in Cd transport and storage, and they are of importance in view of toxicology and food chemistry, but also for environmental protection.
Sequence alignment reveals possible MAPK docking motifs on HIV proteins.

Directory of Open Access Journals (Sweden)

Perry Evans

Full Text Available Over the course of HIV infection, virus replication is facilitated by the phosphorylation of HIV proteins by human ERK1 and ERK2 mitogen-activated protein kinases (MAPKs. MAPKs are known to phosphorylate their substrates by first binding with them at a docking site. Docking site interactions could be viable drug targets because the sequences guiding them are more specific than phosphorylation consensus sites. In this study we use multiple bioinformatics tools to discover candidate MAPK docking site motifs on HIV proteins known to be phosphorylated by MAPKs, and we discuss the possibility of targeting docking sites with drugs. Using sequence alignments of HIV proteins of different subtypes, we show that MAPK docking patterns previously described for human proteins appear on the HIV matrix, Tat, and Vif proteins in a strain dependent manner, but are absent from HIV Rev and appear on all HIV Nef strains. We revise the regular expressions of previously annotated MAPK docking patterns in order to provide a subtype independent motif that annotates all HIV proteins. One revision is based on a documented human variant of one of the substrate docking motifs, and the other reduces the number of required basic amino acids in the standard docking motifs from two to one. The proposed patterns are shown to be consistent with in silico docking between ERK1 and the HIV matrix protein. The motif usage on HIV proteins is sufficiently different from human proteins in amino acid sequence similarity to allow for HIV specific targeting using small-molecule drugs.
Mutational properties of amino acid residues: implications for evolvability of phosphorylatable residues

DEFF Research Database (Denmark)

Creixell, Pau; Schoof, Erwin M.; Tan, Chris Soon Heng

2012-01-01

in terms of their mutational activity. Moreover, we highlight the importance of the genetic code and physico-chemical properties of the amino acid residues as likely causes of these inequalities and uncover serine as a mutational hot spot. Finally, we explore the consequences that these different......; it is typically assumed that all amino acid residues are equally likely to mutate or to result from a mutation. Here, by reconstructing ancestral sequences and computing mutational probabilities for all the amino acid residues, we refute this assumption and show extensive inequalities between different residues...... mutational properties have on phosphorylation site evolution, showing that a higher degree of evolvability exists for phosphorylated threonine and, to a lesser extent, serine in comparison with tyrosine residues. As exemplified by the suppression of serine's mutational activity in phosphorylation sites, our...
[Cloning and sequencing of the papA gene from uropathogenic Escherichia coli 4030 strain].

Science.gov (United States)

Wu, Qinggang; Zhang, Jingping; Zhao, Chuncheng; Zhu, Jianguo

2008-09-01

Cloning and sequencing of the papA gene from uropathogenic Escherichia coli 4030 strain to investigate the differences of the sequences of the papA of UPEC4030 strain and the ones of related genes, in order to make whether or not it was a new genotype. Cloning and sequencing methods were used to analyze the sequence of the papA of UPEC4030 strain in comparison with related sequences. The sequence analysis of papA revealed a 722 bp gene and encode 192 amino acid polypeptide. The overall homology of the papA genes between UPEC4030 and the standard strains of ten F types were 36.11%-77.95% and 22.20%-78.34% at nucleotide and deduced amino acid levels. The homology between the sequence of the reverse primers and the corresponding sequence of UPEC4030 papA was 10%-66.67%. The results confirmed that UPEC4030 strain contained a novel papA variant. UPEC4030 strain could contain an unknown papA variant or the novel genotype. The pathogenic mechanism and epidemiology related need to be further studied.
Fatty acid elongase 1 (FAE1) promoter as a candidate for genetic ...

African Journals Online (AJOL)

As an important cis-regulatory element, a promoter plays a key role in plant gene expression and regulation, and has been widely used in plant genetic engineering. The fatty acid elongase 1 (FAE1) promoter was isolated from Arabidopsis thaliana. Sequence analysis showed that the FAE1 promoter contains two Skn-1 ...
Molecular Profiling of Microbial Communities from Contaminated Sources: Use of Subtractive Cloning Methods and rDNA Spacer Sequences; FINAL

International Nuclear Information System (INIS)

Robb, Frank T.

2001-01-01

The major objective of this research was to provide appropriate sequences and assemble a DNA array of oligonucleotides to be used for rapid profiling of microbial populations from polluted areas and other areas of interest. The sequences to be assigned to the DNA array were chosen from cloned genomic DNA taken from groundwater sites having well characterized pollutant histories at Hanford Nuclear Plant and Lawrence Livermore Site 300. Glass-slide arrays were made and tested; and a new multiplexed, bead-based method was developed that uses nucleic acid hybridization on the surface of microscopic polystyrene spheres to identify specific sequences in heterogeneous mixtures of DNA sequences. The test data revealed considerable strain variation between sample sites showing a striking distribution of sequences. It also suggests that diversity varies greatly with bioremediation, and that there are many bacterial intergenic spacer region sequences that can indicate its effects. The bead method exhibited superior sequence discrimination and has features for easier and more accurate measurement
Nematicidal Activity of Kojic Acid Produced by Aspergillus oryzae against Meloidogyne incognita.

Science.gov (United States)

Kim, Tae Yoon; Jang, Ja Yeong; Jeon, Sun Jeong; Lee, Hye Won; Bae, Chang-Hwan; Yeo, Joo Hong; Lee, Hyang Burm; Kim, In Seon; Park, Hae Woong; Kim, Jin-Cheol

2016-08-28

The fungal strain EML-DML3PNa1 isolated from leaf of white dogwood (Cornus alba L.) showed strong nematicidal activity with juvenile mortality of 87.6% at a concentration of 20% fermentation broth filtrate at 3 days after treatment. The active fungal strain was identified as Aspergillus oryzae, which belongs to section Flavi, based on the morphological characteristics and sequence analysis of the ITS rDNA, calmodulin (CaM), and β-tubulin (BenA) genes. The strain reduced the pH value to 5.62 after 7 days of incubation. Organic acid analysis revealed the presence of citric acid (515.0 mg/kg), malic acid (506.6 mg/kg), and fumaric acid (21.7 mg/kg). The three organic acids showed moderate nematicidal activities, but the mixture of citric acid, malic acid, and fumaric acid did not exhibit the full nematicidal activity of the culture filtrate of EML- DML3PNa1. Bioassay-guided fractionation coupled with (1)H- and (13)C-NMR and EI-MS analyses led to identification of kojic acid as the major nematicidal metabolite. Kojic acid exhibited dose-dependent mortality and inhibited the hatchability of M. incognita, showing EC50 values of 195.2 µg/ml and 238.3 µg/ml, respectively, at 72 h postexposure. These results suggest that A. oryzae EML-DML3PNa1 and kojic acid have potential as a biological control agent against M. incognita.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

Science.gov (United States)

Sakoda, H; Imanaka, T

1992-02-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.
Sequence motifs in MADS transcription factors responsible for specificity and diversification of protein-protein interaction.

Directory of Open Access Journals (Sweden)

Aalt D J van Dijk

Full Text Available Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and
The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

Science.gov (United States)

Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

1992-02-01

The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs
Induction of Heavy-Metal-Transporting CPX-Type ATPases during Acid Adaptation in Lactobacillus bulgaricus▿

Science.gov (United States)

Penaud, S.; Fernandez, A.; Boudebbouze, S.; Ehrlich, S. D.; Maguin, E.; van de Guchte, M.

2006-01-01

Lactobacillus bulgaricus is a lactic acid bacteria (LAB) that, through the production of lactic acid, gradually acidifies its environment during growth. In the course of this process, L. bulgaricus acquires an improved tolerance to acidity. A survey of the recently established genome sequence shows that this bacterium possesses few of the pH control functions that have been described in other LAB and raises the question of what other mechanisms could be involved in its adaptation to the decreasing environmental pH. In some bacteria other than LAB, ion transport systems have been implicated in acid adaptation. We therefore studied the expression of this type of transport system during acid adaptation in L. bulgaricus by reverse transcription and real-time quantitative PCR and mapped transcription start sites. Intriguingly, the most significantly induced were three ATPases carrying the CPX signature of heavy-metal transporters. Protein homology and the presence of a conserved sequence motif in the promoter regions of the genes encoding these proteins strongly suggest that they are involved in copper homeostasis. Induction of this system is thought to assist in avoiding indirect damage that could result from medium acidification. PMID:16997986
Neutron scattering shows a droplet of oleic acid at the center of the BAMLET complex.

Science.gov (United States)

Rath, Emma M; Duff, Anthony P; Gilbert, Elliot P; Doherty, Greg; Knott, Robert B; Church, W Bret

2017-07-01

The anti-cancer complex, Bovine Alpha-lactalbumin Made LEthal to Tumors (BAMLET), has intriguing broad-spectrum anti-cancer activity. Although aspects of BAMLET's anti-cancer mechanism are still not known, it is understood that it involves the oleic acid or oleate component of BAMLET being preferentially released into cancer cell membranes leading to increased membrane permeability and lysis. The structure of the protein component of BAMLET has previously been elucidated by small angle X-ray scattering (SAXS) to be partially unfolded and dramatically enlarged. However, the structure of the oleic acid component of BAMLET and its disposition with respect to the protein component was not revealed as oleic acid has the same X-ray scattering length density (SLD) as water. Employing the difference in the neutron SLDs of hydrogen and deuterium, we carried out solvent contrast variation small angle neutron scattering (SANS) experiments of hydrogenated BAMLET in deuterated water buffers, to reveal the size, shape, and disposition of the oleic acid component of BAMLET. Our resulting analysis and models generated from SANS and SAXS data indicate that oleic acid forms a spherical droplet of oil incompletely encapsulated by the partially unfolded protein component. This model provides insight into the anti-cancer mechanism of this cache of lipid. The model also reveals a protein component "tail" not associated with the oleic acid component that is able to interact with the tail of other BAMLET molecules, providing a plausible explanation of how BAMLET readily forms aggregates. Proteins 2017; 85:1371-1378. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins

DEFF Research Database (Denmark)

Madsen, Peder; Rasmussen, H H; Leffers, H

1992-01-01

termed PA-FABP (psoriasis-associated fatty acid-binding protein). The deduced sequence predicted a protein with molecular weight of 15,164 daltons and a calculated pI of 6.96, values that are close to those recorded in the keratinocyte 2D gel protein database. The protein comigrated with PA-FABP...... as determined by 2D gel analysis of [35S]-methionine-labeled proteins expressed by transformed human amnion (AMA) cells transfected with clone 1592 using the vaccinia virus expression system and reacted with a rabbit polyclonal antibody raised against 2D gel purified PA-FABP. Structural analysis of the amino...... acid sequence revealed 48%, 52%, and 56% identity to known low-molecular-weight fatty acid-binding proteins belonging to the FABP family. Northern blot analysis showed that PA-FABP mRNA is indeed highly up-regulated in psoriatic keratinocytes. The transcript is present in human cell lines of epithelial...
Citrate synthase gene sequence: a new tool for phylogenetic analysis and identification of Ehrlichia.

Science.gov (United States)

Inokuma, H; Brouqui, P; Drancourt, M; Raoult, D

2001-09-01

The sequence of the citrate synthase gene (gltA) of 13 ehrlichial species (Ehrlichia chaffeensis, Ehrlichia canis, Ehrlichia muris, an Ehrlichia species recently detected from Ixodes ovatus, Cowdria ruminantium, Ehrlichia phagocytophila, Ehrlichia equi, the human granulocytic ehrlichiosis [HGE] agent, Anaplasma marginale, Anaplasma centrale, Ehrlichia sennetsu, Ehrlichia risticii, and Neorickettsia helminthoeca) have been determined by degenerate PCR and the Genome Walker method. The ehrlichial gltA genes are 1,197 bp (E. sennetsu and E. risticii) to 1,254 bp (A. marginale and A. centrale) long, and GC contents of the gene vary from 30.5% (Ehrlichia sp. detected from I. ovatus) to 51.0% (A. centrale). The percent identities of the gltA nucleotide sequences among ehrlichial species were 49.7% (E. risticii versus A. centrale) to 99.8% (HGE agent versus E. equi). The percent identities of deduced amino acid sequences were 44.4% (E. sennetsu versus E. muris) to 99.5% (HGE agent versus E. equi), whereas the homology range of 16S rRNA genes was 83.5% (E. risticii versus the Ehrlichia sp. detected from I. ovatus) to 99.9% (HGE agent, E. equi, and E. phagocytophila). The architecture of the phylogenetic trees constructed by gltA nucleotide sequences or amino acid sequences was similar to that derived from the 16S rRNA gene sequences but showed more-significant bootstrap values. Based upon the alignment analysis of the ehrlichial gltA sequences, two sets of primers were designed to amplify tick-borne Ehrlichia and Neorickettsia genogroup Ehrlichia (N. helminthoeca, E. sennetsu, and E. risticii), respectively. Tick-borne Ehrlichia species were specifically identified by restriction fragment length polymorphism (RFLP) patterns of AcsI and XhoI with the exception of E. muris and the very closely related ehrlichia derived from I. ovatus for which sequence analysis of the PCR product is needed. Similarly, Neorickettsia genogroup Ehrlichia species were specifically identified by
Single-molecule protein sequencing through fingerprinting: computational assessment

Science.gov (United States)

Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin

2015-10-01

Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.
Single-molecule protein sequencing through fingerprinting: computational assessment

International Nuclear Information System (INIS)

Yao, Yao; Docter, Margreet; Van Ginkel, Jetty; Joo, Chirlmin; De Ridder, Dick

2015-01-01

Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences. (paper)
Regulatory sequence of cupin family gene

Science.gov (United States)

Hood, Elizabeth; Teoh, Thomas

2017-07-25

This invention is in the field of plant biology and agriculture and relates to novel seed specific promoter regions. The present invention further provide methods of producing proteins and other products of interest and methods of controlling expression of nucleic acid sequences of interest using the seed specific promoter regions.
Cloning and sequence of the human adrenodoxin reductase gene

International Nuclear Information System (INIS)

Lin, Dong; Shi, Y.; Miller, W.L.

1990-01-01

Adrenodoxin reductase is a flavoprotein mediating electron transport to all mitochondrial forms of cytochrome P450. The authors cloned the human adrenodoxin reductase gene and characterized it by restriction endonuclease mapping and DNA sequencing. The entire gene is approximately 12 kilobases long and consists of 12 exons. The first exon encodes the first 26 of the 32 amino acids of the signal peptide, and the second exon encodes the remainder of signal peptide and the apparent FAD binding site. The remaining 10 exons are clustered in a region of only 4.3 kilobases, separated from the first two exons by a large intron of about 5.6 kilobases. Two forms of human adrenodoxin reductase mRNA, differing by the presence or absence of 18 bases in the middle of the sequence, arise from alternate splicing at the 5' end of exon 7. This alternately spliced region is directly adjacent to the NADPH binding site, which is entirely contained in exon 6. The immediate 5' flanking region lacks TATA and CAAT boxes; however, this region is rich in G+C and contains six copies of the sequence GGGCGGG, resembling promoter sequences of housekeeping genes. RNase protection experiments show that transcription is initiated from multiple sites in the 5' flanking region, located about 21-91 base pairs upstream from the AUG translational initiation codon
Recent Developments in Peptide-Based Nucleic Acid Delivery

Directory of Open Access Journals (Sweden)

Tobias Restle

2008-07-01

Full Text Available Despite the fact that non-viral nucleic acid delivery systems are generally considered to be less efficient than viral vectors, they have gained much interest in recent years due to their superior safety profile compared to their viral counterpart. Among these synthetic vectors are cationic polymers, branched dendrimers, cationic liposomes and cellpenetrating peptides (CPPs. The latter represent an assortment of fairly unrelated sequences essentially characterised by a high content of basic amino acids and a length of 10-30 residues. CPPs are capable of mediating the cellular uptake of hydrophilic macromolecules like peptides and nucleic acids (e.g. siRNAs, aptamers and antisenseoligonucleotides, which are internalised by cells at a very low rate when applied alone. Up to now, numerous sequences have been reported to show cell-penetrating properties and many of them have been used to successfully transport a variety of different cargos into mammalian cells. In recent years, it has become apparent that endocytosis is a major route of internalisation even though the mechanisms underlying the cellular translocation of CPPs are poorly understood and still subject to controversial discussions. In this review, we will summarise the latest developments in peptide-based cellular delivery of nucleic acid cargos. We will discuss different mechanisms of entry, the intracellular fate of the cargo, correlation studies of uptake versus biological activity of the cargo as well as technical problems and pitfalls.

Cloning and cDNA sequence of the dihydrolipoamide dehydrogenase component of human α-ketoacid dehydrogenase complexes

International Nuclear Information System (INIS)

Pons, G.; Raefsky-Estrin, C.; Carothers, D.J.; Pepin, R.A.; Javed, A.A.; Jesse, B.W.; Ganapathi, M.K.; Samols, D.; Patel, M.S.

1988-01-01

cDNA clones comprising the entire coding region for human dihydrolipoamide dehydrogenase have been isolated from a human liver cDNA library. The cDNA sequence of the largest clone consisted of 2082 base pairs and contained a 1527-base open reading frame that encodes a precursor dihydrolipoamide dehydrogenase of 509 amino acid residues. The first 35-amino acid residues of the open reading frame probably correspond to a typical mitochondrial import leader sequence. The predicted amino acid sequence of the mature protein, starting at the residue number 36 of the open reading frame, is almost identical (>98% homology) with the known partial amino acid sequence of the pig heart dihydrolipoamide dehydrogenase. The cDNA clone also contains a 3' untranslated region of 505 bases with an unusual polyadenylylation signal (TATAAA) and a short poly(A) track. By blot-hybridization analysis with the cDNA as probe, two mRNAs, 2.2 and 2.4 kilobases in size, have been detected in human tissues and fibroblasts, whereas only one mRNA (2.4 kilobases) was detected in rat tissues
Complete amino acid sequence of the human alpha 5 (IV) collagen chain and identification of a single-base mutation in exon 23 converting glycine 521 in the collagenous domain to cysteine in an Alport syndrome patient

DEFF Research Database (Denmark)

Zhou, J; Hertz, Jens Michael; Leinonen, A

1992-01-01

We have generated and characterized cDNA clones providing the complete amino acid sequence of the human type IV collagen chain whose gene has been shown to be mutated in X chromosome-linked Alport syndrome. The entire translation product has 1,685 amino acid residues. There is a 26-residue signal...
Nucleotide sequence of a cDNA coding for the barley seed protein CMa: an inhibitor of insect α-amylase

DEFF Research Database (Denmark)

Rasmussen, Søren Kjærsgård; Johansson, A.

1992-01-01

The primary structure of the insect alpha-amylase inhibitor CMa of barley seeds was deduced from a full-length cDNA clone pc43F6. Analysis of RNA from barley endosperm shows high levels 15 and 20 days after flowering. The cDNA predicts an amino acid sequence of 119 residues preceded by a signal...... peptide of 25 amino acids. Ala and Leu account for 55% of the signal peptide. CMa is 60-85% identical with alpha-amylase inhibitors of wheat, but shows less than 50% identity to trypsin inhibitors of barley and wheat. The 10 Cys residues are located in identical positions compared to the cereal inhibitor...
Slag Treatment Followed by Acid Leaching as a Route to Solar-Grade Silicon

NARCIS (Netherlands)

Meteleva-Fischer, Y.V.; Yang, Y.; Boom, R.; Kraaijveld, B.; Kuntzel, H.

2012-01-01

Refining of metallurgical-grade silicon was studied using a process sequence of slag treatment, controlled cooling, and acid leaching. A slag of the Na2O-CaO-SiO2 system was used. The microstructure of grain boundaries in the treated silicon showed enhanced segregation of impurities, and the
Site-directed mutagenesis and molecular modelling studies show the role of Asp82 and cysteines in rat acylase 1, a member of the M20 family

International Nuclear Information System (INIS)

Herga, Sameh; Brutus, Alexandre; Vitale, Rosa Maria; Miche, Helene; Perrier, Josette; Puigserver, Antoine; Scaloni, Andrea; Giardina, Thierry

2005-01-01

Acylase 1 from rat kidney catalyzes the hydrolysis of acyl-amino acids. Sequence alignment has shown that this enzyme belongs to the metalloprotein family M20. Site-directed mutagenesis experiments led to the identification of one functionally important amino acid residue located near one of the zinc coordinating residues, which play a critical role in the enzymatic activity. The D82N- and D82E-substituted forms showed no significant activity and very low activity, respectively, along with a loss of zinc coordination. Molecular modelling investigations indicated a putative role of D82 in ensuring a proper protonation of catalytic histidine. In addition, none of the five cysteine residues present in the rat kidney acylase 1 sequence seemed involved in the catalytic process: the loss of activity induced by the C294A substitution was probably due to a conformational change in the 3D structure
Panel-based whole exome sequencing identifies novel mutations in microphthalmia and anophthalmia patients showing complex Mendelian inheritance patterns.

Science.gov (United States)

Riera, Marina; Wert, Ana; Nieto, Isabel; Pomares, Esther

2017-11-01

Microphthalmia and anophthalmia (MA) are congenital eye abnormalities that show an extremely high clinical and genetic complexity. In this study, we evaluated the implementation of whole exome sequencing (WES) for the genetic analysis of MA patients. This approach was used to investigate three unrelated families in which previous single-gene analyses failed to identify the molecular cause. A total of 47 genes previously associated with nonsyndromic MA were included in our panel. WES was performed in one affected patient from each family using the AmpliSeq TM Exome technology and the Ion Proton TM platform. A novel heterozygous OTX2 missense mutation was identified in a patient showing bilateral anophthalmia who inherited the variant from a parent who was a carrier, but showed no sign of the condition. We also describe a new PAX6 missense variant in an autosomal-dominant pedigree affected by mild bilateral microphthalmia showing high intrafamiliar variability, with germline mosaicism determined to be the most plausible molecular cause of the disease. Finally, a heterozygous missense mutation in RBP4 was found to be responsible in an isolated case of bilateral complex microphthalmia. This study highlights that panel-based WES is a reliable and effective strategy for the genetic diagnosis of MA. Furthermore, using this technique, the mutational spectrum of these diseases was broadened, with novel variants identified in each of the OTX2, PAX6, and RBP4 genes. Moreover, we report new cases of reduced penetrance, mosaicism, and variable phenotypic expressivity associated with MA, further demonstrating the heterogeneity of such disorders. © 2017 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.
What makes ribosome-mediated transcriptional attenuation sensitive to amino acid limitation?

Directory of Open Access Journals (Sweden)

Johan Elf

2005-06-01

Full Text Available Ribosome-mediated transcriptional attenuation mechanisms are commonly used to control amino acid biosynthetic operons in bacteria. The mRNA leader of such an operon contains an open reading frame with "regulatory" codons, cognate to the amino acid that is synthesized by the enzymes encoded by the operon. When the amino acid is in short supply, translation of the regulatory codons is slow, which allows transcription to continue into the structural genes of the operon. When amino acid supply is in excess, translation of regulatory codons is rapid, which leads to termination of transcription. We use a discrete master equation approach to formulate a probabilistic model for the positioning of the RNA polymerase and the ribosome in the attenuator leader sequence. The model describes how the current rate of amino acid supply compared to the demand in protein synthesis (signal determines the expression of the amino acid biosynthetic operon (response. The focus of our analysis is on the sensitivity of operon expression to a change in the amino acid supply. We show that attenuation of transcription can be hyper-sensitive for two main reasons. The first is that its response depends on the outcome of a race between two multi-step mechanisms with synchronized starts: transcription of the leader of the operon, and translation of its regulatory codons. The relative change in the probability that transcription is aborted (attenuated can therefore be much larger than the relative change in the time it takes for the ribosome to read a regulatory codon. The second is that the general usage frequencies of codons of the type used in attenuation control are small. A small percentage decrease in the rate of supply of the controlled amino acid can therefore lead to a much larger percentage decrease in the rate of reading a regulatory codon. We show that high sensitivity further requires a particular choice of regulatory codon among several synonymous codons for the
Bias in phylogenetic reconstruction of vertebrate rhodopsin sequences.

Science.gov (United States)

Chang, B S; Campbell, D L

2000-08-01

Two spurious nodes were found in phylogenetic analyses of vertebrate rhodopsin sequences in comparison with well-established vertebrate relationships. These spurious reconstructions were well supported in bootstrap analyses and occurred independently of the method of phylogenetic analysis used (parsimony, distance, or likelihood). Use of this data set of vertebrate rhodopsin sequences allowed us to exploit established vertebrate relationships, as well as the considerable amount known about the molecular evolution of this gene, in order to identify important factors contributing to the spurious reconstructions. Simulation studies using parametric bootstrapping indicate that it is unlikely that the spurious nodes in the parsimony analyses are due to long branches or other topological effects. Rather, they appear to be due to base compositional bias at third positions, codon bias, and convergent evolution at nucleotide positions encoding the hydrophobic residues isoleucine, leucine, and valine. LogDet distance methods, as well as maximum-likelihood methods which allow for nonstationary changes in base composition, reduce but do not entirely eliminate support for the spurious resolutions. Inclusion of five additional rhodopsin sequences in the phylogenetic analyses largely corrected one of the spurious reconstructions while leaving the other unaffected. The additional sequences not only were more proximal to the corrected node, but were also found to have intermediate levels of base composition and codon bias as compared with neighboring sequences on the tree. This study shows that the spurious reconstructions can be corrected either by excluding third positions, as well as those encoding the amino acids Ile, Val, and Leu (which may not be ideal, as these sites can contain useful phylogenetic signal for other parts of the tree), or by the addition of sequences that reduce problems associated with convergent evolution.
Differentiation of highly virulent strains of Streptococcus suis serotype 2 according to glutamate dehydrogenase electrophoretic and sequence type.

Science.gov (United States)

Kutz, Russell; Okwumabua, Ogi

2008-10-01

The glutamate dehydrogenase (GDH) enzymes of 19 Streptococcus suis serotype 2 strains, consisting of 18 swine isolates and 1 human clinical isolate from a geographically varied collection, were analyzed by activity staining on a nondenaturing gel. All seven (100%) of the highly virulent strains tested produced an electrophoretic type (ET) distinct from those of moderately virulent and nonvirulent strains. By PCR and nucleotide sequence determination, the gdh genes of the 19 strains and of 2 highly virulent strains involved in recent Chinese outbreaks yielded a 1,820-bp fragment containing an open reading frame of 1,344 nucleotides, which encodes a protein of 448 amino acid residues with a calculated molecular mass of approximately 49 kDa. The nucleotide sequences contained base pair differences, but most were silent. Cluster analysis of the deduced amino acid sequences separated the isolates into three groups. Group I (ETI) consisted of the seven highly virulent isolates and the two Chinese outbreak strains, containing Ala(299)-to-Ser, Glu(305)-to-Lys, and Glu(330)-to-Lys amino acid substitutions compared with groups II and III (ETII). Groups II and III consisted of moderately virulent and nonvirulent strains, which are separated from each other by Tyr(72)-to-Asp and Thr(296)-to-Ala substitutions. Gene exchange studies resulted in the change of ETI to ETII and vice versa. A spectrophotometric activity assay for GDH did not show significant differences between the groups. These results suggest that the GDH ETs and sequence types may serve as useful markers in predicting the pathogenic behavior of strains of this serotype and that the molecular basis for the observed differences in the ETs was amino acid substitutions and not deletion, insertion, or processing uniqueness.
spa Typing and Multilocus Sequence Typing Show Comparable Performance in a Macroepidemiologic Study of Staphylococcus aureus in the United States.

Science.gov (United States)

O'Hara, F Patrick; Suaya, Jose A; Ray, G Thomas; Baxter, Roger; Brown, Megan L; Mera, Robertino M; Close, Nicole M; Thomas, Elizabeth; Amrine-Madsen, Heather

2016-01-01

A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants.
Aligning protein sequence and analysing substitution pattern using ...

Indian Academy of Sciences (India)

Prakash

Aligning protein sequences using a score matrix has became a routine but valuable method in modern biological ..... the amino acids according to their substitution behaviour ...... which may cause great change (e.g. prolonging the helix) in.
Modeling bias and variation in the stochastic processes of small RNA sequencing.

Science.gov (United States)

Argyropoulos, Christos; Etheridge, Alton; Sakhanenko, Nikita; Galas, David

2017-06-20

The use of RNA-seq as the preferred method for the discovery and validation of small RNA biomarkers has been hindered by high quantitative variability and biased sequence counts. In this paper we develop a statistical model for sequence counts that accounts for ligase bias and stochastic variation in sequence counts. This model implies a linear quadratic relation between the mean and variance of sequence counts. Using a large number of sequencing datasets, we demonstrate how one can use the generalized additive models for location, scale and shape (GAMLSS) distributional regression framework to calculate and apply empirical correction factors for ligase bias. Bias correction could remove more than 40% of the bias for miRNAs. Empirical bias correction factors appear to be nearly constant over at least one and up to four orders of magnitude of total RNA input and independent of sample composition. Using synthetic mixes of known composition, we show that the GAMLSS approach can analyze differential expression with greater accuracy, higher sensitivity and specificity than six existing algorithms (DESeq2, edgeR, EBSeq, limma, DSS, voom) for the analysis of small RNA-seq data. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Site-specific antibodies distinguish single amino acid substitutions in position 57 in HLA-DQ beta-chain alleles associated with insulin-dependent diabetes

DEFF Research Database (Denmark)

Atar, D; Dyrberg, T; Michelsen, Birgitte

1989-01-01

The HLA-DQ beta-chain gene shows a close association with susceptibility or resistance to autoimmune insulin-dependent diabetes mellitus (IDDM) and it has been suggested that the amino acid in position 57 may be of pathogenetic importance. To study the expression of the IDDM associated HLA-DQ beta......-chain alleles, we immunized rabbits with 12 to 13 amino acid long peptides representing HLA-DQw7 and -DQw8 allelic sequences, differing only by one amino acid in position 57 being aspartic acid (Asp) and alanine (Ala), respectively. Immunoblot analysis of lymphoblastoid cells showed that several antisera...
Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

Science.gov (United States)

2012-01-01

Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence
Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.

Science.gov (United States)

Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P

1997-11-01

A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.
Sequence Classification - TMBETA-GENOME | LSDB Archive [Life Science Database Archive metadata

Lifescience Database Archive (English)

Full Text Available ansmembrane helical proteins by applying statistical and machine learning methods to each amino acid sequenc.... Amino Acid Result of predicting β-barrel membrane protein with a statistical method using amino acid compo...sition. ( TMBETADISC-COMP ) Dipeptide Result of predicting β-barrel membrane protein with a statistic...ting β-barrel membrane protein with a statistical method using motifs. ( TMBETADISC-MOTIF ) SVM Result of pr
Analysis of S gene mutation of the hepatitis B virus in adult liver transplant recipients showing resistance to hepatitis B immunoglobulin therapy.

Science.gov (United States)

Park, G-C; Hwang, S; Ahn, C-S; Kim, K-H; Moon, D-B; Ha, T-Y; Song, G-W; Jung, D-H; Shin, Y W; Kim, S-H; Chang, K-H; Namgoong, J-M; Park, C-S; Park, H-W; Park, Y-H; Kang, S-H; Jung, B-H; Lee, S-G

2013-10-01

A considerable proportion of recipients of liver transplantations who are presented hepatitis B immunoglobulin (HBIG) monotherapy for hepatitis B virus (HBV) prophylaxis develop HBIG resistance. In this study, we investigated the mutation patterns in the major hydrophilic region (MHR) of amino acid sequences 100 to 160. Using the gene sequence analyzer for amino acid sequences 0 to 226 in the S/pre-S region we analyzed blood samples of 15 patients showing HBIG resistance after high-dose HBIG prophylaxis. Various mutations in the MHR were observed in 14/15 samples: Gly145Arg mutation in 8/13 Adr subtype and 1/2 Ayw subtype samples (60%). The next most common mutation was Gly165Trp in 8/13 Adr subtype but neither of 2 Ayw subtype samples (53.3%). Concurrent antiviral resistance was noted in 5 patients: lamivudine (n = 5), or entecavir (n = 3), but not adefovir, suggesting the occurrence of simultaneous, antiviral cross-resistances. Two patients underwent retransplantation due to the progression of HBV infection despite vigorous antiviral therapy. At diagnosis of HBV recurrence, the mean HBV DNA load was 6.5 × 10(6) copies/mL; 4 patients showed paradoxical coexistence of anti-HBs and HBsAg. Currently, 2 subjects show low-level HBV DNA replication in peripheral blood, although the other 12 had no DNA replication after prolonged antiviral therapy. This study suggested that various mutations in the "a" determinant were associated with HBIG resistance. Since treatment failure to rescue antiviral therapy was often associated with delayed detection of HBV recurrence rather than concurrent antiviral resistance, frequent HBV surveillance using more sensitive screening tests, such as HBeAg and HBV DNA polymerase chain reaction assay, seems to be mandatory. Copyright © 2013 Elsevier Inc. All rights reserved.
Plasma bile acids show a positive correlation with body mass index and are negatively associated with cognitive restraint of eating in obese patients

Directory of Open Access Journals (Sweden)

Philip ePrinz

2015-06-01

Full Text Available Bile acids may be involved in the regulation of food intake and energy metabolism. The aim of the study was to investigate the association of plasma bile acids with body mass index (BMI and the possible involvement of circulating bile acids in the modulation of physical activity and eating behavior. Blood was obtained in a group of hospitalized patients with normal weight (BMI 18.5-25 kg/m2, underweight (anorexia nervosa, BMI 50 kg/m2, n=14-15/group and plasma bile acid concentrations assessed. Physical activity and plasma bile acids were measured in a group of patients with anorexia nervosa (BMI 14.6±0.3 kg/m2, n=43. Lastly, in a population of obese patients (BMI 48.5±0.9 kg/m2, n=85, psychometric parameters related to disordered eating and plasma bile acids were assessed. Plasma bile acids showed a positive correlation with BMI (r=0.26, p=0.03 in the population of patients with broad range of BMI (9-85 kg/m2, n=74. No associations were observed between plasma bile acids and different parameters of physical activity in anorexic patients (p>0.05. Plasma bile acids were negatively correlated with cognitive restraint of eating (r=-0.30, p=0.008, while no associations were observed with other psychometric eating behavior-related parameters (p>0.05 in obese patients. In conclusion, these data may point towards a role of bile acids in the regulation of body weight. Since plasma bile acids are negatively correlated with the cognitive restraint of eating in obese patients, this may represent a compensatory adaptation to prevent further overeating.
Cloning and sequence analysis of serine proteinase of Gloydius ussuriensis venom gland

International Nuclear Information System (INIS)

Sun Dejun; Liu Shanshan; Yang Chunwei; Zhao Yizhuo; Chang Shufang; Yan Weiqun

2005-01-01

Objective: To construct a cDNA library by using mRNA from Gloydius ussuriensis (G. Ussuriensis) venom gland, to clone and analyze serine proteinase gene from the cDNA library. Methods: Total RNA was isolated from venom gland of G. ussuriensis, mRNA was purified by using mRNA isolation Kit. The whole length cDNA was synthesized by means of smart cDNA synthesis strategy, and amplified by long distance PCR procedure, lately cDAN was cloned into vector pBluescrip-sk. The recombinant cDNA was transformed into E. coli DH5α. The cDNA of serine proteinase gene in the venom gland of G. ussuriensis was detected and amplified using the in situ hybridization. The cDNA fragment was inserted into pGEMT vector, cloned and its nucleotide sequence was determined. Results: The capacity of cDNA library of venom gland was above 2.3 x 10 6 . Its open reading frame was composed of 702 nucleotides and coded a protein pre-zymogen of 234 amino acids. It contained 12 cysteine residues. The sequence analysis indicated that the deduced amino acid sequence of the cDNA fragment shared high identity with the thrombin-like enzyme genes of other snakes in the GenBank. the query sequence exhibited strong amino acid sequence homology of 85% to the serine proteas of T. gramineus, thrombin-like serine proteinase I of D. acutus and serine protease catroxase II of C. atrox respectively. Based on the amino acid sequences of other thrombin-like enzymes, the catalytic residues and disulfide bridges of this thrombin-like enzyme were deduced as follows: catalytic residues, His 41 , Asp 86 , Ser 180 ; and six disulfide bridges Cys 7 -Cys 139 , Cys 26 -Cys 42 , Cys 74 -Cys 232 , Cys 118 -Cys 186 , Cys 150 -Cys 165 , Cys 176 -Cys 201 . Conclusion: The capacity of cDNA library of venom gland is above 2.3 x 10 6 , overtop the level of 10 5 capicity. The constructed cDNA library of G. ussuriensis venom gland would be helpful platform to detect new target genes and further gene manipulate. The cloned serine
Pinus pinaster seedlings and their fungal symbionts show high plasticity in phosphorus acquisition in acidic soils.

Science.gov (United States)

Ali, M A; Louche, J; Legname, E; Duchemin, M; Plassard, C

2009-12-01

Young seedlings of maritime pine (Pinus pinaster Soland in Aït.) were grown in rhizoboxes using intact spodosol soil samples from the southwest of France, in Landes of Gascogne, presenting a large variation of phosphorus (P) availability. Soils were collected from a 93-year-old unfertilized stand and a 13-year-old P. pinaster stand with regular annual fertilization of either only P or P and nitrogen (N). After 6 months of culture in controlled conditions, different morphotypes of ectomycorrhiza (ECM) were used for the measurements of acid phosphatase activity and molecular identification of fungal species using amplification of the ITS region. Total biomass, N and P contents were measured in roots and shoots of plants. Bicarbonate- and NaOH-available inorganic P (Pi), organic P (Po) and ergosterol concentrations were measured in bulk and rhizosphere soil. The results showed that bulk soil from the 93-year-old forest stand presented the highest Po levels, but relatively higher bicarbonate-extractable Pi levels compared to 13-year-old unfertilized stand. Fertilizers significantly increased the concentrations of inorganic P fractions in bulk soil. Ergosterol contents in rhizosphere soil were increased by fertilizer application. The dominant fungal species was Rhizopogon luteolus forming 66.6% of analysed ECM tips. Acid phosphatase activity was highly variable and varied inversely with bicarbonate-extractable Pi levels in the rhizosphere soil. Total P or total N in plants was linearly correlated with total plant biomass, but the slope was steep only between total P and biomass in fertilized soil samples. In spite of high phosphatase activity in ECM tips, P availability remained a limiting nutrient in soil samples from unfertilized stands. Nevertheless young P. pinaster seedlings showed a high plasticity for biomass production at low P availability in soils.

[Apply fourier transform infrared spectra coupled with two-dimensional correlation analysis to study the evolution of humic acids during composting].

Science.gov (United States)

Bu, Gui-jun; Yu, Jing; Di, Hui-hui; Luo, Shi-jia; Zhou, Da-zhai; Xiao, Qiang

2015-02-01

The composition and structure of humic acids formed during composting play an important influence on the quality and mature of compost. In order to explore the composition and evolution mechanism, municipal solid wastes were collected to compost and humic and fulvic acids were obtained from these composted municipal solid wastes. Furthermore, fourier transform infrared spectra and two-dimensional correlation analysis were applied to study the composition and transformation of humic and fulvic acids during composting. The results from fourier transform infrared spectra showed that, the composition of humic acids was complex, and several absorbance peaks were observed at 2917-2924, 2844-2852, 2549, 1662, 1622, 1566, 1454, 1398, 1351, 990-1063, 839 and 711 cm(-1). Compared to humic acids, the composition of fulvci acids was simple, and only three peaks were detected at 1725, 1637 and 990 cm(-1). The appearance of these peaks showed that both humic and fulvic acids comprised the benzene originated from lignin and the polysaccharide. In addition, humic acids comprised a large number of aliphatic and protein which were hardly detected in fulvic acids. Aliphatic, polysaccharide, protein and lignin all were degraded during composting, however, the order of degradation was different between humic and fulvci acids. The result from two-dimensional correlation analysis showed that, organic compounds in humic acids were degraded in the following sequence: aliphatic> protein> polysaccharide and lignin, while that in fulvic acids was as following: protein> polysaccharide and aliphatic. A large number of carboxyl, alcohols and ethers were formed during the degradation process, and the carboxyl was transformed into carbonates. It can be concluded that, fourier transform infrared spectra coupled with two-dimensional correlation analysis not only can analyze the function group composition of humic substances, but also can characterize effectively the degradation sequence of these
Prevalence of Plasmodium spp. in malaria asymptomatic African migrants assessed by nucleic acid sequence based amplification

Directory of Open Access Journals (Sweden)

Schallig Henk DFH

2009-01-01

Full Text Available Abstract Background Malaria is one of the most important infectious diseases in the world. Although most cases are found distributed in the tropical regions of Africa, Asia, Central and South Americas, there is in Europe a significant increase in the number of imported cases in non-endemic countries, in particular due to the higher mobility in today's society. Methods The prevalence of a possible asymptomatic infection with Plasmodium species was assessed using Nucleic Acid Sequence Based Amplification (NASBA assays on clinical samples collected from 195 study cases with no clinical signs related to malaria and coming from sub-Saharan African regions to Southern Italy. In addition, base-line demographic, clinical and socio-economic information was collected from study participants who also underwent a full clinical examination. Results Sixty-two study subjects (31.8% were found positive for Plasmodium using a pan Plasmodium specific NASBA which can detect all four Plasmodium species causing human disease, based on the small subunit 18S rRNA gene (18S NASBA. Twenty-four samples (38% of the 62 18S NASBA positive study cases were found positive with a Pfs25 mRNA NASBA, which is specific for the detection of gametocytes of Plasmodium falciparum. A statistically significant association was observed between 18S NASBA positivity and splenomegaly, hepatomegaly and leukopaenia and country of origin. Conclusion This study showed that a substantial proportion of people originating from malaria endemic countries harbor malaria parasites in their blood. If transmission conditions are available, they could potentially be a reservoir. Thefore, health authorities should pay special attention to the health of this potential risk group and aim to improve their health conditions.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites*

Science.gov (United States)

Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

2012-01-01

To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi’an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi’an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%–99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites. PMID:23024043
Detection of M-Sequences from Spike Sequence in Neuronal Networks

Directory of Open Access Journals (Sweden)

Yoshi Nishitani

2012-01-01

Full Text Available In circuit theory, it is well known that a linear feedback shift register (LFSR circuit generates pseudorandom bit sequences (PRBS, including an M-sequence with the maximum period of length. In this study, we tried to detect M-sequences known as a pseudorandom sequence generated by the LFSR circuit from time series patterns of stimulated action potentials. Stimulated action potentials were recorded from dissociated cultures of hippocampal neurons grown on a multielectrode array. We could find several M-sequences from a 3-stage LFSR circuit (M3. These results show the possibility of assembling LFSR circuits or its equivalent ones in a neuronal network. However, since the M3 pattern was composed of only four spike intervals, the possibility of an accidental detection was not zero. Then, we detected M-sequences from random spike sequences which were not generated from an LFSR circuit and compare the result with the number of M-sequences from the originally observed raster data. As a result, a significant difference was confirmed: a greater number of “0–1” reversed the 3-stage M-sequences occurred than would have accidentally be detected. This result suggests that some LFSR equivalent circuits are assembled in neuronal networks.
Cloning and sequencing of growth hormone gene of Iranian Lori Bakhtiari sheep

Directory of Open Access Journals (Sweden)

M Dayani-Nia

2010-05-01

Full Text Available Growth hormone (GH is a peptide hormone that stimulates growth and cell reproduction in humans and animals. It is a 191-amino acid, single chain polypeptide hormone which is synthesized, stored, and secreted by the somatotroph cells within the lateral wings of the anterior pituitary gland. The goal of this research was to clone and sequence sheep growth hormone of Lori Bakhtiary breed in Iran. For this purpose, RNA was extracted from the pituitary gland of freshly slaughtered sheep and cDNA of growth hormone produced. The T/A cloning technique was used to clone the cDNA of growth hormone and then the synthesized construct was transferred into E. coli as the host. Once the correct recombinants were further confirmed by colony PCR or restriction enzyme digestion, sequencing was done. The sequencing results showed that, the length of sheep growth hormone cDNA was 690 bp fragments. Comparison of sequence of growth hormone inside the synthesized construct with those recorded in Genebank (NCBI, Blast indicated high degrees of similarity between Iranian native sheep and other sheep breeds of the world.
Nucleotide sequences of two cellulase genes from alkalophilic Bacillus sp. strain N-4 and their strong homology.

OpenAIRE

Fukumori, F; Sashihara, N; Kudo, T; Horikoshi, K

1986-01-01

Two genes for cellulases of alkalophilic Bacillus sp. strain N-4 (ATCC 21833) have been sequenced. From the DNA sequences the cellulases encoded in the plasmids pNK1 and pNK2 consist of 488 and 409 amino acids, respectively. The DNA and protein sequences of the pNK1-encoded cellulase are related to those of the pNK2-encoded cellulase. The pNK2-encoded cellulase lacks the direct repeat sequence of a stretch of 60 amino acids near the C-terminal end of the pNK1-encoded cellulase. The duplicatio...
Human tissue factor: cDNA sequence and chromosome localization of the gene

International Nuclear Information System (INIS)

Scarpati, E.M.; Wen, D.; Broze, G.J. Jr.; Miletich, J.P.; Flandermeyer, R.R.; Siegel, N.R.; Sadler, J.E.

1987-01-01

A human placenta cDNA library in λgt11 was screened for the expression of tissue factor antigens with rabbit polyclonal anti-human tissue factor immunoglobulin G. Among 4 million recombinant clones screened, one positive, λHTF8, expressed a protein that shared epitopes with authentic human brain tissue factor. The 1.1-kilobase cDNA insert of λHTF8 encoded a peptide that contained the amino-terminal protein sequence of human brain tissue factor. Northern blotting identified a major mRNA species of 2.2 kilobases and a minor species of ∼ 3.2 kilobases in poly(A) + RNA of placenta. Only 2.2-kilobase mRNA was detected in human brain and in the human monocytic U937 cell line. In U937 cells, the quantity of tissue factor mRNA was increased several fold by exposure of the cells to phorbol 12-myristate 13-acetate. Additional cDNA clones were selected by hybridization with the cDNA insert of λHTF8. These overlapping isolates span 2177 base pairs of the tissue factor cDNA sequence that includes a 5'-noncoding region of 75 base pairs, an open reading frame of 885 base pairs, a stop codon, a 3'-noncoding region of 1141 base pairs, and a poly(a) tail. The open reading frame encodes a 33-kilodalton protein of 295 amino acids. The predicted sequence includes a signal peptide of 32 or 34 amino acids, a probable extracellular factor VII binding domain of 217 or 219 amino acids, a transmembrane segment of 23 acids, and a cytoplasmic tail of 21 amino acids. There are three potential glycosylation sites with the sequence Asn-X-Thr/Ser. The 3'-noncoding region contains an inverted Alu family repetitive sequence. The tissue factor gene was localized to chromosome 1 by hybridization of the cDNA insert of λHTF8 to flow-sorted human chromosomes
Campylobacter jejuni sequence types show remarkable spatial and temporal stability in Blackbirds

Directory of Open Access Journals (Sweden)

Petra Griekspoor

2015-12-01

Full Text Available Background: The zoonotic bacterium Campylobacter jejuni has a broad host range but is especially associated with birds, both domestic and wild. Earlier studies have indicated thrushes of the genus Turdus in Europe to be frequently colonized with C. jejuni, and predominately with host-associated specific genotypes. The European Blackbird Turdus merula has a large distribution in Europe, including some oceanic islands, and was also introduced to Australia by European immigrants in the 1850s. Methods: The host specificity and temporal stability of European Blackbird C. jejuni was investigated with multilocus sequence typing in a set of isolates collected from Sweden, Australia, and The Azores. Results: Remarkably, we found that the Swedish, Australian, and Azorean isolates were genetically highly similar, despite extensive spatial and temporal isolation. This indicates adaptation, exquisite specificity, and stability in time for European Blackbirds, which is in sharp contrast with the high levels of recombination and mutation found in poultry-related C. jejuni genotypes. Conclusion: The maintenance of host-specific signals in spatially and temporally separated C. jejuni populations suggests the existence of strong purifying selection for this bacterium in European Blackbirds.
Repdigits in k-Lucas sequences

Indian Academy of Sciences (India)

57(2) 2000 243-254) proved that 11 is the largest number with only one distinct digit (the so-called repdigit) in the sequence ( L n ( 2 ) ) n . In this paper, we address a similar problem in the family of -Lucas sequences. We also show that the -Lucas sequences have similar properties to those of -Fibonacci sequences ...
SPiCE : A web-based tool for sequence-based protein classification and exploration

NARCIS (Netherlands)

Van den Berg, B.A.; Reinders, M.J.; Roubos, J.A.; De Ridder, D.

2014-01-01

Background Amino acid sequences and features extracted from such sequences have been used to predict many protein properties, such as subcellular localization or solubility, using classifier algorithms. Although software tools are available for both feature extraction and classifier construction,
Primary structure of human pancreatic protease E determined by sequence analysis of the cloned mRNA

International Nuclear Information System (INIS)

Shen, W.; Fletcher, T.S.; Largman, C.

1987-01-01

Although protease E was isolated from human pancreas over 10 years ago, its amino acid sequence and relationship to the elastases have not been established. The authors report the isolation of a cDNA clone for human pancreatic protease E and determination of the nucleic acid sequence coding for the protein. The deduced amino acid sequence contains all of the features common to serine proteases. The substrate binding region is highly homologous to those of porcine and rat elastases 1, explaining the similar specificity for alanine reported for protease E and these elastases. However, the amino acid sequence outside the substrate binding region is less than 50% conserved, and there is a striking difference in the overall net charge for protease E (6-) and elastases 1 (8+). These findings confirm that protease E is a new member of the serine protease family. They have attempted to identify amino acid residues important for the interaction between elastases and elastin by examining the amino acid sequence differences between elastases and protease E. In addition to the large number of surface charge changes which are outside the substrate binding region, there are several changes which might be crucial for elastolysis: Leu-73/Arg-73; Arg-217A/Ala-217A; Arg-65A/Gln-65A; and the presence of two new cysteine residues (Cys-98 and Cys-99B) which computer modeling studies predict could form a new disulfide bond, not previously observed for serine proteases. They also present evidence which suggests that human pancreas does not synthesize a basic, alanine-specific elastase similar to porcine elastase 1
Application of Next-generation Sequencing in Clinical Molecular Diagnostics

Directory of Open Access Journals (Sweden)

Morteza Seifi

2017-05-01

Full Text Available ABSTRACT Next-generation sequencing (NGS is the catch all terms that used to explain several different modern sequencing technologies which let us to sequence nucleic acids much more rapidly and cheaply than the formerly used Sanger sequencing, and as such have revolutionized the study of molecular biology and genomics with excellent resolution and accuracy. Over the past years, many academic companies and institutions have continued technological advances to expand NGS applications from research to the clinic. In this review, the performance and technical features of current NGS platforms were described. Furthermore, advances in the applying of NGS technologies towards the progress of clinical molecular diagnostics were emphasized. General advantages and disadvantages of each sequencing system are summarized and compared to guide the selection of NGS platforms for specific research aims.
Retention of nucleic acids in ion-pair reversed-phase high-performance liquid chromatography depends not only on base composition but also on base sequence.

Science.gov (United States)

Qiao, Jun-Qin; Liang, Chao; Wei, Lan-Chun; Cao, Zhao-Ming; Lian, Hong-Zhen

2016-12-01

The study on nucleic acid retention in ion-pair reversed-phase high-performance liquid chromatography mainly focuses on size-dependence, however, other factors influencing retention behaviors have not been comprehensively clarified up to date. In this present work, the retention behaviors of oligonucleotides and double-stranded DNAs were investigated on silica-based C 18 stationary phase by ion-pair reversed-phase high-performance liquid chromatography. It is found that the retention of oligonucleotides was influenced by base composition and base sequence as well as size, and oligonucleotides prone to self-dimerization have weaker retention than those not prone to self-dimerization but with the same base composition. However, homo-oligonucleotides are suitable for the size-dependent separation as a special case of oligonucleotides. For double-stranded DNAs, the retention is also influenced by base composition and base sequence, as well as size. This may be attributed to the interaction of exposed bases in major or minor grooves with the hydrophobic alky chains of stationary phase. In addition, no specific influence of guanine and cytosine content was confirmed on retention of double-stranded DNAs. Notably, the space effect resulted from the stereostructure of nucleic acids also influences the retention behavior in ion-pair reversed-phase high-performance liquid chromatography. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Comparison of cDNA-derived protein sequences of the human fibronectin and vitronectin receptor α-subunits and platelet glycoprotein IIb

International Nuclear Information System (INIS)

Fitzgerald, L.A.; Poncz, M.; Steiner, B.; Rall, S.C. Jr.; Bennett, J.S.; Phillips, D.R.

1987-01-01

The fibronectin receptor (FnR), the vitronectin receptor (VnR), and the platelet membrane glycoprotein (GP) IIb-IIIa complex are members of a family of cell adhesion receptors, which consist of noncovalently associated α- and β-subunits. The present study was designed to compare the cDNA-derived protein sequences of the α-subunits of human FnR, VnR, and platelet GP IIb. cDNA clones for the α-subunit of the FnR (FnR/sub α/) were obtained from a human umbilical vein endothelial (HUVE) cell library by using an oligonucleotide probe designed from a peptide sequence of platelet GP IIb. cDNA clones for platelet GP IIb were isolated from a cDNA expression library of human erythroleukemia cells by using antibodies. cDNA clones of the VnR α-subunit (VnR/sub α/) were obtained from the HUVE cell library by using an oligonucleotide probe from the partial cDNA sequence for the VnR/sub α/. Translation of these sequences showed that the FNR/sub α/, the VnR/sub α/, and GP IIb are composed of disulfide-linked large (858-871 amino acids) and small (137-158 amino acids) chains that are posttranslationally processed from a single mRNA. A single hydrophobic segment located near the carboxyl terminus of each small chain appears to be a transmembrane domain. The large chains appear to be entirely extracellular, and each contains four repeated putative Ca 2+ -binding domains of about 30 amino acids that have sequence similarities to other Ca 2+ -binding proteins. The identity among the protein sequences of the three receptor α-subunits ranges from 36.1% to 44.5%, with the Ca 2+ -binding domains having the greatest homology. These proteins apparently evolved by a process of gene duplication
Purification, properties, and N-terminal amino acid sequence of homogeneous Escherichia coli 2-amino-3-ketobutyrate CoA ligase, a pyridoxal phosphate-dependent enzyme.

Science.gov (United States)

Mukherjee, J J; Dekker, E E

1987-10-25

Starting with 100 g (wet weight) of a mutant of Escherichia coli K-12 forced to grow on L-threonine as sole carbon source, we developed a 6-step procedure that provides 30-40 mg of homogeneous 2-amino-3-ketobutyrate CoA ligase (also called aminoacetone synthetase or synthase). This ligase, which catalyzes the cleavage/condensation reaction between 2-amino-3-ketobutyrate (the presumed product of the L-threonine dehydrogenase-catalyzed reaction) and glycine + acetyl-CoA, has an apparent molecular weight approximately equal to 85,000 and consists of two identical (or nearly identical) subunits with Mr = 42,000. Computer analysis of amino acid composition data, which gives the best fit nearest integer ratio for each residue, indicates a total of 387 amino acids/subunit with a calculated Mr = 42,093. Stepwise Edman degradation provided the N-terminal sequence of the first 21 amino acids. It is a pyridoxal phosphate-dependent enzyme since (a) several carbonyl reagents caused greater than 90% loss of activity, (b) dialysis against buffer containing hydroxylamine resulted in 89% loss of activity coincident with an 86% decrease in absorptivity at 428 nm, (c) incubation of the apoenzyme with 20 microM pyridoxal phosphate showed a parallel recovery (greater than 90%) of activity and 428-nm absorptivity, and (d) reduction of the holoenzyme with NaBH4 resulted in complete inactivation, disappearance of a new absorption maximum at 333 nm. Strict specificity for glycine is shown but acetyl-CoA (100%), n-propionyl-CoA (127%), or n-butyryl-CoA (16%) is utilized in the condensation reaction. Apparent Km values for acetyl-CoA, n-propionyl-CoA, and glycine are 59 microM, 80 microM, and 12 mM, respectively; the pH optimum = 7.5. Added divalent metal ions or sulfhydryl compounds inhibited catalysis of the condensation reaction.
Sequence-selective targeting of duplex DNA by peptide nucleic acids

DEFF Research Database (Denmark)

Nielsen, Peter E

2010-01-01

Sequence-selective gene targeting constitutes an attractive drug-discovery approach for genetic therapy, with the aim of reducing or enhancing the activity of specific genes at the transcriptional level, or as part of a methodology for targeted gene repair. The pseudopeptide DNA mimic peptide...
Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

KAUST Repository

Sugumar, Thennarasu; Harishankar, M.; Dhinakar Raj, G.

2011-01-01

Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

OpenAIRE

Sakoda, H; Imanaka, T

1992-01-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those cata...
Femtomolar Ln(III) affinity in peptide-based ligands containing unnatural chelating amino acids.

Science.gov (United States)

Niedźwiecka, Agnieszka; Cisnetti, Federico; Lebrun, Colette; Delangle, Pascale

2012-05-07

The incorporation of unnatural chelating amino acids in short peptide sequences leads to lanthanide-binding peptides with a higher stability than sequences built exclusively from natural residues. In particular, the hexadentate peptide P(22), which incorporates two unnatural amino acids Ada(2) with aminodiacetate chelating arms, showed picomolar affinity for Tb(3+). To design peptides with higher denticity, expected to show higher affinity for Ln(3+), we synthesized the novel unnatural amino acid Ed3a(2) which carries an ethylenediamine triacetate side-chain and affords a pentadentate coordination site. The synthesis of the derivative Fmoc-Ed3a(2)(tBu)(3)-OH, with appropriate protecting groups for direct use in the solid phase peptide synthesis (Fmoc strategy), is described. The two high denticity peptides P(HD2) (Ac-Trp-Ed3a(2)-Pro-Gly-Ada(2)-Gly-NH(2)) and P(HD5) (Ac-Trp-Ada(2)-Pro-Gly-Ed3a(2)-Gly-NH(2)) led to octadentate Tb(3+) complexes with femtomolar stability in water. The position of the high denticity amino acid Ed3a(2) in the hexapeptide sequence appears to be critical for the control of the metal complex speciation. Whereas P(HD5) promotes the formation of polymetallic species in excess of Ln(3+), P(HD2) forms exclusively the mononuclear complex. The octadentate coordination of Tb(3+) by both P(HD) leads to total dehydration of the metal ion in the mononuclear complexes with long luminescence lifetimes (>2 ms). Hence, we demonstrated that unnatural amino acids carrying polyaminocarboxylate side-chains are interesting building blocks to design high affinity Ln-binding peptides. In particular the novel peptide P(HD2) forms a unique octadentate Tb(3+) complex with femtomolar stability in water and an improvement of the luminescence properties with respect to the trisaquo TbP(22) complex by a factor of 4.
Isolation and amino acid sequence of a dehydratase acting on d-erythro-3-hydroxyaspartate from Pseudomonas sp. N99, and its application in the production of optically active 3-hydroxyaspartate.

Science.gov (United States)

Nagano, Hiroyuki; Shibano, Kana; Matsumoto, Yu; Yokota, Atsushi; Wada, Masaru

2017-06-01

An enzyme catalyzing the ammonia-lyase reaction for the conversion of d-erythro-3-hydroxyaspartate to oxaloacetate was purified from the cell-free extract of a soil-isolated bacterium Pseudomonas sp. N99. The enzyme exhibited ammonia-lyase activity toward l-threo-3-hydroxyaspartate and d-erythro-3-hydroxyaspartate, but not toward other 3-hydroxyaspartate isomers. The deduced amino acid sequence of the enzyme, which belongs to the serine/threonine dehydratase family, shows similarity to the sequence of l-threo-3-hydroxyaspartate ammonia-lyase (EC 4.3.1.16) from Pseudomonas sp. T62 (74%) and Saccharomyces cerevisiae (64%) and serine racemase from Schizosaccharomyces pombe (65%). These results suggest that the enzyme is similar to l-threo-3-hydroxyaspartate ammonia-lyase from Pseudomonas sp. T62, which does not act on d-erythro-3-hydroxyaspartate. We also then used the recombinant enzyme expressed in Escherichia coli to produce optically pure l-erythro-3-hydroxyaspartate and d-threo-3-hydroxyaspartate from the corresponding dl-racemic mixtures. The enzymatic resolution reported here is one of the simplest and the first enzymatic method that can be used for obtaining optically pure l-erythro-3-hydroxyaspartate.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.