WorldWideScience

Sample records for acid sequence deduced

  1. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250...

  2. Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein.

    Science.gov (United States)

    Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J

    2015-01-01

    Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of Arabidopsis thaliana s-nitrosoglutathione reductase. Further study is required

  3. Epitopes of human testis-specific lactate dehydrogenase deduced from a cDNA sequence

    International Nuclear Information System (INIS)

    Millan, J.L.; Driscoll, C.E.; LeVan, K.M.; Goldberg, E.

    1987-01-01

    The sequence and structure of human testis-specific L-lactate dehydrogenase [LDHC 4 , LDHX; (L)-lactate:NAD + oxidoreductase, EC 1.1.1.27] has been derived from analysis of a complementary DNA (cDNA) clone comprising the complete protein coding region of the enzyme. From the deduced amino acid sequence, human LDHC 4 is as different from rodent LDHC 4 (73% homology) as it is from human LDHA 4 (76% homology) and porcine LDHB 4 (68% homology). Subunit homologies are consistent with the conclusion that the LDHC gene arose by at least two independent duplication events. Furthermore, the lower degree of homology between mouse and human LDHC 4 and the appearance of this isozyme late in evolution suggests a higher rate of mutation in the mammalian LDHC genes than in the LDHA and -B genes. Comparison of exposed amino acid residues of discrete anti-genic determinants of mouse and human LDHC 4 reveals significant differences. Knowledge of the human LDHC 4 sequence will help design human-specific peptides useful in the development of a contraceptive vaccine

  4. Nucleotide sequence of a cDNA for branched chain acyltransferase with analysis of the deduced protein structure

    International Nuclear Information System (INIS)

    Hummel, K.B.; Litwer, S.; Bradford, A.P.; Aitken, A.; Danner, D.J.; Yeaman, S.J.

    1988-01-01

    Nucleotide sequence was determined for a 1.6-kilobase human cDNA putative for the branched chain acyltransferase protein of the branched chain α-ketoacid dehydrogenase complex. Translation of the sequence reveals an open reading frame encoding a 315-amino acid protein of molecular weight 35,759 followed by 560 bases of 3'-untranslated sequence. Three repeats of the polyadenylation signal hexamer ATTAAA are present prior to the polyadenylate tail. Within the open reading frame is a 10-amino acid fragment which matches exactly the amino acid sequence around the lipoate-lysine residue in bovine kidney branched chain acyltransferase, thus confirming the identity of the cDNA. Analysis of the deduced protein structure for the human branched chain acyltransferase revealed an organization into domains similar to that reported for the acyltransferase proteins of the pyruvate and α-ketoglutarate dehydrogenase complexes. This similarity in organization suggests that a more detailed analysis of the proteins will be required to explain the individual substrate and multienzyme complex specificity shown by these acyltransferases

  5. Deduced amino acid sequence of the small hydrophobic protein of US avian pneumovirus has greater identity with that of human metapneumovirus than those of non-US avian pneumoviruses.

    Science.gov (United States)

    Yunus, Abdul S; Govindarajan, Dhanasekaran; Huang, Zhuhui; Samal, Siba K

    2003-05-01

    We report here the nucleotide and deduced amino acid (aa) sequences of the small hydrophobic (SH) gene of the avian pneumovirus strain Colorado (APV/CO). The SH gene of APV/CO is 628 nucleotides in length from gene-start to gene-end. The longest ORF of the SH gene encoded a protein of 177 aas in length. Comparison of the deduced aa sequence of the SH protein of APV/CO with the corresponding published sequences of other members of genera metapneumovirus showed 28% identity with the newly discovered human metapneumovirus (hMPV), but no discernable identity with the APV subgroup A or B. Collectively, this data supports the hypothesis that: (i) APV/CO is distinct from European APV subgroups and belongs to the novel subgroup APV/C (APV/US); (ii) APV/CO is more closely related to hMPV, a mammalian metapneumovirus, than to either APV subgroup A or B. The SH gene of APV/CO was cloned using a genomic walk strategy which initiated cDNA synthesis from genomic RNA that traversed the genes in the order 3'-M-F-M2-SH-G-5', thus confirming that gene-order of APV/CO conforms in the genus Metapneumovirus. We also provide the sequences of transcription-signals and the M-F, F-M2, M2-SH and SH-G intergenic regions of APV/CO.

  6. The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

    OpenAIRE

    Haggarty, N W; Dunbar, B; Fothergill, L A

    1983-01-01

    The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important...

  7. The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

    Science.gov (United States)

    Haggarty, N W; Dunbar, B; Fothergill, L A

    1983-01-01

    The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356

  8. Molecular cloning and sequence analysis of complementary DNA encoding rat mammary gland medium-chain S-acyl fatty acid synthetase thio ester hydrolase

    International Nuclear Information System (INIS)

    Safford, R.; de Silva, J.; Lucas, C.

    1987-01-01

    Poly(A) + RNA from pregnant rat mammary glands was size-fractionated by sucrose gradient centrifugation, and fractions enriched in medium-chain S-acyl fatty acid synthetase thio ester hydrolase (MCH) were identified by in vitro translation and immunoprecipitation. A cDNA library was constructed, in pBR322, from enriched poly(A) + RNA and screened with two oligonucleotide probes deduced from rat MCH amino acid sequence data. Cross-hybridizing clones were isolated and found to contain cDNA inserts ranging from ∼ 1100 to 1550 base pairs (bp). A 1550-bp cDNA insert, from clone 43H09, was confirmed to encode MCH by hybrid-select translation/immunoprecipitation studies and by comparison of the amino acid sequence deduced from the DNA sequence of the clone to the amino acid sequence of the MCH peptides. Northern blot analysis revealed the size of the MCH mRNA to be 1500 nucleotides, and it is therefore concluded that the 1550-bp insert (including G x C tails) of clone 43H09 represents a full- or near-full-length copy of the MCH gene. The rat MCH sequence is the first reported sequence of a thioesterase from a mammalian source, but comparison of the deduced amino acid sequences of MCH and the recently published mallard duck medium-chain S-acyl fatty acid synthetase thioesterase reveals significant homology. In particular, a seven amino acid sequence containing the proposed active serine of the duck thioesterase is found to be perfectly conserved in rat MCH

  9. Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

    Science.gov (United States)

    Pietrowski, D; Förster, M

    2000-01-01

    The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).

  10. Molecular cloning of chicken metallothionein. Deduction of the complete amino acid sequence and analysis of expression using cloned cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Wei, D; Andrews, G K

    1988-01-25

    A cDNA library was constructed using RNA isolated from the livers of chickens which had been treated with zinc. This library was screened with a RNA probe complementary to mouse metallothionein-I (MT), and eight chicken MT cDNA clones were obtained. All of the cDNA clones contained nucleotide sequences homologous to regions of the longest (375 bp) cDNA clone. The latter contained an open reading frame of 189 bp, and the deduced amino acid sequence indicates a protein of 63 amino acids of which 20 are cysteine residues. Amino acid composition and partial amino acid sequence analyses of purified chicken MT protein agreed with the amino acid composition and sequence deduced from the cloned cDNA. Amino acid sequence comparison establish that chicken MT shares extensive homology with mammalian MTs. Southern blot analysis of chicken DNA indicates that the chicken MT gene is not a part of a large family of related sequences, but rather is likely to be a unique gene sequence. In the chicken liver, levels of chicken MT mRNA were rapidly induced by metals (Cd/sup 2 +/, Zn/sup 2 +/, Cu/sup 2 +/), glucocorticoids and lipopolysaccharide. MT mRNA was present in low levels in embryonic liver and increased to high levels during the first week after hatching before decreasing again to the basal levels found in adult liver. The results of this study establish that MT is highly conserved between birds and mammals and is regulated in the chicken by agents which also regulate expression of mammalian MT genes. However, in contrast to the mammals, the results suggest the existence of a single isoform of MT in the chicken.

  11. Cloning and sequencing of the bovine gastrin gene

    DEFF Research Database (Denmark)

    Lund, T; Rehfeld, J F; Olsen, Jørgen

    1989-01-01

    In order to deduce the primary structure of bovine preprogastrin we therefore sequenced a gastrin DNA clone isolated from a bovine liver cosmid library. Bovine preprogastrin comprises 104 amino acids and consists of a signal peptide, a 37 amino acid spacer-sequence, the gastrin-34 sequence followed...

  12. Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

    Science.gov (United States)

    Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

    1993-02-01

    A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.

  13. Sequence of human protamine 2 cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Domenjoud, L; Fronia, C; Uhde, F; Engel, W [Universitaet Goettingen (West Germany)

    1988-08-11

    The authors report the cloning and sequencing of a cDNA clone for human protamine 2 (hp2), isolated from a human testis cDNA library cloned in the vector {lambda}-gt11. A 66mer oligonucleotide, that corresponds to an amino acid sequence which is highly conserved between hp2 and mouse protamine 2 (mp2) served as hybridization probe. The homology between the amino acid sequence deduced from our cDNA and the published amino acid sequence for hp2 is 100%.

  14. Deduced sequences of the membrane fusion and attachment proteins of canine distemper viruses isolated from dogs and wild animals in Korea.

    Science.gov (United States)

    Bae, Chae-Wun; Lee, Joong-Bok; Park, Seung-Yong; Song, Chang-Seon; Lee, Nak-Hyung; Seo, Kun-Ho; Kang, Young-Sun; Park, Choi-Kyu; Choi, In-Soo

    2013-08-01

    Canine distemper virus (CDV) causes highly contagious respiratory, gastrointestinal, and neurological diseases in wild and domestic animal species. Despite a broad vaccination campaign, the disease is still a serious problem worldwide. In this study, six field CDV strains were isolated from three dogs, two raccoon dogs, and one badger in Korea. The full sequence of the genes encoding fusion (F) and hemagglutinin (H) proteins were compared with those of other CDVs including field and vaccine strains. The phylogenetic analysis for the F and H genes indicated that the two CDV strains isolated from dogs were most closely related to Chinese strains in the Asia-1 genotype. Another four strains were closely related to Japanese strains in the Asia-2 genotype. The six currently isolated strains shared 90.2-92.1% and 88.2-91.8% identities with eight commercial vaccine strains in their nucleotide and amino acid sequences of the F protein, respectively. They also showed 90.1-91.4% and 87.8-90.7% identities with the same vaccine strains in their nucleotide and deduced amino acid sequences of the H protein, respectively. Different N-linked glycosylation sites were identified in the F and H genes of the six isolates from the prototype vaccine strain Onderstepoort. Collectively, these results demonstrate that at least two different CDV genotypes currently exist in Korea. The considerable genetic differences between the vaccine strains and wild-type isolates would be a major factor of the incomplete protection of dogs from CDV infections.

  15. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    International Nuclear Information System (INIS)

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-01-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO 4 /PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene

  16. Mass spectrometric amino acid sequencing of a mixture of seed storage proteins (napin) from Brassica napus, products of a multigene family.

    OpenAIRE

    Gehrig, P M; Krzyzaniak, A; Barciszewski, J; Biemann, K

    1996-01-01

    The amino acid sequences of a number of closely related proteins ("napin") isolated from Brassica napus were determined by mass spectrometry without prior separation into individual components. Some of these proteins correspond to those previously deduced (napA, BngNAP1, and gNa), chiefly from DNA sequences. Others were found to differ to a varying extent (BngNAP1', BngNAP1A, BngNAP1B, BngNAP1C, gNa', and gNaA). The short chains of gNa and gNa' and of BngNAP1 and BngNAP1' differ by the replac...

  17. Complete cDNA sequence coding for human docking protein

    Energy Technology Data Exchange (ETDEWEB)

    Hortsch, M; Labeit, S; Meyer, D I

    1988-01-11

    Docking protein (DP, or SRP receptor) is a rough endoplasmic reticulum (ER)-associated protein essential for the targeting and translocation of nascent polypeptides across this membrane. It specifically interacts with a cytoplasmic ribonucleoprotein complex, the signal recognition particle (SRP). The nucleotide sequence of cDNA encoding the entire human DP and its deduced amino acid sequence are given.

  18. The cDNA sequence of a neutral horseradish peroxidase.

    Science.gov (United States)

    Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

    1991-02-16

    A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.

  19. Amino acid sequence of bovine muzzle epithelial desmocollin derived from cloned cDNA: a novel subtype of desmosomal cadherins.

    Science.gov (United States)

    Koch, P J; Goldschmidt, M D; Walsh, M J; Zimbelmann, R; Schmelz, M; Franke, W W

    1991-05-01

    Desmosomes are cell-type-specific intercellular junctions found in epithelium, myocardium and certain other tissues. They consist of assemblies of molecules involved in the adhesion of specific cell types and in the anchorage of cell-type-specific cytoskeletal elements, the intermediate-size filaments, to the plasma membrane. To explore the individual desmosomal components and their functions we have isolated DNA clones encoding the desmosomal glycoprotein, desmocollin, using antibodies and a cDNA expression library from bovine muzzle epithelium. The cDNA-deduced amino-acid sequence of desmocollin (presently we cannot decide to which of the two desmocollins, DC I or DC II, this clone relates) defines a polypeptide with a calculated molecular weight of 85,000, with a single candidate sequence of 24 amino acids sufficiently long for a transmembrane arrangement, and an extracellular aminoterminal portion of 561 amino acid residues, compared to a cytoplasmic part of only 176 amino acids. Amino acid sequence comparisons have revealed that desmocollin is highly homologous to members of the cadherin family of cell adhesion molecules, including the previously sequenced desmoglein, another desmosome-specific cadherin. Using riboprobes derived from cDNAs for Northern-blot analyses, we have identified an mRNA of approximately 6 kb in stratified epithelia such as muzzle epithelium and tongue mucosa but not in two epithelial cell culture lines containing desmosomes and desmoplakins. The difference may indicate drastic differences in mRNA concentration or the existence of cell-type-specific desmocollin subforms. The molecular topology of desmocollin(s) is discussed in relation to possible functions of the individual molecular domains.

  20. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    Science.gov (United States)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  1. Detection of nucleic acid sequences by invader-directed cleavage

    Science.gov (United States)

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  2. Purification and amino acid sequence of a bacteriocins produced by Lactobacillus salivarius K7 isolated from chicken intestine

    Directory of Open Access Journals (Sweden)

    Kenji Sonomoto

    2006-03-01

    Full Text Available A bacteriocin-producing strain, Lactobacillus K7, was isolated from a chicken intestine. The inhibitory activity was determined by spot-on-lawn technique. Identification of the strain was performed by morphological, biochemical (API 50 CH kit and molecular genetic (16S rDNA basis. Bacteriocin purification processes were carried out by amberlite adsorption, cation exchange and reverse-phase high perform- ance liquid chromatography. N-terminal amino acid sequences were performed by Edman degradation. Molecular mass was determined by electrospray-ionization (ESI mass spectrometry (MS. Lactobacillus K7 showed inhibitory activity against Lactobacillus sakei subsp. sakei JCM 1157T, Leuconostoc mesenteroides subsp. mesenteroides JCM 6124T and Bacillus coagulans JCM 2257T. This strain was identified as Lb. salivarius. The antimicrobial substance was destroyed by proteolytic enzymes, indicating its proteinaceous structure designated as a bacteriocin type. The purification of bacteriocin by amberlite adsorption, cation exchange, and reverse-phase chromatography resulted in only one single active peak, which was designated FK22. Molecular weight of this fraction was 4331.70 Da. By amino acid sequence, this peptide was homology to Abp 118 beta produced by Lb. salivarius UCC118. In addition, Lb. salivarius UCC118 produced 2-peptide bacteriocin, which was Abp 118 alpha and beta. Based on the partial amino acid sequences of Abp 118 beta, specific primers were designed from nucleotide sequences according to data from GenBank. The result showed that the deduced peptide was high homology to 2-peptide bacteriocin, Abp 118 alpha and beta.

  3. Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

    OpenAIRE

    Sakoda, H; Imanaka, T

    1992-01-01

    Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those cata...

  4. Catalytically important amino-acid residues of abalone alginate lyase HdAly assessed by site-directed mutagenesis

    OpenAIRE

    Yamamoto, Sayo; Sahara, Takehiko; Sato, Daisuke; Kawasaki, Kosei; Ohgiya, Satoru; Inoue, Akira; Ojima, Takao

    2008-01-01

    Alginate lyase is an enzyme that degrades alginate chains via β-elimination and has been used for the production of alginate oligosaccharides and protoplasts from brown algae. Previously, we deduced the amino-acid sequence of an abalone alginate lyase, HdAly, from its cDNA sequence and, through multiple amino-acid sequence alignment, found that several basic amino-acid residues were highly conserved among the polysaccharide-lyase family 14 (PL-14) enzymes including HdAly. In the present study...

  5. Nucleotide sequence of the coat protein gene of the Skierniewice isolate of plum pox virus (PPV)

    International Nuclear Information System (INIS)

    Wypijewski, K.; Musial, W.; Augustyniak, J.; Malinowski, T.

    1994-01-01

    The coat protein (CP) gene of the Skierniewice isolate of plum pox virus (PPV-S) has been amplified using the reverse transcription - polymerase chain reaction (RT-PCR), cloned and sequenced. The nucleotide sequence of the gene and the deduced amino-acid sequences of PPV-S CP were compared with those of other PPV strains. The nucleotide sequence showed very high homology to most of the published sequences. The motif: Asp-Ala-Gly (DAG), important for the aphid transmissibility, was present in the amino-acid sequence. Our isolate did not react in ELISA with monoclonal antibodies MAb06 supposed to be specific for PPV-D. (author). 32 refs, 1 fig., 2 tabs

  6. SAAS: Short Amino Acid Sequence - A Promising Protein Secondary Structure Prediction Method of Single Sequence

    Directory of Open Access Journals (Sweden)

    Zhou Yuan Wu

    2013-07-01

    Full Text Available In statistical methods of predicting protein secondary structure, many researchers focus on single amino acid frequencies in α-helices, β-sheets, and so on, or the impact near amino acids on an amino acid forming a secondary structure. But the paper considers a short sequence of amino acids (3, 4, 5 or 6 amino acids as integer, and statistics short sequence's probability forming secondary structure. Also, many researchers select low homologous sequences as statistical database. But this paper select whole PDB database. In this paper we propose a strategy to predict protein secondary structure using simple statistical method. Numerical computation shows that, short amino acids sequence as integer to statistics, which can easy see trend of short sequence forming secondary structure, and it will work well to select large statistical database (whole PDB database without considering homologous, and Q3 accuracy is ca. 74% using this paper proposed simple statistical method, but accuracy of others statistical methods is less than 70%.

  7. RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

    Science.gov (United States)

    Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

    1990-07-01

    The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.

  8. cDNA, deduced polypeptide structure and chromosomal assignment of human pulmonary surfactant proteolipid, SPL(pVal)

    International Nuclear Information System (INIS)

    Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.C.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.; Whitsett, J.A.

    1988-01-01

    In hyaline membrane disease of premature infants, lack of surfactant leads to pulmonary atelectasis and respiratory distress. Hydrophobic surfactant proteins of M/sub r/ = 5000-14,000 have been isolated from mammalian surfactants which enhance the rate of spreading and the surface tension lowering properties of phospholipids during dynamic compression. The authors have characterized the amino-terminal amino acid sequence of pulmonary proteolipids from ether/ethanol extracts of bovine, canine, and human surfactant. Two distinct peptides were identified and termed SPL(pVal) and SPL(Phe). An oligonucleotide probe based on the valine-rich amino-terminal amino acid sequence of SPL(pVal) was utilized to isolate cDNA and genomic DNA encoding the human protein, termed surfactant proteolipid SPL(pVal) on the basis of its unique polyvaline domain. The primary structure of a precursor protein of 20,870 daltons, containing the SPL(pVal) peptide, was deduced from the nucleotide sequence of the cDNAs. Hybrid-arrested translation and immunoprecipitation of labeled translation products of human mRNA demonstrated a precursor protein, the active hydrophobic peptide being produced by proteolytic processing. Two classes of cDNAs encoding SPL(pVal) were identified. Human SPL(pVal) mRNA was more abundant in the adult than in fetal lung. The SPL(pVal) gene locus was assigned to chromosome 8

  9. Cloning, sequence and expression of the pel gene from an Amycolata sp.

    Science.gov (United States)

    Brühlmann, F; Keen, N T

    1997-11-20

    The pel gene from an Amycolata sp. encoding a pectate lyase (EC 4.2.2.2) was isolated by activity screening a genomic DNA library in Streptomyces lividans TK24. Subsequent subcloning and sequencing of a 2.3 kb BamHI BglII fragment revealed an open reading frame of 930 nt corresponding to a protein of 29,660 Da. The overall G + C content for the coding region was 65%, with a strong G + C preference in the third (wobble) codon position (93%). A putative ribosome-binding site 5'-GGGAG-3' preceded the translational start codon by 7 base pairs. The Amycolata pectate lyase contains a signal peptide of 26 amino acids, that is cleaved after the sequence Ala-Thr-Ala. The size of the deduced protein as well as its N-terminal amino-acid sequence match the wild-type pectate lyase from the Amycolata sp. Expression of the pel gene in S. lividans TK24 resulted in high pectate lyase activity in the culture supernatant, concomitant with the appearance of a dominant protein band on a sodium dodecyl polyacrylamide gel at 30 kDa. No pectate lyase activity was detected in E. coli BL21 with the pel gene under the strong T7 promotor. The deduced amino-acid sequence showed 40% identity with PelE from Erwinia chrysanthemi and the pectate lyase from Glomerella cingulata. The Amycolata pectate lyase clearly belongs to the pectate lyase superfamily, sharing all functional amino acids and likely has a similar structural topology as Pels from Erwinia chrysanthemi and Bacillus subtilis.

  10. Molecular cloning and expression of the hyu genes from Microbacterium liquefaciens AJ 3912, responsible for the conversion of 5-substituted hydantoins to alpha-amino acids, in Escherichia coli.

    Science.gov (United States)

    Suzuki, Shun'ichi; Takenaka, Yasuhiro; Onishi, Norimasa; Yokozeki, Kenzo

    2005-08-01

    A DNA fragment from Microbacterium liquefaciens AJ 3912, containing the genes responsible for the conversion of 5-substituted-hydantoins to alpha-amino acids, was cloned in Escherichia coli and sequenced. Seven open reading frames (hyuP, hyuA, hyuH, hyuC, ORF1, ORF2, and ORF3) were identified on the 7.5 kb fragment. The deduced amino acid sequence encoded by the hyuA gene included the N-terminal amino acid sequence of the hydantoin racemase from M. liquefaciens AJ 3912. The hyuA, hyuH, and hyuC genes were heterologously expressed in E. coli; their presence corresponded with the detection of hydantoin racemase, hydantoinase, and N-carbamoyl alpha-amino acid amido hydrolase enzymatic activities respectively. The deduced amino acid sequences of hyuP were similar to those of the allantoin (5-ureido-hydantoin) permease from Saccharomyces cerevisiae, suggesting that hyuP protein might function as a hydantoin transporter.

  11. Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

    Science.gov (United States)

    Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

    2007-01-01

    Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.

  12. Sequence analysis and overexpression of a pectin lyase gene (pel1) from Aspergillus oryzae KBN616.

    Science.gov (United States)

    Kitamoto, N; Yoshino-Yasuda, S; Ohmiya, K; Tsukagoshi, N

    2001-01-01

    A gene (pel1) encoding pectin lyase (Pel1) was isolated from a shoyu koji mold, Aspergillus oryzae KBN616, and characterized. The structural gene comprised 1,196 bp with a single intron. The ORF encoded 381 amino acids with a signal peptide of 20 amino acids. The deduced amino acid sequence showed high similarity to those of Aspergillus niger pectin lyases and Glomerella cingulata PnlA. The pel1 gene was successfully overexpressed under the promoter of the A. oryzae TEF1 gene. The molecular mass of the recombinant pectin lyase substantially coincided with that calculated based on nucleotide sequence.

  13. Complete coding sequence of the human raf oncogene and the corresponding structure of the c-raf-1 gene

    Energy Technology Data Exchange (ETDEWEB)

    Bonner, T I; Oppermann, H; Seeburg, P; Kerby, S B; Gunnell, M A; Young, A C; Rapp, U R

    1986-01-24

    The complete 648 amino acid sequence of the human raf oncogene was deduced from the 2977 nucleotide sequence of a fetal liver cDNA. The cDNA has been used to obtain clones which extend the human c-raf-1 locus by an additional 18.9 kb at the 5' end and contain all the remaining coding exons.

  14. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    1993-02-16

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a pu GOVERNMENT RIGHTS This application was funded under Department of Energy Contract DE-AC02-76ER01338. The U.S. Government has certain rights under this application and any patent issuing thereon.

  15. Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

    DEFF Research Database (Denmark)

    Ibaraki, K; Kozak, C A; Wewer, U M

    1995-01-01

    regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76......(s) of tetranectin. The sequence analysis revealed a difference in both sequence and size of the noncoding regions between mouse and human cDNAs. Northern analysis of the various tissues from mouse, rat, and cow showed the major transcript(s) to be approximately 1 kb, which is similar in size to that observed...

  16. Hybridization and sequencing of nucleic acids using base pair mismatches

    Science.gov (United States)

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  17. Comparison of Human and Guinea Pig Acetylcholinesterase Sequences and Rates of Oxime-Assisted Reactivation

    Science.gov (United States)

    2010-01-01

    of appropriate animal model systems. For OP poisoning, the guinea pig (Cavia porcellus) is a commonly used animal model because guinea pigs more...endogenous bioscavenger in vivo. Although guinea pigs historically have been used to test OP poisoning therapies, it has been found recently that guinea pig AChE...transcribed mRNA encoding guinea pig AChE, amplified the resulting cDNA, and sequenced this product. The nucleotide and deduced amino acid sequences of

  18. Variation of amino acid sequences of serum amyloid a (SAA) and immunohistochemical analysis of amyloid a (AA) in Japanese domestic cats.

    Science.gov (United States)

    Tei, Meina; Uchida, Kazuyuki; Chambers, James K; Watanabe, Ken-Ichi; Tamamoto, Takashi; Ohno, Koichi; Nakayama, Hiroyuki

    2018-02-02

    Amyloid A (AA) amyloidosis, a fatal systemic amyloid disease, occurs secondary to chronic inflammatory conditions in humans. Although persistently elevated serum amyloid A (SAA) levels are required for its pathogenesis, not all individuals with chronic inflammation necessarily develop AA amyloidosis. Furthermore, many diseases in cats are associated with the elevated production of SAA, whereas only a small number actually develop AA amyloidosis. We hypothesized that a genetic mutation in the SAA gene may strongly contribute to the pathogenesis of feline AA amyloidosis. In the present study, genomic DNA from four Japanese domestic cats (JDCs) with AA amyloidosis and from five without amyloidosis was analyzed using polymerase chain reaction (PCR) amplification and direct sequencing. We identified the novel variation combination of 45R-51A in the deduced amino acid sequences of four JDCs with amyloidosis and five without. However, there was no relationship between amino acid variations and the distribution of AA amyloid deposits, indicating that differences in SAA sequences do not contribute to the pathogenesis of AA amyloidosis. Immunohistochemical analysis using antisera against the three different parts of the feline SAA protein-i.e., the N-terminal, central, and C-terminal regions-revealed that feline AA contained the C-terminus, unlike human AA. These results indicate that the cleavage and degradation of the C-terminus are not essential for amyloid fibril formation in JDCs.

  19. CDNA encoding a polypeptide including a hevein sequence

    Science.gov (United States)

    Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

    1995-03-21

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  20. Optimization of short amino acid sequences classifier

    Science.gov (United States)

    Barcz, Aleksy; Szymański, Zbigniew

    This article describes processing methods used for short amino acid sequences classification. The data processed are 9-symbols string representations of amino acid sequences, divided into 49 data sets - each one containing samples labeled as reacting or not with given enzyme. The goal of the classification is to determine for a single enzyme, whether an amino acid sequence would react with it or not. Each data set is processed separately. Feature selection is performed to reduce the number of dimensions for each data set. The method used for feature selection consists of two phases. During the first phase, significant positions are selected using Classification and Regression Trees. Afterwards, symbols appearing at the selected positions are substituted with numeric values of amino acid properties taken from the AAindex database. In the second phase the new set of features is reduced using a correlation-based ranking formula and Gram-Schmidt orthogonalization. Finally, the preprocessed data is used for training LS-SVM classifiers. SPDE, an evolutionary algorithm, is used to obtain optimal hyperparameters for the LS-SVM classifier, such as error penalty parameter C and kernel-specific hyperparameters. A simple score penalty is used to adapt the SPDE algorithm to the task of selecting classifiers with best performance measures values.

  1. Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

    Science.gov (United States)

    Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

    2003-01-01

    A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171

  2. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  3. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    1999-05-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 12 figs.

  4. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    1999-05-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  5. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    1995-03-21

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 11 figures.

  6. Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

    Science.gov (United States)

    Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

    1995-01-01

    Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant. PMID:7896694

  7. Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

    Science.gov (United States)

    Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

    1995-04-01

    Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant.

  8. MEANS AND METHODS FOR CLONING NUCLEIC ACID SEQUENCES

    NARCIS (Netherlands)

    Geertsma, Eric Robin; Poolman, Berend

    2008-01-01

    The invention provides means and methods for efficiently cloning nucleic acid sequences of interest in micro-organisms that are less amenable to conventional nucleic acid manipulations, as compared to, for instance, E.coli. The present invention enables high-throughput cloning (and, preferably,

  9. cDNA encoding a polypeptide including a hev ein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  10. Cloning and sequence analysis of serine proteinase of Gloydius ussuriensis venom gland

    International Nuclear Information System (INIS)

    Sun Dejun; Liu Shanshan; Yang Chunwei; Zhao Yizhuo; Chang Shufang; Yan Weiqun

    2005-01-01

    Objective: To construct a cDNA library by using mRNA from Gloydius ussuriensis (G. Ussuriensis) venom gland, to clone and analyze serine proteinase gene from the cDNA library. Methods: Total RNA was isolated from venom gland of G. ussuriensis, mRNA was purified by using mRNA isolation Kit. The whole length cDNA was synthesized by means of smart cDNA synthesis strategy, and amplified by long distance PCR procedure, lately cDAN was cloned into vector pBluescrip-sk. The recombinant cDNA was transformed into E. coli DH5α. The cDNA of serine proteinase gene in the venom gland of G. ussuriensis was detected and amplified using the in situ hybridization. The cDNA fragment was inserted into pGEMT vector, cloned and its nucleotide sequence was determined. Results: The capacity of cDNA library of venom gland was above 2.3 x 10 6 . Its open reading frame was composed of 702 nucleotides and coded a protein pre-zymogen of 234 amino acids. It contained 12 cysteine residues. The sequence analysis indicated that the deduced amino acid sequence of the cDNA fragment shared high identity with the thrombin-like enzyme genes of other snakes in the GenBank. the query sequence exhibited strong amino acid sequence homology of 85% to the serine proteas of T. gramineus, thrombin-like serine proteinase I of D. acutus and serine protease catroxase II of C. atrox respectively. Based on the amino acid sequences of other thrombin-like enzymes, the catalytic residues and disulfide bridges of this thrombin-like enzyme were deduced as follows: catalytic residues, His 41 , Asp 86 , Ser 180 ; and six disulfide bridges Cys 7 -Cys 139 , Cys 26 -Cys 42 , Cys 74 -Cys 232 , Cys 118 -Cys 186 , Cys 150 -Cys 165 , Cys 176 -Cys 201 . Conclusion: The capacity of cDNA library of venom gland is above 2.3 x 10 6 , overtop the level of 10 5 capicity. The constructed cDNA library of G. ussuriensis venom gland would be helpful platform to detect new target genes and further gene manipulate. The cloned serine

  11. [Cloning and sequencing of the papA gene from uropathogenic Escherichia coli 4030 strain].

    Science.gov (United States)

    Wu, Qinggang; Zhang, Jingping; Zhao, Chuncheng; Zhu, Jianguo

    2008-09-01

    Cloning and sequencing of the papA gene from uropathogenic Escherichia coli 4030 strain to investigate the differences of the sequences of the papA of UPEC4030 strain and the ones of related genes, in order to make whether or not it was a new genotype. Cloning and sequencing methods were used to analyze the sequence of the papA of UPEC4030 strain in comparison with related sequences. The sequence analysis of papA revealed a 722 bp gene and encode 192 amino acid polypeptide. The overall homology of the papA genes between UPEC4030 and the standard strains of ten F types were 36.11%-77.95% and 22.20%-78.34% at nucleotide and deduced amino acid levels. The homology between the sequence of the reverse primers and the corresponding sequence of UPEC4030 papA was 10%-66.67%. The results confirmed that UPEC4030 strain contained a novel papA variant. UPEC4030 strain could contain an unknown papA variant or the novel genotype. The pathogenic mechanism and epidemiology related need to be further studied.

  12. Murine protein H is comprised of 20 repeating units, 61 amino acids in length

    DEFF Research Database (Denmark)

    Kristensen, Torsten; Tack, B F

    1986-01-01

    A cDNA library constructed from size-selected (greater than 28 S) poly(A)+ RNA isolated from the livers of C57B10. WR mice was screened by using a 249-base-pair (bp) cDNA fragment encoding 83 amino acid residues of human protein H as a probe. Of 120,000 transformants screened, 30 hybridized......, 448 bp of 3'-untranslated sequence, and a polyadenylylated tail of undetermined length. Murine pre-protein H was deduced to consist of an 18-amino acid signal peptide and 1216 residues of H-protein sequence. Murine H was composed of 20 repetitive units, each about 61 amino acid residues in length...

  13. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    DEFF Research Database (Denmark)

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein...... sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally...

  14. A newly constructed primer pair for the PCR amplification, cloning and sequencing of the flagellin (flaA) gene from isolatesof urease-negative Campylobacter lari.

    Science.gov (United States)

    Sekizuka, Tsuyoshi; Yokoi, Taeko; Murayama, Ohoshi; Millar, B Cherie; Moore, Johne; Matsuda, Motoo

    2005-08-01

    A newly constructed primer pair (lari-Af/lari-Ar) designed to generate a product of the flagellin (flaA) gene for urease-negative Campylobacter lari produced a PCR amplicon of about 1700 bp for 16 isolates from 7 seagulls, 5 humans, 3 food animals and one mussel in Japan and Northern Ireland. Nucleotide sequencing and alignments of the flaA amplicons from these isolates demonstrated that the deduced amino acid sequences of the possible open reading frame were 564-572 amino acid residues in length with calculated molecular weights of 58,804 to 59,463. The deduced amino acid sequence similarity analysis strongly suggested that the ORF of the flaA from the 16 isolates showed 70-75% sequence similarities to those of Campylobacter jejuni isolates. The approximate Mr of the flagellin purified from some of the isolates of urease-negative C. lari was estimated to range from 59.6 to 61.8 kDa. Thus, flagellin from the isolates of urease-negative C. lari was shown for the first time to have a molecular size similar to those of C. jejuni and Campylobacter coli isolates, but to be different from the shorter flaA and smaller flagellin of urease-positive thermophilic Campylobacter (UPTC) isolates. Flagellins from C. lari spp., consisting of the two representative taxa of urease-negative C. lari and UPTC, thus show genotypic and phenotypic diversity.

  15. Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.

    Science.gov (United States)

    Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S

    1985-07-01

    The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.

  16. Murine mammary tumor virus pol-related sequences in human DNA: characterization and sequence comparison with the complete murine mammary tumor virus pol gene

    International Nuclear Information System (INIS)

    Deen, K.C.; Sweet, R.W.

    1986-01-01

    Sequences in the human genome with homology to the murine mammary tumor virus (MMTV) pol gene were isolated from a human phage library. Ten clones with extensive pol homology were shown to define five separate loci. These loci share common sequences immediately adjacent to the pol-like segments and, in addition, contain a related repeat element which bounds this region. This organization is suggestive of a proviral structure. The authors estimate that the human genome contains 30 to 40 copies of these pol-related sequences. The pol region of one of the cloned segments (HM16) and the complete MMTV pol gene were sequenced and compared. The nucleotide homology between these pol sequences is 52% and is concentrated in the terminal regions. The MMTV pol gene contains a single long open reading frame encoding 899 amino acids and is demarcated from the partially overlapping putative gag gene by termination codons and a shift in translational reading frame. The pol sequence of HM16 is multiply terminated but does contain open reading frames which encode 370, 105, and 112 amino acids residues in separate reading frames. The authors deduced a composite pol protein sequence for HM16 by aligning it to the MMTV pol gene and then compared these sequences with other retroviral pol protein sequences. Conserved sequences occur in both the amino and carboxyl regions which lie within the polymerase and endonuclease domains of pol, respectively

  17. Evolutionary history of Calosomina ground beetles (Coleoptera, Carabidae, Carabinae) of the world as deduced from sequence comparisons of the mitochondrial ND 5 gene.

    Science.gov (United States)

    Su, Zhi-Hui; Imura, Yûki; Osawa, Syozo

    2005-11-07

    We deduced the phylogenetic relationships of 54 individuals representing 27 species of the Calosomina (Coleoptera, Carabidae) from various regions of the world from the mitochondrial NADH dehydrogenase subunit 5 (ND 5) gene sequences. The results suggest that these Calosomina radiated into 17 lineages within a short time about 30 million years ago (Mya). Most of the lineages are composed of a single genus containing only one or a few species. In some cases, several species classified into the same genus (e.g., Calosoma maximowiczi, Calos. inquisitor and Calos. frigidum) appear separately in independent lineages, while in others a series of species classified into different genera fall into one lineage (e.g., Chrysostigma calidum, Blaptosoma chihuahua, Microcallisthenes wilkesi and Callisthenes spp.). Based on this molecular phylogeny and morphological data, the probable evolutionary history and mode of morphological differentiation of the Calosomina are discussed.

  18. Recent advances in nanopore-based nucleic acid analysis and sequencing

    International Nuclear Information System (INIS)

    Shi, Jidong; Fang, Ying; Hou, Junfeng

    2016-01-01

    Nanopore-based sequencing platforms are transforming the field of genomic science. This review (containing 116 references) highlights some recent progress on nanopore-based nucleic acid analysis and sequencing. These studies are classified into three categories, biological, solid-state, and hybrid nanopores, according to their nanoporous materials. We begin with a brief description of the translocation-based detection mechanism of nanopores. Next, specific examples are given in nanopore-based nucleic acid analysis and sequencing, with an emphasis on identifying strategies that can improve the resolution of nanopores. This review concludes with a discussion of future research directions that will advance the practical applications of nanopore technology. (author)

  19. Structural and Functional Insights from the Metagenome of an Acidic Hot Spring Microbial Planktonic Community in the Colombian Andes

    NARCIS (Netherlands)

    Jiménez Avella, Diego; Dini Andreote, Fernando; Chaves, Diego; Montaña, José Salvador; Osorio-Forero, Cesar; Junca, Howard; Zambrano, María Mercedes; Baena, Sandra

    2012-01-01

    A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level) acidic hot spring El Coquito (EC). A classification of unassembled metagenomic reads

  20. Deducing magnetic resonance neuroimages based on knowledge from samples.

    Science.gov (United States)

    Jiang, Yuwei; Liu, Feng; Fan, Mingxia; Li, Xuzhou; Zhao, Zhiyong; Zeng, Zhaoling; Wang, Yi; Xu, Dongrong

    2017-12-01

    Because individual variance always exists, using the same set of predetermined parameters for magnetic resonance imaging (MRI) may not be exactly suitable for each participant. We propose a knowledge-based method that can repair MRI data of undesired contrast as if a new scan were acquired using imaging parameters that had been individually optimized. The method employed a strategy called analogical reasoning to deduce voxel-wise relaxation properties using morphological and biological similarity. The proposed framework involves steps of intensity normalization, tissue segmentation, relaxation time deducing, and image deducing. This approach has been preliminarily validated using conventional MRI data at 3T from several examples, including 5 normal and 9 clinical datasets. It can effectively improve the contrast of real MRI data by deducing imaging data using optimized imaging parameters based on deduced relaxation properties. The statistics of deduced images shows a high correlation with real data that were actually collected using the same set of imaging parameters. The proposed method of deducing MRI data using knowledge of relaxation times alternatively provides a way of repairing MRI data of less optimal contrast. The method is also capable of optimizing an MRI protocol for individual participants, thereby realizing personalized MR imaging. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. WEB-server for search of a periodicity in amino acid and nucleotide sequences

    Science.gov (United States)

    E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

    2017-12-01

    A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.

  2. Representation of protein-sequence information by amino acid subalphabets

    DEFF Research Database (Denmark)

    Andersen, C.A.F.; Brunak, Søren

    2004-01-01

    -sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...

  3. Soil amino acid composition across a boreal forest successional sequence

    Science.gov (United States)

    Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone

    2009-01-01

    Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...

  4. Molecular identification based on coat protein sequences of the Barley yellow dwarf virus from Brazil

    Directory of Open Access Journals (Sweden)

    Talita Bernardon Mar

    2013-12-01

    Full Text Available Yellow dwarf disease, one of the most important diseases of cereal crops worldwide, is caused by virus species belonging to the Luteoviridae family. Forty-two virus isolates obtained from oat (Avena sativa L., wheat (Triticum aestivum L., barley (Hordeum vulgare L., corn (Zea mays L., and ryegrass (Lolium multiflorum Lam. collected between 2007 and 2008 from winter cereal crop regions in southern Brazil were screened by polymerase chain reaction (PCR with primers designed on ORF 3 (coat protein - CP for the presence of Barley yellow dwarf virus and Cereal yellow dwarf virus (B/CYDV. PCR products of expected size (~357 bp for subgroup II and (~831 bp for subgroup I were obtained for three and 39 samples, respectively. These products were cloned and sequenced. The subgroup II 3' partial CP amino acid deduced sequences were identified as BYDV-RMV (92 - 93 % of identity with "Illinois" Z14123 isolate. The complete CP amino acid deduced sequences of subgroup I isolates were confirmed as BYDV-PAV (94 - 99 % of identity and established a very homogeneous group (identity higher than 99 %. These results support the prevalence of BYDV-PAV in southern Brazil as previously diagnosed by Enzyme-Linked Immunosorbent Assay (ELISA and suggest that this population is very homogeneous. To our knowledge, this is the first report of BYDV-RMV in Brazil and the first genetic diversity study on B/CYDV in South America.

  5. Human acid β-glucosidase: isolation and amino acid sequence of a peptide containing the catalytic site

    International Nuclear Information System (INIS)

    Dinur, T.; Osiecki, K.M.; Legler, G.; Gatt, S.; Desnick, R.J.; Grabowski, G.A.

    1986-01-01

    Human acid β-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase, EC 3.2.1.45) cleaves the glucosidic bonds of glucosylceramide and synthetic β-glucosides. The deficient activity of this hydrolase is the enzymatic defect in the subtypes and variants of Gaucher disease, the most prevalent lysosomal storage disease. To isolate and characterize the catalytic site of the normal enzyme, brominated 3 H-labeled conduritol B epoxide ( 3 H-Br-CBE), which inhibits the enzyme by binding covalently to this site, was used as an affinity label. Under optimal conditions 1 mol of 3 H-Br-CBE bound to 1 mol of pure enzyme protein, indicating the presence of a single catalytic site per enzyme subunit. After V 8 protease digestion of the 3 H-Br-CBE-labeled homogeneous enzyme, three radiolabeled peptides, designated peptide A, B, or C, were resolved by reverse-phase HPLC. The partial amino acid sequence (37 residues) of peptide A (M/sub r/, 5000) was determined. The sequence of this peptide, which contained the catalytic site, had exact homology to the sequence near the carboxyl terminus of the protein, as predicted from the nucleotide sequence of the full-length cDNA encoding acid β-glucosidase

  6. Lipoxygenase in Caragana jubata responds to low temperature, abscisic acid, methyl jasmonate and salicylic acid.

    Science.gov (United States)

    Bhardwaj, Pardeep Kumar; Kaur, Jagdeep; Sobti, Ranbir Chander; Ahuja, Paramvir Singh; Kumar, Sanjay

    2011-09-01

    Lipoxygenase (LOX) catalyses oxygenation of free polyunsaturated fatty acids into oxylipins, and is a critical enzyme of the jasmonate signaling pathway. LOX has been shown to be associated with biotic and abiotic stress responses in diverse plant species, though limited data is available with respect to low temperature and the associated cues. Using rapid amplification of cDNA ends, a full-length cDNA (CjLOX) encoding lipoxygenase was cloned from apical buds of Caragana jubata, a temperate plant species that grows under extreme cold. The cDNA obtained was 2952bp long consisting of an open reading frame of 2610bp encoding 869 amino acids protein. Multiple alignment of the deduced amino acid sequence with those of other plants demonstrated putative LH2/ PLAT domain, lipoxygenase iron binding catalytic domain and lipoxygenase_2 signature sequences. CjLOX exhibited up- and down-regulation of gene expression pattern in response to low temperature (LT), abscisic acid (ABA), methyl jasmonate (MJ) and salicylic acid (SA). Among all the treatments, a strong up-regulation was observed in response to MJ. Data suggests an important role of jasmonate signaling pathway in response to LT in C. jubata. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. Revised Mimivirus major capsid protein sequence reveals intron-containing gene structure and extra domain

    Directory of Open Access Journals (Sweden)

    Suzan-Monti Marie

    2009-05-01

    Full Text Available Abstract Background Acanthamoebae polyphaga Mimivirus (APM is the largest known dsDNA virus. The viral particle has a nearly icosahedral structure with an internal capsid shell surrounded with a dense layer of fibrils. A Capsid protein sequence, D13L, was deduced from the APM L425 coding gene and was shown to be the most abundant protein found within the viral particle. However this protein remained poorly characterised until now. A revised protein sequence deposited in a database suggested an additional N-terminal stretch of 142 amino acids missing from the original deduced sequence. This result led us to investigate the L425 gene structure and the biochemical properties of the complete APM major Capsid protein. Results This study describes the full length 3430 bp Capsid coding gene and characterises the 593 amino acids long corresponding Capsid protein 1. The recombinant full length protein allowed the production of a specific monoclonal antibody able to detect the Capsid protein 1 within the viral particle. This protein appeared to be post-translationnally modified by glycosylation and phosphorylation. We proposed a secondary structure prediction of APM Capsid protein 1 compared to the Capsid protein structure of Paramecium Bursaria Chlorella Virus 1, another member of the Nucleo-Cytoplasmic Large DNA virus family. Conclusion The characterisation of the full length L425 Capsid coding gene of Acanthamoebae polyphaga Mimivirus provides new insights into the structure of the main Capsid protein. The production of a full length recombinant protein will be useful for further structural studies.

  8. Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.

    Science.gov (United States)

    Wyszyńska-Koko, J; Kurył, J

    2004-01-01

    MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.

  9. Sequences of 12 monoclonal anti-dinitrophenyl spin-label antibodies for NMR studies

    International Nuclear Information System (INIS)

    Leahy, D.J.; Rule, G.S.; Whittaker, M.M.; McConnell, H.M.

    1988-01-01

    Eleven monoclonal antibodies specific for a spin-labeled dinitrophenyl hapten (DNP-SL) have been produces for use in NMR studies. They have been named AN01 and ANO3-AN12. The stability constants for the association of these antibodies with DNP-SL and related haptens were measured by fluorescence quenching. cDNA clones coding for the heavy and light chains of each antibody and of an additional anti-DNP-SL monoclonal antibody, ANO2, have been isolated. The nucleic acid sequence of the 5' end of each clone has been determined, and the amino acid sequence of the variable regions of each antibody has been deduced from the cDNA sequence. The sequences are relatively heterogeneous, but both the heavy and the light chains of ANO1 and ANO3 are derived from the same variable-region gene families as those of the ANO2 antibody. ANO7 has a heavy chain that is related to that of ANO2, and ANO9 has a related light chain. ANO5 and ANO6 are unrelated to ANO2 but share virtually identical heavy and light chains. Preliminary NMR difference spectra comparing related antibodies show that sequence-specific assignment of resonances is possible. Such spectra also provide a measure of structural relatedness

  10. Planarian homeobox genes: cloning, sequence analysis, and expression.

    Science.gov (United States)

    Garcia-Fernàndez, J; Baguñà, J; Saló, E

    1991-01-01

    Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599

  11. Amino acid sequences and structures of chicken and turkey beta 2-microglobulin

    DEFF Research Database (Denmark)

    Welinder, K G; Jespersen, H M; Walther-Rasmussen, J

    1991-01-01

    The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11...

  12. Cloning and inactivation of a branched-chain-amino-acid aminotransferase gene from Staphylococcus carnosus and characterization of the enzyme

    DEFF Research Database (Denmark)

    Madsen, Søren M; Beck, Hans Christian; Ravn, Peter

    2002-01-01

    . The first step in the catabolism is most likely a transamination reaction catalyzed by BCAA aminotransferases (IlvE proteins). In this study, we cloned the ilvE gene from S. carnosus by using degenerate oligonucleotides and PCR. We found that the deduced amino acid sequence was 80% identical...... were essential for optimal cell growth....

  13. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase

    International Nuclear Information System (INIS)

    Finocchiaro, G.; Taroni, F.; Martin, A.L.; Colombo, I.; Tarelli, G.T.; DiDonato, S.; Rocchi, M.

    1991-01-01

    The authors have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH 2 -terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH 2 -terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hanster somatic cell hybrids

  14. cDNA and deduced primary structure of basic phospholipase A2 with neurotoxic activity from the venom secretion of the Crotalus durissus collilineatus rattlesnake

    Directory of Open Access Journals (Sweden)

    F.H.R. Fagundes

    2010-03-01

    Full Text Available To illustrate the construction of precursor complementary DNAs, we isolated mRNAs from whole venom samples. After reverse transcription polymerase chain reaction (RT-PCR, we amplified the cDNA coding for a neurotoxic protein, phospholipase A2 D49 (PLA2 D49, from the venom of Crotalus durissus collilineatus (Cdc PLA2. The cDNA encoding Cdc PLA2 from whole venom was sequenced. The deduced amino acid sequence of this cDNA has high overall sequence identity with the group II PLA2 protein family. Cdc PLA2 has 14 cysteine residues capable of forming seven disulfide bonds that characterize this group of PLA2 enzymes. Cdc PLA2 was isolated using conventional Sephadex G75 column chromatography and reverse-phase high performance liquid chromatography (RP-HPLC. The molecular mass was estimated using matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF mass spectrometry. We tested the neuromuscular blocking activities on chick biventer cervicis neuromuscular tissue. Phylogenetic analysis of Cdc PLA2 showed the existence of two lines of N6-PLA2, denominated F24 and S24. Apparently, the sequences of the New World’s N6-F24-PLA2 are similar to those of the agkistrodotoxin from the Asian genus Gloydius. The sequences of N6-S24-PLA2 are similar to the sequence of trimucrotoxin from the genus Protobothrops, found in the Old World.

  15. Complete genome sequences of three tomato spotted wilt virus isolates from tomato and pepper plants in Korea and their phylogenetic relationship to other TSWV isolates.

    Science.gov (United States)

    Lee, Jong-Seung; Cho, Won Kyong; Kim, Mi-Kyeong; Kwak, Hae-Ryun; Choi, Hong-Soo; Kim, Kook-Hyung

    2011-04-01

    Tomato spotted wilt virus (TSWV) infects numerous host plants and has three genome segments, called L, M and S. Here, we report the complete genome sequences of three Korean TSWV isolates (TSWV-1 to -3) infecting tomato and pepper plants. Although the nucleotide sequence of TSWV-1 genome isolated from tomato is very different from those of TSWV-2 and TSWV-3 isolated from pepper, the deduced amino acid sequences of the five TSWV genes are highly conserved among all three TSWV isolates. In phylogenetic analysis, deduced RdRp protein sequences of TSWV-2 and TSWV-3 were clustered together with two previously reported isolates from Japan and Korea, while TSWV-1 grouped together with a Hawaiian isolate. A phylogenetic tree based on N protein sequences, however, revealed four distinct groups of TSWV isolates, and all three Korean isolates belonged to group II, together with many other isolates, mostly from Europe and Asia. Interestingly, most American isolates grouped together as group I. Together, these results suggested that these newly identified TSWV isolates might have originated from an Asian ancestor and undergone divergence upon infecting different host plants.

  16. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    International Nuclear Information System (INIS)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-01-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary λgt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/β-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of 125 I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene

  17. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-07-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary lambdagt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/..beta..-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of /sup 125/I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene.

  18. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    Science.gov (United States)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  19. Primary structure of human pancreatic protease E determined by sequence analysis of the cloned mRNA

    International Nuclear Information System (INIS)

    Shen, W.; Fletcher, T.S.; Largman, C.

    1987-01-01

    Although protease E was isolated from human pancreas over 10 years ago, its amino acid sequence and relationship to the elastases have not been established. The authors report the isolation of a cDNA clone for human pancreatic protease E and determination of the nucleic acid sequence coding for the protein. The deduced amino acid sequence contains all of the features common to serine proteases. The substrate binding region is highly homologous to those of porcine and rat elastases 1, explaining the similar specificity for alanine reported for protease E and these elastases. However, the amino acid sequence outside the substrate binding region is less than 50% conserved, and there is a striking difference in the overall net charge for protease E (6-) and elastases 1 (8+). These findings confirm that protease E is a new member of the serine protease family. They have attempted to identify amino acid residues important for the interaction between elastases and elastin by examining the amino acid sequence differences between elastases and protease E. In addition to the large number of surface charge changes which are outside the substrate binding region, there are several changes which might be crucial for elastolysis: Leu-73/Arg-73; Arg-217A/Ala-217A; Arg-65A/Gln-65A; and the presence of two new cysteine residues (Cys-98 and Cys-99B) which computer modeling studies predict could form a new disulfide bond, not previously observed for serine proteases. They also present evidence which suggests that human pancreas does not synthesize a basic, alanine-specific elastase similar to porcine elastase 1

  20. Human α2-HS-glycoprotein: the A and B chains with a connecting sequence are encoded by a single mRNA transcript

    International Nuclear Information System (INIS)

    Lee, C.C.; Bowman, B.H.; Yang, F.

    1987-01-01

    The α 2 -HS-glycoprotein (AHSG) is a plasma protein reported to play roles in bone mineralization and in the immune response. It is composed of two subunits, the A and B chains. Recombinant plasmids containing human cDNA AHSG have been isolated by screening an adult human liver library with a mixed oligonucleotide probe. The cDNA clones containing AHSG inserts span approximately 1.5 kilobase pairs and include the entire AHSG coding sequence, demonstrating that the A and B chains are encoded by a single mRNA transcript. The cDNA sequence predicts an 18-amino-acid signal peptide, followed by the A-chain sequence of AHSG. A heretofore unseen connecting sequence of 40 amino acids was deduced between the A- and B-chain sequences. The connecting sequence demonstrates the unique amino acid doublets and collagen triplets found in the A and B chains; it is not homologous with other reported amino acid sequences. The connecting sequence may be cleaved in a posttranslational step by limited proteolysis before mature AHSG is released into the circulation or may vary in its presence because of alternative processing. The AHSG cDNA was utilized for mapping the AHSG gene to the 3q21→qter region of human chromosome 3. The availability of the AHSG cDNA clone will facilitate the analysis of its genetic control and gene expression during development and bone formation

  1. Isolation of laccase gene-specific sequences from white rot and brown rot fungi by PCR.

    Science.gov (United States)

    D'Souza, T M; Boominathan, K; Reddy, C A

    1996-01-01

    Degenerate primers corresponding to the consensus sequences of the copper-binding regions in the N-terminal domains of known basidiomycete laccases were used to isolate laccase gene-specific sequences from strains representing nine genera of wood rot fungi. All except three gave the expected PCR product of about 200 bp. Computer searches of the databases identified the sequence of each of the PCR products analyzed as a laccase gene sequence, suggesting the specificity of the primers. PCR products of the white rot fungi Ganoderma lucidum, Phlebia brevispora, and Trametes versicolor showed 65 to 74% nucleotide sequence similarity to each other; the similarity in deduced amino acid sequences was 83 to 91%. The PCR products of Lentinula edodes and Lentinus tigrinus, on the other hand, showed relatively low nucleotide and amino acid similarities (58 to 64 and 62 to 81%, respectively); however, these similarities were still much higher than when compared with the corresponding regions in the laccases of the ascomycete fungi Aspergillus nidulans and Neurospora crassa. A few of the white rot fungi, as well as Gloeophyllum trabeum, a brown rot fungus, gave a 144-bp PCR fragment which had a nucleotide sequence similarity of 60 to 71%. Demonstration of laccase activity in G. trabeum and several other brown rot fungi was of particular interest because these organisms were not previously shown to produce laccases. PMID:8837429

  2. Complete nucleotide sequences of avian metapneumovirus subtype B genome.

    Science.gov (United States)

    Sugiyama, Miki; Ito, Hiroshi; Hata, Yusuke; Ono, Eriko; Ito, Toshihiro

    2010-12-01

    Complete nucleotide sequences were determined for subtype B avian metapneumovirus (aMPV), the attenuated vaccine strain VCO3/50 and its parental pathogenic strain VCO3/60616. The genomes of both strains comprised 13,508 nucleotides (nt), with a 42-nt leader at the 3'-end and a 46-nt trailer at the 5'-end. The genome contains eight genes in the order 3'-N-P-M-F-M2-SH-G-L-5', which is the same order shown in the other metapneumoviruses. The genes are flanked on either side by conserved transcriptional start and stop signals and have intergenic sequences varying in length from 1 to 88 nt. Comparison of nt and predicted amino acid (aa) sequences of VCO3/60616 with those of other metapneumoviruses revealed higher homology with aMPV subtype A virus than with other metapneumoviruses. A total of 18 nt and 10 deduced aa differences were seen between the strains, and one or a combination of several differences could be associated with attenuation of VCO3/50.

  3. Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

    Science.gov (United States)

    Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

    1991-09-15

    We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.

  4. Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins

    DEFF Research Database (Denmark)

    Madsen, Peder; Rasmussen, H H; Leffers, H

    1992-01-01

    termed PA-FABP (psoriasis-associated fatty acid-binding protein). The deduced sequence predicted a protein with molecular weight of 15,164 daltons and a calculated pI of 6.96, values that are close to those recorded in the keratinocyte 2D gel protein database. The protein comigrated with PA-FABP...... as determined by 2D gel analysis of [35S]-methionine-labeled proteins expressed by transformed human amnion (AMA) cells transfected with clone 1592 using the vaccinia virus expression system and reacted with a rabbit polyclonal antibody raised against 2D gel purified PA-FABP. Structural analysis of the amino...... acid sequence revealed 48%, 52%, and 56% identity to known low-molecular-weight fatty acid-binding proteins belonging to the FABP family. Northern blot analysis showed that PA-FABP mRNA is indeed highly up-regulated in psoriatic keratinocytes. The transcript is present in human cell lines of epithelial...

  5. A Δ-9 Fatty Acid Desaturase Gene in the Microalga Myrmecia incisa Reisigl: Cloning and Functional Analysis

    Directory of Open Access Journals (Sweden)

    Wen-Bin Xue

    2016-07-01

    Full Text Available The green alga Myrmecia incisa is one of the richest natural sources of arachidonic acid (ArA. To better understand the regulation of ArA biosynthesis in M. incisa, a novel gene putatively encoding the Δ9 fatty acid desaturase (FAD was cloned and characterized for the first time. Rapid-amplification of cDNA ends (RACE was employed to yield a full length cDNA designated as MiΔ9FAD, which is 2442 bp long in sequence. Comparing cDNA open reading frame (ORF sequence to genomic sequence indicated that there are 8 introns interrupting the coding region. The deduced MiΔ9FAD protein is composed of 432 amino acids. It is soluble and localized in the chloroplast, as evidenced by the absence of transmembrane domains as well as the presence of a 61-amino acid chloroplast transit peptide. Multiple sequence alignment of amino acids revealed two conserved histidine-rich motifs, typical for Δ9 acyl-acyl carrier protein (ACP desaturases. To determine the function of MiΔ9FAD, the gene was heterologously expressed in a Saccharomyces cerevisiae mutant strain with impaired desaturase activity. Results of GC-MS analysis indicated that MiΔ9FAD was able to restore the synthesis of monounsaturated fatty acids, generating palmitoleic acid and oleic acid through the addition of a double bond in the Δ9 position of palmitic acid and stearic acid, respectively.

  6. Physics-based Inverse Problem to Deduce Marine Atmospheric Boundary Layer Parameters

    Science.gov (United States)

    2017-03-07

    knowledge and capabilities in the use and development of inverse problem techniques to deduce atmospheric parameters. WORK COMPLETED The research completed...please find the Final Technical Report with SF 298 for Dr. Erin E. Hackett’s ONR grant entitled Physics -based Inverse Problem to Deduce Marine...From- To) 07/03/2017 Final Technica l Dec 2012- Dec 2016 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Physics -based Inverse Problem to Deduce Marine

  7. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  8. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...... proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...

  9. The amino acid sequence of snapping turtle (Chelydra serpentina) ribonuclease

    NARCIS (Netherlands)

    Beintema, Jacob; Broos, Jaap; Meulenberg, Janneke; Schüller, Cornelis

    1985-01-01

    Snapping turtle (Chelydra serpentina) ribonuclease was isolated from pancreatic tissue. Turtle ribonuclease binds much more weakly to the affinity chromatography matrix used than mammalian ribonucleases. The amino acid sequence was determined from overlapping peptides obtained from three different

  10. Nucleotide sequence of a cDNA coding for the barley seed protein CMa: an inhibitor of insect α-amylase

    DEFF Research Database (Denmark)

    Rasmussen, Søren Kjærsgård; Johansson, A.

    1992-01-01

    The primary structure of the insect alpha-amylase inhibitor CMa of barley seeds was deduced from a full-length cDNA clone pc43F6. Analysis of RNA from barley endosperm shows high levels 15 and 20 days after flowering. The cDNA predicts an amino acid sequence of 119 residues preceded by a signal...... peptide of 25 amino acids. Ala and Leu account for 55% of the signal peptide. CMa is 60-85% identical with alpha-amylase inhibitors of wheat, but shows less than 50% identity to trypsin inhibitors of barley and wheat. The 10 Cys residues are located in identical positions compared to the cereal inhibitor...

  11. Correlation between fibroin amino acid sequence and physical silk properties.

    Science.gov (United States)

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.

  12. The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

    International Nuclear Information System (INIS)

    Nylund, Stian; Karlsen, Marius; Nylund, Are

    2008-01-01

    The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses, which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Science.gov (United States)

    2010-07-01

    ... mature protein, with the number 1. When presented, the amino acids preceding the mature protein, e.g... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter... data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  14. Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses

    Science.gov (United States)

    Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.

    2004-01-01

    The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.

  15. Purification of MUC1 from Bovine Milk-Fat Globules and Characterization of a Corresponding Full-Length cDNA Clone

    DEFF Research Database (Denmark)

    Pallesen, Lone Tjener; Andersen, Mikkel Holmen; Nielsen, Rune

    2001-01-01

    acid sequences obtained by peptide mapping. The complete amino acid sequence of MUC1 was determined by cloning and sequencing the corresponding bovine mammary gland cDNA, which was shown to encode a protein of 580 amino acid residues comprising a cleavable signal peptide of 22 residues. The deduced...

  16. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    Science.gov (United States)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  17. Permutation Entropy for Random Binary Sequences

    Directory of Open Access Journals (Sweden)

    Lingfeng Liu

    2015-12-01

    Full Text Available In this paper, we generalize the permutation entropy (PE measure to binary sequences, which is based on Shannon’s entropy, and theoretically analyze this measure for random binary sequences. We deduce the theoretical value of PE for random binary sequences, which can be used to measure the randomness of binary sequences. We also reveal the relationship between this PE measure with other randomness measures, such as Shannon’s entropy and Lempel–Ziv complexity. The results show that PE is consistent with these two measures. Furthermore, we use PE as one of the randomness measures to evaluate the randomness of chaotic binary sequences.

  18. The nucleotide sequence of a Polish isolate of Tomato torrado virus.

    Science.gov (United States)

    Budziszewska, Marta; Obrepalska-Steplowska, Aleksandra; Wieczorek, Przemysław; Pospieszny, Henryk

    2008-12-01

    A new virus was isolated from greenhouse tomato plants showing symptoms of leaf and apex necrosis in Wielkopolska province in Poland in 2003. The observed symptoms and the virus morphology resembled viruses previously reported in Spain called Tomato torrado virus (ToTV) and that in Mexico called Tomato marchitez virus (ToMarV). The complete genome of a Polish isolate Wal'03 was determined using RT-PCR amplification using oligonucleotide primers developed against the ToTV sequences deposited in Genbank, followed by cloning, sequencing, and comparison with the sequence of the type isolate. Phylogenetic analyses, performed on the basis of fragments of polyproteins sequences, established the relationship of Polish isolate Wal'03 with Spanish ToTV and Mexican ToMarV, as well as with other viruses from Sequivirus, Sadwavirus, and Cheravirus genera, reported to be the most similar to the new tomato viruses. Wal'03 genome strands has the same organization and very high homology with the ToTV type isolate, showing only some nucleotide and deduced amino acid changes, in contrast to ToMarV, which was significantly different. The phylogenetic tree clustered aforementioned viruses to the same group, indicating that they have a common origin.

  19. Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

    Science.gov (United States)

    Sugimura; Sawabe; Ezura

    2000-01-01

    The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.

  20. Molecular characterisation and nucleotide sequence analysis of canine parvovirus strains in vaccines in India

    Directory of Open Access Journals (Sweden)

    Sukdeb Nandi

    2010-03-01

    Full Text Available Canine parvovirus 2 (CPV‑2 is one of the most important viruses that causes haemorrhagic gastroenteritis and myocarditis of dogs worldwide. The picture has been complicated further due to the emergence of new mutants of CPV, namely: CPV‑2a, CPV‑2b and CPV‑2c. In this study, the molecular characterisation of strains present in the CPV vaccines available on the Indian market was performed using polymerase chain reaction and DNA sequencing. The VP1/VP2 genes of two vaccine strains and a field strain (Bhopal were sequenced and the nucleotide and the deduced amino acid sequences were compared. The results indicated that the isolate belonged to CPV type 2b and the strains in the vaccines belonged to type CPV‑2. From the study, it is inferred that the CPV strain used in commercially available vaccine preparation differed from the strains present in CPV infection in dogs in India

  1. Molecular characterisation and nucleotide sequence analysis of canine parvovirus strains in vaccines in India.

    Science.gov (United States)

    Nandi, Sukdeb; Anbazhagan, Rajendra; Kumar, Manoj

    2010-01-01

    Canine parvovirus 2 (CPV-2) is one of the most important viruses that causes haemorrhagic gastroenteritis and myocarditis of dogs worldwide. The picture has been complicated further due to the emergence of new mutants of CPV, namely: CPV-2a, CPV-2b and CPV-2c. In this study, the molecular characterisation of strains present in the CPV vaccines available on the Indian market was performed using polymerase chain reaction and DNA sequencing. The VP1/VP2 genes of two vaccine strains and a field strain (Bhopal) were sequenced and the nucleotide and the deduced amino acid sequences were compared. The results indicated that the isolate belonged to CPV type 2b and the strains in the vaccines belonged to type CPV-2. From the study, it is inferred that the CPV strain used in commercially available vaccine preparation differed from the strains present in CPV infection in dogs in India.

  2. Nucleotide sequence of the human N-myc gene

    International Nuclear Information System (INIS)

    Stanton, L.W.; Schwab, M.; Bishop, J.M.

    1986-01-01

    Human neuroblastomas frequently display amplification and augmented expression of a gene known as N-myc because of its similarity to the protooncogene c-myc. It has therefore been proposed that N-myc is itself a protooncogene, and subsequent tests have shown that N-myc and c-myc have similar biological activities in cell culture. The authors have now detailed the kinship between N-myc and c-myc by determining the nucleotide sequence of human N-myc and deducing the amino acid sequence of the protein encoded by the gene. The topography of N-myc is strikingly similar to that of c-myc: both genes contain three exons of similar lengths; the coding elements of both genes are located in the second and third exons; and both genes have unusually long 5' untranslated regions in their mRNAs, with features that raise the possibility that expression of the genes may be subject to similar controls of translation. The resemblance between the proteins encoded by N-myc and c-myc sustains previous suspicions that the genes encode related functions

  3. Secondary structure classification of amino-acid sequences using state-space modeling

    OpenAIRE

    Brunnert, Marcus; Krahnke, Tillmann; Urfer, Wolfgang

    2001-01-01

    The secondary structure classification of amino acid sequences can be carried out by a statistical analysis of sequence and structure data using state-space models. Aiming at this classification, a modified filter algorithm programmed in S is applied to data of three proteins. The application leads to correct classifications of two proteins even when using relatively simple estimation methods for the parameters of the state-space models. Furthermore, it has been shown that the assumed initial...

  4. [Complete genome sequencing of polymalic acid-producing strain Aureobasidium pullulans CCTCC M2012223].

    Science.gov (United States)

    Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang

    2017-01-04

    To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.

  5. Amino-acid sequence of two trypsin isoinhibitors, ITD I and ITD III from squash seeds (Cucurbita maxima).

    Science.gov (United States)

    Wilusz, T; Wieczorek, M; Polanowski, A; Denton, A; Cook, J; Laskowski, M

    1983-01-01

    The amino-acid sequences of two trypsin isoinhibitors, ITD I and ITD III, from squash seeds (Cucurbita maxima) were determined. Both isoinhibitors contain 29 amino-acid residues, including 6 half cystine residues. They differ only by one amino acid. Lysine in position 9 of ITD III is substituted by glutamic acid in ITD I. Arginine in position 5 is present at the reactive site of both isoinhibitors. The previously published sequence of ITD III has been shown to be incorrect.

  6. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    Science.gov (United States)

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  7. Isolation of laccase gene-specific sequences from white rot and brown rot fungi by PCR

    Energy Technology Data Exchange (ETDEWEB)

    D`Souza, T.M.; Boominathan, K.; Reddy, C.A. [Michigan State Univ., East Lansing, MI (United States)

    1996-10-01

    Degenerate primers corresponding to the consensus sequences of the copper-binding regions in the N-terminal domains of known basidiomycete laccases were used to isolate laccase gene-specific sequences from strains representing nine genera of wood rot fungi. All except three gave the expected PCR product of about 200 bp. Computer searches of the databases identified the sequences of each of the PCR product of about 200 bp. Computer searches of the databases identified the sequence of each of the PCR products analyzed as a laccase gene sequence, suggesting the specificity of the primers. PCR products of the white rot fungi Ganoderma lucidum, Phlebia brevispora, and Trametes versicolor showed 65 to 74% nucleotide sequence similarity to each other; the similarity in deduced amino acid sequences was 83 to 91%. The PCR products of Lentinula edodes and Lentinus tigrinus, on the other hand, showed relatively low nucleotide and amino acid similarities (58 to 64 and 62 to 81%, respectively); however, these similarities were still much higher than when compared with the corresponding regions in the laccases of the ascomycete fungi Aspergillus nidulans and Neurospora crassa. A few of the white rot fungi, as well as Gloeophyllum trabeum, a brown rot fungus, gave a 144-bp PCR fragment which had a nucleotide sequence similarity of 60 to 71%. Demonstration of laccase activity in G. trabeum and several other brown rot fungi was of particular interest because these organisms were not previously shown to produce laccases. 36 refs., 6 figs., 2 tabs.

  8. Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

    Directory of Open Access Journals (Sweden)

    Michael Andrew Meyer

    2013-07-01

    Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  9. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    International Nuclear Information System (INIS)

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L.

    1988-01-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity

  10. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Directory of Open Access Journals (Sweden)

    Xiaoxia Yang

    Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  11. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Science.gov (United States)

    Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

    2015-01-01

    Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  12. Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores.

    Science.gov (United States)

    Bastien, Olivier; Maréchal, Eric

    2008-08-07

    Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2) following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory) is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure). Homologous sequences were considered as systems 1) having a high redundancy of information reflected by the magnitude of their alignment scores, 2) which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a constant rate, corresponding to the

  13. Identification of a 4-Deoxy-l-erythro-5-hexoseulose Uronic Acid Reductase, FlRed, in an Alginolytic Bacterium Flavobacterium sp. Strain UMI-01

    Directory of Open Access Journals (Sweden)

    Akira Inoue

    2015-01-01

    Full Text Available In alginate-assimilating bacteria, alginate is depolymerized to unsaturated monosaccharide by the actions of endolytic and exolytic alginate lyases (EC 4.2.2.3 and EC 4.2.2.11. The monosaccharide is non-enzymatically converted to 4-deoxy-l-ery thro-5-hexoseulose uronic acid (DEH, then reduced to 2-keto-3-deoxy-d-gluconate (KDG by a specific reductase, and metabolized through the Entner–Doudoroff pathway. Recently, the NADPH-dependent reductase A1-R that belongs to short-chain dehydrogenases/reductases (SDR superfamily was identified as the DEH-reductase in Sphingomonas sp. A1. We have subsequently noticed that an SDR-like enzyme gene, flred, occurred in the genome of an alginolytic bacterium Flavobacterium sp. strain UMI-01. In the present study, we report on the deduced amino-acid sequence of flred and DEH-reducing activity of recombinant FlRed. The deduced amino-acid sequence of flred comprised 254 residues and showed 34% amino-acid identities to that of A1-R from Sphingomonas sp. A1 and 80%–88% to those of SDR-like enzymes from several alginolytic bacteria. Common sequence motifs of SDR-superfamily enzymes, e.g., the catalytic tetrad Asn-Lys-Tyr-Ser and the cofactor-binding sequence Thr-Gly-x-x-x-Gly-x-Gly in Rossmann fold, were completely conserved in FlRed. On the other hand, an Arg residue that determined the NADPH-specificity of Sphingomonas A1-R was replaced by Glu in FlRed. Thus, we investigated cofactor-preference of FlRed using a recombinant enzyme. As a result, the recombinant FlRed (recFlRed was found to show high specificity to NADH. recFlRed exhibited practically no activity toward variety of aldehyde, ketone, keto ester, keto acid and aldose substrates except for DEH. On the basis of these results, we conclude that FlRed is the NADH-dependent DEH-specific SDR of Flavobacterium sp. strain UMI-01.

  14. The complete nucleotide sequence of the barley yellow dwarf GPV isolate from China shows that it is a new member of the genus Polerovirus.

    Science.gov (United States)

    Zhang, Wenwei; Cheng, Zhuomin; Xu, Lei; Wu, Maosen; Waterhouse, Peter; Zhou, Guanghe; Li, Shifang

    2009-01-01

    The complete nucleotide sequence of the ssRNA genome of a Chinese GPV isolate of barley yellow dwarf virus (BYDV) was determined. It comprised 5673 nucleotides, and the deduced genome organization resembled that of members of the genus Polerovirus. It was most closely related to cereal yellow dwarf virus-RPV (77% nt identity over the entire genome; coat protein amino acid identity 79%). The GPV isolate also differs in vector specificity from other BYDV strains. Biological properties, phylogenetic analyses and detailed sequence comparisons suggest that GPV should be considered a member of a new species within the genus, and the name Wheat yellow dwarf virus-GPV is proposed.

  15. Human thyroid peroxidase: complete cDNA and protein sequence, chromosome mapping, and identification of two alternately spliced mRNAs

    International Nuclear Information System (INIS)

    Kimura, S.; Kotani, T.; McBride, O.W.; Umeki, K.; Hirai, K.; Nakayama, T.; Ohtaki, S.

    1987-01-01

    Two forms of human thyroid peroxidase cDNAs were isolated from a λgt11 cDNA library, prepared from Graves disease thyroid tissue mRNA, by use of oligonucleotides. The longest complete cDNA, designated phTPO-1, has 3048 nucleotides and an open reading frame consisting of 933 amino acids, which would encode a protein with a molecular weight of 103,026. Five potential asparagine-linked glycosylation sites are found in the deduced amino acid sequence. The second peroxidase cDNA, designated phTPO-2, is almost identical to phTPO-1 beginning 605 base pairs downstream except that it contains 1-base-pair difference and lacks 171 base pairs in the middle of the sequence. This results in a loss of 57 amino acids corresponding to a molecular weight of 6282. Interestingly, this 171-nucleotide sequence has GT and AG at its 5' and 3' boundaries, respectively, that are in good agreement with donor and acceptor splice site consensus sequences. Using specific oligonucleotide probes for the mRNAs derived from the cDNA sequences hTOP-1 and hTOP-2, the authors show that both are expressed in all thyroid tissues examined and the relative level of two mRNAs is different in each sample. The results suggest that two thyroid peroxidase proteins might be generated through alternate splicing of the same gene. By using somatic cell hybrid lines, the thyroid peroxidase gene was mapped to the short arm of human chromosome 2

  16. ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

    Science.gov (United States)

    Meiler, Arno; Klinger, Claudia; Kaufmann, Michael

    2012-09-08

    The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  17. Isolation and amino acid sequence of a dehydratase acting on d-erythro-3-hydroxyaspartate from Pseudomonas sp. N99, and its application in the production of optically active 3-hydroxyaspartate.

    Science.gov (United States)

    Nagano, Hiroyuki; Shibano, Kana; Matsumoto, Yu; Yokota, Atsushi; Wada, Masaru

    2017-06-01

    An enzyme catalyzing the ammonia-lyase reaction for the conversion of d-erythro-3-hydroxyaspartate to oxaloacetate was purified from the cell-free extract of a soil-isolated bacterium Pseudomonas sp. N99. The enzyme exhibited ammonia-lyase activity toward l-threo-3-hydroxyaspartate and d-erythro-3-hydroxyaspartate, but not toward other 3-hydroxyaspartate isomers. The deduced amino acid sequence of the enzyme, which belongs to the serine/threonine dehydratase family, shows similarity to the sequence of l-threo-3-hydroxyaspartate ammonia-lyase (EC 4.3.1.16) from Pseudomonas sp. T62 (74%) and Saccharomyces cerevisiae (64%) and serine racemase from Schizosaccharomyces pombe (65%). These results suggest that the enzyme is similar to l-threo-3-hydroxyaspartate ammonia-lyase from Pseudomonas sp. T62, which does not act on d-erythro-3-hydroxyaspartate. We also then used the recombinant enzyme expressed in Escherichia coli to produce optically pure l-erythro-3-hydroxyaspartate and d-threo-3-hydroxyaspartate from the corresponding dl-racemic mixtures. The enzymatic resolution reported here is one of the simplest and the first enzymatic method that can be used for obtaining optically pure l-erythro-3-hydroxyaspartate.

  18. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  19. Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites.

    Science.gov (United States)

    Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

    2012-10-01

    To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.

  20. Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

    Directory of Open Access Journals (Sweden)

    Samat Kozhakhmetov

    2014-01-01

    Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.

  1. Isolation and amino acid sequence of corticotropin-releasing factor from pig hypothalami.

    OpenAIRE

    Patthy, M; Horvath, J; Mason-Garcia, M; Szoke, B; Schlesinger, D H; Schally, A V

    1985-01-01

    A polypeptide was isolated from acid extracts of porcine hypothalami on the basis of its high ability to stimulate the release of corticotropin from superfused rat pituitary cells. After an initial separation by gel filtration on Sephadex G-25, further purification was carried out by reversed-phase HPLC. The isolated material was homogeneous chromatographically and by N-terminal sequencing. Based on automated gas-phase sequencing of the intact and CNBr-cleaved peptide and on carboxypeptidase ...

  2. Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus

    DEFF Research Database (Denmark)

    Hansen, T S; Andreasen, P H; Dreisig, H

    1991-01-01

    We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 ...... by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.......We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63...... nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified...

  3. cDNA cloning and immunological characterization of the rye grass allergen Lol p I.

    Science.gov (United States)

    Perez, M; Ishioka, G Y; Walker, L E; Chesnut, R W

    1990-09-25

    The complete amino acid sequence of two "isoallergenic" forms of Lol p I, the major rye grass (Lolium perenne) pollen allergen, was deduced from cDNA sequence analysis. cDNA clones isolated from a Lolium perenne pollen library contained an open reading frame coding for a 240-amino acid protein. Comparison of the nucleotide and deduced amino acid sequence of two of these clones revealed four changes at the amino acid level and numerous nucleotide differences. Both clones contained one possible asparagine-linked glycosylation site. Northern blot analysis shows one RNA species of 1.2 kilobases. Based on the complete amino acid sequence of Lol p I, overlapping peptides covering the entire molecule were synthesized. Utilizing these peptides we have identified a determinant within the Lol p I molecule that is recognized by human leukocyte antigen class II-restricted T cells obtained from persons allergic to rye grass pollen.

  4. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Amino acid sequences of predicted proteins and their annotation for 95 organis...m species. Data detail Data name Amino acid sequences of predicted proteins and their annotation for 95 orga...nism species. DOI 10.18908/lsdba.nbdc00464-001 Description of data contents Amino acid sequences of predicted proteins...Database Description Download License Update History of This Database Site Policy | Contact Us Amino acid sequences of predicted prot...eins and their annotation for 95 organism species. - Gclust Server | LSDB Archive ...

  5. Cloning and sequence analysis of putative type II fatty acid synthase ...

    Indian Academy of Sciences (India)

    Prakash

    Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L. ... acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, β-ketoacyl-ACP .... Helix II plays a dominant role in the interaction ... main distinguishing features of plant ACPs in plastids and ..... synthase component; J. Biol.

  6. Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

    Science.gov (United States)

    Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

    2015-03-01

    The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.

  7. Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

    Science.gov (United States)

    DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

    2015-01-01

    The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630

  8. Formation of conjugated delta8,delta10-double bonds by delta12-oleic-acid desaturase-related enzymes: biosynthetic origin of calendic acid.

    Science.gov (United States)

    Cahoon, E B; Ripp, K G; Hall, S E; Kinney, A J

    2001-01-26

    Divergent forms of the plant Delta(12)-oleic-acid desaturase (FAD2) have previously been shown to catalyze the formation of acetylenic bonds, epoxy groups, and conjugated Delta(11),Delta(13)-double bonds by modification of an existing Delta(12)-double bond in C(18) fatty acids. Here, we report a class of FAD2-related enzymes that modifies a Delta(9)-double bond to produce the conjugated trans-Delta(8),trans-Delta(10)-double bonds found in calendic acid (18:3Delta(8trans,10trans,12cis)), the major component of the seed oil of Calendula officinalis. Using an expressed sequence tag approach, cDNAs for two closely related FAD2-like enzymes, designated CoFADX-1 and CoFADX-2, were identified from a C. officinalis developing seed cDNA library. The deduced amino acid sequences of these polypeptides share 40-50% identity with those of other FAD2 and FAD2-related enzymes. Expression of either CoFADX-1 or CoFADX-2 in somatic soybean embryos resulted in the production of calendic acid. In embryos expressing CoFADX-2, calendic acid accumulated to as high as 22% (w/w) of the total fatty acids. In addition, expression of CoFADX-1 and CoFADX-2 in Saccharomyces cerevisiae was accompanied by calendic acid accumulation when induced cells were supplied exogenous linoleic acid (18:2Delta(9cis,12cis)). These results are thus consistent with a route of calendic acid synthesis involving modification of the Delta(9)-double bond of linoleic acid. Regiospecificity for Delta(9)-double bonds is unprecedented among FAD2-related enzymes and further expands the functional diversity found in this family of enzymes.

  9. Isolation and complete amino acid sequence of human thymopoietin and splenin

    International Nuclear Information System (INIS)

    Audhya, T.; Schlesinger, D.H.; Goldstein, G.

    1987-01-01

    Human thymopoietin and splenin were isolated from human thymus and spleen, respectively, by monitoring tissue fractionation with a bovine thymopoietin RIA cross-reactive with human thymopoietin and splenin. Bovine thymopoietin and splenin are 49-amino acid polypeptides that differ by only 2 amino acids at positions 34 and 43; the change at position 34 in the active-site region changes the receptor specificities and biological activities. The complete amino acid sequences of purified human thymopoietin and splenin were determined and shown to be 48-amino acid polypeptides differing at four positions. Ten amino acids, constant within each species for thymopoietin and splenin, differ between the human and bovine polypeptides. The pentapeptide active side of thymopoietin (residues 32-36) is constant between the human and bovine thymopoietins, but position 34 in the active site of splenin has changed from glutamic acid in bovine splenin to alanine in human splenin, accounting for the biological activity of the human but not the bovine splenin on the human T-cell line MOLT-4

  10. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    Science.gov (United States)

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-07

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Coding sequence of human rho cDNAs clone 6 and clone 9

    Energy Technology Data Exchange (ETDEWEB)

    Chardin, P; Madaule, P; Tavitian, A

    1988-03-25

    The authors have isolated human cDNAs including the complete coding sequence for two rho proteins corresponding to the incomplete isolates previously described as clone 6 and clone 9. The deduced a.a. sequences, when compared to the a.a. sequence deduced from clone 12 cDNA, show that there are in human at least three highly homologous rho genes. They suggest that clone 12 be named rhoA, clone 6 : rhoB and clone 9 : rhoC. RhoA, B and C proteins display approx. 30% a.a. identity with ras proteins,. mainly clustered in four highly homologous internal regions corresponding to the GTP binding site; however at least one significant difference is found; the 3 rho proteins have an Alanine in position corresponding to ras Glycine 13, suggesting that rho and ras proteins might have slightly different biochemical properties.

  12. An alignment-free method to find similarity among protein sequences via the general form of Chou's pseudo amino acid composition.

    Science.gov (United States)

    Gupta, M K; Niyogi, R; Misra, M

    2013-01-01

    In this paper, we propose a method to create the 60-dimensional feature vector for protein sequences via the general form of pseudo amino acid composition. The construction of the feature vector is based on the contents of amino acids, total distance of each amino acid from the first amino acid in the protein sequence and the distribution of 20 amino acids. The obtained cosine distance metric (also called the similarity matrix) is used to construct the phylogenetic tree by the neighbour joining method. In order to show the applicability of our approach, we tested it on three proteins: 1) ND5 protein sequences from nine species, 2) ND6 protein sequences from eight species, and 3) 50 coronavirus spike proteins. The results are in agreement with known history and the output from the multiple sequence alignment program ClustalW, which is widely used. We have also compared our phylogenetic results with six other recently proposed alignment-free methods. These comparisons show that our proposed method gives a more consistent biological relationship than the others. In addition, the time complexity is linear and space required is less as compared with other alignment-free methods that use graphical representation. It should be noted that the multiple sequence alignment method has exponential time complexity.

  13. Cloning, nucleotide sequence and transcriptional analysis of the uvrA gene from Neisseria gonorrhoeae

    International Nuclear Information System (INIS)

    Black, C.G.; Fyfe, J.A.M.; Davies, J.K.

    1997-01-01

    A recombinant plasmid capable of restoring UV resistance to an Escherichia coli uvrA mutant was isolated from a genomic library of Neisseria gonorrhoeae. Sequence analysis revealed an open reading frame whose deduced amino acid sequence displayed significant similarity to those of the UvrA proteins of other bacterial species. A second open reading frame (ORF259) was identified upstream from, and in the opposite orientation to the gonococcal uvrA gene. Transcriptional fusions between portions of the gonococcal uvrA upstream region and a reporter gene were used to localise promoter activity in both E. coli and N. gonorrhoeae. The transcriptional starting points of uvrA and ORF259 were mapped in E. coli by primer extension analysis, and corresponding σ 70 promoters were identified. The arrangement of the uvrA-ORF259 intergenic region is similar to that of the gonococcal recA-aroD intergenic region. Both contain inverted copies of the 10 bp neisserial DNA uptake sequence situated between divergently transcribed genes. However, there is no evidence that either the uptake sequence or the proximity of the promoters influences expression of these genes. (author)

  14. A second pectin lyase gene (pel2) from Aspergillus oryzae KBN616: its sequence analysis and overexpression, and characterization of the gene products.

    Science.gov (United States)

    Kitamoto, N; Yoshino-Yasuda, S; Ohmiya, K; Tsukagoshi, N

    2001-01-01

    A second pectin lyase gene, designated pel2, was isolated from a shoyu koji mold Aspergillus oryzae KBN616 and characterized. The structural gene comprised 1306 bp with three introns. The ORF encoded 375 amino acids with a signal peptide of 19 amino acids. The deduced amino acid sequence showed high similarity to those of A. oryzae Pel1, Aspergillus niger pectin lyases and Glomerella cingulata Pn1A. The pel2 gene was overexpressed under the control of the promoter of the A. oryzae TEF1 gene for purification and enzymatic characterization of its gene product. The gene product exhibited two molecular masses of 48 and 44 kDa due to different degrees of glycosylation. Both proteins had the same pH optimum of 6.0 and temperature optimum of 50 degrees C.

  15. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66

    Directory of Open Access Journals (Sweden)

    Bin Liu

    2016-06-01

    Full Text Available Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA. Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276, with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.

  16. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    OpenAIRE

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.; Almstrand, Robert

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome.

  17. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-01-01

    operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching

  18. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Directory of Open Access Journals (Sweden)

    Meiler Arno

    2012-09-01

    Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  19. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Science.gov (United States)

    2012-01-01

    Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836

  20. Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores

    Directory of Open Access Journals (Sweden)

    Maréchal Eric

    2008-08-01

    Full Text Available Abstract Background Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2 following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. Results We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure. Homologous sequences were considered as systems 1 having a high redundancy of information reflected by the magnitude of their alignment scores, 2 which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a

  1. SequenceCEROSENE: a computational method and web server to visualize spatial residue neighborhoods at the sequence level.

    Science.gov (United States)

    Heinke, Florian; Bittrich, Sebastian; Kaiser, Florian; Labudde, Dirk

    2016-01-01

    To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial

  2. Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

    Science.gov (United States)

    Sakoda, H; Imanaka, T

    1992-02-01

    Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.

  3. Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites*

    Science.gov (United States)

    Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

    2012-01-01

    To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi’an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi’an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%–99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites. PMID:23024043

  4. Functional characterization of two microsomal fatty acid desaturases from Jatropha curcas L.

    Science.gov (United States)

    Wu, Pingzhi; Zhang, Sheng; Zhang, Lin; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-10-15

    Linoleic acid (LA, C18:2) and α-linolenic acid (ALA, C18:3) are polyunsaturated fatty acids (PUFAs) and major storage compounds in plant seed oils. Microsomal ω-6 and ω-3 fatty acid (FA) desaturases catalyze the synthesis of seed oil LA and ALA, respectively. Jatropha curcas L. seed oils contain large proportions of LA, but very little ALA. In this study, two microsomal desaturase genes, named JcFAD2 and JcFAD3, were isolated from J. curcas. Both deduced amino acid sequences possessed eight histidines shown to be essential for desaturases activity, and contained motif in the C-terminal for endoplasmic reticulum localization. Heterologous expression in Saccharomyces cerevisiae and Arabidopsis thaliana confirmed that the isolated JcFAD2 and JcFAD3 proteins could catalyze LA and ALA synthesis, respectively. The results indicate that JcFAD2 and JcFAD3 are functional in controlling PUFA contents of seed oils and could be exploited in the genetic engineering of J. curcas, and potentially other plants. Copyright © 2013 Elsevier GmbH. All rights reserved.

  5. Arachidonic Acid Metabolism in the Nervous System; Physiological and Pathological Significance. Annals of the New York Academy of Science. Volume 5

    Science.gov (United States)

    1989-01-01

    488 Primary Structure of Rat Brain Prostaglandin D Synthetase Deduced from the cDNA Sequence. By YOSHIHIRO URADE, AKIHISA NAGATA, YASUHIKO SUZUKI...Synthetase Deduced from the cDNA Sequence YOSHIHIRO URADE,a AKIHISA NAGATA,b YASUHIKO SUZUKI,b YUTAKA FUJII,c AND OSAMU HAYAISHI a aDepartment of Enzymes

  6. Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

    International Nuclear Information System (INIS)

    Feild, M.J.; Armstrong, F.B.

    1987-01-01

    E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and [ 3 H]-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealed limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region

  7. Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

    Science.gov (United States)

    Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

    2014-09-18

    Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.

  8. Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).

    Science.gov (United States)

    Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

    2014-06-01

    The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  9. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Science.gov (United States)

    2010-07-01

    ... may not include material other than part of the sequence listing. A fixed-width font should be used... integer expressing the number of bases or amino acid residues M. Type Whether presented sequence molecule is DNA, RNA, or PRT (protein). If a nucleotide sequence contains both DNA and RNA fragments, the type...

  10. Isolation and expression of the Pneumocystis carinii thymidylate synthase gene

    DEFF Research Database (Denmark)

    Edman, U; Edman, J C; Lundgren, B

    1989-01-01

    The thymidylate synthase (TS) gene from Pneumocystis carinii has been isolated from complementary and genomic DNA libraries and expressed in Escherichia coli. The coding sequence of TS is 891 nucleotides, encoding a 297-amino acid protein of Mr 34,269. The deduced amino acid sequence is similar...

  11. Increased mRNA expression of a laminin-binding protein in human colon carcinoma: Complete sequence of a full-length cDNA encoding the protein

    International Nuclear Information System (INIS)

    Yow, Hsiukang; Wong, Jau Min; Chen, Hai Shiene; Lee, C.; Steele, G.D. Jr.; Chen, Lanbo

    1988-01-01

    Reliable markers to distinguish human colon carcinoma from normal colonic epithelium are needed particularly for poorly differentiated tumors where no useful marker is currently available. To search for markers the authors constructed cDNA libraries from human colon carcinoma cell lines and screened for clones that hybridize to a greater degree with mRNAs of colon carcinomas than with their normal counterparts. Here they report one such cDNA clone that hybridizes with a 1.2-kilobase (kb) mRNA, the level of which is ∼9-fold greater in colon carcinoma than in adjacent normal colonic epithelium. Blot hybridization of total RNA from a variety of human colon carcinoma cell lines shows that the level of this 1.2-kb mRNA in poorly differentiated colon carcinomas is as high as or higher than that in well-differentiated carcinomas. Molecular cloning and complete sequencing of cDNA corresponding to the full-length open reading frame of this 1.2-kb mRNA unexpectedly show it to contain all the partial cDNA sequence encoding 135 amino acid residues previously reported for a human laminin receptor. The deduced amino acid sequence suggests that this putative laminin-binding protein from human colon carcinomas consists of 295 amino acid residues with interesting features. There is an unusual C-terminal 70-amino acid segment, which is trypsin-resistant and highly negatively charged

  12. Random amino acid mutations and protein misfolding lead to Shannon limit in sequence-structure communication.

    Directory of Open Access Journals (Sweden)

    Andreas Martin Lisewski

    2008-09-01

    Full Text Available The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions and in structure (structural defects trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a sensitive to random errors and (b restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials.

  13. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    International Nuclear Information System (INIS)

    Chang, Soo-Ik; Hammes, G.G.

    1989-01-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the β-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution

  14. Amino acid sequence analysis of the annexin super-gene family of proteins.

    Science.gov (United States)

    Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

    1991-06-15

    The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of

  15. Detection and quantification of Plasmodium falciparum in blood samples using quantitative nucleic acid sequence-based amplification

    NARCIS (Netherlands)

    Schoone, G. J.; Oskam, L.; Kroon, N. C.; Schallig, H. D.; Omar, S. A.

    2000-01-01

    A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the

  16. Primary structure of human pancreatic elastase 2 determined by sequence analysis of the cloned mRNA

    International Nuclear Information System (INIS)

    Fletcher, T.S.; Shen, W.F.; Largman, C.

    1987-01-01

    A cDNA encoding elastase 2 has been cloned from a human pancreatic cDNA library. The cDNA contains a translation initiation site and a poly(A) recognition site and encodes a protein of 269 amino acids, including a proposed 16-residue signal peptide. The amino acid sequence of the deduced mature protein contains a 12-residue activation peptide containing a cysteine at residue 1 similar to that of chymotryspin. The proposed active enzyme contains all of the characteristic active-site amino acids, including His-57, Asp-102, and Ser-195. The S1 binding pocket is bounded by Gly-216 and Ser-226, making this pocket intermediate in size between chymotrypsins and elastase 1 or protease E, consistent with the substrate specificity of elastase 2 for long-chain aliphatic or aromatic amino acids. Computer modeling studies using the amino acid sequence of elastase 2 superimposed on the X-ray structure of porcine elastase 1 suggest that a change of Gln-192 in elastase 1 to Asn-192 in elastase 2 may account for the lower catalytic efficiency of the latter enzyme. Several basic residues appear to be near the ends of the extended binding pocket of elastases which might serve to anchor the enzyme to the elastin substrate. These studies indicate that elastases 2 and elastase 1 both contain an Arg-65A as well as a basic dipeptide at 223/224 which is not present in chymotrypsins. In addition, Arg-217A is present in humaan elastase 2 but absent in rat pancreatic protein which has been proposed to be an elastase 2 on the basis of sequence homology, but which was not isolated during screening of rat pancreatic tissue extracts for elastolytic activity

  17. Coupling constants deduced for the resonances in kaon photo-production

    International Nuclear Information System (INIS)

    Cheoun, M. K.; Kim, K. S.; Choi, T. K.

    2004-01-01

    We deduced the coupling constants of nucleon and hyperon resonances, which participate in kaon productions as intermediate states that are formed by electro-magnetic probes and that finally decay into hadronic final states. We used an isobaric model based on an effective Lagrangian approach to describe the processes, in which relevant coupling constants regarding related resonances are effectively determined by fitting available experimental data. Our scheme to deduce the coupling constants was as follows: First, we calculated the lower and the upper limits on the coupling constants by using the experimental decay data available until now and/or theoretical predictions, such as those from quark models and SU(3) symmetry. Second, we exploited those limits as physical constraints on our fitting scheme for the kaon photo-production data. Finally, the deduced values and regions of the coupling constants, which satisfy not only the reaction data but also the decay data, are presented as figures with respect to the strong and the electro-magnetic coupling constants, and their multiplicative values. Our results for the coupling constants give physical values that are more restricted than those allowed by the experimental data nowadays.

  18. Ubiquitous distribution of fluorescent protein in muscles of four ...

    Indian Academy of Sciences (India)

    AKI FUNAHASHI

    The deduced amino acid sequences among the four species and two subspecies exhibited 91.4–100% identity, and ... The comparison of amino acid sequences revealed two common substitutions in A. ..... tion of tissues exhibiting energy metabolism (Zschiesche et al. ... organization of the neurons (Rakic 1971; Feng et al.

  19. Differentiation of highly virulent strains of Streptococcus suis serotype 2 according to glutamate dehydrogenase electrophoretic and sequence type.

    Science.gov (United States)

    Kutz, Russell; Okwumabua, Ogi

    2008-10-01

    The glutamate dehydrogenase (GDH) enzymes of 19 Streptococcus suis serotype 2 strains, consisting of 18 swine isolates and 1 human clinical isolate from a geographically varied collection, were analyzed by activity staining on a nondenaturing gel. All seven (100%) of the highly virulent strains tested produced an electrophoretic type (ET) distinct from those of moderately virulent and nonvirulent strains. By PCR and nucleotide sequence determination, the gdh genes of the 19 strains and of 2 highly virulent strains involved in recent Chinese outbreaks yielded a 1,820-bp fragment containing an open reading frame of 1,344 nucleotides, which encodes a protein of 448 amino acid residues with a calculated molecular mass of approximately 49 kDa. The nucleotide sequences contained base pair differences, but most were silent. Cluster analysis of the deduced amino acid sequences separated the isolates into three groups. Group I (ETI) consisted of the seven highly virulent isolates and the two Chinese outbreak strains, containing Ala(299)-to-Ser, Glu(305)-to-Lys, and Glu(330)-to-Lys amino acid substitutions compared with groups II and III (ETII). Groups II and III consisted of moderately virulent and nonvirulent strains, which are separated from each other by Tyr(72)-to-Asp and Thr(296)-to-Ala substitutions. Gene exchange studies resulted in the change of ETI to ETII and vice versa. A spectrophotometric activity assay for GDH did not show significant differences between the groups. These results suggest that the GDH ETs and sequence types may serve as useful markers in predicting the pathogenic behavior of strains of this serotype and that the molecular basis for the observed differences in the ETs was amino acid substitutions and not deletion, insertion, or processing uniqueness.

  20. Metazoan Remaining Genes for Essential Amino Acid Biosynthesis: Sequence Conservation and Evolutionary Analyses

    Directory of Open Access Journals (Sweden)

    Igor R. Costa

    2014-12-01

    Full Text Available Essential amino acids (EAA consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS and betaine-homocysteine S-methyltransferase (BHMT diverged from the expected Tree of Life (ToL relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.

  1. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    Science.gov (United States)

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  2. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    Science.gov (United States)

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein. 2010 American Society for Mass Spectrometry. Published by Elsevier Inc. All rights reserved.

  3. Application of Ammonium Persulfate for Selective Oxidation of Guanines for Nucleic Acid Sequencing

    Directory of Open Access Journals (Sweden)

    Yafen Wang

    2017-07-01

    Full Text Available Nucleic acids can be sequenced by a chemical procedure that partially damages the nucleotide positions at their base repetition. Many methods have been reported for the selective recognition of guanine. The accurate identification of guanine in both single and double regions of DNA and RNA remains a challenging task. Herein, we present a new, non-toxic and simple method for the selective recognition of guanine in both DNA and RNA sequences via ammonium persulfate modification. This strategy can be further successfully applied to the detection of 5-methylcytosine by using PCR.

  4. The amino acid sequence of cytochrome c from Cucurbita maxima L. (pumpkin)

    Science.gov (United States)

    Thompson, E. W.; Richardson, M.; Boulter, D.

    1971-01-01

    The amino acid sequence of pumpkin cytochrome c was determined on 2μmol of protein. Some evidence was found for the occurrence of two forms of cytochrome c, whose sequences differed in three positions. Pumpkin cytochrome c consists of 111 residues and is homologous with mitochondrial cytochromes c from other plants. Experimental details are given in a supplementary paper that has been deposited as Supplementary Publication SUP 50005 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1971), 121, 7. PMID:5131733

  5. Citrate synthase gene sequence: a new tool for phylogenetic analysis and identification of Ehrlichia.

    Science.gov (United States)

    Inokuma, H; Brouqui, P; Drancourt, M; Raoult, D

    2001-09-01

    The sequence of the citrate synthase gene (gltA) of 13 ehrlichial species (Ehrlichia chaffeensis, Ehrlichia canis, Ehrlichia muris, an Ehrlichia species recently detected from Ixodes ovatus, Cowdria ruminantium, Ehrlichia phagocytophila, Ehrlichia equi, the human granulocytic ehrlichiosis [HGE] agent, Anaplasma marginale, Anaplasma centrale, Ehrlichia sennetsu, Ehrlichia risticii, and Neorickettsia helminthoeca) have been determined by degenerate PCR and the Genome Walker method. The ehrlichial gltA genes are 1,197 bp (E. sennetsu and E. risticii) to 1,254 bp (A. marginale and A. centrale) long, and GC contents of the gene vary from 30.5% (Ehrlichia sp. detected from I. ovatus) to 51.0% (A. centrale). The percent identities of the gltA nucleotide sequences among ehrlichial species were 49.7% (E. risticii versus A. centrale) to 99.8% (HGE agent versus E. equi). The percent identities of deduced amino acid sequences were 44.4% (E. sennetsu versus E. muris) to 99.5% (HGE agent versus E. equi), whereas the homology range of 16S rRNA genes was 83.5% (E. risticii versus the Ehrlichia sp. detected from I. ovatus) to 99.9% (HGE agent, E. equi, and E. phagocytophila). The architecture of the phylogenetic trees constructed by gltA nucleotide sequences or amino acid sequences was similar to that derived from the 16S rRNA gene sequences but showed more-significant bootstrap values. Based upon the alignment analysis of the ehrlichial gltA sequences, two sets of primers were designed to amplify tick-borne Ehrlichia and Neorickettsia genogroup Ehrlichia (N. helminthoeca, E. sennetsu, and E. risticii), respectively. Tick-borne Ehrlichia species were specifically identified by restriction fragment length polymorphism (RFLP) patterns of AcsI and XhoI with the exception of E. muris and the very closely related ehrlichia derived from I. ovatus for which sequence analysis of the PCR product is needed. Similarly, Neorickettsia genogroup Ehrlichia species were specifically identified by

  6. PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.

    Science.gov (United States)

    Kuznetsov, Igor B; McDuffie, Michael

    2015-05-07

    Alignment of amino acid sequences is the main sequence comparison method used in computational molecular biology. The selection of the amino acid substitution matrix best suitable for a given alignment problem is one of the most important decisions the user has to make. In a conventional amino acid substitution matrix all elements are fixed and their values cannot be easily adjusted. Moreover, most existing amino acid substitution matrices account for the average (dis)similarities between amino acid types and do not distinguish the contribution of a specific biochemical property to these (dis)similarities. PR2ALIGN is a stand-alone software program and a web-server that provide the functionality for implementing flexible user-specified alignment scoring functions and aligning pairs of amino acid sequences based on the comparison of the profiles of biochemical properties of these sequences. Unlike the conventional sequence alignment methods that use 20x20 fixed amino acid substitution matrices, PR2ALIGN uses a set of weighted biochemical properties of amino acids to measure the distance between pairs of aligned residues and to find an optimal minimal distance global alignment. The user can provide any number of amino acid properties and specify a weight for each property. The higher the weight for a given property, the more this property affects the final alignment. We show that in many cases the approach implemented in PR2ALIGN produces better quality pair-wise alignments than the conventional matrix-based approach. PR2ALIGN will be helpful for researchers who wish to align amino acid sequences by using flexible user-specified alignment scoring functions based on the biochemical properties of amino acids instead of the amino acid substitution matrix. To the best of the authors' knowledge, there are no existing stand-alone software programs or web-servers analogous to PR2ALIGN. The software is freely available from http://pr2align.rit.albany.edu.

  7. A Novel Phytase with Sequence Similarity to Purple Acid Phosphatases Is Expressed in Cotyledons of Germinating Soybean Seedlings 1

    Science.gov (United States)

    Hegeman, Carla E.; Grabau, Elizabeth A.

    2001-01-01

    Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases. PMID:11500558

  8. The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

    Science.gov (United States)

    Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

    1986-07-01

    Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.

  9. Hydroquinone: O-glucosyltransferase from cultivated Rauvolfia cells: enrichment and partial amino acid sequences.

    Science.gov (United States)

    Arend, J; Warzecha, H; Stöckigt, J

    2000-01-01

    Plant cell suspension cultures of Rauvolfia are able to produce a high amount of arbutin by glucosylation of exogenously added hydroquinone. A four step purification procedure using anion exchange, hydrophobic interaction, hydroxyapatite-chromatography and chromatofocusing delivered in a yield of 0.5%, an approximately 390 fold enrichment of the involved glucosyltransferase. SDS-PAGE showed a M(r) for the enzyme of 52 kDa. Proteolysis of the pure enzyme with endoproteinase LysC revealed six peptide fragments with 9-23 amino acids which were sequenced. Sequence alignment of the six peptides showed high homologies to glycosyltransferases from other higher plants.

  10. Characterization of glycoprotein C of HSZP strain of herpes simplex virus 1

    NARCIS (Netherlands)

    Oravcova, [No Value; Kudelova, M; Mlcuchova, J; Matis, J; Bystricka, M; Westra, DF; Welling-Wester, S; Rajcani, J

    Sequences of UL44 genes of strains HSZP, KOS and 17 of herpes simplex virus 1 (HSV-1) were determined and the amino acid sequences of corresponding glycoproteins (gC) were deduced. In comparison with the 17 strain, the HSZP strain showed specific changes in 3 nucleotides and in 2 amino acids (aa 139

  11. Evolution of sequence-defined highly functionalized nucleic acid polymers

    Science.gov (United States)

    Chen, Zhen; Lichtor, Phillip A.; Berliner, Adrian P.; Chen, Jonathan C.; Liu, David R.

    2018-03-01

    The evolution of sequence-defined synthetic polymers made of building blocks beyond those compatible with polymerase enzymes or the ribosome has the potential to generate new classes of receptors, catalysts and materials. Here we describe a ligase-mediated DNA-templated polymerization and in vitro selection system to evolve highly functionalized nucleic acid polymers (HFNAPs) made from 32 building blocks that contain eight chemically diverse side chains on a DNA backbone. Through iterated cycles of polymer translation, selection and reverse translation, we discovered HFNAPs that bind proprotein convertase subtilisin/kexin type 9 (PCSK9) and interleukin-6, two protein targets implicated in human diseases. Mutation and reselection of an active PCSK9-binding polymer yielded evolved polymers with high affinity (KD = 3 nM). This evolved polymer potently inhibited the binding between PCSK9 and the low-density lipoprotein receptor. Structure-activity relationship studies revealed that specific side chains at defined positions in the polymers are required for binding to their respective targets. Our findings expand the chemical space of evolvable polymers to include densely functionalized nucleic acids with diverse, researcher-defined chemical repertoires.

  12. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

    Science.gov (United States)

    Li, Yushuang; Song, Tian; Yang, Jiasheng; Zhang, Yi; Yang, Jialiang

    2016-01-01

    In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector.

  13. Cloning and sequence of cDNA encoding 1-aminocyclo- propane-1-carboxylate oxidase in Vanda flowers

    Directory of Open Access Journals (Sweden)

    Pattana Srifah Huehne

    2013-08-01

    Full Text Available The 1-aminocyclopropane-1-carboxylate oxidase (ACO gene in the final step of ethylene biosynthesis was isolated from ethylene-sensitive Vanda Miss Joaquim flowers. This consists of 1,242 base pairs (bp encoding for 326 amino acid residues. To investigate the specific divergence in orchid ACO sequences, the deduced Vanda ACO was aligned with five other orchid ACOs. The results reveal that the ACO sequences within Doritaenopsis, Phalaenopsis and Vanda show highly conserved and almost 95% identical homology, while the ACOs isolated from Cymbidium, Dendrobium and Cattleya are 8788% identical to Vanda ACO. In addition, the 2-oxoglutarate- Fe(II_oxygenase (Oxy domain of orchid ACOs consists of a higher degree of amino acid conservation than that of the non-haem dioxygenase (DIOX_N domain. The overall homology regions of Vanda ACO are commonly folded into 12 α-helices and 12 β-sheets similar to the three dimensional template-structure of Petunia ACO. This Vanda ACO cloned gene is highly expressed in flower tissue compared with root and leaf tissues. In particular, there is an abundance of ACO transcript accumulation in the column followed by the lip and the perianth of Vanda Miss Joaquim flowers at the fully-open stage.

  14. A recombinant wheat serpin with inhibitory activity

    DEFF Research Database (Denmark)

    Rasmussen, Søren K; Dahl, Søren Weis; Nørgård, Anette

    1996-01-01

    A full-length clone encoding the wheat (Triticum aestivum L.) serpin WSZ1 was isolated from a cDNA library based on mRNA from immature grain. The 398 amino acid sequence deduced from the cDNA was corroborated by sequencing CNBr peptides of WSZ1 purified from resting grain. WSZ1 belongs to the sub......A full-length clone encoding the wheat (Triticum aestivum L.) serpin WSZ1 was isolated from a cDNA library based on mRNA from immature grain. The 398 amino acid sequence deduced from the cDNA was corroborated by sequencing CNBr peptides of WSZ1 purified from resting grain. WSZ1 belongs...... sequencing indicated that only few serpins are encoded by wheat, but at least three distinct genes are expressed in the grain. Cleavage experiments on a chymotrypsin column suggested a Gln-Gln reactive site bond not previously observed in inhibitory serpins....

  15. Primary structure and localization of a conserved immunogenic Plasmodium falciparum glutamate rich protein (GLURP) expressed in both the preerythrocytic and erythrocytic stages of the vertebrate life cycle

    DEFF Research Database (Denmark)

    Borre, M B; Dziegiel, M; Høgh, B

    1991-01-01

    A gene coding for a 220-kDa glutamate rich protein (GLURP), an exoantigen of Plasmodium falciparum, was isolated and its nucleotide sequence was determined. The deduced amino acid sequence contains 2 repeat regions. The sequence of one of these was shown to be conserved among geographically...

  16. Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

    Science.gov (United States)

    Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

    1994-07-08

    The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family

  17. Serine protease isoforms in Gloydius intermedius venom: Full sequences, molecular phylogeny and evolutionary implications.

    Science.gov (United States)

    Yang, Zhang-Min; Yu, Hui; Liu, Zhen-Zhen; Pei, Jian-Zhu; Yang, Yu-E; Yan, Su-Xian; Zhang, Cui; Zhao, Wen-Long; Wang, Zhe-Zhi; Wang, Ying-Ming; Tsai, Inn-Ho

    2017-07-05

    Nine distinct venom serine proteases (vSPs) of Gloydius intermedius were studied by transcriptomic, sub-proteomic and phylogenetic analyses. Their complete amino acid sequences were deduced after Expression Sequence Tag (EST) analyses followed by cDNA cloning and sequencing. These vSPs appear to be paralogs and contain the catalytic triads and 1-4 potential N-glycosylation sites. Their relative expression levels evaluated by qPCR were grossly consistent with their EST hit-numbers. The major vSPs were purified by HPLC and their N-terminal sequences matched well to the deduced sequences, while fragments of the minor vSPs were detected by LC-MS/MS identification. Specific amidolytic activities of the fractions from HPLC and anion exchange separation were assayed using four chromogenic substrates, respectively. Molecular phylogenetic tree based on the sequences of these vSPs and their orthologs revealed six major clusters, one of them covered four lineages of plasminogen activator like vSPs. N-glycosylation patterns and variations for the vSPs are discussed. The high sequence similarities between G. intermedius vSPs and their respective orthologs from American pitvipers suggest that most of the isoforms evolved before Asian pitvipers migrated to the New World. Our results also indicate that the neurotoxic venoms contain more kallikrein-like vSPs and hypotensive components than the hemorrhagic venoms. Full sequences and expression levels of nine paralogous serine proteases (designated as GiSPs) of Gloydius intermedius venom have been studied. A kallikrein-like enzyme is most abundant and four isoforms homologous to venom plasminogen-activators are also expressed in this venom. Taken together, the present and previous data demonstrate that the neurotoxic G. intermedius venoms contain more hypotensive vSPs relative to other hemorrhagic pitviper venoms and the pitviper vSPs are highly versatile and diverse. Their structure-function relationships remain to be explored and

  18. Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

    Science.gov (United States)

    Nishizawa, M; Nishizawa, K

    2000-10-01

    The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.

  19. Discovery of the rare HLA-B*39:77 allele in an unrelated Taiwanese bone marrow stem cell donor using the sequence-based typing method.

    Science.gov (United States)

    Yang, K L; Lee, S K; Lin, P Y

    2013-08-01

    We detected a rare HLA-B locus allele, B*39:77, in a Taiwanese unrelated marrow stem cell donor in our routine HLA sequence-based typing (SBT) exercise for a possible haematopoietic stem cell donation. In exons 2, 3 and 4, the DNA sequence of B*39:77 is identical to the sequence of B*39:01:01:01 except one nucleotide at nucleotide position 733 (G->A) in exon 4. The nucleotide variation caused one amino acid alteration at residue 221 (Gly->Ser). B*39:77 was probably derived from a nucleotide substitution event involving B*39:01:01:01. The probable HLA-A, -B, -C, -DRB1 and -DQB1 haplotype in association with B*39:77 may be deduced as A*02:01-B*39:77-C*07:02-DRB1*08:03-DQB1*06:01. Our discovery of B*39:77 in Taiwanese adds further polymorphism of B*39 variants in Taiwanese population. © 2013 John Wiley & Sons Ltd.

  20. Sequence and Genetic Characterization of etrA, an fnr Analog that Regulates Anaerobic Respiration in Shewanella putrefaciens MR-1

    Science.gov (United States)

    Saffarini, Daad A.; Nelson, Kenneth H.

    1993-01-01

    An electron transport regulatory gene, etrA, has been isolated and characterized from the obligate respiratory bacterium Shewanella putrefaciens MR-l. The deduced amino acid sequence of etrA (EtrA) shows a high degree of identity to both the Fnr of Escherichia coli (73.6%) and the analogous protein (ANR) of Pseudomonas aeruginosa (50.8%). The four active cysteine residues of Fnr are conserved in EtrA, and the amino acid sequence of the DNA-binding domains of the two proteins are identical. Further, S.putrefaciens etrA is able to complement an fnr mutant of E.coli. In contrast to fnr, there is no recognizable Fnr box upstream of the etrA sequence. Gene replacement etr.A mutants of MR-1 were deficient in growth on nitrite, thiosulfate, sulfite, trimethylamine-N-oxide, dimethyl sulfoxide, Fe(III), and fumarate, suggesting that EtrA is involved in the regulation of the corresponding reductase genes. However, the mutants were all positive for reduction of and growth on nitrate and Mn(IV), indicating that EtrA is not involved in the regulation of these two systems. Southern blots of S.putrefaciens DNA with use of etrA as a probe revealed the expected etrA bands and a second set of hybridization signals whose genetic and functional properties remain to be determined.

  1. DNA Nucleotide Sequence Restricted by the RI Endonuclease

    Science.gov (United States)

    Hedgpeth, Joe; Goodman, Howard M.; Boyer, Herbert W.

    1972-01-01

    The sequence of DNA base pairs adjacent to the phosphodiester bonds cleaved by the RI restriction endonuclease in unmodified DNA from coliphage λ has been determined. The 5′-terminal nucleotide labeled with 32P and oligonucleotides up to the heptamer were analyzed from a pancreatic DNase digest. The following sequence of nucleotides adjacent to the RI break made in λ DNA was deduced from these data and from the 3′-dinucleotide sequence and nearest-neighbor analysis obtained from repair synthesis with the DNA polymerase of Rous sarcoma virus [Formula: see text] The RI endonuclease cleavage of the phosphodiester bonds (indicated by arrows) generates 5′-phosphoryls and short cohesive termini of four nucleotides, pApApTpT. The most striking feature of the sequence is its symmetry. PMID:4343974

  2. NHE3 in an ancestral vertebrate: primary sequence, distribution, localization, and function in gills.

    Science.gov (United States)

    Choe, Keith P; Kato, Akira; Hirose, Shigehisa; Plata, Consuelo; Sindic, Aleksandra; Romero, Michael F; Claiborne, J B; Evans, David H

    2005-11-01

    In mammals, the Na+/H+ exchanger 3 (NHE3) is expressed with Na+/K+-ATPase in renal proximal tubules, where it secretes H+ and absorbs Na+ to maintain blood pH and volume. In elasmobranchs (sharks, skates, and stingrays), the gills are the dominant site of pH and osmoregulation. This study was conducted to determine whether epithelial NHE homologs exist in elasmobranchs and, if so, to localize their expression in gills and determine whether their expression is altered by environmental salinity or hypercapnia. Degenerate primers and RT-PCR were used to deduce partial sequences of mammalian NHE2 and NHE3 homologs from the gills of the euryhaline Atlantic stingray (Dasyatis sabina). Real-time PCR was then used to demonstrate that mRNA expression of the NHE3 homolog increased when stingrays were transferred to low salinities but not during hypercapnia. Expression of the NHE2 homolog did not change with either treatment. Rapid amplification of cDNA was then used to deduce the complete sequence of a putative NHE3. The 2,744-base pair cDNA includes a coding region for a 2,511-amino acid protein that is 70% identical to human NHE3 (SLC9A3). Antisera generated against the carboxyl tail of the putative stingray NHE3 labeled the apical membranes of Na+/K+-ATPase-rich epithelial cells, and acclimation to freshwater caused a redistribution of labeling in the gills. This study provides the first NHE3 cloned from an elasmobranch and is the first to demonstrate an increase in gill NHE3 expression during acclimation to low salinities, suggesting that NHE3 can absorb Na+ from ion-poor environments.

  3. Evolutionary Steps in the Emergence of Life Deduced from the Bottom-Up Approach and GADV Hypothesis (Top-Down Approach).

    Science.gov (United States)

    Ikehara, Kenji

    2016-01-26

    It is no doubt quite difficult to solve the riddle of the origin of life. So, firstly, I would like to point out the kinds of obstacles there are in solving this riddle and how we should tackle these difficult problems, reviewing the studies that have been conducted so far. After that, I will propose that the consecutive evolutionary steps in a timeline can be rationally deduced by using a common event as a juncture, which is obtained by two counter-directional approaches: one is the bottom-up approach through which many researchers have studied the origin of life, and the other is the top-down approach, through which I established the [GADV]-protein world hypothesis or GADV hypothesis on the origin of life starting from a study on the formation of entirely new genes in extant microorganisms. Last, I will describe the probable evolutionary process from the formation of Earth to the emergence of life, which was deduced by using a common event-the establishment of the first genetic code encoding [GADV]-amino acids-as a juncture for the results obtained from the two approaches.

  4. A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

    Science.gov (United States)

    Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

    2008-12-01

    A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.

  5. Molecular cloning and expression of gene encoding aromatic amino acid decarboxylase in 'Vidal blanc' grape berries.

    Science.gov (United States)

    Pan, Qiu-Hong; Chen, Fang; Zhu, Bao-Qing; Ma, Li-Yan; Li, Li; Li, Jing-Ming

    2012-04-01

    The pleasantly fruity and floral 2-phenylethanol are a dominant aroma compound in post-ripening 'Vidal blanc' grapes. However, to date little has been reported about its synthetic pathway in grapevine. In the present study, a full-length cDNA of VvAADC (encoding aromatic amino acid decarboxylase) was firstly cloned from the berries of 'Vidal blanc', an interspecific hybrid variety of Vitis vinifera × Vitis riparia. This sequence encodes a complete open reading frame of 482 amino acids with a calculated molecular mass of 54 kDa and isoelectric point value (pI) of 5.73. The amino acid sequence deduced shared about 79% identity with that of aromatic L: -amino acid decarboxylases (AADCs) from tomato. Real-time PCR analysis indicated that VvAADC transcript abundance presented a small peak at 110 days after full bloom and then a continuous increase at the berry post-ripening stage, which was consistent with the accumulation of 2-phenylethanol, but did not correspond to the trends of two potential intermediates, phenethylamine and 2-phenylacetaldehyde. Furthermore, phenylalanine still exhibited a continuous increase even in post-ripening period. It is thus suggested that 2-phenylethanol biosynthetic pathway mediated by AADC exists in grape berries, but it has possibly little contribution to a considerable accumulation of 2-phenylethanol in post-ripening 'Vidal blanc' grapes.

  6. Confirmation of a novel siadenovirus species detected in raptors: partial sequence and phylogenetic analysis.

    Science.gov (United States)

    Kovács, Endre R; Benko, Mária

    2009-03-01

    Partial genome characterisation of a novel adenovirus, found recently in organ samples of multiple species of dead birds of prey, was carried out by sequence analysis of PCR-amplified DNA fragments. The virus, named as raptor adenovirus 1 (RAdV-1), has originally been detected by a nested PCR method with consensus primers targeting the adenoviral DNA polymerase gene. Phylogenetic analysis with the deduced amino acid sequence of the small PCR product has implied a new siadenovirus type present in the samples. Since virus isolation attempts remained unsuccessful, further characterisation of this putative novel siadenovirus was carried out with the use of PCR on the infected organ samples. The DNA sequence of the central genome part of RAdV-1, encompassing nine full (pTP, 52K, pIIIa, III, pVII, pX, pVI, hexon, protease) and two partial (DNA polymerase and DBP) genes and exceeding 12 kb pairs in size, was determined. Phylogenetic tree reconstructions, based on several genes, unambiguously confirmed the preliminary classification of RAdV-1 as a new species within the genus Siadenovirus. Further study of RAdV-1 is of interest since it represents a rare adenovirus genus of yet undetermined host origin.

  7. Journal of Genetics | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    The deduced amino acid sequences among the four species and two subspecies exhibited 91.4–100% identity, and belonged to the fatty-acid-binding protein ... of Animal Wealth,Suez Canal University, Ismailia 12211, Egypt; Fish Farming and ...

  8. Mutations in type 3 reovirus that determine binding to sialic acid are contained in the fibrous tail domain of viral attachment protein sigma1.

    Science.gov (United States)

    Chappell, J D; Gunn, V L; Wetzel, J D; Baer, G S; Dermody, T S

    1997-03-01

    The reovirus attachment protein, sigma1, determines numerous aspects of reovirus-induced disease, including viral virulence, pathways of spread, and tropism for certain types of cells in the central nervous system. The sigma1 protein projects from the virion surface and consists of two distinct morphologic domains, a virion-distal globular domain known as the head and an elongated fibrous domain, termed the tail, which is anchored into the virion capsid. To better understand structure-function relationships of sigma1 protein, we conducted experiments to identify sequences in sigma1 important for viral binding to sialic acid, a component of the receptor for type 3 reovirus. Three serotype 3 reovirus strains incapable of binding sialylated receptors were adapted to growth in murine erythroleukemia (MEL) cells, in which sialic acid is essential for reovirus infectivity. MEL-adapted (MA) mutant viruses isolated by serial passage in MEL cells acquired the capacity to bind sialic acid-containing receptors and demonstrated a dependence on sialic acid for infection of MEL cells. Analysis of reassortant viruses isolated from crosses of an MA mutant virus and a reovirus strain that does not bind sialic acid indicated that the sigma1 protein is solely responsible for efficient growth of MA mutant viruses in MEL cells. The deduced sigma1 amino acid sequences of the MA mutant viruses revealed that each strain contains a substitution within a short region of sequence in the sigma1 tail predicted to form beta-sheet. These studies identify specific sequences that determine the capacity of reovirus to bind sialylated receptors and suggest a location for a sialic acid-binding domain. Furthermore, the results support a model in which type 3 sigma1 protein contains discrete receptor binding domains, one in the head and another in the tail that binds sialic acid.

  9. The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

    DEFF Research Database (Denmark)

    Behrendt, N; Rønne, E; Ploug, M

    1990-01-01

    -PA. The purified protein shows a single 55-60 kDa band after sodium dodecyl sulfate-polyacrylamide gel electrophoresis and silver staining. It is a heavily glycosylated protein, the deglycosylated polypeptide chain comprising only 35 kDa. The glycosylated protein contains N-acetyl-D-glucosamine and sialic acid......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported...

  10. Sequence Learning with Stochastic Feedback in a Cross-Cultural Sample of Boys in the Autistic Spectrum

    Science.gov (United States)

    Hentschel, Maren; Lange-Kuttner, Christiane; Averbeck, Bruno B.

    2016-01-01

    The study investigated sequence learning from stochastic feedback in boys with Autistic Spectrum Disorder (ASD) and typically developed (TD) boys. We asked boys with ASD from Nigeria and the UK as well as age- and gender-matched controls (also males only) to deduce a sequence of four left and right button presses, LLRR, RRLL, LRLR, RLRL, LRRL and…

  11. Molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer myostatin gene

    Directory of Open Access Journals (Sweden)

    Smith-Keune Carolyn

    2008-02-01

    Full Text Available Abstract Background Myostatin (MSTN is a member of the transforming growth factor-β superfamily that negatively regulates growth of skeletal muscle tissue. The gene encoding for the MSTN peptide is a consolidate candidate for the enhancement of productivity in terrestrial livestock. This gene potentially represents an important target for growth improvement of cultured finfish. Results Here we report molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer MSTN-1 gene. The barramundi MSTN-1 was encoded by three exons 379, 371 and 381 bp in length and translated into a 376-amino acid peptide. Intron 1 and 2 were 412 and 819 bp in length and presented typical GT...AG splicing sites. The upstream region contained cis-regulatory elements such as TATA-box and E-boxes. A first assessment of sequence variability suggested that higher mutation rates are found in the 5' flanking region with several SNP's present in this species. A putative micro RNA target site has also been observed in the 3'UTR (untranslated region and is highly conserved across teleost fish. The deduced amino acid sequence was conserved across vertebrates and exhibited characteristic conserved putative functional residues including a cleavage motif of proteolysis (RXXR, nine cysteines and two glycosilation sites. A qualitative analysis of the barramundi MSTN-1 expression pattern revealed that, in adult fish, transcripts are differentially expressed in various tissues other than skeletal muscles including gill, heart, kidney, intestine, liver, spleen, eye, gonad and brain. Conclusion Our findings provide valuable insights such as sequence variation and genomic information which will aid the further investigation of the barramundi MSTN-1 gene in association with growth. The finding for the first time in finfish MSTN of a miRNA target site in the 3'UTR provides an opportunity for the identification of regulatory mutations on the

  12. Characterization of gonadotrophin-releasing hormone precursor cDNA in the Old World mole-rat Cryptomys hottentotus pretoriae: high degree of identity with the New World guinea pig sequence.

    Science.gov (United States)

    Kalamatianos, T; du Toit, L; Hrabovszky, E; Kalló, I; Marsh, P J; Bennett, N C; Coen, C W

    2005-05-01

    Regulation of pituitary gonadotrophins by the decapeptide gonadotrophin-releasing hormone 1 (GnRH1) is crucial for the development and maintenance of reproductive functions. A common amino acid sequence for this decapeptide, designated as 'mammalian' GnRH, has been identified in all mammals thus far investigated with the exception of the guinea pig, in which there are two amino acid substitutions. Among hystricognath rodents, the members of the family Bathyergidae regulate reproduction in response to diverse cues. Thus, highveld mole-rats (Cryptomys hottentotus pretoriae) are social bathyergids in which breeding is restricted to a particular season in the dominant female, but continuously suppressed in subordinate colony members. Elucidation of reproductive control in these animals will be facilitated by characterization of their GnRH1 gene. A partial sequence of GnRH1 precursor cDNA was isolated and characterized. Comparative analysis revealed the highest degree of identity (86%) to guinea pig GnRH1 precursor mRNA. Nevertheless, the deduced amino acid sequence of the mole-rat decapeptide is identical to the 'mammalian' sequence rather than that of guinea pigs. Successful detection of GnRH1-synthesizing neurones using either a guinea pig GnRH1 riboprobe or an antibody against the 'mammalian' decapeptide is consistent with the guinea pig-like sequence for the precursor and the classic 'mammalian' form for the decapeptide. The high degree of identity in the GnRH1 precursor sequence between this Old World mole-rat and the New World guinea pig is consistent with the theory that caviomorphs and phiomorphs originated from a common ancestral line in the Palaeocene to mid Eocene, some 63-45 million years ago.

  13. Comparisons of Satellite-Deduced Overlapping Cloud Properties and CALIPSO CloudSat Data

    Science.gov (United States)

    Chang, Fu-Lung; Minnis, Patrick; Lin, Bing; Sun-Mack, Sunny

    2010-01-01

    Introduction to the overlapped cloud properties derived from polar-orbiting (MODIS) and geostationary (GOES-12, -13, Meteosat-8, -9, etc.) meteorological satellites, which are produced at the NASA Langley Research Center (LaRC) cloud research & development team (NASA lead scientist: Dr. Patrick Minnis). Comparison of the LaRC CERES MODIS Edition-3 overlapped cloud properties to the CALIPSO and the CloudSat active sensing data. High clouds and overlapped clouds occur frequently as deduced by CALIPSO (44 & 25%), CloudSat (25 & 4%), and MODIS (37 & 6%). Large fractions of optically-thin cirrus and overlapped clouds are deduced from CALIPSO, but much smaller fractions are from CloudSat and MODIS. For overlapped clouds, the averaged upper-layer CTHs are about 12.8 (CALIPSO), 10.9 (CloudSat) and 10 km (MODIS), and the averaged lower-layer CTHs are about 3.6 (CALIPSO), 3.2 (CloudSat) and 3.9 km (MODIS). Based on comparisons of upper and lower-layer cloud properties as deduced from the MODIS, CALIPSO and CloudSat data, more enhanced passive satellite methods for retrieving thin cirrus and overlapped cloud properties are needed and are under development.

  14. Characterization of Rous sarcoma virus-related sequences in the Japanese quail.

    Science.gov (United States)

    Chambers, J A; Cywinski, A; Chen, P J; Taylor, J M

    1986-08-01

    We detected sequences related to the avian retrovirus Rous sarcoma virus within the genome of the Japanese quail, a species previously considered to be free of endogenous avian leukosis virus elements. Using low-stringency conditions of hybridization, we screened a quail genomic library for clones containing retrovirus-related information. Of five clones so selected, one, lambda Q48, contained sequence information related to the gag, pol, and env genes of Rous sarcoma virus arranged in a contiguous fashion and spanning a distance of approximately 5.8 kilobases. This organization is consistent with the presence of an endogenous retroviral element within the Japanese quail genome. Use of this element as a high-stringency probe on Southern blots of genomic digests of several quail DNA demonstrated hybridization to a series of high-molecular-weight bands. By slot hybridization to quail DNA with a cloned probe, it was deduced that there were approximately 300 copies per diploid cell. In addition, the quail element also hybridized at low stringency to the DNA of the White Leghorn chicken and at high stringency to the DNAs of several species of jungle fowl and both true and ruffed pheasants. Limited nucleotide sequencing analysis of lambda Q48 revealed homologies of 65, 52, and 46% compared with the sequence of Rous sarcoma virus strain Prague C for the endonuclease domain of pol, the pol-env junction, and the 3'-terminal region of env, respectively. Comparisons at the amino acid level were also significant, thus confirming the retrovirus relatedness of the cloned quail element.

  15. Polyvinyl-alcohol-based magnetic beads for rapid and efficient separation of specific or unspecific nucleic acid sequences

    International Nuclear Information System (INIS)

    Oster, J.; Parker, Jeffrey; Brassard, Lothar

    2001-01-01

    The versatile application of polyvinyl-alcohol-based magnetic M-PVA beads is demonstrated in the separation of genomic DNA, sequence specific nucleic acid purification, and binding of bacteria for subsequent DNA extraction and detection. It is shown that nucleic acids can be obtained in high yield and purity using M-PVA beads, making sample preparation efficient, fast and highly adaptable for automation processes

  16. Identification and chromosomal distribution of copia-like retrotransposon sequences in the coffee (Coffea L. genome

    Directory of Open Access Journals (Sweden)

    Juan-Carlos Herrera

    2013-12-01

    Full Text Available The presence of copia-like transposable elements in seven coffee (Coffea sp. species, including the cultivated Coffea arabica, was investigated. The highly conserved domains of the reverse transcriptase (RT present in the copia retrotransposons were amplified by PCR using degenerated primers. Fragments of roughly 300 bp were obtained and the nucleotide sequence was determined for 36 clones, 19 of which showed good quality. The deduced amino acid sequences were compared by multiple alignment analysis. The data suggested two distinct coffee RT groups, designated as CRTG1 and CRTG2. The sequence identities among the groups ranged from 52 to 60% for CRTG1 and 74 to 85% for CRTG2. The multiple alignment analysis revealed that some of the clones in CRTG1 were closely related to the representative elements present in other plant species such as Brassica napus, Populus ciliata and Picea abis. Furthermore, the chromosomal localization of the RT domains in C. arabica and their putative ancestors was investigated by fluorescence in situ hybridization (FISH analysis. FISH signals were observed throughout the chromosomes following a similar dispersed pattern with some localized regions exhibiting higher concentrations of those elements, providing new evidence of their relative conservation and stability in the coffee genome

  17. Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila.

    Science.gov (United States)

    Liu, X; Gorovsky, M A

    1996-01-01

    A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889

  18. Evolution of green plants as deduced from 5S rRNA sequences.

    Science.gov (United States)

    Hori, H; Lim, B L; Osawa, S

    1985-02-01

    We have constructed a phylogenic tree for green plants by comparing 5S rRNA sequences. The tree suggests that the emergence of most of the uni- and multicellular green algae such as Chlamydomonas, Spirogyra, Ulva, and Chlorella occurred in the early stage of green plant evolution. The branching point of Nitella is a little earlier than that of land plants and much later than that of the above green algae, supporting the view that Nitella-like green algae may be the direct precursor to land plants. The Bryophyta and the Pteridophyta separated from each other after emergence of the Spermatophyta. The result is consistent with the view that the Bryophyta evolved from ferns by degeneration. In the Pteridophyta, Psilotum (whisk fern) separated first, and a little later Lycopodium (club moss) separated from the ancestor common to Equisetum (horsetail) and Dryopteris (fern). This order is in accordance with the classical view. During the Spermatophyta evolution, the gymnosperms (Cycas, Ginkgo, and Metasequoia have been studied here) and the angiosperms (flowering plants) separated, and this was followed by the separation of Metasequoia and Cycas (cycad)/Ginkgo (maidenhair tree) on one branch and various flowering plants on the other.

  19. Detection of a Usp-like gene in Calotropis procera plant from the de novo assembled genome contigs of the high-throughput sequencing dataset

    KAUST Repository

    Shokry, Ahmed M.

    2014-02-01

    The wild plant species Calotropis procera (C. procera) has many potential applications and beneficial uses in medicine, industry and ornamental field. It also represents an excellent source of genes for drought and salt tolerance. Genes encoding proteins that contain the conserved universal stress protein (USP) domain are known to provide organisms like bacteria, archaea, fungi, protozoa and plants with the ability to respond to a plethora of environmental stresses. However, information on the possible occurrence of Usp in C. procera is not available. In this study, we uncovered and characterized a one-class A Usp-like (UspA-like, NCBI accession No. KC954274) gene in this medicinal plant from the de novo assembled genome contigs of the high-throughput sequencing dataset. A number of GenBank accessions for Usp sequences were blasted with the recovered de novo assembled contigs. Homology modelling of the deduced amino acids (NCBI accession No. AGT02387) was further carried out using Swiss-Model, accessible via the EXPASY. Superimposition of C. procera USPA-like full sequence model on Thermus thermophilus USP UniProt protein (PDB accession No. Q5SJV7) was constructed using RasMol and Deep-View programs. The functional domains of the novel USPA-like amino acids sequence were identified from the NCBI conserved domain database (CDD) that provide insights into sequence structure/function relationships, as well as domain models imported from a number of external source databases (Pfam, SMART, COG, PRK, TIGRFAM). © 2014 Académie des sciences.

  20. Purification and sequencing of radish seed calmodulin antagonists phosphorylated by calcium-dependent protein kinase.

    Science.gov (United States)

    Polya, G M; Chandra, S; Condron, R

    1993-02-01

    A family of radish (Raphanus sativus) calmodulin antagonists (RCAs) was purified from seeds by extraction, centrifugation, batch-wise elution from carboxymethyl-cellulose, and high performance liquid chromatography (HPLC) on an SP5PW cation-exchange column. This RCA fraction was further resolved into three calmodulin antagonist polypeptides (RCA1, RCA2, and RCA3) by denaturation in the presence of guanidinium HCl and mercaptoethanol and subsequent reverse-phase HPLC on a C8 column eluted with an acetonitrile gradient in the presence of 0.1% trifluoroacetic acid. The RCA preparation, RCA1, RCA2, RCA3, and other radish seed proteins are phosphorylated by wheat embryo Ca(2+)-dependent protein kinase (CDPK). The RCA preparation contains other CDPK substrates in addition to RCA1, RCA2, and RCA3. The RCA preparation, RCA1, RCA2, and RCA3 inhibit chicken gizzard calmodulin-dependent myosin light chain kinase assayed with a myosin-light chain-based synthetic peptide substrate (fifty percent inhibitory concentrations of RCA2 and RCA3 are about 7 and 2 microM, respectively). N-terminal sequencing by sequential Edman degradation of RCA1, RCA2, and RCA3 revealed sequences having a high homology with the small subunit of the storage protein napin from Brassica napus and with related proteins. The deduced amino acid sequences of RCA1, RCA2, RCA3, and RCA3' (a subform of RCA3) have agreement with average molecular masses from electrospray mass spectrometry of 4537, 4543, 4532, and 4560 kD, respectively. The only sites for serine phosphorylation are near or at the C termini and hence adjacent to the sites of proteolytic precursor cleavage.

  1. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    OpenAIRE

    Mohn, W W

    1995-01-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per...

  2. fCCAC: functional canonical correlation analysis to evaluate covariance between nucleic acid sequencing datasets.

    Science.gov (United States)

    Madrigal, Pedro

    2017-03-01

    Computational evaluation of variability across DNA or RNA sequencing datasets is a crucial step in genomic science, as it allows both to evaluate reproducibility of biological or technical replicates, and to compare different datasets to identify their potential correlations. Here we present fCCAC, an application of functional canonical correlation analysis to assess covariance of nucleic acid sequencing datasets such as chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We show how this method differs from other measures of correlation, and exemplify how it can reveal shared covariance between histone modifications and DNA binding proteins, such as the relationship between the H3K4me3 chromatin mark and its epigenetic writers and readers. An R/Bioconductor package is available at http://bioconductor.org/packages/fCCAC/ . pmb59@cam.ac.uk. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  3. Prediction of beta-turns from amino acid sequences using the residue-coupled model.

    Science.gov (United States)

    Guruprasad, K; Shukla, S

    2003-04-01

    We evaluated the prediction of beta-turns from amino acid sequences using the residue-coupled model with an enlarged representative protein data set selected from the Protein Data Bank. Our results show that the probability values derived from a data set comprising 425 protein chains yielded an overall beta-turn prediction accuracy 68.74%, compared with 94.7% reported earlier on a data set of 30 proteins using the same method. However, we noted that the overall beta-turn prediction accuracy using probability values derived from the 30-protein data set reduces to 40.74% when tested on the data set comprising 425 protein chains. In contrast, using probability values derived from the 425 data set used in this analysis, the overall beta-turn prediction accuracy yielded consistent results when tested on either the 30-protein data set (64.62%) used earlier or a more recent representative data set comprising 619 protein chains (64.66%) or on a jackknife data set comprising 476 representative protein chains (63.38%). We therefore recommend the use of probability values derived from the 425 representative protein chains data set reported here, which gives more realistic and consistent predictions of beta-turns from amino acid sequences.

  4. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    Science.gov (United States)

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  5. Amino acid sequence and biological characterization of BlatPLA₂, a non-toxic acidic phospholipase A₂ from the venom of the arboreal snake Bothriechis lateralis from Costa Rica.

    Science.gov (United States)

    Van der Laat, Marco; Fernández, Julián; Durban, Jordi; Villalobos, Eva; Camacho, Erika; Calvete, Juan J; Lomonte, Bruno

    2013-10-01

    Bothriechis is considered a monophyletic, basal genus of arboreal Neotropical pitvipers distributed across Middle America. The four species found in Costa Rica (B. lateralis, B. schlegeli, B. nigroviridis, B. supraciliaris) differ in their venom proteomic profiles, suggesting that different Bothriechis taxa have evolved diverse trophic strategies. In this study, we isolated a phospholipase A₂ (PLA₂) from B. lateralis venom, aiming at increasing our knowledge on the structural and functional characteristics of group II acidic PLA₂s, whose toxic actions are generally more restricted than those displayed by basic PLA₂s. The new acidic enzyme, BlatPLA₂, occurs as a monomer of 13,917 Da, in contrast to many basic group II PLA₂s which associate into dimers and often display myotoxicity and/or neurotoxicity. Its amino acid sequence of 122 residues predicts an isoelectric point of 4.7, and displays significant differences with previously characterized acidic PLA₂s, with which it shows a maximum sequence identity of 78%. BlatPLA₂ is catalytically active but appears to be devoid of major toxic activities, lacking intravenous or intracerebroventricular lethality, myotoxicity, in vitro anticoagulant activity, and platelet aggregation or inhibition effects. Phylogenetic relationships with similar group II enzymes suggest that BlatPLA₂ may represent a basal sequence to other acidic PLA₂s. Due to the metabolic cost of venom protein synthesis, the presence of a relatively abundant (9%) but non-toxic component is somewhat puzzling. Nevertheless, we hypothesize that BlatPLA₂ could have a role in the pre-digestion of prey, possibly having retained characteristics of ancestral PLA₂s without evolving towards potent toxicity. Copyright © 2013 Elsevier Ltd. All rights reserved.

  6. Amino-acid sequences of trypsin inhibitors from watermelon (Citrullus vulgaris) and red bryony (Bryonia dioica) seeds.

    Science.gov (United States)

    Otlewski, J; Whatley, H; Polanowski, A; Wilusz, T

    1987-11-01

    The amino-acid sequences of two trypsin inhibitors isolated from red bryony (Bryonia dioica) and watermelon (Citrullus vulgaris) seeds are reported. Both species represent different genera of the Cucurbitaceae family, which have not been previously investigated as a source of proteinase inhibitors. The sequences are unique but are very similar to those of other proteinase inhibitors which have been isolated from squash seeds. Based on structural homology we assume that the Arg5-Ile6 peptide bond represents the reactive site bond of both inhibitors.

  7. Method of estimating horizontal vectors of ionospheric electric field deduced from HF Doppler data

    International Nuclear Information System (INIS)

    Tsutsui, M.; Ogawa, T.; Kamide, Y.; Kroehl, H.W.; Hausman, B.A.

    1988-01-01

    An HF Doppler method for estimating the time variations of the horizontal electric field in the ionosphere is presented which takes into account, for long-lasting variations in the electric field, the effect of electron decay due to attachment and/or recombination processes. The method is applied to an isolated substorm event, using equivalent ionospheric current systems deduced from worldwide magnetometer data in the estimations. The present results are found to agree with data deduced from current systems and high latitude electrojet activity. 18 references

  8. Fast computational methods for predicting protein structure from primary amino acid sequence

    Science.gov (United States)

    Agarwal, Pratul Kumar [Knoxville, TN

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  9. Identification, Characterization and Full-Length Sequence Analysis of a Novel Polerovirus Associated with Wheat Leaf Yellowing Disease.

    Science.gov (United States)

    Zhang, Peipei; Liu, Yan; Liu, Wenwen; Cao, Mengji; Massart, Sebastien; Wang, Xifeng

    2017-01-01

    To identify the pathogens responsible for leaf yellowing symptoms on wheat samples collected from Jinan, China, we tested for the presence of three known barley/wheat yellow dwarf viruses (BYDV-GAV, -PAV, WYDV-GPV) (most likely pathogens) using RT-PCR. A sample that tested negative for the three viruses was selected for small RNA sequencing. Twenty-five million sequences were generated, among which 5% were of viral origin. A novel polerovirus was discovered and temporarily named wheat leaf yellowing-associated virus (WLYaV). The full genome of WLYaV corresponds to 5,772 nucleotides (nt), with six AUG-initiated open reading frames, one non-AUG-initiated open reading frame, and three untranslated regions, showing typical features of the family Luteoviridae . Sequence comparison and phylogenetic analyses suggested that WLYaV had the closest relationship with sugarcane yellow leaf virus (ScYLV), but the identities of full genomic nucleotides and deduced amino acid sequence of coat protein (CP) were 64.9 and 86.2%, respectively, below the species demarcation thresholds (90%) in the family Luteoviridae . Furthermore, agroinoculation of Nicotiana benthamiana leaves with a cDNA clone of WLYaV caused yellowing symptoms on the plant. Our study adds a new polerovirus that is associated with wheat leaf yellowing disease, which would help to identify and control pathogens of wheat.

  10. Identification, Characterization and Full-Length Sequence Analysis of a Novel Polerovirus Associated with Wheat Leaf Yellowing Disease

    Directory of Open Access Journals (Sweden)

    Peipei Zhang

    2017-09-01

    Full Text Available To identify the pathogens responsible for leaf yellowing symptoms on wheat samples collected from Jinan, China, we tested for the presence of three known barley/wheat yellow dwarf viruses (BYDV-GAV, -PAV, WYDV-GPV (most likely pathogens using RT-PCR. A sample that tested negative for the three viruses was selected for small RNA sequencing. Twenty-five million sequences were generated, among which 5% were of viral origin. A novel polerovirus was discovered and temporarily named wheat leaf yellowing-associated virus (WLYaV. The full genome of WLYaV corresponds to 5,772 nucleotides (nt, with six AUG-initiated open reading frames, one non-AUG-initiated open reading frame, and three untranslated regions, showing typical features of the family Luteoviridae. Sequence comparison and phylogenetic analyses suggested that WLYaV had the closest relationship with sugarcane yellow leaf virus (ScYLV, but the identities of full genomic nucleotides and deduced amino acid sequence of coat protein (CP were 64.9 and 86.2%, respectively, below the species demarcation thresholds (90% in the family Luteoviridae. Furthermore, agroinoculation of Nicotiana benthamiana leaves with a cDNA clone of WLYaV caused yellowing symptoms on the plant. Our study adds a new polerovirus that is associated with wheat leaf yellowing disease, which would help to identify and control pathogens of wheat.

  11. Cloning and sequence of the gene encoding a cefotaxime-hydrolyzing class A beta-lactamase isolated from Escherichia coli.

    Science.gov (United States)

    Ishii, Y; Ohno, A; Taguchi, H; Imajo, S; Ishiguro, M; Matsuzawa, H

    1995-01-01

    Escherichia coli TUH12191, which is resistant to piperacillin, cefazolin, cefotiam, ceftizoxime, cefuzonam, and aztreonam but is susceptible to cefoxitin, latamoxef, flomoxef, and imipenem, was isolated from the urine of a patient treated with beta-lactam antibiotics. The beta-lactamase (Toho-1) purified from the bacteria had a pI of 7.8, had a molecular weight of about 29,000, and hydrolyzed beta-lactam antibiotics such as penicillin G, ampicillin, oxacillin, carbenicillin, piperacillin, cephalothin, cefoxitin, cefotaxime, ceftazidime, and aztreonam. Toho-1 was markedly inhibited by beta-lactamase inhibitors such as clavulanic acid and tazobactam. Resistance to beta-lactams, streptomycin, spectinomycin, sulfamethoxazole, and trimethoprim was transferred by conjugational transfer from E. coli TUH12191 to E. coli ML4903, and the transferred plasmid was about 58 kbp, belonging to incompatibility group M. The cefotaxime resistance gene for Toho-1 was subcloned from the 58-kbp plasmid by transformation of E. coli MV1184. The sequence of the gene for Toho-1 was determined, and the open reading frame of the gene consisted of 873 or 876 bases (initial sequence, ATGATG). The nucleotide sequence of the gene (DDBJ accession number D37830) was found to be about 73% homologous to the sequence of the gene encoding a class A beta-lactamase produced by Klebsiella oxytoca E23004. According to the amino acid sequence deduced from the DNA sequence, the precursor consisted of 290 or 291 amino acid residues, which contained amino acid motifs common to class A beta-lactamases (70SXXK, 130SDN, and 234KTG). Toho-1 was about 83% homologous to the beta-lactamase mediated by the chromosome of K. oxytoca D488 and the beta-lactamase mediated by the plasmid of E. coli MEN-1. Therefore, the newly isolated beta-lactamase Toho-1 produced by E. coli TUH12191 is similar to beta-lactamases produced by K. oxytoca D488, K. oxytoca E23004, and E. coli MEN-1 rather than to mutants of TEM or SHV enzymes

  12. Predicting protein amidation sites by orchestrating amino acid sequence features

    Science.gov (United States)

    Zhao, Shuqiu; Yu, Hua; Gong, Xiujun

    2017-08-01

    Amidation is the fourth major category of post-translational modifications, which plays an important role in physiological and pathological processes. Identifying amidation sites can help us understanding the amidation and recognizing the original reason of many kinds of diseases. But the traditional experimental methods for predicting amidation sites are often time-consuming and expensive. In this study, we propose a computational method for predicting amidation sites by orchestrating amino acid sequence features. Three kinds of feature extraction methods are used to build a feature vector enabling to capture not only the physicochemical properties but also position related information of the amino acids. An extremely randomized trees algorithm is applied to choose the optimal features to remove redundancy and dependence among components of the feature vector by a supervised fashion. Finally the support vector machine classifier is used to label the amidation sites. When tested on an independent data set, it shows that the proposed method performs better than all the previous ones with the prediction accuracy of 0.962 at the Matthew's correlation coefficient of 0.89 and area under curve of 0.964.

  13. Differences in acid tolerance between Bifidobacterium breve BB8 and its acid-resistant derivative B. breve BB8dpH, revealed by RNA-sequencing and physiological analysis.

    Science.gov (United States)

    Yang, Xu; Hang, Xiaomin; Tan, Jing; Yang, Hong

    2015-06-01

    Bifidobacteria are common inhabitants of the human gastrointestinal tract, and their application has increased dramatically in recent years due to their health-promoting effects. The ability of bifidobacteria to tolerate acidic environments is particularly important for their function as probiotics because they encounter such environments in food products and during passage through the gastrointestinal tract. In this study, we generated a derivative, Bifidobacterium breve BB8dpH, which displayed a stable, acid-resistant phenotype. To investigate the possible reasons for the higher acid tolerance of B. breve BB8dpH, as compared with its parental strain B. breve BB8, a combined transcriptome and physiological approach was used to characterize differences between the two strains. An analysis of the transcriptome by RNA-sequencing indicated that the expression of 121 genes was increased by more than 2-fold, while the expression of 146 genes was reduced more than 2-fold, in B. breve BB8dpH. Validation of the RNA-sequencing data using real-time quantitative PCR analysis demonstrated that the RNA-sequencing results were highly reliable. The comparison analysis, based on differentially expressed genes, suggested that the acid tolerance of B. breve BB8dpH was enhanced by regulating the expression of genes involved in carbohydrate transport and metabolism, energy production, synthesis of cell envelope components (peptidoglycan and exopolysaccharide), synthesis and transport of glutamate and glutamine, and histidine synthesis. Furthermore, an analysis of physiological data showed that B. breve BB8dpH displayed higher production of exopolysaccharide and lower H(+)-ATPase activity than B. breve BB8. The results presented here will improve our understanding of acid tolerance in bifidobacteria, and they will lead to the development of new strategies to enhance the acid tolerance of bifidobacterial strains. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. scsB, a cDNA encoding the hydrogenosomal beta subunit of succinyl-CoA synthetase from the anaerobic fungus Neocallimastix frontalis

    NARCIS (Netherlands)

    Brondijk, THC; Durand, R; vanderGiezen, M; Gottschal, JC; Prins, RA; Fevre, M

    1996-01-01

    A clone containing a Neocallimastix frontalis cDNA assumed to encode the beta subunit of succinyl-CoA synthetase (SCSB) was identified by sequence homology with prokaryotic and eukaryotic counterparts. An open reading frame of 1311 bp was found. The deduced 437 amino acid sequence showed a high

  15. The UDP glucuronosyltransferase gene superfamily: suggested nomenclature based on evolutionary divergence

    NARCIS (Netherlands)

    Burchell, B.; Nebert, D. W.; Nelson, D. R.; Bock, K. W.; Iyanagi, T.; Jansen, P. L.; Lancet, D.; Mulder, G. J.; Chowdhury, J. R.; Siest, G.

    1991-01-01

    A nomenclature system for the UDP glucuronosyltransferase superfamily is proposed, based on divergent evolution of the genes. A total of 26 distinct cDNAs in five mammalian species have been sequenced to date. Comparison of the deduced amino acid sequences leads to the definition of two families and

  16. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  17. Complete Genome Sequence of the Gamma-Aminobutyric Acid-Producing Strain Streptococcus thermophilus APC151.

    Science.gov (United States)

    Linares, Daniel M; Arboleya, Silvia; Ross, R Paul; Stanton, Catherine

    2017-04-27

    Here is presented the whole-genome sequence of Streptococcus thermophilus APC151, isolated from a marine fish. This bacterium produces gamma-aminobutyric acid (GABA) in high yields and is biotechnologically suitable to produce naturally GABA-enriched biofunctional yogurt. Its complete genome comprises 2,097 genes and 1,839,134 nucleotides, with an average G+C content of 39.1%. Copyright © 2017 Linares et al.

  18. Sequence of the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase from Nicotiana plumbaginifolia and phylogenetic origin of the gene family.

    Science.gov (United States)

    Habenicht, A; Quesada, A; Cerff, R

    1997-10-01

    A cDNA-library has been constructed from Nicotiana plumbaginifolia seedlings, and the non-phosphorylating glyceraldehyde-3-phosphate dehydrogenase (GapN, EC 1.2.1.9) was isolated by plaque hybridization using the cDNA from pea as a heterologous probe. The cDNA comprises the entire GapN coding region. A putative polyadenylation signal is identified. Phylogenetic analysis based on the deduced amino acid sequences revealed that the GapN gene family represents a separate ancient branch within the aldehyde dehydrogenase superfamily. It can be shown that the GapN gene family and other distinct branches of the superfamily have its phylogenetic origin before the separation of primary life-forms. This further demonstrates that already very early in evolution, a broad diversification of the aldehyde dehydrogenases led to the formation of the superfamily.

  19. Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

    International Nuclear Information System (INIS)

    Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

    1987-01-01

    Apolipoprotein(a) [apo(a)] is a glycoprotein with M/sub r/ ∼ 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain

  20. MipLAAO, a new L-amino acid oxidase from the redtail coral snake Micrurus mipartitus

    Directory of Open Access Journals (Sweden)

    Paola Rey-Suárez

    2018-06-01

    Full Text Available L-amino acid oxidases (LAAOs are ubiquitous enzymes in nature. Bioactivities described for these enzymes include apoptosis induction, edema formation, induction or inhibition of platelet aggregation, as well as antiviral, antiparasite, and antibacterial actions. With over 80 species, Micrurus snakes are the representatives of the Elapidae family in the New World. Although LAAOs in Micrurus venoms have been predicted by venom gland transcriptomic studies and detected in proteomic studies, no enzymes of this kind have been previously purified from their venoms. Earlier proteomic studies revealed that the venom of M. mipartitus from Colombia contains ∼4% of LAAO. This enzyme, here named MipLAAO, was isolated and biochemically and functionally characterized. The enzyme is found in monomeric form, with an isotope-averaged molecular mass of 59,100.6 Da, as determined by MALDI-TOF. Its oxidase activity shows substrate preference for hydrophobic amino acids, being optimal at pH 8.0. By nucleotide sequencing of venom gland cDNA of mRNA transcripts obtained from a single snake, six isoforms of MipLAAO with minor variations among them were retrieved. The deduced sequences present a mature chain of 483 amino acids, with a predicted pI of 8.9, and theoretical masses between 55,010.9 and 55,121.0 Da. The difference with experimentally observed mass is likely due to glycosylation, in agreement with the finding of three putative N-glycosylation sites in its amino acid sequence. A phylogenetic analysis of MmipLAAO placed this new enzyme within the clade of homologous proteins from elapid snakes, characterized by the conserved Serine at position 223, in contrast to LAAOs from viperids. MmipLAAO showed a potent bactericidal effect on S. aureus (MIC: 2 µg/mL, but not on E. coli. The former activity could be of interest to future studies assessing its potential as antimicrobial agent.

  1. Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.

    Science.gov (United States)

    Tan, Yen Hock; Huang, He; Kihara, Daisuke

    2006-08-15

    Aligning distantly related protein sequences is a long-standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile-profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile-profile alignments. Here we have developed novel amino acid similarity matrices from knowledge-based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential-based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential-based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential-based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space.

  2. Deducing Energy Consumer Behavior from Smart Meter Data

    DEFF Research Database (Denmark)

    Ebeid, Emad Samuel Malki; Heick, Rune; Jacobsen, Rune Hylsberg

    2017-01-01

    The ongoing upgrade of electricity meters to smart ones has opened a new market of intelligent services to analyze the recorded meter data. This paper introduces an open architecture and a unified framework for deducing user behavior from its smart main electricity meter data and presenting...... the results in a natural language. The framework allows a fast exploration and integration of a variety of machine learning algorithms combined with data recovery mechanisms for improving the recognition’s accuracy. Consequently, the framework generates natural language reports of the user’s behavior from...

  3. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Science.gov (United States)

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  4. Molecular cloning of a cDNA encoding human calumenin, expression in Escherichia coli and analysis of its Ca2+-binding activity

    DEFF Research Database (Denmark)

    Vorum, H; Liu, X; Madsen, Peder

    1998-01-01

    By microsequencing and cDNA cloning we have identified the transformation-sensitive protein No. IEF SSP 9302 as the human homologue of calumenin. The nucleotide sequence predicts a 315 amino acid protein with high identity to murine and rat calumenin. The deduced protein contains a 19 amino acid N...

  5. Identification of two novel genes encoding 97- to 99-kilodalton outer membrane proteins of Chlamydia pneumoniae.Infect Immun. 1999 Jan;67(1):375-83

    DEFF Research Database (Denmark)

    Knudsen, K; Madsen, AS; Mygind, P

    1999-01-01

    Two genes encoding 97- to 99-kDa Chlamydia pneumoniae VR1310 outer membrane proteins (Omp4 and Omp5) with mutual similarity were cloned and sequenced. The proteins were shown to be constituents of the C. pneumoniae outer membrane complex, and the deduced amino acid sequences were similar to those...

  6. Altering the expression of two chitin synthase genes differentially affects the growth and morphology of Aspergillus oryzae

    DEFF Research Database (Denmark)

    Müller, Christian; Hjort, C.M.; Hansen, K.

    2002-01-01

    In Aspergillus oryzae, one full-length chitin synthase (chsB) and fragments of two other chitin synthases (csmA and chsC) were identified. The deduced amino acid sequence of chsB was similar (87% identity) to chsB from Aspergillus nidulans, which encodes a class III chitin synthase. The sequence...

  7. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    Directory of Open Access Journals (Sweden)

    Ruan Jishou

    2007-04-01

    Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are

  8. Human catechol-O-methyltransferase: Cloning and expression of the membrane-associated form

    International Nuclear Information System (INIS)

    Bertocci, B.; Miggiano, V.; Da Prada, M.; Dembic, Z.; Lahm, H.W.; Malherbe, P.

    1991-01-01

    A cDNA clone for human catechol-O-methyltransferase was isolated from a human hepatoma cell line (Hep G2) cDNA library by hybridization screening with a porcine cDNA probe. The cDNA clone was sequenced and found to have an insert of 1226 nucleotides. The deduced primary structure of hCOMT is composed of 271 amino acid residues with the predicted molecular mass of 30 kDa. At its N terminus it has a hydrophobic segment of 21 amino acid residues that may be responsible for insertion of hCOMT into the endoplasmic reticulum membrane. The primary structure of hCOMT exhibits high homology to the porcine partial cDNA sequence (93%). The deduced amino acid sequence contains two tryptic peptide sequences (T-22, T-33) found in porcine liver catechol-O-methyltransferase (CEMT). The coding region of hCOMT cDNA was placed under the control of the cytomegalovirus promoter to transfect human kidney 293 cells. The recombinant hCOMT was shown by immunoblot analysis to be mainly associated with the membrane fraction. RNA blot analysis revealed one COMT mRNA transcript of 1.4 kilobases in Hep G2 poly(A) + RNA

  9. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    Directory of Open Access Journals (Sweden)

    Pelletier Eric

    2010-10-01

    Full Text Available Abstract Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C

  10. Nucleic Acid Amplification Testing and Sequencing Combined with Acid-Fast Staining in Needle Biopsy Lung Tissues for the Diagnosis of Smear-Negative Pulmonary Tuberculosis.

    Science.gov (United States)

    Jiang, Faming; Huang, Weiwei; Wang, Ye; Tian, Panwen; Chen, Xuerong; Liang, Zongan

    2016-01-01

    Smear-negative pulmonary tuberculosis (PTB) is common and difficult to diagnose. In this study, we investigated the diagnostic value of nucleic acid amplification testing and sequencing combined with acid-fast bacteria (AFB) staining of needle biopsy lung tissues for patients with suspected smear-negative PTB. Patients with suspected smear-negative PTB who underwent percutaneous transthoracic needle biopsy between May 1, 2012, and June 30, 2015, were enrolled in this retrospective study. Patients with AFB in sputum smears were excluded. All lung biopsy specimens were fixed in formalin, embedded in paraffin, and subjected to acid-fast staining and tuberculous polymerase chain reaction (TB-PCR). For patients with positive AFB and negative TB-PCR results in lung tissues, probe assays and 16S rRNA sequencing were used for identification of nontuberculous mycobacteria (NTM). The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic accuracy of PCR and AFB staining were calculated separately and in combination. Among the 220 eligible patients, 133 were diagnosed with TB (men/women: 76/57; age range: 17-80 years, confirmed TB: 9, probable TB: 124). Forty-eight patients who were diagnosed with other specific diseases were assigned as negative controls, and 39 patients with indeterminate final diagnosis were excluded from statistical analysis. The sensitivity, specificity, PPV, NPV, and accuracy of histological AFB (HAFB) for the diagnosis of smear-negative were 61.7% (82/133), 100% (48/48), 100% (82/82), 48.5% (48/181), and 71.8% (130/181), respectively. The sensitivity, specificity, PPV, and NPV of histological PCR were 89.5% (119/133), 95.8% (46/48), 98.3% (119/121), and 76.7% (46/60), respectively, demonstrating that histological PCR had significantly higher accuracy (91.2% [165/181]) than histological acid-fast staining (71.8% [130/181]), P pulmonary tuberculosis. For patients with positive histological AFB and

  11. Sequence Design for a Test Tube of Interacting Nucleic Acid Strands.

    Science.gov (United States)

    Wolfe, Brian R; Pierce, Niles A

    2015-10-16

    We describe an algorithm for designing the equilibrium base-pairing properties of a test tube of interacting nucleic acid strands. A target test tube is specified as a set of desired "on-target" complexes, each with a target secondary structure and target concentration, and a set of undesired "off-target" complexes, each with vanishing target concentration. Sequence design is performed by optimizing the test tube ensemble defect, corresponding to the concentration of incorrectly paired nucleotides at equilibrium evaluated over the ensemble of the test tube. To reduce the computational cost of accepting or rejecting mutations to a random initial sequence, the structural ensemble of each on-target complex is hierarchically decomposed into a tree of conditional subensembles, yielding a forest of decomposition trees. Candidate sequences are evaluated efficiently at the leaf level of the decomposition forest by estimating the test tube ensemble defect from conditional physical properties calculated over the leaf subensembles. As optimized subsequences are merged toward the root level of the forest, any emergent defects are eliminated via ensemble redecomposition and sequence reoptimization. After successfully merging subsequences to the root level, the exact test tube ensemble defect is calculated for the first time, explicitly checking for the effect of the previously neglected off-target complexes. Any off-target complexes that form at appreciable concentration are hierarchically decomposed, added to the decomposition forest, and actively destabilized during subsequent forest reoptimization. For target test tubes representative of design challenges in the molecular programming and synthetic biology communities, our test tube design algorithm typically succeeds in achieving a normalized test tube ensemble defect ≤1% at a design cost within an order of magnitude of the cost of test tube analysis.

  12. Isolation, cDNA cloning and gene expression of an antibacterial protein from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros.

    Science.gov (United States)

    Yang, J; Yamamoto, M; Ishibashi, J; Taniai, K; Yamakawa, M

    1998-08-01

    An antibacterial protein, designated rhinocerosin, was purified to homogeneity from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros immunized with Escherichia coli. Based on the amino acid sequence of the N-terminal region, a degenerate primer was synthesized and reverse-transcriptase PCR was performed to clone rhinocerosin cDNA. As a result, a 279-bp fragment was obtained. The complete nucleotide sequence was determined by sequencing the extended rhinocerosin cDNA clone by 5' rapid amplification of cDNA ends. The deduced amino acid sequence of the mature portion of rhinocerosin was composed of 72 amino acids without cystein residues and was shown to be rich in glycine (11.1%) and proline (11.1%) residues. Comparison of the deduced amino acid sequence of rhinocerosin with those of other antibacterial proteins indicated that it has 77.8% and 44.6% identity with holotricin 2 and coleoptrecin, respectively. Rhinocerosin had strong antibacterial activity against E. coli, Streptococcus pyogenes, Staphylococcus aureus but not against Pseudomonas aeruginosa. Results of reverse-transcriptase PCR analysis of gene expression in different tissues indicated that the rhinocerosin gene is strongly expressed in the fat body and the Malpighian tubule, and weakly expressed in hemocytes and midgut. In addition, gene expression was inducible by bacteria in the fat body, the Malpighian tubule and hemocyte but constitutive expression was observed in the midgut.

  13. [Complete genome sequencing and analyses of rabies viruses isolated from wild animals (Chinese Ferret-Badger) in Zhejiang province].

    Science.gov (United States)

    Lei, Yong-Liang; Wang, Xiao-Guang; Liu, Fu-Ming; Chen, Xiu-Ying; Ye, Bi-Feng; Mei, Jian-Hua; Lan, Jin-Quan; Tang, Qing

    2009-08-01

    Based on sequencing the full-length genomes of two Chinese Ferret-Badger, we analyzed the properties of rabies viruses genetic variation in molecular level to get information on prevalence and variation of rabies viruses in Zhejiang, and to enrich the genome database of rabies viruses street strains isolated from Chinese wildlife. Overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses of the N genes from Chinese Ferret-Badger, sika deer, vole, dog. Vaccine strains were then determined. The two full-length genomes were completely sequenced to find out that they had the same genetic structure with 11 923 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions (IGRs), 423 nts-Pseudogene-like sequence (Psi), 70 nts-Trailer. The two full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by blast and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the two full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so that the nucleotide mutations happened in these two genomes were most probably as synonymous mutations. Compared to the referenced rabies viruses, the lengths of the five protein coding regions did not show any changes or recombination, but only with a few-point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the two ferret badgers genomes were similar to the referenced vaccine or street strains. The two strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessing the distinct geographyphic characteristics of China. All the evidence suggested a cue that these two ferret badgers

  14. Phylogenetic relationships between Sarcocystis species from reindeer and other Sarcocystidae deduced from ssu rRNA gene sequences

    DEFF Research Database (Denmark)

    Dahlgren, S.S.; Oliveira, Rodrigo Gouveia; Gjerde, B.

    2008-01-01

    any effect on previously inferred phylogenetic relationships within the Sarcocystidae. The complete small subunit (ssu) rRNA gene sequences of all six Sarcocystis species from reindeer were used in the phylogenetic analyses along with ssu rRNA gene sequences of 85 other members of the Coccidea. Trees...... the six species in phylogenetic analyses of the Sarcocystidae, and also to investigate the phylogenetic relationships between the species from reindeer and those from other hosts. The study also aimed at revealing whether the inclusion of six Sarcocystis species from the same intermediate host would have....... tarandivulpes, formed a sister group to other Sarcocystis species with a canine definitive host. The position of S. hardangeri on the tree suggested that it uses another type of definitive host than the other Sarcocystis species in this clade. Considering the geographical distribution and infection intensity...

  15. DNA sequence analysis, expression, distribution, and physiological role of the Xaa-prolyldipeptidyl aminopeptidase gene from Lactobacillus helveticus CNRZ32.

    Science.gov (United States)

    Yüksel, G U; Steele, J L

    1996-02-01

    Lactobacillus helveticus CNRZ32 possesses an Xaa-prolyldipeptidyl aminopeptidase (PepX), which releases amino-terminal dipeptides from peptides containing proline residues in the penultimate position. The PepX gene, designated pepX, from Lb. helveticus CNRZ32 was sequenced. Analysis of the sequence identified a putative 2379-bp pepX open-reading frame, which encodes a polypeptide of 793 amino acid residues with a deduced molecular mass of 88,111 Da. The gene shows significant sequence identity with sequenced pepX genes from lactic acid bacteria. The product of the gene contains a motif that is almost identical with the active-site motif of the serine-dependent PepX from lactococci. The introduction of pepX into Lactococcus lactis LM0230 on either pGK12 (a low-copy-number plasmid vector) or pIL253 (a high-copy-number plasmid vector) did not result in a significant increase in PepX activity, while the introduction of pepX into CNRZ32 on pGK12 resulted in a four-fold increase in PepX activity. Southern hybridization experiments revealed that the pepX gene from CNRZ32 is well conserved in lactobacilli, pediococci and streptococci. The physiological role of PepX during growth in lactobacillus MRS (a rich medium containing protein hydrolysates along with other ingredients) and milk was examined by comparing growth of CNRZ32 and a CNRZ32 PepX-negative derivative. No difference in growth rate or acid production was observed between CNRZ32 and its PepX-negative derivative in MRS. However, the CNRZ32 PepX-negative derivative grew in milk at a reduced specific growth rate when compared to wild-type CNRZ32. Introduction of the cloned PepX determinant into the CNRZ32 PepX-negative derivative resulted in a construct with a specific growth rate similar to that of wild-type CNRZ32.

  16. Isolation and amino acid sequence of a short-chain neurotoxin from an Australian elapid snake, Pseudechis australis.

    OpenAIRE

    Takasaki, C; Tamiya, N

    1985-01-01

    A short-chain neurotoxin Pseudechis australis a (toxin Pa a) was isolated from the venom of an Australian elapid snake Pseudechis australis (king brown snake) by sequential chromatography on CM-cellulose, Sephadex G-50 and CM-cellulose columns. Toxin Pa a has an LD50 (intravenous) value of 76 micrograms/kg body wt. in mice and consists of 62 amino acid residues. The amino acid sequence of Pa a shows considerable homology with those of short-chain neurotoxins of elapid snakes, especially of tr...

  17. Genome sequence of the thermophilic strain Bacillus coagulans 2-6, an efficient producer of high-optical-purity L-lactic acid.

    Science.gov (United States)

    Su, Fei; Yu, Bo; Sun, Jibin; Ou, Hong-Yu; Zhao, Bo; Wang, Limin; Qin, Jiayang; Tang, Hongzhi; Tao, Fei; Jarek, Michael; Scharfe, Maren; Ma, Cuiqing; Ma, Yanhe; Xu, Ping

    2011-09-01

    Bacillus coagulans 2-6 is an efficient producer of lactic acid. The genome of B. coagulans 2-6 has the smallest genome among the members of the genus Bacillus known to date. The frameshift mutation at the start of the d-lactate dehydrogenase sequence might be responsible for the production of high-optical-purity l-lactic acid.

  18. BLEACHING EUCALYPTUS PULPS WITH SHORT SEQUENCES

    Directory of Open Access Journals (Sweden)

    Flaviana Reis Milagres

    2011-03-01

    Full Text Available Eucalyptus spp kraft pulp, due to its high content of hexenuronic acids, is quite easy to bleach. Therefore, investigations have been made attempting to decrease the number of stages in the bleaching process in order to minimize capital costs. This study focused on the evaluation of short ECF (Elemental Chlorine Free and TCF (Totally Chlorine Free sequences for bleaching oxygen delignified Eucalyptus spp kraft pulp to 90% ISO brightness: PMoDP (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, PMoD/P (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, without washing PMoD(PO (Molybdenum catalyzed acid peroxide, chlorine dioxide and pressurized peroxide, D(EPODP (chlorine dioxide, extraction oxidative with oxygen and peroxide, chlorine dioxide and hydrogen peroxide, PMoQ(PO (Molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide, and XPMoQ(PO (Enzyme, molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide. Uncommon pulp treatments, such as molybdenum catalyzed acid peroxide (PMo and xylanase (X bleaching stages, were used. Among the ECF alternatives, the two-stage PMoD/P sequence proved highly cost-effective without affecting pulp quality in relation to the traditional D(EPODP sequence and produced better quality effluent in relation to the reference. However, a four stage sequence, XPMoQ(PO, was required to achieve full brightness using the TCF technology. This sequence was highly cost-effective although it only produced pulp of acceptable quality.

  19. [Sequencing and analysis of complete genome of rabies viruses isolated from Chinese Ferret-Badger and dog in Zhejiang province].

    Science.gov (United States)

    Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing

    2010-01-01

    Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses

  20. Characterization of a chitinolytic enzyme from Serratia sp. KCK isolated from kimchi juice.

    Science.gov (United States)

    Kim, Hyun-Soo; Timmis, Kenneth N; Golyshin, Peter N

    2007-07-01

    The novel chitinolytic bacterium Serratia sp. KCK, which was isolated from kimchi juice, produced chitinase A. The gene coding for the chitinolytic enzyme was cloned on the basis of sequencing of internal peptides, homology search, and design of degenerated primers. The cloned open reading frame of chiA encodes for deduced polypeptide of 563 amino acid residues with a calculated molecular mass of 61 kDa and appears to correspond to a molecular mass of about 57 kDa, which excluded the signal sequence. The deduced amino acid sequence showed high similarity to those of bacterial chitinases classified as family 18 of glycosyl hydrolases. The chitinase A is an exochitinase and exhibits a greater pH range (5.0-10.0), thermostability with a temperature optimum of 40 degrees C, and substrate range other than Serratia chitinases thus far described. These results suggested that Serratia sp. KCK chitinase A can be used for biotechnological applications with good potential.

  1. Cloning and characterization of the ddc homolog encoding L-2,4-diaminobutyrate decarboxylase in Enterobacter aerogenes.

    Science.gov (United States)

    Yamamoto, S; Mutoh, N; Tsuzuki, D; Ikai, H; Nakao, H; Shinoda, S; Narimatsu, S; Miyoshi, S I

    2000-05-01

    L-2,4-diaminobutyrate decarboxylase (DABA DC) catalyzes the formation of 1,3-diaminopropane (DAP) from DABA. In the present study, the ddc gene encoding DABA DC from Enterobacter aerogenes ATCC 13048 was cloned and characterized. Determination of the nucleotide sequence revealed an open reading frame of 1470 bp encoding a 53659-Da protein of 490 amino acids, whose deduced NH2-terminal sequence was identical to that of purified DABA DC from E. aerogenes. The deduced amino acid sequence was highly similar to those of Acinetobacter baumannii and Haemophilus influenzae DABA DCs encoded by the ddc genes. The lysine-307 of the E. aerogenes DABA DC was identified as the pyridoxal 5'-phosphate binding residue by site-directed mutagenesis. Furthermore, PCR analysis revealed the distribution of E. aerogenes ddc homologs in some other species of Enterobacteriaceae. Such a relatively wide occurrence of the ddc homologs implies biological significance of DABA DC and its product DAP.

  2. Nucleic Acid Amplification Testing and Sequencing Combined with Acid-Fast Staining in Needle Biopsy Lung Tissues for the Diagnosis of Smear-Negative Pulmonary Tuberculosis.

    Directory of Open Access Journals (Sweden)

    Faming Jiang

    Full Text Available Smear-negative pulmonary tuberculosis (PTB is common and difficult to diagnose. In this study, we investigated the diagnostic value of nucleic acid amplification testing and sequencing combined with acid-fast bacteria (AFB staining of needle biopsy lung tissues for patients with suspected smear-negative PTB.Patients with suspected smear-negative PTB who underwent percutaneous transthoracic needle biopsy between May 1, 2012, and June 30, 2015, were enrolled in this retrospective study. Patients with AFB in sputum smears were excluded. All lung biopsy specimens were fixed in formalin, embedded in paraffin, and subjected to acid-fast staining and tuberculous polymerase chain reaction (TB-PCR. For patients with positive AFB and negative TB-PCR results in lung tissues, probe assays and 16S rRNA sequencing were used for identification of nontuberculous mycobacteria (NTM. The sensitivity, specificity, positive predictive value (PPV, negative predictive value (NPV, and diagnostic accuracy of PCR and AFB staining were calculated separately and in combination.Among the 220 eligible patients, 133 were diagnosed with TB (men/women: 76/57; age range: 17-80 years, confirmed TB: 9, probable TB: 124. Forty-eight patients who were diagnosed with other specific diseases were assigned as negative controls, and 39 patients with indeterminate final diagnosis were excluded from statistical analysis. The sensitivity, specificity, PPV, NPV, and accuracy of histological AFB (HAFB for the diagnosis of smear-negative were 61.7% (82/133, 100% (48/48, 100% (82/82, 48.5% (48/181, and 71.8% (130/181, respectively. The sensitivity, specificity, PPV, and NPV of histological PCR were 89.5% (119/133, 95.8% (46/48, 98.3% (119/121, and 76.7% (46/60, respectively, demonstrating that histological PCR had significantly higher accuracy (91.2% [165/181] than histological acid-fast staining (71.8% [130/181], P < 0.001. Parallel testing of histological AFB staining and PCR showed the

  3. Purification and primary structure determination of human lysosomal dipeptidase.

    Science.gov (United States)

    Dolenc, Iztok; Mihelic, Marko

    2003-02-01

    The lysosomal metallopeptidase is an enzyme that acts preferentially on dipeptides with unsubstituted N- and C-termini. Its activity is highest in slightly acidic pH. Here we describe the isolation and characterization of lysosomal dipeptidase from human kidney. The isolated enzyme has the amino-terminal sequence DVAKAIINLAVY and is a homodimer with a molecular mass of 100 kDa. So far no amino acid sequence has been determined for this metallopeptidase. The complete primary structure as deduced from the nucleotide sequence revealed that the isolated dipeptidase is similar to blood plasma glutamate carboxypeptidase.

  4. Nonlinear analysis of sequence repeats of multi-domain proteins

    Energy Technology Data Exchange (ETDEWEB)

    Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

    2007-11-15

    Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.

  5. Computer-aided visualization and analysis system for sequence evaluation

    Energy Technology Data Exchange (ETDEWEB)

    Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

    2004-05-11

    A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.

  6. Cloning and nucleotide sequence analysis of pepV, a carnosinase gene from Lactobacillus delbrueckii subsp. lactis DSM 7290, and partial characterization of the enzyme.

    Science.gov (United States)

    Vongerichten, K F; Klein, J R; Matern, H; Plapp, R

    1994-10-01

    Cell extracts of Lactobacillus delbrueckii subsp. lactis DSM 7290 were found to exhibit unique peptolytic ability against unusual beta-alanyl-dipeptides. In order to clone the gene encoding this activity, designated pepV, a gene library of strain DSM 7290 genomic DNA, prepared in the low-copy-number plasmid pLG339, was screened for heterologous expression in Escherichia coli. Recombinant clones harbouring pepV were identified by their ability to allow the utilization of carnosine (beta-alanyl-histidine) as a source of histidine by the E. coli mutant strain UK197 (pepD, hisG). Complementation was observed in a colony harbouring a recombinant plasmid (pKV101), carrying pepV. A 2.4 kb fragment containing pepV was subcloned and its nucleotide sequence revealed an open reading frame (ORF) of 1413 nucleotides, corresponding to a protein with predicted molecular mass of 51998 Da. A single transcription initiation site 71 bp upstream of the ATG translational start codon was identified by primer extension. No significant homology was detected between pepV or its deduced amino acid sequence with any entry in the databases. The only similarity was found in a region conserved in the ArgE/DapE/CPG2/YscS family of proteins. This observation, and protease inhibitor studies, indicated that pepV is of the metalloprotease type. A second ORF present in the sequenced fragment showed extensive homology to a variety of amino acid permeases from E. coli and Saccharomyces cerevisiae.

  7. Isolation and sequencing of the cryIC-like delta endotoxin gene from ...

    African Journals Online (AJOL)

    use

    2012-02-16

    Feb 16, 2012 ... rooting from transfer of shoots on basal-MS medium. Potato plantlets adapted to ... amino acid residues and contained a potential repeat motif. The deduced ... containing MS media supplemented with 3 or 4% sucrose and MS + 1.5 mg/l .... investment in time before the expressed proteins can be analysed.

  8. Temporal variability of TEC deduced from groundbased measurements

    International Nuclear Information System (INIS)

    Mosert, M.; Ezquer, R.G.; Jadur, C.; Radicella, S.M.

    2001-01-01

    This paper presents a study of the behaviour of the integrated total electron content (ITEC) deduced from electron density profiles of two Argentine stations: Tucuman (26.9 S; 294.6 E) and San Juan (31.5 S; 290.4 E). The ITEC values have been obtained by the technique proposed by Reinisch and Huang (2000). The database includes electron density profiles derived from ionograms recorded at 4 typical hours of the day (00.00, 06.00, 12.00 and 18.00 LT) during different seasonal and solar activity conditions. An analysis of the day to day variability of ITEC has also been done. (author)

  9. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

    Directory of Open Access Journals (Sweden)

    Stephan Pabinger

    Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage

  10. Deducing T, C, and P invariance for strong interactions in topological particle theory

    International Nuclear Information System (INIS)

    Jones, C.E.

    1985-01-01

    It is shown here how the separate discrete invariances [time reversal (T), charge conjugation (C), and parity (P)] in strong interactions can be deduced as consequences of other S-matrix requirements in topological particle theory

  11. First draft genome sequencing of indole acetic acid producing and plant growth promoting fungus Preussia sp. BSL10.

    Science.gov (United States)

    Khan, Abdul Latif; Asaf, Sajjad; Khan, Abdur Rahim; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2016-05-10

    Preussia sp. BSL10, family Sporormiaceae, was actively producing phytohormone (indole-3-acetic acid) and extra-cellular enzymes (phosphatases and glucosidases). The fungus was also promoting the growth of arid-land tree-Boswellia sacra. Looking at such prospects of this fungus, we sequenced its draft genome for the first time. The Illumina based sequence analysis reveals an approximate genome size of 31.4Mbp for Preussia sp. BSL10. Based on ab initio gene prediction, total 32,312 coding sequences were annotated consisting of 11,967 coding genes, pseudogenes, and 221 tRNA genes. Furthermore, 321 carbohydrate-active enzymes were predicted and classified into many functional families. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

    OpenAIRE

    Kuhn, H; Fietzek, P P; Lampen, J O

    1982-01-01

    The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.

  13. Acid mine drainage neutralization in a pilot sequencing batch reactor using limestone from a paper and pulp industry

    CSIR Research Space (South Africa)

    Vadapalli, VRK

    2015-10-01

    Full Text Available This study investigated the implications of using two grades of limestone from a paper and pulp industry for neutralization of acid mine drainage (AMD) in a pilot sequencing batch reactor (SBR). In this regard, two grades of calcium carbonate were...

  14. Novel algorithms for protein sequence analysis

    NARCIS (Netherlands)

    Ye, Kai

    2008-01-01

    Each protein is characterized by its unique sequential order of amino acids, the so-called protein sequence. Biology”s paradigm is that this order of amino acids determines the protein”s architecture and function. In this thesis, we introduce novel algorithms to analyze protein sequences. Chapter 1

  15. Influence of the Amino Acid Sequence on Protein-Mineral Interactions in Soil

    Science.gov (United States)

    Chacon, S. S.; Reardon, P. N.; Purvine, S.; Lipton, M. S.; Washton, N.; Kleber, M.

    2017-12-01

    The intimate associations between protein and mineral surfaces have profound impacts on nutrient cycling in soil. Proteins are an important source of organic C and N, and a subset of proteins, extracellular enzymes (EE), can catalyze the depolymerization of soil organic matter (SOM). Our goal was to determine how variation in the amino acid sequence could influence a protein's susceptibility to become chemically altered by mineral surfaces to infer the fate of adsorbed EE function in soil. We hypothesized that (1) addition of charged amino acids would enhance the adsorption onto oppositely charged mineral surfaces (2) addition of aromatic amino acids would increase adsorption onto zero charged surfaces (3) Increase adsorption of modified proteins would enhance their susceptibility to alterations by redox active minerals. To test these hypotheses, we generated three engineered proxies of a model protein Gb1 (IEP 4.0, 6.2 kDA) by inserting either negatively charged, positively charged or aromatic amino acids in the second loop. These modified proteins were allowed to interact with functionally different mineral surfaces (goethite, montmorillonite, kaolinite and birnessite) at pH 5 and 7. We used LC-MS/MS and solution-state Heteronuclear Single Quantum Coherence Spectroscopy NMR to observe modifications on engineered proteins as a consequence to mineral interactions. Preliminary results indicate that addition of any amino acids to a protein increase its susceptibility to fragmentation and oxidation by redox active mineral surfaces, and alter adsorption to the other mineral surfaces. This suggest that not all mineral surfaces in soil may act as sorbents for EEs and chemical modification of their structure should also be considered as an explanation for decrease in EE activity. Fragmentation of proteins by minerals can bypass the need to produce proteases, but microbial acquisition of other nutrients that require enzymes such as cellulases, ligninases or phosphatases

  16. Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

    Science.gov (United States)

    Kimura, M; Kimura, J; Hatakeyama, T

    1988-11-21

    The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).

  17. Deducing the temporal order of cofactor function in ligand-regulated gene transcription: theory and experimental verification.

    Science.gov (United States)

    Dougherty, Edward J; Guo, Chunhua; Simons, S Stoney; Chow, Carson C

    2012-01-01

    Cofactors are intimately involved in steroid-regulated gene expression. Two critical questions are (1) the steps at which cofactors exert their biological activities and (2) the nature of that activity. Here we show that a new mathematical theory of steroid hormone action can be used to deduce the kinetic properties and reaction sequence position for the functioning of any two cofactors relative to a concentration limiting step (CLS) and to each other. The predictions of the theory, which can be applied using graphical methods similar to those of enzyme kinetics, are validated by obtaining internally consistent data for pair-wise analyses of three cofactors (TIF2, sSMRT, and NCoR) in U2OS cells. The analysis of TIF2 and sSMRT actions on GR-induction of an endogenous gene gave results identical to those with an exogenous reporter. Thus new tools to determine previously unobtainable information about the nature and position of cofactor action in any process displaying first-order Hill plot kinetics are now available.

  18. Sequence Analysis of the Cryptic Plasmid pMG101 from Rhodopseudomonas palustris and Construction of Stable Cloning Vectors

    Science.gov (United States)

    Inui, Masayuki; Roh, Jung Hyeob; Zahn, Kenneth; Yukawa, Hideaki

    2000-01-01

    A 15-kb cryptic plasmid was obtained from a natural isolate of Rhodopseudomonas palustris. The plasmid, designated pMG101, was able to replicate in R. palustris and in closely related strains of Bradyrhizobium japonicum and phototrophic Bradyrhizobium species. However, it was unable to replicate in the purple nonsulfur bacterium Rhodobacter sphaeroides and in Rhizobium species. The replication region of pMG101 was localized to a 3.0-kb SalI-XhoI fragment, and this fragment was stably maintained in R. palustris for over 100 generations in the absence of selection. The complete nucleotide sequence of this fragment revealed two open reading frames (ORFs), ORF1 and ORF2. The deduced amino acid sequence of ORF1 is similar to sequences of Par proteins, which mediate plasmid stability from certain plasmids, while ORF2 was identified as a putative rep gene, coding for an initiator of plasmid replication, based on homology with the Rep proteins of several other plasmids. The function of these sequences was studied by deletion mapping and gene disruptions of ORF1 and ORF2. pMG101-based Escherichia coli-R. palustris shuttle cloning vectors pMG103 and pMG105 were constructed and were stably maintained in R. palustris growing under nonselective conditions. The ability of plasmid pMG101 to replicate in R. palustris and its close phylogenetic relatives should enable broad application of these vectors within this group of α-proteobacteria. PMID:10618203

  19. Quantum-Sequencing: Fast electronic single DNA molecule sequencing

    Science.gov (United States)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.

  20. Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

    Science.gov (United States)

    Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

    2007-01-01

    The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792

  1. Geometrical primitives reconstruction from image sequence in an interactive context

    International Nuclear Information System (INIS)

    Monchal, L.; Aubry, P.

    1995-01-01

    We propose a method to recover 3D geometrical shape from image sequence, in a context of man machine co-operation. The human operator has to point out the edges of an object in the first image and choose a corresponding geometrical model. The algorithm tracks each relevant 2D segments describing surface discontinuities or limbs, in the images. Then, knowing motion of the camera between images, the positioning and the size of the virtual object are deduced by minimising a function. The function describes how well the virtual objects is linked to the extracted segments of the sequence, its geometrical model and pieces of information given by the operator. (author). 13 refs., 7 figs., 8 tabs

  2. Cloning, Phylogenetic Analysis and 3D Modeling of a Putative Lysosomal Acid Lipase from the Camel, Camelus dromedarius

    Directory of Open Access Journals (Sweden)

    Farid Shokry Ataya

    2012-08-01

    Full Text Available Acid lipase belongs to a family of enzymes that is mainly present in lysosomes of different organs and the stomach. It is characterized by its capacity to withstand acidic conditions while maintaining high lipolytic activity. We cloned for the first time the full coding sequence of camel’s lysosomal acid lipase, cLIPA using RT-PCR technique (Genbank accession numbers JF803951 and AEG75815, for the nucleotide and aminoacid sequences respectively. The cDNA sequencing revealed an open reading frame of 1,197 nucleotides that encodes a protein of 399 aminoacids which was similar to that from other related mammalian species. Bioinformatic analysis was used to determine the aminoacid sequence, 3D structure and phylogeny of cLIPA. Bioinformatics analysis suggested the molecular weight of the translated protein to be 45.57 kDa, which could be decreased to 43.16 kDa after the removal of a signal peptide comprising the first 21 aminoacids. The deduced cLIPA sequences exhibited high identity with Equus caballus (86%, Numascus leucogenys (85%, Homo sapiens (84%, Sus scrofa (84%, Bos taurus (82% and Ovis aries (81%. cLIPA shows high aminoacid sequence identity with human and dog-gastric lipases (58%, and 59% respectively which makes it relevant to build a 3D structure model for cLIPA. The comparison confirms the presence of the catalytic triad and the oxyanion hole in cLIPA. Phylogenetic analysis revealed that camel cLIPA is grouped with monkey, human, pig, cow and goat. The level of expression of cLIPA in five camel tissues was examined using Real Time-PCR. The highest level of cLIPA transcript was found in the camel testis (162%, followed by spleen (129%, liver (100%, kidney (20.5% and lung (17.4%.

  3. Ferritin from the Pacific abalone Haliotis discus hannai: Analysis of cDNA sequence, expression, and activity.

    Science.gov (United States)

    Qiu, Reng; Kan, Yunchao; Li, Dandan

    2016-02-01

    Ferritin plays an important role in iron homeostasis due to its ability to bind and sequester large amounts of iron. In this study, the gene encoding a ferritin (HdhFer2) was cloned from Pacific abalone (Haliotis discus hannai). The full-length cDNA of HdhFer2 contains a 5'-UTR of 121 bp, an ORF of 516 bp, and a 3'-UTR of 252 bp with a polyadenylation signal sequence of AATAAA and a poly(A) tail. It also contains a 31 bp iron-responsive element (IRE) in the 5'-UTR position, which is conserved in many ferritins. HdhFer2 consists of 171 amino acid residues with a predicted molecular weight (MW) ∼19.8 kDa and a theoretical isoelectric point (PI) of 4.84. The deduced amino acid sequence of HdhFer2 contains two ferritin iron-binding region signatures (IBRSs). HdhFer2 mRNA was detected in a wide range of tissues and was dominantly expressed in the gill. Infection with the bacterial pathogen Vibrio anguillarum significantly upregulated HdhFer2 expression in a time-dependent manner. Recombinant HdhFer2 (rHdhFer2) purified from Escherichia coli was able to bind ferrous iron in a concentration-dependent manner. In summary, these results suggest that HdhFer2 is a crucial protein in the iron-withholding defense system, and plays an important role in the innate immune response of abalone. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    Science.gov (United States)

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  5. A New Approach to Sequence Analysis Exemplified by Identification of cis-Elements in Abscisic Acid Inducible Promoters

    DEFF Research Database (Denmark)

    Busk, Peter Kamp; Hallin, Peter Fischer; Salomon, Jesper

    -regulatory elements. We have developed a method for identifying short, conserved motifs in biological sequences such as proteins, DNA and RNA5. This method was used for analysis of approximately 2000 Arabidopsis thaliana promoters that have been shown by DNA array analysis to be induced by abscisic acid6....... These promoters were compared to 28000 promoters that are not induced by abscisic acid. The analysis identified previously described ABA-inducible promoter elements such as ABRE, CE3 and CRT1 but also new cis-elements were found. Furthermore, the list of DNA elements could be used to predict ABA...

  6. Cloning and expression of cDNA coding for bouganin.

    Science.gov (United States)

    den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

    2002-03-01

    Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.

  7. A new earthworm cellulase and its possible role in the innate immunity.

    Science.gov (United States)

    Park, In Yong; Cha, Ju Roung; Ok, Suk-Mi; Shin, Chuog; Kim, Jin-Se; Kwak, Hee-Jin; Yu, Yun-Sang; Kim, Yu-Kyung; Medina, Brenda; Cho, Sung-Jin; Park, Soon Cheol

    2017-02-01

    A new endogenous cellulase (Ean-EG) from the earthworm, Eisenia andrei and its expression pattern are demonstrated. Based on a deduced amino acid sequence, the open reading frame (ORF) of Ean-EG consisted of 1368 bps corresponding to a polypeptide of 456 amino acid residues in which is contained the conserved region specific to GHF9 that has the essential amino acid residues for enzyme activity. In multiple alignments and phylogenetic analysis, the deduced amino acid sequence of Ean- EG showed the highest sequence similarity (about 79%) to that of an annelid (Pheretima hilgendorfi) and could be clustered together with other GHF9 cellulases, indicating that Ean-EG could be categorized as a member of the GHF9 to which most animal cellulases belong. The histological expression pattern of Ean-EG mRNA using in situ hybridization revealed that the most distinct expression was observed in epithelial cells with positive hybridization signal in epidermis, chloragogen tissue cells, coelomic cell-aggregate, and even blood vessel, which could strongly support the fact that at least in the earthworm, Eisenia andrei, cellulase function must not be limited to digestive process but be possibly extended to the innate immunity. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

    Science.gov (United States)

    Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

    1999-04-16

    A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.

  9. Allelic diversity of the MHC class II DRB genes in brown bears (Ursus arctos) and a comparison of DRB sequences within the family Ursidae.

    Science.gov (United States)

    Goda, N; Mano, T; Kosintsev, P; Vorobiev, A; Masuda, R

    2010-11-01

    The allelic diversity of the DRB locus in major histocompatibility complex (MHC) genes was analyzed in the brown bear (Ursus arctos) from the Hokkaido Island of Japan, Siberia, and Kodiak of Alaska. Nineteen alleles of the DRB exon 2 were identified from a total of 38 individuals of U. arctos and were highly polymorphic. Comparisons of non-synonymous and synonymous substitutions in the antigen-binding sites of deduced amino acid sequences indicated evidence for balancing selection on the bear DRB locus. The phylogenetic analysis of the DRB alleles among three genera (Ursus, Tremarctos, and Ailuropoda) in the family Ursidae revealed that DRB allelic lineages were not separated according to species. This strongly shows trans-species persistence of DRB alleles within the Ursidae. © 2010 John Wiley & Sons A/S.

  10. Phylogenetic analysis of Hungarian goose parvovirus isolates and vaccine strains.

    Science.gov (United States)

    Tatár-Kis, Tímea; Mató, Tamás; Markos, Béla; Palya, Vilmos

    2004-08-01

    Polymerase chain reaction and sequencing were used to analyse goose parvovirus field isolates and vaccine strains. Two fragments of the genome were amplified. Fragment "A" represents a region of VP3 gene, while fragment "B" represents a region upstream of the VP3 gene, encompassing part of the VP1 gene. In the region of fragment "A" the deduced amino acid sequence of the strains was identical, therefore differentiation among strains could be done only at the nucleotide level, which resulted in the formation of three groups: Hungarian, West-European and Asian strains. In the region of fragment "B", separation of groups could be done by both nucleotide and deduced amino acid sequence level. The nucleotide sequences resulted in the same groups as for fragment "A" but with a different clustering pattern among the Hungarian strains. Within the "Hungarian" group most of the recent field isolates fell into one cluster, very closely related or identical to each other, indicating a very slow evolutionary change. The attenuated strains and field isolates from 1979/80 formed a separate cluster. When vaccine strains and field isolates were compared, two specific amino acid differences were found that can be considered as possible markers for vaccinal strains. Sequence analysis of fragment "B" seems to be a suitable method for differentiation of attenuated vaccine strains from virulent strains. Copyright 2004 Houghton Trust Ltd

  11. Characterization of the N-linked glycosylation site of recombinant pectate lyase

    NARCIS (Netherlands)

    Colangelo, J.; Licon, V.; Benen, J.A.E.; Visser, J.; Bergmann, C.; Orlando, R.

    1999-01-01

    Recombinant pectate lyase from Aspergillus niger was overexpressed in Aspergillus nidulans. The two recombinant proteins produced differed in molecular mass by 1200 Da, which suggested that the larger molecular weight protein was glycosylated. The deduced amino acid sequence was searched for

  12. Irritable bowel syndrome-diarrhea: characterization of genotype by exome sequencing, and phenotypes of bile acid synthesis and colonic transit

    Science.gov (United States)

    Klee, Eric W.; Shin, Andrea; Carlson, Paula; Li, Ying; Grover, Madhusudan; Zinsmeister, Alan R.

    2013-01-01

    The study objectives were: to mine the complete exome to identify putative rare single nucleotide variants (SNVs) associated with irritable bowel syndrome (IBS)-diarrhea (IBS-D) phenotype, to assess genes that regulate bile acids in IBS-D, and to explore univariate associations of SNVs with symptom phenotype and quantitative traits in an independent IBS cohort. Using principal components analysis, we identified two groups of IBS-D (n = 16) with increased fecal bile acids: rapid colonic transit or high bile acids synthesis. DNA was sequenced in depth, analyzing SNVs in bile acid genes (ASBT, FXR, OSTα/β, FGF19, FGFR4, KLB, SHP, CYP7A1, LRH-1, and FABP6). Exome findings were compared with those of 50 similar ethnicity controls. We assessed univariate associations of each SNV with quantitative traits and a principal components analysis and associations between SNVs in KLB and FGFR4 and symptom phenotype in 405 IBS, 228 controls and colonic transit in 70 IBS-D, 71 IBS-constipation. Mining the complete exome did not reveal significant associations with IBS-D over controls. There were 54 SNVs in 10 of 11 bile acid-regulating genes, with no SNVs in FGF19; 15 nonsynonymous SNVs were identified in similar proportions of IBS-D and controls. Variations in KLB (rs1015450, downstream) and FGFR4 [rs434434 (intronic), rs1966265, and rs351855 (nonsynonymous)] were associated with colonic transit (rs1966265; P = 0.043), fecal bile acids (rs1015450; P = 0.064), and principal components analysis groups (all 3 FGFR4 SNVs; P transit (P = 0.066). Thus exome sequencing identified additional variants in KLB and FGFR4 associated with bile acids or colonic transit in IBS-D. PMID:24200957

  13. Amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui.

    Science.gov (United States)

    Hatakeyama, T; Hatakeyama, T

    1990-07-06

    The complete amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui were determined. Protein HL30 was found to be acetylated at its N-terminal amino acid and shows homology to the eukaryotic ribosomal proteins YL34 from yeast and RL31 from rat. Protein HmaL5 was homologous to the protein L5 from Escherichia coli and Bacillus stearothermophilus as well as to YL16 from yeast. HmaL5 shows more similarities to its eukaryotic counterpart than to eubacterial ones.

  14. Heavy-ion optical potential for sub-barrier fusion deduced from a dispersion relation

    International Nuclear Information System (INIS)

    Kim, B.T.; Kim, H.C.; Park, K.E.

    1988-01-01

    The heavy-ion energy-dependent optical potentials for the 16 O+ 208 Pb system are deduced from a dispersion relation. These potentials are used to analyze the elastic scattering, fusion, and spin distributions of compound nuclei for the system in a unified way based on the direct reaction theory. It turns out that the energy dependence of the optical potential is essential in explaining the data at near- and sub-barrier energies. The real part of the energy-dependent optical potential deduced was also used in calculating the elastic and fusion cross sections by the conventional barrier penetration model using an incoming wave boundary condition. The predictions of the elastic scattering, fusion cross sections, and the spin distributions of compound nuclei are not satisfactory compared with those from the direct reaction approach. It seems to originate from the fact that this model neglects absorption around the Coulomb barrier region

  15. A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

    Science.gov (United States)

    Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

    1995-04-01

    The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).

  16. Improved purification, crystallization and primary structure of pyruvate:ferredoxin oxidoreductase from Halobacterium halobium.

    Science.gov (United States)

    Plaga, W; Lottspeich, F; Oesterhelt, D

    1992-04-01

    An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.

  17. Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

    Science.gov (United States)

    Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

    2012-08-01

    Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or 15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.

  18. Functional analysis of fructosyl-amino acid oxidases of Aspergillus oryzae.

    Science.gov (United States)

    Akazawa, Shin-Ichi; Karino, Tetsuya; Yoshida, Nobuyuki; Katsuragi, Tohoru; Tani, Yoshiki

    2004-10-01

    Three active fractions of fructosyl-amino acid oxidase (FAOD-Ao1, -Ao2a, and -Ao2b) were isolated from Aspergillus oryzae strain RIB40. N-terminal and internal amino acid sequences of FAOD-Ao2a corresponded to those of FAOD-Ao2b, suggesting that these two isozymes were derived from the same protein. FAOD-Ao1 and -Ao2 were different in substrate specificity and subunit assembly; FAOD-Ao2 was active toward N(epsilon)-fructosyl N(alpha)-Z-lysine and fructosyl valine (Fru-Val), whereas FAOD-Ao1 was not active toward Fru-Val. The genes encoding the FAOD isozymes (i.e., FAOAo1 and FAOAo2) were cloned by PCR with an FAOD-specific primer set. The deduced amino acid sequences revealed that FAOD-Ao1 was 50% identical to FAOD-Ao2, and each isozyme had a peroxisome-targeting signal-1, indicating their localization in peroxisomes. The genes was expressed in Escherichia coli and rFaoAo2 showed the same characteristics as FAOD-Ao2, whereas rFaoAo1 was not active. FAOAo2 disruptant was obtained by using ptrA as a selective marker. Wild-type strain grew on the medium containing Fru-Val as the sole carbon and nitrogen sources, but strain Delta faoAo2 did not grow. Addition of glucose or (NH(4))(2)SO(4) to the Fru-Val medium did not affect the assimilation of Fru-Val by wild-type, indicating glucose and ammonium repressions did not occur in the expression of the FAOAo2 gene. Furthermore, conidia of the wild-type strain did not germinate on the medium containing Fru-Val and NaNO(2) as the sole carbon and nitrogen sources, respectively, suggesting that Fru-Val may also repress gene expression of nitrite reductase. These results indicated that FAOD is needed for utilization of fructosyl-amino acids as nitrogen sources in A. oryzae.

  19. Isolation and Molecular Characterization of High Molecular Weight Glutenin Subunit Genes 1Bx13 and 1By16 from Hexaploid Wheat

    Institute of Scientific and Technical Information of China (English)

    Bin-Shuang Pang; Xue-Yong Zhang

    2008-01-01

    The high molecular weight glutenin subunit (HMW-GS) pair 1Bx13+1Byt6 are recognized to positively correlate with bread-making quality; however, their molecular data remain unknown. In order to reveal the mechanism by which 1By16 and 1Bx13 creates high quality, their open reading frames (ORFs) were amplified from common wheat Atlas66 and Jimai 20 using primers that were designed based on published sequences of HMW glutenin genes. The ORF of 1By16 was 2220bp, deduced into 738 amino acid residues with seven cysteines including 59 hexapeptides and 22 nanopeptides motifs. The ORF of 1Bx13 was 2385bp, deduced into 795 amino acid residues with four cysteines including 68 hexapeptides, 25 nanopeptides and six tripeptides motifs. We found that 1By16 was the largest y-type HMW glutenin gene described to date in common wheat. The 1By16 had 36 amino acid residues inserted in the central repetitive domain compared with 1By15. Expression in bacteria and western-blot tests confirmed that the sequence cloned was the ORF of HMW-GS 1By16, and that 1Bx13 was one of the largest 1Bx genes that have been described so far in common wheat, exhibiting a hexapeptide (PGQGQQ) insertion in the end of central repetitive domain compared with 1Bx7. A phylogenetic tree based on the deduced full-length amino acid sequence alignment of the published HMW-GS genes showed that the 1By16 was clustered with Glu-IB-2, and that the 1Bx13 was clustered with Glu-1B-1 alleles.

  20. Kinetics of Oxidation of 3-Benzoylpropionic Acid by N-Bromoacetamide in Aqueous Acetic Acid Medium

    Directory of Open Access Journals (Sweden)

    N. A. Mohamed Farook

    2011-01-01

    Full Text Available The kinetics of oxidation of 3-benzoylpropionic acid (KA with N-bromoacetamide (NBA have been studied potentiometrically in 50:50 (v/v aqueous acetic acid medium at 298 K The reaction was first order each with respect to [KA], [NBA] and [H+]. The main product of the oxidation is the corresponding carboxylic acid. The rate decreases with the addition of acetamide, one of the products of the reaction. Variation in ionic strength of the reaction medium has no significant effect on the rate of oxidation. But the rate of the reaction is enhanced by lowering the dielectric constant of the reaction medium. A mechanism consistent with observed results have been proposed and the related rate law was deduced.

  1. Primary and secondary structural analyses of glutathione S-transferase pi from human placenta.

    Science.gov (United States)

    Ahmad, H; Wilson, D E; Fritz, R R; Singh, S V; Medh, R D; Nagle, G T; Awasthi, Y C; Kurosky, A

    1990-05-01

    The primary structure of glutathione S-transferase (GST) pi from a single human placenta was determined. The structure was established by chemical characterization of tryptic and cyanogen bromide peptides as well as automated sequence analysis of the intact enzyme. The structural analysis indicated that the protein is comprised of 209 amino acid residues and gave no evidence of post-translational modifications. The amino acid sequence differed from that of the deduced amino acid sequence determined by nucleotide sequence analysis of a cDNA clone (Kano, T., Sakai, M., and Muramatsu, M., 1987, Cancer Res. 47, 5626-5630) at position 104 which contained both valine and isoleucine whereas the deduced sequence from nucleotide sequence analysis identified only isoleucine at this position. These results demonstrated that in the one individual placenta studied at least two GST pi genes are coexpressed, probably as a result of allelomorphism. Computer assisted consensus sequence evaluation identified a hydrophobic region in GST pi (residues 155-181) that was predicted to be either a buried transmembrane helical region or a signal sequence region. The significance of this hydrophobic region was interpreted in relation to the mode of action of the enzyme especially in regard to the potential involvement of a histidine in the active site mechanism. A comparison of the chemical similarity of five known human GST complete enzyme structures, one of pi, one of mu, two of alpha, and one microsomal, gave evidence that all five enzymes have evolved by a divergent evolutionary process after gene duplication, with the microsomal enzyme representing the most divergent form.

  2. Details of the evolutionary history from invertebrates to vertebrates, as deduced from the sequences of 18S rDNA.

    Science.gov (United States)

    Wada, H; Satoh, N

    1994-01-01

    Almost the entire sequences of 18S rDNA were determined for two chaetognaths, five echinoderms, a hemichordate, and two urochordates (a larvacean and a salp). Phylogenetic comparisons of the sequences, together with those of other deuterostomes (an ascidian, a cephalochordate, and vertebrates) and protostomes (an arthropod and a mollusc), suggest the monophyly of the deuterostomes, with the exception of the chaetognaths. Chaetognaths may not be a group of deuterostomes. The deuterostome group closest to vertebrates was the group of cephalochordates. Ascidians, larvaceans, and salps seem to form a discrete group (urochordates), in which the early divergence of larvaceans is evident. These results support the hypothesis that chordates evolved from free-living ancestors. PMID:8127885

  3. Sequence and expression analyses of porcine ISG15 and ISG43 genes.

    Science.gov (United States)

    Huang, Jiangnan; Zhao, Shuhong; Zhu, Mengjin; Wu, Zhenfang; Yu, Mei

    2009-08-01

    The coding sequences of porcine interferon-stimulated gene 15 (ISG15) and the interferon-stimulated gene (ISG43) were cloned from swine spleen mRNA. The amino acid sequences deduced from porcine ISG15 and ISG43 genes coding sequence shared 24-75% and 29-83% similarity with ISG15s and ISG43s from other vertebrates, respectively. Structural analyses revealed that porcine ISG15 comprises two ubiquitin homologues motifs (UBQ) domain and a conserved C-terminal LRLRGG conjugating motif. Porcine ISG43 contains an ubiquitin-processing proteases-like domain. Phylogenetic analyses showed that porcine ISG15 and ISG43 were mostly related to rat ISG15 and cattle ISG43, respectively. Using quantitative real-time PCR assay, significant increased expression levels of porcine ISG15 and ISG43 genes were detected in porcine kidney endothelial cells (PK15) cells treated with poly I:C. We also observed the enhanced mRNA expression of three members of dsRNA pattern-recognition receptors (PRR), TLR3, DDX58 and IFIH1, which have been reported to act as critical receptors in inducing the mRNA expression of ISG15 and ISG43 genes. However, we did not detect any induced mRNA expression of IFNalpha and IFNbeta, suggesting that transcriptional activations of ISG15 and ISG43 were mediated through IFN-independent signaling pathway in the poly I:C treated PK15 cells. Association analyses in a Landrace pig population revealed that ISG15 c.347T>C (BstUI) polymorphism and the ISG43 c.953T>G (BccI) polymorphism were significantly associated with hematological parameters and immune-related traits.

  4. Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B

    International Nuclear Information System (INIS)

    Brown-Shimer, S.; Johnson, K.A.; Bruskin, A.; Green, N.R.; Hill, D.E.; Lawrence, J.B.; Johnson, C.

    1990-01-01

    The inactivation of growth suppressor genes appears to play a major role in the malignant process. To assess whether protein phosphotyrosyl phosphatases function as growth suppressors, the authors have isolated a cDNA clone encoding human protein phosphotyrosyl phosphatase 1B for structural and functional characterization. The translation product deduced from the 1,305-nucleotide open reading frame predicts a protein containing 435 amino acids and having a molecular mass of 49,966 Da. The amino-terminal 321 amino acids deduced from the cDNA sequence are identical to the empirically determined sequence of protein phosphotyrosyl phosphatase 1B. A genomic clone has been isolated and used in an in situ hybridization to banded metaphase chromosomes to determine that the gene encoding protein phosphotyrosyl phosphatase 1B maps as a single-copy gene to the long arm of chromosome 20 in the region q13.1-q13.2

  5. Complete Genomic Sequence of Border Disease Virus, a Pestivirus from Sheep

    Science.gov (United States)

    Becher, Paul; Orlich, Michaela; Thiel, Heinz-Jürgen

    1998-01-01

    The genus Pestivirus of the family Flaviviridae comprises three established species, namely, bovine viral diarrhea virus (BVDV), classical swine fever virus (CSFV), and border disease virus from sheep (BDV). In this study, we report the first complete nucleotide sequence of BDV, that of strain X818. The genome is 12,333 nucleotides long and contains one long open reading frame encoding 3,895 amino acids. The 5′ noncoding region (NCR) of BDV X818 consists of 372 nucleotides and is thus similar in length to the 5′ NCR reported for other pestiviruses. The 3′ NCR of X818 is 273 nucleotides long and thereby at least 32 nucleotides longer than the 3′ NCR of pestiviruses analyzed thus far. Within the 3′ NCR of BDV X818, the sequence motif TATTTATTTA was identified at four locations. The same repeat was found at two or three locations within the 3′ NCR of different CSFV isolates but was absent in the 3′ NCR of BVDV. Analysis of five additional BDV strains showed that the 3′ NCR sequences are highly conserved within this species. Comparison of the deduced amino acid sequence of X818 with the ones of other pestiviruses allowed the prediction of polyprotein cleavage sites which were conserved with regard to the structural proteins. It has been reported for two BVDV strains that cleavage at the nonstructural (NS) protein sites 3/4A, 4A/4B, 4B/5A, and 5A/5B is mediated by the NS3 serine protease and for each site a conserved leucine was found at the P1 position followed by either serine or alanine at P1′ (N. Tautz, K. Elbers, D. Stoll, G. Meyers, and H.-J. Thiel, J. Virol. 71:5415–5422, 1997; J. Xu, E. Mendez, P. R. Caron, C. Lin, M. A. Murcko, M. S. Collett, and C. M. Rice, J. Virol. 71:5312–5322). Interestingly, P1′ of the predicted NS5A/5B cleavage site of BDV is represented by an asparagine residue. Transient expression studies demonstrated that this unusual NS5A/5B processing site is efficiently cleaved by the NS3 serine protease of BDV. PMID

  6. from Eriocheir sinensis with antimicrobial activity

    African Journals Online (AJOL)

    The cDNA of a new Eriocheir sinensis ALF (designated as EsALF-3) was obtained based on EST analysis. The full-length cDNA was of 956 bp, consisting of an open reading frame (ORF) of 369 bp encoding a polypeptide of 123 amino acids. In the deduced amino acid sequence of EsALF-3, there were two highly ...

  7. Sequence variations in the FAD2 gene in seeded pumpkins.

    Science.gov (United States)

    Ge, Y; Chang, Y; Xu, W L; Cui, C S; Qu, S P

    2015-12-21

    Seeded pumpkins are important economic crops; the seeds contain various unsaturated fatty acids, such as oleic acid and linoleic acid, which are crucial for human and animal nutrition. The fatty acid desaturase-2 (FAD2) gene encodes delta-12 desaturase, which converts oleic acid to linoleic acid. However, little is known about sequence variations in FAD2 in seeded pumpkins. Twenty-seven FAD2 clones from 27 accessions of Cucurbita moschata, Cucurbita maxima, Cucurbita pepo, and Cucurbita ficifolia were obtained (totally 1152 bp; a single gene without introns). More than 90% nucleotide identities were detected among the 27 FAD2 clones. Nucleotide substitution, rather than nucleotide insertion and deletion, led to sequence polymorphism in the 27 FAD2 clones. Furthermore, the 27 FAD2 selected clones all encoded the FAD2 enzyme (delta-12 desaturase) with amino acid sequence identities from 91.7 to 100% for 384 amino acids. The same main-function domain between 47 and 329 amino acids was identified. The four species clustered separately based on differences in the sequences that were identified using the unweighted pair group method with arithmetic mean. Geographic origin and species were found to be closely related to sequence variation in FAD2.

  8. Identification and characterization of novel reptile cathelicidins from elapid snakes.

    Science.gov (United States)

    Zhao, Hui; Gan, Tong-Xiang; Liu, Xiao-Dong; Jin, Yang; Lee, Wen-Hui; Shen, Ji-Hong; Zhang, Yun

    2008-10-01

    Three cDNA sequences coding for elapid cathelicidins were cloned from constructed venom gland cDNA libraries of Naja atra, Bungarus fasciatus and Ophiophagus hannah. The open reading frames of the cloned elapid cathelicidins were all composed of 576bp and coded for 191 amino acid residue protein precursors. Each of the deduced elapid cathelicidin has a 22 amino acid residue signal peptide, a conserved cathelin domain of 135 amino acid residues and a mature antimicrobial peptide of 34 amino acid residues. Unlike the highly divergent cathelicidins in mammals, the nucleotide and deduced protein sequences of the three cloned elapid cathelicidins were remarkably conserved. All the elapid mature cathelicidins were predicted to be cleaved at Valine157 by elastase. OH-CATH, the deduced mature cathelicidin from king cobra, was chemically synthesized and it showed strong antibacterial activity against various bacteria with minimal inhibitory concentration of 1-20microg/ml in the presence of 1% NaCl. Meanwhile, the synthetic peptide showed no haemolytic activity toward human red blood cells even at a high dose of 200microg/ml. Phylogenetic analysis of cathelicidins from vertebrate suggested that elapid and viperid cathelicidins were grouped together in the tree. Snake cathelicidins were evolutionary closely related to the neutrophilic granule proteins (NGPs) from mouse, rat and rabbit. Snake cathelicidins also showed a close relationship with avian fowlicidins (1-3) and chicken myeloid antimicrobial peptide 27. Elapid cathelicidins might be used as models for the development of novel therapeutic drugs.

  9. Complementary DNA and derived amino acid sequence of the α subunit of human complement protein C8: evidence for the existence of a separate α subunit messenger RNA

    International Nuclear Information System (INIS)

    Rao, A.G.; Howard, O.M.Z.; Ng, S.C.; Whitehead, A.S.; Colten, H.R.; Sodetz, J.M.

    1987-01-01

    The entire amino acid sequence of the α subunit (M/sub r/ 64,000) of the eight component of complement (C8) was determined by characterizing cDNA clones isolated from a human liver cDNA library. Two clones with overlapping inserts of net length 2.44 kilobases (kb) were isolated and found to contain the entire α coding region [1659 base pairs (bp)]. The 5' end consists of an untranslated region and a leader sequence of 30 amino acids. This sequence contains an apparent initiation Met, signal peptide, and propeptide which ends with an arginine-rich sequence that is characteristic of proteolytic processing sites found in the pro form of protein precursors. The 3' untranslated region contains two polyadenylation signals and a poly(A)sequence. RNA blot analysis of total cellular RNA from the human hepatoma cell line HepG2 revealed a message size of ∼2.5 kb. Features of the 5' and 3' sequences and the message size suggest that a separate mRNA codes for α and argues against the occurrence of a single-chain precursor form of the disulfide-linked α-λ subunit found in mature C8. Analysis of the derived amino acid sequence revealed several membrane surface seeking domains and a possible transmembrane domain. Analysis of the carbohydrate composition indicates 1 or 2 asparagine-linked but no O-linked oligosaccharide chains, a result consistent with predictions from the amino acid sequence. Most significantly, it exhibits a striking overall homology to human C9, with values of 24% on the basis of identity and 46% when conserved substitutions are allowed. As described in an accompanying report this homology also extends to the β subunit of C8

  10. Complete genome sequence of the actinobacterium Amycolatopsis japonica MG417-CF17T (=DSM 44213T) producing (S,S)-N,N′-ethylenediaminedisuccinic acid

    DEFF Research Database (Denmark)

    Stegmann, Evi; Albersmeier, Andreas; Spohn, Marius

    2014-01-01

    We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons: the chro......We report the complete genome sequence of Amycolatopsis japonica MG417-CF17T (=DSM 44213T) which was identified as the producer of (S,S)-N,N′-ethylenediaminedisuccinic acid during a screening for phospholipase C inhibitors. The genome of A. japonica MG417-CF17T consists of two replicons...

  11. Characterization of defensin gene from abalone Haliotis discus hannai and its deduced protein

    Science.gov (United States)

    Hong, Xuguang; Sun, Xiuqin; Zheng, Minggang; Qu, Lingyun; Zan, Jindong; Zhang, Jinxing

    2008-11-01

    Defensin is one of preserved ancient host defensive materials formed in biological evolution. As a regulator and effector molecule, it is very important in animals’ acquired immune system. This paper reports the defensin gene from the mixed liver and kidney cDNA library of abalone Haliotis discus hannai Ino. Sequence analysis shows that the gene sequence of full-length cDNA encodes 42 mature peptides (including six Cys), molecular weight of 4 323 Da, and pI of 8.02. Amino acid sequence homology analysis shows that the peptides are highly similar (70% in common) to other insects defensin. Because of a typical insect-defensin structural character of mature peptide in the secondary structure, the polypeptide named Haliotis discus defensin (hd-def), a novel of antimicrobial peptides, belongs to insects defensin subfamily. The RT-PCR result of Haliotis discus defensin shows that the gene can be expressed only in the hepatopancreas by Gram-negative and positive bacteria stimulation, which is ascribed to inducible expression. Therefore, it is revealed that the Haliotis discus defensin gene expression was related to the antibacterial infection of Haliotis discus hannai Ino.

  12. Identification of a haemolysin-like peptide with antibacterial activity using the draft genome sequence of Staphylococcus epidermidis strain A487.

    Science.gov (United States)

    Al-Mahrous, Mohammed M; Jack, Ralph W; Sandiford, Stephanie K; Tagg, John R; Beatson, Scott A; Upton, Mathew

    2011-08-01

    Our interest in Staphylococcus epidermidis strain A487 was prompted by the unusual nature of its inhibitory activity in screening tests against methicillin-resistant Staphylococcus aureus isolates. The inhibitory activity was detected in deferred antagonism tests only if the agar plate was preheated for at least 35 min at ≥ 55 °C before inoculation of the indicator bacteria, this phenomenon indicating possible involvement of a heat-labile immunity agent or protease. The inhibitor was purified to homogeneity by ammonium sulphate precipitation, followed by cation-exchange and reversed-phase chromatography. Tandem MS revealed a novel peptide of molecular weight 2588.4 Da. The draft genome sequence of strain A487 was determined using 454 GS FLX technology, allowing the identification of the structural gene (hlp) encoding the mature peptide MQFITDLIKKAVDFFKGLFGNK. The deduced amino acid sequence of peptide 487 exhibited 70.8% similarity to that of a putative haemolysin from Staphylococcus cohnii. Analysis of the genome of strain A487 showed several additional inhibitor-encoding genes, including hld, the determinant for staphylococcal δ-lysin. This work indicates that potentially useful inhibitors could be overlooked in agar-based inhibitor screening programmes lacking a heat pretreatment step and also highlights the utility of draft genome sequence examination in antibacterial agent discovery. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  13. TaALMT1 promoter sequence compositions, acid tolerance, and Al tolerance in wheat cultivars and landraces from Sichuan in China.

    Science.gov (United States)

    Han, C; Dai, S F; Liu, D C; Pu, Z J; Wei, Y M; Zheng, Y L; Wen, D J; Zhao, L; Yan, Z H

    2013-11-18

    Previous genetic studies on wheat from various sources have indicated that aluminum (Al) tolerance may have originated independently in USA, Brazil, and China. Here, TaALMT1 promoter sequences of 92 landraces and cultivars from Sichuan, China, were sequenced. Five promoter types (I', II, III, IV, and V) were observed in 39 cultivars, and only three promoter types (I, II, and III) were observed in 53 landraces. Among the wheat collections worldwide, only the Chinese Spring (CS) landrace native to Sichuan, China, carried the TaALMT1 promoter type III. Besides CS, two other Sichuan-bred landraces and six cultivars with TaALMT1 promoter type III were identified in this study. In the phylogenetic tree constructed based on the TaALMT1 promoter sequences, type III formed a separate branch, which was supported by a high bootstrap value. It is likely that TaALMT1 promoter type III originated from Sichuan-bred wheat landraces of China. In addition, the landraces with promoter type I showed the lowest Al tolerance among all landraces and cultivars. Furthermore, the cultivars with promoter type IV showed better Al tolerance than landraces with promoter type II. A comparison of acid tolerance and Al tolerance between cultivars and landraces showed that the landraces had better acid tolerance than the cultivars, whereas the cultivars showed better Al tolerance than the landraces. Moreover, significant difference in Al tolerance was also observed between the cultivars raised by the National Ministry of Agriculture and by Sichuan Province. Among the landraces from different regions, those from the East showed better acid tolerance and Al tolerance than those from the South and West of Sichuan. Additional Al-tolerant and acid-tolerant wheat lines were also identified.

  14. Species of /sup 67/Ga-binding acid mucopolysaccharide in liver

    Energy Technology Data Exchange (ETDEWEB)

    Ando, A.; Ando, I.

    1985-01-01

    It was determined from measuring neutral saccharide in the structure that the principal /sup 67/Ga-binding acid mucopolysaccharide in liver was keratan sulfate and/or keratan polysulfate. On the other hand, it was clarified from the results of mucopolysaccharase treatment that the main /sup 67/Ga-binding acid mucopolysaccharide in liver was neither keratan sulfate, heparan sulfate, heparin, nor chondroitin sulfate A, B and C. Based on the present results, it was deduced that the main /sup 67/Ga-binding acid mucopolysaccharide in liver was keratan polysulfate.

  15. Complete amino acid sequences of the ribosomal proteins L25, L29 and L31 from the archaebacterium Halobacterium marismortui.

    Science.gov (United States)

    Hatakeyama, T; Kimura, M

    1988-03-15

    Ribosomal proteins were extracted from 50S ribosomal subunits of the archaebacterium Halobacterium marismortui by decreasing the concentration of Mg2+ and K+, and the proteins were separated and purified by ion-exchange column chromatography on DEAE-cellulose. Ten proteins were purified to homogeneity and three of these proteins were subjected to sequence analysis. The complete amino acid sequences of the ribosomal proteins L25, L29 and L31 were established by analyses of the peptides obtained by enzymatic digestion with trypsin, Staphylococcus aureus protease, chymotrypsin and lysylendopeptidase. Proteins L25, L29 and L31 consist of 84, 115 and 95 amino acid residues with the molecular masses of 9472 Da, 12293 Da and 10418 Da respectively. A comparison of their sequences with those of other large-ribosomal-subunit proteins from other organisms revealed that protein L25 from H. marismortui is homologous to protein L23 from Escherichia coli (34.6%), Bacillus stearothermophilus (41.8%), and tobacco chloroplasts (16.3%) as well as to protein L25 from yeast (38.0%). Proteins L29 and L31 do not appear to be homologous to any other ribosomal proteins whose structures are so far known.

  16. A protein with amino acid sequence homology to bovine insulin is present in the legume Vigna unguiculata (cowpea

    Directory of Open Access Journals (Sweden)

    Venâncio T.M.

    2003-01-01

    Full Text Available Since the discovery of bovine insulin in plants, much effort has been devoted to the characterization of these proteins and elucidation of their functions. We report here the isolation of a protein with similar molecular mass and same amino acid sequence to bovine insulin from developing fruits of cowpea (Vigna unguiculata genotype Epace 10. Insulin was measured by ELISA using an anti-human insulin antibody and was detected both in empty pods and seed coats but not in the embryo. The highest concentrations (about 0.5 ng/µg of protein of the protein were detected in seed coats at 16 and 18 days after pollination, and the values were 1.6 to 4.0 times higher than those found for isolated pods tested on any day. N-terminal amino acid sequencing of insulin was performed on the protein purified by C4-HPLC. The significance of the presence of insulin in these plant tissues is not fully understood but we speculate that it may be involved in the transport of carbohydrate to the fruit.

  17. Analysis of complete nucleotide sequences of Angolan hepatitis B virus isolates reveals the existence of a separate lineage within genotype E.

    Directory of Open Access Journals (Sweden)

    Barbara V Lago

    Full Text Available Hepatitis B virus genotype E (HBV/E is highly prevalent in Western Africa. In this work, 30 HBV/E isolates from HBsAg positive Angolans (staff and visitors of a private hospital in Luanda were genetically characterized: 16 of them were completely sequenced and the pre-S/S sequences of the remaining 14 were determined. A high proportion (12/30, 40% of subjects tested positive for both HBsAg and anti-HBs markers. Deduced amino acid sequences revealed the existence of specific substitutions and deletions in the B- and T-cell epitopes of the surface antigen (pre-S1- and pre-S2 regions of the virus isolates derived from 8/12 individuals with concurrent HBsAg/anti-HBs. Phylogenetic analysis performed with 231 HBV/E full-length sequences, including 16 from this study, showed that all isolates from Angola, Namibia and the Democratic Republic of Congo (n = 28 clustered in a separate lineage, divergent from the HBV/E isolates from nine other African countries, namely Cameroon, Central African Republic, Côte d'Ivoire, Ghana, Guinea, Madagascar, Niger, Nigeria and Sudan, with a Bayesian posterior probability of 1. Five specific mutations, namely small S protein T57I, polymerase Q177H, G245W and M612L, and X protein V30L, were observed in 79-96% of the isolates of the separate lineage, compared to a frequency of 0-12% among the other HBV/E African isolates.

  18. Filovirus Glycoprotein Sequence, Structure and Virulence

    OpenAIRE

    Phillips, J. C.

    2014-01-01

    Leading Ebola subtypes exhibit a wide mortality range, here explained at the molecular level by using fractal hydropathic scaling of amino acid sequences based on protein self-organized criticality. Specific hydrophobic features in the hydrophilic mucin-like domain suffice to account for the wide mortality range. Significance statement: Ebola virus is spreading rapidly in Africa. The connection between protein amino acid sequence and mortality is identified here.

  19. Characterization of the alkaline/neutral invertase gene in Dendrobium officinale and its relationship with polysaccharide accumulation.

    Science.gov (United States)

    Gao, F; Cao, X F; Si, J P; Chen, Z Y; Duan, C L

    2016-05-06

    Dendrobium officinale is one of the most well-known traditional Chinese medicines, and polysaccharide is its main active ingredient. Many studies have investigated the synthesis and accumulation mechanisms of polysaccharide, but until recently, little was known about the molecular mechanism of how polysaccharide is synthesized because no related genes have been cloned. In this study, we cloned an alkaline/neutral invertase gene from D. officinale (DoNI) by the rapid amplification of cDNA ends (RACE) method. DoNI was 2231 bp long and contained an open reading frame that predicted a 62.8-kDa polypeptide with 554-amino acid residues. An alkaline/neutral invertase conserved domain was predicted from this deduced amino acid sequence, and DoNI had a similar deduced amino acid sequence to Setaria italica and Oryza brachyantha. We also found that DoNI expression in different tissues was closely related to DoNI activity, and more importantly, polysaccharide level. Our results indicate that DoNI is associated with polysaccharide accumulation in D. officinale.

  20. Molecular cloning of a novel GSK3/shaggy-like gene from Triticum ...

    African Journals Online (AJOL)

    The deduced amino acid sequence showed a high homology with shaggy-like kinases from Triticum aestivum, Zea mays, Trifolium repens, Nicotine tabacum, Medicago sativa and Arabidopsis thaliana; therefore, the gene was named TmGSK1 (Triticum monococcum Glycogen Synthase Kinase 1,GenBank Accession No.

  1. Molecular Cloning and Pharmacological Properties of an Acidic PLA2 from Bothrops pauloensis Snake Venom

    Science.gov (United States)

    Ferreira, Francis Barbosa; Gomes, Mário Sérgio Rocha; Naves de Souza, Dayane Lorena; Gimenes, Sarah Natalie Cirilo; Castanheira, Letícia Eulalio; Borges, Márcia Helena; Rodrigues, Renata Santos; Yoneyama, Kelly Aparecida Geraldo; Homsi Brandeburgo, Maria Inês; Rodrigues, Veridiana M.

    2013-01-01

    In this work, we describe the molecular cloning and pharmacological properties of an acidic phospholipase A2 (PLA2) isolated from Bothrops pauloensis snake venom. This enzyme, denominated BpPLA2-TXI, was purified by four chromatographic steps and represents 2.4% of the total snake venom protein content. BpPLA2-TXI is a monomeric protein with a molecular mass of 13.6 kDa, as demonstrated by Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF) analysis and its theoretical isoelectric point was 4.98. BpPLA2-TXI was catalytically active and showed some pharmacological effects such as inhibition of platelet aggregation induced by collagen or ADP and also induced edema and myotoxicity. BpPLA2-TXI displayed low cytotoxicity on TG-180 (CCRF S 180 II) and Ovarian Carcinoma (OVCAR-3), whereas no cytotoxicity was found in regard to MEF (Mouse Embryonic Fibroblast) and Sarcoma 180 (TIB-66). The N-terminal sequence of forty-eight amino acid residues was determined by Edman degradation. In addition, the complete primary structure of 122 amino acids was deduced by cDNA from the total RNA of the venom gland using specific primers, and it was significantly similar to other acidic D49 PLA2s. The phylogenetic analyses showed that BpPLA2-TXI forms a group with other acidic D49 PLA2s from the gender Bothrops, which are characterized by a catalytic activity associated with anti-platelet effects. PMID:24304676

  2. Brain cDNA clone for human cholinesterase

    International Nuclear Information System (INIS)

    McTiernan, C.; Adkins, S.; Chatonnet, A.; Vaughan, T.A.; Bartels, C.F.; Kott, M.; Rosenberry, T.L.; La Du, B.N.; Lockridge, O.

    1987-01-01

    A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum. The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase

  3. Extraction of antimony from nitric acid solutions using tributyl phosphate. II. Tributyl phosphate-antimony(V)-nitric acid system

    International Nuclear Information System (INIS)

    Lakaev, V.S.; Smelov, V.S.

    1989-01-01

    The extraction of pentavalent antimony from nitric acid solutions using tributyl phosphate has been investigated. A possible mechanism for the extraction of antimony(V) has been determined and the (pre)concentration constant for the process has been calculated. The composition of the extracted antimony(V) complex has been deduced. A negative effect of temperature on the distribution coefficient for antimony(V) has also been demonstrated

  4. Ruthenium Hydride/Brønsted Acid-Catalyzed Tandem Isomerization/N-Acyliminium Cyclization Sequence for the Synthesis of Tetrahydro-β-carbolines

    DEFF Research Database (Denmark)

    Hansen, Casper Lykke; Clausen, Janie Regitse Waël; Ohm, Ragnhild Gaard

    2013-01-01

    This paper describes an efficient tandem sequence for the synthesis of 1,2,3,4-tetrahydro-β-carbolines (THBCs) relying on a ruthenium hydride/Brønsted acid- catalyzed isomerization of allylic amides to N-acyliminium ion intermediates which are trapped by a tethered indolenucleophile. The methodol...... the Suzuki cross-coupling reaction to the isomerization/N-acyliminium cyclization sequence. Finally, diastereo- and enantioselective versions of the title reaction have been examined using substrate control (with dr >15: 1) and asymmetric catalysis (ee up to 57%), respectively...

  5. Cell density signal protein suitable for treatment of connective tissue injuries and defects

    Science.gov (United States)

    Schwarz, Richard I.

    2002-08-13

    Identification, isolation and partial sequencing of a cell density protein produced by fibroblastic cells. The cell density signal protein comprising a 14 amino acid peptide or a fragment, variant, mutant or analog thereof, the deduced cDNA sequence from the 14 amino acid peptide, a recombinant protein, protein and peptide-specific antibodies, and the use of the peptide and peptide-specific antibodies as therapeutic agents for regulation of cell differentiation and proliferation. A method for treatment and repair of connective tissue and tendon injuries, collagen deficiency, and connective tissue defects.

  6. Isolation and characterization of MUC15, a novel cell membrane-associated mucin

    DEFF Research Database (Denmark)

    Pallesen, Lone Tjener; Berglund, Lars; Rasmussen, Lone Kjær

    2002-01-01

    The present work reports isolation and characterization of a highly glycosylated protein from bovine milk fat globule membranes, known as PAS III. Partial amino-acid sequencing of the purified protein allowed construction of degenerate oligonucleotide primers, enabling isolation of a full-length c......-like protein was named MUC15 by appointment of the HUGO Gene Nomenclature Committee. The deduced amino-acid sequences of human and bovine MUC15 demonstrated structural hallmarks characteristic for other membrane-bound mucins, such as a serine, threonine, and proline-rich extracellular region with several...

  7. Phosphoribosylpyrophosphate synthetase of Escherichia coli. Properties of the purified enzyme and primary structure of the prs gene

    DEFF Research Database (Denmark)

    Hove-Jensen, Bjarne; Harlow, Kenneth W.; King, Cheryl J.

    1986-01-01

    of ADP. The nucleotide sequence of the E. coli prs gene has been determined and the coding segment established. The deduced amino acid sequence of P-Rib-PP synthetase contained 314 amino acid residues and the molecular weight was calculated as 34,060. The initiation site of transcription was determined......Phosphoribosylpyrophosphate (P-Rib-PP) synthetase of Escherichia coli has been purified to near homogeneity from a strain harboring the prs gene, encoding P-Rib-PP synthetase, on a multicopy plasmid. Analysis of the enzyme showed that it required inorganic phosphate for activity and for stability...

  8. Cloning, Sequencing, and Expression of the Pyruvate Carboxylase Gene in Lactococcus lactis subsp. lactis C2†

    OpenAIRE

    Wang, H.; O'Sullivan, D. J.; Baldwin, K. A.; McKay, L. L.

    2000-01-01

    A functional pyc gene was isolated from Lactococcus lactis subsp. lactis C2 and was found to complement a Pyc defect in L. lactis KB4. The deduced lactococcal Pyc protein was highly homologous to Pyc sequences of other bacteria. The pyc gene was also detected in Lactococcus lactis subsp. cremoris and L. lactis subsp. lactis bv. diacetylactis strains.

  9. CONSISTENT USE OF THE KALMAN FILTER IN CHEMICAL TRANSPORT MODELS (CTMS) FOR DEDUCING EMISSIONS

    Science.gov (United States)

    Past research has shown that emissions can be deduced using observed concentrations of a chemical, a Chemical Transport Model (CTM), and the Kalman filter in an inverse modeling application. An expression was derived for the relationship between the "observable" (i.e., the con...

  10. Draft Genome Sequence of a Clostridium botulinum Isolate from Water Used for Cooling at a Plant Producing Low-Acid Canned Foods.

    Science.gov (United States)

    Basavanna, Uma; Gonzalez-Escalona, Narjol; Timme, Ruth; Datta, Shomik; Schoen, Brianna; Brown, Eric W; Zink, Donald; Sharma, Shashi K

    2013-01-01

    Clostridium botulinum is a pathogen of concern for low-acid canned foods. Here we report draft genomes of a neurotoxin-producing C. botulinum strain isolated from water samples used for cooling low-acid canned foods at a canning facility. The genome sequence confirmed that this strain belonged to C. botulinum serotype B1, albeit with major differences, including thousands of unique single nucleotide polymorphisms (SNPs) compared to other genomes of the same serotype.

  11. Draft Genome Sequence of a Clostridium botulinum Isolate from Water Used for Cooling at a Plant Producing Low-Acid Canned Foods

    OpenAIRE

    Basavanna, Uma; Gonzalez-Escalona, Narjol; Timme, Ruth; Datta, Shomik; Schoen, Brianna; Brown, Eric W.; Zink, Donald; Sharma, Shashi K.

    2013-01-01

    Clostridium botulinum is a pathogen of concern for low-acid canned foods. Here we report draft genomes of a neurotoxin-producing C.?botulinum strain isolated from water samples used for cooling low-acid canned foods at a canning facility. The genome sequence confirmed that this strain belonged to C.?botulinum serotype B1, albeit with major differences, including thousands of unique single nucleotide polymorphisms (SNPs) compared to other genomes of the same serotype.

  12. Cloning of the cDNA and gene for a human D2 dopamine receptor

    International Nuclear Information System (INIS)

    Grady, D.K.; Makam, H.; Stofko, R.E.; Bunzow, J.R.; Civelli, O.; Marchionni, M.A.; Alfano, M.; Frothingham, L.; Fischer, J.B.; Burke-Howie, K.J.; Server, A.C.

    1989-01-01

    A clone encoding a human D 2 dopamine receptor was isolated from a pituitary cDNA library and sequenced. The deduced protein sequence is 96% identical with that of the cloned rat receptor with one major difference: the human receptor contains an additional 29 amino acids in its putative third cytoplasmic loop. Southern blotting demonstrated the presence of only one human D 2 receptor gene. Two overlapping phage containing the gene were isolated and characterized. DNA sequence analysis of these clones showed that the coding sequence is interrupted by six introns and that the additional amino acids present in the human pituitary receptor are encoded by a single exon of 87 base pairs. The involvement of this sequence in alternative splicing and its biological significance are discussed

  13. A Single Electrochemical Probe Used for Analysis of Multiple Nucleic Acid Sequences

    Science.gov (United States)

    Mills, Dawn M.; Calvo-Marzal, Percy; Pinzon, Jeffer M.; Armas, Stephanie; Kolpashchikov, Dmitry M.; Chumbimuni-Torres, Karin Y.

    2017-01-01

    Electrochemical hybridization sensors have been explored extensively for analysis of specific nucleic acids. However, commercialization of the platform is hindered by the need for attachment of separate oligonucleotide probes complementary to a RNA or DNA target to an electrode’s surface. Here we demonstrate that a single probe can be used to analyze several nucleic acid targets with high selectivity and low cost. The universal electrochemical four-way junction (4J)-forming (UE4J) sensor consists of a universal DNA stem-loop (USL) probe attached to the electrode’s surface and two adaptor strands (m and f) which hybridize to the USL probe and the analyte to form a 4J associate. The m adaptor strand was conjugated with a methylene blue redox marker for signal ON sensing and monitored using square wave voltammetry. We demonstrated that a single sensor can be used for detection of several different DNA/RNA sequences and can be regenerated in 30 seconds by a simple water rinse. The UE4J sensor enables a high selectivity by recognition of a single base substitution, even at room temperature. The UE4J sensor opens a venue for a re-useable universal platform that can be adopted at low cost for the analysis of DNA or RNA targets. PMID:29371782

  14. Amino Acids Sequence Based in Silico Analysis of RuBisCO (Ribulose-1,5 Bisphosphate Carboxylase Oxygenase Proteins in Some Carthamus L. ssp.

    Directory of Open Access Journals (Sweden)

    Emre SEVİNDİK

    2017-06-01

    Full Text Available RuBisCO is an important enzyme for plants to photosynthesize and balance carbon dioxide in the atmosphere. This study aimed to perform sequence, physicochemical, phylogenetic and 3D (three-dimensional comparative analyses of RuBisCO proteins in the Carthamus ssp. using various bioinformatics tools. The sequence lengths of the RuBisCO proteins were between 166 and 477 amino acids, with an average length of 411.8 amino acids. Their molecular weights (Mw ranged from 18711.47 to 52843.09 Da; the most acidic and basic protein sequences were detected in C. tinctorius (pI = 5.99 and in C. tenuis (pI = 6.92, respectively. The extinction coefficients of RuBisCO proteins at 280 nm ranged from 17,670 to 69,830 M-1 cm-1, the instability index (II values for RuBisCO proteins ranged from 33.31 to 39.39, while the GRAVY values of RuBisCO proteins ranged from -0.313 to -0.250. The most abundant amino acid in the RuBisCO protein was Gly (9.7%, while the least amino acid ratio was Trp (1.6 %. The putative phosphorylation sites of RuBisCO proteins were determined by NetPhos 2.0. Phylogenetic analysis revealed that RuBisCO proteins formed two main clades. A RAMPAGE analysis revealed that 96.3%-97.6% of residues were located in the favoured region of RuBisCO proteins. To predict the three dimensional (3D structure of the RuBisCO proteins PyMOL was used. The results of the current study provide insights into fundamental characteristic of RuBisCO proteins in Carthamus ssp.

  15. Effects of Mutations and Ligands on the Thermostability of the l-Arginine/Agmatine Antiporter AdiC and Deduced Insights into Ligand-Binding of Human l-Type Amino Acid Transporters

    Directory of Open Access Journals (Sweden)

    Hüseyin Ilgü

    2018-03-01

    Full Text Available The l-arginine/agmatine transporter AdiC is a prokaryotic member of the SLC7 family, which enables pathogenic enterobacteria to survive the extremely acidic gastric environment. Wild-type AdiC from Escherichia coli, as well as its previously reported point mutants N22A and S26A, were overexpressed homologously and purified to homogeneity. A size-exclusion chromatography-based thermostability assay was used to determine the melting temperatures (Tms of the purified AdiC variants in the absence and presence of the selected ligands l-arginine (Arg, agmatine, l-arginine methyl ester, and l-arginine amide. The resulting Tms indicated stabilization of AdiC variants upon ligand binding, in which Tms and ligand binding affinities correlated positively. Considering results from this and previous studies, we revisited the role of AdiC residue S26 in Arg binding and proposed interactions of the α-carboxylate group of Arg exclusively with amide groups of the AdiC backbone. In the context of substrate binding in the human SLC7 family member l-type amino acid transporter-1 (LAT1; SLC7A5, an analogous role of S66 in LAT1 to S26 in AdiC is discussed based on homology modeling and amino acid sequence analysis. Finally, we propose a binding mechanism for l-amino acid substrates to LATs from the SLC7 family.

  16. Identification and expression analysis of two pro-inflammatory cytokines, TNF-α and IL-8, in cobia (Rachycentron canadum L.) in response to Streptococcus dysgalactiae infection.

    Science.gov (United States)

    Nguyen, Thuy Thi Thu; Nguyen, Hai Trong; Wang, Pei-Chyi; Chen, Shih-Chu

    2017-08-01

    Tumor necrosis factor-alpha (TNF-α) and interleukin-8 (IL-8/CXCL8) play pivotal roles in mediating inflammatory responses to invading pathogens. In this study, we identified and analyzed expressions of cobia TNF-α and IL-8 during Streptococcus dysgalactiae infection. The cloned cDNA transcript of cobia TNF-α comprised of 1281 base pairs (bp), with a 774 bp open reading frame (ORF) encoding 257 amino acids. The deduced amino acid sequence of cobia TNF-α showed a close relationship (84% similarity) with TNF-α of yellowtail amberjack. The cloned IL-8 cDNA sequence was 828 bp long, including a 300-bp ORF encoding 99 amino acids. The deduced amino acid sequence of cobia IL-8 shared 90% identity with IL-8 of striped trumpeter. Cobia challenged with a virulent S. dysgalactiae strain displayed an early significant up-regulation of TNF-α and IL-8 in head kidney, liver, and spleen. Notably, IL-8 expression level increased dramatically in the liver at the severe stage of infection (72 h). In conclusion, a better understanding of TNF-α and IL-8 allows more detailed investigation of immune responses in cobia and furthers study on controlling the infectious disease caused by S. dysgalactiae. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Cloning and characterisation of a glucoamylase gene (GlaM) from dimorphic zygomycete Mucor circinelloides

    DEFF Research Database (Denmark)

    Houghton-Larsen, J.; Pedersen, Per Amstrup

    2003-01-01

    This article reports a novel strategy for the cloning of glucoamylase genes using conserved sequences and semi-nested PCR and its application in cloning the GlaM glucoamylase gene and cDNA from the dimorphic zygomycete Mucor circinelloides. The deduced 609-amino-acid enzyme (including signal...

  18. A soluble 3-hydroxy-3-methylglutaryl-CoA reductase in the protozoan Trypanosoma cruzi

    DEFF Research Database (Denmark)

    Pena Diaz, Javier; Montalvetti, A; Camacho, A

    1997-01-01

    of the genes described from eukaryotic organisms and the deduced amino acid sequence could be aligned with the C-terminal half of animal and plant reductases exhibiting pronounced similarity to other eukaryotic counterparts. Further examination of the 5' flanking region by cDNA analysis and establishment...

  19. Cloning, chromosomal localization, and functional expression of the alpha 1 subunit of the L-type voltage-dependent calcium channel from normal human heart

    NARCIS (Netherlands)

    Schultz, D; Mikala, G; Yatani, A; Engle, D B; Iles, D E; Segers, B; Sinke, R J; Weghuis, D O; Klöckner, U; Wakamori, M

    1993-01-01

    A unique structural variant of the cardiac L-type voltage-dependent calcium channel alpha 1 subunit cDNA was isolated from libraries derived from normal human heart mRNA. The deduced amino acid sequence shows significant homology to other calcium channel alpha 1 subunits. However, differences from

  20. Discrepancy between molecular structure and ligand selectivity of a testicular follicle-stimulating hormone receptor of the African catfish (Clarias gariepinus)

    NARCIS (Netherlands)

    Bogerd, J.; Blomenröhr, M.; Andersson, E.; van der Putten, H.; Tensen, C.P.; Vischer, H F; Granneman, Joke C M; Janssen-Dommerholt, C; Goos, H.J.; Schulz, Rüdiger W

    A putative FSH receptor (FSH-R) cDNA was cloned from African catfish testis. Alignment of the deduced amino acid sequence with other (putative) glycoprotein hormone receptors and analysis of the African catfish gene indicated that the cloned receptor belonged to the FSH receptor subfamily. Catfish

  1. Phylogeny of fungal hemoglobins and expression analysis of the Aspergillus oryzae flavohemoglobin gene fhbA during hyphal growth

    NARCIS (Netherlands)

    Biesebeke, R. te; Levasseur, A.; Boussier, A.; Record, E.; Hondel, C.A.M.J.J. van den; Punt, P.J.

    2010-01-01

    The fhbA genes encoding putative flavohemoglobins (FHb) from Aspergillus niger and Aspergillus oryzae were isolated. Comparison of the deduced amino acid sequence of the A. niger fhbA gene and other putative filamentous fungal FHb-encoding genes to that of Ralstonia eutropha shows an overall

  2. Exome sequencing and SNP analysis detect novel compound heterozygosity in fatty acid hydroxylase-associated neurodegeneration

    Science.gov (United States)

    Pierson, Tyler Mark; Simeonov, Dimitre R; Sincan, Murat; Adams, David A; Markello, Thomas; Golas, Gretchen; Fuentes-Fajardo, Karin; Hansen, Nancy F; Cherukuri, Praveen F; Cruz, Pedro; Blackstone, Craig; Tifft, Cynthia; Boerkoel, Cornelius F; Gahl, William A

    2012-01-01

    Fatty acid hydroxylase-associated neurodegeneration due to fatty acid 2-hydroxylase deficiency presents with a wide range of phenotypes including spastic paraplegia, leukodystrophy, and/or brain iron deposition. All previously described families with this disorder were consanguineous, with homozygous mutations in the probands. We describe a 10-year-old male, from a non-consanguineous family, with progressive spastic paraplegia, dystonia, ataxia, and cognitive decline associated with a sural axonal neuropathy. The use of high-throughput sequencing techniques combined with SNP array analyses revealed a novel paternally derived missense mutation and an overlapping novel maternally derived ∼28-kb genomic deletion in FA2H. This patient provides further insight into the consistent features of this disorder and expands our understanding of its phenotypic presentation. The presence of a sural nerve axonal neuropathy had not been previously associated with this disorder and so may extend the phenotype. PMID:22146942

  3. Design of Tail-Clamp Peptide Nucleic Acid Tethered with Azobenzene Linker for Sequence-Specific Detection of Homopurine DNA

    Directory of Open Access Journals (Sweden)

    Shinjiro Sawada

    2017-10-01

    Full Text Available DNA carries genetic information in its sequence of bases. Synthetic oligonucleotides that can sequence-specifically recognize a target gene sequence are a useful tool for regulating gene expression or detecting target genes. Among the many synthetic oligonucleotides, tail-clamp peptide nucleic acid (TC-PNA offers advantages since it has two homopyrimidine PNA strands connected via a flexible ethylene glycol-type linker that can recognize complementary homopurine sequences via Watson-Crick and Hoogsteen base pairings and form thermally-stable PNA/PNA/DNA triplex structures. Here, we synthesized a series of TC-PNAs that can possess different lengths of azobenzene-containing linkers and studied their binding behaviours to homopurine single-stranded DNA. Introduction of azobenzene at the N-terminus amine of PNA increased the thermal stability of PNA-DNA duplexes. Further extension of the homopyrimidine PNA strand at the N-terminus of PNA-AZO further increased the binding stability of the PNA/DNA/PNA triplex to the target homopurine sequence; however, it induced TC-PNA/DNA/TC-PNA complex formation. Among these TC-PNAs, 9W5H-C4-AZO consisting of nine Watson-Crick bases and five Hoogsteen bases tethered with a beta-alanine conjugated azobenzene linker gave a stable 1:1 TC-PNA/ssDNA complex and exhibited good mismatch recognition. Our design for TC-PNA-AZO can be utilized for detecting homopurine sequences in various genes.

  4. Cloning of the cDNA for human 12-lipoxygenase

    International Nuclear Information System (INIS)

    Izumi, T.; Hoshiko, S.; Radmark, O.; Samuelsson, B.

    1990-01-01

    A full-length cDNA clone encoding 12-lipoxygenase was isolated from a human platelet cDNA library by using a cDNA for human reticulocyte 15-lipoxygenase as probe for the initial screening. The cDNA had an open reading frame encoding 662 amino acid residues with a calculated molecular weight of 75,590. Three independent clones revealed minor heterogeneities in their DNA sequences. Thus, in three positions of the deduced amino acid sequence, there is a choice between two different amino acids. The deduced sequence from the clone plT3 showed 65% identity with human reticulocyte 15-lipoxygenase and 42% identity with human leukocyte 5-lipoxygenase. The 12-lipoxygenase cDNA recognized a 3.0-kilobase mRNA species in platelets and human erythroleukemia cells (HEL cells). Phorbol 12-tetradecanoyl 13-acetate induced megakaryocytic differentiation of HEL cells and 12-lipoxygenase activity and increased mRNA for 12-lipoxygenase. The identity of the cloned 12-lipoxygenase was assured by expression in a mammalian cell line (COS cells). Human platelet 12-lipoxygenase has been difficult to purify to homogeneity. The cloning of this cDNA will increase the possibilities to elucidate the structure and function of this enzyme

  5. Genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia

    Directory of Open Access Journals (Sweden)

    Anastasiia Kovaliova

    2017-03-01

    Full Text Available Here we report the draft genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia. The draft genome has a size of 4.9 Mb and encodes multiple K+-transporters and proton-consuming decarboxylases. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain DV clusters together with the acid-tolerant Desulfovibrio sp. TomC and Desulfovibrio magneticus. The draft genome sequence and annotation have been deposited at GenBank under the accession number MLBG00000000.

  6. Molecular cloning, sequence analysis and homology modeling of the first caudata amphibian antifreeze-like protein in axolotl (Ambystoma mexicanum).

    Science.gov (United States)

    Zhang, Songyan; Gao, Jiuxiang; Lu, Yiling; Cai, Shasha; Qiao, Xue; Wang, Yipeng; Yu, Haining

    2013-08-01

    Antifreeze proteins (AFPs) refer to a class of polypeptides that are produced by certain vertebrates, plants, fungi, and bacteria and which permit their survival in subzero environments. In this study, we report the molecular cloning, sequence analysis and three-dimensional structure of the axolotl antifreeze-like protein (AFLP) by homology modeling of the first caudate amphibian AFLP. We constructed a full-length spleen cDNA library of axolotl (Ambystoma mexicanum). An EST having highest similarity (∼42%) with freeze-responsive liver protein Li16 from Rana sylvatica was identified, and the full-length cDNA was subsequently obtained by RACE-PCR. The axolotl antifreeze-like protein sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 93 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein were 10128.6 Da and 8.97, respectively. The molecular characterization of this gene and its deduced protein were further performed by detailed bioinformatics analysis. The three-dimensional structure of current AFLP was predicted by homology modeling, and the conserved residues required for functionality were identified. The homology model constructed could be of use for effective drug design. This is the first report of an antifreeze-like protein identified from a caudate amphibian.

  7. EGNAS: an exhaustive DNA sequence design algorithm

    Directory of Open Access Journals (Sweden)

    Kick Alfred

    2012-06-01

    Full Text Available Abstract Background The molecular recognition based on the complementary base pairing of deoxyribonucleic acid (DNA is the fundamental principle in the fields of genetics, DNA nanotechnology and DNA computing. We present an exhaustive DNA sequence design algorithm that allows to generate sets containing a maximum number of sequences with defined properties. EGNAS (Exhaustive Generation of Nucleic Acid Sequences offers the possibility of controlling both interstrand and intrastrand properties. The guanine-cytosine content can be adjusted. Sequences can be forced to start and end with guanine or cytosine. This option reduces the risk of “fraying” of DNA strands. It is possible to limit cross hybridizations of a defined length, and to adjust the uniqueness of sequences. Self-complementarity and hairpin structures of certain length can be avoided. Sequences and subsequences can optionally be forbidden. Furthermore, sequences can be designed to have minimum interactions with predefined strands and neighboring sequences. Results The algorithm is realized in a C++ program. TAG sequences can be generated and combined with primers for single-base extension reactions, which were described for multiplexed genotyping of single nucleotide polymorphisms. Thereby, possible foldback through intrastrand interaction of TAG-primer pairs can be limited. The design of sequences for specific attachment of molecular constructs to DNA origami is presented. Conclusions We developed a new software tool called EGNAS for the design of unique nucleic acid sequences. The presented exhaustive algorithm allows to generate greater sets of sequences than with previous software and equal constraints. EGNAS is freely available for noncommercial use at http://www.chm.tu-dresden.de/pc6/EGNAS.

  8. Molecular characterization of genes of Pseudomonas sp. strain HR199 involved in bioconversion of vanillin to protocatechuate.

    Science.gov (United States)

    Priefert, H; Rabenhorst, J; Steinbüchel, A

    1997-01-01

    The gene loci vdh, vanA, and vanB, which are involved in the bioconversion of vanillin to protocatechuate by Pseudomonas sp. strain HR199 (DSM 7063), were identified as the structural genes of a novel vanillin dehydrogenase (vdh) and the two subunits of a vanillate demethylase (vanA and vanB), respectively. These genes were localized on an EcoRI fragment (E230), which was cloned from a Pseudomonas sp. strain HR199 genomic library in the cosmid pVK100. The vdh gene was identified on a subfragment (HE35) of E230, and the vanA and vanB genes were localized on a different subfragment (H110) of E230. The nucleotide sequences of fragment HE35 and part of fragment H110 were determined, revealing open reading frames of 1062, 951, and 1446 bp, representing vanA, vanB, and vdh, respectively. The vdh gene was organized in one operon together with a fourth open reading frame (ORF2), of 735 bp, which was located upstream of vdh. The deduced amino acid sequences of vanA and vanB exhibited 78.8 and 62.1% amino acid identity, respectively, to the corresponding gene products from Pseudomonas sp. strain ATCC 19151 (F. Brunel and J. Davison, J. Bacteriol. 170:4924-4930, 1988). The deduced amino acid sequence of the vdh gene exhibited up to 35.3% amino acid identity to aldehyde dehydrogenases from different sources. The deduced amino acid sequence of ORF2 exhibited up to 28.4% amino acid identity to those of enoyl coenzyme A hydratases. Escherichia coli strains harboring fragment E230 cloned in pBluescript SK- converted vanillin to protocatechuate via vanillate, indicating the functional expression of vdh, vanA, and vanB in E. coli. High expression of vdh in E. coli was achieved with HE35 cloned in pBluescript SK-. The resulting recombinant strains converted vanillin to vanillate at a rate of up to 0.3 micromol per min per ml of culture. Transfer of vanA, vanB, and vdh to Alcaligenes eutrophus and to different Pseudomonas strains, which were unable to utilize vanillin or vanillate as

  9. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    OpenAIRE

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Fran?oise; Loux, Valentin; Vidal, Marie; Passot, St?phanie; B?al, Catherine; Layec, S?verine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  10. Prunus necrotic ringspot ilarvirus: nucleotide sequence of RNA3 and the relationship to other ilarviruses based on coat protein comparison.

    Science.gov (United States)

    Guo, D; Maiss, E; Adam, G; Casper, R

    1995-05-01

    The RNA3 of prunus necrotic ringspot ilarvirus (PNRSV) has been cloned and its entire sequence determined. The RNA3 consists of 1943 nucleotides (nt) and possesses two large open reading frames (ORFs) separated by an intergenic region of 74 nt. The 5' proximal ORF is 855 nt in length and codes for a protein of molecular mass 31.4 kDa which has homologies with the putative movement protein of other members of the Bromoviridae. The 3' proximal ORF of 675 nt is the cistron for the coat protein (CP) and has a predicted molecular mass of 24.9 kDa. The sequence of the 3' non-coding region (NCR) of PNRSV RNA3 showed a high degree of similarity with those of tobacco streak virus (TSV), prune dwarf virus (PDV), apple mosaic virus (ApMV) and also alfalfa mosaic virus (AIMV). In addition it contained potential stem-loop structures with interspersed AUGC motifs characteristic for ilar- and alfamoviruses. This conserved primary and secondary structure in all 3' NCRs may be responsible for the interaction with homologous and heterologous CPs and subsequent activation of genome replication. The CP gene of an ApMV isolate (ApMV-G) of 657 nt has also been cloned and sequenced. Although ApMV and PNRSV have a distant serological relationship, the deduced amino acid sequences of their CPs have an identity of only 51.8%. The N termini of PNRSV and ApMV CPs have in common a zinc-finger motif and the potential to form an amphipathic helix.

  11. Fine mapping and identification of a candidate gene for the barley Un8 true loose smut resistance gene.

    Science.gov (United States)

    Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D

    2015-07-01

    The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.

  12. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    Directory of Open Access Journals (Sweden)

    Xiaoyu Wang

    Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  13. Amino acid sequence surrounding the chondroitin sulfate attachment site of thrombomodulin regulates chondroitin polymerization.

    Science.gov (United States)

    Izumikawa, Tomomi; Kitagawa, Hiroshi

    2015-05-01

    Thrombomodulin (TM) is a cell-surface glycoprotein and a critical mediator of endothelial anticoagulant function. TM exists as both a chondroitin sulfate (CS) proteoglycan (PG) form and a non-PG form lacking a CS chain (α-TM); therefore, TM can be described as a part-time PG. Previously, we reported that α-TM bears an immature, truncated linkage tetrasaccharide structure (GlcAβ1-3Galβ1-3Galβ1-4Xyl). However, the biosynthetic mechanism to generate part-time PGs remains unclear. In this study, we used several mutants to demonstrate that the amino acid sequence surrounding the CS attachment site influences the efficiency of chondroitin polymerization. In particular, the presence of acidic residues surrounding the CS attachment site was indispensable for the elongation of CS. In addition, mutants defective in CS elongation did not exhibit anti-coagulant activity, as in the case with α-TM. Together, these data support a model for CS chain assembly in which specific core protein determinants are recognized by a key biosynthetic enzyme involved in chondroitin polymerization. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. EGVII endoglucanase and nucleic acids encoding the same

    Science.gov (United States)

    Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

    2009-05-05

    The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.

  15. Sequencing and promoter analysis of the nifENXorf3orf5fdxAnifQ operon from Azospirillum brasilense Sp7

    Directory of Open Access Journals (Sweden)

    Potrich D.P.

    2001-01-01

    Full Text Available A 40-kb DNA region containing the major cluster of nif genes has been isolated from the Azospirillum brasilense Sp7 genome. In this region three nif operons have been identified: nifHDKorf1Y, nifENXorf3orf5fdxAnifQ and orf2nifUSVorf4. The operons containing nifENX and nifUSV genes are separated from the structural nifHDKorf1Y operon by about 5 kb and 10 kb, respectively. The present study shows the sequence analysis of the 6045-bp DNA region containing the nifENX genes. The deduced amino acid sequences from the open reading frames were compared to the nif gene products of other diazotrophic bacteria and indicate the presence of seven ORFs, all reading in the same direction as that of the nifHDKorf1Y operon. Consensus sigma54 and NifA-binding sites are present only in the promoter region upstream of the nifE gene. This promoter is activated by NifA protein and is approximately two-times less active than the nifH promoter, as indicated by the ß-galactosidase assays. This result suggests the differential expression of the nif genes and their respective products in Azospirillum.

  16. Effects of Mutations and Ligands on the Thermostability of the l-Arginine/Agmatine Antiporter AdiC and Deduced Insights into Ligand-Binding of Human l-Type Amino Acid Transporters.

    Science.gov (United States)

    Ilgü, Hüseyin; Jeckelmann, Jean-Marc; Colas, Claire; Ucurum, Zöhre; Schlessinger, Avner; Fotiadis, Dimitrios

    2018-03-20

    The l-arginine/agmatine transporter AdiC is a prokaryotic member of the SLC7 family, which enables pathogenic enterobacteria to survive the extremely acidic gastric environment. Wild-type AdiC from Escherichia coli, as well as its previously reported point mutants N22A and S26A, were overexpressed homologously and purified to homogeneity. A size-exclusion chromatography-based thermostability assay was used to determine the melting temperatures ( T m s) of the purified AdiC variants in the absence and presence of the selected ligands l-arginine (Arg), agmatine, l-arginine methyl ester, and l-arginine amide. The resulting T m s indicated stabilization of AdiC variants upon ligand binding, in which T m s and ligand binding affinities correlated positively. Considering results from this and previous studies, we revisited the role of AdiC residue S26 in Arg binding and proposed interactions of the α-carboxylate group of Arg exclusively with amide groups of the AdiC backbone. In the context of substrate binding in the human SLC7 family member l-type amino acid transporter-1 (LAT1; SLC7A5), an analogous role of S66 in LAT1 to S26 in AdiC is discussed based on homology modeling and amino acid sequence analysis. Finally, we propose a binding mechanism for l-amino acid substrates to LATs from the SLC7 family.

  17. Polarography of hexavalent molybdenum in hypophosphorous acid solutions

    International Nuclear Information System (INIS)

    Hassan, A.; El-Shatory, S.A.; Azab, H.A.

    1988-01-01

    The polarographic behaviour and determination of Mo(6) in hypophosphorous acid solutions of concentrations varying from 0,1 to 5,0 moll -1 and T = 25±0,1 0 C have been investigated. It was shown that reduction of MoO 4 2- takes place along a single or two waves depending upon the acid concentration. Microcoulometric experiments have been performed at the limiting region of the different waves obtained at different acid concentrations. A scheme for the mechanism of reduction occuring at the DME has been deduced. A method for analytical determination of Mo(6) on both the micro- and macro-scales in hypophosphorous acid solutions has been reported. Analysis of a binary mixture Mo(6)/Cd(2) and a tertiary mixture Mo(6)/Cd(2)/Zn(2) in moll -1 hypophosphorous acid has been investigated. (Author)

  18. Using msa-2b as a molecular marker for genotyping Mexican isolates of Babesia bovis.

    Science.gov (United States)

    Genis, Alma D; Perez, Jocelin; Mosqueda, Juan J; Alvarez, Antonio; Camacho, Minerva; Muñoz, Maria de Lourdes; Rojas, Carmen; Figueroa, Julio V

    2009-12-01

    Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2) and msa-2b). To determine the sequence variation among B. bovis Mexican isolates using msa-2b as a genetic marker, PCR amplicons corresponding to msa-2b were cloned and plasmids carrying the corresponding inserts were purified and sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 16 geographically different Mexican B. bovis isolates and a reference strain. Clustal-W multiple alignments of the MSA-2b deduced amino acid sequences performed with the 17 B. bovis Mexican isolates, revealed the identification of three genotypes with a distinct set each of amino acid residues present at the variable region: Genotype I represented by the MO7 strain (in vitro culture-derived from the Mexico isolate) as well as RAD, Chiapas-1, Tabasco and Veracruz-3 isolates; Genotype II, represented by the Jalisco, Mexico and Veracruz-2 isolates; and Genotype III comprising the sequences from most of the isolates studied, Tamaulipas-1, Chiapas-2, Guerrero-1, Nayarit, Quintana Roo, Nuevo Leon, Tamaulipas-2, Yucatan and Guerrero-2. Moreover, these three genotypes could be discriminated against each other by using a PCR-RFLP approach. The results suggest that occurrence of indels within the variable region of msa-2b sequences can be useful markers for identifying a particular genotype present in field populations of B. bovis isolated from infected cattle in Mexico.

  19. The DEDUCE Guided Query tool: providing simplified access to clinical data for research and quality improvement.

    Science.gov (United States)

    Horvath, Monica M; Winfield, Stephanie; Evans, Steve; Slopek, Steve; Shang, Howard; Ferranti, Jeffrey

    2011-04-01

    In many healthcare organizations, comparative effectiveness research and quality improvement (QI) investigations are hampered by a lack of access to data created as a byproduct of patient care. Data collection often hinges upon either manual chart review or ad hoc requests to technical experts who support legacy clinical systems. In order to facilitate this needed capacity for data exploration at our institution (Duke University Health System), we have designed and deployed a robust Web application for cohort identification and data extraction--the Duke Enterprise Data Unified Content Explorer (DEDUCE). DEDUCE is envisioned as a simple, web-based environment that allows investigators access to administrative, financial, and clinical information generated during patient care. By using business intelligence tools to create a view into Duke Medicine's enterprise data warehouse, DEDUCE provides a Guided Query functionality using a wizard-like interface that lets users filter through millions of clinical records, explore aggregate reports, and, export extracts. Researchers and QI specialists can obtain detailed patient- and observation-level extracts without needing to understand structured query language or the underlying database model. Developers designing such tools must devote sufficient training and develop application safeguards to ensure that patient-centered clinical researchers understand when observation-level extracts should be used. This may mitigate the risk of data being misunderstood and consequently used in an improper fashion. Copyright © 2010 Elsevier Inc. All rights reserved.

  20. Assessment of volatile compound profiles and the deduced sensory significance of virgin olive oils from the progeny of Picual×Arbequina cultivars.

    Science.gov (United States)

    Pérez, Ana G; de la Rosa, Raúl; Pascual, Mar; Sánchez-Ortiz, Araceli; Romero-Segura, Carmen; León, Lorenzo; Sanz, Carlos

    2016-01-08

    Volatile compounds are responsible for most of the sensory qualities of virgin olive oil and they are synthesized when enzymes and substrates come together as olive fruit is crushed during the industrial process to obtain the oil. Here we have studied the variability among the major volatile compounds in virgin olive oil prepared from the progeny of a cross of Picual and Arbequina olive cultivars (Olea europaea L.). The volatile compounds were isolated by SPME, and analyzed by HRGC-MS and HRGC-FID. Most of the volatile compounds found in the progeny's oil are produced by the enzymes in the so-called lipoxygenase pathway, and they may be clustered into different groups according to their chain length and polyunsaturated fatty acid origin (linoleic and linolenic acids). In addition, a group of compounds derived from amino acid metabolism and two terpenes also contributed significantly to the volatile fraction, some of which had significant odor values in most of the genotypes evaluated. The volatile compound content of the progeny was very varied, widely transgressing the progenitor levels, suggesting that in breeding programs it might be more effective to consider a larger number of individuals within the same cross than using different crosses with fewer individuals. Multivariate analysis allowed genotypes with particularly interesting volatile compositions to be identified and their flavor quality deduced. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

    Science.gov (United States)

    Martin, Andrew C R

    2014-01-01

    The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.

  2. Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins

    DEFF Research Database (Denmark)

    Madsen, Peder; Rasmussen, H H; Leffers, H

    1992-01-01

    termed PA-FABP (psoriasis-associated fatty acid-binding protein). The deduced sequence predicted a protein with molecular weight of 15,164 daltons and a calculated pI of 6.96, values that are close to those recorded in the keratinocyte 2D gel protein database. The protein comigrated with PA......-FABP as determined by 2D gel analysis of [35S]-methionine-labeled proteins expressed by transformed human amnion (AMA) cells transfected with clone 1592 using the vaccinia virus expression system and reacted with a rabbit polyclonal antibody raised against 2D gel purified PA-FABP. Structural analysis of the amino...... with epidermal growth factor (EGF), pituitary extract, and 10% fetal calf serum] revealed a strong up-regulation of PA-FABP, psoriasin, calgranulins A and B, and a few other proteins that are highly expressed in psoriatic skin. The levels of these proteins exceeded by far those observed in non-cultured normal...

  3. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. Copyright © 2016 Meneghel et al.

  4. Molecular Cloning and Pharmacological Properties of an Acidic PLA2 from Bothrops pauloensis Snake Venom

    Directory of Open Access Journals (Sweden)

    Francis Barbosa Ferreira

    2013-12-01

    Full Text Available In this work, we describe the molecular cloning and pharmacological properties of an acidic phospholipase A2 (PLA2 isolated from Bothrops pauloensis snake venom. This enzyme, denominated BpPLA2-TXI, was purified by four chromatographic steps and represents 2.4% of the total snake venom protein content. BpPLA2-TXI is a monomeric protein with a molecular mass of 13.6 kDa, as demonstrated by Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF analysis and its theoretical isoelectric point was 4.98. BpPLA2-TXI was catalytically active and showed some pharmacological effects such as inhibition of platelet aggregation induced by collagen or ADP and also induced edema and myotoxicity. BpPLA2-TXI displayed low cytotoxicity on TG-180 (CCRF S 180 II and Ovarian Carcinoma (OVCAR-3, whereas no cytotoxicity was found in regard to MEF (Mouse Embryonic Fibroblast and Sarcoma 180 (TIB-66. The N-terminal sequence of forty-eight amino acid residues was determined by Edman degradation. In addition, the complete primary structure of 122 amino acids was deduced by cDNA from the total RNA of the venom gland using specific primers, and it was significantly similar to other acidic D49 PLA2s. The phylogenetic analyses showed that BpPLA2-TXI forms a group with other acidic D49 PLA2s from the gender Bothrops, which are characterized by a catalytic activity associated with anti-platelet effects.

  5. Isolation and sequence of complementary DNA encoding human extracellular superoxide dismutase

    International Nuclear Information System (INIS)

    Hjalmarsson, K.; Marklund, S.L.; Engstroem, A.; Edlund, T.

    1987-01-01

    A complementary DNA (cDNA) clone from a human placenta cDNA library encoding extracellular superoxide dismutase has been isolated and the nucleotide sequence determined. The cDNA has a very high G + C content. EC-SOD is synthesized with a putative 18-amino acid signal peptide, preceding the 222 amino acids in the mature enzyme, indicating that the enzyme is a secretory protein. The first 95 amino acids of the mature enzyme show no sequence homology with other sequenced proteins and there is one possible N-glycosylation site (Asn-89). The amino acid sequence from residues 96-193 shows strong homology (∼ 50%) with the final two-thirds of the sequences of all know eukaryotic CuZn SODs, whereas the homology with the P. leiognathi CuZn SOD is clearly lower. The ligands to Cu and Zn, the cysteines forming the intrasubunit disulfide bridge in the CuZn SODs, and the arginine found in all CuZn SODs in the entrance to the active site can all be identified in EC-SOD. A comparison with bovine CuZn SOD, the three-dimensional structure of which is known, reveals that the homologies occur in the active site and the divergencies are in the part constituting the subunit contact area in CuZn SOD. Amino acid sequence 194-222 in the carboxyl-terminal end of EC-SOD is strongly hydrophilic and contains nine amino acids with a positive charge. This sequence probably confers the affinity of EC-SOD for heparin and heparan sulfate. An analysis of the amino acid sequence homologies with CuZn SODs from various species indicates that the EC-SODs may have evolved form the CuZn SODs before the evolution of fungi and plants

  6. Axolotl hemoglobin: cDNA-derived amino acid sequences of two alpha globins and a beta globin from an adult Ambystoma mexicanum.

    Science.gov (United States)

    Shishikura, Fumio; Takeuchi, Hiro-aki; Nagai, Takatoshi

    2005-11-01

    Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.

  7. GAWK, a novel human pituitary polypeptide: isolation, immunocytochemical localization and complete amino acid sequence.

    Science.gov (United States)

    Benjannet, S; Leduc, R; Lazure, C; Seidah, N G; Marcinkiewicz, M; Chrétien, M

    1985-01-16

    During the course of reverse-phase high pressure liquid chromatography (RP-HPLC) purification of a postulated big ACTH (1) from human pituitary gland extracts, a highly purified peptide bearing no resemblance to any known polypeptide was isolated. The complete sequence of this 74 amino acid polypeptide, called GAWK, has been determined. Search on a computer data bank on the possible homology to any known protein or fragment, using a mutation data matrix, failed to reveal any homology greater than 30%. An antibody produced against a synthetic fragment allowed us to detect several immunoreactive forms. The antisera also enabled us to localize the polypeptide, by immunocytochemistry, in the anterior lobe of the pituitary gland.

  8. Synthesis, physico-chemical properties and complexing abilities of new amphiphilic ligands from D-galacturonic acid.

    Science.gov (United States)

    Allam, Anas; Behr, Jean-Bernard; Dupont, Laurent; Nardello-Rataj, Véronique; Plantier-Royon, Richard

    2010-04-19

    This paper describes a convenient and efficient synthesis of new complexing surfactants from d-galacturonic acid and n-octanol as renewable raw materials in a two-step sequence. In the first step, simultaneous O-glycosidation-esterification under Fischer conditions was achieved. The anomeric ratio of the products was studied based on the main experimental parameters and the activation mode (thermal or microwave). In the second step, aminolysis of the n-octyl ester was achieved with various functionalized primary amines under standard thermal or microwave activation. The physico-chemical properties of these new amphiphilic ligands were measured and these compounds were found to exhibit interesting surface properties. Complexing abilities of one uronamide ligand functionalized with a pyridine moiety toward Cu(II) ions was investigated in solution by EPR titrations. A solid compound was also synthesized and characterized, its relative structure was deduced from spectroscopic data. Copyright (c) 2010 Elsevier Ltd. All rights reserved.

  9. A novel aspartic acid protease gene from pineapple fruit (Ananas comosus): cloning, characterization and relation to postharvest chilling stress resistance.

    Science.gov (United States)

    Raimbault, Astrid-Kim; Zuily-Fodil, Yasmine; Soler, Alain; Cruz de Carvalho, Maria H

    2013-11-15

    A full-length cDNA encoding a putative aspartic acid protease (AcAP1) was isolated for the first time from the flesh of pineapple (Ananas comosus) fruit. The deduced sequence of AcAP1 showed all the common features of a typical plant aspartic protease phytepsin precursor. Analysis of AcAP1 gene expression under postharvest chilling treatment in two pineapple varieties differing in their resistance to blackheart development revealed opposite trends. The resistant variety showed an up-regulation of AcAP1 precursor gene expression whereas the susceptible showed a down-regulation in response to postharvest chilling treatment. The same trend was observed regarding specific AP enzyme activity in both varieties. Taken together our results support the involvement of AcAP1 in postharvest chilling stress resistance in pineapple fruits. Copyright © 2013 Elsevier GmbH. All rights reserved.

  10. Chameleon sequences in neurodegenerative diseases.

    Science.gov (United States)

    Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Chameleon sequences in neurodegenerative diseases

    International Nuclear Information System (INIS)

    Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

    2016-01-01

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.

  12. Chameleon sequences in neurodegenerative diseases

    Energy Technology Data Exchange (ETDEWEB)

    Bahramali, Golnaz [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Goliaei, Bahram, E-mail: goliaei@ut.ac.ir [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of); Salari, Ali [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of)

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.

  13. A novel Y-xylosidase, nucleotide sequence encoding it and use thereof.

    NARCIS (Netherlands)

    Graaff, de L.H.; Peij, van N.N.M.E.; Broeck, van den H.C.; Visser, J.

    1996-01-01

    A nucleotide sequence is provided which encodes a peptide having beta-xylosidase activity and exhibits at least 30mino acid identity with the amino acid sequence shown in SEQ ID NO. 1 or hybridises under stringent conditions with a nucleotide sequence shown in SEQ ID NO. 1, or a part thereof having

  14. Expression analysis of a ''Cucurbita'' cDNA encoding endonuclease

    International Nuclear Information System (INIS)

    Szopa, J.

    1995-01-01

    The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs

  15. The molecular biology and biochemistry of rice endosperm α-globulin

    International Nuclear Information System (INIS)

    Shorrosh, B.S.

    1989-01-01

    The author's first objective was to isolate a cDNA clone that encodes the rice endosperm α-globulin. Purified antibodies against a rice storage protein, α-globulin, were used to screen a λgt11 cDNA expression library constructed from immature rice seed endosperm. The cDNA insert of clone 4A1 (identified by antibody screening) was used as a probe to identify long cDNA inserts in the library. The deduced amino acid sequence of clone A3-12 cDNA insert (identified by cDNA screening) contained the amino acid sequences of three cyanogen bromide peptides fragment of α-globulin. The calculated molecular weight and amino acid composition of the deduced amino acid sequence were similar to the α-globulin protein. Northern blot analysis indicated that mRNA of one size, approximately 1.0 kb, is expressed. Southern genomic blot analysis revealed one band with EcoRI or Hind III digestion. Cell-free translation and immunoprecipitation showed that the initial translation product is approximately 2,000 daltons larger than the mature protein. The amino acid sequence of α-globulin revealed limited regions of similarities with wheat storage proteins. The author concludes that the cDNA insert in clone A3-12 contained the entire coding region of α-globulin protein and that α-globulin is encoded by a single gene. My second objective was to inhibit the degradation of α-globulin in the salt extract of rice flour. The salt extract of rice flour contained an acid protease whose optimal pH was 3 for 3 H-casein hydrolysis. A polypeptide with molecular weight of 20,000 was immunologically reactive with α-globulin antibodies and is produced by limited proteolysis in the extract. Pepstatin inhibited the proteolysis of 3H-casein and slowed the proteolysis of α-globulin

  16. Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings.

    Science.gov (United States)

    Zhu, Fu-Yuan; Chen, Mo-Xian; Ye, Neng-Hui; Shi, Lu; Ma, Kai-Long; Yang, Jing-Fang; Cao, Yun-Ying; Zhang, Youjun; Yoshida, Takuya; Fernie, Alisdair R; Fan, Guang-Yi; Wen, Bo; Zhou, Ruo; Liu, Tie-Yuan; Fan, Tao; Gao, Bei; Zhang, Di; Hao, Ge-Fei; Xiao, Shi; Liu, Ying-Gao; Zhang, Jianhua

    2017-08-01

    In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  17. The nucleotide sequences of two leghemoglobin genes from soybean

    DEFF Research Database (Denmark)

    Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O

    1982-01-01

    We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences in identical positions. Comparison of the coding sequences with known amino-acid sequences of soybean leghemoglobins suggest that the two genes...

  18. Osteocalcin protein sequences of Neanderthals and modern primates.

    Science.gov (United States)

    Nielsen-Marsh, Christina M; Richards, Michael P; Hauschka, Peter V; Thomas-Oates, Jane E; Trinkaus, Erik; Pettitt, Paul B; Karavanic, Ivor; Poinar, Hendrik; Collins, Matthew J

    2005-03-22

    We report here protein sequences of fossil hominids, from two Neanderthals dating to approximately 75,000 years old from Shanidar Cave in Iraq. These sequences, the oldest reported fossil primate protein sequences, are of bone osteocalcin, which was extracted and sequenced by using MALDI-TOF/TOF mass spectrometry. Through a combination of direct sequencing and peptide mass mapping, we determined that Neanderthals have an osteocalcin amino acid sequence that is identical to that of modern humans. We also report complete osteocalcin sequences for chimpanzee (Pan troglodytes) and gorilla (Gorilla gorilla gorilla) and a partial sequence for orangutan (Pongo pygmaeus), all of which are previously unreported. We found that the osteocalcin sequences of Neanderthals, modern human, chimpanzee, and orangutan are unusual among mammals in that the ninth amino acid is proline (Pro-9), whereas most species have hydroxyproline (Hyp-9). Posttranslational hydroxylation of Pro-9 in osteocalcin by prolyl-4-hydroxylase requires adequate concentrations of vitamin C (l-ascorbic acid), molecular O(2), Fe(2+), and 2-oxoglutarate, and also depends on enzyme recognition of the target proline substrate consensus sequence Leu-Gly-Ala-Pro-9-Ala-Pro-Tyr occurring in most mammals. In five species with Pro-9-Val-10, hydroxylation is blocked, whereas in gorilla there is a mixture of Pro-9 and Hyp-9. We suggest that the absence of hydroxylation of Pro-9 in Pan, Pongo, and Homo may reflect response to a selective pressure related to a decline in vitamin C in the diet during omnivorous dietary adaptation, either independently or through the common ancestor of these species.

  19. Sequence analysis of putative swrW gene required for surfactant ...

    African Journals Online (AJOL)

    Serratia marcescens produces biosurfactant serrawettin, essential for its population migration behavior. Serrawettin W1 was revealed to be an antibiotic serratamolide that makes it significant for deoxyribonucleic acid (DNA) and protein sequence analysis. Four nucleotide and amino-acid sequences from local strains ...

  20. The complete genomic sequence of pepper yellow leaf curl virus (PYLCV and its implications for our understanding of evolution dynamics in the genus polerovirus.

    Directory of Open Access Journals (Sweden)

    Aviv Dombrovsky

    Full Text Available We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV. PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV (both poleroviruses. The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs, which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93% and to ORF5 of CABYV (87%. Both PYLCV and Pepper vein yellowing virus (PeVYV contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP, may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2 were identified between nucleotides 4,962 and 5,061 (ORF 5 and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3 with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s between poleroviruses co-infecting a common host(s, resulting in the emergence of PYLCV, a novel pathogen with a wider host range.

  1. The complete genomic sequence of pepper yellow leaf curl virus (PYLCV) and its implications for our understanding of evolution dynamics in the genus polerovirus.

    Science.gov (United States)

    Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel

    2013-01-01

    We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range.

  2. Quantitative thermodynamic predication of interactions between nucleic acid and non-nucleic acid species using Microsoft excel.

    Science.gov (United States)

    Zou, Jiaqi; Li, Na

    2013-09-01

    Proper design of nucleic acid sequences is crucial for many applications. We have previously established a thermodynamics-based quantitative model to help design aptamer-based nucleic acid probes by predicting equilibrium concentrations of all interacting species. To facilitate customization of this thermodynamic model for different applications, here we present a generic and easy-to-use platform to implement the algorithm of the model with Microsoft(®) Excel formulas and VBA (Visual Basic for Applications) macros. Two Excel spreadsheets have been developed: one for the applications involving only nucleic acid species, the other for the applications involving both nucleic acid and non-nucleic acid species. The spreadsheets take the nucleic acid sequences and the initial concentrations of all species as input, guide the user to retrieve the necessary thermodynamic constants, and finally calculate equilibrium concentrations for all species in various bound and unbound conformations. The validity of both spreadsheets has been verified by comparing the modeling results with the experimental results on nucleic acid sequences reported in the literature. This Excel-based platform described here will allow biomedical researchers to rationalize the sequence design of nucleic acid probes using the thermodynamics-based modeling even without relevant theoretical and computational skills. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  3. Nonlinear analysis of sequence symmetry of beta-trefoil family proteins

    Energy Technology Data Exchange (ETDEWEB)

    Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xu Ruizhen [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: yxiao@mail.hust.edu.cn

    2005-07-01

    The tertiary structures of proteins of beta-trefoil family have three-fold quasi-symmetry while their amino acid sequences appear almost at random. In the present paper we show that these amino acid sequences have hidden symmetries in fact and furthermore the degrees of these hidden symmetries are the same as those of their tertiary structures. We shall present a modified recurrence plot to reveal hidden symmetries in protein sequences. Our results can explain the contradiction in sequence-structure relations of proteins of beta-trefoil family.

  4. Genomic sequencing in clinical trials

    OpenAIRE

    Mestan, Karen K; Ilkhanoff, Leonard; Mouli, Samdeep; Lin, Simon

    2011-01-01

    Abstract Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to fin...

  5. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    NARCIS (Netherlands)

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which

  6. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Directory of Open Access Journals (Sweden)

    James B Howard

    Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification

  7. Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk

    2013-01-01

    to the consensus sequence. Additionally, we got an average sequence depth for the genome of 4000 for the Iontorrent PGM and 400 for the FLX platform making the mapping suitable for single nucleotide variant (SNV) detection. The analysis revealed a single non-silent SNV A10665G leading to the amino acid change D......Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms...

  8. Isolation of cDNA clones coding for human tissue factor: primary structure of the protein and cDNA

    International Nuclear Information System (INIS)

    Spicer, E.K.; Horton, R.; Bloem, L.

    1987-01-01

    Tissue factor is a membrane-bound procoagulant protein that activates the extrinsic pathway of blood coagulation in the presence of factor VII and calcium. λ Phage containing the tissue factor gene were isolated from a human placental cDNA library. The amino acid sequence deduced from the nucleotide sequence of the cDNAs indicates that tissue factor is synthesized as a higher molecular weight precursor with a leader sequence of 32 amino acids, while the mature protein is a single polypeptide chain composed of 263 residues. The derived primary structure of tissue factor has been confirmed by comparison to protein and peptide sequence data. The sequence of the mature protein suggests that there are three distinct domains: extracellular, residues 1-219; hydrophobic, residues 220-242; and cytoplasmic, residues 243-263. Three potential N-linked carbohydrate attachment sites occur in the extracellular domain. The amino acid sequence of tissue factor shows no significant homology with the vitamin K-dependent serine proteases, coagulation cofactors, or any other protein in the National Biomedical Research Foundation sequence data bank (Washington, DC)

  9. Peptide Nucleic Acids Having Amino Acid Side Chains

    DEFF Research Database (Denmark)

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than the corresponding DNA or RNA strands, and exhibit increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from a group consisting...

  10. Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

    DEFF Research Database (Denmark)

    Radova, A.; Sebela, M.; Galuszka, P.

    2001-01-01

    This paper reports the first purification method developed for the isolation of an homogeneous polyamine oxidase (PAO) from etiolated barley seedlings. The crude enzyme preparation was obtained after initial precipitation of the extract with protamine sulphate and ammonium sulphate. The enzyme...... was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...... PAO shows a high degree of similarity to that of maize PAO and to several other flavoprotein oxidases. The polyamines spermine and spermidine were the only two substrates of the enzyme with K-m values 4 x 10(-5) and 3 x 10(-5) M and pH optima of 5.0 and 6.0, respectively. Barley polyamine oxidase...

  11. Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

    Science.gov (United States)

    Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

    2015-11-21

    Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties.

  12. Structural and functional insights from the metagenome of an acidic hot spring microbial planktonic community in the Colombian Andes.

    Directory of Open Access Journals (Sweden)

    Diego Javier Jiménez

    Full Text Available A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level acidic hot spring El Coquito (EC. A classification of unassembled metagenomic reads using different databases showed a high proportion of Gammaproteobacteria and Alphaproteobacteria (in total read affiliation, and through taxonomic affiliation of 16S rRNA gene fragments we observed the presence of Proteobacteria, micro-algae chloroplast and Firmicutes. Reads mapped against the genomes Acidiphilium cryptum JF-5, Legionella pneumophila str. Corby and Acidithiobacillus caldus revealed the presence of transposase-like sequences, potentially involved in horizontal gene transfer. Functional annotation and hierarchical comparison with different datasets obtained by pyrosequencing in different ecosystems showed that the microbial community also contained extensive DNA repair systems, possibly to cope with ultraviolet radiation at such high altitudes. Analysis of genes involved in the nitrogen cycle indicated the presence of dissimilatory nitrate reduction to N2 (narGHI, nirS, norBCDQ and nosZ, associated with Proteobacteria-like sequences. Genes involved in the sulfur cycle (cysDN, cysNC and aprA indicated adenylsulfate and sulfite production that were affiliated to several bacterial species. In summary, metagenomic sequence data provided insight regarding the structure and possible functions of this hot spring microbial community, describing some groups potentially involved in the nitrogen and sulfur cycling in this environment.

  13. Structural and Functional Insights from the Metagenome of an Acidic Hot Spring Microbial Planktonic Community in the Colombian Andes

    Science.gov (United States)

    Jiménez, Diego Javier; Andreote, Fernando Dini; Chaves, Diego; Montaña, José Salvador; Osorio-Forero, Cesar; Junca, Howard; Zambrano, María Mercedes; Baena, Sandra

    2012-01-01

    A taxonomic and annotated functional description of microbial life was deduced from 53 Mb of metagenomic sequence retrieved from a planktonic fraction of the Neotropical high Andean (3,973 meters above sea level) acidic hot spring El Coquito (EC). A classification of unassembled metagenomic reads using different databases showed a high proportion of Gammaproteobacteria and Alphaproteobacteria (in total read affiliation), and through taxonomic affiliation of 16S rRNA gene fragments we observed the presence of Proteobacteria, micro-algae chloroplast and Firmicutes. Reads mapped against the genomes Acidiphilium cryptum JF-5, Legionella pneumophila str. Corby and Acidithiobacillus caldus revealed the presence of transposase-like sequences, potentially involved in horizontal gene transfer. Functional annotation and hierarchical comparison with different datasets obtained by pyrosequencing in different ecosystems showed that the microbial community also contained extensive DNA repair systems, possibly to cope with ultraviolet radiation at such high altitudes. Analysis of genes involved in the nitrogen cycle indicated the presence of dissimilatory nitrate reduction to N2 (narGHI, nirS, norBCDQ and nosZ), associated with Proteobacteria-like sequences. Genes involved in the sulfur cycle (cysDN, cysNC and aprA) indicated adenylsulfate and sulfite production that were affiliated to several bacterial species. In summary, metagenomic sequence data provided insight regarding the structure and possible functions of this hot spring microbial community, describing some groups potentially involved in the nitrogen and sulfur cycling in this environment. PMID:23251687

  14. The use of orthologous sequences to predict the impact of amino acid substitutions on protein function.

    Directory of Open Access Journals (Sweden)

    Nicholas J Marini

    2010-05-01

    Full Text Available Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR, in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that "resurrects" the human-macaque ancestor result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an "ancestral site preservation" measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants.

  15. Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

    Science.gov (United States)

    Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

    2017-04-01

    The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.

  16. RNA Sequencing and Coexpression Analysis Reveal Key Genes Involved in α-Linolenic Acid Biosynthesis in Perilla frutescens Seed

    Directory of Open Access Journals (Sweden)

    Tianyuan Zhang

    2017-11-01

    Full Text Available Perilla frutescen is used as traditional food and medicine in East Asia. Its seeds contain high levels of α-linolenic acid (ALA, which is important for health, but is scarce in our daily meals. Previous reports on RNA-seq of perilla seed had identified fatty acid (FA and triacylglycerol (TAG synthesis genes, but the underlying mechanism of ALA biosynthesis and its regulation still need to be further explored. So we conducted Illumina RNA-sequencing in seven temporal developmental stages of perilla seeds. Sequencing generated a total of 127 million clean reads, containing 15.88 Gb of valid data. The de novo assembly of sequence reads yielded 64,156 unigenes with an average length of 777 bp. A total of 39,760 unigenes were annotated and 11,693 unigenes were found to be differentially expressed in all samples. According to Kyoto Encyclopedia of Genes and Genomes (KEGG pathway analysis, 486 unigenes were annotated in the “lipid metabolism” pathway. Of these, 150 unigenes were found to be involved in fatty acid (FA biosynthesis and triacylglycerol (TAG assembly in perilla seeds. A coexpression analysis showed that a total of 104 genes were highly coexpressed (r > 0.95. The coexpression network could be divided into two main subnetworks showing over expression in the medium or earlier and late phases, respectively. In order to identify the putative regulatory genes, a transcription factor (TF analysis was performed. This led to the identification of 45 gene families, mainly including the AP2-EREBP, bHLH, MYB, and NAC families, etc. After coexpression analysis of TFs with highly expression of FAD2 and FAD3 genes, 162 TFs were found to be significantly associated with two FAD genes (r > 0.95. Those TFs were predicted to be the key regulatory factors in ALA biosynthesis in perilla seed. The qRT-PCR analysis also verified the relevance of expression pattern between two FAD genes and partial candidate TFs. Although it has been reported that some TFs

  17. Identifying a base in a nucleic acid

    Science.gov (United States)

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2005-02-08

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  18. Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Budzanivska I. G.

    2014-03-01

    Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.

  19. Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.

    Science.gov (United States)

    Hayat, Maqsood; Khan, Asifullah

    2011-02-21

    Membrane proteins are vital type of proteins that serve as channels, receptors, and energy transducers in a cell. Prediction of membrane protein types is an important research area in bioinformatics. Knowledge of membrane protein types provides some valuable information for predicting novel example of the membrane protein types. However, classification of membrane protein types can be both time consuming and susceptible to errors due to the inherent similarity of membrane protein types. In this paper, neural networks based membrane protein type prediction system is proposed. Composite protein sequence representation (CPSR) is used to extract the features of a protein sequence, which includes seven feature sets; amino acid composition, sequence length, 2 gram exchange group frequency, hydrophobic group, electronic group, sum of hydrophobicity, and R-group. Principal component analysis is then employed to reduce the dimensionality of the feature vector. The probabilistic neural network (PNN), generalized regression neural network, and support vector machine (SVM) are used as classifiers. A high success rate of 86.01% is obtained using SVM for the jackknife test. In case of independent dataset test, PNN yields the highest accuracy of 95.73%. These classifiers exhibit improved performance using other performance measures such as sensitivity, specificity, Mathew's correlation coefficient, and F-measure. The experimental results show that the prediction performance of the proposed scheme for classifying membrane protein types is the best reported, so far. This performance improvement may largely be credited to the learning capabilities of neural networks and the composite feature extraction strategy, which exploits seven different properties of protein sequences. The proposed Mem-Predictor can be accessed at http://111.68.99.218/Mem-Predictor. Copyright © 2010 Elsevier Ltd. All rights reserved.

  20. Cloning and characterization of the gene encoding IMP dehydrogenase from Arabidopsis thaliana.

    Science.gov (United States)

    Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

    1996-10-03

    We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Arabidopsis thaliana (At). The transcription unit of the At gene spans approximately 1900 bp and specifies a protein of 503 amino acids with a calculated relative molecular mass (M(r)) of 54,190. The gene is comprised of a minimum of four introns and five exons with all donor and acceptor splice sequences conforming to previously proposed consensus sequences. The deduced IMPDH amino-acid sequence from At shows a remarkable similarity to other eukaryotic IMPDH sequences, with a 48% identity to human Type II enzyme. Allowing for conservative substitutions, the enzyme is 69% similar to human Type II IMPDH. The putative active-site sequence of At IMPDH conforms to the IMP dehydrogenase/guanosine monophosphate reductase motif and contains an essential active-site cysteine residue.

  1. Feature Selection and the Class Imbalance Problem in Predicting Protein Function from Sequence

    NARCIS (Netherlands)

    Al-Shahib, A.; Breitling, R.; Gilbert, D.

    2005-01-01

    Abstract: When the standard approach to predict protein function by sequence homology fails, other alternative methods can be used that require only the amino acid sequence for predicting function. One such approach uses machine learning to predict protein function directly from amino acid sequence

  2. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    STORAGESEVER

    2010-07-19

    Jul 19, 2010 ... and antisense primers, a single band of 573 base pairs .... Amino acid sequence alignment of Cluster I and Cluster II of phylogenetic tree. First ten sequences ... sequence weighting, postion-spiecific gap penalties and weight.

  3. Chimera: construction of chimeric sequences for phylogenetic analysis

    NARCIS (Netherlands)

    Leunissen, J.A.M.

    2003-01-01

    Chimera allows the construction of chimeric protein or nucleic acid sequence files by concatenating sequences from two or more sequence files in PHYLIP formats. It allows the user to interactively select genes and species from the input files. The concatenated result is stored to one single output

  4. Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

    Science.gov (United States)

    Zimmermann, Karel; Gibrat, Jean-François

    2010-01-04

    Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.

  5. Amino acid "little Big Bang": Representing amino acid substitution matrices as dot products of Euclidian vectors

    Directory of Open Access Journals (Sweden)

    Zimmermann Karel

    2010-01-01

    Full Text Available Abstract Background Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. Results We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. Conclusions This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.

  6. Characterization of a Bordetella pertussis Diaminopimelate (DAP) Biosynthesis Locus Identifies dapC, a Novel Gene Coding for an N-Succinyl-l,l-DAP Aminotransferase

    OpenAIRE

    Fuchs, Thilo M.; Schneider, Boris; Krumbach, Karin; Eggeling, Lothar; Gross, Roy

    2000-01-01

    The functional complementation of two Escherichia coli strains defective in the succinylase pathway of meso-diaminopimelate (meso-DAP) biosynthesis with a Bordetella pertussis gene library resulted in the isolation of a putative dap operon containing three open reading frames (ORFs). In line with the successful complementation of the E. coli dapD and dapE mutants, the deduced amino acid sequences of two ORFs revealed significant sequence similarities with the DapD and DapE proteins of E. coli...

  7. Characterization of a Bordetella pertussis diaminopimelate (DAP) biosynthesis locus identifies dapC, a novel gene coding for an N-succinyl-L, L-DAP aminotransferase

    OpenAIRE

    Fuchs, T. M.; Schneider, B.; Krumbach, K.; Eggeling, L.; Gross, S. M.

    2000-01-01

    The functional complementation of two Escherichia coli strains defective in the succinylase pathway of meso-diaminopimelate (meso DAP) biosynthesis with a Bordetella pertussis gene library resulted in the isolation of a putative dap operon containing three open reading frames (ORFs), In line with the successful complementation of the E, coli dapD and dapE mutants, the deduced amino acid sequences of two ORFs revealed significant sequence similarities with the DapD and DapE proteins of E, coli...

  8. Molecular Evolution of the dotA Gene in Legionella pneumophila

    OpenAIRE

    Ko, Kwan Soo; Hong, Seong Karp; Lee, Hae Kyung; Park, Mi-Yeoun; Kook, Yoon-Hoh

    2003-01-01

    The molecular evolution of dotA, which is related to the virulence of Legionella pneumophila, was investigated by comparing the sequences of 15 reference strains (serogroups 1 to 15). It was found that dotA has a complex mosaic structure. The whole dotA gene of Legionella pneumophila subsp. pneumophila serogroups 2, 6, and 12 has been transferred from Legionella pneumophila subsp. fraseri. A discrepancy was found between the trees inferred from the nucleotide and deduced amino acid sequences ...

  9. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones.

    Science.gov (United States)

    Martins, Juliana R; Nunes, Francis M F; Cristino, Alexandre S; Simões, Zilá L P; Bitondi, Márcia M G

    2010-03-26

    Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110) diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs) revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp), a target of juvenile hormone (JH). The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body and gonads suggest that, in addition to their primary

  10. Clinical sequencing in leukemia with the assistance of artificial intelligence.

    Science.gov (United States)

    Tojo, Arinobu

    2017-01-01

    Next generation sequencing (NGS) of cancer genomes is now becoming a prerequisite for accurate diagnosis and proper treatment in clinical oncology. Because the genomic regions for NGS expand from a certain set of genes to the whole exome or whole genome, the resulting sequence data becomes incredibly enormous and makes it quite laborious to translate the genomic data into medicine, so-called annotation and curation. We organized a clinical sequencing team and established a bidirectional (bed-to-bench and bench-to-bed) system to integrate clinical and genomic data for hematological malignancies. We also started a collaborative research project with IBM Japan to adopt the artificial intelligence Watson for Genomics (WfG) to the pipeline of medical informatics. Genomic DNA was prepared from malignant as well as normal tissues in each patient and subjected to NGS. Sequence data was analyzed using an in-house semi-automated pipeline in combination with WfG, which was used to identify candidate driver mutations and relevant pathways from which applicable drug information was deduced. Currently, we have analyzed more than 150 patients with hematological disorders, including AML and ALL, and obtained many informative findings. In this presentation, I will introduce some of the achievements we have made so far.

  11. Sequencing and expression analysis of CD3γ/δ and CD3ɛ chains in mandarin fish, Siniperca chuatsi

    Science.gov (United States)

    Guo, Zheng; Nie, Pin

    2013-01-01

    The genomic and cDNA sequences of the CD3γ/δ and CD3ɛ homologues in the mandarin fish, Siniperca chuats i, were determined. As in other vertebrate CD3 molecules, the deduced amino acid sequences of mandarin fish CD3γ/δ and CD3ɛ contained conserved residues and motifs, such as cysteine residues and CXXC and immunoreceptor tyrosine-based activation motifs. However, mandarin fish CD3γ/δ and CD3ɛ showed some differences to their mammalian counterparts, specifically the absence of a negatively charged residue in the transmembrane region of CD3γ/δ. Additionally, while an N -glycosylation site was present in CD3ɛ, the site was not observed in CD3γ/δ. The CD3γ/δ and CD3ɛ subunit sequences contain six and five exons, respectively, consistent with homologues from Atlantic salmon, Salmo salar. Phylogenetic analysis also revealed that CD3γ/δ and CD3ɛ in mandarin fish are closely related to their counterparts in Acanthopterygian fish. Real-time PCR showed CD3γ/δ and CD3ɛ were expressed mainly in the thymus and spleen in normal healthy fish and, to a lesser extent, in mucosal-associated lymphoid tissues, such as the intestine and gills. When lymphocytes isolated from head kidney were treated with the mitogens phytohemagglutinin, concanavalin, and polyriboinosinic polyribocytidylic acid, mRNA expression levels of CD3γ/δ and CD3ɛ were significantly elevated within 12 h of treatment. This indicated the presence of T lymphocytes in the head kidney of teleost fish, and also the recognition of mitogens by the lymphocytes. Mandarin fish infected with the bacterial pathogen Flavobacterium columnare also showed an increase in the expression of CD3γ/δ and CD3ɛ mRNA, indicating that CD3γ/δ and CD3ɛ lymphocytes are involved in the immune response of this species.

  12. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    KAUST Repository

    Chen, Peng; Li, Jinyan; Limsoon, Wong; Kuwahara, Hiroyuki; Huang, Jianhua Z.; Gao, Xin

    2013-01-01

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  13. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    KAUST Repository

    Chen, Peng

    2013-07-23

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  14. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences.

    Science.gov (United States)

    Chen, Peng; Li, Jinyan; Wong, Limsoon; Kuwahara, Hiroyuki; Huang, Jianhua Z; Gao, Xin

    2013-08-01

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. Copyright © 2013 Wiley Periodicals, Inc.

  15. Complete genome sequence of Corynebacterium variabile DSM 44702 isolated from the surface of smear-ripened cheeses and insights into cheese ripening and flavor generation

    Directory of Open Access Journals (Sweden)

    Trost Eva

    2011-11-01

    Full Text Available Abstract Background Corynebacterium variabile is part of the complex microflora on the surface of smear-ripened cheeses and contributes to the development of flavor and textural properties during cheese ripening. Still little is known about the metabolic processes and microbial interactions during the production of smear-ripened cheeses. Therefore, the gene repertoire contributing to the lifestyle of the cheese isolate C. variabile DSM 44702 was deduced from the complete genome sequence to get a better understanding of this industrial process. Results The chromosome of C. variabile DSM 44702 is composed of 3, 433, 007 bp and contains 3, 071 protein-coding regions. A comparative analysis of this gene repertoire with that of other corynebacteria detected 1, 534 predicted genes to be specific for the cheese isolate. These genes might contribute to distinct metabolic capabilities of C. variabile, as several of them are associated with metabolic functions in cheese habitats by playing roles in the utilization of alternative carbon and sulphur sources, in amino acid metabolism, and fatty acid degradation. Relevant C. variabile genes confer the capability to catabolize gluconate, lactate, propionate, taurine, and gamma-aminobutyric acid and to utilize external caseins. In addition, C. variabile is equipped with several siderophore biosynthesis gene clusters for iron acquisition and an exceptional repertoire of AraC-regulated iron uptake systems. Moreover, C. variabile can produce acetoin, butanediol, and methanethiol, which are important flavor compounds in smear-ripened cheeses. Conclusions The genome sequence of C. variabile provides detailed insights into the distinct metabolic features of this bacterium, implying a strong adaption to the iron-depleted cheese surface habitat. By combining in silico data obtained from the genome annotation with previous experimental knowledge, occasional observations on genes that are involved in the complex

  16. Sequence characterization and glycosylation sites identification of donkey milk lactoferrin by multiple enzyme digestions and mass spectrometry

    DEFF Research Database (Denmark)

    Gallina, Serafina; Cunsolo, Vincenzo; Saletti, Rosaria

    2016-01-01

    Lactoferrin, a protein showing an array of biochemical properties, including immuno-modulation, iron-binding ability, as well as antioxidant, antibacterial and antiviral activities, but which may also represent a potential milk allergen, was isolated from donkey milk by ion exchange chromatography...... characterization of donkey lactoferrin sequence, that, at least for the covered sequence, differs from the horse genomic deduced sequence (UniProtKB Acc. Nr. O77811) by five point substitutions located at positions 91 (Arg → His), 328 (Thr → Ile/Leu), 466 (Ala → Gly), 642 (Asn → Ser) and 668 (Ser → Ala). Analysis...... of the glycosylated protein showed that glycans in donkey lactoferrin are linked to the protein backbone via an amide bond to asparagine residues located at the positions 137, 281 and 476....

  17. Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

    Directory of Open Access Journals (Sweden)

    Zijian Li

    2014-02-01

    Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.

  18. The ura5 gene of the ascomycete Sordaria macrospora: molecular cloning, characterization and expression in Escherichia coli.

    Science.gov (United States)

    Le Chevanton, L; Leblon, G

    1989-04-15

    We cloned the ura5 gene coding for the orotate phosphoribosyl transferase from the ascomycete Sordaria macrospora by heterologous probing of a Sordaria genomic DNA library with the corresponding Podospora anserina sequence. The Sordaria gene was expressed in an Escherichia coli pyrE mutant strain defective for the same enzyme, and expression was shown to be promoted by plasmid sequences. The nucleotide sequence of the 1246-bp DNA fragment encompassing the region of homology with the Podospora gene has been determined. This sequence contains an open reading frame of 699 nucleotides. The deduced amino acid sequence shows 72% similarity with the corresponding Podospora protein.

  19. Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

    Directory of Open Access Journals (Sweden)

    Nagib eAhsan

    2012-07-01

    Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.

  20. The catalytic chain of human complement subcomponent C1r. Purification and N-terminal amino acid sequences of the major cyanogen bromide-cleavage fragments.

    Science.gov (United States)

    Arlaud, G J; Gagnon, J; Porter, R R

    1982-01-01

    1. The a- and b-chains of reduced and alkylated human complement subcomponent C1r were separated by high-pressure gel-permeation chromatography and isolated in good yield and in pure form. 2. CNBr cleavage of C1r b-chain yielded eight major peptides, which were purified by gel filtration and high-pressure reversed-phase chromatography. As determined from the sum of their amino acid compositions, these peptides accounted for a minimum molecular weight of 28 000, close to the value 29 100 calculated from the whole b-chain. 3. N-Terminal sequence determinations of C1r b-chain and its CNBr-cleavage peptides allowed the identification of about two-thirds of the amino acids of C1r b-chain. From our results, and on the basis of homology with other serine proteinases, an alignment of the eight CNBr-cleavage peptides from C1r b-chain is proposed. 4. The residues forming the 'charge-relay' system of the active site of serine proteinases (His-57, Asp-102 and Ser-195 in the chymotrypsinogen numbering) are found in the corresponding regions of C1r b-chain, and the amino acid sequence around these residues has been determined. 5. The N-terminal sequence of C1r b-chain has been extended to residue 60 and reveals that C1r b-chain lacks the 'histidine loop', a disulphide bond that is present in all other known serine proteinases.

  1. Sequence dependent aggregation of peptides and fibril formation

    Science.gov (United States)

    Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

    2017-09-01

    Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.

  2. Next generation sequencing (NGS)technologies and applications

    Energy Technology Data Exchange (ETDEWEB)

    Vuyisich, Momchilo [Los Alamos National Laboratory

    2012-09-11

    NGS technology overview: (1) NGS library preparation - Nucleic acids extraction, Sample quality control, RNA conversion to cDNA, Addition of sequencing adapters, Quality control of library; (2) Sequencing - Clonal amplification of library fragments, (except PacBio), Sequencing by synthesis, Data output (reads and quality); and (3) Data analysis - Read mapping, Genome assembly, Gene expression, Operon structure, sRNA discovery, and Epigenetic analyses.

  3. BGL6 beta-glucosidase and nucleic acids encoding the same

    Science.gov (United States)

    Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

    2009-09-01

    The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.

  4. Identities among actin-encoding cDNAs of the Nile tilapia (Oreochromis niloticus and other eukaryote species revealed by nucleotide and amino acid sequence analyses

    Directory of Open Access Journals (Sweden)

    Andréia B. Poletto

    2008-01-01

    Full Text Available Actin-encoding cDNAs of Nile tilapia (Oreochromis niloticus were isolated by RT-PCR using total RNA samples of different tissues and further characterized by nucleotide sequencing and in silico amino acid (aa sequence analysis. Comparisons among the actin gene sequences of O. niloticus and those of other species evidenced that the isolated genes present a high similarity to other fish and other vertebrate actin genes. The highest nucleotide resemblance was observed between O. niloticus and O. mossambicus a-actin and b-actin genes. Analysis of the predicted aa sequences revealed two distinct types of cytoplasmic actins, one cardiac muscle actin type and one skeletal muscle actin type that were expressed in different tissues of Nile tilapia. The evolutionary relationships between the Nile tilapia actin genes and diverse other organisms is discussed.

  5. Cloning and analysis of calmodulin gene from Porphyra yezoensis Ueda (Bangiales, Rhodophyta)

    Science.gov (United States)

    Wang, Mengqiang; Mao, Yunxiang; Zhuang, Yunyun; Kong, Fanna; Sui, Zhenghong

    2009-09-01

    In order to understand the mechanisms of signal transduction and anti-desiccation mechanisms of Porphyra yezoensis, cDNA and its genomic sequence of Calmodulin gene (CaM) was cloned by the technique of polymerase chain reaction (PCR) based on the analysis of P. yezoensis ESTs from dbEST database. The result shows that the full-length cDNA of CaM consists of 603 bps including an ORF encoding for 151 amino acids and a terminate codon UGA, while the length of genomic sequence is 1231 bps including 2 exons and 1 intron. The average GC content of the coding region is 58.77%, while the GC content of the third position of this gene is as high as 82.23%. Four Ca2+ binding sites (EF-hand) are found in this gene. The predicted molecular mass of the deduced peptide is 16688.72 Da and the pI is 4.222. By aligning with known CaM genes, the similarity of CaM gene sequence with homologous genes in Chlamydomonas incerta and Chlamydomonas reinhardtii is 72.7% and 72.2% respectively, and the similarity of the deduced amino acid sequence of CaM gene with homologous genes in C. incerta and C. reinhardtii are both 71.5%. This is the first report on CaM from a species of Rhodophyta.

  6. MIPS: a database for genomes and protein sequences.

    Science.gov (United States)

    Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

    2002-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).

  7. cDNA sequences of two apolipoproteins from lamprey

    International Nuclear Information System (INIS)

    Pontes, M.; Xu, X.; Graham, D.; Riley, M.; Doolittle, R.F.

    1987-01-01

    The messages for two small but abundant apolipoproteins found in lamprey blood plasma were cloned with the aid of oligonucleotide probes based on amino-terminal sequences. In both cases, numerous clones were identified in a lamprey liver cDNA library, consistent with the great abundance of these proteins in lamprey blood. One of the cDNAs (LAL1) has a coding region of 105 amino acids that corresponds to a 21-residue signal peptide, a putative 8-residue propeptide, and the 76-residue mature protein found in blood. The other cDNA (LAL2) codes for a total of 191 residues, the first 23 of which constitute a signal peptide. The two proteins, which occur in the high-density lipoprotein fraction of ultracentrifuged plasma, have amino acid compositions similar to those of apolipoproteins found in mammalian blood; computer analysis indicates that the sequences are largely helix-permissive. When the sequences were searched against an amino acid sequence data base, rat apolipoprotein IV was the best matching candidate in both cases. Although a reasonable alignment can be made with that sequence and LAL1, definitive assignment of the two lamprey proteins to typical mammalian classes cannot be made at this point

  8. Multiplex, rapid and sensitive isothermal detection of nucleic-acid sequence by endonuclease restriction-mediated real-time multiple cross displacement amplification

    Directory of Open Access Journals (Sweden)

    Yi eWang

    2016-05-01

    Full Text Available We have devised a novel isothermal amplification technology, termed endonuclease restriction-mediated real-time multiple cross displacement amplification (ET-MCDA, which facilitated multiplex, rapid, specific and sensitive detection of nucleic-acid sequences at a constant temperature. The ET-MCDA integrated multiple cross displacement amplification strategy, restriction endonuclease cleavage and real-time fluorescence detection technique. In the ET-MCDA system, the functional cross primer E-CP1 or E-CP2 was constructed by adding a short sequence at the 5’ end of CP1 or CP2, respectively, and the new E-CP1 or E-CP2 primer was labelled at the 5’ end with a fluorophore and in the middle with a dark quencher. The restriction endonuclease Nb.BsrDI specifically recognized the short sequence and digested the newly synthesized double-stranded terminal sequences (5’ end short sequences and their complementary sequences, which released the quenching, resulting on a gain of fluorescence signal. Thus, the ET-MCDA allowed real-time detection of single or multiple targets in only a single reaction, and the positive results were observed in as short as 12 minutes, detecting down to 3.125 fg of genomic DNA per tube. Moreover, the analytical specificity and the practical application of the ET-MCDA were also successfully evaluated in this study. Here we provided the details on the novel ET-MCDA technique and expounded the basic ET-MCDA amplification mechanism.

  9. Multiplex, Rapid, and Sensitive Isothermal Detection of Nucleic-Acid Sequence by Endonuclease Restriction-Mediated Real-Time Multiple Cross Displacement Amplification.

    Science.gov (United States)

    Wang, Yi; Wang, Yan; Zhang, Lu; Liu, Dongxin; Luo, Lijuan; Li, Hua; Cao, Xiaolong; Liu, Kai; Xu, Jianguo; Ye, Changyun

    2016-01-01

    We have devised a novel isothermal amplification technology, termed endonuclease restriction-mediated real-time multiple cross displacement amplification (ET-MCDA), which facilitated multiplex, rapid, specific and sensitive detection of nucleic-acid sequences at a constant temperature. The ET-MCDA integrated multiple cross displacement amplification strategy, restriction endonuclease cleavage and real-time fluorescence detection technique. In the ET-MCDA system, the functional cross primer E-CP1 or E-CP2 was constructed by adding a short sequence at the 5' end of CP1 or CP2, respectively, and the new E-CP1 or E-CP2 primer was labeled at the 5' end with a fluorophore and in the middle with a dark quencher. The restriction endonuclease Nb.BsrDI specifically recognized the short sequence and digested the newly synthesized double-stranded terminal sequences (5' end short sequences and their complementary sequences), which released the quenching, resulting on a gain of fluorescence signal. Thus, the ET-MCDA allowed real-time detection of single or multiple targets in only a single reaction, and the positive results were observed in as short as 12 min, detecting down to 3.125 fg of genomic DNA per tube. Moreover, the analytical specificity and the practical application of the ET-MCDA were also successfully evaluated in this study. Here, we provided the details on the novel ET-MCDA technique and expounded the basic ET-MCDA amplification mechanism.

  10. Gene cloning and overexpression of two conjugated polyketone reductases, novel aldo-keto reductase family enzymes, of Candida parapsilosis.

    Science.gov (United States)

    Kataoka, M; Delacruz-Hidalgo, A-R G; Akond, M A; Sakuradani, E; Kita, K; Shimizu, S

    2004-04-01

    The genes encoding two conjugated polyketone reductases (CPR-C1, CPR-C2) of Candida parapsilosis IFO 0708 were cloned and sequenced. The genes encoded a total of 304 and 307 amino acid residues for CPR-C1 and CPR-C2, respectively. The deduced amino acid sequences of the two enzymes showed high similarity to each other and to several proteins of the aldo-keto reductase (AKR) superfamily. However, several amino acid residues in putative active sites of AKRs were not conserved in CPR-C1 and CPR-C2. The two CPR genes were overexpressed in Escherichia coli. The E. coli transformant bearing the CPR-C2 gene almost stoichiometrically reduced 30 mg ketopantoyl lactone/ml to D-pantoyl lactone.

  11. Effects of dietary vitamin B6 supplementation on fillet fatty acid composition and fatty acid metabolism of rainbow trout fed vegetable oil based diets.

    Science.gov (United States)

    Senadheera, Shyamalie D; Turchini, Giovanni M; Thanuthong, Thanongsak; Francis, David S

    2012-03-07

    Fish oil replacement in aquaculture feeds results in major modifications to the fatty acid makeup of cultured fish. Therefore, in vivo fatty acid biosynthesis has been a topic of considerable research interest. Evidence suggests that pyridoxine (vitamin B(6)) plays a role in fatty acid metabolism, and in particular, the biosynthesis of LC-PUFA has been demonstrated in mammals. However, there is little information on the effects of dietary pyridoxine availability in fish fed diets lacking LC-PUFA. This study demonstrates a relationship between dietary pyridoxine supplementation and fatty acid metabolism in rainbow trout. In particular, the dietary pyridoxine level was shown to modulate and positively stimulate the activity of the fatty acid elongase and Δ-6 and Δ-5 desaturase enzymes, deduced by the whole-body fatty acid balance method. This activity was insufficient to compensate for a diet lacking in LC-PUFA but does highlight potential strategies to maximize this activity in cultured fish, especially when fish oil is replaced with vegetable oils.

  12. Carbon isotope composition of intermediates of the starch-malate sequence and level of the crassulacean acid metabolism in leaves of Kalanchoe blossfeldiana Tom Thumb.

    Science.gov (United States)

    Deleens, E; Garnier-Dardart, J; Queiroz, O

    1979-09-01

    Isotype analyses were performed on biochemical fractions isolated from leaves of Kalanchoe blossfeldiana Tom Thumb. during aging under long days or short days. Irrespective of the age or photoperiodic conditions, the intermediates of the starch-malate sequence (starch, phosphorylated compounds and organic acids) have a level of (13)C higher than that of soluble sugars, cellulose and hemicellulose. In short days, the activity of the crassulacean acid metabolism pathway is predominant as compared to that of C3 pathway: leaves accumulate organic acids, rich in (13)C. In long days, the activity of the crassulacean acid metabolism pathway increases as the leaves age, remaining, however, relatively low as compared to that of C3 pathway: leaves accumulate soluble sugars, poor in (13)C. After photoperiodic change (long days→short days), isotopic modifications of starch and organic acids suggest evidence for a lag phase in the establishment of the crassulacean acid metabolism pathway specific to short days. The relative proportions of carbon from a C3-origin (RuBPC acitivity as strong discriminating step, isotope discrimination in vivo=20‰) or C4-origin (PEPC activity as weak discriminating step, isotope discrimination in vivo=4‰) present in the biochemical fractions were calculated from their δ(13)C values. Under long days, 30 to 70% versus 80 to 100% under short days, of the carbon of the intermediates linked to the starch-malate sequence, or CAM pathway (starch, phosphorylated compounds and organic acids), have a C4-origin. Products connected to the C3 pathway (free sugars, cellulose, hemicellulose) have 0 to 50% of their carbon, arising from reuptake of the C4 from malate, under long days versus 30 to 70% under short days.

  13. The regulatory network of cluster-root function and development in phosphate-deficient white lupin (Lupinus albus) identified by transcriptome sequencing.

    Science.gov (United States)

    Wang, Zhengrui; Straub, Daniel; Yang, Huaiyu; Kania, Angelika; Shen, Jianbo; Ludewig, Uwe; Neumann, Günter

    2014-07-01

    Lupinus albus serves as model plant for root-induced mobilization of sparingly soluble soil phosphates via the formation of cluster-roots (CRs) that mediate secretion of protons, citrate, phenolics and acid phosphatases (APases). This study employed next-generation sequencing to investigate the molecular mechanisms behind these complex adaptive responses at the transcriptome level. We compared different stages of CR development, including pre-emergent (PE), juvenile (JU) and the mature (MA) stages. The results confirmed that the primary metabolism underwent significant modifications during CR maturation, promoting the biosynthesis of organic acids, as had been deduced from physiological studies. Citrate catabolism was downregulated, associated with citrate accumulation in MA clusters. Upregulation of the phenylpropanoid pathway reflected the accumulation of phenolics. Specific transcript expression of ALMT and MATE transporter genes correlated with the exudation of citrate and flavonoids. The expression of transcripts related to nucleotide degradation and APases in MA clusters coincided with the re-mobilization and hydrolysis of organic phosphate resources. Most interestingly, hormone-related gene expression suggested a central role of ethylene during CR maturation. This was associated with the upregulation of the iron (Fe)-deficiency regulated network that mediates ethylene-induced expression of Fe-deficiency responses in other species. Finally, transcripts related to abscisic acid and jasmonic acid were upregulated in MA clusters, while auxin- and brassinosteroid-related genes and cytokinin receptors were most strongly expressed during CR initiation. Key regulations proposed by the RNA-seq data were confirmed by quantitative real-time polymerase chain reaction (RT-qPCR) and some physiological analyses. A model for the gene network regulating CR development and function is presented. © 2014 Scandinavian Plant Physiology Society.

  14. Characterization of a new (R)-hydroxynitrile lyase from the Japanese apricot Prunus mume and cDNA cloning and secretory expression of one of the isozymes in Pichia pastoris.

    Science.gov (United States)

    Fukuta, Yasuhisa; Nanda, Samik; Kato, Yasuo; Yurimoto, Hiroya; Sakai, Yasuyoshi; Komeda, Hidenobu; Asano, Yasuhisa

    2011-01-01

    PmHNL, a hydroxynitrile lyase from Japanese apricot ume (Prunus mume) seed was purified to homogeneity by ammonium sulfate fractionation and chromatographic steps. The purified enzyme was a monomer with molecular mass of 58 kDa. It was a flavoprotein similar to other hydroxynitrile lyases of the Rosaceae family. It was active over a broad temperature, and pH range. The N-terminal amino acid sequence (20 amino acids) was identical with that of the enzyme from almond (Prunus dulcis). Based on the N-terminal sequence of the purified enzyme and the conserved amino acid sequences of the enzymes from Pr. dulcis, inverse PCR method was used for cloning of a putative PmHNL (PmHNL2) gene from a Pr. mume seedling. Then the cDNA for the enzyme was cloned. The deduced amino acid sequence was found to be highly similar (95%) to that of an enzyme from Pr. serotina, isozyme 2. The recombinant Pichia pastoris transformed with the PmHNL2 gene secreted an active enzyme in glycosylated form.

  15. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

    Directory of Open Access Journals (Sweden)

    Leitner Dietmar

    2005-04-01

    Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.

  16. Information decomposition method to analyze symbolical sequences

    International Nuclear Information System (INIS)

    Korotkov, E.V.; Korotkova, M.A.; Kudryashov, N.A.

    2003-01-01

    The information decomposition (ID) method to analyze symbolical sequences is presented. This method allows us to reveal a latent periodicity of any symbolical sequence. The ID method is shown to have advantages in comparison with application of the Fourier transformation, the wavelet transform and the dynamic programming method to look for latent periodicity. Examples of the latent periods for poetic texts, DNA sequences and amino acids are presented. Possible origin of a latent periodicity for different symbolical sequences is discussed

  17. Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2

    International Nuclear Information System (INIS)

    Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.; Cohen, G.H.

    1984-01-01

    Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar. For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells

  18. Atmospheric CO2 variations over the last three glacial-interglacial climatic cycles deduced from the Dome Fuji deep ice core, Antarctica using a wet extraction technique

    International Nuclear Information System (INIS)

    Kawamura, Kenji; Nakazawa, Takakiyo; Aoki, Shuji

    2003-01-01

    A deep ice core drilled at Dome Fuji, East Antarctica was analyzed for the CO 2 concentration using a wet extraction method in order to reconstruct its atmospheric variations over the past 320 kyr, which includes three full glacial-interglacial climatic cycles, with a mean time resolution of about 1.1 kyr. The CO 2 concentration values derived for the past 65 kyr are very close to those obtained from other Antarctic ice cores using dry extraction methods, although the wet extraction method is generally thought to be inappropriate for the determination of the CO 2 concentration. The comparison between the CO 2 and Ca 2+ concentrations deduced from the Dome Fuji core suggests that calcium carbonate emitted from lands was mostly neutralized in the atmosphere before reaching the central part of Antarctica, or that only a small part of calcium carbonate was involved in CO 2 production during the wet extraction process. The CO 2 concentration for the past 320 kyr deduced from the Dome Fuji core varies between 190 and 300 ppmv, showing clear glacial-interglacial variations similar to the result of the Vostok ice core. However, for some periods, the concentration values of the Dome Fuji core are higher by up to 20 ppmv than those of the Vostok core. There is no clear indication that such differences are related to variations of chemical components of Ca 2+ , microparticle and acidity of the Dome Fuji core

  19. Molecular characterization of 45 kDa aspartic protease of Trichinella spiralis.

    Science.gov (United States)

    Park, Jong Nam; Park, Sang Kyun; Cho, Min Kyoung; Park, Mi-Kyung; Kang, Shin Ae; Kim, Dong-Hee; Yu, Hak Sun

    2012-12-21

    In a previous study, we identified an aspartic protease gene (Ts-Asp) from the Trichinella spiralis muscle stage larva cDNA library. The gene sequence of Ts-Asp was 1281 bp long and was found to encode a protein consisting of 405 amino acids, with a molecular mass of 45.248 kD and a pI of 5.95. The deduced Ts-Asp has a conserved catalytic motif with catalytic aspartic acid residues in the active site, a common characteristic of aspartic proteases. In addition, the deduced amino acid sequence of Ts-Asp was found to possess significant homology (above 50%) with aspartic proteases from nematode parasites. Results of phylogenetic analysis indicated a close relationship of Ts-Asp with cathepsin D aspartic proteases. For production of recombinant Ts-Asp (rTs-Asp), the pGEX4T expression system was used. Like other proteases, the purified rTs-Asp was able to digest collagen matrix in vitro. Abundant expression of Ts-Asp was observed in muscle stage larva. Ts-Asp was detected in ES proteins, and was able to elicit the production of specific antibodies. It is the first report of molecular characterization of aspartic protease isolated from T. spiralis. Copyright © 2012 Elsevier B.V. All rights reserved.

  20. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids.

    Science.gov (United States)

    Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

    2015-08-13

    Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of three different stages of fruit ripening (pre-climacteric, climacteric and post-climacteric) were also analyzed. The analysis of the RNAseqatlas presented here reveals strong differences in gene expression patterns between different organs, especially between root and flower, but also reveals similarities among the gene expression patterns in other organs, such as stem, leaves and aerial buds (vegetative organs) or seed and fruit (storage organs). Important regulators, functional categories, and differentially expressed genes involved in avocado fruit ripening were identified. Additionally, to demonstrate the utility of the avocado gene expression atlas, we investigated the expression patterns of genes implicated in fatty acid metabolism and fruit ripening. A description of transcriptomic changes occurring during fruit ripening was obtained in Mexican avocado, contributing to a dynamic view of the expression patterns of genes involved in fatty acid biosynthesis and the fruit ripening process.

  1. Systematic main sequence photometry of globular cluster stars for age determination

    International Nuclear Information System (INIS)

    Alcaino, G.; Liller, W.

    1984-01-01

    The individual photometric study of the coeval stars in globular clusters presents one of the best observational tests of the stellar evolution theory. Our own globular cluster system provides fundamental clues to the dynamical and chemical evolutionary history of the galaxy, and the study of their ages give a lower limit to the age of the galaxy as well as to that of the universe. The authors have undertaken a systematic research program, and discuss the ages deduced by fitting main sequence photometry to theoretical isochrones of six galactic globular clusters: M4, M22, M30, NGC 288, NGC 3201 and NGC 6397. (Auth.)

  2. Passive films and corrosion protection due to phosphonic acid inhibitors

    Energy Technology Data Exchange (ETDEWEB)

    Fang, J.L.; Liu, Q. (Nanjing Univ. (China)); Li, Y.; Wang, Z.W. (Nanjing Inst. of Chemical Tech. (China))

    1993-04-01

    For protecting mild steel from corrosion, aminotrimethylidenephosphonic acid (ATMP) was more effective than 1-hydroxyethylidene diphosphonic acid (HEDP), N.N-dimethylidenediphosphonic acid (EEDP), and ethylenediaminetetramethylidenephosphonic acid (EDTMP). A 20-min treatment in 1.0 mol/l of ATMP with a pH 0.23 at 45 C formed an anti-corrosive complex film that was composed of 48.4% O, 28.6% P, 7.0% Fe, 4.3% N, and 11.7% C, based on x-ray photoelectron spectroscopy and Auger electron spectroscopy. From differences in binding energies of Fe, N, and O, in the shift of C-N and P-O vibration, in the reflection FTIR spectra, and in the change of P-OH and Fe-N vibration before and after film formation, it was deduced that N and O in ATMP were coordinated with Fe[sub 2+] in the film.

  3. Engineering fatty acid biosynthesis in microalgae for sustainable biodiesel.

    Science.gov (United States)

    Blatti, Jillian L; Michaud, Jennifer; Burkart, Michael D

    2013-06-01

    Microalgae are a promising feedstock for biodiesel and other liquid fuels due to their fast growth rate, high lipid yields, and ability to grow in a broad range of environments. However, many microalgae achieve maximal lipid yields only under stress conditions hindering growth and providing compositions not ideal for biofuel applications. Metabolic engineering of algal fatty acid biosynthesis promises to create strains capable of economically producing fungible and sustainable biofuels. The algal fatty acid biosynthetic pathway has been deduced by homology to bacterial and plant systems, and much of our understanding is gleaned from basic studies in these systems. However, successful engineering of lipid metabolism in algae will necessitate a thorough characterization of the algal fatty acid synthase (FAS) including protein-protein interactions and regulation. This review describes recent efforts to engineer fatty acid biosynthesis toward optimizing microalgae as a biodiesel feedstock. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Genome Sequence of Lactobacillus plantarum Strain UCMA 3037

    OpenAIRE

    Naz, Saima; Tareb, Raouf; Bernardeau, Marion; Vaisse, Melissa; Lucchetti-Miganeh, Celine; Rechenmann, Mathias; Vernoux, Jean-Paul

    2013-01-01

    Nucleic acid of the strain Lactobacillus plantarum UCMA 3037, isolated from raw milk camembert cheese in our laboratory, was sequenced. We present its draft genome sequence with the aim of studying its functional properties and relationship to the cheese ecosystem.

  5. Genome Sequence of Lactobacillus plantarum Strain UCMA 3037.

    Science.gov (United States)

    Naz, Saima; Tareb, Raouf; Bernardeau, Marion; Vaisse, Melissa; Lucchetti-Miganeh, Celine; Rechenmann, Mathias; Vernoux, Jean-Paul

    2013-05-23

    Nucleic acid of the strain Lactobacillus plantarum UCMA 3037, isolated from raw milk camembert cheese in our laboratory, was sequenced. We present its draft genome sequence with the aim of studying its functional properties and relationship to the cheese ecosystem.

  6. Single-cell sequencing unveils the lifestyle and CRISPR-based population history of Hydrotalea sp. in acid mine drainage.

    Science.gov (United States)

    Medeiros, J D; Leite, L R; Pylro, V S; Oliveira, F S; Almeida, V M; Fernandes, G R; Salim, A C M; Araújo, F M G; Volpini, A C; Oliveira, G; Cuadros-Orellana, S

    2017-10-01

    Acid mine drainage (AMD) is characterized by an acid and metal-rich run-off that originates from mining systems. Despite having been studied for many decades, much remains unknown about the microbial community dynamics in AMD sites, especially during their early development, when the acidity is moderate. Here, we describe draft genome assemblies from single cells retrieved from an early-stage AMD sample. These cells belong to the genus Hydrotalea and are closely related to Hydrotalea flava. The phylogeny and average nucleotide identity analysis suggest that all single amplified genomes (SAGs) form two clades that may represent different strains. These cells have the genomic potential for denitrification, copper and other metal resistance. Two coexisting CRISPR-Cas loci were recovered across SAGs, and we observed heterogeneity in the population with regard to the spacer sequences, together with the loss of trailer-end spacers. Our results suggest that the genomes of Hydrotalea sp. strains studied here are adjusting to a quickly changing selective pressure at the microhabitat scale, and an important form of this selective pressure is infection by foreign DNA. © 2017 John Wiley & Sons Ltd.

  7. Molecular cloning and characterization of an acetylcholinesterase cDNA in the brown planthopper, Nilaparvata lugens.

    Science.gov (United States)

    Yang, Zhifan; Chen, Jun; Chen, Yongqin; Jiang, Sijing

    2010-01-01

    A full cDNA encoding an acetylcholinesterase (AChE, EC 3.1.1.7) was cloned and characterized from the brown planthopper, Nilaparvata lugens Stål (Hemiptera: Delphacidae). The complete cDNA (2467 bp) contains a 1938-bp open reading frame encoding 646 amino acid residues. The amino acid sequence of the AChE deduced from the cDNA consists of 30 residues for a putative signal peptide and 616 residues for the mature protein with a predicted molecular weight of 69,418. The three residues (Ser242, Glu371, and His485) that putatively form the catalytic triad and the six Cys that form intra-subunit disulfide bonds are completely conserved, and 10 out of the 14 aromatic residues lining the active site gorge of the AChE are also conserved. Northern blot analysis of poly(A)+ RNA showed an approximately 2.6-kb transcript, and Southern blot analysis revealed there likely was just a single copy of this gene in N. lugens. The deduced protein sequence is most similar to AChE of Nephotettix cincticeps with 83% amino acid identity. Phylogenetic analysis constructed with 45 AChEs from 30 species showed that the deduced N. lugens AChE formed a cluster with the other 8 insect AChE2s. Additionally, the hypervariable region and amino acids specific to insect AChE2 also existed in the AChE of N. lugens. The results revealed that the AChE cDNA cloned in this work belongs to insect AChE2 subgroup, which is orthologous to Drosophila AChE. Comparison of the AChEs between the susceptible and resistant strains revealed a point mutation, Gly185Ser, is likely responsible for the insensitivity of the AChE to methamidopho in the resistant strain.

  8. Chirality- and sequence-selective successive self-sorting via specific homo- and complementary-duplex formations

    Science.gov (United States)

    Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji

    2015-01-01

    Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291

  9. Molecular cloning of cDNAs of human liver and placenta NADH-cytochrome b5 reductase

    International Nuclear Information System (INIS)

    Yubisui, T.; Naitoh, Y.; Zenno, S.; Tamura, M.; Takeshita, M.; Sakaki, Y.

    1987-01-01

    A cDNA coding for human liver NADH-cytochrome b 5 reductase was cloned from a human liver cDNA library constructed in phage λgt11. The library was screened by using an affinity-purified rabbit antibody against NADH-cytochrome b 5 reductase of human erythrocytes. A cDNA about 1.3 kilobase pairs long was isolated. By using the cDNA as a probe, another cDNA (pb 5 R141) of 1817 base pairs was isolated that hybridized with a synthetic oligonucleotide encoding Pro-Asp-Ile-Lys-Tyr-Pro, derived from the amino acid sequence at the amino-terminal region of the enzyme from human erythrocytes. Furthermore, by using the pb 5 R141 as a probe, cDNA clones having more 5' sequence were isolated from a human placenta cDNA library. The amino acid sequences deduced from the nucleotide sequences of these cDNA clones overlapped each other and consisted of a sequence that completely coincides with that of human erythrocytes and a sequence of 19 amino acid residues extended at the amino-terminal side. The latter sequence closely resembles that of the membrane-binding domain of steer liver microsomal enzyme

  10. Identification of single amino acid substitutions (SAAS) in neuraminidase from influenza a virus (H1N1) via mass spectrometry analysis coupled with de novo peptide sequencing.

    Science.gov (United States)

    Peng, Qisheng; Wang, Zijian; Wu, Donglin; Li, Xiaoou; Liu, Xiaofeng; Sun, Wanchun; Liu, Ning

    2016-08-01

    Amino acid substitutions in the neuraminidase of the influenza virus are the main cause of the emergence of resistance to zanamivir or oseltamivir during seasonal influenza treatment; they are the result of non-synonymous mutations in the viral genome that can be successfully detected by polymer chain reaction (PCR)-based approaches. There is always an urgent need to detect variation in amino acid sequences directly at the protein level. Mass spectrometry coupled with de novo sequencing has been explored as an alternative and straightforward strategy for detecting amino acid substitutions, as well - this approach is the primary focus of the present study. Influenza virus (A/Puerto Rico/8/1934 H1N1) propagated in embryonated chicken eggs was purified by ultracentrifugation, followed by PNGase F treatment. The deglycosylated virion was lysed and separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The gel band corresponding to neuraminidase was picked up and subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis. LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three amino acid substitutions: R346K, S349 N, and S370I/L, in the neuraminidase from the influenza virus (A/Puerto Rico/8/1934 H1N1), which were located in three mutated peptides of the neuraminidase: YGNGVWIGK, TKNHSSR, and PNGWTETDI/LK, respectively. We found that the amino acid substitutions in the proteins of RNA viruses (including influenza A virus) resulting from non-synonymous gene mutations can indeed be directly analyzed via mass spectrometry, and that manual interpretation of the MS/MS data may be beneficial. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  11. Bm86 midgut protein sequence variation in South Texas cattle fever ticks

    Directory of Open Access Journals (Sweden)

    Kammlah Diane M

    2010-11-01

    Full Text Available Abstract Background Cattle fever ticks, Rhipicephalus (Boophilus microplus and R. (B. annulatus, vector bovine and equine babesiosis, and have significantly expanded beyond the permanent quarantine zone established in South Texas. Currently, there are no vaccines approved for use within the United States for controlling these vectors. Vaccines developed in Australia and Cuba based on the midgut antigen Bm86 have variable efficacy against cattle fever ticks. A possible explanation for this variation in vaccine efficacy is amino acid sequence divergence between the recombinant Bm86 vaccine component and native Bm86 expressed in ticks from different geographical regions of the world. Results There was 91.8% amino acid sequence identity in Bm86 among R. microplus and R. annulatus sequenced from South Texas infestations. When South Texas isolates were compared to the Australian Yeerongpilly and Cuban Camcord vaccine strains, there was 89.8% and 90.0% identity, respectively. Most of the sequence divergence was focused in one region of the protein, amino acids 206-298. Hydrophilicity profiles revealed that two short regions of Bm86 (amino acids 206-210 and 560-570 appear to be more hydrophilic in South Texas isolates compared to vaccine strains. Only one amino acid difference was found between South Texas and vaccine strains within two previously described B-cell epitopes. A total of 4 amino acid differences were observed within three peptides previously shown to induce protective immune responses in cattle. Conclusions Sequence differences between South Texas isolates and Yeerongpilly and Camcord strains are spread throughout the entire Bm86 sequence, suggesting that geographic variation does exist. Differences within previously described B-cell epitopes between South Texas isolates and vaccine strains are minimal; however, short regions of hydrophilic amino acids found unique to South Texas isolates suggest that additional unique surface exposed

  12. Biological sequence analysis: probabilistic models of proteins and nucleic acids

    National Research Council Canada - National Science Library

    Durbin, Richard

    1998-01-01

    ... analysis methods are now based on principles of probabilistic modelling. Examples of such methods include the use of probabilistically derived score matrices to determine the significance of sequence alignments, the use of hidden Markov models as the basis for profile searches to identify distant members of sequence families, and the inference...

  13. Computational analysis of sequence selection mechanisms.

    Science.gov (United States)

    Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

    2004-04-01

    Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.

  14. The myoglobin of Emperor penguin (Aptenodytes forsteri): amino acid sequence and functional adaptation to extreme conditions.

    Science.gov (United States)

    Tamburrini, M; Romano, M; Giardina, B; di Prisco, G

    1999-02-01

    In the framework of a study on molecular adaptations of the oxygen-transport and storage systems to extreme conditions in Antarctic marine organisms, we have investigated the structure/function relationship in Emperor penguin (Aptenodytes forsteri) myoglobin, in search of correlation with the bird life style. In contrast with previous reports, the revised amino acid sequence contains one additional residue and 15 differences. The oxygen-binding parameters seem well adapted to the diving behaviour of the penguin and to the environmental conditions of the Antarctic habitat. Addition of lactate has no major effect on myoglobin oxygenation over a large temperature range. Therefore, metabolic acidosis does not impair myoglobin function under conditions of prolonged physical effort, such as diving.

  15. Identification of metal ion binding sites based on amino acid sequences.

    Science.gov (United States)

    Cao, Xiaoyong; Hu, Xiuzhen; Zhang, Xiaojin; Gao, Sujuan; Ding, Changjiang; Feng, Yonge; Bao, Weihua

    2017-01-01

    The identification of metal ion binding sites is important for protein function annotation and the design of new drug molecules. This study presents an effective method of analyzing and identifying the binding residues of metal ions based solely on sequence information. Ten metal ions were extracted from the BioLip database: Zn2+, Cu2+, Fe2+, Fe3+, Ca2+, Mg2+, Mn2+, Na+, K+ and Co2+. The analysis showed that Zn2+, Cu2+, Fe2+, Fe3+, and Co2+ were sensitive to the conservation of amino acids at binding sites, and promising results can be achieved using the Position Weight Scoring Matrix algorithm, with an accuracy of over 79.9% and a Matthews correlation coefficient of over 0.6. The binding sites of other metals can also be accurately identified using the Support Vector Machine algorithm with multifeature parameters as input. In addition, we found that Ca2+ was insensitive to hydrophobicity and hydrophilicity information and Mn2+ was insensitive to polarization charge information. An online server was constructed based on the framework of the proposed method and is freely available at http://60.31.198.140:8081/metal/HomePage/HomePage.html.

  16. Profiling of wheat class III peroxidase genes derived from powdery mildew-attacked epidermis reveals distinct sequence-associated expression patterns.

    Science.gov (United States)

    Liu, Guosheng; Sheng, Xiaoyan; Greenshields, David L; Ogieglo, Adam; Kaminskyj, Susan; Selvaraj, Gopalan; Wei, Yangdou

    2005-07-01

    A cDNA library was constructed from leaf epidermis of diploid wheat (Triticum monococcum) infected with the powdery mildew fungus (Blumeria graminis f. sp. tritici) and was screened for genes encoding peroxidases. From 2,500 expressed sequence tags (ESTs), 36 cDNAs representing 10 peroxidase genes (designated TmPRX1 to TmPRX10) were isolated and further characterized. Alignment of the deduced amino acid sequences and phylogenetic clustering with peroxidases from other plant species demonstrated that these peroxidases fall into four distinct groups. Differential expression and tissue-specific localization among the members were observed during the B. graminis f. sp. tritici attack using Northern blots and reverse-transcriptase polymerase chain reaction analyses. Consistent with its abundance in the EST collection, TmPRX1 expression showed the highest induction during pathogen attack and fluctuated in response to the fungal parasitic stages. TmPRX1 to TmPRX6 were expressed predominantly in mesophyll cells, whereas TmPRX7 to TmPRX10, which feature a putative C-terminal propeptide, were detectable mainly in epidermal cells. Using TmPRX8 as a representative, we demonstrated that its C-terminal propeptide was sufficient to target a green fluorescent protein fusion protein to the vacuoles in onion cells. Finally, differential expression profiles of the TmPRXs after abiotic stresses and signal molecule treatments were used to dissect the potential role of these peroxidases in multiple stress and defense pathways.

  17. Nucleic acid drugs: a novel approach

    African Journals Online (AJOL)

    Administrator

    Nucleic acid base sequence of proteins plays a crucial role in the expression of gene. The gene is responsible for the synthesis of proteins and these proteins, which are synthesized, are responsible for the biological process and also for dreadful diseases as well. Once if the nucleic acid sequence is altered, we would be ...

  18. Method of Identifying a Base in a Nucleic Acid

    Science.gov (United States)

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    1999-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  19. Deducing the kinetics of protein synthesis in vivo from the transition rates measured in vitro.

    Directory of Open Access Journals (Sweden)

    Sophia Rudorf

    2014-10-01

    Full Text Available The molecular machinery of life relies on complex multistep processes that involve numerous individual transitions, such as molecular association and dissociation steps, chemical reactions, and mechanical movements. The corresponding transition rates can be typically measured in vitro but not in vivo. Here, we develop a general method to deduce the in-vivo rates from their in-vitro values. The method has two basic components. First, we introduce the kinetic distance, a new concept by which we can quantitatively compare the kinetics of a multistep process in different environments. The kinetic distance depends logarithmically on the transition rates and can be interpreted in terms of the underlying free energy barriers. Second, we minimize the kinetic distance between the in-vitro and the in-vivo process, imposing the constraint that the deduced rates reproduce a known global property such as the overall in-vivo speed. In order to demonstrate the predictive power of our method, we apply it to protein synthesis by ribosomes, a key process of gene expression. We describe the latter process by a codon-specific Markov model with three reaction pathways, corresponding to the initial binding of cognate, near-cognate, and non-cognate tRNA, for which we determine all individual transition rates in vitro. We then predict the in-vivo rates by the constrained minimization procedure and validate these rates by three independent sets of in-vivo data, obtained for codon-dependent translation speeds, codon-specific translation dynamics, and missense error frequencies. In all cases, we find good agreement between theory and experiment without adjusting any fit parameter. The deduced in-vivo rates lead to smaller error frequencies than the known in-vitro rates, primarily by an improved initial selection of tRNA. The method introduced here is relatively simple from a computational point of view and can be applied to any biomolecular process, for which we have

  20. Three grape CBF/DREB1 genes respond to low temperature, drought and abscisic acid.

    Science.gov (United States)

    Xiao, Huogen; Siddiqua, Mahbuba; Braybrook, Siobhan; Nassuth, Annette

    2006-07-01

    The C-repeat (CRT)-binding factor/dehydration-responsive element (DRE) binding protein 1 (CBF/ DREB1) transcription factors control an important pathway for increased freezing and drought tolerance in plants. Three CBF/DREB1-like genes, CBF 1-3, were isolated from both freezing-tolerant wild grape (Vitis riparia) and freezing-sensitive cultivated grape (Vitis vinifera). The deduced proteins in V. riparia are 63-70% identical to each other and 96-98% identical to the corresponding proteins in V. vinifera. All Vitis CBF proteins are 42-51% identical to AtCBF1 and contain CBF-specific amino acid motifs, supporting their identification as CBF proteins. Grape CBF sequences are unique in that they contain 20-29 additional amino acids and three serine stretches. Agro-infiltration experiments revealed that VrCBF1b localizes to the nucleus. VrCBF1a, VrCBF1b and VvCBF1 activated a green fluorescent protein (GFP) or glucuronidase (GUS) reporter gene behind CRT-containing promoters. Expression of the endogenous CBF genes was low at ambient temperature and enhanced upon low temperature (4 degrees C) treatment, first for CBF1, followed by CBF2, and about 2 d later by CBF3. No obvious significant difference was observed between V. riparia and V. vinifera genes. The expression levels of all three CBF genes were higher in young tissues than in older tissues. CBF1, 2 and 3 transcripts also accumulated in response to drought and exogenous abscisic acid (ABA) treatment, indicating that grape contains unique CBF genes.

  1. Amino acid sequence and posttranslational modifications of human factor VIIa from plasma and transfected baby hamster kidney cells

    International Nuclear Information System (INIS)

    Thim, L.; Bjoern, S.; Christensen, M.; Nicolaisen, E.M.; Lund-Hansen, T.; Pedersen, A.H.; Hedner, U.

    1988-01-01

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VII a , participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca 2+ and tissue factor. Three types of potential posttranslational modifications exist in the human factor VII a molecule, namely, 10 γ-carboxylated, N-terminally located glutamic acid residues, 1 β-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VII a as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VII a . By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VII a was found to be identical with human factor VII a . Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VII a . In the recombinant factor VII a , asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VII a and human plasma factor VII a . These results show that factor VII a as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VII a and that this cell line thus might represent an alternative source for human factor VII a

  2. Mathematical model of gluconic acid fermentation by Aspergillus niger

    Energy Technology Data Exchange (ETDEWEB)

    Takamatsu, T.; Shioya, S.; Furuya, T.

    1981-11-01

    A mathematical model for the study of gluconic acid fermentation by Aspergillus niger has been developed. The model has been deduced from the basic biological concept of multicellular filamentous microorganisms, i.e. cell population balance. It can be used to explain the behaviour of both batch and continuous cultures, even when in a lag phase. A new characteristic, involving the existence of dual equilibrium stages during fermentation, has been predicted using this mathematical model. (Refs. 6).

  3. Identification of microRNAs actively involved in fatty acid biosynthesis in developing Brassica napus seeds using high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Jia Wang

    2016-10-01

    Full Text Available Seed development has a critical role during the spermatophyte life cycle. In Brassica napus, a major oil crop, fatty acids are synthesized and stored in specific tissues during embryogenesis, and understanding the molecular mechanism underlying fatty acid biosynthesis during seed development is an important research goal. In this study, we constructed three small RNA libraries from early seeds at 14, 21 and 28 days after flowering (DAF and used high-throughput sequencing to examine microRNA (miRNA expression. A total of 85 known miRNAs from 30 families and 1,160 novel miRNAs were identified, of which 24, including 5 known and 19 novel miRNAs, were found to be involved in fatty acid biosynthesis. bna-miR156b, bna-miR156c, bna-miR156g, novel_mir_1706, novel_mir_1407, novel_mir_173, and novel_mir_104 were significantly down-regulated at 21 DAF and 28 DAF, whereas bna-miR159, novel_mir_1081, novel_mir_19 and novel_mir_555 were significantly up-regulated. In addition, we found that some miRNAs regulate functional genes that are directly involved in fatty acid biosynthesis and that other miRNAs regulate the process of fatty acid biosynthesis by acting on a large number of transcription factors. The miRNAs and their corresponding predicted targets were partially validated by quantitative RT-PCR. Our data suggest that diverse and complex miRNAs are involved in the seed development process and that miRNAs play important roles in fatty acid biosynthesis during seed development.

  4. On the influence of neutral turbulence on ambipolar diffusivities deduced from meteor trail expansion

    Directory of Open Access Journals (Sweden)

    C. M. Hall

    2002-11-01

    Full Text Available By measuring fading times of radar echoes from underdense meteor trails, it is possible to deduce the ambipolar diffusivities of the ions responsible for these radar echoes. It could be anticipated that these diffusivities increase monotonically with height akin to neutral viscosity. In practice, this is not always the case. Here, we investigate the capability of neutral turbulence to affect the meteor trail diffusion rate.Key words. Meteorology and atmospheric dynamics (middle atmosphere dynamics; turbulence

  5. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones

    Directory of Open Access Journals (Sweden)

    Martins Juliana R

    2010-03-01

    Full Text Available Abstract Background Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. Results The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110 diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp, a target of juvenile hormone (JH. The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Conclusions Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body

  6. The four hexamerin genes in the honey bee: structure, molecular evolution and function deduced from expression patterns in queens, workers and drones

    Science.gov (United States)

    2010-01-01

    Background Hexamerins are hemocyanin-derived proteins that have lost the ability to bind copper ions and transport oxygen; instead, they became storage proteins. The current study aimed to broaden our knowledge on the hexamerin genes found in the honey bee genome by exploring their structural characteristics, expression profiles, evolution, and functions in the life cycle of workers, drones and queens. Results The hexamerin genes of the honey bee (hex 70a, hex 70b, hex 70c and hex 110) diverge considerably in structure, so that the overall amino acid identity shared among their deduced protein subunits varies from 30 to 42%. Bioinformatics search for motifs in the respective upstream control regions (UCRs) revealed six overrepresented motifs including a potential binding site for Ultraspiracle (Usp), a target of juvenile hormone (JH). The expression of these genes was induced by topical application of JH on worker larvae. The four genes are highly transcribed by the larval fat body, although with significant differences in transcript levels, but only hex 110 and hex 70a are re-induced in the adult fat body in a caste- and sex-specific fashion, workers showing the highest expression. Transcripts for hex 110, hex 70a and hex70b were detected in developing ovaries and testes, and hex 110 was highly transcribed in the ovaries of egg-laying queens. A phylogenetic analysis revealed that HEX 110 is located at the most basal position among the holometabola hexamerins, and like HEX 70a and HEX 70c, it shares potential orthology relationship with hexamerins from other hymenopteran species. Conclusions Striking differences were found in the structure and developmental expression of the four hexamerin genes in the honey bee. The presence of a potential binding site for Usp in the respective 5' UCRs, and the results of experiments on JH level manipulation in vivo support the hypothesis of regulation by JH. Transcript levels and patterns in the fat body and gonads suggest that

  7. Comparative analysis of the prion protein gene sequences in African lion.

    Science.gov (United States)

    Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

    2006-10-01

    The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.

  8. Structure and Sequence Search on Aptamer-Protein Docking

    Science.gov (United States)

    Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

    2015-03-01

    Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.

  9. Sequence of a cDNA encoding turtle high mobility group 1 protein.

    Science.gov (United States)

    Zheng, Jifang; Hu, Bi; Wu, Duansheng

    2005-07-01

    In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.

  10. Peptide Nucleic Acids

    DEFF Research Database (Denmark)

    2004-01-01

    A novel class of compounds known as peptide nucleic acids, bind complementary DNA and RNA strands, and generally do so more strongly than the corresponding DNA or RNA strands while exhibiting increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from...

  11. The Biomolecule Sequencer Project: Nanopore Sequencing as a Dual-Use Tool for Crew Health and Astrobiology Investigations

    Science.gov (United States)

    John, K. K.; Botkin, D. S.; Burton, A. S.; Castro-Wallace, S. L.; Chaput, J. D.; Dworkin, J. P.; Lehman, N.; Lupisella, M. L.; Mason, C. E.; Smith, D. J.; hide

    2016-01-01

    Human missions to Mars will fundamentally transform how the planet is explored, enabling new scientific discoveries through more sophisticated sample acquisition and processing than can currently be implemented in robotic exploration. The presence of humans also poses new challenges, including ensuring astronaut safety and health and monitoring contamination. Because the capability to transfer materials to Earth will be extremely limited, there is a strong need for in situ diagnostic capabilities. Nucleotide sequencing is a particularly powerful tool because it can be used to: (1) mitigate microbial risks to crew by allowing identification of microbes in water, in air, and on surfaces; (2) identify optimal treatment strategies for infections that arise in crew members; and (3) track how crew members, microbes, and mission-relevant organisms (e.g., farmed plants) respond to conditions on Mars through transcriptomic and genomic changes. Sequencing would also offer benefits for science investigations occurring on the surface of Mars by permitting identification of Earth-derived contamination in samples. If Mars contains indigenous life, and that life is based on nucleic acids or other closely related molecules, sequencing would serve as a critical tool for the characterization of those molecules. Therefore, spaceflight-compatible nucleic acid sequencing would be an important capability for both crew health and astrobiology exploration. Advances in sequencing technology on Earth have been driven largely by needs for higher throughput and read accuracy. Although some reduction in size has been achieved, nearly all commercially available sequencers are not compatible with spaceflight due to size, power, and operational requirements. Exceptions are nanopore-based sequencers that measure changes in current caused by DNA passing through pores; these devices are inherently much smaller and require significantly less power than sequencers using other detection methods

  12. Spreadsheet macros for coloring sequence alignments.

    Science.gov (United States)

    Haygood, M G

    1993-12-01

    This article describes a set of Microsoft Excel macros designed to color amino acid and nucleotide sequence alignments for review and preparation of visual aids. The colored alignments can then be modified to emphasize features of interest. Procedures for importing and coloring sequences are described. The macro file adds a new menu to the menu bar containing sequence-related commands to enable users unfamiliar with Excel to use the macros more readily. The macros were designed for use with Macintosh computers but will also run with the DOS version of Excel.

  13. Cloning and Characterization of an Endoglucanase Gene from sp. Korean Native Goat 40

    Directory of Open Access Journals (Sweden)

    Sung Chan Kim

    2016-01-01

    Full Text Available A gene from Actinomyces sp. Korean native goat (KNG 40 that encodes an endo-β-1,4-glucanase, EG1, was cloned and expressed in Escherichia coli (E. coli DH5α. Recombinant plasmid DNA from a positive clone with a 3.2 kb insert hydrolyzing carboxyl methyl-cellulose (CMC was designated as pDS3. The entire nucleotide sequence was determined, and an open-reading frame (ORF was deduced. The ORF encodes a polypeptide of 684 amino acids. The recombinant EG1 produced in E. coli DH5α harboring pDS3 was purified in one step using affinity chromatography on crystalline cellulose and characterized. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis/zymogram analysis of the purified enzyme revealed two protein bands of 57.1 and 54.1 kDa. The amino terminal sequences of these two bands matched those of the deduced ones, starting from residue 166 and 208, respectively. Putative signal sequences, a Shine–Dalgarno-type ribosomal binding site, and promoter sequences related to the consensus sequences were deduced. EG1 has a typical tripartite structure of cellulase, a catalytic domain, a serine-rich linker region, and a cellulose-binding domain. The optimal temperature for the activity of the purified enzyme was 55°C, but it retained over 90% of maximum activity in a broad temperature range (40°C to 60°C. The optimal pH for the enzyme activity was 6.0. Kinetic parameters, Km and Vmax of rEG1 were 0.39% CMC and 143 U/mg, respectively.

  14. Sequence protein identification by randomized sequence database and transcriptome mass spectrometry (SPIDER-TMS): from manual to automatic application of a 'de novo sequencing' approach.

    Science.gov (United States)

    Pascale, Raffaella; Grossi, Gerarda; Cruciani, Gabriele; Mecca, Giansalvatore; Santoro, Donatello; Sarli Calace, Renzo; Falabella, Patrizia; Bianco, Giuliana

    Sequence protein identification by a randomized sequence database and transcriptome mass spectrometry software package has been developed at the University of Basilicata in Potenza (Italy) and designed to facilitate the determination of the amino acid sequence of a peptide as well as an unequivocal identification of proteins in a high-throughput manner with enormous advantages of time, economical resource and expertise. The software package is a valid tool for the automation of a de novo sequencing approach, overcoming the main limits and a versatile platform useful in the proteomic field for an unequivocal identification of proteins, starting from tandem mass spectrometry data. The strength of this software is that it is a user-friendly and non-statistical approach, so protein identification can be considered unambiguous.

  15. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    Science.gov (United States)

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. The complete genome sequence of a virus associated with cotton blue disease, cotton leafroll dwarf virus, confirms that it is a new member of the genus Polerovirus.

    Science.gov (United States)

    Distéfano, Ana J; Bonacic Kresic, Ivan; Hopp, H Esteban

    2010-11-01

    Cotton blue disease is the most important virus disease of cotton in the southern part of America. The complete nucleotide sequence of the ssRNA genome of the cotton blue disease-associated virus was determined for the first time. It comprised 5,866 nucleotides, and the deduced genomic organization resembled that of members of the genus Polerovirus. Sequence homology comparison and phylogenetic analysis confirm that this virus (previous proposed name cotton leafroll dwarf virus) is a member of a new species within the genus Polerovirus.

  17. Genomic sequences of murine gamma B- and gamma C-crystallin-encoding genes: promoter analysis and complete evolutionary pattern of mouse, rat and human gamma-crystallins.

    Science.gov (United States)

    Graw, J; Liebstein, A; Pietrowski, D; Schmitt-John, T; Werner, T

    1993-12-22

    The murine genes, gamma B-cry and gamma C-cry, encoding the gamma B- and gamma C-crystallins, were isolated from a genomic DNA library. The complete nucleotide (nt) sequences of both genes were determined from 661 and 711 bp, respectively, upstream from the first exon to the corresponding polyadenylation sites, comprising more than 2650 and 2890 bp, respectively. The new sequences were compared to the partial cDNA sequences available for the murine gamma B-cry and gamma C-cry, as well as to the corresponding genomic sequences from rat and man, at both the nt and predicted amino acid (aa) sequence levels. In the gamma B-cry promoter region, a canonical CCAAT-box, a TATA-box, putative NF-I and C/EBP sites were detected. An R-repeat is inserted 366 bp upstream from the transcription start point. In contrast, the gamma C-cry promoter does not contain a CCAAT-box, but some other putative binding sites for transcription factors (AP-2, UBP-1, LBP-1) were located by computer analysis. The promoter regions of all six gamma-cry from mouse, rat and human, except human psi gamma F-cry, were analyzed for common sequence elements. A complex sequence element of about 70-80 bp was found in the proximal promoter, which contains a gamma-cry-specific and almost invariant sequence (crygpel) of 14 nt, and ends with the also invariant TATA-box. Within the complex sequence element, a minimum of three further features specific for the gamma A-, gamma B- and gamma D/E/F-cry genes can be defined, at least two of which were recently shown to be functional. In addition to these four sequence elements, a subtype-specific structure of inverted repeats with different-sized spacers can be deduced from the multiple sequence alignment. A phylogenetic analysis based on the promoter region, as well as the complete exon 3 of all gamma-cry from mouse, rat and man, suggests separation of only five gamma-cry subtypes (gamma A-, gamma B-, gamma C-, gamma D- and gamma E/F-cry) prior to species separation.

  18. Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

    KAUST Repository

    Sugumar, Thennarasu

    2011-12-12

    Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine IL-3. There are 10 amino acid substitutions in buffalo compared with that of bovine. The amino acid sequence of buffalo IL-3 also showed very high identity with that of other ruminants, indicating functional cross-reactivity. Structural homology modelling of buffalo IL-3 protein with human IL-3 showed the presence of five helical structures.

  19. Isolation and structure of a cDNA encoding the B1 (CD20) cell-surface antigen of human B lymphocytes

    International Nuclear Information System (INIS)

    Tender, T.F.; Streuli, M.; Schlossman, S.F.; Saito, H.

    1988-01-01

    The B1 (CD20) molecule is a M/sub r/ 33,000 phosphoprotein on the surface of human B lymphocytes that may serve a central role in the homoral immune response by regulating B-cell proliferation and differentiation. In this report, a cDNA clone that encodes the B1 molecule was isolated and the amino acid sequence of B1 was determined. B-cell-specific cDNA clones were selected from a human tonsillar cDNA library by differential hybridization with labeled cDNA derived from either size-fractionated B-cell mRNA or size-fractionated T-cell mRNA. Of the 261 cDNA clones isolated, 3 cross-hybridizing cDNA clones were chosen as potential candidates for encoding B1 based on their selective hybridization to RNA from B1-positive cell lines. The longest clone, pB1-21, contained a 2.8-kilobase insert with an 891-base-pair open reading frame that encodes a protein of 33 kDa. mRNA synthesized from the pB1-21 cDNA clone in vitro was translated into a protein of the same apparent molecular weight as B1. Limited proteinase digestion of the pB1-21 translation product and B1 generated peptides of the same sizes, indicating that the pB1-21 cDNA encodes the B1 molecule. Gel blot analysis indicated that pB1-21 hybridized with two mRNA species of 2.8 and 3.4 kilobases only in B1-positive cell lines. The amino acid sequence deduced from the pB1-21 nucleotide sequence apparently lacks a signal sequence and contains three extensive hydrophobic regions. The deduced B1 amino acid sequence shows no significant homology with other known patients

  20. Screening and identification of mimotope of gastric cancer associated antigen MGb1-Ag

    Science.gov (United States)

    Han, Zhe-Yi; Wu, Kai-Chun; He, Feng-Tian; Han, Quan-Li; Nie, Yong-Zhan; Han, Ying; Liu, Xiao-Nan; Zheng, Jian-Yong; Xu, Mei-Hong; Lin, Tao; Fan, Dai-Ming

    2003-01-01

    AIM: Using a monoclonal antibody against gastric cancer antigen named MGb1 to screen a phage-displayed random peptide library fused with coat protein pIII in order to get some information on mimotopes. METHODS: Through affinity enrichment and ELISA screening, positive clones of phages were amplified. 10 phage clones were selected after three rounds of biopanning and the ability of specific binding of the positive phage clones to MGb1-Ab were detected by ELISA assay (DNA sequencing was performed and the amino acid sequences were deduced) By blocking test, specificity of the mimic phage epitopes was identified. RESULTS: There were approximately 200 times of enrichment about the titer of bound phages after three rounds of biopanning procedures. DNA of 10 phage clones after the third biopanning was assayed and the result showed that the positive clones had a specific binding activity to MGb1-Ab and a weak ability of binding to control mAb or to mouse IgG. DNA sequencing of 10 phage clones was performed and the amino acid sequences were deduced. According to the homology of the amino acid sequences of the displayed peptides, most of the phage clones had motifs of H(x)Q or L(x)S. And these 10 phage clones could also partly inhibit the binding of MGb1-Ab to gastric cancer cell KATO-III. The percentage of blocking was from (21.0 ± 1.6)% to (39.0 ± 2.7)%. CONCLUSION: Motifs of H(x)Q and L(x)S selected and identified show a high homology in the mimic epitopes of gastric cancer associated antigen. There may be one or more clones which can act as candidates of tumor vaccines. PMID:12970876

  1. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

    Directory of Open Access Journals (Sweden)

    Hayashizaki Yoshihide

    2009-06-01

    Full Text Available Abstract Background Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals. Results As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection. Conclusion We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the

  2. On the influence of neutral turbulence on ambipolar diffusivities deduced from meteor trail expansion

    Directory of Open Access Journals (Sweden)

    C. M. Hall

    Full Text Available By measuring fading times of radar echoes from underdense meteor trails, it is possible to deduce the ambipolar diffusivities of the ions responsible for these radar echoes. It could be anticipated that these diffusivities increase monotonically with height akin to neutral viscosity. In practice, this is not always the case. Here, we investigate the capability of neutral turbulence to affect the meteor trail diffusion rate.

    Key words. Meteorology and atmospheric dynamics (middle atmosphere dynamics; turbulence

  3. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-?-d-Glutamic Acid Anthrax Capsule

    OpenAIRE

    Stabler, Richard A.; Negus, David; Pain, Arnab; Taylor, Peter W.

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-?-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  4. Protein-Protein Interactions Prediction Using a Novel Local Conjoint Triad Descriptor of Amino Acid Sequences

    Directory of Open Access Journals (Sweden)

    Jun Wang

    2017-11-01

    Full Text Available Protein-protein interactions (PPIs play crucial roles in almost all cellular processes. Although a large amount of PPIs have been verified by high-throughput techniques in the past decades, currently known PPIs pairs are still far from complete. Furthermore, the wet-lab experiments based techniques for detecting PPIs are time-consuming and expensive. Hence, it is urgent and essential to develop automatic computational methods to efficiently and accurately predict PPIs. In this paper, a sequence-based approach called DNN-LCTD is developed by combining deep neural networks (DNNs and a novel local conjoint triad description (LCTD feature representation. LCTD incorporates the advantage of local description and conjoint triad, thus, it is capable to account for the interactions between residues in both continuous and discontinuous regions of amino acid sequences. DNNs can not only learn suitable features from the data by themselves, but also learn and discover hierarchical representations of data. When performing on the PPIs data of Saccharomyces cerevisiae, DNN-LCTD achieves superior performance with accuracy as 93.12%, precision as 93.75%, sensitivity as 93.83%, area under the receiver operating characteristic curve (AUC as 97.92%, and it only needs 718 s. These results indicate DNN-LCTD is very promising for predicting PPIs. DNN-LCTD can be a useful supplementary tool for future proteomics study.

  5. 阴道毛滴虫Rab11鸟苷三磷酸酶cDNA克隆和序列分析%Molecular Cloning and Sequence Analysis of Rab11 GTPase in Trichomonas vaginalis

    Institute of Scientific and Technical Information of China (English)

    张仁利; 许铭炎; 许锦阶; 高世同; 黄达娜; 耿艺介; 傅玉才

    2006-01-01

    Objective Rab11 GTPases play an essential role in regulating membrane trafficking pathways in eukaryotic cells. Nonetheless, there has been little work done on characterizing the transport machinery of Trichomonas. The aim of this study is to clone and characterize a Rab11 gene of Trichomonas vaginalis.Methods A cDNA expression library was constructed with T. vaginalis total RNA. A cDNA clone, which showed a high degree of homology with Rab proteins of different species, was isolated and sequenced. Sequence analysis was performed using BLASTP, RPS-BLAST and ClustalW programs. The genomic DNA corresponding to the cDNA sequence was amplified using PCR techniques and following by sequencing. Results cDNA with a length of 710 base pairs and an open reading frame of 636 bp was obtained. The deduced amino acid sequence from the open reading frame was found to possess 211 residuals. Sequence analysis demonstrated that this cDNA clone was homologous to the Rab11 subfamily of different species (60% identity and 79% similarity with Arabidopsis thaliana Rab11c, 58% identity and 78% similarity with human Rab11b), and that the amino acid sequence contains all the well known conserved sequence elements of Rab family. Specific Rab motifs were also detected in the deduced amino acid sequence. Phylogenetic analysis showed that its closest homologues are Rab11 proteins from other species. Sequencing of the PCR product of genomic DNA revealed that the genomic DNA sequence encompassing the putative 5'-ATG and 3'-stop codon is identical to the cDNA sequence.Conclusion A cDNA clone corresponding to the T. vaginalis Rab11 gene was obtained.The function of this gene in regulating membrane trafficking pathways of the parasitic protist is still under investigation.%目的近年研究表明Rab11

  6. Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

    Science.gov (United States)

    Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

    2002-11-01

    The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.

  7. The Saccharomyces cerevisiae RAD18 gene encodes a protein that contains potential zinc finger domains for nucleic acid binding and a putative nucleotide binding sequence

    Energy Technology Data Exchange (ETDEWEB)

    Jones, J.S.; Prakash, L. (Univ. of Rochester School of Medicine, NY (USA)); Weber, S. (Kodak Research Park, Rochester, NY (USA))

    1988-07-25

    The RAD18 gene of Saccharomyces cerevisiae is required for postreplication repair of UV damaged DNA. The authors have isolated the RAD18 gene, determined its nucleotide sequence and examined if deletion mutations of this gene show different or more pronounced phenotypic effects than the previously described point mutations. The RAD18 gene open reading frame encodes a protein of 487 amino acids, with a calculated molecular weight of 55,512. The RAD18 protein contains three potential zinc finger domains for nucleic acid binding, and a putative nucleotide binding sequence that is present in many proteins that bind and hydrolyze ATP. The DNA binding and nucleotide binding activities could enable the RAD18 protein to bind damaged sites in the template DNA with high affinity. Alternatively, or in addition, RAD18 protein may be a transcriptional regulator. The RAD18 deletion mutation resembles the previously described point mutations in its effects on viability, DNA repair, UV mutagenesis, and sporulation.

  8. Antiviral activity of a serine protease from the digestive juice of Bombyx mori larvae against nucleopolyhedrovirus

    International Nuclear Information System (INIS)

    Nakazawa, Hiroshi; Tsuneishi, Eiko; Ponnuvel, Kangayam M.; Furukawa, Seiichi; Asaoka, Ai; Tanaka, Hiromitsu; Ishibashi, Jun; Yamakawa, Minoru

    2004-01-01

    A protein showing strong antiviral activity against Bombyx mori nucleopolyhedrovirus (BmNPV) was purified from the digestive juice of B. mori larvae. The molecular mass of this protein was 24 271 Da. Partial N-terminal amino acid sequence of the protein was determined and cDNA was cloned based on the amino acid sequence. A homology search of the deduced amino acid sequence of the cDNA showed 94% identity with B. mori serine protease so the protein was designated B. mori serine protease-2 (BmSP-2). Analysis of BmSP-2 gene expression showed that this gene is expressed in the midgut but not in other tissues. In addition, BmSP-2 gene was shown to not be expressed in the molting and wandering stages, indicating that the gene is hormonally regulated. Our results suggest that BmSP-2, an insect digestive enzyme, can be a potential antiviral factor against BmNPV at the initial site of viral infection

  9. Probe kit for identifying a base in a nucleic acid

    Science.gov (United States)

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  10. Thermodynamics of sequence-specific binding of PNA to DNA

    DEFF Research Database (Denmark)

    Ratilainen, T; Holmén, A; Tuite, E

    2000-01-01

    For further characterization of the hybridization properties of peptide nucleic acids (PNAs), the thermodynamics of hybridization of mixed sequence PNA-DNA duplexes have been studied. We have characterized the binding of PNA to DNA in terms of binding affinity (perfectly matched duplexes) and seq......For further characterization of the hybridization properties of peptide nucleic acids (PNAs), the thermodynamics of hybridization of mixed sequence PNA-DNA duplexes have been studied. We have characterized the binding of PNA to DNA in terms of binding affinity (perfectly matched duplexes...

  11. Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

    Science.gov (United States)

    Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

    1997-12-01

    A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.

  12. Direct quantification of human cytomegalovirus immediate-early and late mRNA levels in blood of lung transplant recipients by competitive nucleic acid sequence-based amplification

    NARCIS (Netherlands)

    Greijer, AE; Verschuuren, EAM; Harmsen, MC; Dekkers, CAJ; Adriaanse, HMA; The, TH; Middeldorp, JM

    The dynamics of active human cytomegalovirus (HCMV) infection was monitored by competitive nucleic acid sequence-based amplification (NASBA) assays for quantification of IE1 (UL123) and pp67 (UL65) mRNA expression levels In the blood of patients after lung transplantation. RNA was isolated from 339

  13. Lead/acid batteries for photovoltaic applications. Test results and modelling

    Energy Technology Data Exchange (ETDEWEB)

    Copetti, J B [CIEMAT, Inst. de Energias Renovables, Madrid (Spain); Chenlo, F [CIEMAT, Inst. de Energias Renovables, Madrid (Spain)

    1994-01-01

    This work presents the results of experiments carried out on lead/acid batteries during charge and discharge processes at different currents and temperatures, selected to a cover a large range of operating conditions, including those encountered in photovoltaic (PV) system applications. The results allow us to verify the relations among the battery external parameters (voltage, current, state-of-charge and temperature), the behaviour of the internal resistance, and to deduce a model that represents the discharge and charge processes, including the overcharge. Finally, normalized equations with respect to the battery capacity are proposed, which allow us to fix the values of parameters and hence the model is valid for any type and size of lead/acid battery. (orig.)

  14. Enhanced anti-HIV-1 activity of G-quadruplexes comprising locked nucleic acids and intercalating nucleic acids

    DEFF Research Database (Denmark)

    Pedersen, Erik Bjerregaard; Nielsen, Jakob Toudahl; Nielsen, Claus

    2011-01-01

    Two G-quadruplex forming sequences, 50-TGGGAG and the 17-mer sequence T30177, which exhibit anti-HIV-1 activity on cell lines, were modified using either locked nucleic acids (LNA) or via insertions of (R)-1-O-(pyren-1-ylmethyl)glycerol (intercalating nucleic acid, INA) or (R)-1-O-[4-(1......-pyrenylethynyl)phenylmethyl]glycerol (twisted intercalating nucleic acid, TINA). Incorporation of LNA or INA/TINA monomers provide as much as 8-fold improvement of anti-HIV-1 activity. We demonstrate for the first time a detailed analysis of the effect the incorporation of INA/TINA monomers in quadruplex forming...

  15. Procedures of amino acid sequencing of peptides in natural proteins collection of knowledge and intelligence for construction of reliable chemical inference system

    OpenAIRE

    Kudo, Yoshihiro; Kanaya, Shigehiko

    1994-01-01

    In order to establish a reliable chemical inference system on amino acid sequencing of natural peptides, as various kinds of relevant knowledge and intelligence as possible are collected. Topics are on didemnins, dolastatin 3, TL-119 and/or A-3302-B, mycosubtilin, patellamide A, duramycin (and cinnamycin), bottoromycin A 2, A19009, galantin I, vancomycin, stenothricin, calf speleen profilin, neocarzinostatin, pancreatic spasmolytic polypeptide, cerebratulus toxin B-IV, RNAase U 2, ferredoxin ...

  16. Effect of amino acid sequence and pH on nanofiber formation of self-assembling peptides EAK16-II and EAK16-IV.

    Science.gov (United States)

    Hong, Yooseong; Legge, Raymond L; Zhang, S; Chen, P

    2003-01-01

    Atomic force microscopy (AFM) and axisymmetric drop shape analysis-profile (ASDA-P) were used to investigate the mechanism of self-assembly of peptides. The peptides chosen consisted of 16 alternating hydrophobic and hydrophilic amino acids, where the hydrophilic residues possess alternating negative and positive charges. Two types of peptides, AEAEAKAKAEAEAKAK (EAK16-II) and AEAEAEAEAKAKAKAK (EAK16-IV), were investigated in terms of nanostructure formation through self-assembly. The experimental results, which focused on the effects of the amino acid sequence and pH, show that the nanostructures formed by the peptides are dependent on the amino acid sequence and the pH of the solution. For pH conditions around neutrality, one of the peptides used in this study, EAK16-IV, forms globular assemblies and has lower surface tension at air-water interfaces than another peptide, EAK16-II, which forms fibrillar assemblies at the same pH. When the pH is lowered below 6.5 or raised above 7.5, there is a transition from globular to fibrillar structures for EAK16-IV, but EAK16-II does not show any structural transition. Surface tension measurements using ADSA-P showed different surface activities of peptides at air-water interfaces. EAK16-II does not show a significant difference in surface tension for the pH range between 4 and 9. However, EAK16-IV shows a noticeable decrease in surface tension at pH around neutrality, indicating that the formation of globular assemblies is related to the molecular hydrophobicity.

  17. Comparison of complete genome sequences of dog rabies viruses isolated from China and Mexico reveals key amino acid changes that may be associated with virus replication and virulence.

    Science.gov (United States)

    Yu, Fulai; Zhang, Guoqing; Zhong, Xiangfu; Han, Na; Song, Yunfeng; Zhao, Ling; Cui, Min; Rayner, Simon; Fu, Zhen F

    2014-07-01

    Rabies is a global problem, but its impact and prevalence vary across different regions. In some areas, such as parts of Africa and Asia, the virus is prevalent in the domestic dog population, leading to epidemic waves and large numbers of human fatalities. In other regions, such as the Americas, the virus predominates in wildlife and bat populations, with sporadic spillover into domestic animals. In this work, we attempted to investigate whether these distinct environments led to selective pressures that result in measurable changes within the genome at the amino acid level. To this end, we collected and sequenced the full genome of two isolates from divergent environments. The first isolate (DRV-AH08) was from China, where the virus is present in the dog population and the country is experiencing a serious epidemic. The second isolate (DRV-Mexico) was taken from Mexico, where the virus is present in both wildlife and domestic dog populations, but at low levels as a consequence of an effective vaccination program. We then combined and compared these with other full genome sequences to identify distinct amino acid changes that might be associated with environment. Phylogenetic analysis identified strain DRV-AH08 as belonging to the China-I lineage, which has emerged to become the dominant lineage in the current epidemic. The Mexico strain was placed in the D11 Mexico lineage, associated with the West USA-Mexico border clade. Amino acid sequence analysis identified only 17 amino acid differences in the N, G and L proteins. These differences may be associated with virus replication and virulence-for example, the short incubation period observed in the current epidemic in China.

  18. Whole-Genome Sequence Analysis of Bombella intestini LMG 28161T, a Novel Acetic Acid Bacterium Isolated from the Crop of a Red-Tailed Bumble Bee, Bombus lapidarius.

    Directory of Open Access Journals (Sweden)

    Leilei Li

    Full Text Available The whole-genome sequence of Bombella intestini LMG 28161T, an endosymbiotic acetic acid bacterium (AAB occurring in bumble bees, was determined to investigate the molecular mechanisms underlying its metabolic capabilities. The draft genome sequence of B. intestini LMG 28161T was 2.02 Mb. Metabolic carbohydrate pathways were in agreement with the metabolite analyses of fermentation experiments and revealed its oxidative capacity towards sucrose, D-glucose, D-fructose and D-mannitol, but not ethanol and glycerol. The results of the fermentation experiments also demonstrated that the lack of effective aeration in small-scale carbohydrate consumption experiments may be responsible for the lack of reproducibility of such results in taxonomic studies of AAB. Finally, compared to the genome sequences of its nearest phylogenetic neighbor and of three other insect associated AAB strains, the B. intestini LMG 28161T genome lost 69 orthologs and included 89 unique genes. Although many of the latter were hypothetical they also included several type IV secretion system proteins, amino acid transporter/permeases and membrane proteins which might play a role in the interaction with the bumble bee host.

  19. Molecular cloning and expression of bovine kappa-casein in Escherichia coli

    International Nuclear Information System (INIS)

    Kang, Y.C.; Richardson, T.

    1988-01-01

    A cDNA library was constructed using poly(A) + RNA from bovine mammary gland. This cDNA library of 6000 clones was screened employing colony hybridization using 32 P-labelled oligonucleotide probes and restriction endonuclease mapping. The cDNA from the selected plasmid, pKR76, was sequenced using the dideoxy-chain termination method. The cDNA insert of pKR76 carries the full-length sequence, which codes for mature kappa-casein protein. The amino acid sequence deduced from the cDNA sequence fits the published amino acid sequence with three exceptions; the reported pyroglutamic acid at position 1, tyrosine at position 35, and aspartic acid at position 81 are, respectively, a glutamine, a histidine, and an asparagine in the clone containing pKR76. The MspI-, NlaIV-cleaved fragment (630 base pair) from the kappa-casein cDNA insert has been subcloned into expression vectors pUC18 and pKK233-2, which contain a lac promoter and a trc promoter, respectively. Escherichia coli cells carrying the recombinant expression plasmids were shown to produce kappa-casein protein having the expected mobility on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and being recognized by specific antibodies raised against natural bovine kappa-casein

  20. Identification of Meconopsis species by a DNA barcode sequence ...

    African Journals Online (AJOL)

    Deoxyribonucleic acid (DNA) barcoding is a novel technology that uses a standard DNA sequence to facilitate species identification. Species identification is necessary for the authentication of traditional plant based medicines. Although a consensus has not been agreed regarding which DNA sequences can be used as ...

  1. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-01-01

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  2. Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

    International Nuclear Information System (INIS)

    Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki; Ohashi, Yuko; Kano-Murakami, Yuriko; Ozeki, Yoshihiro

    1989-01-01

    A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M r of its subunit was 77,000. The cells converted [ 14 C]-L-phenylalanine into [ 14 C]-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading frame capable of coding for a polypeptide with 707 amino acids (M r 77,137), a 22-bp 5'-noncoding region and a 207-bp 3'-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology

  3. A Novel Bifunctional Amino Acid Racemase With Multiple Substrate Specificity, MalY From Lactobacillus sakei LT-13: Genome-Based Identification and Enzymological Characterization

    Directory of Open Access Journals (Sweden)

    Shiro Kato

    2018-03-01

    Full Text Available The Lactobacillus sakei strain LK-145 isolated from Moto, a starter of sake, produces potentially large amounts of three D-amino acids, D-Ala, D-Glu, and D-Asp, in a medium containing amylase-digested rice as a carbon source. The comparison of metabolic pathways deduced from the complete genome sequence of strain LK-145 to the type culture strain of Lactobacillus sakei strain LT-13 showed that the L- and D-amino acid metabolic pathways are similar between the two strains. However, a marked difference was observed in the putative cysteine/methionine metabolic pathways of strain LK-145 and LT-13. The cystathionine β-lyase homolog gene malY was annotated only in the genome of strain LT-13. Cystathionine β-lyase is an important enzyme in the cysteine/methionine metabolic pathway that catalyzes the conversion of L-cystathionine into L-homocysteine. In addition to malY, most genome-sequenced strains of L. sakei including LT-13 lacked the homologous genes encoding other putative enzymes in this pathway. Accordingly, the cysteine/methionine metabolic pathway likely does not function well in almost all strains of L. sakei. We succeeded in cloning and expressing the malY gene from strain LT-13 (Ls-malY in the cells of Escherichia coli BL21 (DE3 and characterized the enzymological properties of Ls-MalY. Spectral analysis of purified Ls-MalY showed that Ls-MalY contained a pyridoxal 5′-phosphate (PLP as a cofactor, and this observation agreed well with the prediction based on its primary structure. Ls-MalY showed amino acid racemase activity and cystathionine β-lyase activity. Ls-MalY showed amino acid racemase activities in various amino acids, such as Ala, Arg, Asn, Glu, Gln, His, Leu, Lys, Met, Ser, Thr, Trp, and Val. Mutational analysis revealed that the -amino group of Lys233 in the primary structure of Ls-MalY likely bound to PLP, and Lys233 was an essential residue for Ls-MalY to catalyze both the amino acid racemase and β-lyase reactions. In

  4. Comparison of hydrogen isotope exchange reactions between HTO vapor and the sodium salts of o-, m-, and p-aminobenzoic acid

    International Nuclear Information System (INIS)

    Okada, Minoru; Imaizumi, Hiroshi; Itoh, Tomoko

    1991-01-01

    Hydrogen isotope exchange reaction between HTO vapor and one of the sodium salts of o-, m-, and p-aminobenzoic acid (solid) was observed at 50 ∼ 80 degC. The acidity (acidity based on kinetic logic) for the materials at each temperature has been obtained with the A''-McKay plots based on the respective data obtained. The followings have been clarified by comparing these acidities (and the acidities obtained previously). 1) The acidity of aromatic amines can be expressed in terms of the acidity based on kinetic logic. 2) The reactivity of aromatic amine is strongly affected by both I-effect and R-effect. 3) It can be deduced that aromatic amines are more reactive than aliphatic amines. (author)

  5. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

    KAUST Repository

    Stabler, R. A.

    2013-01-24

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  6. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-γ-d-Glutamic Acid Anthrax Capsule.

    Science.gov (United States)

    Stabler, Richard A; Negus, David; Pain, Arnab; Taylor, Peter W

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  7. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly- -D-Glutamic Acid Anthrax Capsule

    KAUST Repository

    Stabler, R. A.; Negus, D.; Pain, Arnab; Taylor, P. W.

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  8. Structure of the horseradish peroxidase isozyme C genes.

    Science.gov (United States)

    Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H

    1988-05-02

    We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.

  9. Gene Cloning, Expression and Activity Analysis of Manganese Superoxide Dismutase from Two Strains of Gracilaria lemaneiformis (Gracilariaceae, Rhodophyta under Heat Stress

    Directory of Open Access Journals (Sweden)

    Lu Zhang

    2012-04-01

    Full Text Available Manganese superoxide dismutase (Mn-SOD plays a crucial role in antioxidant responses to environmental stress. To determine whether Mn-SOD affects heat resistance of Gracilaria lemaneiformis, we cloned Mn-SOD cDNA sequences of two strains of this red alga, wild type and cultivar 981. Both cDNA sequences contained an ORF of 675 bp encoding 224 amino acid residues. The cDNA sequences and the deduced amino acid sequences of the two strains shared relatively high identity (more than 99%. No intron existed in genomic DNA of Mn-SOD in G. lemaneiformis. Southern blotting indicated that there were multiple copies, possibly four, of Mn-SOD in both strains. Both in the wild type and cultivar 981, SOD mRNA transcription and SOD activity increased under high temperature stress, while cultivar 981 was more heat resistant based on its SOD activity. This research suggests that there may be a direct relationship between SOD activity and the heat resistance of G. lemaneiformis.

  10. [Taxonomic status of the Tyulek virus (TLKV) (Orthomyxoviridae, Quaranjavirus, Quaranfil group) isolated from the ticks Argas vulgaris Filippova, 1961 (Argasidae) from the birds burrow nest biotopes in the Kyrgyzstan].

    Science.gov (United States)

    L'vov, D K; Al'khovskiĭ, S V; Shchelkanov, M Iu; Shchetinin, A M; Deriabin, P G; Aristova, V A; Gitel'man, A K; Samokhvalov, E I; Botikov, A G

    2014-01-01

    The Tyulek virus (TLKV) was isolated from the ticks Argas vulgaris Filippova, 1961 (Argasidae), collected from the burrow biotopes in multispecies birds colony in the Aksu river floodplain near Tyulek village (northern part of Chu Valley, Kyrgyzstan). Recently, the TLKV was assigned to the Quaranfil group (including the Quaranfil virus (QRFV), Johnston Atoll virus (JAV), Lake Chad virus) that is a novel genus of the Quaranjavirus in the Orthomyxoviridae family. In his work, the complete genome (ID GenBank KJ438647-8) sequence of the TLKV was determined using next-generation sequencing (Illumina platform). Comparison of deduced amino acid sequences shows closed relationship of the TLKV with QRFV and JAV (86% and 84% identity for PB1 and about 70% for PB2 and PA, respectively). The identity level of the TLKV and QRFV in outer glycoprotein GP is 72% and 80% for nucleotide and amino acid sequences, respectively. The phylogenetic analysis showed that the TLKV belongs to the genus of the Quaranjavirus in the family Orthomyxoviridae.

  11. Isolation and characterization of the gene encoding the starch debranching enzyme limit dextrinase from germinating barley

    DEFF Research Database (Denmark)

    Kristensen, Michael; Lok, Finn; Planchot, Véronique

    1999-01-01

    with a value of 105 kDa estimated by SDS;;PAGE, The coding sequence is interrupted by 26 introns varying in length from 93 bp to 825 bp. The 27 exons vary in length from 53 bp to 197 bp. Southern blot analysis shows that the limit dextrinase gene is present as a single copy in the barley genome. Gene......The gene encoding the starch debranching enzyme limit dextrinase, LD, from barley (Hordeum vulgare), was isolated from a genomic phage library using a barley cDNA clone as probe. The gene encodes a protein of 904 amino acid residues with a calculated molecular mass of 98.6 kDa. This is in agreement...... expression is high during germination and the steady state transcription level reaches a maximum at day 5 of germination. The deduced amino acid sequence corresponds to the protein sequence of limit dextrinase purified from germinating malt, as determined by automated N-terminal sequencing of tryptic...

  12. Ursodeoxycholic Acid Suppresses Lipogenesis in Mouse Liver: Possible Role of the Decrease in β-Muricholic Acid, a Farnesoid X Receptor Antagonist.

    Science.gov (United States)

    Fujita, Kyosuke; Iguchi, Yusuke; Une, Mizuho; Watanabe, Shiro

    2017-04-01

    The farnesoid X receptor (FXR) is a major nuclear receptor of bile acids; its activation suppresses sterol regulatory element-binding protein 1c (SREBP1c)-mediated lipogenesis and decreases the lipid contents in the liver. There are many reports showing that the administration of ursodeoxycholic acid (UDCA) suppresses lipogenesis and reduces the lipid contents in the liver of experimental animals. Since UDCA is not recognized as an FXR agonist, these effects of UDCA cannot be readily explained by its direct activation of FXR. We observed that the dietary administration of UDCA in mice decreased the expression levels of SREBP1c and its target lipogenic genes. Alpha- and β-muricholic acids (MCA) and cholic acid (CA) were the major bile acids in the mouse liver but their contents decreased upon UDCA administration. The hepatic contents of chenodeoxycholic acid and deoxycholic acid (DCA) were relatively low but were not changed by UDCA. UDCA did not show FXR agonistic or antagonistic potency in in vitro FXR transactivation assay. Taking these together, we deduced that the above-mentioned change in hepatic bile acid composition induced upon UDCA administration might cause the relative increase in the FXR activity in the liver, mainly by the reduction in the content of β-MCA, a farnesoid X receptor antagonist, which suggests a mechanism by which UDCA suppresses lipogenesis and decreases the lipid contents in the mouse liver.

  13. Cloning, sequence analysis, expression of Cyathus bulleri laccase in Pichia pastoris and characterization of recombinant laccase.

    Science.gov (United States)

    Garg, Neha; Bieler, Nora; Kenzom, Tenzin; Chhabra, Meenu; Ansorge-Schumacher, Marion; Mishra, Saroj

    2012-10-23

    Laccases are blue multi-copper oxidases and catalyze the oxidation of phenolic and non-phenolic compounds. There is considerable interest in using these enzymes for dye degradation as well as for synthesis of aromatic compounds. Laccases are produced at relatively low levels and, sometimes, as isozymes in the native fungi. The investigation of properties of individual enzymes therefore becomes difficult. The goal of this study was to over-produce a previously reported laccase from Cyathus bulleri using the well-established expression system of Pichia pastoris and examine and compare the properties of the recombinant enzyme with that of the native laccase. In this study, complete cDNA encoding laccase (Lac) from white rot fungus Cyathus bulleri was amplified by RACE-PCR, cloned and expressed in the culture supernatant of Pichia pastoris under the control of the alcohol oxidase (AOX)1 promoter. The coding region consisted of 1,542 bp and encodes a protein of 513 amino acids with a signal peptide of 16 amino acids. The deduced amino acid sequence of the matured protein displayed high homology with laccases from Trametes versicolor and Coprinus cinereus. The sequence analysis indicated the presence of Glu 460 and Ser 113 and LEL tripeptide at the position known to influence redox potential of laccases placing this enzyme as a high redox enzyme. Addition of copper sulfate to the production medium enhanced the level of laccase by about 12-fold to a final activity of 7200 U L-1. The recombinant laccase (rLac) was purified by ~4-fold to a specific activity of ~85 U mg(-1) protein. A detailed study of thermostability, chloride and solvent tolerance of the rLac indicated improvement in the first two properties when compared to the native laccase (nLac). Altered glycosylation pattern, identified by peptide mass finger printing, was proposed to contribute to altered properties of the rLac. Laccase of C. bulleri was successfully produced extra-cellularly to a high level of 7200

  14. Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

    Science.gov (United States)

    Benyo, B; Biro, J C; Benyo, Z

    2004-01-01

    The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.

  15. Cloning and characterization of a cell cycle-regulated gene encoding topoisomerase I from Nicotiana tabacum that is inducible by light, low temperature and abscisic acid.

    Science.gov (United States)

    Mudgil, Y; Singh, B N; Upadhyaya, K C; Sopory, S K; Reddy, M K

    2002-05-01

    We have cloned a full-length 2874-bp cDNA coding for tobacco topoisomerase I, with an ORF of 2559 bp encoding a protein of 852 amino acids with a calculated molecular mass of 95 kDa and an estimated pI of 9.51. The deduced amino acid sequence shows homology to other eukaryotic topoisomerases I. Tobacco topoisomerase I was over-expressed in Escherichia coli, and the purified recombinant protein was found to relax both positively and negatively super-coiled DNA in the absence of the divalent cation Mg(2+)and ATP. These characteristic features indicate that the tobacco enzyme is a type I topoisomerase. The recombinant protein could be phosphorylated at (a) threonine residue(s) by protein kinase C. However, phosphorylation did not cause any change in its enzymatic activity. The genomic organization of the topoisomerase I gene revealed the presence of 8 exons and 7 introns in the region corresponding to the ORF and one intron in the 3' UTR region. Transcript analysis using RT-PCR showed basal constitutive expression in all organs examined, and the gene was expressed at all stages of the cell cycle--but the level of expression increased during the G1-S phase. The transcript level also increased following exposure to light, low-temperature stress and abscisic acid, a stress hormone.

  16. A branch-heterogeneous model of protein evolution for efficient inference of ancestral sequences.

    Science.gov (United States)

    Groussin, M; Boussau, B; Gouy, M

    2013-07-01

    Most models of nucleotide or amino acid substitution used in phylogenetic studies assume that the evolutionary process has been homogeneous across lineages and that composition of nucleotides or amino acids has remained the same throughout the tree. These oversimplified assumptions are refuted by the observation that compositional variability characterizes extant biological sequences. Branch-heterogeneous models of protein evolution that account for compositional variability have been developed, but are not yet in common use because of the large number of parameters required, leading to high computational costs and potential overparameterization. Here, we present a new branch-nonhomogeneous and nonstationary model of protein evolution that captures more accurately the high complexity of sequence evolution. This model, henceforth called Correspondence and likelihood analysis (COaLA), makes use of a correspondence analysis to reduce the number of parameters to be optimized through maximum likelihood, focusing on most of the compositional variation observed in the data. The model was thoroughly tested on both simulated and biological data sets to show its high performance in terms of data fitting and CPU time. COaLA efficiently estimates ancestral amino acid frequencies and sequences, making it relevant for studies aiming at reconstructing and resurrecting ancestral amino acid sequences. Finally, we applied COaLA on a concatenate of universal amino acid sequences to confirm previous results obtained with a nonhomogeneous Bayesian model regarding the early pattern of adaptation to optimal growth temperature, supporting the mesophilic nature of the Last Universal Common Ancestor.

  17. 3D representations of amino acids—applications to protein sequence comparison and classification

    Directory of Open Access Journals (Sweden)

    Jie Li

    2014-08-01

    Full Text Available The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the representation of such a protein sequence facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D space derived from amino acid substitution matrices provides an adequate representation that can be used for direct comparison of protein sequences based on geometry. We measure the performance of such a representation in the context of the protein structural fold prediction problem. We compare the results of classifying different sets of proteins belonging to distinct structural folds against classifications of the same proteins obtained from sequence alone or directly from structural information. We find that sequence alone performs poorly as a structure classifier. We show in contrast that the use of the three dimensional representation of the sequences significantly improves the classification accuracy. We conclude with a discussion of the current limitations of such a representation and with a description of potential improvements.

  18. FASH: A web application for nucleotides sequence search

    Directory of Open Access Journals (Sweden)

    Chew Paul

    2008-05-01

    Full Text Available Abstract FASH (Fourier Alignment Sequence Heuristics is a web application, based on the Fast Fourier Transform, for finding remote homologs within a long nucleic acid sequence. Given a query sequence and a long text-sequence (e.g, the human genome, FASH detects subsequences within the text that are remotely-similar to the query. FASH offers an alternative approach to Blast/Fasta for querying long RNA/DNA sequences. FASH differs from these other approaches in that it does not depend on the existence of contiguous seed-sequences in its initial detection phase. The FASH web server is user friendly and very easy to operate. Availability FASH can be accessed at https://fash.bgu.ac.il:8443/fash/default.jsp (secured website

  19. Identification of a cDNA encoding a parathyroid hormone-like peptide from a human tumor associated with humoral hypercalcemia of malignancy

    International Nuclear Information System (INIS)

    Mangin, M.; Webb, A.C.; Dreyer, B.E.

    1988-01-01

    Humoral hypercalcemia of malignancy is a common paraneoplastic syndrome that appears to be mediated in many instances by a parathyroid hormone-like peptide. Poly(A) + RNA from a human renal carcinoma associated with this syndrome was enriched by preparative electrophoresis and used to construct an enriched cDNA library in phage λgt10. The library was screened with a codon-preference oligonucleotide synthesized on the basis of a partial N-terminal amino acid sequence from a human tumor-derived peptide, and a 2.0 kilo-base cDNA was identified. The cDNA encodes a 177 amino acid protein consisting of a 36 amino acid leader sequence and a 141 amino acid mature peptide. The first 13 amino acids of the deduced sequence of the mature peptide display strong homology to human PTH, with complete divergence thereafter. RNA blot-hybridization analysis revealed multiple transcripts in mRNA from tumors associated with the humor syndrome and also in mRNA from normal human keratinocytes. Southern blot analysis of genomic DNA from humans and rodents revealed a simple pattern compatible with a single-copy gene. The gene has been mapped to chromosome 12

  20. Open questions in origin of life : Experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach

    NARCIS (Netherlands)

    Adamala, K.; Anella, F.M.; Wieczorek, R.; Stano, P.; Chiarabelli, C.; Luisi, P.L.

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the