WorldWideScience

Sample records for acid sequence based

  1. Chip-based sequencing nucleic acids

    Science.gov (United States)

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  2. Nucleic acid sequence-based amplification with oligochromatography for detection of Trypanosoma brucei in clinical samples

    NARCIS (Netherlands)

    C.M. Mugasa; T. Laurent; G.J. Schoone; P.A. Kager; G.W. Lubega; H.D.F.H. Schallig

    2009-01-01

    Molecular tools, such as real-time nucleic acid sequence-based amplification (NASBA) and PCR, have been developed to detect Trypanosoma brucei parasites in blood for the diagnosis of human African trypanosomiasis (HAT). Despite good sensitivity, these techniques are not implemented in HAT control pr

  3. Repetitive sequence based polymerase chain reaction to differentiate close bacteria strains in acidic sites

    Institute of Scientific and Technical Information of China (English)

    XIE Ming; YIN Hua-qun; LIU Yi; LIU Jie; LIU Xue-duan

    2008-01-01

    To study the diversity of bacteria strains newly isolated from several acid mine drainage(AMD) sites in China,repetitive sequence based polymerase chain reaction (rep-PCR),a well established technology for diversity analysis of closely related bacteria strains,was conducted on 30 strains of bacteria Leptospirillum ferriphilium,8 strains of bacteria Acidithiobacillus ferrooxidans,as well as the Acidithiobacillus ferrooxidans type strain ATCC (American Type Culture Collection) 23270.The results showed that,using ERIC and BOX primer sets,rep-PCR produced highly discriminatory banding patterns.Phylogenetic analysis based on ERIC-PCR banding types was made and the results indicated that rep-PCR could be used as a rapid and highly discriminatory screening technique in studying bacterial diversity,especially in differentiating bacteria within one species in AMD.

  4. Real-Time Nucleic Acid Sequence-Based Amplification Using Molecular Beacons for Detection of Enterovirus RNA in Clinical Specimens

    OpenAIRE

    Landry, Marie L.; Garner, Robin; Ferguson, David

    2005-01-01

    Real-time nucleic acid sequence-based amplification (NASBA) using molecular beacon technology (NASBA-beacon) was compared to standard NASBA with postamplification hybridization using electrochemiluminescently labeled probes (NASBA-ECL) for detection of enteroviruses (EV) in 133 cerebrospinal fluid and 27 stool samples. NASBA-ECL and NASBA-beacon were similar in sensitivity, detecting 55 (100%) and 52 (94.5%) EV-positive samples, respectively. There were no false positives. Both NASBA assays w...

  5. Construction Strategy for an Internal Amplification Control for Real-Time Diagnostic Assays Using Nucleic Acid Sequence-Based Amplification: Development and Clinical Application

    OpenAIRE

    Rodríguez-Lázaro, David; D'Agostino, Martin; Pla, Maria; Cook, Nigel

    2005-01-01

    An important analytical control in molecular amplification-based methods is an internal amplification control (IAC), which should be included in each reaction mixture. An IAC is a nontarget nucleic acid sequence which is coamplified simultaneously with the target sequence. With negative results for the target nucleic acid, the absence of an IAC signal indicates that amplification has failed. A general strategy for the construction of an IAC for inclusion in molecular beacon-based real-time nu...

  6. The structural analysis of protein sequences based on the quasi-amino acids code

    Institute of Scientific and Technical Information of China (English)

    Zhu Ping; Tang Xu-Qing; Xu Zhen-Yuan

    2009-01-01

    Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (∑, +, *) is introduced, where ∑ is the set of 64 codons. According to the characteristics of (∑,+, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, +, ×) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3).

  7. Detection of nucleic acid sequences by invader-directed cleavage

    Science.gov (United States)

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  8. Classification of base sequences

    CERN Document Server

    Djokovic, Dragomir Z

    2010-01-01

    Base sequences BS(n+1,n) are quadruples of {1,-1}-sequences (A;B;C;D), with A and B of length n+1 and C and D of length n, such that the sum of their nonperiodic autocorrelation functions is a delta-function. The base sequence conjecture, asserting that BS(n+1,n) exist for all n, is stronger than the famous Hadamard matrix conjecture. We introduce a new definition of equivalence for base sequences BS(n+1,n) and construct a canonical form. By using this canonical form, we have enumerated the equivalence classes of BS(n+1,n) for n <= 30. Due to excessive size of the equivalence classes, the tables in the paper cover only the cases n <= 12.

  9. Identification of Protein-Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information.

    Science.gov (United States)

    Ding, Yijie; Tang, Jijun; Guo, Fei

    2016-01-01

    Identification of protein-protein interactions (PPIs) is a difficult and important problem in biology. Since experimental methods for predicting PPIs are both expensive and time-consuming, many computational methods have been developed to predict PPIs and interaction networks, which can be used to complement experimental approaches. However, these methods have limitations to overcome. They need a large number of homology proteins or literature to be applied in their method. In this paper, we propose a novel matrix-based protein sequence representation approach to predict PPIs, using an ensemble learning method for classification. We construct the matrix of Amino Acid Contact (AAC), based on the statistical analysis of residue-pairing frequencies in a database of 6323 protein-protein complexes. We first represent the protein sequence as a Substitution Matrix Representation (SMR) matrix. Then, the feature vector is extracted by applying algorithms of Histogram of Oriented Gradient (HOG) and Singular Value Decomposition (SVD) on the SMR matrix. Finally, we feed the feature vector into a Random Forest (RF) for judging interaction pairs and non-interaction pairs. Our method is applied to several PPI datasets to evaluate its performance. On the S . c e r e v i s i a e dataset, our method achieves 94 . 83 % accuracy and 92 . 40 % sensitivity. Compared with existing methods, and the accuracy of our method is increased by 0 . 11 percentage points. On the H . p y l o r i dataset, our method achieves 89 . 06 % accuracy and 88 . 15 % sensitivity, the accuracy of our method is increased by 0 . 76 % . On the H u m a n PPI dataset, our method achieves 97 . 60 % accuracy and 96 . 37 % sensitivity, and the accuracy of our method is increased by 1 . 30 % . In addition, we test our method on a very important PPI network, and it achieves 92 . 71 % accuracy. In the Wnt-related network, the accuracy of our method is increased by 16 . 67 % . The source code and all datasets are available

  10. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    Directory of Open Access Journals (Sweden)

    Kager Piet A

    2006-10-01

    Full Text Available Abstract Background Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (Plasmodium antigen detection do often not allow for species discrimination as microscopy does, but also become insensitive at Methods This paper reports the development of a sensitive and specific real-time Quantitative Nucleic Acid Sequence Based Amplification (real-time QT-NASBA assays, based on the small-subunit 18S rRNA gene, to identify the four human Plasmodium species. Results The lower detection limit of the assay is 100 – 1000 molecules in vitro RNA for all species, which corresponds to 0.01 – 0.1 parasite per diagnostic sample (i.e. 50 μl of processed blood. The real-time QT-NASBA was further evaluated using 79 clinical samples from malaria patients: i.e. 11 Plasmodium. falciparum, 37 Plasmodium vivax, seven Plasmodium malariae, four Plasmodium ovale and 20 mixed infections. The initial diagnosis of 69 out of the 79 samples was confirmed with the developed real-time QT-NASBA. Re-analysis of seven available original slides resolved five mismatches. Three of those were initially identified as P. malariae mono-infection, but after re-reading the slides P. falciparum was found, confirming the real-time QT-NASBA result. The other two slides were of poor quality not allowing true species identification. The remaining five discordant results could not be explained by microscopy, but may be due to extreme low numbers of parasites present in the samples. In addition, 12 Plasmodium berghei isolates from mice and 20 blood samples from healthy donors did not show any reaction in the assay. Conclusion Real-time QT-NASBA is a very sensitive and specific technique with a detection limit of 0.1 Plasmodium parasite per diagnostic sample (50 μl of blood and can be used for the detection, identification and quantitative measurement of low parasitaemia of Plasmodium species, thus

  11. The origin of biased sequence depth in sequence-independent nucleic acid amplification and optimization for efficient massive parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Toon Rosseel

    Full Text Available Sequence Independent Single Primer Amplification is one of the most widely used random amplification approaches in virology for sequencing template preparation. This technique relies on oligonucleotides consisting of a 3' random part used to prime complementary DNA synthesis and a 5' defined tag sequence for subsequent amplification. Recently, this amplification method was combined with next generation sequencing to obtain viral sequences. However, these studies showed a biased distribution of the resulting sequence reads over the analyzed genomes. The aim of this study was to elucidate the mechanisms that lead to biased sequence depth when using random amplification. Avian paramyxovirus type 8 was used as a model RNA virus to investigate these mechanisms. We showed, based on in silico analysis of the sequence depth in relation to GC-content, predicted RNA secondary structure and sequence complementarity to the 3' part of the tag sequence, that the tag sequence has the main contribution to the observed bias in sequence depth. We confirmed this finding experimentally using both fragmented and non-fragmented viral RNAs as well as primers differing in random oligomer length (6 or 12 nucleotides and in the sequence of the amplification tag. The observed oligonucleotide annealing bias can be reduced by extending the random oligomer sequence and by in silico combining sequence data from SISPA experiments using different 5' defined tag sequences. These findings contribute to the optimization of random nucleic acid amplification protocols that are currently required for downstream applications such as viral metagenomics and microarray analysis.

  12. Phylogenetic analysis of evolutionary relationships of the planctomycete division of the domain bacteria based on amino acid sequences of elongation factor Tu.

    Science.gov (United States)

    Jenkins, C; Fuerst, J A

    2001-05-01

    Sequences from the tuf gene coding for the elongation factor EF-Tu were amplified and sequenced from the genomic DNA of Pirellula marina and Isosphaera pallida, two species of bacteria within the order Planctomycetales. A near-complete (1140-bp) sequence was obtained from Pi. marina and a partial (759-bp) sequence was obtained for I. pallida. Alignment of the deduced Pi. marina EF-Tu amino acid sequence against reference sequences demonstrated the presence of a unique 11-amino acid sequence motif not present in any other division of the domain Bacteria. Pi. marina shared the highest percentage amino acid sequence identity with I. pallida but showed only a low percentage identity with other members of the domain Bacteria. This is consistent with the concept of the planctomycetes as a unique division of the Bacteria. Neither primary sequence comparison of EF-Tu nor phylogenetic analysis supports any close relationship between planctomycetes and the chlamydiae, which has previously been postulated on the basis of 16S rRNA. Phylogenetic analysis of aligned EF-Tu amino acid sequences performed using distance, maximum-parsimony, and maximum-likelihood approaches yielded contradictory results with respect to the position of planctomycetes relative to other bacteria. It is hypothesized that long-branch attraction effects due to unequal evolutionary rates and mutational saturation effects may account for some of the contradictions. PMID:11443344

  13. Comparative Amino Acid Sequences of Dengue Viruses

    OpenAIRE

    Haishi, Shozo; TANAKA Mariko; Igarashi, Akira

    1990-01-01

    Amino acid (AA) sequences of 4 serotype of dengue viruses deduced from their nucleotide (nt) sequences of genomic RNA were analyzed for each genome segment and each stretch of 10 AA residues. Precursor of membrane protein (pM), and 4 nonstructural proteins (NS1, NS3, NS4B, NS5) were highly conserved, while another nonstructural protein (NS2A) was least conserved among 5 strains of dengue viruses. When homology was compared among heterotypic viruses, type 1 and type 3 dengue viruses showed clo...

  14. On Quantum Algorithm for Multiple Alignment of Amino Acid Sequences

    Science.gov (United States)

    Iriyama, Satoshi; Ohya, Masanori

    2009-02-01

    The alignment of genome sequences or amino acid sequences is one of fundamental operations for the study of life. Usual computational complexity for the multiple alignment of N sequences with common length L by dynamic programming is O(LN). This alignment is considered as one of the NP problems, so that it is desirable to find a nice algorithm of the multiple alignment. Thus in this paper we propose the quantum algorithm for the multiple alignment based on the works12,1,2 in which the NP complete problem was shown to be the P problem by means of quantum algorithm and chaos information dynamics.

  15. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Science.gov (United States)

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  16. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    NARCIS (Netherlands)

    P.F. Mens; G.J. Schoone; P.A. Kager; H.D.F.H. Schallig

    2006-01-01

    Background: Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (< 20 parasites/mu l) the technique becomes less sensitive and time consuming. Rapid diagnostic tests based on Plasmodium antigen detection

  17. Classification of Base Sequences (+1,

    Directory of Open Access Journals (Sweden)

    Dragomir Ž. Ðoković

    2010-01-01

    Full Text Available Base sequences BS(+1, are quadruples of {±1}-sequences (;;;, with A and B of length +1 and C and D of length n, such that the sum of their nonperiodic autocor-relation functions is a -function. The base sequence conjecture, asserting that BS(+1, exist for all n, is stronger than the famous Hadamard matrix conjecture. We introduce a new definition of equivalence for base sequences BS(+1, and construct a canonical form. By using this canonical form, we have enumerated the equivalence classes of BS(+1, for ≤30. As the number of equivalence classes grows rapidly (but not monotonically with n, the tables in the paper cover only the cases ≤13.

  18. On the base sequence conjecture

    CERN Document Server

    Djokovic, Dragomir Z

    2010-01-01

    Let BS(m,n) denote the set of base sequences (A;B;C;D), with A and B of length m and C and D of length n. The base sequence conjecture (BSC) asserts that BS(n+1,n) exist (i.e., are non-empty) for all n. This is known to be true for n <= 36 and when n is a Golay number. We show that it is also true for n=37 and n=38. It is worth pointing out that BSC is stronger than the famous Hadamard matrix conjecture. In order to demonstrate the abundance of base sequences, we have previously attached to BS(n+1,n) a graph Gamma_n and computed the Gamma_n for n <= 27. We now extend these computations and determine the Gamma_n for n=28,...,35. We also propose a conjecture describing these graphs in general.

  19. Acid-Base Homeostasis.

    Science.gov (United States)

    Hamm, L Lee; Nakhoul, Nazih; Hering-Smith, Kathleen S

    2015-12-01

    Acid-base homeostasis and pH regulation are critical for both normal physiology and cell metabolism and function. The importance of this regulation is evidenced by a variety of physiologic derangements that occur when plasma pH is either high or low. The kidneys have the predominant role in regulating the systemic bicarbonate concentration and hence, the metabolic component of acid-base balance. This function of the kidneys has two components: reabsorption of virtually all of the filtered HCO3(-) and production of new bicarbonate to replace that consumed by normal or pathologic acids. This production or generation of new HCO3(-) is done by net acid excretion. Under normal conditions, approximately one-third to one-half of net acid excretion by the kidneys is in the form of titratable acid. The other one-half to two-thirds is the excretion of ammonium. The capacity to excrete ammonium under conditions of acid loads is quantitatively much greater than the capacity to increase titratable acid. Multiple, often redundant pathways and processes exist to regulate these renal functions. Derangements in acid-base homeostasis, however, are common in clinical medicine and can often be related to the systems involved in acid-base transport in the kidneys.

  20. The amino acid sequence of Escherichia coli cyanase.

    Science.gov (United States)

    Chin, C C; Anderson, P M; Wold, F

    1983-01-10

    The amino acid sequence of the enzyme cyanase (cyanate hydrolase) from Escherichia coli has been determined by automatic Edman degradation of the intact protein and of its component peptides. The primary peptides used in the sequencing were produced by cyanogen bromide cleavage at the methionine residues, yielding 4 peptides plus free homoserine from the NH2-terminal methionine, and by trypsin cleavage at the 7 arginine residues after acetylation of the lysines. Secondary peptides required for overlaps and COOH-terminal sequences were produced by chymotrypsin or clostripain cleavage of some of the larger peptides. The complete sequence of the cyanase subunit consists of 156 amino acid residues (Mr 16,350). Based on the observation that the cysteine-containing peptide is obtained as a disulfide-linked dimer, it is proposed that the covalent structure of cyanase is made up of two subunits linked by a disulfide bond between the single cystine residue in each subunit. The native enzyme (Mr 150,000) then appears to be a complex of four or five such subunit dimers.

  1. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    Science.gov (United States)

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov. PMID:26071988

  2. Development of an electrochemical DNA biosensor for detection of specific Mycobacterium tuberculosis sequence based on poly(L-glutamic acid) modified electrode

    Indian Academy of Sciences (India)

    MERVE YESIL; SONER DONMEZ; FATMA ARSLAN

    2016-11-01

    An electrochemical DNA biosensor was developed by avidin-biotin interaction of a biotinylated probe and avidin-attached, poly(L-glutamic) acid coated pencil graphite electrode (PGA/PGE) for detection of specific Mycobacterium tuberculosis DNA sequence. The discrimination of fully complementary hybridization and mismatch hybridization was carried out by electrochemical reduction current of Meldola’s Blue (MDB) in square wave voltammetry (SWV). The calibration graph of the DNA biosensor was linear between 1.5–12.5 nM and the detection limit was calculated as 1.3 nM. The proposed biosensor successfully discriminated short andlong oligonucleotides related to DNA sequence of Mycobacterium tuberculosis in optimal condition.

  3. Electrochemical deoxyribonucleic acid biosensor based on electrodeposited graphene and nickel oxide nanoparticle modified electrode for the detection of salmonella enteritidis gene sequence.

    Science.gov (United States)

    Sun, Wei; Wang, Xiuli; Lu, Yongxi; Gong, Shixing; Qi, Xiaowei; Lei, Bingxin; Sun, Zhenfan; Li, Guangjiu

    2015-04-01

    In this paper a new electrochemical DNA biosensor was prepared by using graphene (GR) and nickel oxide (NiO) nanocomposite modified carbon ionic liquid electrode (CILE) as the substrate electrode. GR and NiO nanoparticles were electrodeposited on the CILE surface step-by-step to get the nanocomposite. Due to the strong affinity of NiO with phosphate groups of ssDNA, oligonucleotide probe with a terminal 5'-phosphate group could be attached on the surface of NiO/GR/CILE, which could further hybridize with the target ssDNA sequence. Methylene blue (MB) was used as the electrochemical indicator for monitoring the hybridization reaction. Under the optimal conditions the reduction peak current of MB was proportional to the concentration of salmonella enteritidis gene sequence in the range from 1.0×10(-13) to 1.0×10(-6)molL(-1) with a detection limit as 3.12×10(-14)molL(-1). This electrochemical DNA sensor exhibited good discrimination ability to one-base and three-base mismatched ssDNA sequences, and the polymerase chain reaction amplification product of salmonella enteritidis gene sequences were further detected with satisfactory results.

  4. A conversational system for the computer analysis of nucleic acid sequences.

    OpenAIRE

    Sege, R; Söll, D.; Ruddle, F H; Queen, C

    1981-01-01

    We present a conversational system for the computer analysis of nucleic acid and protein sequences based on the well-known Queen and Korn program (1). The system can be used by persons with only minimal knowledge of computers.

  5. Nucleotide Sequence of a Chicken Vitellogenin Gene and Derived Amino Acid Sequence of the Encoded Yolk Precursor Protein

    NARCIS (Netherlands)

    Schip, Fred D. van het; Samallo, John; Broos, Jaap; Ophuis, Jan; Mojet, Mart; Gruber, Max; AB, Geert

    1987-01-01

    The gene encoding the major vitellogenin from chicken has been completely sequenced and its exon-intron organization has been established. The gene is 20,342 base-pairs long and contains 35 exons with a combined length of 5787 base-pairs. They encode the 1850-amino acid pre-peptide of vitellogenin,

  6. Peptide Nucleic Acids Having Enhanced Binding Affinity and Sequence Specificity

    DEFF Research Database (Denmark)

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and binding affinity. Methods of increasing binding affinity and sequence specificity of peptide nucleic aci...

  7. MEANS AND METHODS FOR CLONING NUCLEIC ACID SEQUENCES

    NARCIS (Netherlands)

    Geertsma, Eric Robin; Poolman, Berend

    2008-01-01

    The invention provides means and methods for efficiently cloning nucleic acid sequences of interest in micro-organisms that are less amenable to conventional nucleic acid manipulations, as compared to, for instance, E.coli. The present invention enables high-throughput cloning (and, preferably, expr

  8. Amino acid sequences of proteins from Leptospira serovar pomona

    Directory of Open Access Journals (Sweden)

    Alves Selmo F

    2000-01-01

    Full Text Available This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  9. Representation of protein-sequence information by amino acid subalphabets

    DEFF Research Database (Denmark)

    Andersen, C.A.F.; Brunak, Søren

    2004-01-01

    Within computational biology, algorithms are constructed with the aim of extracting knowledge from biological data, in particular, data generated by the large genome projects, where gene and protein sequences are produced in high volume. In this article, we explore new ways of representing protein......-sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...

  10. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences.

    Science.gov (United States)

    Derr, Julien; Manapat, Michael L; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A; Chen, Irene A

    2012-05-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.

  11. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    Science.gov (United States)

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  12. Acids and bases solvent effects on acid-base strenght

    CERN Document Server

    Cox, Brian G

    2013-01-01

    Acids and bases are ubiquitous in chemistry. Our understanding of them, however, is dominated by their behaviour in water. Transfer to non-aqueous solvents leads to profound changes in acid-base strengths and to the rates and equilibria of many processes: for example, synthetic reactions involving acids, bases and nucleophiles; isolation of pharmaceutical actives through salt formation; formation of zwitter- ions in amino acids; and chromatographic separation of substrates. This book seeks to enhance our understanding of acids and bases by reviewing and analysing their behaviour in non-aqueous solvents. The behaviour is related where possible to that in water, but correlations and contrasts between solvents are also presented.

  13. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    Science.gov (United States)

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  14. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    DEFF Research Database (Denmark)

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein...

  15. Conceptual Change and Science Achievement Related to a Lesson Sequence on Acids and Bases Among African American Alternative High School Students: A Teacher's Practical Arguments and the Voice of the "Other"

    Science.gov (United States)

    Wood, Lynda Charese

    The study of teaching and learning during the period of translating ideals of reform into classroom practice enables us to understand student-teacher-researcher symbiotic learning. In line with this assumption, the purpose of this study is threefold:(1) observe effects of the Common Knowledge Construction Model (CKCM), a conceptual change inquiry model of teaching and learning, on African American students' conceptual change and achievement; (2) observe the shift in teacher's practical arguments; and (3) narrate the voice of "the Other" about teacher professional learning. This study uses retrospective data from a mixed-method approach consisting of Phenomenography, practical arguments and story-telling. Data sources include audio-recordings of a chemistry teacher's individual interviews of her students' prior- and post-intervention conceptions of acids and bases; results of Acid-Base Achievement Test (ABA-T); video-recordings of a chemistry teacher's enactment of CKCM acid-base lesson sequence; audio-recordings of teacher-researcher reflective discourse using classroom video-clips; teacher interviews; and teacher and researcher personal reflective journals. Students' conceptual changes reflect change in the number of categories of description; shift in language use from everyday talk to chemical talk; and development of a hierarchy of chemical knowledge. ABA-T results indicated 17 students in the experimental group achieved significantly higher scores than 22 students in the control group taught by traditional teaching methods. The teacher-researcher reflective discourse about enactment of the CKCM acid-base lesson sequence reveals three major shifts in teacher practical arguments: teacher inadequate preparedness to adequate preparedness; lack of confidence to gain in confidence; and surface learning to deep learning. The developing story uncovers several aspects about teaching and learning of African American students: teacher caring for the uncared; cultivating

  16. Establishment of the Nucleic Acid Sequence-based Amplification Method for Detecting Vibio Alginolyticus%溶藻弧菌的依赖于核酸序列恒温扩增检测方法的建立

    Institute of Scientific and Technical Information of China (English)

    秦胜利; 王建广

    2012-01-01

    A new method, based on Nucleic Acid Sequence-based Amplification (NAS-BA) to detect Vibio alginolyticus of samples, was established. A highly specific set of primers was synthesized to target the hsp60 gene of Vibio alginolyticus so as to establish Nucleic Acid Sequence-based Amplification method. Specificity and sensitivity were tested. The results showed that the sensitivity of NASBA was 6. 9×102cfu ? mL-1 which was higher than the result of PCR method. Detecting Vibio alginolyticus with NASBA was more specific and sensitive than PCR method and has lower instrumental requirement. So,there is a broad prospect.%采用自行建立和优化的依赖于核酸序列恒温扩增(NASBA)检测体系,对溶藻弧菌进行检测.采用溶藻弧菌的hsp60基因为目的片段设计特异性引物,建成可快速检测溶藻弧菌的NASBA检测法,并进行了特异性和灵敏度试验.结果表明:所建立起的NASBA检测方法,灵敏度为6.9× 102 cfu·mL-1,高于普通PCR方法.溶藻弧菌的依赖于核酸序列恒温扩增检测方法具有较高灵敏度和和良好特异性,并且对仪器要求更低,用普通恒温设备即可进行反应,具有广阔的推广前景.

  17. NGS-based deep bisulfite sequencing.

    Science.gov (United States)

    Lee, Suman; Kim, Joomyeong

    2016-01-01

    We have developed an NGS-based deep bisulfite sequencing protocol for the DNA methylation analysis of genomes. This approach allows the rapid and efficient construction of NGS-ready libraries with a large number of PCR products that have been individually amplified from bisulfite-converted DNA. This approach also employs a bioinformatics strategy to sort the raw sequence reads generated from NGS platforms and subsequently to derive DNA methylation levels for individual loci. The results demonstrated that this NGS-based deep bisulfite sequencing approach provide not only DNA methylation levels but also informative DNA methylation patterns that have not been seen through other existing methods.•This protocol provides an efficient method generating NGS-ready libraries from individually amplified PCR products.•This protocol provides a bioinformatics strategy sorting NGS-derived raw sequence reads.•This protocol provides deep bisulfite sequencing results that can measure DNA methylation levels and patterns of individual loci.

  18. Comparative genomics beyond sequence-based alignments

    DEFF Research Database (Denmark)

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.;

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment...... methods to misalign, or even refuse to align, homologous ncRNAs, consequently obscuring that structural signal. We have used CMfinder, a structure-oriented local alignment tool, to search the ENCODE regions of vertebrate multiple alignments. In agreement with other studies, we find a large number...... of potential RNA structures in the ENCODE regions. We report 6587 candidate regions with an estimated false-positive rate of 50%. More intriguingly, many of these candidates may be better represented by alignments taking the RNA secondary structure into account than those based on primary sequence alone, often...

  19. Covariance of charged amino acids at positions 322 and 440 of HIV-1 Env contributes to coreceptor specificity of subtype B viruses, and can be used to improve the performance of V3 sequence-based coreceptor usage prediction algorithms.

    Directory of Open Access Journals (Sweden)

    Kieran Cashin

    Full Text Available The ability to determine coreceptor usage of patient-derived human immunodeficiency virus type 1 (HIV-1 strains is clinically important, particularly for the administration of the CCR5 antagonist maraviroc. The envelope glycoprotein (Env determinants of coreceptor specificity lie primarily within the gp120 V3 loop region, although other Env determinants have been shown to influence gp120-coreceptor interactions. Here, we determined whether conserved amino acid alterations outside the V3 loop that contribute to coreceptor usage exist, and whether these alterations improve the performance of V3 sequence-based coreceptor usage prediction algorithms. We demonstrate a significant covariant association between charged amino acids at position 322 in V3 and position 440 in the C4 Env region that contributes to the specificity of HIV-1 subtype B strains for CCR5 or CXCR4. Specifically, positively charged Lys/Arg at position 322 and negatively charged Asp/Glu at position 440 occurred more frequently in CXCR4-using viruses, whereas negatively charged Asp/Glu at position 322 and positively charged Arg at position 440 occurred more frequently in R5 strains. In the context of CD4-bound gp120, structural models suggest that covariation of amino acids at Env positions 322 and 440 has the potential to alter electrostatic interactions that are formed between gp120 and charged amino acids in the CCR5 N-terminus. We further demonstrate that inclusion of a "440 rule" can improve the sensitivity of several V3 sequence-based genotypic algorithms for predicting coreceptor usage of subtype B HIV-1 strains, without compromising specificity, and significantly improves the AUROC of the geno2pheno algorithm when set to its recommended false positive rate of 5.75%. Together, our results provide further mechanistic insights into the intra-molecular interactions within Env that contribute to coreceptor specificity of subtype B HIV-1 strains, and demonstrate that incorporation

  20. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    Science.gov (United States)

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  1. Nucleotide sequence and corresponding amino acid sequence of the gene for the major antigen of foot and mouth disease virus.

    OpenAIRE

    Kurz, C; Forss, S; Küpper, H; K Strohmaier; Schaller, H

    1981-01-01

    A segment of 1160 nucleotides of the FMDV genome has been sequenced using three overlapping fragments of cloned cDNA from FMDV strain O1K. This sequence contains the coding sequence for the viral capsid protein VP1 as shown by its homology to known and newly determined amino acid sequences from this man antigenic polypeptide of the FMDV virion. The structural gene for VP1 comprises 639 nucleotides which specify a sequence of 213 amino acids for the VP1 protein. The coding sequence is not flan...

  2. Correlation between fibroin amino acid sequence and physical silk properties.

    Science.gov (United States)

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet. PMID:12816957

  3. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    Science.gov (United States)

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  4. Protein location prediction using atomic composition and global features of the amino acid sequence

    Energy Technology Data Exchange (ETDEWEB)

    Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India); Nair, Achuthsankar S. [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India)

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  5. Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

    Science.gov (United States)

    Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

    2003-01-01

    A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171

  6. SNAD: sequence name annotation-based designer

    Directory of Open Access Journals (Sweden)

    Gorbalenya Alexander E

    2009-08-01

    Full Text Available Abstract Background A growing diversity of biological data is tagged with unique identifiers (UIDs associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Results Here we introduce SNAD (Sequence Name Annotation-based Designer that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. Conclusion A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  7. Human apolipoprotein C-II: complete nucleic acid sequence of preapolipoprotein C-II.

    OpenAIRE

    Fojo, S S; Law, S W; Brewer, H B

    1984-01-01

    Apolipoprotein (apo) C-II is a cofactor for lipoprotein lipase, the enzyme that catalyzes the hydrolysis of triglycerides on plasma triglyceride-rich lipoproteins. The complete coding sequence of apoC-II mRNA has been determined from an apoC-II clone isolated from a human liver cDNA library. A 17-base-long synthetic oligonucleotide based on amino acid residues 5-10 of apoC-II was utilized as a hybridization probe to select recombinant plasmids containing the apoC-II sequence. Two thousand fou...

  8. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    Science.gov (United States)

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-01

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. PMID:27375218

  9. Molten fatty acid based microemulsions.

    Science.gov (United States)

    Noirjean, Cecile; Testard, Fabienne; Dejugnat, Christophe; Jestin, Jacques; Carriere, David

    2016-06-21

    We show that ternary mixtures of water (polar phase), myristic acid (MA, apolar phase) and cetyltrimethylammonium bromide (CTAB, cationic surfactant) studied above the melting point of myristic acid allow the preparation of microemulsions without adding a salt or a co-surfactant. The combination of SANS, SAXS/WAXS, DSC, and phase diagram determination allows a complete characterization of the structures and interactions between components in the molten fatty acid based microemulsions. For the different structures characterized (microemulsion, lamellar or hexagonal phases), a similar thermal behaviour is observed for all ternary MA/CTAB/water monophasic samples and for binary MA/CTAB mixtures without water: crystalline myristic acid melts at 52 °C, and a thermal transition at 70 °C is assigned to the breaking of hydrogen bounds inside the mixed myristic acid/CTAB complex (being the surfactant film in the ternary system). Water determines the film curvature, hence the structures observed at high temperature, but does not influence the thermal behaviour of the ternary system. Myristic acid is partitioned in two "species" that behave independently: pure myristic acid and myristic acid associated with CTAB to form an equimolar complex that plays the role of the surfactant film. We therefore show that myristic acid plays the role of a solvent (oil) and a co-surfactant allowing the fine tuning of the structure of oil and water mixtures. This solvosurfactant behaviour of long chain fatty acid opens the way for new formulations with a complex structure without the addition of any extra compound. PMID:27241163

  10. Nucleotide and amino acid sequences of human intestinal alkaline phosphatase: close homology to placental alkaline phosphatase

    International Nuclear Information System (INIS)

    A cDNA clone for human adult intestinal alkaline phosphatase (ALP) [orthophosphoric-monoester phosphohydrolase (alkaline optimum); EC 3.1.3.1] was isolated from a λgt11 expression library. The cDNA insert of this clone is 2513 base pairs in length and contains an open reading frame that encodes a 528-amino acid polypeptide. This deduced polypeptide contains the first 40 amino acids of human intestinal ALP, as determined by direct protein sequencing. Intestinal ALP shows 86.5% amino acid identity to placental (type 1) ALP and 56.6% amino acid identity to liver/bone/kidney ALP. In the 3'-untranslated regions, intestinal and placental ALP cDNAs are 73.5% identical (excluding gaps). The evolution of this multigene enzyme family is discussed

  11. Next-Generation Sequencing Techniques for Eukaryotic Microorganisms: Sequencing-Based Solutions to Biological Problems▿

    OpenAIRE

    Nowrousian, Minou

    2010-01-01

    Over the past 5 years, large-scale sequencing has been revolutionized by the development of several so-called next-generation sequencing (NGS) technologies. These have drastically increased the number of bases obtained per sequencing run while at the same time decreasing the costs per base. Compared to Sanger sequencing, NGS technologies yield shorter read lengths; however, despite this drawback, they have greatly facilitated genome sequencing, first for prokaryotic genomes and within the las...

  12. A repetitive sequence assembler based on next-generation sequencing.

    Science.gov (United States)

    Lian, S; Tu, Y; Wang, Y; Chen, X; Wang, L

    2016-01-01

    Repetitive sequences of variable length are common in almost all eukaryotic genomes, and most of them are presumed to have important biomedical functions and can cause genomic instability. Next-generation sequencing (NGS) technologies provide the possibility of identifying capturing these repetitive sequences directly from the NGS data. In this study, we assessed the performances in identifying capturing repeats of leading assemblers, such as Velvet, SOAPdenovo, SGA, MSR-CA, Bambus2, ALLPATHS-LG, and AByss using three real NGS datasets. Our results indicated that most of them performed poorly in capturing the repeats. Consequently, we proposed a repetitive sequence assembler, named NGSReper, for capturing repeats from NGS data. Simulated datasets were used to validate the feasibility of NGSReper. The results indicate that the completeness of capturing repeat is up to 99%. Cross validation was performed in three real NGS datasets, and extensive comparisons indicate that NGSReper performed best in terms of completeness and accuracy in capturing repeats. In conclusion, NGSReper is an appropriate and suitable tool for capturing repeats directly from NGS data. PMID:27525861

  13. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    Science.gov (United States)

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  14. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    Science.gov (United States)

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  15. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Science.gov (United States)

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  16. Repeat-based Sequence Typing of Carnobacterium maltaromaticum.

    Science.gov (United States)

    Rahman, Abdur; El Kheir, Sara M; Back, Alexandre; Mangavel, Cécile; Revol-Junelles, Anne-Marie; Borges, Frédéric

    2016-06-01

    Carnobacterium maltaromaticum is a Lactic Acid Bacterium (LAB) of technological interest for the food industry, especially the dairy as bioprotection and ripening flora. The industrial use of this LAB requires accurate and resolutive typing tools. A new typing method for C. maltaromaticum inspired from MLVA analysis and called Repeat-based Sequence Typing (RST) is described. Rather than electrophoresis analysis, our RST method is based on sequence analysis of multiple loci containing Variable-Number Tandem-Repeats (VNTRs). The method described here for C. maltaromaticum relies on the analysis of three VNTR loci, and was applied to a collection of 24 strains. For each strain, a PCR product corresponding to the amplification of each VNTR loci was sequenced. Sequence analysis allowed delineating 11, 11, and 12 alleles for loci VNTR-A, VNTR-B, and VNTR-C, respectively. Considering the allele combination exhibited by each strain allowed defining 15 genotypes, ending in a discriminatory index of 0.94. Comparison with MLST revealed that both methods were complementary for strain typing in C. maltaromaticum.

  17. Structure of the fully modified left-handed cyclohexene nucleic acid sequence GTGTACAC.

    Science.gov (United States)

    Robeyns, Koen; Herdewijn, Piet; Van Meervelt, Luc

    2008-02-13

    CeNA oligonucleotides consist of a phosphorylated backbone where the deoxyribose sugars are replaced by cyclohexene moieties. The X-ray structure determination and analysis of a fully modified octamer sequence GTGTACAC, which is the first crystal structure of a carbocyclic-based nucleic acid, is presented. This particular sequence was built with left-handed building blocks and crystallizes as a left-handed double helix. The helix can be characterized as belonging to the (mirrored) A-type family. Crystallographic data were processed up to 1.53 A, and the octamer sequence crystallizes in the space group R32. The sugar puckering is found to adopt the 3H2 half-chair conformation which mimics the C3'-endo conformation of the ribose sugar. The double helices stack on top of each other to form continuous helices, and static disorder is observed due to this end-to-end stacking.

  18. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  19. Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Gclust Server Amino acid sequences used for clusterintg (Multi FASTA format) Data detail Data name Amino acid sequence...s used for clusterintg (Multi FASTA format) Description of data contents Amino acid sequences of p...a entries - FASTA format header: ID and the annotation information of the sequence... Site Policy | Contact Us Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive ...

  20. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    Science.gov (United States)

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  1. Entamoeba histolytica: observations on metabolism based on thegenome sequence

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, Iain J.; Loftus, Brendan J.

    2005-07-01

    The sequencing of the genome of Entamoeba histolytica has allowed a reconstruction of its metabolic pathways, many of which are unusual for a eukaryote. Based on the genome sequence, it appears that amino acids may play a larger role than previously thought in energy metabolism, with roles in both ATP synthesis and NAD regeneration. Arginine decarboxylase may be involved in survival of E. histolytica during its passage through the stomach. The usual pyrimidine synthesis pathway is absent, but a partial pyrimidine degradation pathway could be part of a novel pyrimidine synthesis pathway. Ribonucleotide reductase was not found in the E. histolytica genome, but it was found in the close relatives Entamoeba invadens and Entamoeba moshkovskii, suggesting a recent loss from E. histolytica. The usual eukaryotic glucose transporters are not present, but members of a prokaryotic monosaccharide transporter family are present.

  2. Stream cipher based on GSS sequences

    Institute of Scientific and Technical Information of China (English)

    HU Yupu; XIAO Guozhen

    2004-01-01

    Generalized self-shrinking sequences, simply named the GSS sequences,are novel periodic sequences that have many advantages in cryptography. In this paper,we give several results about GSS sequence's application to cryptography. First, we give a simple method for selecting those GSS sequences whose least periods reach the maximum. Second, we give a method for describing and computing the auto-correlation coefficients of GSS sequences. Finally, we point out that some GSS sequences, when used as stream ciphers, have a security weakness.

  3. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    Science.gov (United States)

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

  4. Contig sequences and their annotation (amino acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us Dicty_cDB Contig sequence...s and their annotation (amino acid sequence and results of homology search), and ex...pression profile Data detail Data name Contig sequences and their annotation (amino acid sequence and result...s of homology search), and expression profile Description of data contents Contig sequences of cDNA sequen...ces of Dictyostelium discoideum and their annotation (amino acid sequence and resul

  5. Nucleic acid based logical systems.

    Science.gov (United States)

    Han, Da; Kang, Huaizhi; Zhang, Tao; Wu, Cuichen; Zhou, Cuisong; You, Mingxu; Chen, Zhuo; Zhang, Xiaobing; Tan, Weihong

    2014-05-12

    Researchers increasingly visualize a significant role for artificial biochemical logical systems in biological engineering, much like digital logic circuits in electrical engineering. Those logical systems could be utilized as a type of servomechanism to control nanodevices in vitro, monitor chemical reactions in situ, or regulate gene expression in vivo. Nucleic acids (NA), as carriers of genetic information with well-regulated and predictable structures, are promising materials for the design and engineering of biochemical circuits. A number of logical devices based on nucleic acids (NA) have been designed to handle various processes for technological or biotechnological purposes. This article focuses on the most recent and important developments in NA-based logical devices and their evolution from in vitro, through cellular, even towards in vivo biological applications.

  6. Prediction of protein motions from amino acid sequence and its application to protein-protein interaction

    Directory of Open Access Journals (Sweden)

    Wako Hiroshi

    2010-07-01

    Full Text Available Abstract Background Structural flexibility is an important characteristic of proteins because it is often associated with their function. The movement of a polypeptide segment in a protein can be broken down into two types of motions: internal and external ones. The former is deformation of the segment itself, but the latter involves only rotational and translational motions as a rigid body. Normal Model Analysis (NMA can derive these two motions, but its application remains limited because it necessitates the gathering of complete structural information. Results In this work, we present a novel method for predicting two kinds of protein motions in ordered structures. The prediction uses only information from the amino acid sequence. We prepared a dataset of the internal and external motions of segments in many proteins by application of NMA. Subsequently, we analyzed the relation between thermal motion assessed from X-ray crystallographic B-factor and internal/external motions calculated by NMA. Results show that attributes of amino acids related to the internal motion have different features from those related to the B-factors, although those related to the external motion are correlated strongly with the B-factors. Next, we developed a method to predict internal and external motions from amino acid sequences based on the Random Forest algorithm. The proposed method uses information associated with adjacent amino acid residues and secondary structures predicted from the amino acid sequence. The proposed method exhibited moderate correlation between predicted internal and external motions with those calculated by NMA. It has the highest prediction accuracy compared to a naïve model and three published predictors. Conclusions Finally, we applied the proposed method predicting the internal motion to a set of 20 proteins that undergo large conformational change upon protein-protein interaction. Results show significant overlaps between the

  7. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    Directory of Open Access Journals (Sweden)

    Ruan Jishou

    2007-04-01

    Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are

  8. SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY

    Directory of Open Access Journals (Sweden)

    Yoshihisa Udagawa

    2013-07-01

    Full Text Available Duplicate code adversely affects the quality of software systems and hence should be detected. We discuss an approach that improves source code retrieval using structural information of source code. A lexical parser is developed to extract control statements and method identifiers from Java programs. We propose a similarity measure that is defined by the ratio of the number of sequential fully matching statements to the number of sequential partially matching statements. The defined similarity measure is an extension of the set-based Sorensen-Dice similarity index. This research primarily contributes to the development of a similarity retrieval algorithm that derives meaningful search conditions from a given sequence, and then performs retrieval using all derived conditions. Experiments show that our retrieval model shows an improvement of up to 90.9% over other retrieval models relative to the number of retrieved methods.

  9. Source Code Retrieval Using Sequence Based Similarity

    Directory of Open Access Journals (Sweden)

    Yoshihisa Udagawa

    2013-07-01

    Full Text Available Duplicate code adversely affects the quality of software systems and hence should be detected. We discussan approach that improves source code retrieval using structural information of source code. A lexicalparser is developed to extract control statements and method identifiers from Java programs. We propose asimilarity measure that is defined by the ratio ofthe number of sequential fully matching statementsto thenumber of sequential partially matching statements.The defined similarity measure is an extension oftheset-based Sorensen-Dice similarity index. This research primarily contributes to the development of asimilarity retrieval algorithm that derives meaningful search conditions from a given sequence, and thenperforms retrieval using all derived conditions. Experiments show that our retrieval model shows animprovement of up to 90.9% over other retrieval models relative to the number of retrieved methods.

  10. The complete amino acid sequence of the basic nuclear protein of bull spermatozoa

    NARCIS (Netherlands)

    Coelingh, J.P.; Monfoort, Cornelis H.; Rozijn, Thomas H.; Gevers Leuven, Jan A.; Schiphof, R.; Steyn-Parvé, Elizabeth P.; Braunitzer, Gerhard; Schrank, Barbara; Ruhfus, Annette

    1972-01-01

    The complete amino acid sequence of the basic nuclear protein of bull spermatozoa has been established. The sequence was partially deduced by characterization of peptides isolated from thermolysine and chymotryptic digests of the reduced and S-aminoethylated protein. The complete sequence of the fir

  11. Amino acid selective unlabeling for sequence specific resonance assignments in proteins

    Energy Technology Data Exchange (ETDEWEB)

    Krishnarjuna, B.; Jaipuria, Garima; Thakur, Anushikha [Indian Institute of Science, NMR Research Centre (India); D' Silva, Patrick, E-mail: patrick@biochem.iisc.ernet.in [Indian Institute of Science, Department of Biochemistry (India); Atreya, Hanudatta S., E-mail: hsatreya@sif.iisc.ernet.in [Indian Institute of Science, NMR Research Centre (India)

    2011-01-15

    Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective 'unlabeling' or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly {sup 13}C/{sup 15}N labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {l_brace}{sup 12}CO{sub i}-{sup 15}N{sub i+1}{r_brace}-filtered HSQC, which aids in linking the {sup 1}H{sup N}/{sup 15}N resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i - 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to {sup 2}H labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of {sup 14}N at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies.

  12. Quick Trickle Permutation Based on Quick Trickle Characteristic Sequence

    Institute of Scientific and Technical Information of China (English)

    WangLi-na; FeiRu-chun; LiuZhu

    2003-01-01

    The concept of quick trickle characteristic sequence is presented, the properties and count of quick trickle characteristic sequence are researched, the mapping relationship between quick trickle characteristic sequence and quick trickle permutation is discussed. Finally, an efficient construction of quick trickle permutation based on quick trickle characteristic sequence is given, by which quick trickle permutation can be figured out after constructing quick trickle characteristic sequence. Quick trickle permutation has good cryptographic properties.

  13. Amino acid sequences of neuropeptides in the sinus gland of the land crab Cardisoma carnifex: a novel neuropeptide proteolysis site.

    Science.gov (United States)

    Newcomb, R W

    1987-08-01

    The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.

  14. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    Science.gov (United States)

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  15. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    Science.gov (United States)

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  16. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    Science.gov (United States)

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  17. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    Energy Technology Data Exchange (ETDEWEB)

    Croteau, Rodney Bruce (Pullman, WA); Wildung, Mark Raymond (Colfax, WA); Lange, Bernd Markus (Pullman, WA); McCaskill, David G. (Pullman, WA)

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  18. The amino acid sequence of elephant (Elephas maximus) myoglobin and the phylogeny of Proboscidea.

    Science.gov (United States)

    Dene, H; Goodman, M; Romero-Herrera, A E

    1980-02-13

    The complete amino acid sequence of skeletal myoglobin from the Asian elephant (Elephas maximus) is reported. The functional significance of variations seen when this sequence is compared with that of sperm whale myoglobin is explored in the light of the crystallographic model available for the latter molecule. The phylogenetic implications of the elephant myoglobin amino acid sequence are evaluated by using the maximum parsimony technique. A similar analysis is also presented which incorporates all of the proteins sequenced from the elephant. These results are discussed with respect to current views on proboscidean phylogeny.

  19. Students' Understanding of Acids/Bases in Organic Chemistry Contexts

    Science.gov (United States)

    Cartrette, David P.; Mayo, Provi M.

    2011-01-01

    Understanding key foundational principles is vital to learning chemistry across different contexts. One such foundational principle is the acid/base behavior of molecules. In the general chemistry sequence, the Bronsted-Lowry theory is stressed, because it lends itself well to studying equilibrium and kinetics. However, the Lewis theory of…

  20. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Directory of Open Access Journals (Sweden)

    James B Howard

    Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification

  1. Simulation-Based Evaluation of Learning Sequences for Instructional Technologies

    Science.gov (United States)

    McEneaney, John E.

    2016-01-01

    Instructional technologies critically depend on systematic design, and learning hierarchies are a commonly advocated tool for designing instructional sequences. But hierarchies routinely allow numerous sequences and choosing an optimal sequence remains an unsolved problem. This study explores a simulation-based approach to modeling learning…

  2. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Science.gov (United States)

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697

  3. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Directory of Open Access Journals (Sweden)

    Chun-Tien Chang

    2012-01-01

    Full Text Available The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs, insertion-deletions (indels, short tandem repeats (STRs, and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR, which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS; (iii determine human papilloma virus (HPV genotypes by searching current viral databases in cases of double infections; (iv estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4 and its paralog HSPDP3.

  4. Mixed sequence reader: a program for analyzing DNA sequences with heterozygous base calling.

    Science.gov (United States)

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3.

  5. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    Science.gov (United States)

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  6. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    Science.gov (United States)

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  7. Amino Acid Sequence - KOME | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...ll_sequence_amino_db.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST...ta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_ine_full_sequence_amino_db.fasta.zip Fi...date History of This Database Site Policy | Contact Us Amino Acid Sequence - KOME | LSDB Archive ...

  8. Fusion protein predicted amino acid sequence of the first US avian pneumovirus isolate and lack of heterogeneity among other US isolates.

    Science.gov (United States)

    Seal, B S; Sellers, H S; Meinersmann, R J

    2000-02-01

    Avian pneumovirus (APV) was first isolated from turkeys in the west-central US following emergence of turkey rhinotracheitis (TRT) during 1996. Subsequently, several APV isolates were obtained from the north-central US. Matrix (M) and fusion (F) protein genes of these isolates were examined for sequence heterogeneity and compared with European APV subtypes A and B. Among US isolates the M gene shared greater than 98% nucleotide sequence identity with only one nonsynonymous change occurring in a single US isolate. Although the F gene among US APV isolates shared 98% nucleotide sequence identity, nine conserved substitutions were detected in the predicted amino acid sequence. The predicted amino acid sequence of the US APV isolate's F protein had 72% sequence identity to the F protein of APV subtype A and 71% sequence identity to the F protein of APV subtype B. This compares with 83% sequence identity between the APV subtype A and B predicted amino acid sequences of the F protein. The US isolates were phylogenetically distinguishable from their European counterparts based on F gene nucleotide or predicted amino acid sequences. Lack of sequence heterogeneity among US APV subtypes indicates these viruses have maintained a relatively stable population since the first outbreak of TRT. Phylogenetic analysis of the F protein among APV isolates supports classification of US isolates as a new APV subtype C.

  9. Identification of protein superfamily from structure- based sequence motif

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The structure-based sequence motif of the distant proteins in evolution, protein tyrosine phosphatases (PTP) Ⅰ and Ⅱ superfamilies, as an example, has been defined by the structural comparison, structure-based sequence alignment and analyses on substitution patterns of residues in common sequence conserved regions. And the phosphatases Ⅰ and Ⅱ can be correctly identified together by the structure-based PTP sequence motif from SWISS-PROT and TrEBML databases. The results show that the correct rates of identification are over 98%. This is the first time to identify PTP Ⅰ and Ⅱ together by this motif.

  10. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid

    Science.gov (United States)

    Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  11. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Science.gov (United States)

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid. PMID:27660792

  12. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    Science.gov (United States)

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  13. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Science.gov (United States)

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid.

  14. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M;

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250-555 p...

  15. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    Science.gov (United States)

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  16. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    Science.gov (United States)

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  17. Two distinct ferredoxins from Rhodobacter capsulatus: complete amino acid sequences and molecular evolution.

    Science.gov (United States)

    Saeki, K; Suetsugu, Y; Yao, Y; Horio, T; Marrs, B L; Matsubara, H

    1990-09-01

    Two distinct ferredoxins were purified from Rhodobacter capsulatus SB1003. Their complete amino acid sequences were determined by a combination of protease digestion, BrCN cleavage and Edman degradation. Ferredoxins I and II were composed of 64 and 111 amino acids, respectively, with molecular weights of 6,728 and 12,549 excluding iron and sulfur atoms. Both contained two Cys clusters in their amino acid sequences. The first cluster of ferredoxin I and the second cluster of ferredoxin II had a sequence, CxxCxxCxxxCP, in common with the ferredoxins found in Clostridia. The second cluster of ferredoxin I had a sequence, CxxCxxxxxxxxCxxxCM, with extra amino acids between the second and third Cys, which has been reported for other photosynthetic bacterial ferredoxins and putative ferredoxins (nif-gene products) from nitrogen-fixing bacteria, and with a unique occurrence of Met. The first cluster of ferredoxin II had a CxxCxxxxCxxxCP sequence, with two additional amino acids between the second and third Cys, a characteristics feature of Azotobacter-[3Fe-4S] [4Fe-4S]-ferredoxin. Ferredoxin II was also similar to Azotobacter-type ferredoxins with an extended carboxyl (C-) terminal sequence compared to the common Clostridium-type. The evolutionary relationship of the two together with a putative one recently found to be encoded in nifENXQ region in this bacterium [Moreno-Vivian et al. (1989) J. Bacteriol. 171, 2591-2598] is discussed. PMID:2277040

  18. Comparison of sequence-based and structure-based phylogenetic trees of homologous proteins: Inferences on protein evolution

    Indian Academy of Sciences (India)

    S Balaji; N Srinivasan

    2007-01-01

    Several studies based on the known three-dimensional (3-D) structures of proteins show that two homologous proteins with insignificant sequence similarity could adopt a common fold and may perform same or similar biochemical functions. Hence, it is appropriate to use similarities in 3-D structure of proteins rather than the amino acid sequence similarities in modelling evolution of distantly related proteins. Here we present an assessment of using 3-D structures in modelling evolution of homologous proteins. Using a dataset of 108 protein domain families of known structures with at least 10 members per family we present a comparison of extent of structural and sequence dissimilarities among pairs of proteins which are inputs into the construction of phylogenetic trees. We find that correlation between the structure-based dissimilarity measures and the sequence-based dissimilarity measures is usually good if the sequence similarity among the homologues is about 30% or more. For protein families with low sequence similarity among the members, the correlation coefficient between the sequence-based and the structure-based dissimilarities are poor. In these cases the structure-based dendrogram clusters proteins with most similar biochemical functional properties better than the sequence-similarity based dendrogram. In multi-domain protein families and disulphide-rich protein families the correlation coefficient for the match of sequence-based and structure-based dissimilarity (SDM) measures can be poor though the sequence identity could be higher than 30%. Hence it is suggested that protein evolution is best modelled using 3-D structures if the sequence similarities (SSM) of the homologues are very low.

  19. Image-based temporal alignment of echocardiographic sequences

    Science.gov (United States)

    Danudibroto, Adriyana; Bersvendsen, Jørn; Mirea, Oana; Gerard, Olivier; D'hooge, Jan; Samset, Eigil

    2016-04-01

    Temporal alignment of echocardiographic sequences enables fair comparisons of multiple cardiac sequences by showing corresponding frames at given time points in the cardiac cycle. It is also essential for spatial registration of echo volumes where several acquisitions are combined for enhancement of image quality or forming larger field of view. In this study, three different image-based temporal alignment methods were investigated. First, a method based on dynamic time warping (DTW). Second, a spline-based method that optimized the similarity between temporal characteristic curves of the cardiac cycle using 1D cubic B-spline interpolation. Third, a method based on the spline-based method with piecewise modification. These methods were tested on in-vivo data sets of 19 echo sequences. For each sequence, the mitral valve opening (MVO) time was manually annotated. The results showed that the average MVO timing error for all methods are well under the time resolution of the sequences.

  20. Protein chemotaxonomy. XIII. Amino acid sequence of ferredoxin from Panax ginseng.

    Science.gov (United States)

    Mino, Yoshiki

    2006-08-01

    The complete amino acid sequence of [2Fe-2S] ferredoxin from Panax ginseng (Araliaceae) has been determined by automated Edman degradation of the entire S-carboxymethylcysteinyl protein and of the peptides obtained by enzymatic digestion. This ferredoxin has a unique amino acid sequence, which includes an insertion of Tyr at the 3rd position from the amino-terminus and a deletion of two amino acid residues at the carboxyl terminus. This ferredoxin had 18 differences in its amino acid sequence compared to that of Petroselinum sativum (Umbelliferae). In contrast, 23-33 differences were observed compared to other dicotyledonous plants. This suggests that Panax ginseng is related taxonomically to umbelliferous plants. PMID:16880642

  1. Genetic characterization of three novel chicken parvovirus strains based on analysis of their coding sequences.

    Science.gov (United States)

    Koo, Bon-Sang; Lee, Hae-Rim; Jeon, Eun-Ok; Han, Moo-Sung; Min, Kyeong-Cheol; Lee, Seung-Baek; Bae, Yeon-Ji; Cho, Sun-Hyung; Mo, Jong-Suk; Kwon, Hyuk Moo; Sung, Haan Woo; Kim, Jong-Nyeo; Mo, In-Pil

    2015-01-01

    Chicken parvovirus (ChPV) is one of the causative agents of viral enteritis. Recently, the genome of the ABU-P1 strain of ChPV was fully sequenced and determined to have a distinct genomic composition compared with that of vertebrate parvoviruses. However, no comparative sequence analysis of coding regions of ChPVs was possible because of the lack of other sequence information. In this study, we obtained the nucleotide sequences of all genomic coding regions of three ChPVs by polymerase chain reaction using 13 primer sets, and deduced the amino acid sequences from the nucleotide sequences. The non-structural protein 1 (NS1) gene of the three ChPVs showed 95.0 to 95.5% nucleotide sequence identity and 96.5 to 98.1% amino acid sequence identity to those of NS1 from the ABU-P1 strain, respectively, and even higher nucleotide and amino acid similarities to one another. The viral proteins (VP) gene was more divergent between the three ChPV Korean strains and ABU-P1, with 88.1 to 88.3% nucleotide identity and 93.0% amino acid identity. Analysis of the putative tertiary structure of the ChPV VP2 protein showed that variable regions with less than 80% nucleotide similarity between the three Korean strains and ABU-P1 occurred in large loops of the VP2 protein believed to be involved in antigenicity, pathogenicity, and tissue tropism in other parvoviruses. Based on our analysis of full-length coding sequences, we discovered greater variation in ChPV strains than reported previously, especially in partial regions of the VP2 protein.

  2. Amino acid sequence heterogeneity of the chromosomal encoded Borrelia burgdorferi sensu lato major antigen P100.

    Science.gov (United States)

    Fellinger, W; Farencena, A; Redl, B; Sambri, V; Cevenini, R; Stöffler, G

    1995-04-01

    The entire nucleotide sequence of the chromosomal encoded major antigen p100 of the European Borrelia garinii isolate B29 was determined and the deduced amino acid sequence was compared to the homologous antigen p83 of the North American Borrelia burgdorferi sensu stricto strain B31 and the p100 of the European Borrelia afzelii (group VS461) strain PKo. p100 of strain B29 shows 87% amino acid sequence identity to strain B31 and 79.2% to strain PKo, p100 of strain B31 and PKo shows 62.5% identity to each other. In addition, partial nucleotide sequences of the most heterogeneous region of the p100 gene of two other Borrelia garinii isolates (PBi and VS286) have been determined and the deduced amino acid sequences were compared with all p100 of Borrelia garinii published so far. We found an amino acid sequence identity between 88.6 and 100% within the same genospecies. The N-terminal part of the p100 proteins is highly conserved whereas a striking heterogeneous region within the C-terminal part of the proteins was observed.

  3. An Incremental Algorithm of Text Clustering Based on Semantic Sequences

    Institute of Scientific and Technical Information of China (English)

    FENG Zhonghui; SHEN Junyi; BAO Junpeng

    2006-01-01

    This paper proposed an incremental textclustering algorithm based on semantic sequence.Using similarity relation of semantic sequences and calculating the cover of similarity semantic sequences set, the candidate cluster with minimum entropy overlap value was selected as a result cluster every time in this algorithm.The comparison of experimental results shows that the precision of the algorithm is higher than other algorithms under same conditions and this is obvious especially on long documents set.

  4. Feature-based Image Sequence Compression Coding

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    A novel compressing method for video teleconference applications is presented. Semantic-based coding based on human image feature is realized, where human features are adopted as parameters. Model-based coding and the concept of vector coding are combined with the work on image feature extraction to obtain the result.

  5. The Conjugate Acid-Base Chart.

    Science.gov (United States)

    Treptow, Richard S.

    1986-01-01

    Discusses the difficulties that beginning chemistry students have in understanding acid-base chemistry. Describes the use of conjugate acid-base charts in helping students visualize the conjugate relationship. Addresses chart construction, metal ions, buffers and pH titrations, and the organic functional groups and nonaqueous solvents. (TW)

  6. The Kidney and Acid-Base Regulation

    Science.gov (United States)

    Koeppen, Bruce M.

    2009-01-01

    Since the topic of the role of the kidneys in the regulation of acid base balance was last reviewed from a teaching perspective (Koeppen BM. Renal regulation of acid-base balance. Adv Physiol Educ 20: 132-141, 1998), our understanding of the specific membrane transporters involved in H+, HCO , and NH transport, and especially how these…

  7. An Ant-Based Model for Multiple Sequence Alignment

    CERN Document Server

    Guinand, Frédéric

    2008-01-01

    Multiple sequence alignment is a key process in today's biology, and finding a relevant alignment of several sequences is much more challenging than just optimizing some improbable evaluation functions. Our approach for addressing multiple sequence alignment focuses on the building of structures in a new graph model: the factor graph model. This model relies on block-based formulation of the original problem, formulation that seems to be one of the most suitable ways for capturing evolutionary aspects of alignment. The structures are implicitly built by a colony of ants laying down pheromones in the factor graphs, according to relations between blocks belonging to the different sequences.

  8. Introduction of restriction enzyme sites in protein-coding DNA sequences by site-specific mutagenesis not affecting the amino acid sequence: a computer program.

    OpenAIRE

    Arentzen, R; Ripka, W. C.

    1984-01-01

    Structure/function relationship studies of proteins are greatly facilitated by recombinant DNA technology which allows specific amino acid mutations to be made at the DNA sequence level by site-specific mutagenesis employing synthetic oligonucleotides. This technique has been successfully used to alter one or two amino acids in a protein. Replacement of existing DNA sequence coding for several amino acids with new synthetic DNA fragments would be facilitated by the presence of unique restrict...

  9. Protein sequence for clustering DNA based on Artificial Neural Networks

    Directory of Open Access Journals (Sweden)

    Gamal. F. Elhadi

    2012-01-01

    Full Text Available DNA is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organisms and some viruses. Clustering is a process that groups a set of objects into clusters so that the similarity among objects in the same cluster is high, while that among the objects in different clusters is low. In this paper, we proposed an approach for clustering DNA sequences using Self-Organizing Map (SOM algorithm and Protein Sequence. The main objective is to analyze biological data and to bunch DNA to many clusters more easily and efficiently. We use the proposed approach to analyze both large and small amount of input DNA sequences. The results show that the similarity of the sequences does not depend on the amount of input sequences. Our approach depends on evaluating the degree of the DNA sequences similarity using the hierarchal representation Dendrogram. Representing large amount of data using hierarchal tree gives the ability to compare large sequences efficiently

  10. Movement Pattern Analysis Based on Sequence Signatures

    Directory of Open Access Journals (Sweden)

    Seyed Hossein Chavoshi

    2015-09-01

    Full Text Available Increased affordability and deployment of advanced tracking technologies have led researchers from various domains to analyze the resulting spatio-temporal movement data sets for the purpose of knowledge discovery. Two different approaches can be considered in the analysis of moving objects: quantitative analysis and qualitative analysis. This research focuses on the latter and uses the qualitative trajectory calculus (QTC, a type of calculus that represents qualitative data on moving point objects (MPOs, and establishes a framework to analyze the relative movement of multiple MPOs. A visualization technique called sequence signature (SESI is used, which enables to map QTC patterns in a 2D indexed rasterized space in order to evaluate the similarity of relative movement patterns of multiple MPOs. The applicability of the proposed methodology is illustrated by means of two practical examples of interacting MPOs: cars on a highway and body parts of a samba dancer. The results show that the proposed method can be effectively used to analyze interactions of multiple MPOs in different domains.

  11. Swarm-based Sequencing Recommendations in E-learning

    NARCIS (Netherlands)

    Van den Berg, Bert; Tattersall, Colin; Janssen, José; Brouns, Francis; Kurvers, Hub; Koper, Rob

    2005-01-01

    Van den Berg, B., Tattersall, C., Janssen, J., Brouns, F., Kurvers, H., & Koper, R. (2006). Swarm-based Sequencing Recommendations in E-learning. International Journal of Computer Science & Applications, III(III), 1-11.

  12. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    KAUST Repository

    Chen, Peng

    2013-07-23

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  13. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Indian Academy of Sciences (India)

    Meng-Jun Li; Ai-Qin Li; Han Xia; Chuan-Zhi Zhao; Chang-Sheng Li; Shu-Bo Wan; Yu-Ping Bi; Xing-Jun Wang

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, -ketoacyl-ACP synthase (I, II, III), -ketoacyl-ACP reductase, -hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  14. Sequence Context Specific Mutagenesis and Base Excision Repair

    OpenAIRE

    Donigan, Katherine; Sweasy, Joann B.

    2009-01-01

    Base excision repair is critical for the maintenance of genome stability because it repairs at least 20,000 endogenously generated DNA lesions per cell per day. Several enzymes within the base excision repair pathway exhibit sequence context dependency during the excision and DNA synthesis steps of repair. New evidence is emerging that germ line and tumor-associated variants of enzymes in this repair pathway exhibit sequence context dependence that is different from their ancestral counterpar...

  15. Immune and Genetic Algorithm Based Assembly Sequence Planning

    Institute of Scientific and Technical Information of China (English)

    YANG Jian-guo; LI Bei-zhi; YU Lei; JIN Yu-song

    2004-01-01

    In this paper an assembly sequence planning model inspired by natural immune and genetic algorithm (ASPIG) based on the part degrees of freedom matrix (PDFM) is proposed, and a proto system - DSFAS based on the ASPIG is introduced to solve assembly sequence problem. The concept and generation of PDFM and DSFAS are also discussed. DSFAS can prevent premature convergence, and promote population diversity, and can accelerate the learning and convergence speed in behavior evolution problem.

  16. Novel Frequency Hopping Sequences Generator Based on AES Algorithm

    Institute of Scientific and Technical Information of China (English)

    李振荣; 庄奕琪; 张博; 张超

    2010-01-01

    A novel frequency hopping(FH) sequences generator based on advanced encryption standard(AES) iterated block cipher is proposed for FH communication systems.The analysis shows that the FH sequences based on AES algorithm have good performance in uniformity, correlation, complexity and security.A high-speed, low-power and low-cost ASIC of FH sequences generator is implemented by optimizing the structure of S-Box and MixColumns of AES algorithm, proposing a hierarchical power management strategy, and applying ...

  17. DNA sequence analysis with droplet-based microfluidics

    Science.gov (United States)

    Abate, Adam R.; Hung, Tony; Sperling, Ralph A.; Mary, Pascaline; Rotem, Assaf; Agresti, Jeremy J.; Weiner, Michael A.; Weitz, David A.

    2014-01-01

    Droplet-based microfluidic techniques can form and process micrometer scale droplets at thousands per second. Each droplet can house an individual biochemical reaction, allowing millions of reactions to be performed in minutes with small amounts of total reagent. This versatile approach has been used for engineering enzymes, quantifying concentrations of DNA in solution, and screening protein crystallization conditions. Here, we use it to read the sequences of DNA molecules with a FRET-based assay. Using probes of different sequences, we interrogate a target DNA molecule for polymorphisms. With a larger probe set, additional polymorphisms can be interrogated as well as targets of arbitrary sequence. PMID:24185402

  18. Sequence Pattern Correlation of Amino Acid in Collision-induced Dissociation Electrospray Ionization Mass Spectrometry

    Institute of Scientific and Technical Information of China (English)

    SONG,Hao-Wei(宋浩威); YUE,Gui-Hua(岳贵花); LU,Yu(陆宇); YANG,Peng-Yuan(杨芃原); WANG,Hong-Hai(王洪海)

    2002-01-01

    A novel approach of sequence pattern correlation has been applied to predict an expected amino acid sequence from CID ESI-MS spectra. The proposed approach deduces sequence patterns with no help from known protein database such that it is useful to identify an unknown peptide or new protein. The algorithm applies a cross-correlation to match an experimental CID spectrum with predicted sequence pattern generated from fragmentation information. The fragmentation knowledge of both y-series and other non y-series are utilized to generate the predicted sequence patterns. In contrast to the normal de novo approach, the proposed approach is insensitive to mass tolerance and non-susceptive to spectral integrality with no need for selection of a starting point.

  19. Base-acid hybrid water electrolysis.

    Science.gov (United States)

    Chen, Long; Dong, Xiaoli; Wang, Fei; Wang, Yonggang; Xia, Yongyao

    2016-02-21

    A base-acid hybrid electrolytic system with a low onset voltage of 0.78 V for water electrolysis was developed by using a ceramic Li-ion exchange membrane to separate the oxygen-evolving reaction (OER) in a basic electrolyte solution containing the Li-ion and hydrogen-evolving reaction (HER) in an acidic electrolyte solution. PMID:26804323

  20. Boronic acid-based autoligation of nucleic acids

    DEFF Research Database (Denmark)

    Barbeyron, R.; Vasseur, J.-J.; Smietana, M.;

    2013-01-01

    Abstract: The development of synthetic systems displaying dynamic and adaptive characteristics is a formidable challenge with wide applications from biotechnology to therapeutics. Recently, we described a dynamic and programmable nucleic acid-based system relying on the formation of reversible...... boronate internucleosidic linkages. The DNA- or RNA-templated system comprises a 5′-ended boronic acid probe connecting a 3′-ended ribonucleosidic oligonucleotide partner. To explore the dominant factors that control the reversible linkage, we synthesized a series of 3′-end modified ribonucleotidic strands...

  1. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Science.gov (United States)

    2010-07-01

    ... Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of WIPO... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences... Listing” referred to in paragraph (c) of this section must also be submitted in computer readable...

  2. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    Science.gov (United States)

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  3. New chaos-based encryption scheme for digital sequence

    Institute of Scientific and Technical Information of China (English)

    Zhang Zhengwei; Fan Yangyu; Zeng Li

    2007-01-01

    To enhance the anti-breaking performance of privacy information, this article proposes a new encryption method utilizing the leaping peculiarity of the periodic orbits of chaos systems. This method maps the secret sequence to several chaos periodic orbits, and a short sequence obtained by evolving the system parameters of the periodic orbits in another nonlinear system will be the key to reconstruct these periodic orbits. In the decryption end, the shadowing method of chaos trajectory based on the modified Newton-Raphson algorithm is adopted to restore these system parameters. Through deciding which orbit each pair coordinate falls on, the original digital sequence can be decrypted.

  4. Nanopore-based Fourth-generation DNA Sequencing Technology

    Institute of Scientific and Technical Information of China (English)

    Yanxiao Feng; Yuechuan Zhang; Cuifeng Ying; Deqiang Wang; Chunlei Du

    2015-01-01

    Nanopore-based sequencers, as the fourth-generation DNA sequencing technology, have the potential to quickly and reliably sequence the entire human genome for less than $1000, and possibly for even less than$100. The single-molecule techniques used by this technology allow us to further study the interaction between DNA and protein, as well as between protein and protein. Nanopore analysis opens a new door to molecular biology investigation at the single-molecule scale. In this article, we have reviewed academic achievements in nanopore technology from the past as well as the latest advances, including both biological and solid-state nanopores, and discussed their recent and potential applications.

  5. Complete amino acid sequence of the human alpha 5 (IV) collagen chain and identification of a single-base mutation in exon 23 converting glycine 521 in the collagenous domain to cysteine in an Alport syndrome patient

    DEFF Research Database (Denmark)

    Zhou, J; Hertz, Jens Michael; Leinonen, A;

    1992-01-01

    of genomic DNA from members of a kindred with Alport syndrome revealed a new HindIII cleavage site within the coding sequence of one of the cDNA clones characterized. The proband had a new 1.25-kilobase HindIII fragment and a lack of a 1.35-kilobase fragment, and his mildly affected female cousin had both...

  6. Markov chaotic sequences for correlation based watermarking schemes

    Energy Technology Data Exchange (ETDEWEB)

    Tefas, A.; Nikolaidis, A.; Nikolaidis, N.; Solachidis, V.; Tsekeridou, S.; Pitas, I. E-mail: pitas@zeus.csd.auth.gr

    2003-07-01

    In this paper, statistical analysis of watermarking schemes based on correlation detection is presented. Statistical properties of watermark sequences generated by piecewise-linear Markov maps are exploited, resulting in superior watermark detection reliability. Correlation/spectral properties of such sequences are easily controllable, a fact that affects the watermarking system performance. A family of chaotic maps, namely the skew tent map family, is proposed for use in watermarking schemes.

  7. cDNA-derived amino-acid sequence of a land turtle (Geochelone carbonaria) beta-chain hemoglobin.

    Science.gov (United States)

    Bordin, S; Meza, A N; Saad, S T; Ogo, S H; Costa, F F

    1997-06-01

    The cDNA sequence encoding the turtle Geochelone carbonaria beta-chain was determinated. The isolation of hemoglobin mRNA was based on degenerate primers' PCR in combination with 5'- and 3'-RACE protocol. The full length cDNA is 615 bp with the ATG start codon at position 53 and TGA stop codon at position 495; The AATAAA polyadenylation signal is found at position 599. The deduced polypeptyde contains 146 amino-acid residues. The predicted amino acid sequence shares 83% identity with the beta-globin of a related specie, the aquatic turtle C. p. belli. Otherwise, identity is higher when compared with chicken beta-Hb (80%) than with other reptilian orders (Squamata, 69%, and Crocodilia, 61%). Compared with human HbA, there is 67% identity, and at least three amino acid substitutions could be of some functional significance (Glu43 beta-->Ser, His116 beta-->Thr and His143 beta-->Leu). To our knowledge this represents the first cDNA sequence of a reptile globin gene described. PMID:9238523

  8. Amino acid sequence of the serine-repeat antigen (SERA) of Plasmodium falciparum determined from cloned cDNA.

    Science.gov (United States)

    Bzik, D J; Li, W B; Horii, T; Inselburg, J

    1988-09-01

    We report the isolation of cDNA clones for a Plasmodium falciparum gene that encodes the complete amino acid sequence of a previously identified exported blood stage antigen. The Mr of this antigen protein had been determined by sodium dodecylsulphate-polyacrylamide gel electrophoresis analysis, by different workers, to be 113,000, 126,000, and 140,000. We show, by cDNA nucleotide sequence analysis, that this antigen gene encodes a 989 amino acid protein (111 kDa) that contains a potential signal peptide, but not a membrane anchor domain. In the FCR3 strain the serine content of the protein was 11%, of which 57% of the serine residues were localized within a 201 amino acid sequence that included 35 consecutive serine residues. The protein also contained three possible N-linked glycosylation sites and numerous possible O-linked glycosylation sites. The mRNA was abundant during late trophozoite-schizont parasite stages. We propose to identity this antigen, which had been called p126, by the acronym SERA, serine-repeat antigen, based on its complete structure. The usefulness of the cloned cDNA as a source of a possible malaria vaccine is considered in view of the previously demonstrated ability of the antigen to induce parasite-inhibitory antibodies and a protective immune response in Saimiri monkeys. PMID:2847041

  9. A Direct, Biomass-Based Synthesis of Benzoic Acid: Formic Acid-Mediated Deoxygenation of the Glucose-Derived Materials Quinic Acid and Shikimic Acid

    Energy Technology Data Exchange (ETDEWEB)

    Arceo, Elena; Ellman, Jonathan; Bergman, Robert

    2010-05-03

    An alternative biomass-based route to benzoic acid from the renewable starting materials quinic acid and shikimic acid is described. Benzoic acid is obtained selectively using a highly efficient, one-step formic acid-mediated deoxygenation method.

  10. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    Science.gov (United States)

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided. PMID:11414222

  11. Solexa sequencing based transcriptome analysis of Helicoverpa armigera larvae.

    Science.gov (United States)

    Li, Jigang; Li, Xiumin; Chen, Yongli; Yang, Zhongxiang; Guo, Sandui

    2012-12-01

    Helicoverpa armigera (Hübner) is a polyphagous Lepidoptera pest which causes great economic losses in crop production worldwide. In contrast to its agricultural importance, advances in the molecular aspects of this insect are quite limited. In the present study, Illumina's SOLEXA sequencing was adopted to determine the transcriptome of young H. armigera larvae. About 7 gigabases of raw sequence data was generated and assembled into 116,601 contigs with an average length of 389 base pairs after data preprocess. 37,352 of these contigs were annotated by searching against Uniref 100 of UniProt database. The annotated sequences were functionally classified into three groups including biological process (15,632 sequences), cellular component (9,562 sequences) and molecular function (19,258 sequences). KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis showed that 1,409 contigs predicted to encode enzymes with enzyme commission numbers were mapped into 220 KEGG pathways in total. Finally, contigs with simple sequence repeats were derived from this dataset. PMID:23065207

  12. The developmental transcriptome landscape of bovine skeletal muscle defined by Ribo-Zero ribonucleic acid sequencing.

    Science.gov (United States)

    Sun, X; Li, M; Sun, Y; Cai, H; Li, R; Wei, X; Lan, X; Huang, Y; Lei, C; Chen, H

    2015-12-01

    Ribonucleic acid sequencing (RNA-Seq) libraries are normally prepared with oligo(dT) selection of poly(A)+ mRNA, but it depends on intact total RNA samples. Recent studies have described Ribo-Zero technology, a novel method that can capture both poly(A)+ and poly(A)- transcripts from intact or fragmented RNA samples. We report here the first application of Ribo-Zero RNA-Seq for the analysis of the bovine embryonic, neonatal, and adult skeletal muscle whole transcriptome at an unprecedented depth. Overall, 19,893 genes were found to be expressed, with a high correlation of expression levels between the calf and the adult. Hundreds of genes were found to be highly expressed in the embryo and decreased at least 10-fold after birth, indicating their potential roles in embryonic muscle development. In addition, we present for the first time the analysis of global transcript isoform discovery in bovine skeletal muscle and identified 36,694 transcript isoforms. Transcriptomic data were also analyzed to unravel sequence variations; 185,036 putative SNP and 12,428 putative short insertions-deletions (InDel) were detected. Specifically, many stop-gain, stop-loss, and frameshift mutations were identified that probably change the relative protein production and sequentially affect the gene function. Notably, the numbers of stage-specific transcripts, alternative splicing events, SNP, and InDel were greater in the embryo than in the calf and the adult, suggesting that gene expression is most active in the embryo. The resulting view of the transcriptome at a single-base resolution greatly enhances the comprehensive transcript catalog and uncovers the global trends in gene expression during bovine skeletal muscle development. PMID:26641174

  13. Thermodynamics-based models of transcriptional regulation with gene sequence.

    Science.gov (United States)

    Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing

    2015-12-01

    Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.

  14. DNA sequence analysis using hierarchical ART-based classification networks

    Energy Technology Data Exchange (ETDEWEB)

    LeBlanc, C.; Hruska, S.I. [Florida State Univ., Tallahassee, FL (United States); Katholi, C.R.; Unnasch, T.R. [Univ. of Alabama, Birmingham, AL (United States)

    1994-12-31

    Adaptive resonance theory (ART) describes a class of artificial neural network architectures that act as classification tools which self-organize, work in real-time, and require no retraining to classify novel sequences. We have adapted ART networks to provide support to scientists attempting to categorize tandem repeat DNA fragments from Onchocerca volvulus. In this approach, sequences of DNA fragments are presented to multiple ART-based networks which are linked together into two (or more) tiers; the first provides coarse sequence classification while the sub- sequent tiers refine the classifications as needed. The overall rating of the resulting classification of fragments is measured using statistical techniques based on those introduced to validate results from traditional phylogenetic analysis. Tests of the Hierarchical ART-based Classification Network, or HABclass network, indicate its value as a fast, easy-to-use classification tool which adapts to new data without retraining on previously classified data.

  15. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    Science.gov (United States)

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand. PMID:21402111

  16. The amino acid sequence of cytochrome c from Cucurbita maxima L. (pumpkin)

    Science.gov (United States)

    Thompson, E. W.; Richardson, M.; Boulter, D.

    1971-01-01

    The amino acid sequence of pumpkin cytochrome c was determined on 2μmol of protein. Some evidence was found for the occurrence of two forms of cytochrome c, whose sequences differed in three positions. Pumpkin cytochrome c consists of 111 residues and is homologous with mitochondrial cytochromes c from other plants. Experimental details are given in a supplementary paper that has been deposited as Supplementary Publication SUP 50005 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1971), 121, 7. PMID:5131733

  17. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...... alignment by 'reverse translation' of the aligned protein sequences. In the resulting DNA alignment, gaps occur in groups of three corresponding to entire codons, and analogous codon positions are therefore always lined up. These features are useful when constructing multiple DNA alignments for phylogenetic...

  18. Site-directed gene mutation at mixed sequence targets by psoralen-conjugated pseudo-complementary peptide nucleic acids.

    Science.gov (United States)

    Kim, Ki-Hyun; Nielsen, Peter E; Glazer, Peter M

    2007-01-01

    Sequence-specific DNA-binding molecules such as triple helix-forming oligonucleotides (TFOs) provide a means for inducing site-specific mutagenesis and recombination at chromosomal sites in mammalian cells. However, the utility of TFOs is limited by the requirement for homopurine stretches in the target duplex DNA. Here, we report the use of pseudo-complementary peptide nucleic acids (pcPNAs) for intracellular gene targeting at mixed sequence sites. Due to steric hindrance, pcPNAs are unable to form pcPNA-pcPNA duplexes but can bind to complementary DNA sequences by Watson-Crick pairing via double duplex-invasion complex formation. We show that psoralen-conjugated pcPNAs can deliver site-specific photoadducts and mediate targeted gene modification within both episomal and chromosomal DNA in mammalian cells without detectable off-target effects. Most of the induced psoralen-pcPNA mutations were single-base substitutions and deletions at the predicted pcPNA-binding sites. The pcPNA-directed mutagenesis was found to be dependent on PNA concentration and UVA dose and required matched pairs of pcPNAs. Neither of the individual pcPNAs alone had any effect nor did complementary PNA pairs of the same sequence. These results identify pcPNAs as new tools for site-specific gene modification in mammalian cells without purine sequence restriction, thereby providing a general strategy for designing gene targeting molecules. PMID:17977869

  19. Accuracy of structure-based sequence alignment of automatic methods

    Directory of Open Access Journals (Sweden)

    Lee Byungkook

    2007-09-01

    similarity is low, structure-based methods produce better sequence alignments than by using sequence similarities alone. However, current structure-based methods still mis-align 11–19% of the conserved core residues when compared to the human-curated CDD alignments. The alignment quality of each program depends on the protein structural type and similarity, with DaliLite showing the most agreement with CDD on average.

  20. Sequence-specific nucleic acid detection from binary pore conductance measurement

    OpenAIRE

    Esfandiari, Leyla; Monbouquette, Harold G.; Jacob J. Schmidt

    2012-01-01

    We describe a platform for sequence-specific nucleic acid (NA) detection utilizing a micropipette tapered to a 2 μm diameter pore and 3 μm diameter polystyrene beads to which uncharged peptide nucleic acid (PNA) probe molecules have been conjugated. As the target NAs hybridize to the complementary PNA-beads, the beads acquire negative charge and become electrophoretically mobile. An applied electric field guides these NA-PNA-beads toward the pipette tip, which they obstruct, leading to an ind...

  1. Repeat Sequences and Base Correlations in Human Y Chromosome Palindromes

    Institute of Scientific and Technical Information of China (English)

    Neng-zhi Jin; Zi-xian Liu; Yan-jiao Qi; Wen-yuan Qiu

    2009-01-01

    On the basis of information theory and statistical methods, we use mutual information, n-tuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes. The magnitude distribution of the long range correlation which can be reflected by the mutual information is P5>P5a>P5b (P5a and P5b are the sequences that replace solely Alu repeats and all interspersed repeats with random uncorrelated sequences in human Y chromosome palindrome 5, respectively); and the magnitude distribution of the short range correlation which can be reflected by the n-tuple entropy and the conditional entropy is P5>P5a>P5b>random uncorrelated sequence. In other words, when the Alu repeats and all interspersed repeats replace with random uncorrelated sequence, the long range and short range correlation decrease gradually. However, the random uncorrelated sequence has no correlation. This research indicates that more repeat sequences result in stronger correlation between bases in human Y chromosome. The analyses may be helpful to understand the special structures of human Y chromosome palindromes profoundly.

  2. Self-Sequencing of Amino Acids and Origins of Polyfunctional Protocells

    Science.gov (United States)

    Fox, Sidney W.

    1984-12-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells.

  3. Purification and amino acid sequence of a bacteriocins produced by Lactobacillus salivarius K7 isolated from chicken intestine

    Directory of Open Access Journals (Sweden)

    Kenji Sonomoto

    2006-03-01

    Full Text Available A bacteriocin-producing strain, Lactobacillus K7, was isolated from a chicken intestine. The inhibitory activity was determined by spot-on-lawn technique. Identification of the strain was performed by morphological, biochemical (API 50 CH kit and molecular genetic (16S rDNA basis. Bacteriocin purification processes were carried out by amberlite adsorption, cation exchange and reverse-phase high perform- ance liquid chromatography. N-terminal amino acid sequences were performed by Edman degradation. Molecular mass was determined by electrospray-ionization (ESI mass spectrometry (MS. Lactobacillus K7 showed inhibitory activity against Lactobacillus sakei subsp. sakei JCM 1157T, Leuconostoc mesenteroides subsp. mesenteroides JCM 6124T and Bacillus coagulans JCM 2257T. This strain was identified as Lb. salivarius. The antimicrobial substance was destroyed by proteolytic enzymes, indicating its proteinaceous structure designated as a bacteriocin type. The purification of bacteriocin by amberlite adsorption, cation exchange, and reverse-phase chromatography resulted in only one single active peak, which was designated FK22. Molecular weight of this fraction was 4331.70 Da. By amino acid sequence, this peptide was homology to Abp 118 beta produced by Lb. salivarius UCC118. In addition, Lb. salivarius UCC118 produced 2-peptide bacteriocin, which was Abp 118 alpha and beta. Based on the partial amino acid sequences of Abp 118 beta, specific primers were designed from nucleotide sequences according to data from GenBank. The result showed that the deduced peptide was high homology to 2-peptide bacteriocin, Abp 118 alpha and beta.

  4. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.

    Directory of Open Access Journals (Sweden)

    Brejnev Muhizi Muhire

    Full Text Available The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV. There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT, a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms.

  5. Anti-peptide aptamers recognize amino acid sequence and bind a protein epitope.

    OpenAIRE

    Xu, W; Ellington, A. D.

    1996-01-01

    In vitro selection of nucleic acid binding species (aptamers) is superficially similar to the immune response. Both processes produce biopolymers that can recognize targets with high affinity and specificity. While antibodies are known to recognize the sequence and conformation of protein surface features (epitopes), very little is known about the precise interactions between aptamers and their epitopes. Therefore, aptamers that could recognize a particular epitope, a peptide fragment of huma...

  6. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  7. Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

    Directory of Open Access Journals (Sweden)

    Michael Andrew Meyer

    2013-07-01

    Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  8. Hypokalemic paralysis and acid-base balance

    Directory of Open Access Journals (Sweden)

    Ivo Casagranda

    2006-10-01

    Full Text Available Three cases of hypokalemic paralysis are reported, presenting to the Emergency Department. The first is a patient with a hypokalemic periodic paralysis with a normal acid-base status, the second is a case of hypokalemic flaccid paralysis of all extremities with a normal anion gap metabolic acidosis, the last is a patient with a hypokalemic distal paralysis of right upper arm with metabolic alkalosis. Afterwards some pathophysiologic principles and the clinical aspects of hypokalemia are discussed and an appropriate approach to do in Emergency Department, to identify the hypokalemic paralysis etiologies in the Emergency Department, is presented, beginning from the evaluation of acid-base status.

  9. Which Microbial Communities Are Present? Sequence-Based Metagenomics

    Science.gov (United States)

    Caffrey, Sean M.

    The use of metagenomic methods that directly sequence environmental samples has revealed the extraordinary microbial diversity missed by traditional culture-based methodologies. Therefore, to develop a complete and representative model of an environment's microbial community and activities, metagenomic analysis is an essential tool.

  10. Identification of Amino Acid Sequences with Good Folding Properties in an Off-Lattice Model

    CERN Document Server

    Irbäck, Anders; Potthast, Frank

    2008-01-01

    Folding properties of a two-dimensional toy protein model containing only two amino-acid types, hydrophobic and hydrophilic, respectively, are analyzed. An efficient Monte Carlo procedure is employed to ensure that the ground states are found. The thermodynamic properties are found to be strongly sequence dependent in contrast to the kinetic ones. Hence, criteria for good folders are defined entirely in terms of thermodynamic fluctuations. With these criteria sequence patterns that fold well are isolated. For 300 chains with 20 randomly chosen binary residues approximately 10% meet these criteria. Also, an analysis is performed by means of statistical and artificial neural network methods from which it is concluded that the folding properties can be predicted to a certain degree given the binary numbers characterizing the sequences.

  11. Sequence-selective targeting of duplex DNA by peptide nucleic acids

    DEFF Research Database (Denmark)

    Nielsen, Peter E

    2010-01-01

    Sequence-selective gene targeting constitutes an attractive drug-discovery approach for genetic therapy, with the aim of reducing or enhancing the activity of specific genes at the transcriptional level, or as part of a methodology for targeted gene repair. The pseudopeptide DNA mimic peptide...... nucleic acid (PNA) can recognize duplex DNA with high sequence specificity and affinity in triplex, duplex and double-duplex invasive modes or non-invasive triplex modes. Novel PNA modification has improved the affinity for DNA recognition via duplex invasion, double-duplex invasion and triplex...... recognition considerably. Such modifications have also resulted in new approaches to targeted gene repair and sequence-selective double-strand cleavage of genomic DNA....

  12. Random amino acid mutations and protein misfolding lead to Shannon limit in sequence-structure communication.

    Directory of Open Access Journals (Sweden)

    Andreas Martin Lisewski

    Full Text Available The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions and in structure (structural defects trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a sensitive to random errors and (b restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials.

  13. Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

    Directory of Open Access Journals (Sweden)

    Samat Kozhakhmetov

    2014-01-01

    Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.

  14. Multiple Base Substitution Corrections in DNA Sequence Evolution

    Science.gov (United States)

    Kowalczuk, M.; Mackiewicz, P.; Szczepanik, D.; Nowicka, A.; Dudkiewicz, M.; Dudek, M. R.; Cebrat, S.

    We discuss the Jukes and Cantor's one-parameter model and Kimura's two-parameter model unability to describe evolution of asymmetric DNA molecules. The standard distance measure between two DNA sequences, which is the number of substitutions per site, should include the effect of multiple base substitutions separately for each type of the base. Otherwise, the respective tables of substitutions cannot reconstruct the asymmetric DNA molecule with respect to the composition. Basing on Kimura's neutral theory, we have derived a linear law for the correlation of the mean survival time of nucleotides under constant mutation pressure and their fraction in the genome. According to the law, the corrections to Kimura's theory have been discussed to describe evolution of genomes with asymmetric nucleotide composition. We consider the particular case of the strongly asymmetric Borrelia burgdorferi genome and we discuss in detail the corrections, which should be introduced into the distance measure between two DNA sequences to include multiple base substitutions.

  15. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    International Nuclear Information System (INIS)

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of ≅ 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH2-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung

  16. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Warr, R.G.; Hawgood, S.; Buckley, D.I.; Crisp, T.M.; Schilling, J.; Benson, B.J.; Ballard, P.L.; Clements, J.A.; White, R.T.

    1987-11-01

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of approx. = 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH/sub 2/-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung.

  17. DNA sequence analysis with droplet-based microfluidics

    OpenAIRE

    Abate, Adam R.; Hung, Tony; Sperling, Ralph A.; Mary, Pascaline; Rotem, Assaf; Agresti, Jeremy J.; Weiner, Michael A.; Weitz, David A.

    2013-01-01

    Droplet-based microfluidic techniques can form and process micrometer scale droplets at thousands per second. Each droplet can house an individual biochemical reaction, allowing millions of reactions to be performed in minutes with small amounts of total reagent. This versatile approach has been used for engineering enzymes, quantifying concentrations of DNA in solution, and screening protein crystallization conditions. Here, we use it to read the sequences of DNA molecules with a FRET-based ...

  18. A New Images Hiding Scheme Based on Chaotic Sequences

    Institute of Scientific and Technical Information of China (English)

    LIU Nian-sheng; GUO Dong-hui; WU Bo-xi; Parr G

    2005-01-01

    We propose a data hidding technique in a still image. This technique is based on chaotic sequence in the transform domain of covert image. We use different chaotic random sequences multiplied by multiple sensitive images, respectively, to spread the spectrum of sensitive images. Multiple sensitive images are hidden in a covert image as a form of noise. The results of theoretical analysis and computer simulation show the new hiding technique have better properties with high security, imperceptibility and capacity for hidden information in comparison with the conventional scheme such as LSB (Least Significance Bit).

  19. 3D Motion Parameters Determination Based on Binocular Sequence Images

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Exactly capturing three dimensional (3D) motion information of an object is an essential and important task in computer vision, and is also one of the most difficult problems. In this paper, a binocular vision system and a method for determining 3D motion parameters of an object from binocular sequence images are introduced. The main steps include camera calibration, the matching of motion and stereo images, 3D feature point correspondences and resolving the motion parameters. Finally, the experimental results of acquiring the motion parameters of the objects with uniform velocity and acceleration in the straight line based on the real binocular sequence images by the mentioned method are presented.

  20. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome.

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)-microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker "retinoic acid" in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5'-AGGTCA-3') in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and embryological

  1. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome.

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)-microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker "retinoic acid" in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5'-AGGTCA-3') in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and embryological

  2. Local Sequence Information-based Support Vector Machine to Classify Voltage-gated Potassium Channels

    Institute of Scientific and Technical Information of China (English)

    Li-Xia LIU; Meng-Long LI; Fu-Yuan TAN; Min-Chun LU; Ke-Long WANG; Yan-Zhi GUO; Zhi-Ning WEN; Lin JIANG

    2006-01-01

    In our previous work, we developed a computational tool, PreK-ClassK-ClassKv, to predict and classify potassium (K+) channels. For K+ channel prediction (PreK) and classification at family level (ClassK), this method performs well. However, it does not perform so well in classifying voltage-gated potassium (Kv) channels (ClassKv). In this paper, a new method based on the local sequence information of Kv channels is introduced to classify Kv channels. Six transmembrane domains of a Kv channel protein are used to define a protein, and the dipeptide composition technique is used to transform an amino acid sequence to a numerical sequence. A Kv channel protein is represented by a vector with 2000 elements, and a support vector machine algorithm is applied to classify Kv channels. This method shows good performance with averages of total accuracy (Acc), sensitivity (SE), specificity (SP); reliability (R) and Matthews correlation coefficient (MCC) of 98.0%, 89.9%, 100%, 0.95 and 0.94 respectively. The results indicate that the local sequence information-based method is better than the global sequence information-based method to classify Kv channels.

  3. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    Science.gov (United States)

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  4. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    OpenAIRE

    Yan Koon-Kiu; Axelsen Jacob; Maslov Sergei

    2005-01-01

    Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model...

  5. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    NARCIS (Netherlands)

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which mi

  6. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    OpenAIRE

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which might shed light onto the enzymatic machineries that are involved in its production of biogenic amines.

  7. Steganalytic method based on short and repeated sequence distance statistics

    Institute of Scientific and Technical Information of China (English)

    WANG GuoXin; PING XiJian; XU ManKun; ZHANG Tao; BAO XiRui

    2008-01-01

    According to the distribution characteristics of short and repeated sequence (SRS),a steganalytic method based on the correlation of image bit planes is proposed.Firstly,we provide the conception of SRS distance statistics and deduce its statistical distribution.Because the SRS distance statistics can effectively reflect the correlation of the sequence,SRS has statistical features when the image bit plane sequence equals the image width.Using this characteristic,the steganalytic method is fulfilled by the distinct test of Poisson distribution.Experimental results show a good performance for detecting LSB matching steganographic method in still images.By the way,the proposed method is not designed for specific steganographic algorithms and has good generality.

  8. Nanopore-based Fourth-generation DNA Sequencing Technology

    Directory of Open Access Journals (Sweden)

    Yanxiao Feng

    2015-02-01

    Full Text Available Nanopore-based sequencers, as the fourth-generation DNA sequencing technology, have the potential to quickly and reliably sequence the entire human genome for less than $1000, and possibly for even less than $100. The single-molecule techniques used by this technology allow us to further study the interaction between DNA and protein, as well as between protein and protein. Nanopore analysis opens a new door to molecular biology investigation at the single-molecule scale. In this article, we have reviewed academic achievements in nanopore technology from the past as well as the latest advances, including both biological and solid-state nanopores, and discussed their recent and potential applications.

  9. Spectroscopic investigation on the telomeric DNA base sequence repeat

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Telomeres are protein-DNA complexes at the terminals of linear chromosomes, which protect chromosomal integrity and maintain cellular replicative capacity.From single-cell organisms to advanced animals and plants,structures and functions of telomeres are both very conservative. In cells of human and vertebral animals, telomeric DNA base sequences all are (TTAGGG)n. In the present work, we have obtained absorption and fluorescence spectra measured from seven synthesized oligonucleotides to simulate the telomeric DNA system and calculated their relative fluorescence quantum yields on which not only telomeric DNA characteristics are predicted but also possibly the shortened telomeric sequences during cell division are imrelative fluorescence quantum yield and remarkable excitation energy innerconversion, which tallies with the telomeric sequence of (TTAGGG)n. This result shows that telomeric DNA has a strong non-radiative or innerconvertible capability.``

  10. Fermented probiotic beverages based on acid whey

    Directory of Open Access Journals (Sweden)

    Katarzyna Skryplonek

    2015-12-01

    Full Text Available Background. Production of fermented probiotic beverages can be a good method for acid whey usage. The obtained products combine a high nutritional value of whey with health benefits claimed for probiotic bac- teria. The aim of the study was to define quality properties of beverages based on fresh acid whey and milk with addition of buttermilk powder or sweet whey powder. Material and methods. Samples were inoculated with two strains of commercial probiotic cultures: Lac- tobacillus acidophilus La-5 or Bifidobacterium animalis Bb-12. After fermentation, samples were stored at refrigerated conditions. After 1, 4, 7, 14 and 21 days sensory characteristics, hardness, acetaldehyde content, titratable acidity, pH acidity and count of bacteria cells were evaluated. Results. Throughout all storage period, the number of bacteria was higher than 8 log cfu/ml in the all sam- ples. Beverages with La-5 strain had higher hardness and acidity, whilst samples with Bb-12 contained more acetaldehyde. Samples with buttermilk powder had better sensory properties than with sweet whey powder. Conclusions. Obtained products made of acid whey combined with milk and fortified with buttermilk pow- der or sweet whey powder, are good medium for growth and survival of examined probiotic bacteria strains. The level of bacteria was sufficient to provide health benefits to consumers.

  11. DUK - A Fast and Efficient Kmer Based Sequence Matching Tool

    Energy Technology Data Exchange (ETDEWEB)

    Li, Mingkun; Copeland, Alex; Han, James

    2011-03-21

    A new tool, DUK, is developed to perform matching task. Matching is to find whether a query sequence partially or totally matches given reference sequences or not. Matching is similar to alignment. Indeed many traditional analysis tasks like contaminant removal use alignment tools. But for matching, there is no need to know which bases of a query sequence matches which position of a reference sequence, it only need know whether there exists a match or not. This subtle difference can make matching task much faster than alignment. DUK is accurate, versatile, fast, and has efficient memory usage. It uses Kmer hashing method to index reference sequences and Poisson model to calculate p-value. DUK is carefully implemented in C++ in object oriented design. The resulted classes can also be used to develop other tools quickly. DUK have been widely used in JGI for a wide range of applications such as contaminant removal, organelle genome separation, and assembly refinement. Many real applications and simulated dataset demonstrate its power.

  12. An Uncompressed Image Encryption Algorithm Based on DNA Sequences

    Directory of Open Access Journals (Sweden)

    Shima Ramesh Maniyath

    2011-07-01

    Full Text Available The rapid growth of the Internet and digitized content made image and video distribution simpler. Hence the need for image and video data protection is on the rise. In this paper, we propose a secure and computationally feasible image and video encryption/decryption algorithm based on DNA sequences. The main purpose of this algorithm is to reduce the big image encryption time. This algorithm is implemented by using the natural DNA sequences as main keys. The first part is the process of pixel scrambling. The original image is confused in the light of the scrambling sequence which is generated by the DNA sequence. The second part is the process of pixel replacement. The pixel gray values of the new image and the one of the three encryption templates generated by the other DNA sequence are XORed bit-by-bit in turn. The main scope of this paper is to propose an extension of this algorithm to videos and making it secure using modern Biological technology. A security analysis for the proposed system is performed and presented.

  13. Jigsaw Cooperative Learning: Acid-Base Theories

    Science.gov (United States)

    Tarhan, Leman; Sesen, Burcin Acar

    2012-01-01

    This study focused on investigating the effectiveness of jigsaw cooperative learning instruction on first-year undergraduates' understanding of acid-base theories. Undergraduates' opinions about jigsaw cooperative learning instruction were also investigated. The participants of this study were 38 first-year undergraduates in chemistry education…

  14. Glycine/Glycolic acid based copolymers

    NARCIS (Netherlands)

    Veld, in 't Peter J.A.; Shen, Zheng-Rong; Takens, Gijsbert A.J.; Dijkstra, Pieter J.; Feijen, Jan

    1994-01-01

    Glycine/glycolic acid based biodegradable copolymers have been prepared by ring-opening homopolymerization of morpholine-2,5-dione, and ring-opening copolymerization of morpholine-2,5-dione and glycolide. The homopolymerization of morpholine-2,5-dione was carried out in the melt at 200°C for 3 min u

  15. Solid-State Nanopore-Based DNA Sequencing Technology

    Directory of Open Access Journals (Sweden)

    Zewen Liu

    2016-01-01

    Full Text Available The solid-state nanopore-based DNA sequencing technology is becoming more and more attractive for its brand new future in gene detection field. The challenges that need to be addressed are diverse: the effective methods to detect base-specific signatures, the control of the nanopore’s size and surface properties, and the modulation of translocation velocity and behavior of the DNA molecules. Among these challenges, the realization of the high-quality nanopores with the help of modern micro/nanofabrication technologies is a crucial one. In this paper, typical technologies applied in the field of solid-state nanopore-based DNA sequencing have been reviewed.

  16. N-terminal amino acid sequences and some characteristics of fibrinolytic/hemorrhagic metalloproteinases purified from Bothrops jararaca venom.

    Science.gov (United States)

    Maruyama, Masugi; Sugiki, Masahiko; Anai, Keita; Yoshida, Etsuo

    2002-08-01

    We determined the N-terminal amino acid sequences of the fibrinolytic/hemorrhagic metalloproteinases (jararafibrases I, III and IV) purified from Bothrops jararaca venom. The N-terminal amino acid sequences of jararafibrase I and its degradation products were identical to those of jararhagin, another hemorrhagic metalloproteinase purified from the same snake venom. Together with enzymatic and immunological properties, we concluded that those two enzymes are identical. The N-terminal amino acid sequence of jararafibrase III was quite similar to C-type lectin isolated from Crotalus atrox, and the protein had a hemagglutinating activity on intact rat red blood cells. PMID:12165326

  17. Evolutionary insights from suffix array-based genome sequence analysis

    Indian Academy of Sciences (India)

    Anindya Poddar; Nagasuma Chandra; Madhavi Ganapathiraju; K Sekar; Judith Klein-Seetharaman; Raj Reddy; N Balakrishnan

    2007-08-01

    Gene and protein sequence analyses, central components of studies in modern biology are easily amenable to string matching and pattern recognition algorithms. The growing need of analysing whole genome sequences more efficiently and thoroughly, has led to the emergence of new computational methods. Suffix trees and suffix arrays are data structures, well known in many other areas and are highly suited for sequence analysis too. Here we report an improvement to the design of construction of suffix arrays. Enhancement in versatility and scalability, enabled by this approach, is demonstrated through the use of real-life examples. The scalability of the algorithm to whole genomes renders it suitable to address many biologically interesting problems. One example is the evolutionary insight gained by analysing unigrams, bi-grams and higher n-grams, indicating that the genetic code has a direct influence on the overall composition of the genome. Further, different proteomes have been analysed for the coverage of the possible peptide space, which indicate that as much as a quarter of the total space at the tetra-peptide level is left un-sampled in prokaryotic organisms, although almost all tri-peptides can be seen in one protein or another in a proteome. Besides, distinct patterns begin to emerge for the counts of particular tetra and higher peptides, indicative of a ‘meaning’ for tetra and higher n-grams. The toolkit has also been used to demonstrate the usefulness of identifying repeats in whole proteomes efficiently. As an example, 16 members of one COG, coded by the genome of Mycobacterium tuberculosis H37Rv have been found to contain a repeating sequence of 300 amino acids.

  18. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  19. Sequencing-based typing reveals new insight in HLA-DPA1 polymorphism.

    Science.gov (United States)

    Rozemuller, E H; Bouwens, A G; van Oort, E; Versluis, L F; Marsh, S G; Bodmer, J G; Tilanus, M G

    1995-01-01

    An HLA-DPA1 sequencing-based typing (SBT) system has been developed to identify DPA1 alleles. Up to now eight DPA1 alleles have been defined. Six can be discriminated based upon exon 2 polymorphism. The three subtypes of DPA1*01: DPA1*0101, DPA1*0102 and DPA1*0103, have identical exon 2 sequences but show differences in exon 4. Exon 4 sequences were known for only the three DPA1*01 subtypes and for DPA1*0201. We now present additional sequence information for exon 4 and the unknown segments at the 3' end of exon 2. Additionally with the use of this sequencing technique it is also possible to identify previously unidentified polymorphism. We have studied the exon 2 and exon 4 polymorphism of DPA1 in 40 samples which include all known DPA1 alleles. A new allele, DPA1*01 new, was identified which differs by one nucleotide in exon 2 from DPA1*0103, resulting in an aspartic acid at codon 28. The DPA1*01 subtypes DPA1*0101 and DPA1*0102 could not be confirmed in samples which previously were used to define these subtypes, and consequently they do not exist. The exon 4 sequence of DPA1*0201 is corrected based on sequence data of DAUDI, the cell line in which DPA1*0202 was originally defined. The exon 4 regions of the remaining four alleles were resolved: the exon 4 regions of the alleles DPA1*02021 and DPA1*02022 were found to be identical to the--corrected--DPA1*0201 whereas the exon 4 region of DPA1*0301 differs by one nucleotide compared to DPA1*0103. The DPA1*0401 exon 4 region differs by one nucleotide compared to the corrected DPA1*0201.(ABSTRACT TRUNCATED AT 250 WORDS)

  20. Revision of Begomovirus taxonomy based on pairwise sequence comparisons

    KAUST Repository

    Brown, Judith K.

    2015-04-18

    Viruses of the genus Begomovirus (family Geminiviridae) are emergent pathogens of crops throughout the tropical and subtropical regions of the world. By virtue of having a small DNA genome that is easily cloned, and due to the recent innovations in cloning and low-cost sequencing, there has been a dramatic increase in the number of available begomovirus genome sequences. Even so, most of the available sequences have been obtained from cultivated plants and are likely a small and phylogenetically unrepresentative sample of begomovirus diversity, a factor constraining taxonomic decisions such as the establishment of operationally useful species demarcation criteria. In addition, problems in assigning new viruses to established species have highlighted shortcomings in the previously recommended mechanism of species demarcation. Based on the analysis of 3,123 full-length begomovirus genome (or DNA-A component) sequences available in public databases as of December 2012, a set of revised guidelines for the classification and nomenclature of begomoviruses are proposed. The guidelines primarily consider a) genus-level biological characteristics and b) results obtained using a standardized classification tool, Sequence Demarcation Tool, which performs pairwise sequence alignments and identity calculations. These guidelines are consistent with the recently published recommendations for the genera Mastrevirus and Curtovirus of the family Geminiviridae. Genome-wide pairwise identities of 91 % and 94 % are proposed as the demarcation threshold for begomoviruses belonging to different species and strains, respectively. Procedures and guidelines are outlined for resolving conflicts that may arise when assigning species and strains to categories wherever the pairwise identity falls on or very near the demarcation threshold value.

  1. Generation of deviation parameters for amino acid singlets, doublets and triplets from three-dimensional structures of proteins and its implications for secondary structure prediction from amino acid sequences

    Indian Academy of Sciences (India)

    S A Mugilan; K Veluraja

    2000-03-01

    We present a new method, secondary structure prediction by deviation parameter (SSPDP) for predicting the secondary structure of proteins from amino acid sequence. Deviation parameters (DP) for amino acid singlets, doublets and triplets were computed with respect to secondary structural elements of proteins based on the dictionary of secondary structure prediction (DSSP)-generated secondary structure for 408 selected non-homologous proteins. To the amino acid triplets which are not found in the selected dataset, a DP value of zero is assigned with respect to the secondary structural elements of proteins. The total number of parameters generated is 15,432, in the possible parameters of 25,260. Deviation parameter is complete with respect to amino acid singlets, doublets, and partially complete with respect to amino acid triplets. These generated parameters were used to predict secondary structural elements from amino acid sequence. The secondary structure predicted by our method (SSPDP) was compared with that of single sequence (NNPREDICT) and multiple sequence (PHD) methods. The average value of the percentage of prediction accuracy for α-helix by SSPDP, NNPREDICT and PHD methods was found to be 57%, 44% and 69% respectively for the proteins in the selected dataset. For -strand the prediction accuracy is found to be 69%, 21% and 53% respectively by SSPDP, NNPREDICT and PHD methods. This clearly indicates that the secondary structure prediction by our method is as good as PHD method but much better than NNPREDICT method.

  2. Development of a SCAR (sequence-characterised amplified region) marker for acid resistance-related gene in Lactobacillus plantarum.

    Science.gov (United States)

    Liu, Shu-Wen; Li, Kai; Yang, Shi-Ling; Tian, Shu-Fen; He, Ling

    2015-03-01

    A sequence characterised amplified region marker was developed to determine an acid resistance-related gene in Lactobacillus plantarum. A random amplified polymorphic DNA marker named S116-680 was reported to be closely related to the acid resistance of the strains. The DNA band corresponding to this marker was cloned and sequenced with the induction of specific designed PCR primers. The results of PCR test helped to amplify a clear specific band of 680 bp in the tested acid-resistant strains. S116-680 marker would be useful to explore the acid-resistant mechanism of L. plantarum and to screen desirable malolactic fermentation strains.

  3. A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation

    OpenAIRE

    SAHA, AMRITA; Khapra, Mitesh M.; Chandar, Sarath; Rajendran, Janarthanan; Cho, Kyunghyun

    2016-01-01

    Interlingua based Machine Translation (MT) aims to encode multiple languages into a common linguistic representation and then decode sentences in multiple target languages from this representation. In this work we explore this idea in the context of neural encoder decoder architectures, albeit on a smaller scale and without MT as the end goal. Specifically, we consider the case of three languages or modalities X, Z and Y wherein we are interested in generating sequences in Y starting from inf...

  4. Development in Rice Genome Research Based on Accurate Genome Sequence

    OpenAIRE

    2008-01-01

    Rice is one of the most important crops in the world. Although genetic improvement is a key technology for the acceleration of rice breeding, a lack of genome information had restricted efforts in molecular-based breeding until the completion of the high-quality rice genome sequence, which opened new opportunities for research in various areas of genomics. The syntenic relationship of the rice genome to other cereal genomes makes the rice genome invaluable for understanding how cereal genomes...

  5. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    Directory of Open Access Journals (Sweden)

    Xiaoyu Wang

    Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  6. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences.

    Directory of Open Access Journals (Sweden)

    Philip J Tully

    2016-05-01

    Full Text Available Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx. We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison.

  7. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences.

    Science.gov (United States)

    Tully, Philip J; Lindén, Henrik; Hennig, Matthias H; Lansner, Anders

    2016-05-01

    Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN) learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx). We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison. PMID:27213810

  8. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    Science.gov (United States)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  9. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    Directory of Open Access Journals (Sweden)

    A.E. El Hakim

    2015-12-01

    Full Text Available Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The tryptic peptides of the two isoforms share high sequence homology with other snake venom l-amino acid oxidases. The optimal pH and temperature values of Cc-LAAOI and Cc-LAAOII were 7.8, 50 °C and 7, 60 °C, respectively. The two isoenzymes were thermally stable up to 70 °C. The Km and Vmax values were 0.67 mM, 0.135 μmol/min for LAAOI and 0.82 mM, 0.087 μmol/min for LAAOII. Both isoenzymes displayed high catalytic preference to long-chain, hydrophobic and aromatic amino acids. The Mn2+ ion markedly increased the LAAO activity for both purified isoforms, while Na+, K+, Ca2+, Mg2+ and Ba2+ ions showed a non-significant increase in the enzymatic activity of both isoforms. Furthermore, Zn2+, Ni2+, Co2+, Cu2+ and AL3+ ions markedly inhibited the LAAOI and LAAOII activities. l-Cysteine and reduced glutathione completely inhibited the LAAO activity of both isoenzymes, whereas, β-mercaptoethanol, O-phenanthroline and PMSF completely inhibited the enzymatic activity of LAAOII. Furthermore, iodoacitic acid inhibited the enzymatic activity of LAAOII by 46% and had no effect on the LAAOI activity.

  10. Amino acid sequence and variant forms of favin, a lectin from Vicia faba.

    Science.gov (United States)

    Hopp, T P; Hemperly, J J; Cunningham, B A

    1982-04-25

    We have determined the complete amino acid sequence (182 residues) of the beta chain of favin, the glucose-binding lectin from fava beans (Vicia faba), and have established that the carbohydrate moiety is attached to Asn 168. Together with the sequence of the alpha chain previously reported (Hemperly, J. J., Hopp, T. P., Becker, J. W., and Cunningham, B. A. (1979) J. Biol. Chem. 254, 6803-6810), these data complete the analysis of the primary structure of the lectin. We have also examined minor polypeptides that appear in all preparations of favin. Two lower molecular weight species (Mr = 9,500-11,600) appear to be fragments of the beta chain resulting from cleavage following Asn 76, whereas six high molecular weight forms (Mr = 25,000 or greater) appear to include aggregates of the beta chain and possibly some alternative products of chain processing. PMID:7068646

  11. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    Energy Technology Data Exchange (ETDEWEB)

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  12. Caption detection from video sequence based on fuzzy neural networks

    Science.gov (United States)

    Gao, Xinbo; Xin, Hong; Li, Jie

    2001-09-01

    Caption graphically superimposed in video frames can provide important indexing information. The automatic detection and recognition of video captions can be of great help in querying topics of interest in digital news library. To detect the caption from video sequence, we present algorithms based on fuzzy clustering neural networks. Since neural networks have the capabilities of learning and self-organizing and parallel computing mechanism, with the great increasing of digital images and video databases, neural networks based techniques become more efficient and popular tools for multimedia processing. Experimental results show that our caption detection scheme is effective and robust.

  13. Complete Genome Sequence of Lactococcus lactis IO-1, a Lactic Acid Bacterium That Utilizes Xylose and Produces High Levels of l-Lactic Acid

    OpenAIRE

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-01-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly l-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  14. The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

    DEFF Research Database (Denmark)

    Behrendt, N; Rønne, E; Ploug, M;

    1990-01-01

    The receptor for human urokinase-type plasminogen activator (u-PA) was purified from phorbol 12-myristate 13-acetate-stimulated U937 cells by temperature-induced phase separation of detergent extracts, followed by affinity chromatography with immobilized diisopropyl fluorophosphate-treated u......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported...... amino-terminal fragment; 2) the identical electrophoretic mobilities observed for cross-linked conjugates, formed between either the purified protein or the u-PA receptor on intact U937 cells and the above ligands; 3) the identity of the apparent molecular weight of the purified protein...

  15. Translating sanger-based routine DNA diagnostics into generic massive parallel ion semiconductor sequencing

    NARCIS (Netherlands)

    Diekstra, A.; Bosgoed, E.A.J.; Rikken, A.; Lier, B. van; Kamsteeg, E.J.; Tychon, M.W.J.; Derks, R.C.; Soest, R.A.; Mensenkamp, A.R.; Scheffer, H.; Neveling, K.; Nelen, M.R.

    2015-01-01

    BACKGROUND: Dideoxy-based chain termination sequencing developed by Sanger is the gold standard sequencing approach and allows clinical diagnostics of disorders with relatively low genetic heterogeneity. Recently, new next generation sequencing (NGS) technologies have found their way into diagnostic

  16. WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences.

    Science.gov (United States)

    Zika, Radek; Paces, Jan; Pavlícek, Adam; Paces, Václav

    2004-07-01

    Web Alignment Visualization Server contains a set of web-tools designed for quick generation of publication-quality color figures of multiple alignments of nucleotide or amino acids sequences. It can be used for identification of conserved regions and gaps within many sequences using only common web browsers. The server is accessible at http://wavis.img.cas.cz.

  17. WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences

    OpenAIRE

    Zika, Radek; Pačes, Jan; Pavlíček, Adam; Pačes, Václav

    2004-01-01

    Web Alignment Visualization Server contains a set of web-tools designed for quick generation of publication-quality color figures of multiple alignments of nucleotide or amino acids sequences. It can be used for identification of conserved regions and gaps within many sequences using only common web browsers. The server is accessible at http://wavis.img.cas.cz.

  18. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    OpenAIRE

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of cons...

  19. [Segmentation Method for Liver Organ Based on Image Sequence Context].

    Science.gov (United States)

    Zhang, Meiyun; Fang, Bin; Wang, Yi; Zhong, Nanchang

    2015-10-01

    In view of the problems of more artificial interventions and segmentation defects in existing two-dimensional segmentation methods and abnormal liver segmentation errors in three-dimensional segmentation methods, this paper presents a semi-automatic liver organ segmentation method based on the image sequence context. The method takes advantage of the existing similarity between the image sequence contexts of the prior knowledge of liver organs, and combines region growing and level set method to carry out semi-automatic segmentation of livers, along with the aid of a small amount of manual intervention to deal with liver mutation situations. The experiment results showed that the liver segmentation algorithm presented in this paper had a high precision, and a good segmentation effect on livers which have greater variability, and can meet clinical application demands quite well.

  20. Cladistic analysis of iridoviruses based on protein and DNA sequences.

    Science.gov (United States)

    Wang, J W; Deng, R Q; Wang, X Z; Huang, Y S; Xing, K; Feng, J H; He, J G; Long, Q X

    2003-11-01

    Cladograms of iridoviruses were inferred from bootstrap analysis of molecular data sets comprising all published protein and DNA sequences of the major capsid protein, ATPase and DNA polymerase genes of members of the Iridoviridae family Iridovirus. All data sets yielded cladograms supporting the separation of the Iridovirus, Ranavirus and Lymphocystivirus genera, and the cladogram based on data derived from major capsid proteins further divided both the Iridovirus and Ranavirus genera into two groups. Tests of alternative hypotheses of topological constraints were also performed to further investigate relationships between infectious spleen and kidney necrosis virus (ISKNV), an unclassified fish iridovirus for which the complete genome sequence data is available, and other iridoviruses. Cladograms inferred and results of Shimodaira-Hasegawa tests indicated that ISKNV is more closely related to the Ranavirus genus than it is to the other genera of the family.

  1. Watermarking scheme of colour image based on chaotic sequences

    Institute of Scientific and Technical Information of China (English)

    LIU Nian-sheng; GUO Dong-hui

    2009-01-01

    The proposed perceptual mask is based on the singularity of cover image and matches very well with the properties of the human visual system. The cover colour image is decomposed into several subbands by the wavelet transform. The water-mark composed of chaotic sequence and the covert image is embedded into the subband with the largest energy. The chaos system plays an important role in the security invisibility and robustness of the proposed scheme. The parameter and initial state of chaos system can directly influence the generation of watermark information as a key. Moreover, the watermark information has the property of spread spectrum signal by chaotic sequence to improve the invisibility and security of watermarked image. Experimental results and comparisons with other watermarking techniques prove that the proposed algorithm is effective and feasible, and improves the security, invisibility and robustness of watermarking information.

  2. Prediction of potential drug targets based on simple sequence properties

    Directory of Open Access Journals (Sweden)

    Lai Luhua

    2007-09-01

    Full Text Available Abstract Background During the past decades, research and development in drug discovery have attracted much attention and efforts. However, only 324 drug targets are known for clinical drugs up to now. Identifying potential drug targets is the first step in the process of modern drug discovery for developing novel therapeutic agents. Therefore, the identification and validation of new and effective drug targets are of great value for drug discovery in both academia and pharmaceutical industry. If a protein can be predicted in advance for its potential application as a drug target, the drug discovery process targeting this protein will be greatly speeded up. In the current study, based on the properties of known drug targets, we have developed a sequence-based drug target prediction method for fast identification of novel drug targets. Results Based on simple physicochemical properties extracted from protein sequences of known drug targets, several support vector machine models have been constructed in this study. The best model can distinguish currently known drug targets from non drug targets at an accuracy of 84%. Using this model, potential protein drug targets of human origin from Swiss-Prot were predicted, some of which have already attracted much attention as potential drug targets in pharmaceutical research. Conclusion We have developed a drug target prediction method based solely on protein sequence information without the knowledge of family/domain annotation, or the protein 3D structure. This method can be applied in novel drug target identification and validation, as well as genome scale drug target predictions.

  3. Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

    DEFF Research Database (Denmark)

    Radova, A.; Sebela, M.; Galuszka, P.;

    2001-01-01

    was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...

  4. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    Science.gov (United States)

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  5. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Science.gov (United States)

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  6. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Science.gov (United States)

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/. PMID:26424080

  7. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    Directory of Open Access Journals (Sweden)

    Yan Koon-Kiu

    2007-11-01

    Full Text Available Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw" duplication and deletion rates rdup∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsgaKjabbwha1jabbchaWbqaaiabgEHiQaaaaaa@3283@, rdel∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsga

  8. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    Directory of Open Access Journals (Sweden)

    Pelletier Eric

    2010-10-01

    Full Text Available Abstract Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C

  9. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    International Nuclear Information System (INIS)

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species

  10. Complete amino acid sequence of the myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani.

    Science.gov (United States)

    Jones, B N; Wang, C C; Dwulet, F E; Lehman, L D; Meuth, J L; Bogardt, R A; Gurd, F R

    1979-04-25

    The complete amino acid sequence of the major component myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani, was determined by the automated Edman degradation of several large peptides obtained by specific cleavage of the protein. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. By subjecting four of these peptides and the apomyoglobin to automated Edman degradation, over 80% of the primary structure of the protein was obtained. The remainder of the covalent structure was determined by the sequence analysis of peptides that resulted from further digestion of the central cyanogen bromide fragment. This fragment was cleaved at its glutamyl residues with staphylococcal protease and its lysyl residues with trypsin. The action of trypsin was restricted to the lysyl residues by chemical modification of the single arginyl residue of the fragment with 1,2-cyclohexanedione. The primary structure of this myoglobin proved to be identical with that from the Atlantic bottlenosed dolphin and Pacific common dolphin but differs from the myoglobins of the killer whale and pilot whale at two positions. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea. PMID:454657

  11. Multifunctional hybrid networks based on self assembling peptide sequences

    Science.gov (United States)

    Sathaye, Sameer

    The overall aim of this dissertation is to achieve a comprehensive correlation between the molecular level changes in primary amino acid sequences of amphiphilic beta-hairpin peptides and their consequent solution-assembly properties and bulk network hydrogel behavior. This has been accomplished using two broad approaches. In the first approach, amino acid substitutions were made to peptide sequence MAX1 such that the hydrophobic surfaces of the folded beta-hairpins from the peptides demonstrate shape specificity in hydrophobic interactions with other beta-hairpins during the assembly process, thereby causing changes to the peptide nanostructure and bulk rheological properties of hydrogels formed from the peptides. Steric lock and key complementary hydrophobic interactions were designed to occur between two beta-hairpin molecules of a single molecule, LNK1 during beta-sheet fibrillar assembly of LNK1. Experimental results from circular dichroism, transmission electron microscopy and oscillatory rheology collectively indicate that the molecular design of the LNK1 peptide can be assigned the cause of the drastically different behavior of the networks relative to MAX1. The results indicate elimination or significant reduction of fibrillar branching due to steric complementarity in LNK1 that does not exist in MAX1, thus supporting the original hypothesis. As an extension of the designed steric lock and key complementarity between two beta-hairpin molecules of the same peptide molecule. LNK1, three new pairs of peptide molecules LP1-KP1, LP2-KP2 and LP3-KP3 that resemble complementary 'wedge' and 'trough' shapes when folded into beta-hairpins were designed and studied. All six peptides individually and when blended with their corresponding shape complement formed fibrillar nanostructures with non-uniform thickness values. Loose packing in the assembled structures was observed in all the new peptides as compared to the uniform tight packing in MAX1 by SANS analysis. This

  12. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Directory of Open Access Journals (Sweden)

    Meiler Arno

    2012-09-01

    Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  13. Generalization of entropy based divergence measures for symbolic sequence analysis.

    Directory of Open Access Journals (Sweden)

    Miguel A Ré

    Full Text Available Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD, is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms.

  14. Generalization of entropy based divergence measures for symbolic sequence analysis.

    Science.gov (United States)

    Ré, Miguel A; Azad, Rajeev K

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms.

  15. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach.

    Science.gov (United States)

    Adamala, Katarzyna; Anella, Fabrizio; Wieczorek, Rafal; Stano, Pasquale; Chiarabelli, Cristiano; Luisi, Pier Luigi

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the chemical viewpoint, macromolecules are formed by chemical pathways leading to the condensation of building blocks (amino acids, or nucleotides) in long-chain copolymers (proteins and nucleic acids, respectively). The second difficulty deals with a conceptual problem, namely with the emergence of specific sequences among a vast array of possible ones, the huge "sequence space", leading to the question "why these macromolecules, and not the others?" We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides, like Ser-His, with respect to peptide- and nucleotides-condensation, as a realistic model of primitive organocatalysis. We have also set up a strategy for exploring the sequence space of random proteins and RNAs (the so-called "never born biopolymer" project) with respect to the production of folded structures. Being still far from solved, the main aspects of these "open questions" are discussed here, by commenting on recent results obtained in our groups and by providing a unifying view on the problem and possible solutions. In particular, we propose a general scenario for macromolecule formation via fragment-condensation, as a scheme for the emergence of specific sequences based on molecular growth and selection.

  16. Next-Generation Sequencing-Based Molecular Diagnosis of Choroideremia

    Directory of Open Access Journals (Sweden)

    Kayo Shimizu

    2015-07-01

    Full Text Available We screened patients with choroideremia using next-generation sequencing (NGS and identified a novel mutation and a known mutation in the CHM gene. One patient presented an atypical fundus appearance for choroideremia. Another patient presented macular hole retinal detachment in the left eye. The present case series shows the utility of NGS-based screening in patients with choroideremia. In addition, the presence of macular hole in 1 of the 2 patients, together with a previous report, indicated the susceptibility of patients with choroideremia to macular hole.

  17. Quantitative sequence-function relationships in proteins based on gene ontology

    Directory of Open Access Journals (Sweden)

    Lesk Arthur M

    2007-08-01

    Full Text Available Abstract Background The relationship between divergence of amino-acid sequence and divergence of function among homologous proteins is complex. The assumption that homologs share function – the basis of transfer of annotations in databases – must therefore be regarded with caution. Here, we present a quantitative study of sequence and function divergence, based on the Gene Ontology classification of function. We determined the relationship between sequence divergence and function divergence in 6828 protein families from the PFAM database. Within families there is a broad range of sequence similarity from very closely related proteins – for instance, orthologs in different mammals – to very distantly-related proteins at the limit of reliable recognition of homology. Results We correlated the divergence in sequences determined from pairwise alignments, and the divergence in function determined by path lengths in the Gene Ontology graph, taking into account the fact that many proteins have multiple functions. Our results show that, among homologous proteins, the proportion of divergent functions decreases dramatically above a threshold of sequence similarity at about 50% residue identity. For proteins with more than 50% residue identity, transfer of annotation between homologs will lead to an erroneous attribution with a totally dissimilar function in fewer than 6% of cases. This means that for very similar proteins (about 50 % identical residues the chance of completely incorrect annotation is low; however, because of the phenomenon of recruitment, it is still non-zero. Conclusion Our results describe general features of the evolution of protein function, and serve as a guide to the reliability of annotation transfer, based on the closeness of the relationship between a new protein and its nearest annotated relative.

  18. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    Energy Technology Data Exchange (ETDEWEB)

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L. (Univ. of Massachusetts Medical School, Worcester (USA))

    1988-06-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity.

  19. EFFECT OF CASEIN-BASED SEMISYNTHETIC FOOD ON RENAL ACID EXCRETION AND ACID-BASE STATE OF BLOOD IN DOGS

    NARCIS (Netherlands)

    ZIJLSTRA, WG; LANGBROEK, AJM; KRAAN, J; RISPENS, P; NIJMEIJER, A

    1995-01-01

    Urinary acid excretion and blood acid-base stare were determined in dogs fed a casein-based semi-synthetic food (SSF), to which different amounts of salts had been added, in comparison with feeding normal dog food. Net acid excretion (NAE) and inorganic acid excretion (IAE) increased during SSF feed

  20. [Determination of body fluid based on analysis of nucleic acids].

    Science.gov (United States)

    Korabečná, Marie

    2015-01-01

    Recent methodological approaches of molecular genetics allow isolation of nucleic acids (DNA and RNA) from negligible forensic samples. Analysis of these molecules may be used not only for individual identification based on DNA profiling but also for the detection of origin of the body fluid which (alone or in mixture with other body fluids) forms the examined biological trace. Such an examination can contribute to the evaluation of procedural, technical and tactical value of the trace. Molecular genetic approaches discussed in the review offer new possibilities in comparison with traditional spectrum of chemical, immunological and spectroscopic tests especially with regard to the interpretation of mixtures of biological fluids and to the confirmatory character of the tests. Approaches based on reverse transcription of tissue specific mRNA and their subsequent polymerase chain reaction (PCR) and fragmentation analysis are applicable on samples containing minimal amounts of biological material. Methods for body fluid discrimination based on examination of microRNA in samples provided so far confusing results therefore further development in this field is needed. The examination of tissue specific methylation of nucleotides in selected gene sequences seems to represent a promising enrichment of the methodological spectrum. The detection of DNA sequences of tissue related bacteria has been established and it provides satisfactory results mainly in combination with above mentioned methodological approaches. PMID:26419517

  1. Sequence-structure based phylogeny of GPCR Class A Rhodopsin receptors.

    Science.gov (United States)

    Kakarala, Kavita Kumari; Jamil, Kaiser

    2014-05-01

    Current methods of G protein coupled receptors (GPCRs) phylogenetic classification are sequence based and therefore inappropriate for highly divergent sequences, sharing low sequence identity. In this study, sequence structure profile based alignment generated by PROMALS3D was used to understand the GPCR Class A Rhodopsin superfamily evolution using the MEGA 5 software. Phylogenetic analysis included a combination of Neighbor-Joining method and Maximum Likelihood method, with 1000 bootstrap replicates. Our study was able to identify potential ligand association for Class A Orphans and putative/unclassified Class A receptors with no cognate ligand information: GPR21 and GPR52 with fatty acids; GPR75 with Neuropeptide Y; GPR82, GPR18, GPR141 with N-arachidonylglycine; GPR176 with Free fatty acids, GPR10 with Tachykinin & Neuropeptide Y; GPR85 with ATP, ADP & UDP glucose; GPR151 with Galanin; GPR153 and GPR162 with Adrenalin, Noradrenalin; GPR146, GPR139, GPR142 with Neuromedin, Ghrelin, Neuromedin U-25 & Thyrotropin-releasing hormone; GPR171 with ATP, ADP & UDP Glucose; GPR88, GPR135, GPR161, GPR101with 11-cis-retinal; GPR83 with Tackykinin; GPR148 with Prostanoids, GPR109b, GPR81, GPR31with ATP & UTP and GPR150 with GnRH I & GnRHII. Furthermore, we suggest that this study would prove useful in re-classification of receptors, selecting templates for homology modeling and identifying ligands which may show cross reactivity with other GPCRs as signaling via multiple ligands play a significant role in disease modulation. PMID:24503482

  2. Rapid sequencing of DNA based on single-molecule detection

    Science.gov (United States)

    Soper, Steven A.; Davis, Lloyd M.; Fairfield, Frederick R.; Hammond, Mark L.; Harger, Carol A.; Jett, James H.; Keller, Richard A.; Marrone, Babetta L.; Martin, John C.; Nutter, Harvey L.; Shera, E. Brooks; Simpson, Daniel J.

    1991-07-01

    Sequencing the human genome is a major undertaking considering the large number of nucleotides present in the genome and the slow methods currently available to perform the task. The authors have recently reported on a scheme to sequence DNA rapidly using a non-gel based technique. The concept is based upon the incorporation of fluorescently labeled nucleotides into a strand of DNA, isolation and manipulation of a labeled DNA fragment and the detection of single nucleotides using ultra-sensitive laser-induced fluorescence detection following their cleavage from the fragment. Detection of individual fluorophores in the liquid phase was accomplished with time-gated detection following pulsed-laser excitation. The photon bursts from individual rhodamine 6G (R6G) molecules travelling through a laser beam have been observed, as have bursts from single fluorescently modified nucleotides. Using two different biotinylated nucleotides as a model system for fluorescently labeled nucleotides, the authors have observed synthesis of the complementary copy of M13 bacteriophage. Work with fluorescently labeled nucleotides is underway. Individual molecules of DNA attached to a microbead have been observed and manipulated with an epifluorescence microscope.

  3. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    Directory of Open Access Journals (Sweden)

    Selvaraj S

    2011-10-01

    Full Text Available Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching. Results We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix. Conclusions The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.

  4. Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

    Directory of Open Access Journals (Sweden)

    Nagib eAhsan

    2012-07-01

    Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.

  5. Teaching Acid/Base Physiology in the Laboratory

    Science.gov (United States)

    Friis, Ulla G.; Plovsing, Ronni; Hansen, Klaus; Laursen, Bent G.; Wallstedt, Birgitta

    2010-01-01

    Acid/base homeostasis is one of the most difficult subdisciplines of physiology for medical students to master. A different approach, where theory and practice are linked, might help students develop a deeper understanding of acid/base homeostasis. We therefore set out to develop a laboratory exercise in acid/base physiology that would provide…

  6. Enzyme-free translation of DNA into sequence-defined synthetic polymers structurally unrelated to nucleic acids

    Science.gov (United States)

    Niu, Jia; Hili, Ryan; Liu, David R.

    2013-04-01

    The translation of DNA sequences into corresponding biopolymers enables the production, function and evolution of the macromolecules of life. In contrast, methods to generate sequence-defined synthetic polymers with similar levels of control have remained elusive. Here, we report the development of a DNA-templated translation system that enables the enzyme-free translation of DNA templates into sequence-defined synthetic polymers that have no necessary structural relationship with nucleic acids. We demonstrate the efficiency, sequence-specificity and generality of this translation system by oligomerizing building blocks including polyethylene glycol, α-(D)-peptides, and β-peptides in a DNA-programmed manner. Sequence-defined synthetic polymers with molecular weights of 26 kDa containing 16 consecutively coupled building blocks and 90 densely functionalized β-amino acid residues were translated from DNA templates using this strategy. We integrated the DNA-templated translation system developed here into a complete cycle of translation, coding sequence replication, template regeneration and re-translation suitable for the iterated in vitro selection of functional sequence-defined synthetic polymers unrelated in structure to nucleic acids.

  7. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  8. Liquid Crystalline Furandicarboxylic Acid-based Aaromatic Polyesters

    NARCIS (Netherlands)

    WILSENS, CAROLUS HENRICUS R. MARIA; RASTOGI, SANJAY; VELD, MARTIJN ARNOLDUS JOHANNES; KLOP, ENNO ANTON; NOORDOVER, BART ADRIANUS JOHANNES

    2013-01-01

    The invention pertains to a fully aromatic liquid crystalline furandicarboxylic acid- based aromatic polyester obtainable from a mixture of monomers comprising 2,5- furandicarboxylic acid, p-hydroxybenzoic acid, an aromatic diol, and 5-40 mol% of an aromatic monocarboxylic acid selected from vanilli

  9. Establishment of the nucleic acid sequence-based amplificationmethod of detecting Vibrio parahaemolyticus%依赖于核酸序列恒温扩增技术快速检测副溶血性弧菌方法的建立

    Institute of Scientific and Technical Information of China (English)

    倪鑫; 王志聪; 雷质文; 梁成珠; 汪东风; 姜英辉; 王建广; 祝素贞

    2011-01-01

    为建立副溶血性弧菌的快速检测方法,本研究采用依赖于核酸序列恒温扩增(NASBA)技术,以副溶血性弧菌的tlh基因为扩增的靶基因设计特异性引物和探针,建立可快速扩增副溶血性弧菌的NASBA方法.特异性和灵敏度试验结果表明:该NASBA方法对副溶血性弧菌的最小检出量为5.1×102 cfu/mL,高于普通PCR方法,而且与其他种属的菌无任何交叉反应.此外,本研究将副溶血弧菌扩增产物采用通用型核酸扩增物快速检测板进行检测,实现了特异性强的快速副溶血弧菌的检测.该方法对仪器要求低,具有良好的应用前景.%In this study, we adopted a technology based on nucleic acid sequence-based amplification (NASBA), to establish a rapid test method of Vibrio parahaemolyticus. Specific primers and probes were designed for the tlh target gene of V. Parahaemolyticus and the detection method of NASBA was established. Its specificity and sensitivity were tested. The results showed that the sensitivity of the NASBA was 5.1 x 102 cfii/mL which was higher than that of PCR method and this method had no cross reaction with other species of bacteria. The amplification products of V. Pamhaemolyticus were rapidly detected by using the universal rapid detection board of nucleic acid amplification product. In conclusion, detecting V. Parahaemolyticus with NASBA method was more specific and sensitive than PCR method and has lower instrumental requirement and a broad application.

  10. Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

    Science.gov (United States)

    Rodrigue, Nicolas; Philippe, Hervé; Lartillot, Nicolas

    2010-03-01

    Modeling the interplay between mutation and selection at the molecular level is key to evolutionary studies. To this end, codon-based evolutionary models have been proposed as pertinent means of studying long-range evolutionary patterns and are widely used. However, these approaches have not yet consolidated results from amino acid level phylogenetic studies showing that selection acting on proteins displays strong site-specific effects, which translate into heterogeneous amino acid propensities across the columns of alignments; related codon-level studies have instead focused on either modeling a single selective context for all codon columns, or a separate selective context for each codon column, with the former strategy deemed too simplistic and the latter deemed overparameterized. Here, we integrate recent developments in nonparametric statistical approaches to propose a probabilistic model that accounts for the heterogeneity of amino acid fitness profiles across the coding positions of a gene. We apply the model to a dozen real protein-coding gene alignments and find it to produce biologically plausible inferences, for instance, as pertaining to site-specific amino acid constraints, as well as distributions of scaled selection coefficients. In their account of mutational features as well as the heterogeneous regimes of selection at the amino acid level, the modeling approaches studied here can form a backdrop for several extensions, accounting for other selective features, for variable population size, or for subtleties of mutational features, all with parameterizations couched within population-genetic theory. PMID:20176949

  11. Trypsin inhibitors from ridged gourd (Luffa acutangula Linn.) seeds: purification, properties, and amino acid sequences.

    Science.gov (United States)

    Haldar, U C; Saha, S K; Beavis, R C; Sinha, N K

    1996-02-01

    Two trypsin inhibitors, LA-1 and LA-2, have been isolated from ridged gourd (Luffa acutangula Linn.) seeds and purified to homogeneity by gel filtration followed by ion-exchange chromatography. The isoelectric point is at pH 4.55 for LA-1 and at pH 5.85 for LA-2. The Stokes radius of each inhibitor is 11.4 A. The fluorescence emission spectrum of each inhibitor is similar to that of the free tyrosine. The biomolecular rate constant of acrylamide quenching is 1.0 x 10(9) M-1 sec-1 for LA-1 and 0.8 x 10(9) M-1 sec-1 for LA-2 and that of K2HPO4 quenching is 1.6 x 10(11) M-1 sec-1 for LA-1 and 1.2 x 10(11) M-1 sec-1 for LA-2. Analysis of the circular dichroic spectra yields 40% alpha-helix and 60% beta-turn for La-1 and 45% alpha-helix and 55% beta-turn for LA-2. Inhibitors LA-1 and LA-2 consist of 28 and 29 amino acid residues, respectively. They lack threonine, alanine, valine, and tryptophan. Both inhibitors strongly inhibit trypsin by forming enzyme-inhibitor complexes at a molar ratio of unity. A chemical modification study suggests the involvement of arginine of LA-1 and lysine of LA-2 in their reactive sites. The inhibitors are very similar in their amino acid sequences, and show sequence homology with other squash family inhibitors. PMID:8924202

  12. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    OpenAIRE

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which might shed light onto the enzymatic machineries that are involved in its production of biogenic amines.

  13. Development of Sequence-Based Microsatellite Marker for Phalaenopsis Orchid

    Directory of Open Access Journals (Sweden)

    FATIMAH

    2011-06-01

    Full Text Available Phalaenopsis is one of the most interesting genera of orchids due to the members are often used as parents to produce hybrids. The establishment and development of highly reliable and discriminatory methods for identifying species and cultivars has become increasingly more important to plant breeders and members of the nursery industry. The aim of this research was to develop sequence-based microsatellite (eSSR markers for the Phalaenopsis orchid designed from the sequence of GenBank NCBI. Seventeen primers were designed and thirteen primers pairs could amplify the DNA giving the expected PCR product with polymorphism. A total of 51 alleles, with an average of 3 alleles per locus and polymorphism information content (PIC values at 0.674, were detected at the 16 SSR loci. Therefore, these markers could be used for identification of the Phalaenopsis orchid used in this study. Genetic similarity and principle coordinate analysis identified five major groups of Phalaenopsis sp. the first group consisted of P. amabilis, P. fuscata, P. javanica, and P. zebrine. The second group consisted of P. amabilis, P. amboinensis, P. bellina, P. floresens, and P. mannii. The third group consisted of P. bellina, P. cornucervi, P. cornucervi, P. violaceae sumatra, P. modesta. The forth group consisted of P. cornucervi and P. lueddemanniana, and the fifth group was P. amboinensis.

  14. New algorithm for iris recognition based on video sequences

    Science.gov (United States)

    Bourennane, Salah; Fossati, Caroline; Ketchantang, William

    2010-07-01

    Among existing biometrics, iris recognition systems are among the most accurate personal biometric identification systems. However, the acquisition of a workable iris image requires strict cooperation of the user; otherwise, the image will be rejected by a verification module because of its poor quality, inducing a high false reject rate (FRR). The FRR may also increase when iris localization fails or when the pupil is too dilated. To improve the existing methods, we propose to use video sequences acquired in real time by a camera. In order to keep the same computational load to identify the iris, we propose a new method to estimate the iris characteristics. First, we propose a new iris texture characterization based on Fourier-Mellin transform, which is less sensitive to pupil dilatations than previous methods. Then, we develop a new iris localization algorithm that is robust to variations of quality (partial occlusions due to eyelids and eyelashes, light reflects, etc.), and finally, we introduce a fast and new criterion of suitable image selection from an iris video sequence for an accurate recognition. The accuracy of each step of the algorithm in the whole proposed recognition process is tested and evaluated using our own iris video database and several public image databases, such as CASIA, UBIRIS, and BATH.

  15. CLUSS: Clustering of protein sequences based on a new similarity measure

    Directory of Open Access Journals (Sweden)

    Brzezinski Ryszard

    2007-08-01

    Full Text Available Abstract Background The rapid burgeoning of available protein data makes the use of clustering within families of proteins increasingly important. The challenge is to identify subfamilies of evolutionarily related sequences. This identification reveals phylogenetic relationships, which provide prior knowledge to help researchers understand biological phenomena. A good evolutionary model is essential to achieve a clustering that reflects the biological reality, and an accurate estimate of protein sequence similarity is crucial to the building of such a model. Most existing algorithms estimate this similarity using techniques that are not necessarily biologically plausible, especially for hard-to-align sequences such as proteins with different domain structures, which cause many difficulties for the alignment-dependent algorithms. In this paper, we propose a novel similarity measure based on matching amino acid subsequences. This measure, named SMS for Substitution Matching Similarity, is especially designed for application to non-aligned protein sequences. It allows us to develop a new alignment-free algorithm, named CLUSS, for clustering protein families. To the best of our knowledge, this is the first alignment-free algorithm for clustering protein sequences. Unlike other clustering algorithms, CLUSS is effective on both alignable and non-alignable protein families. In the rest of the paper, we use the term "phylogenetic" in the sense of "relatedness of biological functions". Results To show the effectiveness of CLUSS, we performed an extensive clustering on COG database. To demonstrate its ability to deal with hard-to-align sequences, we tested it on the GH2 family. In addition, we carried out experimental comparisons of CLUSS with a variety of mainstream algorithms. These comparisons were made on hard-to-align and easy-to-align protein sequences. The results of these experiments show the superiority of CLUSS in yielding clusters of proteins

  16. Canine amino acid transport system Xc(-): cDNA sequence, distribution and cystine transport activity in lens epithelial cells.

    Science.gov (United States)

    Maruo, Takuya; Kanemaki, Nobuyuki; Onda, Ken; Sato, Reiichiro; Ichihara, Nobuteru; Ochiai, Hideharu

    2014-04-01

    The cystine transport activity of a lens epithelial cell line originated from a canine mature cataract was investigated. The distinct cystine transport activity was observed, which was inhibited to 28% by extracellular 1 mM glutamate. The cDNA sequences of canine cysteine/glutamate exchanger (xCT) and 4F2hc were determined. The predicted amino acid sequences were 527 and 533 amino acid polypeptides, respectively. The amino acid sequences of canine xCT and 4F2hc showed high similarities (>80%) to those of humans. The expression of xCT in lens epithelial cell line was confirmed by western blot analysis. RT-PCR analysis revealed high level expression only in the brain, and it was below the detectable level in other tissues.

  17. Targeting of an Interrupted Polypurine:Polypyrimidine Sequence in Mammalian Cells by a Triplex-Forming Oligonucleotide Containing a Novel Base Analogue†

    OpenAIRE

    Semenyuk, A.; Darian, E.; Liu, J; Majumdar, A; Cuenoud, B; Miller, P S; MacKerell, A.D.; Seidman, M M

    2010-01-01

    The DNA triple helix consists of a third strand of nucleic acid lying in the major groove of an intact DNA duplex. The most stable triplexes form on polypurine:polypyrimidine sequences, and pyrimidine interruptions in the purine strand are destabilizing. Sequence stringency is imparted by specific Hoogsteen hydrogen bonds between third strand bases and the purine bases in the duplex. Appropriate base and sugar modifications of triple helix-forming oligonucleotides (TFOs) confer chromosome tar...

  18. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    Energy Technology Data Exchange (ETDEWEB)

    Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  19. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    Science.gov (United States)

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. PMID:25708409

  20. Application of peptide nucleic acids containing azobenzene self-assembled electrochemical biosensors in detecting DNA sequences

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Hybridization of peptide nucleic acids probe containing azobenzene (NH2-TNT4, N-PNAs) with DNA was performed by covalently immobilizing of NH2-TNT4 in sequence on the 3-mercaptopropionic acid self-assembled monolayer modified gold electrode with the helps of N-(3-dimethylaminopropy1)-N’-ethylcarbodiimide hydrochloride (EDC) and N-hydroxysuccinimide (NHS), and the hybrid was coded as N-PNAs/DNA. Using [Fe(CN)6]4-/3- (1:1) as the electrochemical indicator, the electrochemical properties of the N-PNAs self-assembled monolayer (N-PNAs-SAMs) and N-PNAs/DNA hybridization system under the conditions of before and after UV light irradiation were characterized with cyclic voltammetry (CV), differential pulse voltammetry (DPV), and electrochemical impedance spectra (EIS). Results showed that the redox currents decreased with the increase of irradiation time, suggesting that the ability of the charge transfer on the electrode surface was weakened and the conformation of hybrid system had been changed, and the control of PNAs/DNA hybridization could be realized by UV light irradiation.

  1. Similarity Measurement of Web Sessions Based on Sequence Alignment

    Institute of Scientific and Technical Information of China (English)

    LI Chaofeng; LU Yansheng

    2007-01-01

    The task of clustering Web sessions is to group Web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity.The first and foremost question needed to be considered in clustering Web sessions is how to measure the similarity between Web sessions. However, there are many shortcomings in traditional measurements. This paper introduces a new method for measuring similarities between Web pages that takes into account not only the URL but also the viewing time of the visited Web page. Then we give a new method to measure the similarity of Web sessions using sequence alignment and the similarity of Web page access in detail.Experiments have proved that our method is valid and efficient.

  2. The application of nucleic acid sequence?based amplification,real?time PCR and GM test in invasive aspergillosis diagnosis%核酸序列依赖性扩增、Real?time PCR及GM试验诊断侵袭性曲霉菌感染的临床应用评价

    Institute of Scientific and Technical Information of China (English)

    王立朋; 鲍翠霞; 于丽梅; 张晓录; 于威娟; 张霞; 李玮; 黄葆华; 李杰

    2015-01-01

    Objective To study the diagnostic performance of nucleic acid sequence?based amplification ( NASBA) assay,real?time PCR and GM test in detecting invasive aspergillosis for clinical diagnosis.Methods Blood samples from 80 patients at a high risk for IA were collected during from November 2013 to June 2014.These patients were categorized as 8 proven IA,26 probable IA, and 46 non?IA according to the 2008 revised definitions of EORTC/MSG.Blood samples were tested by NASBA,real?time PCR and GM test and their diagnostic parameters were calculated,respectively.Result The sensitivity of NASBA,real?time PCR and GM test was 76.47%,67.65% and 52.94%,while their specificity was 80.43%,89.13%,80.43%,respectively.The efficiency of various com?binations of tests was also evaluated.Perfect specificity (100%) and positive predictive value (100%) were achieved by combining NASBA and real?time PCR as a serial testing.A combination of NASBA and real?time PCR as a parallel testing was the most sensitive (94.12%).Conclusion The sensitivity and specificity of NASBA and real?time PCR were superior to GM test.Combination of these assays could be particularly useful in specific clinical situations.%目的 核酸序列依赖性扩增 ( nucleic acid sequence?based amplification,NASBA)、Real?time PCR及GM试验在侵袭性曲霉菌感染中的诊断价值. 方法 收集2013年11月~2014年6月临床上曲霉菌感染高危病患的血液标本80例,并根据EORTC/MSG诊断标准分为确诊组8例,拟诊组26例,非感染组46例,分别利用NASBA、real?time PCR及GM试验进行检测,计算3种方法的诊断指标并分析评价. 结果 NASBA、real?time PCR及GM试验3种方法的灵敏度分别为76.47%、67.65%、52.94%,特异度分别为80.43%、89.13%、80.43%. 联合诊断结果显示,NASBA与real?time PCR串联方案有最好的特异度 (100%)及阳性预测值(100%);NASBA与real?time PCR并联方案则最为灵敏(94.12%). 结论 NASBA用于诊断IA最为敏感,而real?time PCR

  3. Site-directed gene mutation at mixed sequence targets by psoralen-conjugated pseudo-complementary peptide nucleic acids

    OpenAIRE

    Kim, Ki-Hyun; Nielsen, Peter E.; Glazer, Peter M.

    2007-01-01

    Sequence-specific DNA-binding molecules such as triple helix-forming oligonucleotides (TFOs) provide a means for inducing site-specific mutagenesis and recombination at chromosomal sites in mammalian cells. However, the utility of TFOs is limited by the requirement for homopurine stretches in the target duplex DNA. Here, we report the use of pseudo-complementary peptide nucleic acids (pcPNAs) for intracellular gene targeting at mixed sequence sites. Due to steric hindrance, pcPNAs are unable ...

  4. Nucleotide and amino acid sequence coding for polypeptides of foot-and-mouth disease virus type A12.

    OpenAIRE

    Robertson, B H; Grubman, M J; Weddell, G N; Moore, D.M.; Welsh, J D; Fischer, T.; Dowbenko, D J; Yansura, D G; Small, B.; Kleid, D G

    1985-01-01

    The coding region for the structural and nonstructural polypeptides of the type A12 foot-and-mouth disease virus genome has been identified by nucleotide sequencing of cloned DNA derived from the viral RNA. In addition, 704 nucleotides in the 5' untranslated region between the polycytidylic acid tract and the probable initiation codon of the first translated gene, P16-L, have been sequenced. This region has several potential initiation codons, one of which appears to be a low-frequency altern...

  5. Physiological roles of acid-base sensors.

    Science.gov (United States)

    Levin, Lonny R; Buck, Jochen

    2015-01-01

    Acid-base homeostasis is essential for life. The macromolecules upon which living organisms depend are sensitive to pH changes, and physiological systems use the equilibrium between carbon dioxide, bicarbonate, and protons to buffer their pH. Biological processes and environmental insults are constantly challenging an organism's pH; therefore, to maintain a consistent and proper pH, organisms need sensors that measure pH and that elicit appropriate responses. Mammals use multiple sensors for measuring both intracellular and extracellular pH, and although some mammalian pH sensors directly measure protons, it has recently become apparent that many pH-sensing systems measure pH via bicarbonate-sensing soluble adenylyl cyclase. PMID:25340964

  6. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

    Directory of Open Access Journals (Sweden)

    Leitner Dietmar

    2005-04-01

    Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.

  7. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    Science.gov (United States)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  8. BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

    Science.gov (United States)

    Bahr, A; Thompson, J D; Thierry, J C; Poch, O

    2001-01-01

    BAliBASE is specifically designed to serve as an evaluation resource to address all the problems encountered when aligning complete sequences. The database contains high quality, manually constructed multiple sequence alignments together with detailed annotations. The alignments are all based on three-dimensional structural superpositions, with the exception of the transmembrane sequences. The first release provided sets of reference alignments dealing with the problems of high variability, unequal repartition and large N/C-terminal extensions and internal insertions. Here we describe version 2.0 of the database, which incorporates three new reference sets of alignments containing structural repeats, trans-membrane sequences and circular permutations to evaluate the accuracy of detection/prediction and alignment of these complex sequences. BAliBASE can be viewed at the web site http://www-igbmc.u-strasbg. fr/BioInfo/BAliBASE2/index.html or can be downloaded from ftp://ftp-igbmc.u-strasbg.fr/pub/BAliBASE2 /.

  9. GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH

    Directory of Open Access Journals (Sweden)

    Zeinab A. Fareed

    2016-02-01

    Full Text Available The alignment of two DNA sequences is a basic step in the analysis of biological data. Sequencing a long DNA sequence is one of the most interesting problems in bioinformatics. Several techniques have been developed to solve this sequence alignment problem like dynamic programming and heuristic algorithms. In this paper, we introduce (GPCodon alignment a pairwise DNA-DNA method for global sequence alignment that improves the accuracy of pairwise sequence alignment. We use a new scoring matrix to produce the final alignment called the empirical codon substitution matrix. Using this matrix in our technique enabled the discovery of new relationships between sequences that could not be discovered using traditional matrices. In addition, we present experimental results that show the performance of the proposed technique over eleven datasets of average length of 2967 bps. We compared the efficiency and accuracy of our techniques against a comparable tool called “Pairwise Align Codons” [1].

  10. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    International Nuclear Information System (INIS)

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the β-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution

  11. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    Energy Technology Data Exchange (ETDEWEB)

    Chang, Soo-Ik (Harvard Medical School, Boston, MA (USA)); Hammes, G.G. (Univ. of California, Santa Barbara (USA))

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  12. The complete amino acid sequence of a mannose-binding lectin from "Kidachi Aloe" (Aloe arborescens Miller var. natalensis Berger).

    Science.gov (United States)

    Koike, T; Titani, K; Suzuki, M; Beppu, H; Kuzuya, H; Maruta, K; Shimpo, K; Fujita, K

    1995-09-01

    The complete amino acid sequence of a mannose-binding lectin purified from the leaf skin of "Kidachi Aloe" (Aloe arborescens Miller var. natalensis Berger) is presented. The 109-residue sequence of the subunit was determined by analysis of peptides of the intact or S-pyridylethylated protein generated by digestion with cyanogen bromide, BNPS-skatole, Achromobacter protease I, or trypsin. The subunit contains an intrachain disulfide bridge. The sequence is highly homologous to that of a mannose-binding lectin from snowdrop bulb. PMID:7669035

  13. AbCD: arbitrary coverage design for sequencing-based genetic studies

    OpenAIRE

    Kang, Jian; Huang, Kuan-Chieh; Xu, Zheng; Wang, Yunfei; Abecasis, Gonçalo R.; Li, Yun

    2013-01-01

    Summary: Recent advances in sequencing technologies have revolutionized genetic studies. Although high-coverage sequencing can uncover most variants present in the sequenced sample, low-coverage sequencing is appealing for its cost effectiveness. Here, we present AbCD (arbitrary coverage design) to aid the design of sequencing-based studies. AbCD is a user-friendly interface providing pre-estimated effective sample sizes, specific to each minor allele frequency category, for designs with arbi...

  14. A DNA Structure-Based Bionic Wavelet Transform and Its Application to DNA Sequence Analysis

    OpenAIRE

    Fei Chen; Yuan-Ting Zhang

    2003-01-01

    DNA sequence analysis is of great significance for increasing our understanding of genomic functions. An important task facing us is the exploration of hidden structural information stored in the DNA sequence. This paper introduces a DNA structure-based adaptive wavelet transform (WT) – the bionic wavelet transform (BWT) – for DNA sequence analysis. The symbolic DNA sequence can be separated into four channels of indicator sequences. An adaptive symbol-to-number mapping, determined from the s...

  15. SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCLEOTIDE BASES, FREQUENCY AND POSITION OF GROUP MUTATIONS

    Directory of Open Access Journals (Sweden)

    Fatima KABLI

    2016-01-01

    Full Text Available The DNA sequences similarity analysis approaches have been based on the representation and the frequency of sequences components; however, the position inside sequence is important information for the sequence data. Whereas, insufficient information in sequences representations is important reason that causes poor similarity results. Based on three classifications of the DNA bases according to their chemical properties, the frequencies and average positions of group mutations have been grouped into two twelve-components vectors, the Euclidean distances among introduced vectors applied to compare the coding sequences of the first exon of beta globin gene of 11 species.

  16. Java Implementation based Heterogeneous Video Sequence Automated Surveillance Monitoring

    Directory of Open Access Journals (Sweden)

    Sankari Muthukarupan

    2013-04-01

    Full Text Available Automated video based surveillance monitoring is an essential and computationally challenging task to resolve issues in the secure access localities. This paper deals with some of the issues which are encountered in the integration surveillance monitoring in the real-life circumstances. We have employed video frames which are extorted from heterogeneous video formats. Each video frame is chosen to identify the anomalous events which are occurred in the sequence of time-driven process. Background subtraction is essentially required based on the optimal threshold and reference frame. Rest of the frames are ablated from reference image, hence all the foreground images paradigms are obtained. The co-ordinate existing in the deducted images is found by scanning the images horizontally until the occurrence of first black pixel. Obtained coordinate is twinned with existing co-ordinates in the primary images. The twinned co-ordinate in the primary image is considered as an active-region-of-interest. At the end, the starred images are converted to temporal video that scrutinizes the moving silhouettes of human behaviors in a static background. The proposed model is implemented in Java. Results and performance analysis are carried out in the real-life environments.

  17. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE Consensus Sequence Repeats in the Viral Genome.

    Directory of Open Access Journals (Sweden)

    Ashutosh Kumar

    2016-08-01

    Full Text Available Owing to the reports of microcephaly as a consistent outcome in the foetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV - microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favour of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1 sequence homology between ZIKV genome and the response element of an early neural tube developmental marker ‘retinoic acid’ in human DNA and (2 comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE consensus sequence (5′–AGGTCA–3′ in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly and other viruses available in National Institute of Health genetic sequence database (GenBank for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause foetal brain defects (for which maternal-foetal transmission during developing stage may be required. The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although bioinformatic

  18. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    Science.gov (United States)

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  19. Multiple sequence alignment based on combining genetic algorithm with chaotic sequences.

    Science.gov (United States)

    Gao, C; Wang, B; Zhou, C J; Zhang, Q

    2016-06-24

    In bioinformatics, sequence alignment is one of the most common problems. Multiple sequence alignment is an NP (nondeterministic polynomial time) problem, which requires further study and exploration. The chaos optimization algorithm is a type of chaos theory, and a procedure for combining the genetic algorithm (GA), which uses ergodicity, and inherent randomness of chaotic iteration. It is an efficient method to solve the basic premature phenomenon of the GA. Applying the Logistic map to the GA and using chaotic sequences to carry out the chaotic perturbation can improve the convergence of the basic GA. In addition, the random tournament selection and optimal preservation strategy are used in the GA. Experimental evidence indicates good results for this process.

  20. Complete chloroplast genome sequence of Fritillaria unibracteata var. wabuensis based on SMRT Sequencing Technology.

    Science.gov (United States)

    Li, Ying; Li, Qiushi; Li, Xiwen; Song, Jingyuan; Sun, Chao

    2016-09-01

    Fritillaria unibracteata var. wabuensis is an important medicinal plant used for the treatment of cough symptoms related to the respiratory system. The chloroplast genome of F. unibracteata var. wabuensis (GenBank accession no. KF769142) was assembled using the PacBio RS platform (Pacific Biosciences, Beverly, MA) as a circle sequence with 151 009 bp. The assembled genome contains 133 genes, including 88 protein-coding, 37 tRNA, and eight rRNA genes. This genome sequence will provide important resource for further studies on the evolution of Fritillaria genus and molecular identification of Fritillaria herbs and their adulterants. This work suggests that PacBio RS is a powerful tool to sequence and assemble chloroplast genomes. PMID:26370383

  1. The phylogenetic status of Paxillosida (Asteroidea) based on complete mitochondrial DNA sequences.

    Science.gov (United States)

    Matsubara, Mioko; Komatsu, Miéko; Araki, Takeyoshi; Asakawa, Shuichi; Yokobori, Shin-ichi; Watanabe, Kimitsuna; Wada, Hiroshi

    2005-09-01

    One of the most important issues in asteroid phylogeny is the phylogenetic status of Paxillosida. This group lacks an anus and suckers on the tube feet in adults and does not develop the brachiolaria stage in early development. Two controversial hypotheses have been proposed for the phylogenetic status of Paxillosida, i.e., Paxillosida is primitive or rather specialized in asteroids. In this study, we determined the complete mitochondrial DNA nucleotide sequences from two paxillosidans (Astropecten polyacanthus and Luidia quinaria) and one forcipulatidan (Asterias amurensis). The mitochondrial genomes of the three asteroids were identical with respect to gene order and transcription direction, and were identical to the previously reported mitochondrial genomes of Asterina pectinifera (Valvatida) and Pisaster ochraceus (Forcipulatida) in this respect. Therefore, the comparison of genome structures was uninformative for the purposes of asteroid phylogeny. However, molecular phylogenetic analyses based on the amino acid sequences and the nucleotide sequences from the five asteroids supported the monophyly of the clade that included the two paxillosidans and Asterina. This suggests that the paxillosidan characters are secondarily derived ones.

  2. Amino acid classification based spectrum kernel fusion for protein subnuclear localization

    Directory of Open Access Journals (Sweden)

    Fei Wang

    2010-01-01

    Full Text Available Abstract Background Prediction of protein localization in subnuclear organelles is more challenging than general protein subcelluar localization. There are only three computational models for protein subnuclear localization thus far, to the best of our knowledge. Two models were based on protein primary sequence only. The first model assumed homogeneous amino acid substitution pattern across all protein sequence residue sites and used BLOSUM62 to encode k-mer of protein sequence. Ensemble of SVM based on different k-mers drew the final conclusion, achieving 50% overall accuracy. The simplified assumption did not exploit protein sequence profile and ignored the fact of heterogeneous amino acid substitution patterns across sites. The second model derived the PsePSSM feature representation from protein sequence by simply averaging the profile PSSM and combined the PseAA feature representation to construct a kNN ensemble classifier Nuc-PLoc, achieving 67.4% overall accuracy. The two models based on protein primary sequence only both achieved relatively poor predictive performance. The third model required that GO annotations be available, thus restricting the model's applicability. Methods In this paper, we only use the amino acid information of protein sequence without any other information to design a widely-applicable model for protein subnuclear localization. We use K-spectrum kernel to exploit the contextual information around an amino acid and the conserved motif information. Besides expanding window size, we adopt various amino acid classification approaches to capture diverse aspects of amino acid physiochemical properties. Each amino acid classification generates a series of spectrum kernels based on different window size. Thus, (I window expansion can capture more contextual information and cover size-varying motifs; (II various amino acid classifications can exploit multi-aspect biological information from the protein sequence. Finally

  3. Genome Sequence of Corynebacterium glutamicum ATCC 14067, Which Provides Insight into Amino Acid Biosynthesis in Coryneform Bacteria

    OpenAIRE

    Lv, Yangyong; Liao, Juanjun; Wu, Zhanhong; Han, Shuangyan; Lin, Ying; Zheng, Suiping

    2012-01-01

    We report the genome sequence of Corynebacterium glutamicum ATCC 14067 (once named Brevibacterium flavum), which is useful for taxonomy research and further molecular breeding in amino acid production. Preliminary comparison with those of the reported coryneform strains revealed some notable differences that might be related to the difficulties in molecular manipulation.

  4. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    Science.gov (United States)

    Meneghel, Julie; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  5. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  6. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Science.gov (United States)

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  7. N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

    OpenAIRE

    Kuhn, H; Fietzek, P P; Lampen, J. O.

    1982-01-01

    The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.

  8. Amino acid sequence of Coprinus macrorhizus peroxidase and cDNA sequence encoding Coprinus cinereus peroxidase. A new family of fungal peroxidases.

    Science.gov (United States)

    Baunsgaard, L; Dalbøge, H; Houen, G; Rasmussen, E M; Welinder, K G

    1993-04-01

    Sequence analysis and cDNA cloning of Coprinus peroxidase (CIP) were undertaken to expand the understanding of the relationships of structure, function and molecular genetics of the secretory heme peroxidases from fungi and plants. Amino acid sequencing of Coprinus macrorhizus peroxidase, and cDNA sequencing of Coprinus cinereus peroxidase showed that the mature proteins are identical in amino acid sequence, 343 residues in size and preceded by a 20-residue signal peptide. Their likely identity to peroxidase from Arthromyces ramosus is discussed. CIP has an 8-residue, glycine-rich N-terminal extension blocked with a pyroglutamate residue which is absent in other fungal peroxidases. The presence of pyroglutamate, formed by cyclization of glutamine, and the finding of a minor fraction of a variant form lacking the N-terminal residue, indicate that signal peptidase cleavage is followed by further enzymic processing. CIP is 40-45% identical in amino-acid sequence to 11 lignin peroxidases from four fungal species, and 42-43% identical to the two known Mn-peroxidases. Like these white-rot fungal peroxidases, CIP has an additional segment of approximately 40 residues at the C-terminus which is absent in plant peroxidases. Although CIP is much more similar to horseradish peroxidase (HRP C) in substrate specificity, specific activity and pH optimum than to white-rot fungal peroxidases, the sequences of CIP and HRP C showed only 18% identity. Hence, CIP qualifies as the first member of a new family of fungal peroxidases. The nine invariant residues present in all plant, fungal and bacterial heme peroxidases are also found in CIP. The present data support the hypothesis that only one chromosomal CIP gene exists. In contrast, a large number of secretory plant and fungal peroxidases are expressed from several peroxidase gene clusters. Analyses of three batches of CIP protein and of 49 CIP clones revealed the existence of only two highly similar alleles indicating less

  9. Roche genome sequencer FLX based high-throughput sequencing of ancient DNA

    DEFF Research Database (Denmark)

    Alquezar-Planas, David E; Fordyce, Sarah Louise

    2012-01-01

    Since the development of so-called "next generation" high-throughput sequencing in 2005, this technology has been applied to a variety of fields. Such applications include disease studies, evolutionary investigations, and ancient DNA. Each application requires a specialized protocol to ensure tha...

  10. ACID-BASE INTERACTIONS BETWEEN POLYMERS AND FILLERS

    Institute of Scientific and Technical Information of China (English)

    WANG Qingguo; CHEN Fute; HUANG Yuanfu; ZHOU Qingli

    1987-01-01

    Inverse gas chromatography(IGC) and Fourier-transform infrared (FT-IR) techniques were applied to determining the relative acid-base strength of polymers and coupling agents. The acid-base characteristics of fillers such as CaCO3 could be altered by treatment with different coupling agents. It was shown that some mechanical properties of filled polymers were obviously associated with acid-base interactions between polymers and fillers.

  11. Extraction of Sequence Conservation Features for the Prioritization of Candidate Single Amino Acid Polymorphisms

    Directory of Open Access Journals (Sweden)

    Jiaxin Wu

    2011-03-01

    Full Text Available Although remarkable success has been achieved by genome-wide association (GWA studies over the past few years, genetic variants discovered in GWA studies can typically account for only a small fraction of heritability of most common diseases. As such, the identification of multiple rare variants that are associated with complex diseases has been receiving more and more attentions. However, most of the recently developed statistical approaches for detecting association of rare variants with diseases require the selection of functional variants before the successive analysis, making an effective bioinformatics method for filtering out non-relevant rare variants indispensible. In this paper, we focus on a specific type of genetic variants called single amino acid polymorphisms (SAAPs. We propose to prioritize candidate SAAPs for a specific disease according to their association scores that are calculated using a guilt-by-association model with a set of features derived from protein sequences. We validate the proposed approach in a systematic way and demonstrate that the proposed model is powerful in distinguishing disease-associated SAAPs for the specific disease of interest.

  12. Moving Away from the Reference Genome: Evaluating a Peptide Sequencing Tagging Approach for Single Amino Acid Polymorphism Identifications in the Genus Populus

    Energy Technology Data Exchange (ETDEWEB)

    Abraham, Paul E [ORNL; Adams, Rachel M [ORNL; Tuskan, Gerald A [ORNL; Hettich, Robert {Bob} L [ORNL

    2013-01-01

    The genetic diversity across natural populations of the model organism, Populus, is extensive, containing a single nucleotide polymorphism roughly every 200 base pairs. When deviations from the reference genome occur in coding regions, they can impact protein sequences. Rather than relying on a static reference database to profile protein expression, we employed a peptide sequence tagging (PST) approach capable of decoding the plasticity of the Populus proteome. Using shotgun proteomics data from two genotypes of P. trichocarpa, a tag-based approach enabled the detection of 6,653 unexpected sequence variants. Through manual validation, our study investigated how the most abundant chemical modification (methionine oxidation) could masquerade as a sequence variant (AlaSer) when few site-determining ions existed. In fact, precise localization of an oxidation site for peptides with more than one potential placement was indeterminate for 70% of the MS/MS spectra. We demonstrate that additional fragment ions made available by high energy collisional dissociation enhances the robustness of the peptide sequence tagging approach (81% of oxidation events could be exclusively localized to a methionine). We are confident that augmenting fragmentation processes for a PST approach will further improve the identification of single amino acid polymorphism in Populus and potentially other species as well.

  13. STING Millennium: a web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence

    Science.gov (United States)

    Neshich, Goran; Togawa, Roberto C.; Mancini, Adauto L.; Kuser, Paula R.; Yamagishi, Michel E. B.; Pappas, Georgios; Torres, Wellington V.; Campos, Tharsis Fonseca e; Ferreira, Leonardo L.; Luna, Fabio M.; Oliveira, Adilton G.; Miura, Ronald T.; Inoue, Marcus K.; Horita, Luiz G.; de Souza, Dimas F.; Dominiquini, Fabiana; Álvaro, Alexandre; Lima, Cleber S.; Ogawa, Fabio O.; Gomes, Gabriel B.; Palandrani, Juliana F.; dos Santos, Gabriela F.; de Freitas, Esther M.; Mattiuz, Amanda R.; Costa, Ivan C.; de Almeida, Celso L.; Souza, Savio; Baudet, Christian; Higa, Roberto H.

    2003-01-01

    STING Millennium Suite (SMS) is a new web-based suite of programs and databases providing visualization and a complex analysis of molecular sequence and structure for the data deposited at the Protein Data Bank (PDB). SMS operates with a collection of both publicly available data (PDB, HSSP, Prosite) and its own data (contacts, interface contacts, surface accessibility). Biologists find SMS useful because it provides a variety of algorithms and validated data, wrapped-up in a user friendly web interface. Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment, indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts and Cα–Cα and Cβ–Cβ distance geometry. Specific emphasis in SMS is given to interface forming residues (IFR)—amino acids that define the interactive portion of the protein surfaces. SMS may simultaneously display and analyze previously superimposed structures. PDB updates trigger SMS updates in a synchronized fashion. SMS is freely accessible for public data at http://www.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS and http://trantor.bioc.columbia.edu/SMS. PMID:12824333

  14. Acid-base strengths in pyridine

    NARCIS (Netherlands)

    Bos, M.; Dahmen, E.A.M.F.

    1971-01-01

    Although pyridine is a solvent with a low dielectric constant, spectrophotometric determinations show simple dissociation without ion pairs as intermediates for some sulfonphthaleins and polynitrophenols in pyridine. The salts of a number of amines and hydrochloric acid, perchloric acid and picric

  15. Enhanced Acid/Base Catalysis in High Temperature Liquid Water

    Institute of Scientific and Technical Information of China (English)

    Xiu Yang LU; Qi JING; Zhun LI; Lei YUAN; Fei GAO; Xin LIU

    2006-01-01

    Two novel and environmentally benign solvent systems, organic acids-enriched high temperature liquid water (HTLW) and NH3-enriched HTLW, were developed, which can enhance the reaction rate of acid/base-catalyzed organic reactions in HTLW. We investigated the decomposition of fructose in organic acids-enriched HTLW, hydrolysis of cinnamaldehyde and aldol condensation of phenylaldehyde with acetaldehyde in NH3-enriched HTLW. The experimental results demonstrated that organic acids-enriched or NH3-enriched HTLW can greatly accelerate acid/base-catalyzed organic reactions in HTLW.

  16. Cloning, DNA sequencing and heterologous expression of the gene for thermostable N-acylamino acid racemase from Amycolatopsis sp. TS-1-60 in Escherichia coli.

    Science.gov (United States)

    Tokuyama, S; Hatano, K

    1995-03-01

    The gene encoding the novel enzyme N-acylamino acid racemase (AAR) was cloned in recombinant phage lambda-4 from the DNA library of Amycolatopsis sp. TS-1-60, a rare actinomycete, using antiserum against the enzyme. The cloned gene was subcloned and transformed in Escherichia coli JM105 using pUC118 as a vector. The AAR gene consists of an open-reading frame of 1104 nucleotides, which specifies a 368-amino-acid protein with a molecular mass of 39411Da. The molecular mass deduced from the AAR gene is in good agreement with the subunit molecular mass (40kDa) of AAR from Amycolatopsis sp. TS-1-60. The guanosine plus cytosine content of the AAR gene was about 70%. Although the AAR gene uses the unusual initiation codon GTG, the gene was expressed in Escherichia coli using the lac promoter of pUC118. The amount of the enzyme produced by the transformant was 16 times that produced by Amycolatopsis sp. TS-1-60. When the unusual initiation codon GTG was changed to ATG, the enzyme productivity of the transformant increased to more than 37 times that of Amycolatopsis sp. TS-1-60. In the comparison of the DNA sequence and the deduced amino acid sequence of AAR with those of known racemases and epimerases in data bases, no significant sequence homology was found. However, AAR resembles mandelate racemase in that requires metal ions for enzyme activity.(ABSTRACT TRUNCATED AT 250 WORDS)

  17. Sequence and structure-based prediction of fructosyltransferase activity for functional subclassification of fungal GH32 enzymes.

    Science.gov (United States)

    Trollope, Kim M; van Wyk, Niël; Kotjomela, Momo A; Volschenk, Heinrich

    2015-12-01

    Sucrolytic enzymes catalyse sucrose hydrolysis or the synthesis of fructooligosaccharides (FOSs), a prebiotic in human and animal nutrition. FOS synthesis capacity differs between sucrolytic enzymes. Amino-acid-sequence-based classification of FOS synthesizing enzymes would greatly facilitate the in silico identification of novel catalysts, as large amounts of sequence data lie untapped. The development of a bioinformatics tool to rapidly distinguish between high-level FOSs synthesizing predominantly sucrose hydrolysing enzymes from fungal genomic data is presented. Sequence comparison of functionally characterized enzymes displaying low- and high-level FOS synthesis revealed conserved motifs unique to each group. New light is shed on the sequence context of active site residues in three previously identified conserved motifs. We characterized two enzymes predicted to possess low- and high-level FOS synthesis activities based on their conserved motif sequences. FOS data for the enzymes confirmed our successful prediction of their FOS synthesis capacity. Structural comparison of enzymes displaying low- and high-level FOS synthesis identified steric hindrance between nystose and a long loop region present only in low-level FOS synthesizers. This loop is proposed to limit the synthesis of FOS species with higher degrees of polymerization, a phenomenon observed among enzymes displaying low-level FOS synthesis. Conserved sequence motifs surrounding catalytic residues and a distant structural determinant were identifiers of FOS synthesis capacity and allow for functional annotation of sucrolytic enzymes directly from amino acid sequence. The tool presented may also be useful to study the structure-function relationships of β-fructofuranosidases by identifying mutations present in a group of closely related enzymes displaying similar function. PMID:26426731

  18. Sequence and structure-based prediction of fructosyltransferase activity for functional subclassification of fungal GH32 enzymes.

    Science.gov (United States)

    Trollope, Kim M; van Wyk, Niël; Kotjomela, Momo A; Volschenk, Heinrich

    2015-12-01

    Sucrolytic enzymes catalyse sucrose hydrolysis or the synthesis of fructooligosaccharides (FOSs), a prebiotic in human and animal nutrition. FOS synthesis capacity differs between sucrolytic enzymes. Amino-acid-sequence-based classification of FOS synthesizing enzymes would greatly facilitate the in silico identification of novel catalysts, as large amounts of sequence data lie untapped. The development of a bioinformatics tool to rapidly distinguish between high-level FOSs synthesizing predominantly sucrose hydrolysing enzymes from fungal genomic data is presented. Sequence comparison of functionally characterized enzymes displaying low- and high-level FOS synthesis revealed conserved motifs unique to each group. New light is shed on the sequence context of active site residues in three previously identified conserved motifs. We characterized two enzymes predicted to possess low- and high-level FOS synthesis activities based on their conserved motif sequences. FOS data for the enzymes confirmed our successful prediction of their FOS synthesis capacity. Structural comparison of enzymes displaying low- and high-level FOS synthesis identified steric hindrance between nystose and a long loop region present only in low-level FOS synthesizers. This loop is proposed to limit the synthesis of FOS species with higher degrees of polymerization, a phenomenon observed among enzymes displaying low-level FOS synthesis. Conserved sequence motifs surrounding catalytic residues and a distant structural determinant were identifiers of FOS synthesis capacity and allow for functional annotation of sucrolytic enzymes directly from amino acid sequence. The tool presented may also be useful to study the structure-function relationships of β-fructofuranosidases by identifying mutations present in a group of closely related enzymes displaying similar function.

  19. Acid-Base Pairs in Lewis Acidic Zeolites Promote Direct Aldol Reactions by Soft Enolization.

    Science.gov (United States)

    Lewis, Jennifer D; Van de Vyver, Stijn; Román-Leshkov, Yuriy

    2015-08-17

    Hf-, Sn-, and Zr-Beta zeolites catalyze the cross-aldol condensation of aromatic aldehydes with acetone under mild reaction conditions with near quantitative yields. NMR studies with isotopically labeled molecules confirm that acid-base pairs in the Si-O-M framework ensemble promote soft enolization through α-proton abstraction. The Lewis acidic zeolites maintain activity in the presence of water and, unlike traditional base catalysts, in acidic solutions. PMID:26138135

  20. Phylogenetic relationships of Salmonella based on rRNA sequences

    DEFF Research Database (Denmark)

    Christensen, H.; Nordentoft, Steen; Olsen, J.E.

    1998-01-01

    To establish the phylogenetic relationships between the subspecies of Salmonella enterica (official name Salmonella choleraesuis), Salmonella bongori and related members of Enterobacteriaceae, sequence comparison of rRNA was performed by maximum-likelihood analysis. The two Salmonella species were...

  1. PHARMACOGENETIC TESTING OPPORTUNITIES IN CARDIOLOGY BASED ON EXOME SEQUENCING

    Directory of Open Access Journals (Sweden)

    N. V. Shcherbakova

    2014-01-01

    Full Text Available Aim. To study what cardiac drugs currently have any comments on biomarkers and what information can be obtained by pharmacogenetic testing using data exome sequencing in patients with cardiac diseases.Material and methods. Exome sequencing in random participant of the ATEROGEN IVANOVO study and bioinformatics analysis of the data were performed. Point mutations were annotated using ANNOVAR program, as well as comparison with a number of specialized databases was done on the basis of user protocols.Results. 11 cardiac drugs and 7 genes which variants can influence cardiac drug metabolism were analyzed. According to exome sequencing of the participant we did not reveal allelic variants that require dose regime correction and careful efficacy control.Conclusion. The exome sequencing application is the next step to a wide range of personalized therapy. Future opportunities for improvement of the risk-benefit ratio in each patient are the main purpose of the collection and analysis of pharmacogenetic data.

  2. EST sequences and their annotation (amino acid sequence and results of homology search) - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available and VS) derived from five developmental stages. Clone ID ID of cDNA clone Atlas ID ID of Atlas database ( ht...tp://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ) and link to Atlas database NBRP ID ID of cDNA c...ir annotations (amino acid sequence, homology search results (with target DBs: dicty EST-DB, DNA-DB and prot...ein-DB)). Links to the Atlas database ( http://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ), whic

  3. Antibiotic Selection Pressure Determination through Sequence-Based Metagenomics.

    Science.gov (United States)

    Willmann, Matthias; El-Hadidi, Mohamed; Huson, Daniel H; Schütz, Monika; Weidenmaier, Christopher; Autenrieth, Ingo B; Peter, Silke

    2015-12-01

    The human gut forms a dynamic reservoir of antibiotic resistance genes (ARGs). Treatment with antimicrobial agents has a significant impact on the intestinal resistome and leads to enhanced horizontal transfer and selection of resistance. We have monitored the development of intestinal ARGs over a 6-day course of ciprofloxacin (Cp) treatment in two healthy individuals by using sequenced-based metagenomics and different ARG quantification methods. Fixed- and random-effect models were applied to determine the change in ARG abundance per defined daily dose of Cp as an expression of the respective selection pressure. Among various shifts in the composition of the intestinal resistome, we found in one individual a strong positive selection for class D beta-lactamases which were partly located on a mobile genetic element. Furthermore, a trend to a negative selection has been observed with class A beta-lactamases (-2.66 hits per million sample reads/defined daily dose; P = 0.06). By 4 weeks after the end of treatment, the composition of ARGs returned toward their initial state but to a different degree in both subjects. We present here a novel analysis algorithm for the determination of antibiotic selection pressure which can be applied in clinical settings to compare therapeutic regimens regarding their effect on the intestinal resistome. This information is of critical importance for clinicians to choose antimicrobial agents with a low selective force on their patients' intestinal ARGs, likely resulting in a diminished spread of resistance and a reduced burden of hospital-acquired infections with multidrug-resistant pathogens. PMID:26369961

  4. Antibiotic Selection Pressure Determination through Sequence-Based Metagenomics.

    Science.gov (United States)

    Willmann, Matthias; El-Hadidi, Mohamed; Huson, Daniel H; Schütz, Monika; Weidenmaier, Christopher; Autenrieth, Ingo B; Peter, Silke

    2015-12-01

    The human gut forms a dynamic reservoir of antibiotic resistance genes (ARGs). Treatment with antimicrobial agents has a significant impact on the intestinal resistome and leads to enhanced horizontal transfer and selection of resistance. We have monitored the development of intestinal ARGs over a 6-day course of ciprofloxacin (Cp) treatment in two healthy individuals by using sequenced-based metagenomics and different ARG quantification methods. Fixed- and random-effect models were applied to determine the change in ARG abundance per defined daily dose of Cp as an expression of the respective selection pressure. Among various shifts in the composition of the intestinal resistome, we found in one individual a strong positive selection for class D beta-lactamases which were partly located on a mobile genetic element. Furthermore, a trend to a negative selection has been observed with class A beta-lactamases (-2.66 hits per million sample reads/defined daily dose; P = 0.06). By 4 weeks after the end of treatment, the composition of ARGs returned toward their initial state but to a different degree in both subjects. We present here a novel analysis algorithm for the determination of antibiotic selection pressure which can be applied in clinical settings to compare therapeutic regimens regarding their effect on the intestinal resistome. This information is of critical importance for clinicians to choose antimicrobial agents with a low selective force on their patients' intestinal ARGs, likely resulting in a diminished spread of resistance and a reduced burden of hospital-acquired infections with multidrug-resistant pathogens.

  5. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    Science.gov (United States)

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  6. Ionisation constants of inorganic acids and bases in aqueous solution

    CERN Document Server

    Perrin, D D

    2013-01-01

    Ionisation Constants of Inorganic Acids and Bases in Aqueous Solution, Second Edition provides a compilation of tables that summarize relevant data recorded in the literature up to the end of 1980 for the ionization constants of inorganic acids and bases in aqueous solution. This book includes references to acidity functions for strong acids and bases, as well as details about the formation of polynuclear species. This text then explains the details of each column of the tables, wherein column 1 gives the name of the substance and the negative logarithm of the ionization constant and column 2

  7. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    Science.gov (United States)

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  8. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    Science.gov (United States)

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered. PMID:11281267

  9. Autonomously generating operations sequences for a Mars Rover using AI-based planning

    Science.gov (United States)

    Sherwood, Rob; Mishkin, Andrew; Estlin, Tara; Chien, Steve; Backes, Paul; Cooper, Brian; Maxwell, Scott; Rabideau, Gregg

    2001-01-01

    This paper discusses a proof-of-concept prototype for ground-based automatic generation of validated rover command sequences from highlevel science and engineering activities. This prototype is based on ASPEN, the Automated Scheduling and Planning Environment. This Artificial Intelligence (AI) based planning and scheduling system will automatically generate a command sequence that will execute within resource constraints and satisfy flight rules.

  10. The Research of Chaos-based M-ary Spreading Sequences

    Directory of Open Access Journals (Sweden)

    YANG Hongye

    2012-12-01

    Full Text Available This paper is devoted to the generation and evaluation of the Chaos-based M-ary spreading sequences on communications systems. Sequences obtained by repeating a truncated and multi-ary quantized chaotic series are compared with classical m-sequences by means of the autocorrelation and cross-correlation properties and power-spectral features. Anti-noise performance of binary sequences and chaotic-based M-ary spreading sequences has been compared in the case of the same single-frequency interferences. Studies have shown that spectral features and anti-noise performance of chaotic-based M-ary spreading sequences which have great researching value are better than binary sequences.

  11. A Closer Look at Acid-Base Olfactory Titrations

    Science.gov (United States)

    Neppel, Kerry; Oliver-Hoyo, Maria T.; Queen, Connie; Reed, Nicole

    2005-01-01

    Olfactory titrations using raw onions and eugenol as acid-base indicators are reported. An in-depth investigation on olfactory titrations is presented to include requirements for potential olfactory indicators and protocols for using garlic, onions, and vanillin as acid-base olfactory indicators are tested.

  12. Teaching acid/base physiology in the laboratory

    DEFF Research Database (Denmark)

    Friis, Ulla G; Plovsing, Ronni; Hansen, Klaus;

    2010-01-01

    Acid/base homeostasis is one of the most difficult subdisciplines of physiology for medical students to master. A different approach, where theory and practice are linked, might help students develop a deeper understanding of acid/base homeostasis. We therefore set out to develop a laboratory...

  13. The Roles of Acids and Bases in Enzyme Catalysis

    Science.gov (United States)

    Weiss, Hilton M.

    2007-01-01

    Many organic reactions are catalyzed by strong acids or bases that protonate or deprotonate neutral reactants leading to reactive cations or anions that proceed to products. In enzyme reactions, only weak acids and bases are available to hydrogen bond to reactants and to transfer protons in response to developing charges. Understanding this…

  14. A dipstick sensor for coulometric acid-base titrations

    NARCIS (Netherlands)

    Olthuis, W.; Schoot, van der B.H.; Chavez, F.; Bergveld, P.

    1989-01-01

    By performing an acid-base titration by coulometric generation of OH− or H+ ions at an inert electrode in close proximity to the pH-sensitive gate of an ISFET, it is possible to determine the acid or base concentration of a solution using the ISFET as an indicator device for the equivalence point in

  15. Connecting Acids and Bases with Encapsulation... and Chemistry with Nanotechnology

    Science.gov (United States)

    Criswell, Brett

    2007-01-01

    The features and the development of various new acids and bases activity sets that combines chemistry with nanotechnology are being described. These sets lead to the generation of many nanotechnology-based pharmaceuticals for the treatment of various diseases.

  16. Novel Sequence Number Based Secure Authentication Scheme for Wireless LANs

    Institute of Scientific and Technical Information of China (English)

    Rajeev Singh; Teek Parval Sharma

    2015-01-01

    Authentication per frame is an implicit necessity for security in wireless local area networks (WLANs). We propose a novel per frame secure authentication scheme which provides authentication to data frames in WLANs. The scheme involves no cryptographic overheads for authentication of frames. It utilizes the sequence number of the frame along with the authentication stream generators for authentication. Hence, it requires no extra bits or messages for the authentication purpose and also no change in the existing frame format is required. The scheme provides authentication by modifying the sequence number of the frame at the sender, and that the modification is verified at the receiver. The modified sequence number is protected by using the XOR operation with a random number selected from the random stream. The authentication is lightweight due to the fact that it requires only trivial arithmetic operations like the subtraction and XOR operation.

  17. Predicting tissue-specific expressions based on sequence characteristics

    KAUST Repository

    Paik, Hyojung

    2011-04-30

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  18. Spherical Nucleic Acids as Intracellular Agents for Nucleic Acid Based Therapeutics

    Science.gov (United States)

    Hao, Liangliang

    Recent functional discoveries on the noncoding sequences of human genome and transcriptome could lead to revolutionary treatment modalities because the noncoding RNAs (ncRNAs) can be applied as therapeutic agents to manipulate disease-causing genes. To date few nucleic acid-based therapeutics have been translated into the clinic due to challenges in the delivery of the oligonucleotide agents in an effective, cell specific, and non-toxic fashion. Unmodified oligonucleotide agents are destroyed rapidly in biological fluids by enzymatic degradation and have difficulty crossing the plasma membrane without the aid of transfection reagents, which often cause inflammatory, cytotoxic, or immunogenic side effects. Spherical nucleic acids (SNAs), nanoparticles consisting of densely organized and highly oriented oligonucleotides, pose one possible solution to circumventing these problems in both the antisense and RNA interference (RNAi) pathways. The unique three dimensional architecture of SNAs protects the bioactive oligonucleotides from unspecific degradation during delivery and supports their targeting of class A scavenger receptors and endocytosis via a lipid-raft-dependent, caveolae-mediated pathway. Owing to their unique structure, SNAs are able to cross cell membranes and regulate target genes expression as a single entity, without triggering the cellular innate immune response. Herein, my thesis has focused on understanding the interactions between SNAs and cellular components and developing SNA-based nanostructures to improve therapeutic capabilities. Specifically, I developed a novel SNA-based, nanoscale agent for delivery of therapeutic oligonucleotides to manipulate microRNAs (miRNAs), the endogenous post-transcriptional gene regulators. I investigated the role of SNAs involving miRNAs in anti-cancer or anti-inflammation responses in cells and in in vivo murine disease models via systemic injection. Furthermore, I explored using different strategies to construct

  19. JRC GMO-Amplicons, a collection of nucleic acid sequences related to genetically modified organisms

    OpenAIRE

    PETRILLO MAURO; ANGERS ALEXANDRE; HENRIKSSON PETER; Bonfini, Laura; PATAK DENNSTEDT Alexandre; KREYSA JOACHIM

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico...

  20. Analysis of amino acids network based on distance matrix

    Science.gov (United States)

    Ali, Tazid; Akhtar, Adil; Gohain, Nisha

    2016-06-01

    In this paper we have constructed a distance matrix of the amino acids. The distance is defined based on the relative evolutionary importance of the base position of the corresponding codons. From this distance matrix a network of the amino acids is obtained. We have argued that this network depicts the evolutionary pattern of the amino acids. To examine the relative importance of the amino acids with respect to this network we have discussed different measures of centrality. We have also investigated the correlation coefficients between different measures of centrality. Further we have explored clustering coefficient as well as degree of distribution.

  1. A DNA Structure-Based Bionic Wavelet Transform and Its Application to DNA Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Fei Chen

    2003-01-01

    Full Text Available DNA sequence analysis is of great significance for increasing our understanding of genomic functions. An important task facing us is the exploration of hidden structural information stored in the DNA sequence. This paper introduces a DNA structure-based adaptive wavelet transform (WT – the bionic wavelet transform (BWT – for DNA sequence analysis. The symbolic DNA sequence can be separated into four channels of indicator sequences. An adaptive symbol-to-number mapping, determined from the structural feature of the DNA sequence, was introduced into WT. It can adjust the weight value of each channel to maximise the useful energy distribution of the whole BWT output. The performance of the proposed BWT was examined by analysing synthetic and real DNA sequences. Results show that BWT performs better than traditional WT in presenting greater energy distribution. This new BWT method should be useful for the detection of the latent structural features in future DNA sequence analysis.

  2. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Science.gov (United States)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  3. Purification, characterization, and amino acid sequencing of a. delta. /sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B

    Energy Technology Data Exchange (ETDEWEB)

    Linden, K.G.

    1986-01-01

    Studies were performed on the ..delta../sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B. The studies have involved three broad areas: improvement in the purification of the enzyme, further characterization of the purified enzyme, and completion of the amino acid sequence of the enzyme. For the purification of the enzyme, techniques for removing the isomerase from whole cells were studied, the effects of ionic strength on the binding of the isomerase to steroidal affinity resins was explored, and a new affinity resin was developed. Absorption spectra and the proton NMR spectra of the isomerase were obtained. Amino acid sequencing of the oxosteroid isomerase indicates that the enzyme is a dimeric protein consisting of two identical subunits each consisting of a polypeptide chain of 131 residues and a M/sub r/ = 14,536.

  4. Safety assessment of Bifidobacterium longum J DM301 based on complete genome sequences

    Institute of Scientific and Technical Information of China (English)

    Yan-Xia Wei; Zhuo-Yang Zhang; Chang Liu; Xiao-Kui Guo; Pradeep K Malakar

    2012-01-01

    AIM: To assess the safety of Bifidobacterium longum (B.longum) JDM301 based on complete genome sequences. METHODS: The complete genome sequences of JDM301 were determined using the GS 20 system. Putative virulence factors, putative antibiotic resistance genes and genes encoding enzymes responsible for harmful metabolites were identified by blast with virulence factors database, antibiotic resistance genes database and genes associated with harmful metabolites in previous reports. Minimum inhibitory concentration of 16 common antimicrobial agents was evaluated by E-test. RESULTS: JDM301 was shown to contain 36 genes associated with antibiotic resistance, 5 enzymes related to harmful metabolites and 162 nonspecific virulence factors mainly associated with transcriptional regulation, adhesion, sugar and amino acid transport. B. longum JDM301 was intrinsically resistant tocipro ciprofloxacin,amikacin, gentamicin and streptomycin and susceptible to vancomycin, amoxicillin, cephalothin, chloramphenicol, erythromycin, ampicillin, cefotaxime, rifampicin, imipenemandtrimethoprim and trimethoprim-sulphamethoxazol. JDM301.JDM301 was moderately resistant to bacitracin, while an earlier study showed that bifidobacteria were susceptible to this antibiotic. A tetracycline resistance gene with the risk of transfer was found in JDM301, which needs to be experimentally validated. CONCLUSION: The safety assessment of JDM301 using information derived from complete bacterial genome will contribute to a wider and deeper insight into the safety of probiotic bacteria.

  5. Robin Sequence: The road to evidence based personalized treatment

    NARCIS (Netherlands)

    H. Basart

    2016-01-01

    Robin Sequence (RS) is characterized by micrognathia and upper airway obstruction (UAO) caused by glossoptosis resulting in respiratory and feeding problems of varying severity. According to the original RS definition a cleft palate is associated with RS, but not part of the definition. Reported inc

  6. Whole-genome sequence-based analysis of thyroid function

    DEFF Research Database (Denmark)

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby;

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome seque...

  7. General base-general acid catalysis by terpenoid cyclases.

    Science.gov (United States)

    Pemberton, Travis A; Christianson, David W

    2016-07-01

    Terpenoid cyclases catalyze the most complex reactions in biology, in that more than half of the substrate carbon atoms often undergo changes in bonding during the course of a multistep cyclization cascade that proceeds through multiple carbocation intermediates. Many cyclization mechanisms require stereospecific deprotonation and reprotonation steps, and most cyclization cascades are terminated by deprotonation to yield an olefin product. The first bacterial terpenoid cyclase to yield a crystal structure was pentalenene synthase from Streptomyces exfoliatus UC5319. This cyclase generates the hydrocarbon precursor of the pentalenolactone family of antibiotics. The structures of pentalenene synthase and other terpenoid cyclases reveal predominantly nonpolar active sites typically lacking amino acid side chains capable of serving general base-general acid functions. What chemical species, then, enables the Brønsted acid-base chemistry required in the catalytic mechanisms of these enzymes? The most likely candidate for such general base-general acid chemistry is the co-product inorganic pyrophosphate. Here, we briefly review biological and nonbiological systems in which phosphate and its derivatives serve general base and general acid functions in catalysis. These examples highlight the fact that the Brønsted acid-base activities of phosphate derivatives are comparable to the Brønsted acid-base activities of amino acid side chains.

  8. Using Triple Helix Forming Peptide Nucleic Acids for Sequence-selective Recognition of Double-stranded RNA

    OpenAIRE

    Hnedzko, Dziyana; Cheruiyot, Samwel K.; Rozners, Eriks

    2014-01-01

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple helix forming peptide nucleic acids (PNAs) that bind in the major grov...

  9. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids

    OpenAIRE

    Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

    2015-01-01

    Background Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. Results The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of thre...

  10. Carbon-based strong solid acid for cornstarch hydrolysis

    Science.gov (United States)

    Nata, Iryanti Fatyasari; Irawan, Chairul; Mardina, Primata; Lee, Cheng-Kang

    2015-10-01

    Highly sulfonated carbonaceous spheres with diameter of 100-500 nm can be generated by hydrothermal carbonization of glucose in the presence of hydroxyethylsulfonic acid and acrylic acid at 180 °C for 4 h. The acidity of the prepared carbonaceous sphere C4-SO3H can reach 2.10 mmol/g. It was used as a solid acid catalyst for the hydrolysis of cornstarch. Total reducing sugar (TRS) concentration of 19.91 mg/mL could be obtained by hydrolyzing 20 mg/mL cornstarch at 150 °C for 6 h using C4-SO3H as solid acid catalyst. The solid acid catalyst demonstrated good stability that only 9% decrease in TRS concentration was observed after five repeat uses. The as-prepared carbon-based solid acid catalyst can be an environmentally benign replacement for homogeneous catalyst.

  11. Amino acid sequence and glycosylation of functional unit RtH2-e from Rapana thomasiana (gastropod) hemocyanin.

    Science.gov (United States)

    Stoeva, Stanka; Idakieva, Krasimira; Betzel, Christian; Genov, Nicolay; Voelter, Wolfgang

    2002-03-15

    The complete amino acid sequence of Rapana thomasiana hemocyanin functional unit RtH2-e was determined by direct sequencing and matrix-assisted laser desorption ionization mass spectrometry of peptides obtained by cleavage with EndoLysC proteinase, chymotrypsin, and trypsin. The single-polypeptide chain of RtH2-e consists of 413 amino acid residues and contains two consensus sequences NXS/T (positions 11-19 and 127-129), potential sites for N-glycosylation. Monosaccharide analysis of RtH2-e revealed a carbohydrate content of about 1.1% and the presence of xylose, fucose, mannose, and N-acetylglucosamine, demonstrating that only N-linked carbohydrate chains of high-mannose type seem to be present. On basis of the monosaccharide composition and MALDI-MS analysis of native and PNGase-F-treated chymotryptic glycopeptide fragment of RtH2-e the oligosaccharide Man(5)GlcNAc(2), attached to Asn(127), is suggested. Multiple sequence alignments with other molluscan hemocyanin e functional units revealed an identity of 63% to the cephalopod Octopus dofleini and of 69% to the gastropod Haliotis tuberculata. The present results are discussed in view of the recently determined X-ray structure of the functional unit g of the O. dofleini hemocyanin. PMID:11888200

  12. Reactive Distillation for Esterification of Bio-based Organic Acids

    Energy Technology Data Exchange (ETDEWEB)

    Fields, Nathan; Miller, Dennis J.; Asthana, Navinchandra S.; Kolah, Aspi K.; Vu, Dung; Lira, Carl T.

    2008-09-23

    The following is the final report of the three year research program to convert organic acids to their ethyl esters using reactive distillation. This report details the complete technical activities of research completed at Michigan State University for the period of October 1, 2003 to September 30, 2006, covering both reactive distillation research and development and the underlying thermodynamic and kinetic data required for successful and rigorous design of reactive distillation esterification processes. Specifically, this project has led to the development of economical, technically viable processes for ethyl lactate, triethyl citrate and diethyl succinate production, and on a larger scale has added to the overall body of knowledge on applying fermentation based organic acids as platform chemicals in the emerging biorefinery. Organic acid esters constitute an attractive class of biorenewable chemicals that are made from corn or other renewable biomass carbohydrate feedstocks and replace analogous petroleum-based compounds, thus lessening U.S. dependence on foreign petroleum and enhancing overall biorefinery viability through production of value-added chemicals in parallel with biofuels production. Further, many of these ester products are candidates for fuel (particularly biodiesel) components, and thus will serve dual roles as both industrial chemicals and fuel enhancers in the emerging bioeconomy. The technical report from MSU is organized around the ethyl esters of four important biorenewables-based acids: lactic acid, citric acid, succinic acid, and propionic acid. Literature background on esterification and reactive distillation has been provided in Section One. Work on lactic acid is covered in Sections Two through Five, citric acid esterification in Sections Six and Seven, succinic acid in Section Eight, and propionic acid in Section Nine. Section Ten covers modeling of ester and organic acid vapor pressure properties using the SPEAD (Step Potential

  13. Effect of acute acid loading on acid-base and calcium metabolism

    DEFF Research Database (Denmark)

    Osther, Palle J

    2006-01-01

    OBJECTIVE: To investigate the acid-base and calcium metabolic responses to acute non-carbonic acid loading in idiopathic calcium stone-formers and healthy males using a quantitative organ physiological approach. MATERIAL AND METHODS: Five-h ammonium chloride loading studies were performed in 12 m...

  14. A homogeneous nucleic acid hybridization assay based on strand displacement.

    OpenAIRE

    Vary, C P

    1987-01-01

    A homogeneous nucleic acid hybridization assay which is conducted in solution and requires no separation steps is described. The assay is based on the concept of strand displacement. In the strand displacement assay, an RNA "signal strand" is hybridized within a larger DNA strand termed the "probe strand", which is, in turn, complementary to the target nucleic acid of interest. Hybridization of the target nucleic acid with the probe strand ultimately results in displacement of the RNA signal ...

  15. Cloning, Sequence Analysis, and Expression in Escherichia coli of the Gene Encoding an α-Amino Acid Ester Hydrolase from Acetobacter turbidans

    NARCIS (Netherlands)

    Polderman-Tijmes, Jolanda J.; Jekel, P; de Vries, Erik; van Merode, Annet; Floris, René; Laan, Jan-Metske van der; Sonke, Theo; Janssen, Dick B.

    2002-01-01

    The α-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing β-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified α-amino acid ester hydrolase allowed cloning and genetic characterization of the

  16. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    Science.gov (United States)

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  17. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Science.gov (United States)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  18. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans

    NARCIS (Netherlands)

    Polderman-Tijmes, JJ; Jekel, PA; de Vries, EJ; van Merode, Annet; Floris, R; van der Laan, JM; Sonke, T; Janssen, DB

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterizat

  19. Phylogeny of Vibrio cholerae Based on recA Sequence

    OpenAIRE

    Stine, O. Colin; Sozhamannan, Shanmuga; Gou, Qing; Zheng, Siqen; Morris, J. Glenn; Johnson, Judith A.

    2000-01-01

    We sequenced a 705-bp fragment of the recA gene from 113 Vibrio cholerae strains and closely related species. One hundred eighty-seven nucleotides were phylogenetically informative, 55 were phylogenetically uninformative, and 463 were invariant. Not unexpectedly, Vibrio parahaemolyticus and Vibrio vulnificus strains formed out-groups; we also identified isolates which resembled V. cholerae biochemically but which did not cluster with V. cholerae. In many instances, V. cholerae serogroup desig...

  20. Identification Lactic acid bacteria isolated from mongolia fermented milk products bases on 16S rDNA sequence analysis and tuf-RFLP technology%利用16S rDNA序列及tuf-RFLP鉴定蒙古国发酵乳中的乳酸菌

    Institute of Scientific and Technical Information of China (English)

    王宏梅; 于洁; 包秋华; 吕嫱; 孟和毕力格; 张和平

    2011-01-01

    One hundred and ten strains of lactic acid bacteria (LAB) isolated from twenty five samples of naturally fermented yoghurt in Dza-vhan of Mongolia were classified by 16S rDNA sequence analysis and tuf- RFLP. They were preliminary identified as Streptococcus thermophilus (41 strains), Lactobacillus helvetkus (40 strains), Lactobarillus delbrueckii subsp. Bulgaricus (11 strains), Lactobadllus fermentum (2 strains), Leuconostoc mesenteroides subsp. Mesenteroides (2 strains), Lactococcus lactis subsp. Lactis (1 strain) and Lactobadllus casei group (12 strains) by 16S rDNA sequence analysis. However, the 16S rDNA sequence of Lactobadllus casei group existed very small difference, tuf-RFLP was used to confirm results given by 16S rDNA sequencing. The tuf- RFLP profile pattern revealed that the twelve strains were accurately identified as Lactobadllus casei.%运用16S rDNA序列分析和tuf-RFLP技术对采于蒙古国扎布汗省的25份发酵乳样中分离出的110株乳酸菌进行鉴定.首先将分离的110株乳酸菌的16S rRNA基因进行扩增,测序并构建系统发育树,初步鉴定为41株嗜热链球菌,40株瑞士乳杆菌,11株德氏乳杆菌保加利亚亚种,2株发酵乳杆菌,1株乳明串珠菌,2株肠膜明串肠膜亚种,1株乳酸乳球乳酸亚种和12株属于干酪族的菌株.由于干酪乳杆菌族的16S rDNA序列差异很小,故采用tuf-RFLP 技术对这12株进行了进一步的验证,通过分离菌株与模式菌株tuf-RFLP图谱的比较分析,结果表明这12株菌均为干酪乳杆菌.

  1. Carbon-based strong solid acid for cornstarch hydrolysis

    Energy Technology Data Exchange (ETDEWEB)

    Nata, Iryanti Fatyasari, E-mail: yanti_tkunlam@yahoo.com [Chemical Engineering Study Program, Faculty of Engineering, Lambung Mangkurat University, Jl. A. Yani Km. 36 Banjarbaru, South Kalimantan 70714 (Indonesia); Irawan, Chairul; Mardina, Primata [Chemical Engineering Study Program, Faculty of Engineering, Lambung Mangkurat University, Jl. A. Yani Km. 36 Banjarbaru, South Kalimantan 70714 (Indonesia); Lee, Cheng-Kang, E-mail: cklee@mail.ntust.edu.tw [Department of Chemical Engineering, National Taiwan University of Science and Technology, 43 Keelung Rd. Sec.4, Taipei 106, Taiwan (China)

    2015-10-15

    Highly sulfonated carbonaceous spheres with diameter of 100–500 nm can be generated by hydrothermal carbonization of glucose in the presence of hydroxyethylsulfonic acid and acrylic acid at 180 °C for 4 h. The acidity of the prepared carbonaceous sphere C4-SO{sub 3}H can reach 2.10 mmol/g. It was used as a solid acid catalyst for the hydrolysis of cornstarch. Total reducing sugar (TRS) concentration of 19.91 mg/mL could be obtained by hydrolyzing 20 mg/mL cornstarch at 150 °C for 6 h using C4-SO{sub 3}H as solid acid catalyst. The solid acid catalyst demonstrated good stability that only 9% decrease in TRS concentration was observed after five repeat uses. The as-prepared carbon-based solid acid catalyst can be an environmentally benign replacement for homogeneous catalyst. - Highlights: • Carbon solid acid was successfully prepared by one-step hydrothermal carbonization. • The acrylic acid as monomer was effectively reduce the diameter size of particle. • The solid acid catalyst show good catalytic performance of starch hydrolysis. • The solid acid catalyst is not significantly deteriorated after repeated use.

  2. mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences

    Science.gov (United States)

    2013-01-01

    Background Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. Results Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. Conclusions mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly. PMID:24451012

  3. Nucleic Acid-Based Nanodevices in Biological Imaging.

    Science.gov (United States)

    Chakraborty, Kasturi; Veetil, Aneesh T; Jaffrey, Samie R; Krishnan, Yamuna

    2016-06-01

    The nanoscale engineering of nucleic acids has led to exciting molecular technologies for high-end biological imaging. The predictable base pairing, high programmability, and superior new chemical and biological methods used to access nucleic acids with diverse lengths and in high purity, coupled with computational tools for their design, have allowed the creation of a stunning diversity of nucleic acid-based nanodevices. Given their biological origin, such synthetic devices have a tremendous capacity to interface with the biological world, and this capacity lies at the heart of several nucleic acid-based technologies that are finding applications in biological systems. We discuss these diverse applications and emphasize the advantage, in terms of physicochemical properties, that the nucleic acid scaffold brings to these contexts. As our ability to engineer this versatile scaffold increases, its applications in structural, cellular, and organismal biology are clearly poised to massively expand. PMID:27294440

  4. Nucleic acid based fluorescent sensor for mercury detection

    Science.gov (United States)

    Lu, Yi; Liu, Juewen

    2013-02-05

    A nucleic acid enzyme comprises an oligonucleotide containing thymine bases. The nucleic acid enzyme is dependent on both Hg.sup.2+and a second ion as cofactors, to produce a product from a substrate. The substrate comprises a ribonucleotide, a deoxyribonucleotide, or both.

  5. INDUS - a composition-based approach for rapid and accurate taxonomic classification of metagenomic sequences

    OpenAIRE

    Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Reddy, Rachamalla Maheedhar; Reddy, Chennareddy Venkata Siva Kumar; Singh, Nitin Kumar; Sharmila S Mande

    2011-01-01

    Background Taxonomic classification of metagenomic sequences is the first step in metagenomic analysis. Existing taxonomic classification approaches are of two types, similarity-based and composition-based. Similarity-based approaches, though accurate and specific, are extremely slow. Since, metagenomic projects generate millions of sequences, adopting similarity-based approaches becomes virtually infeasible for research groups having modest computational resources. In this study, we present ...

  6. Sequence specificity of mutagen-nucleic acid complexes in solution: intercalation and mutagen-base pair overlap geometries for proflavine binding to dC-dC-dG-dG and dG-dG-dC-dC self-complementary duplexes.

    Science.gov (United States)

    Patel, D J; Canuel, L L

    1977-07-01

    The complex formed between the mutagen proflavine and the dC-dC-dG-dG and dG-dG-dC-dC self-complementary tetranucleotide duplexes has been monitored by proton high resolution nuclear magnetic resonance spectroscopy in 0.1 M phosphate solution at high nucleotide/drug ratios. The large upfield shifts (0.5 to 0.85 ppm) observed at all the proflavine ring nonexchangeable protons on complex formation are consistent with intercalation of the mutagen between base pairs of the tetranucleotide duplex. We have proposed an approximate overlap geometry between the proflavine ring and nearest neighbor base pairs at the intercalation site from a comparison between experimental shifts and those calculated for various stacking orientations. We have compared the binding of actinomycin D, propidium diiodide, and proflavine to self-complementary tetranucleotide sequences dC-dC-dG-dG and dG-dG-dC-dC by UV absorbance changes in the drug bands between 400 and 500 nm. Actinomycin D exhibits a pronounced specificity for sequences with dG-dC sites (dG-dG-dC-dC), while propidium diiodide and proflavine exhibit a specificity for sequences with dC-dG sites (dC-dC-dG-dG). Actinomycin D binds more strongly than propidium diiodide and proflavine to dC-dG-dC-dG (contains dC-dG and dG-dC binding sites), indicative of the additional stabilization from hydrogen bonding and hydrophobic interactions between the pentapeptide lactone rings of actinomycin D and the base pair edges and sugar-phosphate backbone of the tetranucleotide duplex.

  7. Comparison of illumina and 454 deep sequencing in participants failing raltegravir-based antiretroviral therapy.

    Directory of Open Access Journals (Sweden)

    Jonathan Z Li

    Full Text Available The impact of raltegravir-resistant HIV-1 minority variants (MVs on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs.A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegravir resistance at the time of virologic failure. A control library was created by pooling integrase clones at predefined proportions. Multiplexed sequencing was performed with Illumina and 454 platforms at comparable costs. Illumina sequence analysis was performed with the novel snp-assess tool and 454 sequencing was analyzed with V-Phaser.Illumina sequencing resulted in significantly higher sequence coverage and a 0.095% limit of detection. Illumina accurately detected all MVs in the control library at ≥0.5% and 7/10 MVs expected at 0.1%. 454 sequencing failed to detect any MVs at 0.1% with 5 false positive calls. For MVs detected in the patient samples by both 454 and Illumina, the correlation in the detected variant frequencies was high (R2 = 0.92, P<0.001. Illumina sequencing detected 2.4-fold greater nucleotide MVs and 2.9-fold greater amino acid MVs compared to 454. The only raltegravir-resistant MV detected was an E138K mutation in one participant by Illumina sequencing, but not by 454.In participants of A5262 with raltegravir resistance at virologic failure, baseline raltegravir-resistant MVs were rarely detected. At comparable costs to 454 sequencing, Illumina demonstrated greater depth of coverage, increased sensitivity for detecting HIV MVs, and fewer false positive variant calls.

  8. DNA Lossless Differential Compression Algorithm based on Similarity of Genomic Sequence Database

    CERN Document Server

    Afify, Heba; Wahed, Manal Abdel

    2011-01-01

    Modern biological science produces vast amounts of genomic sequence data. This is fuelling the need for efficient algorithms for sequence compression and analysis. Data compression and the associated techniques coming from information theory are often perceived as being of interest for data communication and storage. In recent years, a substantial effort has been made for the application of textual data compression techniques to various computational biology tasks, ranging from storage and indexing of large datasets to comparison of genomic databases. This paper presents a differential compression algorithm that is based on production of difference sequences according to op-code table in order to optimize the compression of homologous sequences in dataset. Therefore, the stored data are composed of reference sequence, the set of differences, and differences locations, instead of storing each sequence individually. This algorithm does not require a priori knowledge about the statistics of the sequence set. The...

  9. Syntheses and characterizations of three acid-base supramolecular complexes

    International Nuclear Information System (INIS)

    Three acid-base compounds with supramolecular architectures, namely, (1,2-H2bdc)(dmt) (1), (trans-1,4-H2ccdc)0.5(phdat) (2) and (1,3-H2bdc)(phdat) (3) (1,2-H2bdc = 1,2-benzenedicarboxylic acid, trans-1, 4-H2ccdc = trans-1, 4-cyclohexanedicarboxylic acid, 1,3-H2bdc = 1,3-benzenedicarboxylic acid, dmt = 2,4-diamino-6-methyl-s-triazine, phdat = 2,4-diamino-6-phenyl-s-triazine) have been synthesized and characterized by IR spectra, elemental analyses, single-crystal X-ray diffractions and TGA. (author)

  10. A locked nucleic Acid-based nanocrawler

    DEFF Research Database (Denmark)

    Astakhova, I Kira; Pasternak, Karol; Campbell, Meghan A;

    2013-01-01

    excimer formation and pyrene-perylene interstrand Förster resonance energy transfer. We furthermore demonstrate that the nanocrawler selectively and reversibly moves along the road, followed by a bright and consistent fluorescence response for up to 10 cycles without any loss of signal.......Herein we introduce a novel fluorescent LNA/DNA machine, a nanocrawler, which reversibly moves along a directionally polar complementary road controlled by affinity-enhancing locked nucleic acid (LNA) monomers and additional regulatory strands. Polyaromatic hydrocarbon (PAH) dyes attached to 2...

  11. Cluster-Based Multipolling Sequencing Algorithm for Collecting RFID Data in Wireless LANs

    Science.gov (United States)

    Choi, Woo-Yong; Chatterjee, Mainak

    2015-03-01

    With the growing use of RFID (Radio Frequency Identification), it is becoming important to devise ways to read RFID tags in real time. Access points (APs) of IEEE 802.11-based wireless Local Area Networks (LANs) are being integrated with RFID networks that can efficiently collect real-time RFID data. Several schemes, such as multipolling methods based on the dynamic search algorithm and random sequencing, have been proposed. However, as the number of RFID readers associated with an AP increases, it becomes difficult for the dynamic search algorithm to derive the multipolling sequence in real time. Though multipolling methods can eliminate the polling overhead, we still need to enhance the performance of the multipolling methods based on random sequencing. To that extent, we propose a real-time cluster-based multipolling sequencing algorithm that drastically eliminates more than 90% of the polling overhead, particularly so when the dynamic search algorithm fails to derive the multipolling sequence in real time.

  12. Research on Nucleic Acid Sequence Design Methods for DNA Computing%DNA计算中核酸序列设计方法比较研究

    Institute of Scientific and Technical Information of China (English)

    张凯; 耿修堂; 肖建华; 赵东明

    2008-01-01

    DNA computing maps the instances of a reality problem onto specific nucleic acid mol-ecules and protocols so that the result contains the answers to the problem's instances to enable successful extraction.Good DNA sequences prevent unwanted hybridization errors during the computation and enable easy retrieval the answers in the extraction phase.At first,the paper in-troduces two typical nucleic acid sequence methods,free energy based method and Hamming dis-tance based method,and analyzes the influence of constraints on the quality of DNA sequences.In addition,the paper has compared the integrity and computational cost of two methods for ex-cluding the non-specific hybridization sequence.At last,the paper analyzes the efficiency of two nucleic acid sequence design methods.Through the comparison we can see that the Hamming dis-tance based DNA sequence design method can only approximately estimate the thermodynamic stability of DNA hybridization,and can not be a perfect substitute for the minimum free energy method.Further analysis shows that when only consider the free energy of Watson-crick base pairs,the calculation of free energy will be equal to the Hamming distance method.So,if the precision satisfied the requirement of experiment,Hamming distance based DNA sequence design method,not only can effectively distinguish between specific and non-specific hybridization,and effectively reduce the amount of calculation,improve the efficiency of the DNA sequence designing.%DNA计算是将现实问题进行编码,映射到DNA分子上,然后通过分子生物实验产生出代表问题解的DNA分子.最后通过检测技术提取出该DNA分子.高质量的DNA编码可以尽可能避免或减少计算过程中出现的错误,并使检测阶段易于提取出代表问题解的DNA分子.文中对基于汉明距离和基于自由能的DNA核酸编码方法进行研究,分析了两类方法的约束条件对DNA编码质量的影响,比较了两类方法排除非特异性

  13. Combined sequence and sequence-structure based methods for analyzing FGF23, CYP24A1 and VDR genes.

    Science.gov (United States)

    Nagamani, Selvaraman; Singh, Kh Dhanachandra; Muthusamy, Karthikeyan

    2016-09-01

    FGF23, CYP24A1 and VDR altogether play a significant role in genetic susceptibility to chronic kidney disease (CKD). Identification of possible causative mutations may serve as therapeutic targets and diagnostic markers for CKD. Thus, we adopted both sequence and sequence-structure based SNP analysis algorithm in order to overcome the limitations of both methods. We explore the functional significance towards the prediction of risky SNPs associated with CKD. We assessed the performance of four widely used pathogenicity prediction methods. We compared the performances of the programs using Mathews correlation Coefficient ranged from poor (MCC = 0.39) to reasonably good (MCC = 0.42). However, we got the best results for the combined sequence and structure based analysis method (MCC = 0.45). 4 SNPs from FGF23 gene, 8 SNPs from VDR gene and 13 SNPs from CYP24A1 gene were predicted to be the causative agents for human diseases. This study will be helpful in selecting potential SNPs for experimental study from the SNP pool and also will reduce the cost for identification of potential SNPs as a genetic marker. PMID:27114920

  14. H.264 MOTION ESTIMATION ALGORITHM BASED ON VIDEO SEQUENCES ACTIVITY

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Motion estimation is an important part of H.264/AVC encoding progress, with high computational complexity. Therefore, it is quite necessary to find a fast motion estimation algorithm for real-time applications. The algorithm proposed in this letter adjudges the macroblocks activity degree first; then classifies different video sequences, and applies different search strategies according to the result. Experiments show that this method obtains almost the same video quality with the Full Search (FS) algorithm but with reduced more than 95% computation cost.

  15. An Olfactory Indicator for Acid-Base Titrations.

    Science.gov (United States)

    Flair, Mark N.; Setzer, William N.

    1990-01-01

    The use of an olfactory acid-base indicator in titrations for visually impaired students is discussed. Potential olfactory indicators include eugenol, thymol, vanillin, and thiophenol. Titrations performed with each indicator with eugenol proved to be successful. (KR)

  16. Acid-base strengths in m-cresol

    NARCIS (Netherlands)

    Bos, M.; Dahmen, E.A.M.F.

    1971-01-01

    For various acids and bases dissociation constants were determined conductimetrically in m-cresol. A glass electrode was calibrated by means of some compounds with dissociation constants known from conductivity measurements. Potentiometric titrations with this calibrated glass electrode gave dissoci

  17. Towards lactic acid bacteria-based biorefineries.

    Science.gov (United States)

    Mazzoli, Roberto; Bosco, Francesca; Mizrahi, Itzhak; Bayer, Edward A; Pessione, Enrica

    2014-11-15

    Lactic acid bacteria (LAB) have long been used in industrial applications mainly as starters for food fermentation or as biocontrol agents or as probiotics. However, LAB possess several characteristics that render them among the most promising candidates for use in future biorefineries in converting plant-derived biomass-either from dedicated crops or from municipal/industrial solid wastes-into biofuels and high value-added products. Lactic acid, their main fermentation product, is an attractive building block extensively used by the chemical industry, owing to the potential for production of polylactides as biodegradable and biocompatible plastic alternative to polymers derived from petrochemicals. LA is but one of many high-value compounds which can be produced by LAB fermentation, which also include biofuels such as ethanol and butanol, biodegradable plastic polymers, exopolysaccharides, antimicrobial agents, health-promoting substances and nutraceuticals. Furthermore, several LAB strains have ascertained probiotic properties, and their biomass can be considered a high-value product. The present contribution aims to provide an extensive overview of the main industrial applications of LAB and future perspectives concerning their utilization in biorefineries. Strategies will be described in detail for developing LAB strains with broader substrate metabolic capacity for fermentation of cheaper biomass.

  18. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Science.gov (United States)

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  19. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Science.gov (United States)

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  20. A sequence-based variation map of zebrafish.

    Science.gov (United States)

    Patowary, Ashok; Purkanti, Ramya; Singh, Meghna; Chauhan, Rajendra; Singh, Angom Ramcharan; Swarnkar, Mohit; Singh, Naresh; Pandey, Vikas; Torroja, Carlos; Clark, Matthew D; Kocher, Jean-Pierre; Clark, Karl J; Stemple, Derek L; Klee, Eric W; Ekker, Stephen C; Scaria, Vinod; Sivasubbu, Sridhar

    2013-03-01

    Zebrafish (Danio rerio) is a popular vertebrate model organism largely deployed using outbred laboratory animals. The nonisogenic nature of the zebrafish as a model system offers the opportunity to understand natural variations and their effect in modulating phenotype. In an effort to better characterize the range of natural variation in this model system and to complement the zebrafish reference genome project, the whole genome sequence of a wild zebrafish at 39-fold genome coverage was determined. Comparative analysis with the zebrafish reference genome revealed approximately 5.2 million single nucleotide variations and over 1.6 million insertion-deletion variations. This dataset thus represents a new catalog of genetic variations in the zebrafish genome. Further analysis revealed selective enrichment for variations in genes involved in immune function and response to the environment, suggesting genome-level adaptations to environmental niches. We also show that human disease gene orthologs in the sequenced wild zebrafish genome show a lower ratio of nonsynonymous to synonymous single nucleotide variations.

  1. DNA sequence-based analysis of the Pseudomonas species.

    Science.gov (United States)

    Mulet, Magdalena; Lalucat, Jorge; García-Valdés, Elena

    2010-06-01

    Partial sequences of four core 'housekeeping' genes (16S rRNA, gyrB, rpoB and rpoD) of the type strains of 107 Pseudomonas species were analysed in order to obtain a comprehensive view regarding the phylogenetic relationships within the Pseudomonas genus. Gene trees allowed the discrimination of two lineages or intrageneric groups (IG), called IG P. aeruginosa and IG P. fluorescens. The first IG P. aeruginosa, was divided into three main groups, represented by the species P. aeruginosa, P. stutzeri and P. oleovorans. The second IG was divided into six groups, represented by the species P. fluorescens, P. syringae, P. lutea, P. putida, P. anguilliseptica and P. straminea. The P. fluorescens group was the most complex and included nine subgroups, represented by the species P. fluorescens, P. gessardi, P. fragi, P. mandelii, P. jesseni, P. koreensis, P. corrugata, P. chlororaphis and P. asplenii. Pseudomonas rhizospherae was affiliated with the P. fluorescens IG in the phylogenetic analysis but was independent of any group. Some species were located on phylogenetic branches that were distant from defined clusters, such as those represented by the P. oryzihabitans group and the type strains P. pachastrellae, P. pertucinogena and P. luteola. Additionally, 17 strains of P. aeruginosa, 'P. entomophila', P. fluorescens, P. putida, P. syringae and P. stutzeri, for which genome sequences have been determined, have been included to compare the results obtained in the analysis of four housekeeping genes with those obtained from whole genome analyses.

  2. LookSeq: A browser-based viewer for deep sequencing data

    OpenAIRE

    Manske, Heinrich Magnus; Dominic P Kwiatkowski

    2009-01-01

    Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an ov...

  3. Neural network predicts sequence of TP53 gene based on DNA chip

    DEFF Research Database (Denmark)

    Spicker, J.S.; Wikman, F.; Lu, M.L.;

    2002-01-01

    We have trained an artificial neural network to predict the sequence of the human TP53 tumor suppressor gene based on a p53 GeneChip. The trained neural network uses as input the fluorescence intensities of DNA hybridized to oligonucleotides on the surface of the chip and makes between zero...... and four errors in the predicted 1300 bp sequence when tested on wild-type TP53 sequence....

  4. Parallel divide and conquer bio-sequence comparison based on Smith-Waterman algorithm

    Institute of Scientific and Technical Information of China (English)

    ZHANG Fa; QIAO Xiangzhen; LIU Zhiyong

    2004-01-01

    Tools for pair-wise bio-sequence alignment have for long played a central role in computation biology. Several algorithms for bio-sequence alignment have been developed. The Smith-Waterman algorithm, based on dynamic programming, is considered the most fundamental alignment algorithm in bioinformatics. However the existing parallel Smith-Waterman algorithm needs large memory space, and this disadvantage limits the size of a sequence to be handled. As the data of biological sequences expand rapidly, the memory requirement of the existing parallel SmithWaterman algorithm has become a critical problem. For solving this problem, we develop a new parallel bio-sequence alignment algorithm, using the strategy of divide and conquer, named PSW-DC algorithm. In our algorithm, first, we partition the query sequence into several subsequences and distribute them to every processor respectively,then compare each subsequence with the whole subject sequence in parallel, using the Smith-Waterman algorithm, and get an interim result, finally obtain the optimal alignment between the query sequence and subject sequence, through the special combination and extension method. Memory space required in our algorithm is reduced significantly in comparison with existing ones. We also develop a key technique of combination and extension, named the C&E method, to manipulate the interim results and obtain the final sequences alignment. We implement the new parallel bio-sequences alignment algorithm,the PSW-DC, in a cluster parallel system.

  5. Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus.

    Science.gov (United States)

    Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

    2012-01-01

    Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function.

  6. Synthesis and antimicrobial activities of new higher amino acid Schiff base derivatives of 6-aminopenicillanic acid and 7-aminocephalosporanic acid

    Science.gov (United States)

    Özdemir (nee Güngör), Özlem; Gürkan, Perihan; Özçelik, Berrin; Oyardı, Özlem

    2016-02-01

    Novel β-lactam derivatives (1c-3c) (1d-3d) were produced by using 6-aminopenicillanic acid (6-APA), 7-aminocephalosporanic acid (7-ACA) and the higher amino acid Schiff bases. The synthesized compounds were characterized by elemental analysis, IR, 1H/13C NMR and UV-vis spectra. Antibacterial activities of all the higher amino acid Schiff bases (1a-3a) (1b-3b) and β-lactam derivatives were screened against three gram negative bacteria (Escherichia coli ATCC 25922, Pseudomonas aeruginosa ATCC 27853, Acinetobacter baumannii RSKK 02026), three gram positive bacteria (Staphylococcus aureus ATCC 25923, Enterococcus faecalis ATCC 07005, Bacillus subtilis ATCC 6633) and their drug-resistant isolates by using broth microdilution method. Two fungi (Candida albicans and Candida krusei) were used for antifungal activity.

  7. Acid-base homeostasis in the human system

    Science.gov (United States)

    White, R. J.

    1974-01-01

    Acid-base regulation is a cooperative phenomena in vivo with body fluids, extracellular and intracellular buffers, lungs, and kidneys all playing important roles. The present account is much too brief to be considered a review of present knowledge of these regulatory systems, and should be viewed, instead, as a guide to the elements necessary to construct a simple model of the mutual interactions of the acid-base regulatory systems of the body.

  8. Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

    KAUST Repository

    Zhang, Zhang

    2010-11-08

    Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.

  9. Synthesis of polyacrylic-acid-based thermochromic polymers

    Science.gov (United States)

    Srivastava, Jyoti; Alam, Sarfaraz; Mathur, G. N.

    2003-10-01

    Smart materials respond to environmental stimuli with particular changes in some variables (for example temperature, pressure and electric field etc), for that reason they are often called responsive materials. In the present work, we have synthesized thermochromic polymer based on poly acrylic acid cobalt chloride (CoCl2) and phosphoric acid (H3PO4) that visually and reversibly changes color in the temperature range (70 - 130°C). These thermochromic materials can be used as visual sensors of temperature. Thermochromic polymers are based on polyacrylic acid and CoCl2 complex.

  10. "De-novo" amino acid sequence elucidation of protein G'e by combined "Top-Down" and "Bottom-Up" mass spectrometry

    Science.gov (United States)

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F. M.; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L.; Glocker, Michael O.

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein Ǵ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α- N-gluconoylation and α- N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α- N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant ( K d ) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.

  11. Amino acid sequence of phospholipase A/sub 2/-. cap alpha. from the venom of Crotalus adamanteus

    Energy Technology Data Exchange (ETDEWEB)

    Heinrikson, R.L.; Krueger, E.T.; Keim, P.S.

    1977-07-25

    The complete amino acid sequence of Crotalus adamanteus venom phospholipase A/sub 2/-..cap alpha.. has been determined by analysis of the five tryptic peptides from the citraconylated, reduced, and S-(/sup 14/C)carboxamidomethylated enzyme. Earlier studies provided the information necessary to align the tryptic fragments so that secondary cleavage procedures to establish overlaps were unnecessary. The subunit in the phospholipase A/sub 2/-..cap alpha.. dimer is a single polypeptide chain containing 122 amino acids and seven disulfide bonds. The histidine residue implicated in the active site of mammalian phospholipases is at position 47 in the C. adamanteus enzyme and is located in a domain of the molecule which is highly homologous in sequence with corresponding regions of phospholipases from a variety of venom and pancreatic sources. Comparative sequence analysis has revealed insights with regard to the function and evolution of phospholipases A/sub 2/. Primary structural relationships observed among the snake venom enzymes parallel the phylogenetic classification of the venomous reptiles from which they were derived. It is proposed that phospholipases A/sub 2/ of this general type be divided into two groups depending upon the presence or absence of distinctive structural features elucidated in this study.

  12. Amino acid sequence and posttranslational modifications of human factor VIIa from plasma and transfected baby hamster kidney cells

    International Nuclear Information System (INIS)

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VIIa, participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca2+ and tissue factor. Three types of potential posttranslational modifications exist in the human factor VIIa molecule, namely, 10 γ-carboxylated, N-terminally located glutamic acid residues, 1 β-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VIIa as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VIIa. By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VIIa was found to be identical with human factor VIIa. Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VIIa. In the recombinant factor VIIa, asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VIIa and human plasma factor VIIa. These results show that factor VIIa as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VIIa and that this cell line thus might represent an alternative source for human factor VIIa

  13. Characterization of DNA-binding sequences for CcaR in the cephamycin-clavulanic acid supercluster of Streptomyces clavuligerus.

    Science.gov (United States)

    Santamarta, I; López-García, M T; Kurt, A; Nárdiz, N; Alvarez-Álvarez, R; Pérez-Redondo, R; Martín, J F; Liras, P

    2011-08-01

    RT-PCR analysis of the genes in the clavulanic acid cluster revealed three transcriptional polycistronic units that comprised the ceaS2-bls2-pah2-cas2, cyp-fd-orf12-orf13 and oppA2-orf16 genes, whereas oat2, car, oppA1, claR, orf14, gcaS and pbpA were expressed as monocistronic transcripts. Quantitative RT-PCR of Streptomyces clavuligerus ATCC 27064 and the mutant S. clavuligerus ccaR::aph showed that, in the mutant, there was a 1000- to 10,000-fold lower transcript level for the ceaS2 to cas2 polycistronic transcript that encoded CeaS2, the first enzyme of the clavulanic acid pathway that commits arginine to clavulanic acid biosynthesis. Smaller decreases in expression were observed in the ccaR mutant for other genes in the cluster. Two-dimensional electrophoresis and MALDI-TOF analysis confirmed the absence in the mutant strain of proteins CeaS2, Bls2, Pah2 and Car that are required for clavulanic acid biosynthesis, and CefF and IPNS that are required for cephamycin biosynthesis. Gel shift electrophoresis using recombinant r-CcaR protein showed that it bound to the ceaS2 and claR promoter regions in the clavulanic acid cluster, and to the lat, cefF, cefD-cmcI and ccaR promoter regions in the cephamycin C gene cluster. Footprinting experiments indicated that triple heptameric conserved sequences were protected by r-CcaR, and allowed identification of heptameric sequences as CcaR binding sites.

  14. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    OpenAIRE

    Chen, Chunxian; Gmitter Jr, Fred G

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for...

  15. Sequence-Length Requirement of Distance-Based Phylogeny Reconstruction: Breaking the Polynomial Barrier

    CERN Document Server

    Roch, Sebastien

    2009-01-01

    We introduce a new distance-based phylogeny reconstruction technique which provably achieves, at sufficiently short branch lengths, a polylogarithmic sequence-length requirement -- improving significantly over previous polynomial bounds for distance-based methods. The technique is based on an averaging procedure that implicitly reconstructs ancestral sequences. In the same token, we extend previous results on phase transitions in phylogeny reconstruction to general time-reversible models. More precisely, we show that in the so-called Kesten-Stigum zone (roughly, a region of the parameter space where ancestral sequences are well approximated by ``linear combinations'' of the observed sequences) sequences of length $\\poly(\\log n)$ suffice for reconstruction when branch lengths are discretized. Here $n$ is the number of extant species. Our results challenge, to some extent, the conventional wisdom that estimates of evolutionary distances alone carry significantly less information about phylogenies than full sequ...

  16. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    OpenAIRE

    A.E. El Hakim; W.H. Salama; M.B. Hamed; Ali, A. A.; N.M. Ibrahim

    2015-01-01

    Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The t...

  17. The nucleotide sequence of HLA-B{sup *}2704 reveals a new amino acid substitution in exon 4 which is also present in HLA-B{sup *}2706

    Energy Technology Data Exchange (ETDEWEB)

    Rudwaleit, M.; Bowness, P.; Wordsworth, P. [John Radcliffe Hospital, Oxford (United Kingdom)

    1996-12-31

    The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.

  18. Phylogeny of Pelargonium (Geraniaceae) based on DNA sequences from three genomes

    NARCIS (Netherlands)

    Bakker, F.T.; Culham, A.; Hettiarachi, P.; Touloumendidou, T.; Gibby, M.

    2004-01-01

    Phylogenetic hypotheses for the largely South African genus Pelargonium L'Hér. (Geraniaceae) were derived based on DNA sequence data from nuclear, chloroplast and mitochondrial encoded regions. The datasets were unequally represented and comprised cpDNA trnL-F sequences for 152 taxa, nrDNA ITS seque

  19. Genotyping of Histomonas meleagridis isolates based on Internal Transcribed Spacer-1 sequences

    NARCIS (Netherlands)

    H.M.J.F. van der Heijden; W.J.M. Landman; S. Greve; R. Peek

    2006-01-01

    C-profiling is a novel genotyping method for protozoan pathogens, based on polymerase chain reaction and sequencing of AT-rich Internal Transcribed Spacer-1 sequences. It was applied to various Histomonas meleagridis isolates originating from outbreaks of histomoniasis in six Dutch turkey and chicke

  20. Main: Sequences [KOME

    Lifescience Database Archive (English)

    Full Text Available Sequences Amino Acid Sequence Amino Acid sequence of full length cDNA (Longest ORF) kome_ine_full_sequence..._amino_db.fasta.zip kome_ine_full_sequence_amino_db.zip kome_ine_full_sequence_amino_db ...

  1. THE CONSTRUCTIONS OF ALMOST BINARY SEQUENCE PAIRS WITH THREE-LEVEL CORRELATION BASED ON CYCLOTOMY

    Institute of Scientific and Technical Information of China (English)

    Peng Xiuping; Xu Chengqian

    2012-01-01

    In this paper,a new class of almost binary sequence pair with a single zero element is presented.The almost binary sequence pairs with three-level correlation are constructed based on cyclotomic numbers of order 2,4,and 6.Most of them have good correlation and balance property,whose maximum nontrivial correlation magnitudes are 2 and the difference between the numbers of occurrence of +1's and -1's are 0 or 1.In addition,the corresponding binary sequence pairs are investigated as well and we can also get some kinds of binary sequence pairs with optimum balance and good correlation.

  2. A method for amplification of unknown flanking sequences based on touchdown PCR and suppression-PCR.

    Science.gov (United States)

    Gao, Song; He, Dan; Li, Guangquan; Zhang, Yanhua; Lv, Huiying; Wang, Li

    2016-09-15

    Thermal asymmetric staggered PCR is the most widely used technique to obtain the flanking sequences. However, it has some limitations, including a low rate of positivity, and complex operation. In this study, a improved method of it was made based on suppression-PCR and touchdown PCR. The PCR fragment obtained by the amplification was used directly for sequencing after gel purification. Using this improved method, the positive rate of amplified flanking sequences of the ATMT mutants reached 99%. In addition, the time from DNA extraction to flanking sequence analysis was shortened to 2 days with about 6 dollars each sample. PMID:27393656

  3. A method for amplification of unknown flanking sequences based on touchdown PCR and suppression-PCR.

    Science.gov (United States)

    Gao, Song; He, Dan; Li, Guangquan; Zhang, Yanhua; Lv, Huiying; Wang, Li

    2016-09-15

    Thermal asymmetric staggered PCR is the most widely used technique to obtain the flanking sequences. However, it has some limitations, including a low rate of positivity, and complex operation. In this study, a improved method of it was made based on suppression-PCR and touchdown PCR. The PCR fragment obtained by the amplification was used directly for sequencing after gel purification. Using this improved method, the positive rate of amplified flanking sequences of the ATMT mutants reached 99%. In addition, the time from DNA extraction to flanking sequence analysis was shortened to 2 days with about 6 dollars each sample.

  4. An Approach to Assembly Sequence Plannning Based on Hierarchical Strategy and Genetic Algorithm

    Institute of Scientific and Technical Information of China (English)

    Niu Xinwen; Ding Han; Xiong Youlun

    2001-01-01

    Using group and subassembly cluster methods, the hierarchical structure of a product is.generated automatically, which largely reduces the complexity of planning. Based on genetic algofithn the optimal of assembly sequence of each stracture level can be obtained by sequence-bysequence search. As a result, a better assembly sequence of the product can be generated by combining the assembly sequences of all hierarchical structures, which provides more parallelism and flexibility for assembly operations. An industrial example is solved by this new approach.

  5. A Quantitative Tool to Distinguish Isobaric Leucine and Isoleucine Residues for Mass Spectrometry-Based De Novo Monoclonal Antibody Sequencing

    Science.gov (United States)

    Poston, Chloe N.; Higgs, Richard E.; You, Jinsam; Gelfanova, Valentina; Hale, John E.; Knierman, Michael D.; Siegel, Robert; Gutierrez, Jesus A.

    2014-07-01

    De novo sequencing by mass spectrometry (MS) allows for the determination of the complete amino acid (AA) sequence of a given protein based on the mass difference of detected ions from MS/MS fragmentation spectra. The technique relies on obtaining specific masses that can be attributed to characteristic theoretical masses of AAs. A major limitation of de novo sequencing by MS is the inability to distinguish between the isobaric residues leucine (Leu) and isoleucine (Ile). Incorrect identification of Ile as Leu or vice versa often results in loss of activity in recombinant antibodies. This functional ambiguity is commonly resolved with costly and time-consuming AA mutation and peptide sequencing experiments. Here, we describe a set of orthogonal biochemical protocols, which experimentally determine the identity of Ile or Leu residues in monoclonal antibodies (mAb) based on the selectivity that leucine aminopeptidase shows for n-terminal Leu residues and the cleavage preference for Leu by chymotrypsin. The resulting observations are combined with germline frequencies and incorporated into a logistic regression model, called Predictor for Xle Sites (PXleS) to provide a statistical likelihood for the identity of Leu at an ambiguous site. We demonstrate that PXleS can generate a probability for an Xle site in mAbs with 96% accuracy. The implementation of PXleS precludes the expression of several possible sequences and, therefore, reduces the overall time and resources required to go from spectra generation to a biologically active sequence for a mAb when an Ile or Leu residue is in question.

  6. Complete amino acid sequence of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris.

    Science.gov (United States)

    Lehman, L D; Jones, B N; Dwulet, F E; Bogardt, R A; Gurd, F R

    1980-10-21

    The complete primary structure of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at its two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage with trypsin of the citraconylated apomyoglobin at its three arginine residues. Further digestion of the central cyanogen bromide peptide with S. aureus strain V8 protease and the 1,2-cyclohexanedione-treated central cyanogen bromide peptide with trypsin enabled the determination of the remainder of the covalent structure. This myoglobin differs from the cetacean myoglobins determined to date at 12 to 17 positions. These large sequence differences reflect the distant taxonomic relationships between the goose-beaked whale and the other species of Cetacea the myoglobin sequences of which have previously been determined.

  7. Pattern of acid base abnormalities in critically ill patinets

    International Nuclear Information System (INIS)

    To find out the pattern of acid base abnormalities in critically ill patients in a tertiary care health facility. Study Design: A descriptive study. Place and Duration of Study: The study was carried out in the department of pathology, Combined Military Hospital Kharian from January 2013 to June 2013. Patients and Methods: Two hundred and fifty patients suffering from various diseases and presenting with exacerbation of their clinical conditions were studied. These patients were hospitalized and managed in acute care units of the hospital. Arterial blood gases were analysed to detect acid base status and their correlation with their clinical condition. Concomitant analysis of electrolytes was carried out. Tests related to concurrent illnesses e.g. renal and liver function tests, cardiac enzymes and plasma glucose were assayed by routine end point and kinetic methods. Standard reference materials were used to ensure internal quantify control of analyses. Results: Two hundred and fifteen patients out of 250 studied suffered from acid base disorders. Gender distribution showed a higher percentage of male patients and the mean age was 70.5 ± 17.4 years. Double acid base disorders were the commonest disorders (34%) followed by metabolic acidosis (30%). Anion gap was calculated to further stratify metabolic acidosis and cases of diabetic ketoacidosis were the commonest in this category (47%). Other simple acid base disorders were relatively less frequent. Delta bicarbonate was calculated to unmask the superimposition of respiratory alkalosis or acidosis with metabolic acidosis and metabolic alkalosis. Though triple acid base disorders were noted in a small percentage of cases (05%), but were found to be the most complicated and challenging. Mixed acid base disorders were associated with high mortality. Conclusion: A large number of critically ill patients manifested acid base abnormalities over the full spectrum of these disorders. Mixed acid base disorders were

  8. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ID ID of a sequence Cluster ID ID of cluster. gclust_cluster is referenced. Annotation in original... database Annotation at the original website Species Species name Length Amino acid sequen

  9. CAPS satellite spread spectrum communication blind multi-user detecting system based on chaotic sequences

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Multiple Path Interference (MPI) and Multiple Access Interference (MAI) are important factors that affect the performance of Chinese Area Positioning System (CAPS). These problems can be solved by using spreading sequences with ideal properties and multi-user detectors. Chaotic sequences based on Chebyshev map are studied and the satellite communication system model is set up to investigate the application of chaotic sequences for CAPS in this paper. Simulation results show that chaotic sequences have desirable correlation properties and it is easy to generate a large number of chaotic sequences with good security. It has great practical value to apply chaotic sequences to CAPS together with multi-user detecting technology and the system performance can be improved greatly.

  10. High-throughput-sequencing-based identification of a grapevine fanleaf virus satellite RNA in Vitis vinifera.

    Science.gov (United States)

    Chiumenti, Michela; Mohorianu, Irina; Roseti, Vincenzo; Saldarelli, Pasquale; Dalmay, Tamas; Minafra, Angelantonio

    2016-05-01

    A new satellite RNA (satRNA) of grapevine fanleaf virus (GFLV) was identified by high-throughput sequencing of high-definition (HD) adapter libraries from grapevine plants of the cultivar Panse precoce (PPE) affected by enation disease. The complete nucleotide sequence was obtained by automatic sequencing using primers designed based on next-generation sequencing (NGS) data. The full-length sequence, named satGFLV-PPE, consisted of 1119 nucleotides with a single open reading frame from position 15 to 1034. This satRNA showed maximum nucleotide sequence identity of 87 % to satArMV-86 and satGFLV-R6. Symptomatic grapevines were surveyed for the presence of the satRNA, and no correlation was found between detection of the satRNA and enation symptom expression. PMID:26873812

  11. CAPS satellite spread spectrum communication blind multi-user detecting system based on chaotic sequences

    Institute of Scientific and Technical Information of China (English)

    LEI LiHua; SHI HuLi; MA GuanYi

    2009-01-01

    Multiple Path Interference (MPI) and Multiple Access Interference (MAI) are Important factors that affect the performance of Chinese Area Positioning System (CAPS),These problems can be solved by using spreading sequences with ideal properties and multi-user detectors.Chaotic sequences based on Chebyshev map are studied and the satellite communication system model is set up to investigate the application of chaotic sequences for CAPS in this paper,Simulation results show that chaotic sequences have desirable correlation properties and it is easy to generate a large number of chaotic sequences with good security.It has great practical value to apply chaotic sequences to CAPS together with multi-user detecting technology and the system performance can be improved greatly.

  12. Phylogeny of Sym plasmids of rhizobia by PCR-based sequencing of a nodC segment.

    OpenAIRE

    Ueda, T.; Suga, Y; Yahiro, N; Matsuguchi, T

    1995-01-01

    To understand the host specificity of rhizobia and the relationship between the evolution of Sym plasmids and that of host plants, we determined partial nodC sequences of 10 representative rhizobium strains and then constructed an evolutionary tree for the deduced amino acid sequences with four published sequences. These coding sequences yield a phylogenetic tree similar to that for leghemoglobin of host plants, suggesting that the evolution of common nodulation genes may be linked to host le...

  13. Isolation and amino acid sequence of crustacean hyperglycemic hormone precursor-related peptides.

    Science.gov (United States)

    Tensen, C P; Verhoeven, A H; Gaus, G; Janssen, K P; Keller, R; Van Herp, F

    1991-01-01

    The crustacean hyperglycemic hormone (CHH) is synthesized as part of a larger preprohormone in which the sequence of CHH is N-terminally flanked by a peptide for which the name CPRP (CHH precursor-related peptide) is proposed. Both CHH and CPRP are present in the sinus gland, the neurohemal organ of neurosecretory cells located in the eyestalk of decapod crustaceans. This paper describes the isolation and sequence analysis of CPRPs isolated from sinus glands of the crab Carcinus maenas, the crayfish Orconectes limosus and the lobster Homarus americanus. The published sequence of "peptide H" isolated from the land crab, Cardisoma carnifex, has now been recognized as a CPRP in this species. Sequence comparison reveals a high level of identity for the N-terminal region (residues 1-13) between all four peptides, while identity in the C-terminal domain is high between lobster and crayfish CPRP on the one hand, and between both crab species on the other. Conserved N-terminal residues include a putative monobasic processing site at position 11, which suggests that CPRP may be a biosynthetic intermediate from which a potentially bioactive decapeptide can be derived.

  14. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer

    Science.gov (United States)

    Zambanini, Thiemo; Buescher, Joerg M.; Meurer, Guido; Blank, Lars M.

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  15. Amino Acid Sequence Requirements of the Transmembrane and Cytoplasmic Domains of Influenza Virus Hemagglutinin for Viable Membrane Fusion

    OpenAIRE

    Melikyan, Grigory B.; Lin, Sasa; Roth, Michael G.; Cohen, Fredric S.

    1999-01-01

    The amino acid sequence requirements of the transmembrane (TM) domain and cytoplasmic tail (CT) of the hemagglutinin (HA) of influenza virus in membrane fusion have been investigated. Fusion properties of wild-type HA were compared with those of chimeras consisting of the ectodomain of HA and the TM domain and/or CT of polyimmunoglobulin receptor, a nonviral integral membrane protein. The presence of a CT was not required for fusion. But when a TM domain and CT were present, fusion activity w...

  16. A comparison of single molecule and amplification based sequencing of cancer transcriptomes.

    Directory of Open Access Journals (Sweden)

    Lee T Sam

    Full Text Available The second wave of next generation sequencing technologies, referred to as single-molecule sequencing (SMS, carries the promise of profiling samples directly without employing polymerase chain reaction steps used by amplification-based sequencing (AS methods. To examine the merits of both technologies, we examine mRNA sequencing results from single-molecule and amplification-based sequencing in a set of human cancer cell lines and tissues. We observe a characteristic coverage bias towards high abundance transcripts in amplification-based sequencing. A larger fraction of AS reads cover highly expressed genes, such as those associated with translational processes and housekeeping genes, resulting in relatively lower coverage of genes at low and mid-level abundance. In contrast, the coverage of high abundance transcripts plateaus off using SMS. Consequently, SMS is able to sequence lower- abundance transcripts more thoroughly, including some that are undetected by AS methods; however, these include many more mapping artifacts. A better understanding of the technical and analytical factors introducing platform specific biases in high throughput transcriptome sequencing applications will be critical in cross platform meta-analytic studies.

  17. Control allocation and management of redundant control effectors based on bases sequenced optimal method

    Institute of Scientific and Technical Information of China (English)

    2010-01-01

    For an advanced aircraft, the amount of its effectors is much more than that for a traditional one, the functions of effectors are more complex and the coupling between each other is more severe. Based on the current control allocation research, this paper puts forward the concept and framework of the control allocation and management system for aircrafts with redundancy con-trol effectors. A new optimal control allocation method, bases sequenced optimal (BSO) method, is then presented. By analyz-ing the physical meaning of the allocation process of BSO method, four types of management strategies are adopted by the system, which act on the control allocation process under different flight conditions, mission requirements and effectors work-ing conditions. Simulation results show that functions of the control allocation system are extended and the system adaptability to flight status, mission requirements and effector failure conditions is improved.

  18. Genomic Variance Estimation Based on Genotyping-by-Sequencing with Different Coverage in Perennial Ryegrass

    DEFF Research Database (Denmark)

    Ashraf, Bilal; Fé, Dario; Jensen, Just;

    2014-01-01

    Advancement in next generation sequencing (NGS) technologies has significantly decreased the cost of DNA sequencing enabling increased use of genotyping by sequencing (GBS) in several plant species. In contrast to array-based genotyping GBS also allows for easy estimation of allele frequencies...... at each SNP in family pools or polyploids. There are, however, several statistical challenges associated with this method, including low sequencing depth and missing values. Low sequencing depth results in inaccuracies in estimates of allele frequencies for each SNP. In this work we have focused...... on optimizing methods and models utilizing F2 family phenotype records and NGS information from F2 family pools in perennial ryegrass. Genomic variance was estimated using genomic relationship matrices based on different coverage depths to verify effects of coverage depth. Example traits were seed yield, rust...

  19. On Properties of Update Sequences Based on Causal Rejection

    OpenAIRE

    Eiter, T.; Fink, M; Sabbatini, G; Tompits, H.

    2001-01-01

    We consider an approach to update nonmonotonic knowledge bases represented as extended logic programs under answer set semantics. New information is incorporated into the current knowledge base subject to a causal rejection principle enforcing that, in case of conflicts, more recent rules are preferred and older rules are overridden. Such a rejection principle is also exploited in other approaches to update logic programs, e.g., in dynamic logic programming by Alferes et al. We give a thoroug...

  20. Evaluation of Hybridization Capture Versus Amplicon‐Based Methods for Whole‐Exome Sequencing

    Science.gov (United States)

    Samorodnitsky, Eric; Jewell, Benjamin M.; Hagopian, Raffi; Miya, Jharna; Wing, Michele R.; Lyon, Ezra; Damodaran, Senthilkumar; Bhatt, Darshna; Reeser, Julie W.; Datta, Jharna

    2015-01-01

    ABSTRACT Next‐generation sequencing has aided characterization of genomic variation. While whole‐genome sequencing may capture all possible mutations, whole‐exome sequencing remains cost‐effective and captures most phenotype‐altering mutations. Initial strategies for exome enrichment utilized a hybridization‐based capture approach. Recently, amplicon‐based methods were designed to simplify preparation and utilize smaller DNA inputs. We evaluated two hybridization capture‐based and two amplicon‐based whole‐exome sequencing approaches, utilizing both Illumina and Ion Torrent sequencers, comparing on‐target alignment, uniformity, and variant calling. While the amplicon methods had higher on‐target rates, the hybridization capture‐based approaches demonstrated better uniformity. All methods identified many of the same single‐nucleotide variants, but each amplicon‐based method missed variants detected by the other three methods and reported additional variants discordant with all three other technologies. Many of these potential false positives or negatives appear to result from limited coverage, low variant frequency, vicinity to read starts/ends, or the need for platform‐specific variant calling algorithms. All methods demonstrated effective copy‐number variant calling when evaluated against a single‐nucleotide polymorphism array. This study illustrates some differences between whole‐exome sequencing approaches, highlights the need for selecting appropriate variant calling based on capture method, and will aid laboratories in selecting their preferred approach. PMID:26110913

  1. Evaluation of Hybridization Capture Versus Amplicon-Based Methods for Whole-Exome Sequencing.

    Science.gov (United States)

    Samorodnitsky, Eric; Jewell, Benjamin M; Hagopian, Raffi; Miya, Jharna; Wing, Michele R; Lyon, Ezra; Damodaran, Senthilkumar; Bhatt, Darshna; Reeser, Julie W; Datta, Jharna; Roychowdhury, Sameek

    2015-09-01

    Next-generation sequencing has aided characterization of genomic variation. While whole-genome sequencing may capture all possible mutations, whole-exome sequencing remains cost-effective and captures most phenotype-altering mutations. Initial strategies for exome enrichment utilized a hybridization-based capture approach. Recently, amplicon-based methods were designed to simplify preparation and utilize smaller DNA inputs. We evaluated two hybridization capture-based and two amplicon-based whole-exome sequencing approaches, utilizing both Illumina and Ion Torrent sequencers, comparing on-target alignment, uniformity, and variant calling. While the amplicon methods had higher on-target rates, the hybridization capture-based approaches demonstrated better uniformity. All methods identified many of the same single-nucleotide variants, but each amplicon-based method missed variants detected by the other three methods and reported additional variants discordant with all three other technologies. Many of these potential false positives or negatives appear to result from limited coverage, low variant frequency, vicinity to read starts/ends, or the need for platform-specific variant calling algorithms. All methods demonstrated effective copy-number variant calling when evaluated against a single-nucleotide polymorphism array. This study illustrates some differences between whole-exome sequencing approaches, highlights the need for selecting appropriate variant calling based on capture method, and will aid laboratories in selecting their preferred approach. PMID:26110913

  2. Base J glucosyltransferase does not regulate the sequence specificity of J synthesis in trypanosomatid telomeric DNA.

    Science.gov (United States)

    Bullard, Whitney; Cliffe, Laura; Wang, Pengcheng; Wang, Yinsheng; Sabatini, Robert

    2015-12-01

    Telomeric DNA of trypanosomatids possesses a modified thymine base, called base J, that is synthesized in a two-step process; the base is hydroxylated by a thymidine hydroxylase forming hydroxymethyluracil (hmU) and a glucose moiety is then attached by the J-associated glucosyltransferase (JGT). To examine the importance of JGT in modifiying specific thymine in DNA, we used a Leishmania episome system to demonstrate that the telomeric repeat (GGGTTA) stimulates J synthesis in vivo while mutant telomeric sequences (GGGTTT, GGGATT, and GGGAAA) do not. Utilizing an in vitro GT assay we find that JGT can glycosylate hmU within any sequence with no significant change in Km or kcat, even mutant telomeric sequences that are unable to be J-modified in vivo. The data suggests that JGT possesses no DNA sequence specificity in vitro, lending support to the hypothesis that the specificity of base J synthesis is not at the level of the JGT reaction.

  3. Group Graded Associated Ideals with Flat Base Change of Rings and Short Exact Sequences

    Indian Academy of Sciences (India)

    Srinivas Behara; Shiv Datt Kumar

    2011-05-01

    This paper deals with the study of behaviour of -associated ideals and strong Krull -associated ideals with flat base change of rings and behaviour of -associated ideals with short exact sequences over rings graded by finitely generated abelian group .

  4. Design and Evaluation of a Research-Based Teaching Sequence: The Superposition of Electric Field.

    Science.gov (United States)

    Viennot, L.; Rainson, S.

    1999-01-01

    Illustrates an approach to research-based teaching strategies and their evaluation. Addresses a teaching sequence on the superposition of electric fields implemented at the college level in an institutional framework subject to severe constraints. Contains 28 references. (DDR)

  5. Base J glucosyltransferase does not regulate the sequence specificity of J synthesis in trypanosomatid telomeric DNA.

    Science.gov (United States)

    Bullard, Whitney; Cliffe, Laura; Wang, Pengcheng; Wang, Yinsheng; Sabatini, Robert

    2015-12-01

    Telomeric DNA of trypanosomatids possesses a modified thymine base, called base J, that is synthesized in a two-step process; the base is hydroxylated by a thymidine hydroxylase forming hydroxymethyluracil (hmU) and a glucose moiety is then attached by the J-associated glucosyltransferase (JGT). To examine the importance of JGT in modifiying specific thymine in DNA, we used a Leishmania episome system to demonstrate that the telomeric repeat (GGGTTA) stimulates J synthesis in vivo while mutant telomeric sequences (GGGTTT, GGGATT, and GGGAAA) do not. Utilizing an in vitro GT assay we find that JGT can glycosylate hmU within any sequence with no significant change in Km or kcat, even mutant telomeric sequences that are unable to be J-modified in vivo. The data suggests that JGT possesses no DNA sequence specificity in vitro, lending support to the hypothesis that the specificity of base J synthesis is not at the level of the JGT reaction. PMID:26815240

  6. Prediction of the Occurrence of the ADP-binding βαβ-fold in Proteins, Using an Amino Acid Sequence Fingerprint

    NARCIS (Netherlands)

    Wierenga, Rik K.; Terpstra, Peter; Hol, Wim G.J.

    1986-01-01

    An amino acid sequence "fingerprint” has been derived that can be used to test if a particular sequence will fold into a βαβ-unit with ADP-binding properties. It was deduced from a careful analysis of the known three-dimensional structures of ADP-binding βαβ-folds. This fingerprint is in fact a set

  7. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Rhee, Mun Su [University of Florida, Gainesville; Moritz, Brelan E. [University of Florida, Gainesville; Xie, Gary [Los Alamos National Laboratory (LANL); Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Chertkov, Olga [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Patel, Milind [University of Florida, Gainesville; Ou, Mark [University of Florida, Gainesville; Harbrucker, Roberta [University of Florida, Gainesville; Ingram, Lonnie O. [University of Florida; Shanmugam, Keelnathan T. [University of Florida

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  8. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Energy Technology Data Exchange (ETDEWEB)

    Xie, Gary [Los Alamos National Laboratory (LANL); Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  9. Acid-base disturbance in patients with cirrhosis

    DEFF Research Database (Denmark)

    Henriksen, Jens H; Bendtsen, Flemming; Møller, Søren

    2015-01-01

    PURPOSE: Acid-base disturbances were investigated in patients with cirrhosis in relation to hemodynamic derangement to analyze the hyperventilatory effects and the metabolic compensation. METHODS: A total of 66 patients with cirrhosis and 44 controls were investigated during a hemodynamic study. ...... alterations in serum albumin and water retention that may result in a delicate acid-base balance in these patients.......PURPOSE: Acid-base disturbances were investigated in patients with cirrhosis in relation to hemodynamic derangement to analyze the hyperventilatory effects and the metabolic compensation. METHODS: A total of 66 patients with cirrhosis and 44 controls were investigated during a hemodynamic study......, and effects of unidentified ions (all Pacid-base disturbances could not be identified. CONCLUSION: Hypocapnic alkalosis is related to disease severity and hyperdynamic systemic circulation in patients with cirrhosis. The metabolic compensation includes...

  10. Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns

    OpenAIRE

    Comin, Matteo; Schimd, Michele

    2014-01-01

    Background With the advent of Next-Generation Sequencing technologies (NGS), a large amount of short read data has been generated. If a reference genome is not available, the assembly of a template sequence is usually challenging because of repeats and the short length of reads. When NGS reads cannot be mapped onto a reference genome alignment-based methods are not applicable. However it is still possible to study the evolutionary relationship of unassembled genomes based on NGS data. Results...

  11. The coulometric titration of acids and bases in dimethylsulfoxide media

    NARCIS (Netherlands)

    Bos, M.; IJpma, S.T.; Dahmen, E.A.M.F.

    1976-01-01

    The coulometric titration of 20–200 μeq of acids and bases in DMSO media is described. In the titration of bases, the electro-oxidation of hydrogen at a platinized platinum electrode is used as the source of protons. The conditions for 100 % current efficiency at this electrode are low current densi

  12. A European Acid Rain Program based on the US experience

    DEFF Research Database (Denmark)

    Brandt, U. Steiner; Svendsen, Gert Tinggaard

    2000-01-01

    The paper shows that cost-effective involvement of the source location involves utmost difficulty in practice. Based on the RAINS model, it is recommended that source location should be ignored in a European market for SO2, as is the case in the US Acid Rain Program. Based on the political target...

  13. A new approach based on PSO algorithm to find good computational encoding sequences

    Institute of Scientific and Technical Information of China (English)

    Cui Guangzhao; Niu Yunyun; Wang Yanfeng; Zhang Xuncai; Pan Linqiang

    2007-01-01

    Computational encoding DNA sequence design is one of the most important steps in molecular computation. A lot of research work has been done to design reliable sequence library. A revised method based on the support system developed by Tanaka et al.is proposed here with different criteria to construct fitness function. Then we adapt particle swarm optimization (PSO) algorithm to our encoding problem. By using the new algorithm, a set of sequences with good quality is generated. The result also shows that our PSO- based approach could rapidly converge at the minimum level for an output of the simulation model. The celerity of the algorithm fits our requirements.

  14. Metal-based chemosensors for amino acids, peptides, and nucleotides

    OpenAIRE

    Buryak, Andrey

    2007-01-01

    An organometallic 4d transition metal complex [Cp*RhCl2]2, together with commercially available dyes, was used to construct indicator displacement assays (IDAs) for the detection of peptides, amino acids, and nucleotides. The combination of the Cp*Rh complex with the dye azophloxine was found to form a chemosensing ensemble for the sequence-selective detection of histidine- and methionine-containing peptides in water at neutral pH. A strong interaction of the rhodium complex with peptides bea...

  15. Metal-based chemosensors for amino acids, peptides, and nucleotides

    OpenAIRE

    Buryak, Andrey; Severin, Kay

    2008-01-01

    An organometallic 4d transition metal complex [Cp*RhCl2]2, together with commercially available dyes, was used to construct indicator displacement assays (IDAs) for the detection of peptides, amino acids, and nucleotides. The combination of the Cp*Rh complex with the dye azophloxine was found to form a chemosensing ensemble for the sequence-selective detection of histidine- and methionine-containing peptides in water at neutral pH. A strong interaction of the rhodium complex with peptides bea...

  16. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    OpenAIRE

    Selvaraj S; Jayaram B; Saranya N; Gromiha M; Fukui Kazuhiko

    2011-01-01

    Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such...

  17. [Comparison between Astragalus membranaceus var. mongholicus and Hedysarum polybotrys based on ITS sequences and metabolomics].

    Science.gov (United States)

    Jiao, Mei-li; Li, Zhen-yu; Zhang, Fu-sheng; Qin, Xue-mei

    2015-12-01

    Astragalus membranaceus var. mongholicus and Hedysarum polybotrys belong to different genera, but have similar drug efficacy in traditional Chinese medicine theory, and H. polybotrys was used as the legal A. membranaceus var. mongholicus previously. In this study, similarities and differences between them were analyzed via their ITS/ITS2 fragments information. The ITS (internal transcribed spacer) regions were amplified using polymerase chain reaction and then sequenced in two-way. The alignment lengths of ITS regions were 616 bp, in which 508 loci were consistent, and 103 loci were different, accounting for 82.47% and 16.72% of the total ITS nucleotides in length, respectively. As genotype determines phenotype, 1HNMR-based metabolomic approach was further used to reveal the chemical similarities and differences between them. Thirty-four metabolites were identified in the 1H NMR spectra, and twenty-seven metabolites were the common components. Amino acids, carbohydrates and other primary metabolites were similar, while a large difference existed in the flavonoids and astragalosides. This study suggests that A. membranaceus var. mongholicus and H. polybotrys show similarities and differences from molecular and chemical perspectives, which has laid a foundation for elucidating the effective material basis of drug with similar efficacy and resources utilization. PMID:27169287

  18. Expanding the diversity of oenococcal bacteriophages: insights into a novel group based on the integrase sequence.

    Science.gov (United States)

    Jaomanjaka, Fety; Ballestra, Patricia; Dols-lafargue, Marguerite; Le Marrec, Claire

    2013-09-01

    Temperate bacteriophages are a contributor of the genetic diversity in the lactic acid bacterium Oenococcus oeni. We used a classification scheme for oenococcal prophages based on integrase gene polymorphism, to analyze a collection of Oenococcus strains mostly isolated in the area of Bordeaux, which represented the major lineages identified through MLST schemes in the species. Genome sequences of oenococcal prophages were clustered into four integrase groups (A to D) which were related to the chromosomal integration site. The prevalence of each group was determined and we could show that members of the intB- and intC-prophage groups were rare in our panel of strains. Our study focused on the so far uncharacterized members of the intD-group. Various intD viruses could be easily isolated from wine samples, while intD lysogens could be induced to produce phages active against two permissive O. oeni isolates. These data support the role of this prophage group in the biology of O. oeni. Global alignment of three relevant intD-prophages revealed significant conservation and highlighted a number of unique ORFs that may contribute to phage and lysogen fitness.

  19. Sequence-specific inhibition of duck hepatitis B virus reverse transcription by peptide nucleic acids (PNA)

    DEFF Research Database (Denmark)

    Robaczewska, Magdalena; Narayan, Ramamurthy; Seigneres, Beatrice;

    2005-01-01

    BACKGROUND/AIMS: Peptide nucleic acids (PNAs) appear as promising new antisense agents, that have not yet been examined as hepatitis B virus (HBV) inhibitors. Our aim was to study the ability of PNAs targeting the duck HBV (DHBV) encapsidation signal epsilon to inhibit reverse transcription (RT...

  20. Amino acid sequence of the cold-active alkaline phosphatase from Atlantic cod (Gadus morhua)

    DEFF Research Database (Denmark)

    Asgeirsson, Bjarni; Nielsen, Berit Noesgaard; Højrup, Peter

    2003-01-01

    -linked glycosylation sites were found. The glycan structure was determined as complex biantennary in type with fucose and sialic acid attached, although a trace of complex tri-antennary structure was also observed. A three-dimensional model was obtained by homology modelling using the human placental AP scaffold. Cod...

  1. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry.

    Science.gov (United States)

    Carter, Charles W; Wolfenden, Richard

    2016-01-01

    The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.

  2. Deoxyribonucleic acid base compositions of dermatophytes.

    Science.gov (United States)

    Davison, F D; Mackenzie, D W; Owen, R J

    1980-06-01

    DNA was extracted and purified from 55 dermatophyte isolates representing 34 species of Trichophyton, Microsporum and Epidermophyton. The base compositions of the chromosomal DNA were determined by CsCl density gradient centrifugation and were found to be in the narrow range of 48.7 to 50.3 mol % G + C. A satellite DNA component assumed to be of mitochondrial origin was present in most strains, with a G + C content ranging from 14.7 to 30.8 mol % G + C. Heterogeneity in microscopic and colonial characteristics was not reflected in differences in the mean G + C content of the chromosomal DNAs. Strains varied in the G + C contents of satelite DNA, but these did not correlate with traditional species concepts.

  3. Complete Genome Sequence of Moraxella osloensis Strain KMC41, a Producer of 4-Methyl-3-Hexenoic Acid, a Major Malodor Compound in Laundry.

    Science.gov (United States)

    Goto, Takatsugu; Hirakawa, Hideki; Morita, Yuji; Tomida, Junko; Sato, Jun; Matsumura, Yuta; Mitani, Asako; Niwano, Yu; Takeuchi, Kohei; Kubota, Hiromi; Kawamura, Yoshiaki

    2016-01-01

    We report the complete genome sequence of Moraxella osloensis strain KMC41, isolated from laundry with malodor. The KMC41 genome comprises a 2,445,556-bp chromosome and three plasmids. A fatty acid desaturase and at least four β-oxidation-related genes putatively associated with 4-methyl-3-hexenoic acid generation were detected in the KMC41 chromosome. PMID:27445387

  4. Identification of a novel HMW glutenin subunit and comparison of its amino acid sequence with those of homologous subunits

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Aegilops tauschii is the donor of the D genome of common wheat (Triticum aestivum). Genetic variation of HMW glutenin subunits encoded by the Glu-1Dt locus of Ae. tauschii has been found to be higher than that specified by the Glu-1D locus in common wheat. In the present note, we report the identification of a novel HMW glutenin subunit, Dy13t, from Ae. tauschii. The newly identified subunit possessed an electrophoretic mobility that was faster than that of the Dy12 subunit of common wheat. The complete ORF of encoding the Dy13t subunit contained 624 codons (excluding the stop codons). The amino acid sequence deduced from the Dy13t gene ORF was the shortest among those of the previously reported subunits derived by the D genome. A further comparison of Dy13t amino acid sequence with those of the subunits characterized from the A, B, D, R genomes of Triticeae showed that the smaller size of the Dy13t subunit was associated with a reduction in the size of its repetitive domain.

  5. Rapid Conversion of Traditional Introductory Physics Sequences to an Activity-Based Format

    Science.gov (United States)

    Yoder, Garett; Cook, Jerry

    2014-01-01

    The Department of Physics at EKU [Eastern Kentucky University] with support from the National Science Foundations Course Curriculum and Laboratory Improvement Program has successfully converted our entire introductory physics sequence, both algebra-based and calculus-based courses, to an activity-based format where laboratory activities,…

  6. GrabCut-Based Human Segmentation in Video Sequences

    Directory of Open Access Journals (Sweden)

    Sergio Escalera

    2012-11-01

    Full Text Available In this paper, we present a fully-automatic Spatio-Temporal GrabCut human segmentation methodology that combines tracking and segmentation. GrabCut initialization is performed by a HOG-based subject detection, face detection, and skin color model. Spatial information is included by Mean Shift clustering whereas temporal coherence is considered by the historical of Gaussian Mixture Models. Moreover, full face and pose recovery is obtained by combining human segmentation with Active Appearance Models and Conditional Random Fields. Results over public datasets and in a new Human Limb dataset show a robust segmentation and recovery of both face and pose using the presented methodology.

  7. An algorithm for computing nucleic acid base-pairing probabilities including pseudoknots.

    Science.gov (United States)

    Dirks, Robert M; Pierce, Niles A

    2004-07-30

    Given a nucleic acid sequence, a recent algorithm allows the calculation of the partition function over secondary structure space including a class of physically relevant pseudoknots. Here, we present a method for computing base-pairing probabilities starting from the output of this partition function algorithm. The approach relies on the calculation of recursion probabilities that are computed by backtracking through the partition function algorithm, applying a particular transformation at each step. This transformation is applicable to any partition function algorithm that follows the same basic dynamic programming paradigm. Base-pairing probabilities are useful for analyzing the equilibrium ensemble properties of natural and engineered nucleic acids, as demonstrated for a human telomerase RNA and a synthetic DNA nanostructure. PMID:15139042

  8. Lactic acid bacteria fermentations in oat-based suspensions

    OpenAIRE

    Mårtensson, Olof

    2002-01-01

    This thesis deals with the fermentation characteristics of lactic acid bacteria (LAB) in oat-based suspensions, with formulation work of fermented products based on oat and with nutritional studies of these products. Changes in structure in terms of viscosity and ropiness were studied when exopolysaccharide (EPS)-producing LAB strains, namely, Lactobacillus delbrueckii subsp. bulgaricus NCFB 2772, Lactobacillus brevis G-77 and Pediococcus damnosus 2.6 were grown in these oat-based suspensions...

  9. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    OpenAIRE

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young; Lee, Sang Yup

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kb...

  10. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    Directory of Open Access Journals (Sweden)

    Kathleen S. Rein

    2008-05-01

    Full Text Available Okadaic acid (OA and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  11. Complete genome sequence of probiotic Bacillus coagulans HM-08: A potential lactic acid producer.

    Science.gov (United States)

    Yao, Guoqiang; Gao, Pengfei; Zhang, Wenyi

    2016-06-20

    Bacillus coagulans HM-08 is a commercialized probiotic strain in China. Its genome contains a 3.62Mb circular chromosome with an average GC content of 46.3%. In silico analysis revealed the presence of one xyl operon as well as several other genes that are correlated to xylose utilization. The genetic information provided here may help to expand its future biotechnology potential in lactic acid production. PMID:27130497

  12. Carbon Dot Based Sensing of Dopamine and Ascorbic Acid

    Directory of Open Access Journals (Sweden)

    Upama Baruah

    2014-01-01

    Full Text Available We demonstrate carbon dot based sensor of catecholamine, namely, dopamine and ascorbic acid. Carbon dots (CDs were prepared from a green source: commercially available Assam tea. The carbon dots prepared from tea had particle sizes of ∼0.8 nm and are fluorescent. Fluorescence of the carbon dots was found to be quenched in the presence of dopamine and ascorbic acid with greater sensitivity for dopamine. The minimum detectable limits were determined to be 33 μM and 98 μM for dopamine and ascorbic acid, respectively. The quenching constants determined from Stern-Volmer plot were determined to be 5 × 10−4 and 1 × 10−4 for dopamine and ascorbic acid, respectively. A probable mechanism of quenching has been discussed in the paper.

  13. A new EPR dosimeter based on sulfanilic acid

    Energy Technology Data Exchange (ETDEWEB)

    Maghraby, A. [Radiation Dosimetry Department, National Institute for Standards (NIS), Ministry of Scientific Research, Haram, Giza, Tersa Street, P.O. Box 136 (Egypt)]. E-mail: maghrabism@yahoo.com; Tarek, E. [Physics Department, Faculty of Science, Fayoum University, Fayoum (Egypt)

    2006-02-15

    A new radiation dosimeter based on sulfanilic acid was suggested and tested. Radiation-induced radical EPR spectrum was recorded and characterized. EPR spectrum of sulfanilic acid is characterized by its simple single signal, which is located at g-factor=2.0053+/-0.00017. Its microwave absorption curve and effect of modulation amplitude on signal intensity and line width were studied. The radiation-induced radical was found to be stable after about 24h following irradiation within +/-1.2%. Energy dependence curves of sulfanilic acid compared to soft tissue and alanine were plotted. Sulfanilic acid is characterized also by its isotropic behavior and its calibration curve in {sup 137}Cs gamma radiation is plotted compared to that of alanine. Its sensitivity is about 85% of alanine, while its detection limit is about 100+/-30mGy.

  14. Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains

    Directory of Open Access Journals (Sweden)

    Branislava Gemovic

    2013-01-01

    Full Text Available There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM, a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.

  15. First-principles study of physisorption of nucleic acid bases on small-diameter carbon nanotubes

    International Nuclear Information System (INIS)

    We report the results of our first-principles study based on density functional theory on the interaction of the nucleic acid base molecules adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U), with a single-walled carbon nanotube (CNT). Specifically, the focus is on the physisorption of base molecules on the outer wall of a (5, 0) metallic CNT possessing one of the smallest diameters possible. Compared to the case for CNTs with large diameters, the physisorption energy is found to be reduced in the high-curvature case. The base molecules exhibit significantly different interaction strengths and the calculated binding energies follow the hierarchy G>A>T>C>U, which appears to be independent of the tube curvature. The stabilizing factor in the interaction between the base molecule and CNT is dominated by the molecular polarizability that allows a weakly attractive dispersion force to be induced between them. The present study provides an improved understanding of the role of the base sequence in deoxyribonucleic acid (DNA) or ribonucleic acid (RNA) in their interactions with carbon nanotubes of varying diameters

  16. MRI-Based Thermometry for Tumor Thermal Ablation: A Comparison of Different MR Sequences

    Directory of Open Access Journals (Sweden)

    T. J. Vogl

    2010-05-01

    Full Text Available Background/Objective: To evaluate T1 and PRF thermometry methods utilizing fast MR sequences and fluoroptic thermometer."nMaterials and Methods: The MR-guided LITT (Laser-Induced Interstitial Thermotherapy with a laser wavelength/power of 1064nm/30W was applied to pig liver and a gel phantom. During the ablation process, the temperature was measured using a fluoroptic thermometer and MR imaging was performed applying a 1.5-Tesla tomograph with an EPI (Echo Planar Imaging sequence for PRF (Proton Resonance Frequency method and FLASH, IRTF, SRTF and TRUFI sequences for T1 method. Plotting MR signal intensity against measured temperature determined the temperature constant for each of the T1 sequences. To determine the PRF temperature constant, phase values were recorded from phase images and then plotted against temperature. The PRF temperature constant was verified comparing the MR temperature with the measured one obtained from a second LITT experiment on gel phantom."nResults: The experiments determining the temperature constant for T1 method showed that the IRTF and FLASH sequences have the highest temperature sensitivity and the most linear relationship between MR signal intensity and measured temperature. SRTF sequence presented relatively good linearity but inferior temperature sensitivity compared to IRTF and FLASH sequences. Conversely, TRUFI sequence exhibited the lowest temperature sensitivity and linearity of data points. Concerning the PRF method, the measured and the MR-based temperatures agreed up to approximately 70 C."nConclusion: To demonstrate and control temperature in target tissue during the LITT process, the PRF method with an EPI sequence is preferred for temperatures below 70 C due to its acceptable accuracy. Among the T1 sequences, FLASH is preferable as the most robust, though not the most accurate T1 sequence.

  17. Park-based and zero sequence-based relaying techniques with application to transformers protection

    Energy Technology Data Exchange (ETDEWEB)

    Diaz, G.; Arboleya, P.; Gomez-Aleixandre, J. [University of Oviedo (Spain). Dept. of Electrical Engineering

    2004-09-01

    Two relaying techniques for protecting power transformers are presented and discussed. Very often, differential relaying is used for this purpose. A comparison between the two proposed techniques and conventional differential relaying is thus presented. The first technique, based on the measurements of zero sequence current within a delta winding, performs best in multiwinding transformers, since only measurement of the coil currents is needed. Thus, great simplicity is achieved. The second one is based on the differential procedure, but its analysis of asymmetries in the plot in Park's plane avoids problems related to spectral analysis in conventional differential relaying. The technique is justified from the analysis of symmetrical components. Misoperation in conventional differential relaying has been observed in some cases as a function of switching instant and fault location. This issue is discussed in the paper, and a statistical analysis of a large number of laboratory tests, in which both factors were controlled, is presented. As a conclusion, both relaying techniques proposed succeed in protecting the transformer. Additionally, the Park-based relay exhibits three characteristics of most importance: fastest performance, robustness and simplicity in its formulation. (author)

  18. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    Science.gov (United States)

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  19. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    Science.gov (United States)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  20. Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

    Directory of Open Access Journals (Sweden)

    Zijian Li

    2014-02-01

    Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.

  1. Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics.

    Science.gov (United States)

    Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

    2016-06-17

    With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy.

  2. [Identification of plantaginis semen based on ITS2 and psbA-trnH sequences].

    Science.gov (United States)

    Song, Ming; Zhang, Ya-Qin; Lin, Yun-Han; Tu, Yuan; Ma, Xiao-Xi; Sun, Wei; Xiang, Li; Jiao, Wen-Jing; Liu, Xia

    2014-06-01

    In order to evaluate the efficiency of ITS2 and psbA-trnH sequences used as DNA barcodes to distinguish Plantaginis Semen from its adulterants, we collected 71 samples of Plantaginis Semen and its adulterants. The ITS2 and psbA-trnH sequences were aligned through Clustal W, and the genetic distances were calculated by kimura 2-parameter (K2P) model and the Neighbor-Joining (NJ) phylogenetic trees were constructed using MEGA 5.1. The results indicated that the ITS2 sequence lengths of Plantago asiatica and P. depressa were 199 bp and 200 bp, respectively; the maximum intra-specific K2P distance were lower than the minimum inter-specific K2P distance; the NJ tree based on ITS2 sequence indicated that Plantaginis Semen and its adulterants could be distinguished clearly. The sequence lengths of psbA-trnH of both P. asiatica and P. depressa were 340 bp; the maximum intra-specific K2P distances were lower than the minimum inter-specific K2P distance; the NJ tree based on psbA-trnH sequence showed that Plantaginis Semen can be distinguished clearly from its adulterants except for P. major. Therefore, ITS2 sequences can be used as an ideal DNA barcode to distinguish Plantaginis Semen from its adulterants. PMID:25244750

  3. Phylogeny of the Zygomycota based on nuclear ribosomal sequence data.

    Science.gov (United States)

    White, Merlin M; James, Timothy Y; O'Donnell, Kerry; Cafaro, Matías J; Tanabe, Yuuhiko; Sugiyama, Junta

    2006-01-01

    The Zygomycota is an ecologically heterogenous assemblage of nonzoosporic fungi comprising two classes, Zygomycetes and Trichomycetes. Phylogenetic analyses have suggested that the phylum is polyphyletic; two of four orders of Trichomycetes are related to the Mesomycetozoa (protists) that diverged near the fungal/animal split. Current circumscription of the Zygomycota includes only orders with representatives that produce zygospores. We present a molecular-based phylogeny including recognized representatives of the Zygomycetes and Trichomycetes with a combined dataset for nuclear rRNA 18S (SSU), 5.8S and 28S (LSU) genes. Tree reconstruction by Bayesian analyses suggests the Zygomycota is paraphyletic. Although 12 clades were identified only some of these correspond to the nine orders of Zygomycota currently recognized. A large superordinal clade, comprising the Dimargaritales, Harpellales, Kickxellales and Zoopagales, grouping together many symbiotic fungi, also is identified in part by a unique septal structure. Although Harpellales and Kickxellales are not monophyletic, these lineages are distinct from the Mucorales, Endogonales and Mortierellales, which appear more closely related to the Ascomycota + Basidiomycota + Glomeromycota. The final major group, the insect-associated Entomophthorales, appears to be polyphyletic. In the present analyses Basidiobolus and Neozygites group within Zygomycota but not with the Entomophthorales. Clades are discussed with special reference to traditional classifications, mapping morphological characters and ecology, where possible, as a snapshot of our current phylogenetic perspective of the Zygomycota.

  4. Efficient Simulation of Quantum States Based on Classical Fields Modulated with Pseudorandom Phase Sequences

    CERN Document Server

    Fu, Jian

    2010-01-01

    We demonstrate that a tensor product structure could be obtained by introducing pseudorandom phase sequences into classical fields with two orthogonal modes. Using classical fields modulated with pseudorandom phase sequences, we discuss efficient simulation of several typical quantum states, including product state, Bell states, GHZ state, and W state. By performing quadrature demodulation scheme, we could obtain the mode status matrix of the simulating classical fields, based on which we propose a sequence permutation mechanism to reconstruct the simulated quantum states. The research on classical simulation of quantum states is important, for it not only enables potential practical applications in quantum computation, but also provides useful insights into fundamental concepts of quantum mechanics.

  5. Predicting effects of noncoding variants with deep learning–based sequence model

    Science.gov (United States)

    Zhou, Jian; Troyanskaya, Olga G

    2016-01-01

    Identifying functional effects of noncoding variants is a major challenge in human genetics. To predict the noncoding-variant effects de novo from sequence, we developed a deep learning–based algorithmic framework, DeepSEA (http://deepsea.princeton.edu/), that directly learns a regulatory sequence code from large-scale chromatin-profiling data, enabling prediction of chromatin effects of sequence alterations with single-nucleotide sensitivity. We further used this capability to improve prioritization of functional variants including expression quantitative trait loci (eQTLs) and disease-associated variants. PMID:26301843

  6. Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

    Directory of Open Access Journals (Sweden)

    Martin Mascher

    Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new

  7. Amino Acid sequence analysis of the two major outer Capsid Proteins (VP7 and VP4 from human-derived canine G3P[3] Rotavirus Strain Detected in Brazil

    Directory of Open Access Journals (Sweden)

    Adriana Luchs

    2013-12-01

    Full Text Available Introduction: A close look at the rotavirus group A (RVA genotypes in Brazil revealed the detection of a rare G3P[3] strain close related to canine strains. The aim of this study was to add to the already known genetic analysis by the description of the G3P[3] (IAL-R2638 strain amino acid characteristics. Methods: Amino acid sequence analysis and protein based trees were conducted using BioEdit and MEGA 4.0. Results: The VP7 and VP4 protein of the IAL-R2638 strain displayed the highest amino acid identity to the canine-derived human strain HCR3A (99.2%, and to the canine strain RV52/96 (96.4%, respectively. IAL-R2638 strain did not possess an extra VP7 N-linked glycosylation site at amino acid 238 recently described for some G3 strains, as well as RotaTeqTM G3 vaccine strain. The topology exhibited by phylogenetic trees in previous analysis were maintained in the present amino acid-based trees, reinforcing a stable relationship between G3P[3] strains. Conclusions: Amino acid analysis data were consistent with the previous sequence of data obtained for the IAL-R2638 strain, supporting its possible canine origin. Theoretically, RotaTeqTM vaccine could efficiently protect against G3P[3] infections based on the lack of the extra VP7 N-linked glycosylation site at amino acid 238. Phylogenetic analysis hypothesizes that all features undergo evolution independently of each other; however, unfavorable effects of nucleotide substitutions may be compensated by substitutions in other positions. The present study raises the question as to whether the amino acid-based trees could be applied as an approach to the study of RVA evolution, avoiding incorrect phylogenetic reconstructions.

  8. Acid-base transport in pancreas-new challenges

    DEFF Research Database (Denmark)

    Novak, Ivana; Haanes, Kristian Agmund; Wang, Jing

    2013-01-01

    Along the gastrointestinal tract a number of epithelia contribute with acid or basic secretions in order to aid digestive processes. The stomach and pancreas are the most extreme examples of acid (H+) and base (HCO-3) transporters, respectively. Nevertheless, they share the same challenges...... to consider in pancreas are the proton pumps (H-K-ATPases), as well as the calcium-activated K and Cl channels, such as K3.1 and TMEM16A/ANO1. Local regulators, such as purinergic signaling, fine-tune, and coordinate pancreatic secretion. Lastly, we speculate whether dys-regulation of acid-base transport...... contributes to pancreatic diseases including cystic fibrosis, pancreatitis, and cancer. © 2013 Novak, Haanes and Wang....

  9. MOST: a modified MLST typing tool based on short read sequencing.

    Science.gov (United States)

    Tewolde, Rediat; Dallman, Timothy; Schaefer, Ulf; Sheppard, Carmen L; Ashton, Philip; Pichon, Bruno; Ellington, Matthew; Swift, Craig; Green, Jonathan; Underwood, Anthony

    2016-01-01

    Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches. PMID:27602279

  10. Sequence diversity and functional conformity

    OpenAIRE

    de Lange, Orlando

    2015-01-01

    At least four phylogenetically distinct groups of bacteria encode repeat proteins with the common ability to bind specific DNA sequences with a unique but conserved code. Each repeat binds a single DNA base, and specificity is determined by the amino acid residue at position 13 of each repeat. Repeats are typically 33-35 amino acids long. Comparing repeat sequences across all groups reveals that only three positions are hyper-conserved. Repeats are in most cases functionally compatible such t...

  11. Statistical framework for detection of genetically modified organisms based on Next Generation Sequencing

    OpenAIRE

    Willems, Sander; Fraiture, Marie-Alice; Deforce, Dieter; De Keersmaecker, Sigrid; Herman, Philippe; De Loose, Marc; Ruttink, Tom; Van Nieuwerburgh, Filip; Roosens, Nancy

    2016-01-01

    Because the number and diversity of genetically modified (GM) crops has significantly increased, their analysis based on real-time PCR (qPCR) methods is becoming increasingly complex and laborious. While several pioneers already investigated Next Generation Sequencing (NGS) as an alternative to qPCR, its practical use has not been assessed for routine analysis. In this study a statistical framework was developed to predict the number of NGS reads needed to detect transgene sequences, to prove...

  12. CloudMap: A Cloud-Based Pipeline for Analysis of Mutant Genome Sequences

    OpenAIRE

    Minevich, Gregory; Park, Danny S.; Blankenberg, Daniel; Richard J Poole; Hobert, Oliver

    2012-01-01

    Whole genome sequencing (WGS) allows researchers to pinpoint genetic differences between individuals and significantly shortcuts the costly and time-consuming part of forward genetic analysis in model organism systems. Currently, the most effort-intensive part of WGS is the bioinformatic analysis of the relatively short reads generated by second generation sequencing platforms. We describe here a novel, easily accessible and cloud-based pipeline, called CloudMap, which greatly simplifies the ...

  13. Evolution of early life inferred from protein and ribonucleic acid sequences

    Science.gov (United States)

    Dayhoff, M. O.; Schwartz, R. M.

    1978-01-01

    The chemical structures of ferredoxin, 5S ribosomal RNA, and c-type cytochrome sequences have been employed to construct a phylogenetic tree which connects all major photosynthesizing organisms: the three types of bacteria, blue-green algae, and chloroplasts. Anaerobic and aerobic bacteria, eukaryotic cytoplasmic components and mitochondria are also included in the phylogenetic tree. Anaerobic nonphotosynthesizing bacteria similar to Clostridium were the earliest organisms, arising more than 3.2 billion years ago. Bacterial photosynthesis evolved nearly 3.0 billion years ago, while oxygen-evolving photosynthesis, originating in the blue-green algal line, came into being about 2.0 billion years ago. The phylogenetic tree supports the symbiotic theory of the origin of eukaryotes.

  14. Identification and Characterization of Second-Generation Invader Locked Nucleic Acids (LNAs) for Mixed-Sequence Recognition of Double-Stranded DNA

    DEFF Research Database (Denmark)

    Sau, Sujay P; Madsen, Andreas S; Podbevsek, Peter;

    2013-01-01

    , but substantial efforts are currently devoted to the development of alternative strategies that overcome the limitations observed with the classic approaches. In 2005, we introduced Invader locked nucleic acids (LNAs), i.e., double-stranded probes that are activated for mixed-sequence recognition of dsDNA through...... modification with "+1 interstrand zippers" of 2'-N-(pyren-1-yl)methyl-2'-amino-α-l-LNA monomers. Despite promising preliminary results, progress has been slow because of the synthetic complexity of the building blocks. Here we describe a study that led to the identification of two simpler classes of Invader...... monomers. We compare the thermal denaturation characteristics of double-stranded probes featuring different interstrand zippers of pyrene-functionalized monomers based on 2'-amino-α-l-LNA, 2'-N-methyl-2'-amino-DNA, and RNA scaffolds. Insights from fluorescence spectroscopy, molecular modeling, and NMR...

  15. [Clinical Application of Extraction and Analysis of the Key Frames Based on IVUS Sequences].

    Science.gov (United States)

    Mao, Haiqun; Yang, Feng; Huang, Zheng; Cui, Kai; Wang, Xinxin

    2015-08-01

    In this paper, we propose an image-based key frame gating method to reduce motion artifacts in intravascular ultrasound (IVUS) longitudinal cuts. The artifacts are mainly caused by the periodic relative displacement between blood vessels and the IVUS catheter due to cardiac motion. The method is achieved in four steps as following. Firstly, we convert IVUS image sequences to polar coordinates to cut down the amount of calculation. Secondly, we extracted a one-dimensional signal cluster reflecting cardiac motion by spectral analysis and filtering techniques. Thirdly, we designed a Butterworth band-pass filter for filtering the one-dimensional signal clusters. Fourthly, we retrieved the extremes of the filtered signal clusters to seek key frames to compose key-frames gated sequences. Experimental results showed that our algorithm was fast and the average frame processing time was 17ms. Observing the longitudinal viewpictures, we found that comparing to the original ones, the gated sequences had similar trend, less saw tooth shape, and good continuity. We selected 12 groups of clinical IVUS sequences [images (876 +/- 65 frames), coronary segments length (14.61 +/- 1.08 mm)] to calculate vessel volume, lumen volume, mean plaque burden of the original and gated sequences. Statistical results showed that, on one hand, both vessel volume and lumen volume measured of the gated sequences were significantly smaller than those of the original ones, and there was no significant difference on mean plaque burden between original and gated sequences, which met the need of the clinical diagnosis and treatment. On the other hand, variances of vessel area and lumen area of the gated sequences were significantly smaller than those of the original sequences, indicating that the gated sequences would be more stable than the original ones.

  16. Effect of benzoic acid supplementation on acid-base status and mineralmetabolism in catheterized growing pigs

    DEFF Research Database (Denmark)

    Nørgaard, Jan Værum; Fernández, José Adalberto; Sørensen, Kristina Ulrich;

    2010-01-01

    Benzoic acid (BA) in diets for growing pigs results in urinary acidification and reduced ammonia emission. The objective was to study the impact of BA supplementation on the acid-base status and mineral metabolism in pigs. Eight female 50-kg pigs, fitted with a catheter in the abdominal aorta, were...... samples of faeces and urine for determination of P, Ca, Na, K, and N balance. Blood samples from the abdominal aorta and urine were collected every 2 h for the first 24 h. Blood and urine were immediately analyzed for pH. Urinary pH decreased (P 

  17. Interactions between acid- and base-functionalized surfaces

    NARCIS (Netherlands)

    Giesbers, M.; Kleijn, J.M.; Cohen Stuart, M.A.

    2002-01-01

    In this paper we present an AFM force study on interactions between chemically modified surfaces. Surfaces with terminal groups of either NH2 or COOH were obtained by chemisorption of a silane-based compound (3-amino-propyltriethoxysilane) on silica or a thiol compound (11-mercapto undecanoic acid)

  18. A Schiff base formed from sulfanilic acid and dimethylformamide.

    Science.gov (United States)

    Hempel, A; Camerman, N; Mastropaolo, D; Camerman, A

    1999-04-15

    The crystal structure the Schiff base contains one 4-dimethylaminomethyleneaminobenzenesulfonic acid molecule in zwitterionic form [4-(dimethylaminomethyleneammonio)benzenesulfonate], and one water molecule in the asymmetric unit (C9H12N2O3S.H2O). Protonation occurs at nitrogen atom N1, but the charge is delocalized.

  19. Photoresponsive self-assemblies based on fatty acids.

    Science.gov (United States)

    Fameau, A-L; Arnould, A; Lehmann, M; von Klitzing, R

    2015-02-18

    Photoresponsive surfactant system based on fatty acids has been developed by the introduction in aqueous solution of a photoacid generator (PAG). Self-assembly transitions are triggered by UV irradiation due to a pH change induced by the presence of PAG.

  20. Fatty acid-based polyurethane films for wound dressing applications

    NARCIS (Netherlands)

    Gultekin, G.; Atalay-Oral, C.; Erkal, S.; Sahin, F.; Karastova, D.; Tantekin-Ersolmaz, S.B.; Guner, F.S.

    2008-01-01

    Fatty acid-based polyurethane films were prepared for use as potential wound dressing material. The polymerization reaction was carried out with or without catalyst. Polymer films were prepared by casting-evaporation technique with or without crosslink-catalyst. The film prepared from uncatalyzed re

  1. High Interlaboratory Reprocucibility of DNA Sequence-based Typing of Bacteria in a Multicenter Study

    DEFF Research Database (Denmark)

    Sousa, MA de; Boye, Kit; Lencastre, H de;

    2006-01-01

    Current DNA amplification-based typing methods for bacterial pathogens often lack interlaboratory reproducibility. In this international study, DNA sequence-based typing of the Staphylococcus aureus protein A gene (spa, 110 to 422 bp) showed 100% intra- and interlaboratory reproducibility without...

  2. Parallel proton transfer pathways in aqueous acid-base reactions

    OpenAIRE

    Cox, M. J.; Bakker, H.J.

    2008-01-01

    We study the mechanism of proton transfer (PT) between the photoacid 8-hydroxy-1,3, 6-pyrenetrisulfonic acid (HPTS) and the base chloroacetate in aqueous solution. We investigate both proton and deuteron transfer reactions in solutions with base concentrations ranging from 0.25M to 4M. Using femtosecond midinfrared spectroscopy, we probe the vibrational responses of HPTS, its conjugate photobase, the hydrated proton/deuteron, and chloroacetate. The measurement of these four resonances allows ...

  3. A next generation semiconductor based sequencing approach for the identification of meat species in DNA mixtures.

    Directory of Open Access Journals (Sweden)

    Francesca Bertolini

    Full Text Available The identification of the species of origin of meat and meat products is an important issue to prevent and detect frauds that might have economic, ethical and health implications. In this paper we evaluated the potential of the next generation semiconductor based sequencing technology (Ion Torrent Personal Genome Machine for the identification of DNA from meat species (pig, horse, cattle, sheep, rabbit, chicken, turkey, pheasant, duck, goose and pigeon as well as from human and rat in DNA mixtures through the sequencing of PCR products obtained from different couples of universal primers that amplify 12S and 16S rRNA mitochondrial DNA genes. Six libraries were produced including PCR products obtained separately from 13 species or from DNA mixtures containing DNA from all species or only avian or only mammalian species at equimolar concentration or at 1:10 or 1:50 ratios for pig and horse DNA. Sequencing obtained a total of 33,294,511 called nucleotides of which 29,109,688 with Q20 (87.43% in a total of 215,944 reads. Different alignment algorithms were used to assign the species based on sequence data. Error rate calculated after confirmation of the obtained sequences by Sanger sequencing ranged from 0.0003 to 0.02 for the different species. Correlation about the number of reads per species between different libraries was high for mammalian species (0.97 and lower for avian species (0.70. PCR competition limited the efficiency of amplification and sequencing for avian species for some primer pairs. Detection of low level of pig and horse DNA was possible with reads obtained from different primer pairs. The sequencing of the products obtained from different universal PCR primers could be a useful strategy to overcome potential problems of amplification. Based on these results, the Ion Torrent technology can be applied for the identification of meat species in DNA mixtures.

  4. A next generation semiconductor based sequencing approach for the identification of meat species in DNA mixtures.

    Science.gov (United States)

    Bertolini, Francesca; Ghionda, Marco Ciro; D'Alessandro, Enrico; Geraci, Claudia; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    The identification of the species of origin of meat and meat products is an important issue to prevent and detect frauds that might have economic, ethical and health implications. In this paper we evaluated the potential of the next generation semiconductor based sequencing technology (Ion Torrent Personal Genome Machine) for the identification of DNA from meat species (pig, horse, cattle, sheep, rabbit, chicken, turkey, pheasant, duck, goose and pigeon) as well as from human and rat in DNA mixtures through the sequencing of PCR products obtained from different couples of universal primers that amplify 12S and 16S rRNA mitochondrial DNA genes. Six libraries were produced including PCR products obtained separately from 13 species or from DNA mixtures containing DNA from all species or only avian or only mammalian species at equimolar concentration or at 1:10 or 1:50 ratios for pig and horse DNA. Sequencing obtained a total of 33,294,511 called nucleotides of which 29,109,688 with Q20 (87.43%) in a total of 215,944 reads. Different alignment algorithms were used to assign the species based on sequence data. Error rate calculated after confirmation of the obtained sequences by Sanger sequencing ranged from 0.0003 to 0.02 for the different species. Correlation about the number of reads per species between different libraries was high for mammalian species (0.97) and lower for avian species (0.70). PCR competition limited the efficiency of amplification and sequencing for avian species for some primer pairs. Detection of low level of pig and horse DNA was possible with reads obtained from different primer pairs. The sequencing of the products obtained from different universal PCR primers could be a useful strategy to overcome potential problems of amplification. Based on these results, the Ion Torrent technology can be applied for the identification of meat species in DNA mixtures.

  5. Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

    Science.gov (United States)

    Freimuth, Paul I.

    2010-04-06

    The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.

  6. Carbon Dot Based Sensing of Dopamine and Ascorbic Acid

    OpenAIRE

    Upama Baruah; Neelam Gogoi; Achyut Konwar; Manash Jyoti Deka; Devasish Chowdhury; Gitanjali Majumdar

    2014-01-01

    We demonstrate carbon dot based sensor of catecholamine, namely, dopamine and ascorbic acid. Carbon dots (CDs) were prepared from a green source: commercially available Assam tea. The carbon dots prepared from tea had particle sizes of ∼0.8 nm and are fluorescent. Fluorescence of the carbon dots was found to be quenched in the presence of dopamine and ascorbic acid with greater sensitivity for dopamine. The minimum detectable limits were determined to be 33 μM and 98 μM for dopamine and ascor...

  7. A rational approach in probe design for nucleic acid-based biosensing.

    Science.gov (United States)

    Ermini, M L; Scarano, S; Bini, R; Banchelli, M; Berti, D; Mascini, M; Minunni, M

    2011-08-15

    Development of nucleic acid-based sensing attracts the interest of many researchers in the field of both basic and applied research in chemistry. Major factors for the fabrication of a successful nucleic acid sensor include the design of probes for target sequence hybridization and their immobilization on the chip surface. Here we demonstrate that a rational choice of bioprobes has important impact on the sensor's analytical performances. Computational evaluations, by a simple and freely available program, successfully led to the design of the best probes for a given target, with direct application to nucleic acid-based sensing. We developed here an optimized and reproducible strategy for in silico probe design supported by optical transduction experiments. In particular Surface Plasmon Resonance imaging (SPRi), at the forefront of optical sensing, was used here as proof of principle. Five probes were selected, immobilized on gold chip surfaces by widely consolidated thiol chemistry and tested to validate the computational model. Using SPRi as the transducting component, real-time and label free analysis was performed, taking the Homo sapiens actin beta (ACTB) gene fragment as model system in nucleic acid detection. The experimental sensor behavior was further studied by evaluating the strength of the secondary structure of probes using melting experiments. Dedicated software was also used to evaluate probes' folding, to support our criteria. The SPRi experimental results fully validate the computational evaluations, revealing this approach highly promising as a useful tool to design biosensor probes with optimized performances. PMID:21719269

  8. Acid-base transport in pancreas – new challenges

    Directory of Open Access Journals (Sweden)

    Ivana eNovak

    2013-12-01

    Full Text Available Along the gastrointestinal tract a number of epithelia contribute with acid or basic secretions in order to aid digestive processes. The stomach and pancreas are the most extreme examples of acid (H+ and base (HCO3- transporters, respectively. Nevertheless, they share the same challenges of transporting acid and bases across epithelia and effectively regulating their intracellular pH. In this review, we will make use of comparative physiology to enlighten the cellular mechanisms of pancreatic HCO3- and fluid secretion, which is still challenging physiologists. Some of the novel transporters to consider in pancreas are the proton pumps (H+-K+-ATPases, as well as the calcium-activated K+ and Cl- channels, such as KCa3.1 and TMEM16A/ANO1. Local regulators, such as purinergic signalling, fine-tune and coordinate pancreatic secretion. Lastly, we speculate whether dys-regulation of acid-base transport contributes to pancreatic diseases including cystic fibrosis, pancreatitis and cancer.

  9. JiffyNet: a web-based instant protein network modeler for newly sequenced species.

    Science.gov (United States)

    Kim, Eiru; Kim, Hanhae; Lee, Insuk

    2013-07-01

    Revolutionary DNA sequencing technology has enabled affordable genome sequencing for numerous species. Thousands of species already have completely decoded genomes, and tens of thousands more are in progress. Naturally, parallel expansion of the functional parts list library is anticipated, yet genome-level understanding of function also requires maps of functional relationships, such as functional protein networks. Such networks have been constructed for many sequenced species including common model organisms. Nevertheless, the majority of species with sequenced genomes still have no protein network models available. Moreover, biologists might want to obtain protein networks for their species of interest on completion of the genome projects. Therefore, there is high demand for accessible means to automatically construct genome-scale protein networks based on sequence information from genome projects only. Here, we present a public web server, JiffyNet, specifically designed to instantly construct genome-scale protein networks based on associalogs (functional associations transferred from a template network by orthology) for a query species with only protein sequences provided. Assessment of the networks by JiffyNet demonstrated generally high predictive ability for pathway annotations. Furthermore, JiffyNet provides network visualization and analysis pages for wide variety of molecular concepts to facilitate network-guided hypothesis generation. JiffyNet is freely accessible at http://www.jiffynet.org.

  10. Study design requirements for RNA sequencing-based breast cancer diagnostics.

    Science.gov (United States)

    Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias

    2016-01-01

    Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic. PMID:26830453

  11. Construction of a phylogenetic tree of photosynthetic prokaryotes based on average similarities of whole genome sequences.

    Directory of Open Access Journals (Sweden)

    Soichirou Satoh

    Full Text Available Phylogenetic trees have been constructed for a wide range of organisms using gene sequence information, especially through the identification of orthologous genes that have been vertically inherited. The number of available complete genome sequences is rapidly increasing, and many tools for construction of genome trees based on whole genome sequences have been proposed. However, development of a reasonable method of using complete genome sequences for construction of phylogenetic trees has not been established. We have developed a method for construction of phylogenetic trees based on the average sequence similarities of whole genome sequences. We used this method to examine the phylogeny of 115 photosynthetic prokaryotes, i.e., cyanobacteria, Chlorobi, proteobacteria, Chloroflexi, Firmicutes and nonphotosynthetic organisms including Archaea. Although the bootstrap values for the branching order of phyla were low, probably due to lateral gene transfer and saturated mutation, the obtained tree was largely consistent with the previously reported phylogenetic trees, indicating that this method is a robust alternative to traditional phylogenetic methods.

  12. ParticleCall: A particle filter for base calling in next-generation sequencing systems

    Directory of Open Access Journals (Sweden)

    Shen Xiaohu

    2012-07-01

    Full Text Available Abstract Background Next-generation sequencing systems are capable of rapid and cost-effective DNA sequencing, thus enabling routine sequencing tasks and taking us one step closer to personalized medicine. Accuracy and lengths of their reads, however, are yet to surpass those provided by the conventional Sanger sequencing method. This motivates the search for computationally efficient algorithms capable of reliable and accurate detection of the order of nucleotides in short DNA fragments from the acquired data. Results In this paper, we consider Illumina’s sequencing-by-synthesis platform which relies on reversible terminator chemistry and describe the acquired signal by reformulating its mathematical model as a Hidden Markov Model. Relying on this model and sequential Monte Carlo methods, we develop a parameter estimation and base calling scheme called ParticleCall. ParticleCall is tested on a data set obtained by sequencing phiX174 bacteriophage using Illumina’s Genome Analyzer II. The results show that the developed base calling scheme is significantly more computationally efficient than the best performing unsupervised method currently available, while achieving the same accuracy. Conclusions The proposed ParticleCall provides more accurate calls than the Illumina’s base calling algorithm, Bustard. At the same time, ParticleCall is significantly more computationally efficient than other recent schemes with similar performance, rendering it more feasible for high-throughput sequencing data analysis. Improvement of base calling accuracy will have immediate beneficial effects on the performance of downstream applications such as SNP and genotype calling. ParticleCall is freely available at https://sourceforge.net/projects/particlecall.

  13. An Optimal Sorting of Pulse Amplitude Sequence Based on the Phased Array Radar Beam Tasks

    Institute of Scientific and Technical Information of China (English)

    Chuan Sheng∗,Yongshun Zhang; Wenlong Lu

    2016-01-01

    The study of phased array radar ( PAR) pulse amplitude sequence characteristics is the key to understand the radar’s working state and its beam’s scanning manner. According to the principle of antenna pattern formation and the searching and tracking modes of beams, this paper analyzes the characteristics and differences of pulse amplitude sequence when the radar beams work in searching and tracking modes respectively. Then an optimal sorting model of pulse amplitude sequence is established based on least⁃squares and curve⁃fitting methods. This method is helpful for acquiring the current working state of the radar and recognizing its instantaneous beam pointing by sorting the pulse amplitude sequence without the necessity to estimate the antenna pattern.

  14. Fast interactive segmentation algorithm of image sequences based on relative fuzzy connectedness

    Institute of Scientific and Technical Information of China (English)

    Tian Chunna; Gao Xinbo

    2005-01-01

    A fast interactive segmentation algorithm of image-sequences based on relative fuzzy connectedness is presented. In comparison with the original algorithm, the proposed one, with the same accuracy, accelerates the segmentation speed by three times for single image. Meanwhile, this fast segmentation algorithm is extended from single object to multiple objects and from single-image to image-sequences. Thus the segmentation of multiple objects from complex background and batch segmentation of image-sequences can be achieved. In addition, a post-processing scheme is incorporated in this algorithm, which extracts smooth edge with one-pixel-width for each segmented object. The experimental results illustrate that the proposed algorithm can obtain the object regions of interest from medical image or image-sequences as well as man-made images quickly and reliably with only a little interaction.

  15. Weather data analysis based on typical weather sequence analysis. Application: energy building simulation

    CERN Document Server

    David, Mathieu; Garde, Francois; Boyer, Harry

    2014-01-01

    In building studies dealing about energy efficiency and comfort, simulation software need relevant weather files with optimal time steps. Few tools generate extreme and mean values of simultaneous hourly data including correlation between the climatic parameters. This paper presents the C++ Runeole software based on typical weather sequences analysis. It runs an analysis process of a stochastic continuous multivariable phenomenon with frequencies properties applied to a climatic database. The database analysis associates basic statistics, PCA (Principal Component Analysis) and automatic classifications. Different ways of applying these methods will be presented. All the results are stored in the Runeole internal database that allows an easy selection of weather sequences. The extreme sequences are used for system and building sizing and the mean sequences are used for the determination of the annual cooling loads as proposed by Audrier-Cros (Audrier-Cros, 1984). This weather analysis was tested with the datab...

  16. Acid-base cooperativity of heterogeneous catalyst containing acidic framework and sterically hindered base for aldol condensation

    Institute of Scientific and Technical Information of China (English)

    Hua Li; Shu Tao Xu; Xiao Bing Lu; Wei Ping Zhang

    2009-01-01

    A bifunctional heterogeneous catalyst containing two mutually incompatible acidic and basic sites,which exhibits cooperative catalytic behavior in the aldol condensation of acetone and various aldehydes,was synthesized by postgrafting of 1,5,7-triazabicyclo[4.4.0]dec-5-ene(TBD,a sterically hindered organic base)onto Al-MCM-41 molecular sieve.

  17. Triplex formation at single-stranded nucleic acid target sites of unrestricted sequence by two added strands of oligonucleotides: A proposed model

    Energy Technology Data Exchange (ETDEWEB)

    Trapane, T.L.; Ts' o, P.O.P. (John Hopkins Univ., Baltimore, MD (United States))

    1994-11-16

    By using the standard purine nucleosides, guanosine and adenosine, and the pyrimidine C-nucleosides, pseudoisocytidine and pseudouridine, as complements on a probe strand, it is possible to construct a regular Watson-Crick helix with a single-stranded target sequence having any arrangement of the four naturally-occurring bases found in nucleic acids. The major groove of this helix will have a unique configuration of hydrogen-bonding sites on the probe strand for each of these four base pairs. By using this duplex as a framework, an ensemble of recognition patterns composed of base triads may be constructed. In these patterns, either a homopyrimidine or homopurine third strand binds in the major groove of the duplex formed by the target and probe strands. Ten distinct geometries, or motifs, are shown, each one consisting of four isomorphic base triads built upon recognition of C, G, A, or U(T) residues in the target strand. In order to maintain specific hydrogen bonding and to construct isomorphous triads, the use of several nonstandard bases is proposed. 33 refs., 9 figs., 3 tabs.

  18. Molecular identification based on coat protein sequences of the Barley yellow dwarf virus from Brazil

    Directory of Open Access Journals (Sweden)

    Talita Bernardon Mar

    2013-12-01

    Full Text Available Yellow dwarf disease, one of the most important diseases of cereal crops worldwide, is caused by virus species belonging to the Luteoviridae family. Forty-two virus isolates obtained from oat (Avena sativa L., wheat (Triticum aestivum L., barley (Hordeum vulgare L., corn (Zea mays L., and ryegrass (Lolium multiflorum Lam. collected between 2007 and 2008 from winter cereal crop regions in southern Brazil were screened by polymerase chain reaction (PCR with primers designed on ORF 3 (coat protein - CP for the presence of Barley yellow dwarf virus and Cereal yellow dwarf virus (B/CYDV. PCR products of expected size (~357 bp for subgroup II and (~831 bp for subgroup I were obtained for three and 39 samples, respectively. These products were cloned and sequenced. The subgroup II 3' partial CP amino acid deduced sequences were identified as BYDV-RMV (92 - 93 % of identity with "Illinois" Z14123 isolate. The complete CP amino acid deduced sequences of subgroup I isolates were confirmed as BYDV-PAV (94 - 99 % of identity and established a very homogeneous group (identity higher than 99 %. These results support the prevalence of BYDV-PAV in southern Brazil as previously diagnosed by Enzyme-Linked Immunosorbent Assay (ELISA and suggest that this population is very homogeneous. To our knowledge, this is the first report of BYDV-RMV in Brazil and the first genetic diversity study on B/CYDV in South America.

  19. BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark.

    Science.gov (United States)

    Thompson, Julie D; Koehl, Patrice; Ripp, Raymond; Poch, Olivier

    2005-10-01

    Multiple sequence alignment is one of the cornerstones of modern molecular biology. It is used to identify conserved motifs, to determine protein domains, in 2D/3D structure prediction by homology and in evolutionary studies. Recently, high-throughput technologies such as genome sequencing and structural proteomics have lead to an explosion in the amount of sequence and structure information available. In response, several new multiple alignment methods have been developed that improve both the efficiency and the quality of protein alignments. Consequently, the benchmarks used to evaluate and compare these methods must also evolve. We present here the latest release of the most widely used multiple alignment benchmark, BAliBASE, which provides high quality, manually refined, reference alignments based on 3D structural superpositions. Version 3.0 of BAliBASE includes new, more challenging test cases, representing the real problems encountered when aligning large sets of complex sequences. Using a novel, semiautomatic update protocol, the number of protein families in the benchmark has been increased and representative test cases are now available that cover most of the protein fold space. The total number of proteins in BAliBASE has also been significantly increased from 1444 to 6255 sequences. In addition, full-length sequences are now provided for all test cases, which represent difficult cases for both global and local alignment programs. Finally, the BAliBASE Web site (http://www-bio3d-igbmc.u-strasbg.fr/balibase) has been completely redesigned to provide a more user-friendly, interactive interface for the visualization of the BAliBASE reference alignments and the associated annotations.

  20. Enhancing Students Motivation towards School Science with an Inquiry - Based Site Visit Teaching Sequence: A Design - Based Research Approach

    OpenAIRE

    Anni Loukomies

    2013-01-01

    An inquiry-based site visit teaching sequence for school science was designed in co-operation with researchers and science teachers, according to the principles of Design Based Research (DBR). Out-of-school industry site visits were central in the design. Theory-based conjectures arising from the literature on motivation, interest and inquiry-based science teaching (IBST) were embodied in the design solution, and these embodied conjectures were studied in order to uncover the aspects of the d...

  1. Deep sequencing-based identification of small regulatory RNAs in Synechocystis sp. PCC 6803.

    Directory of Open Access Journals (Sweden)

    Wen Xu

    Full Text Available Synechocystis sp. PCC 6803 is a genetically tractable model organism for photosynthesis research. The genome of Synechocystis sp. PCC 6803 consists of a circular chromosome and seven plasmids. The importance of small regulatory RNAs (sRNAs as mediators of a number of cellular processes in bacteria has begun to be recognized. However, little is known regarding sRNAs in Synechocystis sp. PCC 6803. To provide a comprehensive overview of sRNAs in this model organism, the sRNAs of Synechocystis sp. PCC 6803 were analyzed using deep sequencing, and 7,951,189 reads were obtained. High quality mapping reads (6,127,890 were mapped onto the genome and assembled into 16,192 transcribed regions (clusters based on read overlap. A total number of 5211 putative sRNAs were revealed from the genome and the 4 megaplasmids, and 27 of these molecules, including four from plasmids, were confirmed by RT-PCR. In addition, possible target genes regulated by all of the putative sRNAs identified in this study were predicted by IntaRNA and analyzed for functional categorization and biological pathways, which provided evidence that sRNAs are indeed involved in many different metabolic pathways, including basic metabolic pathways, such as glycolysis/gluconeogenesis, the citrate cycle, fatty acid metabolism and adaptations to environmentally stress-induced changes. The information from this study provides a valuable reservoir for understanding the sRNA-mediated regulation of the complex physiology and metabolic processes of cyanobacteria.

  2. PDMS-based gold electrode for sensing ascorbic acid.

    Science.gov (United States)

    Xu, Qin; Bi, Lianhua; Zheng, Huxiang; Fan, Dahe; Wang, Wei

    2011-11-01

    Electrode with optical shapes is appreciated in microfluidics. In this article, we reported a flexible poly(dimethylsiloxane) (PDMS)-based gold electrode for ascorbic acid detection. Gold nanoparticles were chemically deposited on PDMS and the composite film was applied as working electrode. The electrode could undergo deformation and display good response performance without damage. This biosensor could give quick response to ascorbic acid (AA) (<5s) and the currents were linear with concentrations of AA in range of 0.023-7.00 mM and 30-100 mM, respectively. Limit of detection was 0.008 mM (S/N=3). This biosensor has been applied to determine ascorbic acid content in vitamin C tablets and the results were consistent with traditional iodometric method. PMID:21807485

  3. Cloning, nucleotide sequence, and expression of the Rhodobacter sphaeroides Y thioredoxin gene.

    OpenAIRE

    Pille, S.; Chuat, J C; Breton, A M; Clément-Métral, J D; Galibert, F

    1990-01-01

    Synthetic oligodeoxynucleotide probes based on the known amino acid sequence of Rhodobacter sphaeroides Y thioredoxin were used to identify, clone, and sequence the structural gene. The amino acid sequence derived from the DNA sequence of the R. sphaeroides gene was identical to the known amino acid sequence of R. sphaeroides thioredoxin. An NcoI site was created by directed mutagenesis at the beginning of the thioredoxin gene, inducing in the encoded protein the replacement of serine in posi...

  4. Phylogeny of plastids based on cladistic analysis of gene loss inferred from complete plastid genome sequences.

    Science.gov (United States)

    Nozaki, Hisayoshi; Ohta, Njij; Matsuzaki, Motomichi; Misumi, Osami; Kuroiwa, Tsuneyoshi

    2003-10-01

    Based on the recent hypothesis on the origin of eukaryotic phototrophs, red algae, green plants, and glaucophytes constitute the "primary photosynthetic eukaryotes" (whose plastids may have originated directly from a cyanobacterium-like prokaryote via primary endosymbiosis), whereas the plastids of other lineages of eukaryotic phototrophs appear to be the result of secondary or tertiary endosymbiotic events (involving a phototrophic eukaryote and a host cell). Although phylogenetic analyses using multiple plastid genes from a wide range of eukaryotic lineages have been carried out, some of the major phylogenetic relationships of plastids remain ambiguous or conflict between different phylogenetic methods used for nucleotide or amino acid substitutions. Therefore, an alternative methodology to infer the plastid phylogeny is needed. Here, we carried out a cladistic analysis of the "loss of plastid genes" after primary endosymbiosis using complete plastid genome sequences from a wide range of eukaryotic phototrophs. Since it is extremely unlikely that plastid genes are regained during plastid evolution, we used the irreversible Camin-Sokal model for our cladistic analysis of the loss of plastid genes. The cladistic analysis of the 274 plastid protein-coding genes resolved the 20 operational taxonomic units representing a wide range of eukaryotic lineages (including three secondary plastid-containing groups) into two large monophyletic groups with high bootstrap values: one corresponded to the red lineage and the other consisted of a large clade composed of the green lineage (green plants and Euglena) and the basal glaucophyte plastid. Although the sister relationship between the green lineage and the Glaucophyta was not resolved in recent phylogenetic studies using amino acid substitutions from multiple plastid genes, it is consistent with the rbcL gene phylogeny and with a recent phylogenetic study using multiple nuclear genes. In addition, our analysis robustly

  5. Amino Acid Sequence and Structural Comparison of BACE1 and BACE2 Using Evolutionary Trace Method

    Directory of Open Access Journals (Sweden)

    Hoda Mirsafian

    2014-01-01

    Full Text Available Beta-amyloid precursor protein cleavage enzyme 1 (BACE1 and beta-amyloid precursor protein cleavage enzyme 2 (BACE2, members of aspartyl protease family, are close homologues and have high similarity in their protein crystal structures. However, their enzymatic properties differ leading to disparate clinical consequences. In order to identify the residues that are responsible for such differences, we used evolutionary trace (ET method to compare the amino acid conservation patterns of BACE1 and BACE2 in several mammalian species. We found that, in BACE1 and BACE2 structures, most of the ligand binding sites are conserved which indicate their enzymatic property of aspartyl protease family members. The other conserved residues are more or less randomly localized in other parts of the structures. Four group-specific residues were identified at the ligand binding site of BACE1 and BACE2. We postulated that these residues would be essential for selectivity of BACE1 and BACE2 biological functions and could be sites of interest for the design of selective inhibitors targeting either BACE1 or BACE2.

  6. A Model for Protein Sequence Evolution Based on Selective Pressure for Protein Stability: Application to Hemoglobins

    OpenAIRE

    Lorraine Marsh

    2009-01-01

    Negative selection against protein instability is a central influence on evolution of proteins. Protein stability is maintained over evolution despite changes in underlying sequences. An empirical all-site stability-based model of evolution was developed to focus on the selection of residues arising from their contributions to protein stability. In this model, site rates could vary. A structure-based method was used to predict stationary frequencies of hemoglobin residues based on their prope...

  7. A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-Tuples

    Science.gov (United States)

    Wu, Yu-Wei; Ye, Yuzhen

    Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. Among the computational tools recently developed for metagenomic sequence analysis, binning tools attempt to classify all (or most) of the sequences in a metagenomic dataset into different bins (i.e., species), based on various DNA composition patterns (e.g., the tetramer frequencies) of various genomes. Composition-based binning methods, however, cannot be used to classify very short fragments, because of the substantial variation of DNA composition patterns within a single genome. We developed a novel approach (AbundanceBin) for metagenomics binning by utilizing the different abundances of species living in the same environment. AbundanceBin is an application of the Lander-Waterman model to metagenomics, which is based on the l-tuple content of the reads. AbundanceBin achieved accurate, unsupervised, clustering of metagenomic sequences into different bins, such that the reads classified in a bin belong to species of identical or very similar abundances in the sample. In addition, AbundanceBin gave accurate estimations of species abundances, as well as their genome sizes - two important parameters for characterizing a microbial community. We also show that AbundanceBin performed well when the sequence lengths are very short (e.g. 75 bp) or have sequencing errors.

  8. DNA Sequence Optimization Based on Continuous Particle Swarm Optimization for Reliable DNA Computing and DNA Nanotechnology

    Directory of Open Access Journals (Sweden)

    N. K. Khalid

    2008-01-01

    Full Text Available Problem statement: In DNA based computation and DNA nanotechnology, the design of good DNA sequences has turned out to be an essential problem and one of the most practical and important research topics. Basically, the DNA sequence design problem is a multi-objective problem and it can be evaluated using four objective functions, namely, Hmeasure, similarity, continuity and hairpin. Approach: There are several ways to solve multi-objective problem, however, in order to evaluate the correctness of PSO algorithm in DNA sequence design, this problem is converted into single objective problem. Particle Swarm Optimization (PSO is proposed to minimize the objective in the problem, subjected to two constraints: melting temperature and GCcontent. A model is developed to present the DNA sequence design based on PSO computation. Results: Based on experiments and researches done, 20 particles are used in the implementation of the optimization process, where the average values and the standard deviation for 100 runs are shown along with comparison to other existing methods. Conclusion: The results achieve verified that PSO can suitably solves the DNA sequence design problem using the proposed method and model, comparatively better than other approaches.

  9. Pigs in sequence space: A 0.66X coverage pig genome survey based on shotgun sequencing

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Schierup, M.H.; Jorgensen, F.G.;

    2005-01-01

    sequences (0.66X coverage) from the pig genome. The data are hereby released (NCBI Trace repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project") together with an initial evolutionary analysis. The non-repetitive fraction of the sequences was aligned to the UCSC human...

  10. RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

    Science.gov (United States)

    Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

    2015-01-01

    Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition. PMID:25860951

  11. RNA Sequencing Identifies Upregulated Kyphoscoliosis Peptidase and Phosphatidic Acid Signaling Pathways in Muscle Hypertrophy Generated by Transgenic Expression of Myostatin Propeptide

    Directory of Open Access Journals (Sweden)

    Yuanxin Miao

    2015-04-01

    Full Text Available Myostatin (MSTN, a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph, and zinc metallopeptidase STE24 (Zmpste24. In addition, kyphoscoliosis peptidase (Ky, which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA pathways (Dgki, Dgkz, Plcd4 were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  12. RNA sequencing identifies upregulated kyphoscoliosis peptidase and phosphatidic acid signaling pathways in muscle hypertrophy generated by transgenic expression of myostatin propeptide.

    Science.gov (United States)

    Miao, Yuanxin; Yang, Jinzeng; Xu, Zhong; Jing, Lu; Zhao, Shuhong; Li, Xinyun

    2015-04-09

    Myostatin (MSTN), a member of the transforming growth factor-β superfamily, plays a crucial negative role in muscle growth. MSTN mutations or inhibitions can dramatically increase muscle mass in most mammal species. Previously, we generated a transgenic mouse model of muscle hypertrophy via the transgenic expression of the MSTN N-terminal propeptide cDNA under the control of the skeletal muscle-specific MLC1 promoter. Here, we compare the mRNA profiles between transgenic mice and wild-type littermate controls with a high-throughput RNA sequencing method. The results show that 132 genes were significantly differentially expressed between transgenic mice and wild-type control mice; 97 of these genes were up-regulated, and 35 genes were down-regulated in the skeletal muscle. Several genes that had not been reported to be involved in muscle hypertrophy were identified, including up-regulated myosin binding protein H (mybph), and zinc metallopeptidase STE24 (Zmpste24). In addition, kyphoscoliosis peptidase (Ky), which plays a vital role in muscle growth, was also up-regulated in the transgenic mice. Interestingly, a pathway analysis based on grouping the differentially expressed genes uncovered that cardiomyopathy-related pathways and phosphatidic acid (PA) pathways (Dgki, Dgkz, Plcd4) were up-regulated. Increased PA signaling may increase mTOR signaling, resulting in skeletal muscle growth. The findings of the RNA sequencing analysis help to understand the molecular mechanisms of muscle hypertrophy caused by MSTN inhibition.

  13. Molecular phylogeny of Pamphagidae (Acridoidea, Orthoptera) from China based on mitochondrial cytochrome oxidase Ⅱ sequences

    Institute of Scientific and Technical Information of China (English)

    Dao-Chuan Zhang; Hong-Yan Han; Hong Yin; Xin-Jiang Li; Zhan Yin; Xiang-Chu Yin

    2011-01-01

    Phylogenetic relationships of Pamphagidae were examined using cytochrome oxidase subunit Ⅱ (COII) mtDNA sequences (684 bp). Twenty-seven species of Acridoidea from 20 genera were sequenced to obtain mtDNA data, along with four species from the GenBank nucleotide database. The purpose of this study was analyzing the phylogenetic relationships among subfamilies within Pamphagidae and interpreting the phylogenetic position of this family within the Acridoidea superfamily. Phylogenetic trees were reconstructed using neighbor-joining (NJ), maximum parsimony (MP) and Bayesian inference (BI) methods. The 684 bp analyzed fragment included 126 parsimony informative sites. Sequences diverged 1.0%-l1.1% between genera within subfamilies, and 8.8%-12.3% between subfamilies. Amino acid sequence diverged 0-6.1% between genera within subfamilies, and 0.4%-7.5% between subfamilies. Our phylogenetic trees revealed the monophyly of Pamphagidae and three distinct major groups within this family. Moreover, several well supported and stable clades were found in Pamphagidae. The global clustering results were similar to that obtained through classical morphological classification: Prionotropisinae, Thrinchinae and Pamphaginae were monophyletic groups. However, the current genus Filchnerella (Prionotropisinae) was not a monophyletic group and the genus Asiotmethis (Prionotropisinae) was a sister group of the genus Thrinchus (Thrinchinae). Further molecular and morphological studies are required to clarify the phylogenetic relationships of the genera Filchnerella and Asiotmethis.

  14. Comparative analysis of four essential Gracilariaceae species in China based on whole transcriptomic sequencing

    Institute of Scientific and Technical Information of China (English)

    XU Jiayue; WU Shuangxiu; YU Jun; SUN Jing; YIN Jinlong; WANG Liang; WANG Xumin; LIU Tao; CHI Shan; LIU Cui; REN Lufeng

    2014-01-01

    Three Gracilaria species, G. chouae, G. blodgettii, G. vermiculophylla and a close relative species, Gracilari-opsis lemaneiformis which is now nominated as Gracilaria lemaneiformis, are the typically indigenous spe-cies which are important resources for the production of special proteins, phycobilisomes, special carbo-hydrates, and agar in China. In this study, de novo transcriptome sequencing on these four species using the next generation sequencing technology was performed for the first time. Functional annotations on assembled sequencing reads showed that the transcriptomic profiles were quite different between G. lema-neiformis and other three Gracilaria species. Comparative analysis of differential gene expression related to carbohydrate and phycobiliprotein metabolisms also showed that the expression profiles of these essential genes were different in four species. The genes encoding allophycocyanin, phycocyanin and phycoerythrin were further examined in four species and their deduced amino acid sequences were used for phylogenetic analysis to confirm that G. lemaneiformis had close relationship to genus Gracilaria, as well as that within genus Gracilaria, G. chouae had closer relationship to G. vermiculophylla rather than to G. blodgettii. The de novo transcriptome study on four species provided a valuable genomic resource for further understanding and analysis on biological and evolutionary study among marine algae.

  15. Experimental design-based functional mining and characterization of high-throughput sequencing data in the sequence read archive.

    Directory of Open Access Journals (Sweden)

    Takeru Nakazato

    Full Text Available High-throughput sequencing technology, also called next-generation sequencing (NGS, has the potential to revolutionize the whole process of genome sequencing, transcriptomics, and epigenetics. Sequencing data is captured in a public primary data archive, the Sequence Read Archive (SRA. As of January 2013, data from more than 14,000 projects have been submitted to SRA, which is double that of the previous year. Researchers can download raw sequence data from SRA website to perform further analyses and to compare with their own data. However, it is extremely difficult to search entries and download raw sequences of interests with SRA because the data structure is complicated, and experimental conditions along with raw sequences are partly described in natural language. Additionally, some sequences are of inconsistent quality because anyone can submit sequencing data to SRA with no quality check. Therefore, as a criterion of data quality, we focused on SRA entries that were cited in journal articles. We extracted SRA IDs and PubMed IDs (PMIDs from SRA and full-text versions of journal articles and retrieved 2748 SRA ID-PMID pairs. We constructed a publication list referring to SRA entries. Since, one of the main themes of -omics analyses is clarification of disease mechanisms, we also characterized SRA entries by disease keywords, according to the Medical Subject Headings (MeSH extracted from articles assigned to each SRA entry. We obtained 989 SRA ID-MeSH disease term pairs, and constructed a disease list referring to SRA data. We previously developed feature profiles of diseases in a system called "Gendoo". We generated hyperlinks between diseases extracted from SRA and the feature profiles of it. The developed project, publication and disease lists resulting from this study are available at our web service, called "DBCLS SRA" (http://sra.dbcls.jp/. This service will improve accessibility to high-quality data from SRA.

  16. Genotyping-in-Thousands by sequencing (GT-seq): A cost effective SNP genotyping method based on custom amplicon sequencing.

    Science.gov (United States)

    Campbell, Nathan R; Harmon, Stephanie A; Narum, Shawn R

    2015-07-01

    Genotyping-in-Thousands by sequencing (GT-seq) is a method that uses next-generation sequencing of multiplexed PCR products to generate genotypes from relatively small panels (50-500) of targeted single-nucleotide polymorphisms (SNPs) for thousands of individuals in a single Illumina HiSeq lane. This method uses only unlabelled oligos and PCR master mix in two thermal cycling steps for amplification of targeted SNP loci. During this process, sequencing adapters and dual barcode sequence tags are incorporated into the amplicons enabling thousands of individuals to be pooled into a single sequencing library. Post sequencing, reads from individual samples are split into individual files using their unique combination of barcode sequences. Genotyping is performed with a simple perl script which counts amplicon-specific sequences for each allele, and allele ratios are used to determine the genotypes. We demonstrate this technique by genotyping 2068 individual steelhead trout (Oncorhynchus mykiss) samples with a set of 192 SNP markers in a single library sequenced in a single Illumina HiSeq lane. Genotype data were 99.9% concordant to previously collected TaqMan(™) genotypes at the same 192 loci, but call rates were slightly lower with GT-seq (96.4%) relative to Taqman (99.0%). Of the 192 SNPs, 187 were genotyped in ≥90% of the individual samples and only 3 SNPs were genotyped in <70% of samples. This study demonstrates amplicon sequencing with GT-seq greatly reduces the cost of genotyping hundreds of targeted SNPs relative to existing methods by utilizing a simple library preparation method and massive efficiency of scale.

  17. Thimet oligopeptidase: similarity to 'soluble angiotensin II-binding protein' and some corrections to the published amino acid sequence of the rat testis enzyme.

    Science.gov (United States)

    McKie, N; Dando, P M; Rawlings, N D; Barrett, A J

    1993-01-01

    The deduced amino acid sequence of pig liver soluble angiotensin II-binding protein [Sugiura, Hagiwara and Hirose (1992) J. Biol. Chem. 267, 18067-18072] is similar over most of its length to that reported for rat testis thimet oligopeptidase (EC 3.4.24.15) by Pierotti, Dong, Glucksman, Orlowski and Roberts [(1990) (Biochemistry 29, 10323-10329]. We have found that homogeneous rat testis thimet oligopeptidase binds angiotensin II with the same distinctive characteristics as the pig liver protein. Analysis of the nucleotide sequences reported for the two proteins pointed to the likelihood that sequencing errors had caused two segments of the amino acid sequence of the rat protein to be translated out of frame, and re-sequencing of selected parts of the clone (kindly provided by the previous authors) confirmed this. The revised deduced amino acid sequence of rat thimet oligopeptidase contains 687 residues, representing a protein of 78,308 Da, and is more closely related to those of the pig liver protein and other known homologues of thimet oligopeptidase than that described previously. Images Figure 1 PMID:8216239

  18. Mitochondrial DNA sequence-based phylogenetic relationship among flesh flies of the genus Sarcophaga (Sarcophagidae: Diptera)

    Indian Academy of Sciences (India)

    Neelam Bajpai; Raghav Ram Tewari

    2010-04-01

    The phylogenetic relationships among flesh flies of the family Sarcophagidae has been based mainly on the morphology of male genitalia. However, the male genitalic character-based relationships are far from satisfactory. Therefore, in the present study mitochondrial DNA has been used as marker to unravel genetic relatedness and to construct phylogeny among five sympatric species of the genus Sarcophaga. Two mitochondrial genes viz., cytochrome oxidase subunit 1 (COI) and NAD dehydrogenase subunit 5 (ND5) were sequenced and genetic distance values were calculated on the basis of sequence differences in both the mitochondrial genes. The data revealed very few genetic difference among the five species for the COI and ND5 gene sequences.

  19. Host suppression and bioinformatics for sequence-based characterization of unknown pathogens.

    Energy Technology Data Exchange (ETDEWEB)

    Branda, Steven S.; Lane, Todd W.; Misra, Milind; Meagher, Robert J.; Patel, Kamlesh D.; Kaiser, Julia N.

    2009-11-01

    Bioweapons and emerging infectious diseases pose formidable and growing threats to our national security. Rapid advances in biotechnology and the increasing efficiency of global transportation networks virtually guarantee that the United States will face potentially devastating infectious disease outbreaks caused by novel ('unknown') pathogens either intentionally or accidentally introduced into the population. Unfortunately, our nation's biodefense and public health infrastructure is primarily designed to handle previously characterized ('known') pathogens. While modern DNA assays can identify known pathogens quickly, identifying unknown pathogens currently depends upon slow, classical microbiological methods of isolation and culture that can take weeks to produce actionable information. In many scenarios that delay would be costly, in terms of casualties and economic damage; indeed, it can mean the difference between a manageable public health incident and a full-blown epidemic. To close this gap in our nation's biodefense capability, we will develop, validate, and optimize a system to extract nucleic acids from unknown pathogens present in clinical samples drawn from infected patients. This system will extract nucleic acids from a clinical sample, amplify pathogen and specific host response nucleic acid sequences. These sequences will then be suitable for ultra-high-throughput sequencing (UHTS) carried out by a third party. The data generated from UHTS will then be processed through a new data assimilation and Bioinformatic analysis pipeline that will allow us to characterize an unknown pathogen in hours to days instead of weeks to months. Our methods will require no a priori knowledge of the pathogen, and no isolation or culturing; therefore it will circumvent many of the major roadblocks confronting a clinical microbiologist or virologist when presented with an unknown or engineered pathogen.

  20. DNA LOSSLESS DIFFERENTIAL COMPRESSION ALGORITHM BASED ON SIMILARITY OF GENOMIC SEQUENCE DATABASE

    Directory of Open Access Journals (Sweden)

    Heba Afify

    2011-09-01

    Full Text Available Modern biological science produces vast amounts of genomic sequence data. This is fuelling the need forefficient algorithms for sequence compression and analysis. Data compression and the associatedtechniques coming from information theory are often perceived as being of interest for datacommunication and storage. In recent years, a substantial effort has been made for the application oftextual data compression techniques to various computational biology tasks, ranging from storage andindexing of large datasets to comparison of genomic databases. This paper presents a differentialcompression algorithm that is based on production of difference sequences according to op-code table inorder to optimize the compression of homologous sequences in dataset. Therefore, the stored data arecomposed of reference sequence, the set of differences, and differences locations, instead of storing eachsequence individually. This algorithm does not require a priori knowledge about the statistics of thesequence set. The algorithm was applied to three different datasets of genomic sequences, it achieved upto 195-fold compression rate corresponding to 99.4% space saving.

  1. Sequence Comparison Alignment-Free Approach Based on Suffix Tree and L-Words Frequency

    Directory of Open Access Journals (Sweden)

    Inês Soares

    2012-01-01

    Full Text Available The vast majority of methods available for sequence comparison rely on a first sequence alignment step, which requires a number of assumptions on evolutionary history and is sometimes very difficult or impossible to perform due to the abundance of gaps (insertions/deletions. In such cases, an alternative alignment-free method would prove valuable. Our method starts by a computation of a generalized suffix tree of all sequences, which is completed in linear time. Using this tree, the frequency of all possible words with a preset length L—L-words—in each sequence is rapidly calculated. Based on the L-words frequency profile of each sequence, a pairwise standard Euclidean distance is then computed producing a symmetric genetic distance matrix, which can be used to generate a neighbor joining dendrogram or a multidimensional scaling graph. We present an improvement to word counting alignment-free approaches for sequence comparison, by determining a single optimal word length and combining suffix tree structures to the word counting tasks. Our approach is, thus, a fast and simple application that proved to be efficient and powerful when applied to mitochondrial genomes. The algorithm was implemented in Python language and is freely available on the web.

  2. Mapping Protein-DNA Interactions Using ChIP-exo and Illumina-Based Sequencing.

    Science.gov (United States)

    Barfeld, Stefan J; Mills, Ian G

    2016-01-01

    Chromatin immunoprecipitation (ChIP) provides a means of enriching DNA associated with transcription factors, histone modifications, and indeed any other proteins for which suitably characterized antibodies are available. Over the years, sequence detection has progressed from quantitative real-time PCR and Southern blotting to microarrays (ChIP-chip) and now high-throughput sequencing (ChIP-seq). This progression has vastly increased the sequence coverage and data volumes generated. This in turn has enabled informaticians to predict the identity of multi-protein complexes on DNA based on the overrepresentation of sequence motifs in DNA enriched by ChIP with a single antibody against a single protein. In the course of the development of high-throughput sequencing, little has changed in the ChIP methodology until recently. In the last three years, a number of modifications have been made to the ChIP protocol with the goal of enhancing the sensitivity of the method and further reducing the levels of nonspecific background sequences in ChIPped samples. In this chapter, we provide a brief commentary on these methodological changes and describe a detailed ChIP-exo method able to generate narrower peaks and greater peak coverage from ChIPped material.

  3. Enzymes/non-enzymes classification model complexity based on composition, sequence, 3D and topological indices.

    Science.gov (United States)

    Munteanu, Cristian Robert; González-Díaz, Humberto; Magalhães, Alexandre L

    2008-09-21

    The huge amount of new proteins that need a fast enzymatic activity characterization creates demands of protein QSAR theoretical models. The protein parameters that can be used for an enzyme/non-enzyme classification includes the simpler indices such as composition, sequence and connectivity, also called topological indices (TIs) and the computationally expensive 3D descriptors. A comparison of the 3D versus lower dimension indices has not been reported with respect to the power of discrimination of proteins according to enzyme action. A set of 966 proteins (enzymes and non-enzymes) whose structural characteristics are provided by PDB/DSSP files was analyzed with Python/Biopython scripts, STATISTICA and Weka. The list of indices includes, but it is not restricted to pure composition indices (residue fractions), DSSP secondary structure protein composition and 3D indices (surface and access). We also used mixed indices such as composition-sequence indices (Chou's pseudo-amino acid compositions or coupling numbers), 3D-composition (surface fractions) and DSSP secondary structure amino acid composition/propensities (obtained with our Prot-2S Web tool). In addition, we extend and test for the first time several classic TIs for the Randic's protein sequence Star graphs using our Sequence to Star Graph (S2SG) Python application. All the indices were processed with general discriminant analysis models (GDA), neural networks (NN) and machine learning (ML) methods and the results are presented versus complexity, average of Shannon's information entropy (Sh) and data/method type. This study compares for the first time all these classes of indices to assess the ratios between model accuracy and indices/model complexity in enzyme/non-enzyme discrimination. The use of different methods and complexity of data shows that one cannot establish a direct relation between the complexity and the accuracy of the model. PMID:18606172

  4. Nucleic acid-based approaches to STAT inhibition.

    Science.gov (United States)

    Sen, Malabika; Grandis, Jennifer R

    2012-10-01

    Silencing of abnormally activated genes can be accomplished in a highly specific manner using nucleic acid based approaches. The focus of this review includes the different nucleic acid based inhibition strategies such as antisense oligodeoxynucleotides, small interfering RNA (siRNA), dominant-negative constructs, G-quartet oligonucleotides and decoy oligonucleotides, their mechanism of action and the effectiveness of these approaches to targeting the STAT (signal transducer and activator of transcription) proteins in cancer. Among the STAT proteins, especially STAT3, followed by STAT5, are the most frequently activated oncogenic STATs, which have emerged as plausible therapeutic cancer targets. Both STAT3 and STAT5 have been shown to regulate numerous oncogenic signaling pathways including proliferation, survival, angiogenesis and migration/invasion. PMID:24058785

  5. Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome.

    Science.gov (United States)

    Singh, Vinod Kumar; Krishnamachari, Annangarachari

    2016-09-01

    Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme. PMID:27508123

  6. Context based computational analysis and characterization of ARS consensus sequences (ACS of Saccharomyces cerevisiae genome

    Directory of Open Access Journals (Sweden)

    Vinod Kumar Singh

    2016-09-01

    Full Text Available Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS requires an essential consensus sequence (ACS for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC denoted as ORC-ACS and non-replicating ACS sequences (nrACS, that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.

  7. Simultaneous genomic identification and profiling of a single cell using semiconductor-based next generation sequencing

    Directory of Open Access Journals (Sweden)

    Manabu Watanabe

    2014-09-01

    Full Text Available Combining single-cell methods and next-generation sequencing should provide a powerful means to understand single-cell biology and obviate the effects of sample heterogeneity. Here we report a single-cell identification method and seamless cancer gene profiling using semiconductor-based massively parallel sequencing. A549 cells (adenocarcinomic human alveolar basal epithelial cell line were used as a model. Single-cell capture was performed using laser capture microdissection (LCM with an Arcturus® XT system, and a captured single cell and a bulk population of A549 cells (≈106 cells were subjected to whole genome amplification (WGA. For cell identification, a multiplex PCR method (AmpliSeq™ SNP HID panel was used to enrich 136 highly discriminatory SNPs with a genotype concordance probability of 1031–35. For cancer gene profiling, we used mutation profiling that was performed in parallel using a hotspot panel for 50 cancer-related genes. Sequencing was performed using a semiconductor-based bench top sequencer. The distribution of sequence reads for both HID and Cancer panel amplicons was consistent across these samples. For the bulk population of cells, the percentages of sequence covered at coverage of more than 100× were 99.04% for the HID panel and 98.83% for the Cancer panel, while for the single cell percentages of sequence covered at coverage of more than 100× were 55.93% for the HID panel and 65.96% for the Cancer panel. Partial amplification failure or randomly distributed non-amplified regions across samples from single cells during the WGA procedures or random allele drop out probably caused these differences. However, comparative analyses showed that this method successfully discriminated a single A549 cancer cell from a bulk population of A549 cells. Thus, our approach provides a powerful means to overcome tumor sample heterogeneity when searching for somatic mutations.

  8. Complete Genome Sequence of the d-Amino Acid Catabolism Bacterium Phaeobacter sp. Strain JL2886, Isolated from Deep Seawater of the South China Sea.

    Science.gov (United States)

    Fu, Yingnan; Wang, Rui; Zhang, Zilian; Jiao, Nianzhi

    2016-01-01

    Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. PMID:27587825

  9. Convective mixing induced by acid-base reactions.

    OpenAIRE

    Almarcha, Christophe; R'Honi, Yasmina; De Decker, Yannick; Trevelyan, Philip; Eckert, Kerstin; De Wit, Anne

    2011-01-01

    When two miscible solutions, each containing a reactive species, are put in contact in the gravity field, local variations in the density due to the reaction can induce convective motion and mixing. We characterize here both experimentally and theoretically such buoyancy-driven instabilities induced by the neutralization of a strong acid by a strong base in aqueous solutions. The diverse patterns obtained are shown to depend on the type of reactants used and on their relative concentrations. ...

  10. A study of archaeal enzymes involved in polar lipid synthesis linking amino acid sequence information, genomic contexts and lipid composition

    Directory of Open Access Journals (Sweden)

    Hiromi Daiyasu

    2005-01-01

    Full Text Available Cellular membrane lipids, of which phospholipids are the major constituents, form one of the characteristic features that distinguish Archaea from other organisms. In this study, we focused on the steps in archaeal phospholipid synthetic pathways that generate polar lipids such as archaetidylserine, archaetidylglycerol, and archaetidylinositol. Only archaetidylserine synthase (ASS, from Methanothermobacter thermautotrophicus, has been experimentally identified. Other enzymes have not been fully examined. Through database searching, we detected many archaeal hypothetical proteins that show sequence similarity to members of the CDP alcohol phosphatidyltransferase family, such as phosphatidylserine synthase (PSS, phosphatidylglycerol synthase (PGS and phosphatidylinositol synthase (PIS derived from Bacteria and Eukarya. The archaeal hypothetical proteins were classified into two groups, based on the sequence similarity. Members of the first group, including ASS from M. thermautotrophicus, were closely related to PSS. The rough agreement between PSS homologue distribution within Archaea and the experimentally identified distribution of archaetidylserine suggested that the hypothetical proteins are ASSs. We found that an open reading frame (ORF tends to be adjacent to that of ASS in the genome, and that the order of the two ORFs is conserved. The sequence similarity of phosphatidylserine decarboxylase to the product of the ORF next to the ASS gene, together with the genomic context conservation, suggests that the ORF encodes archaetidylserine decarboxylase, which may transform archaetidylserine to archaetidylethanolamine. The second group of archaeal hypothetical proteins was related to PGS and PIS. The members of this group were subjected to molecular phylogenetic analysis, together with PGSs and PISs and it was found that they formed two distinct clusters in the molecular phylogenetic tree. The distribution of members of each cluster within Archaea

  11. Molecular phylogeny of western Atlantic Farfantepenaeus and Litopenaeus shrimp based on mitochondrial 16S partial sequences.

    Science.gov (United States)

    Maggioni, R; Rogers, A D; Maclean, N; D'Incao, F

    2001-01-01

    Partial sequences for the 16S rRNA mitochondrial gene were obtained from 10 penaeid shrimp species: Farfantepenaeus paulensis, F. brasiliensis, F. subtilis, F. duorarum, F. aztecus, Litopenaeus schmitti, L. setiferus, and Xiphopenaeus kroyeri from the western Atlantic and L. vannamei and L. stylirostris from the eastern Pacific. Sequences were also obtained from an undescribed morphotype of pink shrimp (morphotype II) usually identified as F. subtilis. The phylogeny resulting from the 16S partial sequences showed that these species form two well-supported monophyletic clades consistent with the two genera proposed in a recent systematic review of the suborder Dendrobranchiata. This contrasted with conclusions drawn from recent molecular phylogenetic work on penaeid shrimps based on partial sequences of the mitochondrial COI region that failed to support recent revisions of the Dendrobranchiata based on morphological analysis. Consistent differences observed in the sequences for morphotype II, coupled with previous allozyme data, support the conclusion that this is a previously undescribed species of Farfantepenaeus. PMID:11161743

  12. Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach

    Science.gov (United States)

    Hofmann, Hansjörg; Sakti, Sakriani; Hori, Chiori; Kashioka, Hideki; Nakamura, Satoshi; Minker, Wolfgang

    The performance of English automatic speech recognition systems decreases when recognizing spontaneous speech mainly due to multiple pronunciation variants in the utterances. Previous approaches address this problem by modeling the alteration of the pronunciation on a phoneme to phoneme level. However, the phonetic transformation effects induced by the pronunciation of the whole sentence have not yet been considered. In this article, the sequence-based pronunciation variation is modeled using a noisy channel approach where the spontaneous phoneme sequence is considered as a “noisy” string and the goal is to recover the “clean” string of the word sequence. Hereby, the whole word sequence and its effect on the alternation of the phonemes will be taken into consideration. Moreover, the system not only learns the phoneme transformation but also the mapping from the phoneme to the word directly. In this study, first the phonemes will be recognized with the present recognition system and afterwards the pronunciation variation model based on the noisy channel approach will map from the phoneme to the word level. Two well-known natural language processing approaches are adopted and derived from the noisy channel model theory: Joint-sequence models and statistical machine translation. Both of them are applied and various experiments are conducted using microphone and telephone of spontaneous speech.

  13. An Alternative to Synthetic Acid Base Indicator-Tagetes Erecta Linn

    Directory of Open Access Journals (Sweden)

    *A. Elumalai

    2012-06-01

    Full Text Available The present work highlights the use of the methanolic extract of the flowers of Tagetes erecta as an acid-base indicator in acid-base titrations. This natural indicator is easy to extract as well as easily available. Indicators used in titration show well marked changes of colour in certain intervals of pH. Most of these indicators are organic dyes and are of synthetic origin. Today synthetic indicators are the choice of acid-base titrations. But due to environmental pollution, availability and cost, the search for natural compounds as an acid-base indicator was started. Herbal indicators are evaluated by using strong acid-strong base, strong acid-weak base, weak acid-strong base and weak acid weak base. In all these titrations the methanolic extract of the flowers of Tagetes erecta was found to be very useful, economical, simple and accurate for acid base titration.

  14. Full Genome Sequence-Based Comparative Study of Wild-Type and Vaccine Strains of Infectious Laryngotracheitis Virus from Italy.

    Directory of Open Access Journals (Sweden)

    Alessandra Piccirillo

    Full Text Available Infectious laryngotracheitis (ILT is an acute and highly contagious respiratory disease of chickens caused by an alphaherpesvirus, infectious laryngotracheitis virus (ILTV. Recently, full genome sequences of wild-type and vaccine strains have been determined worldwide, but none was from Europe. The aim of this study was to determine and analyse the complete genome sequences of five ILTV strains. Sequences were also compared to reveal the similarity of strains across time and to discriminate between wild-type and vaccine strains. Genomes of three ILTV field isolates from outbreaks occurred in Italy in 1980, 2007 and 2011, and two commercial chicken embryo origin (CEO vaccines were sequenced using the 454 Life Sciences technology. The comparison with the Serva genome showed that 35 open reading frames (ORFs differed across the five genomes. Overall, 54 single nucleotide polymorphisms (SNPs and 27 amino acid differences in 19 ORFs and two insertions in the UL52 and ORFC genes were identified. Similarity among the field strains and between the field and the vaccine strains ranged from 99.96% to 99.99%. Phylogenetic analysis revealed a close relationship among them, as well. This study generated data on genomic variation among Italian ILTV strains revealing that, even though the genetic variability of the genome is well conserved across time and between wild-type and vaccine strains, some mutations may help in differentiating among them and may be involved in ILTV virulence/attenuation. The results of this study can contribute to the understanding of the molecular bases of ILTV pathogenicity and provide genetic markers to differentiate between wild-type and vaccine strains.

  15. Teaching Research Methodology Using a Project-Based Three Course Sequence Critical Reflections on Practice

    Science.gov (United States)

    Braguglia, Kay H.; Jackson, Kanata A.

    2012-01-01

    This article presents a reflective analysis of teaching research methodology through a three course sequence using a project-based approach. The authors reflect critically on their experiences in teaching research methods courses in an undergraduate business management program. The introduction of a range of specific techniques including student…

  16. Model-Based Requirements Analysis for Reactive Systems with UML Sequence Diagrams and Coloured Petri Nets

    DEFF Research Database (Denmark)

    Tjell, Simon; Lassen, Kristian Bisgaard

    2008-01-01

    of a derivative of UML 2.0 high-level Sequence Diagrams. The automated requirement checking is part of a bigger tool framework in which VDM++ is applied to automatically generate initial CPN models based on Problem Diagrams. These models are manually enhanced to provide behavioral descriptions of the environment...

  17. Predicting and understanding transcription factor interactions based on sequence level determinants of combinatorial control

    NARCIS (Netherlands)

    Dijk, van A.D.J.; Braak, ter C.J.F.; Immink, G.H.; Angenent, G.C.; Ham, van R.C.H.J.

    2008-01-01

    Motivation: Transcription factor interactions are the cornerstone of combinatorial control, which is a crucial aspect of the gene regulatory system. Understanding and predicting transcription factor interactions based on their sequence alone is difficult since they are often part of families of fact

  18. Magnetism Teaching Sequences Based on an Inductive Approach for First-Year Thai University Science Students

    Science.gov (United States)

    Narjaikaew, Pattawan; Emarat, Narumon; Arayathanitkul, Kwan; Cowie, Bronwen

    2010-01-01

    The study investigated the impact on student motivation and understanding of magnetism of teaching sequences based on an inductive approach. The study was conducted in large lecture classes. A pre- and post-Conceptual Survey of Electricity and Magnetism was conducted with just fewer than 700 Thai undergraduate science students, before and after…

  19. Molecular phylogeny of Edraianthus (Grassy Bells; Campanulaceae) based on non-coding plastid DNA sequences

    DEFF Research Database (Denmark)

    Stefanovic, Sasa; Lakusic, Dmitar; Kuzmina, Maria;

    2008-01-01

    divided into three sections: E. sect. Edraianthus, E. sect. Uniflori, and E. sect. Spathulati. We present here the first phylogenetic study of Edraianthus based on multiple plastid DNA sequences (trnL-F region and rbcL-atpB spacer) derived from a wide taxonomic sampling and geographic range. While...

  20. Nuclemeter: A Reaction-Diffusion Based Method for Quantifying Nucleic Acids Undergoing Enzymatic Amplification

    Science.gov (United States)

    Liu, Changchun; Sadik, Mohamed M.; Mauk, Michael G.; Edelstein, Paul H.; Bushman, Frederic D.; Gross, Robert; Bau, Haim H.

    2014-01-01

    Real-time amplification and quantification of specific nucleic acid sequences plays a major role in medical and biotechnological applications. In the case of infectious diseases, such as HIV, quantification of the pathogen-load in patient specimens is critical to assess disease progression and effectiveness of drug therapy. Typically, nucleic acid quantification requires expensive instruments, such as real-time PCR machines, which are not appropriate for on-site use and for low-resource settings. This paper describes a simple, low-cost, reaction-diffusion based method for end-point quantification of target nucleic acids undergoing enzymatic amplification. The number of target molecules is inferred from the position of the reaction-diffusion front, analogous to reading temperature in a mercury thermometer. The method was tested for HIV viral load monitoring and performed on par with conventional benchtop methods. The proposed method is suitable for nucleic acid quantification at point of care, compatible with multiplexing and high-throughput processing, and can function instrument-free. PMID:25477046