WorldWideScience

Sample records for acid sequence based

  1. Chip-based sequencing nucleic acids

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  2. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Xiaoxia Yang

    Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  3. Nucleic acid sequence-based amplification with oligochromatography for detection of Trypanosoma brucei in clinical samples

    C.M. Mugasa; T. Laurent; G.J. Schoone; P.A. Kager; G.W. Lubega; H.D.F.H. Schallig

    2009-01-01

    Molecular tools, such as real-time nucleic acid sequence-based amplification (NASBA) and PCR, have been developed to detect Trypanosoma brucei parasites in blood for the diagnosis of human African trypanosomiasis (HAT). Despite good sensitivity, these techniques are not implemented in HAT control pr

  4. Repetitive sequence based polymerase chain reaction to differentiate close bacteria strains in acidic sites

    XIE Ming; YIN Hua-qun; LIU Yi; LIU Jie; LIU Xue-duan

    2008-01-01

    To study the diversity of bacteria strains newly isolated from several acid mine drainage(AMD) sites in China,repetitive sequence based polymerase chain reaction (rep-PCR),a well established technology for diversity analysis of closely related bacteria strains,was conducted on 30 strains of bacteria Leptospirillum ferriphilium,8 strains of bacteria Acidithiobacillus ferrooxidans,as well as the Acidithiobacillus ferrooxidans type strain ATCC (American Type Culture Collection) 23270.The results showed that,using ERIC and BOX primer sets,rep-PCR produced highly discriminatory banding patterns.Phylogenetic analysis based on ERIC-PCR banding types was made and the results indicated that rep-PCR could be used as a rapid and highly discriminatory screening technique in studying bacterial diversity,especially in differentiating bacteria within one species in AMD.

  5. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  6. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  7. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  8. iTriplet, a rule-based nucleic acid sequence motif finder

    Gunderson Samuel I

    2009-10-01

    Full Text Available Abstract Background With the advent of high throughput sequencing techniques, large amounts of sequencing data are readily available for analysis. Natural biological signals are intrinsically highly variable making their complete identification a computationally challenging problem. Many attempts in using statistical or combinatorial approaches have been made with great success in the past. However, identifying highly degenerate and long (>20 nucleotides motifs still remains an unmet challenge as high degeneracy will diminish statistical significance of biological signals and increasing motif size will cause combinatorial explosion. In this report, we present a novel rule-based method that is focused on finding degenerate and long motifs. Our proposed method, named iTriplet, avoids costly enumeration present in existing combinatorial methods and is amenable to parallel processing. Results We have conducted a comprehensive assessment on the performance and sensitivity-specificity of iTriplet in analyzing artificial and real biological sequences in various genomic regions. The results show that iTriplet is able to solve challenging cases. Furthermore we have confirmed the utility of iTriplet by showing it accurately predicts polyA-site-related motifs using a dual Luciferase reporter assay. Conclusion iTriplet is a novel rule-based combinatorial or enumerative motif finding method that is able to process highly degenerate and long motifs that have resisted analysis by other methods. In addition, iTriplet is distinguished from other methods of the same family by its parallelizability, which allows it to leverage the power of today's readily available high-performance computing systems.

  9. The structural analysis of protein sequences based on the quasi-amino acids code

    Zhu Ping; Tang Xu-Qing; Xu Zhen-Yuan

    2009-01-01

    Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (∑, +, *) is introduced, where ∑ is the set of 64 codons. According to the characteristics of (∑,+, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, +, ×) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3).

  10. First case series of emerging Rickettsial neonatal sepsis identified by polymerase chain reaction-based deoxyribonucleic acid sequencing

    P Aarthi

    2013-01-01

    Full Text Available Purpose: To detect and identify the aetiological agent in the peripheral blood from the cases of neonatal sepsis. Materials and Methods: Four neonates from geographically different regions of South India presented with signs of neonatal sepsis and all the routine clinical and laboratory investigations were performed. Blood culture by Bac T Alert 3D was negative. To establish the aetiology, polymerase chain reaction (PCR for eubacterial genome and subsequent amplification with Gram positive and Gram negative primers were performed followed by deoxyribonucleic acid (DNA sequencing. Results: PCR for the detection of eubacterial genome was positive in all the four neonates and further amplification with designed Gram positive and Gram negative primers revealed the presence of Gram negative bacteria. The amplicons were identified as Orientia tsutsugamushi in three neonates and Coxiella burnetti in the other neonate. Multalin analysis was done to further characterise the strain variation among the three strains. Conclusion: PCR-based DNA sequencing is a rapid and reliable diagnostic tool to identify the aetiological agents of neonatal sepsis. This is the first case series of emerging Rickettsial neonatal sepsis in India .

  11. Comparison of nucleic acid sequence-based amplification and loop-mediated isothermal amplification for diagnosis of human African trypanosomiasis.

    Mugasa, Claire M; Katiti, Diana; Boobo, Alex; Lubega, George W; Schallig, Henk D F H; Matovu, Enock

    2014-02-01

    Diagnosis of human African trypanosomiasis (HAT) using molecular tests should ideally achieve high sensitivity without compromising specificity. This study compared 2 simplified tests, nucleic acid sequence-based amplification (NASBA) combined with oligochromatography (OC) and loop-mediated isothermal amplification (LAMP), executed on 181 blood samples from 65 Trypanosoma brucei gambiense HAT patients, 86 controls, and 30 serological suspects from Uganda. Basing on the composite reference standard, the diagnostic sensitivity and specificity of NASBA were 93.9% (95% confidence interval [CI] = 84.9-98.3%) and 100% (95% CI = 94.9-100%), respectively. The same parameters for LAMP were 76.9% (95% CI = 64.8-86.5%) and 100% (95% CI = 91.6-100%), respectively. The level of agreement between LAMP and microscopy was good with a kappa (κ) value of 79.2% (95% CI = 69.4-88.9%), while that of NASBA-OC/microscopy was very good (κ value 94.6%; 95% CI = 89.3-99.8%). The sensitivity of NASBA-OC was significantly higher than that of LAMP (Z = 2.723; P = 0.007). These tests have potential application to HAT surveillance.

  12. Differentiation of acetic acid bacteria based on sequence analysis of 16S-23S rRNA gene internal transcribed spacer sequences.

    González, Angel; Mas, Albert

    2011-06-30

    The 16S-23S gene internal transcribed spacer sequence of sixty-four strains belonging to different acetic acid bacteria genera were analyzed, and phylogenetic trees were generated for each genera. The topologies of the different trees were in accordance with the 16S rRNA gene trees, although the similarity percentages obtained between the species was shown to be much lower. These values suggest the usefulness of including the 16S-23S gene internal transcribed spacer region as a part of the polyphasic approach required for the further classification of acetic acid bacteria. Furthermore, the region could be a good target for primer and probe design. It has also been validated for use in the identification of unknown samples of this bacterial group from wine vinegar and fruit condiments.

  13. Using scores of amino acid topological descriptors for quantitative sequence-mobility modeling of peptides based on support vector machine

    LIANG Guizhao; YANG Shanbin; ZHOU Yuan; ZHOU Peng; LI Zhiliang

    2006-01-01

    Scores of amino acid topological descriptors (SATD) derived from principle components analysis of a matrix of 1262 structural variables related to 23 amino acids were employed to express the structure of 125 peptides in different length.Quantitative sequence-mobility modelings (QSMMs)were constructed using partial least square (PLS)and support vector machine (SVM), respectively. As new amino acid scales, SATD including plentiful information related to biological activity were easily manipulated. Better results were obtained compared to those obtained with PLS, which indicated that SVM presented robust stability and excellent predictive ability for electrophoretic mobilities. These results show that there is a wide prospect for the applications of SATD and SVM regression in QSMMs.

  14. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    Kager Piet A

    2006-10-01

    Full Text Available Abstract Background Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (Plasmodium antigen detection do often not allow for species discrimination as microscopy does, but also become insensitive at Methods This paper reports the development of a sensitive and specific real-time Quantitative Nucleic Acid Sequence Based Amplification (real-time QT-NASBA assays, based on the small-subunit 18S rRNA gene, to identify the four human Plasmodium species. Results The lower detection limit of the assay is 100 – 1000 molecules in vitro RNA for all species, which corresponds to 0.01 – 0.1 parasite per diagnostic sample (i.e. 50 μl of processed blood. The real-time QT-NASBA was further evaluated using 79 clinical samples from malaria patients: i.e. 11 Plasmodium. falciparum, 37 Plasmodium vivax, seven Plasmodium malariae, four Plasmodium ovale and 20 mixed infections. The initial diagnosis of 69 out of the 79 samples was confirmed with the developed real-time QT-NASBA. Re-analysis of seven available original slides resolved five mismatches. Three of those were initially identified as P. malariae mono-infection, but after re-reading the slides P. falciparum was found, confirming the real-time QT-NASBA result. The other two slides were of poor quality not allowing true species identification. The remaining five discordant results could not be explained by microscopy, but may be due to extreme low numbers of parasites present in the samples. In addition, 12 Plasmodium berghei isolates from mice and 20 blood samples from healthy donors did not show any reaction in the assay. Conclusion Real-time QT-NASBA is a very sensitive and specific technique with a detection limit of 0.1 Plasmodium parasite per diagnostic sample (50 μl of blood and can be used for the detection, identification and quantitative measurement of low parasitaemia of Plasmodium species, thus

  15. CANADA: designing nucleic acid sequences for nanobiotechnology applications.

    Feldkamp, Udo

    2010-02-01

    The design of nucleic acid sequences for a highly specific and efficient hybridization is a crucial step in DNA computing and DNA-based nanotechnology applications. The CANADA package contains software tools for designing DNA sequences that meet these and other requirements, as well as for analyzing and handling sequences. CANADA is freely available, including a detailed manual and example input files, at http://ls11-www.cs.uni-dortmund.de/molcomp/downloads.

  16. Detection of Campylobacter jejuni and Campylobacter coli in chicken meat samples by real-time nucleic acid sequence-based amplification with molecular beacons.

    Churruca, E; Girbau, C; Martínez, I; Mateo, E; Alonso, R; Fernández-Astorga, A

    2007-06-10

    A nucleic acid sequence-based amplification (NASBA) assay based on molecular beacons was used for real-time detection of Campylobacter jejuni and Campylobacter coli in samples of chicken meat. A set of specific primers and beacon probe were designed to target the 16S rRNA of both species. The real-time NASBA protocol including the RNA isolation was valid for both of the cell suspensions in buffered saline and the artificially contaminated chicken meat samples. The presence of rRNA could be correlated with cellular viability, following inactivation of the bacteria by heating, in inoculated chicken meat samples but not in RNase-free cell suspensions.

  17. The origin of biased sequence depth in sequence-independent nucleic acid amplification and optimization for efficient massive parallel sequencing.

    Toon Rosseel

    Full Text Available Sequence Independent Single Primer Amplification is one of the most widely used random amplification approaches in virology for sequencing template preparation. This technique relies on oligonucleotides consisting of a 3' random part used to prime complementary DNA synthesis and a 5' defined tag sequence for subsequent amplification. Recently, this amplification method was combined with next generation sequencing to obtain viral sequences. However, these studies showed a biased distribution of the resulting sequence reads over the analyzed genomes. The aim of this study was to elucidate the mechanisms that lead to biased sequence depth when using random amplification. Avian paramyxovirus type 8 was used as a model RNA virus to investigate these mechanisms. We showed, based on in silico analysis of the sequence depth in relation to GC-content, predicted RNA secondary structure and sequence complementarity to the 3' part of the tag sequence, that the tag sequence has the main contribution to the observed bias in sequence depth. We confirmed this finding experimentally using both fragmented and non-fragmented viral RNAs as well as primers differing in random oligomer length (6 or 12 nucleotides and in the sequence of the amplification tag. The observed oligonucleotide annealing bias can be reduced by extending the random oligomer sequence and by in silico combining sequence data from SISPA experiments using different 5' defined tag sequences. These findings contribute to the optimization of random nucleic acid amplification protocols that are currently required for downstream applications such as viral metagenomics and microarray analysis.

  18. Methods for analyzing nucleic acid sequences

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  19. Genotyping-by-Sequencing-Based Investigation of the Genetic Architecture Responsible for a ∼Sevenfold Increase in Soybean Seed Stearic Acid.

    Heim, Crystal B; Gillman, Jason D

    2017-01-05

    Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1) to reduce or eliminate the need for chemical hydrogenation, and (2) to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24-28% seed stearic acid. Genotyping-by-sequencing (GBS)-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s); loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability.

  20. Genotyping-by-Sequencing-Based Investigation of the Genetic Architecture Responsible for a ∼Sevenfold Increase in Soybean Seed Stearic Acid

    Crystal B. Heim

    2017-01-01

    Full Text Available Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1 to reduce or eliminate the need for chemical hydrogenation, and (2 to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24–28% seed stearic acid. Genotyping-by-sequencing (GBS-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s; loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability.

  1. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  2. Classification of Base Sequences (+1,

    Dragomir Ž. Ðoković

    2010-01-01

    Full Text Available Base sequences BS(+1, are quadruples of {±1}-sequences (;;;, with A and B of length +1 and C and D of length n, such that the sum of their nonperiodic autocor-relation functions is a -function. The base sequence conjecture, asserting that BS(+1, exist for all n, is stronger than the famous Hadamard matrix conjecture. We introduce a new definition of equivalence for base sequences BS(+1, and construct a canonical form. By using this canonical form, we have enumerated the equivalence classes of BS(+1, for ≤30. As the number of equivalence classes grows rapidly (but not monotonically with n, the tables in the paper cover only the cases ≤13.

  3. Detection and identification of human Plasmodium species with real-time quantitative nucleic acid sequence-based amplification

    P.F. Mens; G.J. Schoone; P.A. Kager; H.D.F.H. Schallig

    2006-01-01

    Background: Decisions concerning malaria treatment depend on species identification causing disease. Microscopy is most frequently used, but at low parasitaemia (< 20 parasites/mu l) the technique becomes less sensitive and time consuming. Rapid diagnostic tests based on Plasmodium antigen detection

  4. Higher specificity of nucleic acid sequence-based amplification isothermal technology than of real-time PCR for quantification of HIV-1 RNA on dried blood spots.

    Mercier-Delarue, Severine; Vray, Muriel; Plantier, Jean Christophe; Maillard, Theodora; Adjout, Zidan; de Olivera, Fabienne; Schnepf, Nathalie; Maylin, Sarah; Simon, Francois; Delaugerre, Constance

    2014-01-01

    Dried blood spots (DBS) are widely proposed as a plasma surrogate for monitoring antiretroviral treatment efficacy based on the HIV-1 RNA level (viral load [VL]) in resource-limited settings. Interfering coamplification of cell-associated HIV-1 DNA during reverse transcription (RT)-PCR can be avoided by using nucleic acid sequence-based amplification (NASBA) technology, which is based on an RNA template and isothermic conditions. We analyzed VL values obtained with DBS and plasma samples by comparing isothermic NASBA (NucliSENS EasyQ HIV-1 V2.0; bioMérieux) with real-time RT-PCR (Cobas TaqMan HIV-1 V2.0; Roche). Samples from 197 HIV-1-infected patients were tested (non-B subtypes in 51% of the cases). Nucleic acid extractions were performed by use of NucliSENS EasyMAG (bioMérieux) and Cobas AmpliPrep (Roche) before the NASBA and RT-PCR quantifications, respectively. Both quantification assays have lower limits of detection of 20 (1.3) and 800 (2.9) log10 copies/ml (log) in plasma and DBS, respectively. The mean (DBS minus plasma) differences were -0.39 and -0.46 log, respectively, for RT-PCR and NASBA. RT-PCR on DBS identified virological failure in 122 of 126 patients (sensitivity, 97%) and viral suppression in 58 of 70 patients (specificity, 83%), yielding 12 false-positive results (median, 3.2 log). NASBA on DBS identified virological failure in 85 of 96 patients (sensitivity, 89%) and viral suppression in 95 of 97 patients (specificity, 98%) and yielded 2 false-positive results (3.0 log for both). Both technologies detected HIV-1 RNA in DBS at a threshold of 800 copies/ml. This higher specificity of NASBA technology could avoid overestimation of poor compliance or the emergence of resistance when monitoring antiretroviral efficacy with the DBS method.

  5. On the base sequence conjecture

    Djokovic, Dragomir Z

    2010-01-01

    Let BS(m,n) denote the set of base sequences (A;B;C;D), with A and B of length m and C and D of length n. The base sequence conjecture (BSC) asserts that BS(n+1,n) exist (i.e., are non-empty) for all n. This is known to be true for n <= 36 and when n is a Golay number. We show that it is also true for n=37 and n=38. It is worth pointing out that BSC is stronger than the famous Hadamard matrix conjecture. In order to demonstrate the abundance of base sequences, we have previously attached to BS(n+1,n) a graph Gamma_n and computed the Gamma_n for n <= 27. We now extend these computations and determine the Gamma_n for n=28,...,35. We also propose a conjecture describing these graphs in general.

  6. Visual detection and differentiation of Classic Swine Fever Virus strains using nucleic acid sequence-based amplification (NASBA) and G-quadruplex DNAzyme assay

    Lu, Xiaolu; Shi, Xueyao; Wu, Gege; Wu, Tiantian; Qin, Rui; Wang, Yi

    2017-01-01

    The split G-quadruplex DNAzyme has emerged as a valuable tool for visual DNA detection. Here, we successfully integrated colorimetric split G-quadruplex DNAzyme assay with nucleic acid sequence-based amplification to generate a novel detection approach, allowing visual and rapid detection for the RNA of Shimen and HCLV strains of Classic Swine Fever Virus (CSFV). CSFV is a RNA virus that causes a highly contagious disease in domestic pigs and wild boar. With this method, we were able to detect as little as 10 copies/ml of CSF viral RNA within 3 h in serum samples taken from the field. No interference was encountered in the amplification and detection of Classic Swine Fever Virus in the presence of non-target RNA or DNA. Moreover, Shimen and HCLV strains of Classic Swine Fever Virus could be easily differentiated using the NASBA-DNAzyme system. These findings indicate the NASBA-DNAzyme system is a rapid and practical technique for detecting and discriminating CSFV strains and may be applied to the detection of other RNA viruses. PMID:28287135

  7. The amino acid sequence of Escherichia coli cyanase.

    Chin, C C; Anderson, P M; Wold, F

    1983-01-10

    The amino acid sequence of the enzyme cyanase (cyanate hydrolase) from Escherichia coli has been determined by automatic Edman degradation of the intact protein and of its component peptides. The primary peptides used in the sequencing were produced by cyanogen bromide cleavage at the methionine residues, yielding 4 peptides plus free homoserine from the NH2-terminal methionine, and by trypsin cleavage at the 7 arginine residues after acetylation of the lysines. Secondary peptides required for overlaps and COOH-terminal sequences were produced by chymotrypsin or clostripain cleavage of some of the larger peptides. The complete sequence of the cyanase subunit consists of 156 amino acid residues (Mr 16,350). Based on the observation that the cysteine-containing peptide is obtained as a disulfide-linked dimer, it is proposed that the covalent structure of cyanase is made up of two subunits linked by a disulfide bond between the single cystine residue in each subunit. The native enzyme (Mr 150,000) then appears to be a complex of four or five such subunit dimers.

  8. Acid-Base Homeostasis.

    Hamm, L Lee; Nakhoul, Nazih; Hering-Smith, Kathleen S

    2015-12-07

    Acid-base homeostasis and pH regulation are critical for both normal physiology and cell metabolism and function. The importance of this regulation is evidenced by a variety of physiologic derangements that occur when plasma pH is either high or low. The kidneys have the predominant role in regulating the systemic bicarbonate concentration and hence, the metabolic component of acid-base balance. This function of the kidneys has two components: reabsorption of virtually all of the filtered HCO3(-) and production of new bicarbonate to replace that consumed by normal or pathologic acids. This production or generation of new HCO3(-) is done by net acid excretion. Under normal conditions, approximately one-third to one-half of net acid excretion by the kidneys is in the form of titratable acid. The other one-half to two-thirds is the excretion of ammonium. The capacity to excrete ammonium under conditions of acid loads is quantitatively much greater than the capacity to increase titratable acid. Multiple, often redundant pathways and processes exist to regulate these renal functions. Derangements in acid-base homeostasis, however, are common in clinical medicine and can often be related to the systems involved in acid-base transport in the kidneys.

  9. Development of an electrochemical DNA biosensor for detection of specific Mycobacterium tuberculosis sequence based on poly(L-glutamic acid) modified electrode

    MERVE YESIL; SONER DONMEZ; FATMA ARSLAN

    2016-11-01

    An electrochemical DNA biosensor was developed by avidin-biotin interaction of a biotinylated probe and avidin-attached, poly(L-glutamic) acid coated pencil graphite electrode (PGA/PGE) for detection of specific Mycobacterium tuberculosis DNA sequence. The discrimination of fully complementary hybridization and mismatch hybridization was carried out by electrochemical reduction current of Meldola’s Blue (MDB) in square wave voltammetry (SWV). The calibration graph of the DNA biosensor was linear between 1.5–12.5 nM and the detection limit was calculated as 1.3 nM. The proposed biosensor successfully discriminated short andlong oligonucleotides related to DNA sequence of Mycobacterium tuberculosis in optimal condition.

  10. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  11. Electrochemical deoxyribonucleic acid biosensor based on electrodeposited graphene and nickel oxide nanoparticle modified electrode for the detection of salmonella enteritidis gene sequence.

    Sun, Wei; Wang, Xiuli; Lu, Yongxi; Gong, Shixing; Qi, Xiaowei; Lei, Bingxin; Sun, Zhenfan; Li, Guangjiu

    2015-04-01

    In this paper a new electrochemical DNA biosensor was prepared by using graphene (GR) and nickel oxide (NiO) nanocomposite modified carbon ionic liquid electrode (CILE) as the substrate electrode. GR and NiO nanoparticles were electrodeposited on the CILE surface step-by-step to get the nanocomposite. Due to the strong affinity of NiO with phosphate groups of ssDNA, oligonucleotide probe with a terminal 5'-phosphate group could be attached on the surface of NiO/GR/CILE, which could further hybridize with the target ssDNA sequence. Methylene blue (MB) was used as the electrochemical indicator for monitoring the hybridization reaction. Under the optimal conditions the reduction peak current of MB was proportional to the concentration of salmonella enteritidis gene sequence in the range from 1.0×10(-13) to 1.0×10(-6)molL(-1) with a detection limit as 3.12×10(-14)molL(-1). This electrochemical DNA sensor exhibited good discrimination ability to one-base and three-base mismatched ssDNA sequences, and the polymerase chain reaction amplification product of salmonella enteritidis gene sequences were further detected with satisfactory results.

  12. Peptide Nucleic Acids Having Enhanced Binding Affinity and Sequence Specificity

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and binding affinity. Methods of increasing binding affinity and sequence specificity of peptide nucleic aci...

  13. Representation of protein-sequence information by amino acid subalphabets

    Andersen, C.A.F.; Brunak, Søren

    2004-01-01

    -sequence information, using machine learning strategies, where the primary goal is the discovery of novel powerful representations for use in AI techniques. In the case of proteins and the 20 different amino acids they typically contain, it is also a secondary goal to discover how the current selection of amino acids...

  14. Comparative genomics beyond sequence-based alignments

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.;

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...

  15. Complete amino acid sequence of the Aspergillus cytotoxin mitogillin

    Fernandez-Luna, J.L.; Lopez-Otin, C.; Soriano, F.; Mendez, E.

    1985-02-12

    The complete amino acid sequence of the cytotoxin mitogillin has been determined by sequencing the intact chain and peptide fragments produced by cleavage at methionyl, arginyl, lysyl, and tryptophanyl residues and at one aspartic acid-proline bond. The protein consists of 149 amino acid residues with alanine at the NH/sub 2/ terminus and histidine at the COOH terminus. The calculated Mr of the native mitogillin was 16,867. The native molecule presents two disulfide bridges, one between cysteine residues at positions 5 and 147 and another one between cysteine residues at positions 75 and 131. The amino acid sequence of mitogillin shows 86% homology with another cytotoxic protein called alpha-sarcin.

  16. Isolation of a human anti-haemophilic factor IX cDNA clone using a unique 52-base synthetic oligonucleotide probe deduced from the amino acid sequence of bovine factor IX.

    Jaye, M; de la Salle, H; Schamber, F; Balland, A; Kohli, V; Findeli, A; Tolstoshev, P; Lecocq, J P

    1983-04-25

    A unique 52mer oligonucleotide deduced from the amino acid sequence of bovine Factor IX was synthesized and used as a probe to screen a human liver cDNA bank. The Factor IX clone isolated shows 5 differences in nucleotide and deduced amino acid sequence as compared to a previously isolated clone. In addition, precisely one codon has been deleted.Images

  17. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  18. Amino acid sequences of proteins from Leptospira serovar pomona

    Alves Selmo F

    2000-01-01

    Full Text Available This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  19. A design for computer nucleic-acid-sequence storage, retrieval, and manipulation

    1982-01-01

    We have designed and built a data-base system for the storage of nucleic-acid sequences. The system consists of a data base (“the library”) and software that manages and provides access to that data base (“the Librarian”).

  20. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences.

    Derr, Julien; Manapat, Michael L; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A; Chen, Irene A

    2012-05-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.

  1. Electrochemical microfluidic biosensor for the detection of nucleic acid sequences.

    Goral, Vasiliy N; Zaytseva, Natalya V; Baeumner, Antje J

    2006-03-01

    A microfluidic biosensor with electrochemical detection for the quantification of nucleic acid sequences was developed. In contrast to most microbiosensors that are based on fluorescence for signal generation, it takes advantage of the simplicity and high sensitivity provided by an amperometric and coulorimetric detection system. An interdigitated ultramicroelectrode array (IDUA) was fabricated in a glass chip and integrated directly with microchannels made of poly(dimethylsiloxane) (PDMS). The assembly was packaged into a Plexiglas housing providing fluid and electrical connections. IDUAs were characterized amperometrically and using cyclic voltammetry with respect to static and dynamic responses for the presence of a reversible redox couple-potassium hexacyanoferrate (ii)/hexacyanoferrate (iii) (ferri/ferrocyanide). A combined concentration of 0.5 microM of ferro/ferricyanide was determined as lower limit of detection with a dynamic range of 5 orders of magnitude. Background signals were negligible and the IDUA responded in a highly reversible manner to the injection of various volumes and various concentrations of the electrochemical marker. For the detection of nucleic acid sequences, liposomes entrapping the electrochemical marker were tagged with a DNA probe, and superparamagnetic beads were coated with a second DNA probe. A single stranded DNA target sequence hybridized with both probes. The sandwich was captured in the microfluidic channel just upstream of the IDUA via a magnet located in the outside housing. Liposomes were lysed using a detergent and the amount of released ferro/ferricyanide was quantified while passing by the IDUA. Optimal location of the magnet with respect to the IDUA was investigated, the effect of dextran sulfate on the hybridization reaction was studied and the amount of magnetic beads used in the assay was optimized. A dose response curve using varying concentrations of target DNA molecules was carried out demonstrating a limit of

  2. The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

    Behrendt, N; Rønne, E; Ploug, M;

    1990-01-01

    -PA. The purified protein shows a single 55-60 kDa band after sodium dodecyl sulfate-polyacrylamide gel electrophoresis and silver staining. It is a heavily glycosylated protein, the deglycosylated polypeptide chain comprising only 35 kDa. The glycosylated protein contains N-acetyl-D-glucosamine and sialic acid......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported....... The protein has a high content of cysteine residues. The NH2-terminal sequence is not closely related to any known sequence. The identification of the purified and sequenced protein with the human u-PA receptor is based on the following findings: 1) the ability of the purified protein to bind u-PA and its...

  3. Acids and bases solvent effects on acid-base strenght

    Cox, Brian G

    2013-01-01

    Acids and bases are ubiquitous in chemistry. Our understanding of them, however, is dominated by their behaviour in water. Transfer to non-aqueous solvents leads to profound changes in acid-base strengths and to the rates and equilibria of many processes: for example, synthetic reactions involving acids, bases and nucleophiles; isolation of pharmaceutical actives through salt formation; formation of zwitter- ions in amino acids; and chromatographic separation of substrates. This book seeks to enhance our understanding of acids and bases by reviewing and analysing their behaviour in non-aqueous solvents. The behaviour is related where possible to that in water, but correlations and contrasts between solvents are also presented.

  4. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active...... sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally...... valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects...

  5. Conceptual Change and Science Achievement Related to a Lesson Sequence on Acids and Bases Among African American Alternative High School Students: A Teacher's Practical Arguments and the Voice of the "Other"

    Wood, Lynda Charese

    The study of teaching and learning during the period of translating ideals of reform into classroom practice enables us to understand student-teacher-researcher symbiotic learning. In line with this assumption, the purpose of this study is threefold:(1) observe effects of the Common Knowledge Construction Model (CKCM), a conceptual change inquiry model of teaching and learning, on African American students' conceptual change and achievement; (2) observe the shift in teacher's practical arguments; and (3) narrate the voice of "the Other" about teacher professional learning. This study uses retrospective data from a mixed-method approach consisting of Phenomenography, practical arguments and story-telling. Data sources include audio-recordings of a chemistry teacher's individual interviews of her students' prior- and post-intervention conceptions of acids and bases; results of Acid-Base Achievement Test (ABA-T); video-recordings of a chemistry teacher's enactment of CKCM acid-base lesson sequence; audio-recordings of teacher-researcher reflective discourse using classroom video-clips; teacher interviews; and teacher and researcher personal reflective journals. Students' conceptual changes reflect change in the number of categories of description; shift in language use from everyday talk to chemical talk; and development of a hierarchy of chemical knowledge. ABA-T results indicated 17 students in the experimental group achieved significantly higher scores than 22 students in the control group taught by traditional teaching methods. The teacher-researcher reflective discourse about enactment of the CKCM acid-base lesson sequence reveals three major shifts in teacher practical arguments: teacher inadequate preparedness to adequate preparedness; lack of confidence to gain in confidence; and surface learning to deep learning. The developing story uncovers several aspects about teaching and learning of African American students: teacher caring for the uncared; cultivating

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    2010-07-01

    ... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences... sequences are specifically excluded from this definition. Sequences with fewer than four specifically... acids are not intended to be embraced by this definition. Any amino acid sequence that contains...

  7. NGS-based deep bisulfite sequencing.

    Lee, Suman; Kim, Joomyeong

    2016-01-01

    We have developed an NGS-based deep bisulfite sequencing protocol for the DNA methylation analysis of genomes. This approach allows the rapid and efficient construction of NGS-ready libraries with a large number of PCR products that have been individually amplified from bisulfite-converted DNA. This approach also employs a bioinformatics strategy to sort the raw sequence reads generated from NGS platforms and subsequently to derive DNA methylation levels for individual loci. The results demonstrated that this NGS-based deep bisulfite sequencing approach provide not only DNA methylation levels but also informative DNA methylation patterns that have not been seen through other existing methods.•This protocol provides an efficient method generating NGS-ready libraries from individually amplified PCR products.•This protocol provides a bioinformatics strategy sorting NGS-derived raw sequence reads.•This protocol provides deep bisulfite sequencing results that can measure DNA methylation levels and patterns of individual loci.

  8. Establishment of the Nucleic Acid Sequence-based Amplification Method for Detecting Vibio Alginolyticus%溶藻弧菌的依赖于核酸序列恒温扩增检测方法的建立

    秦胜利; 王建广

    2012-01-01

    A new method, based on Nucleic Acid Sequence-based Amplification (NAS-BA) to detect Vibio alginolyticus of samples, was established. A highly specific set of primers was synthesized to target the hsp60 gene of Vibio alginolyticus so as to establish Nucleic Acid Sequence-based Amplification method. Specificity and sensitivity were tested. The results showed that the sensitivity of NASBA was 6. 9×102cfu ? mL-1 which was higher than the result of PCR method. Detecting Vibio alginolyticus with NASBA was more specific and sensitive than PCR method and has lower instrumental requirement. So,there is a broad prospect.%采用自行建立和优化的依赖于核酸序列恒温扩增(NASBA)检测体系,对溶藻弧菌进行检测.采用溶藻弧菌的hsp60基因为目的片段设计特异性引物,建成可快速检测溶藻弧菌的NASBA检测法,并进行了特异性和灵敏度试验.结果表明:所建立起的NASBA检测方法,灵敏度为6.9× 102 cfu·mL-1,高于普通PCR方法.溶藻弧菌的依赖于核酸序列恒温扩增检测方法具有较高灵敏度和和良好特异性,并且对仪器要求更低,用普通恒温设备即可进行反应,具有广阔的推广前景.

  9. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  10. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...

  11. Covariance of charged amino acids at positions 322 and 440 of HIV-1 Env contributes to coreceptor specificity of subtype B viruses, and can be used to improve the performance of V3 sequence-based coreceptor usage prediction algorithms.

    Kieran Cashin

    Full Text Available The ability to determine coreceptor usage of patient-derived human immunodeficiency virus type 1 (HIV-1 strains is clinically important, particularly for the administration of the CCR5 antagonist maraviroc. The envelope glycoprotein (Env determinants of coreceptor specificity lie primarily within the gp120 V3 loop region, although other Env determinants have been shown to influence gp120-coreceptor interactions. Here, we determined whether conserved amino acid alterations outside the V3 loop that contribute to coreceptor usage exist, and whether these alterations improve the performance of V3 sequence-based coreceptor usage prediction algorithms. We demonstrate a significant covariant association between charged amino acids at position 322 in V3 and position 440 in the C4 Env region that contributes to the specificity of HIV-1 subtype B strains for CCR5 or CXCR4. Specifically, positively charged Lys/Arg at position 322 and negatively charged Asp/Glu at position 440 occurred more frequently in CXCR4-using viruses, whereas negatively charged Asp/Glu at position 322 and positively charged Arg at position 440 occurred more frequently in R5 strains. In the context of CD4-bound gp120, structural models suggest that covariation of amino acids at Env positions 322 and 440 has the potential to alter electrostatic interactions that are formed between gp120 and charged amino acids in the CCR5 N-terminus. We further demonstrate that inclusion of a "440 rule" can improve the sensitivity of several V3 sequence-based genotypic algorithms for predicting coreceptor usage of subtype B HIV-1 strains, without compromising specificity, and significantly improves the AUROC of the geno2pheno algorithm when set to its recommended false positive rate of 5.75%. Together, our results provide further mechanistic insights into the intra-molecular interactions within Env that contribute to coreceptor specificity of subtype B HIV-1 strains, and demonstrate that incorporation

  12. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  13. Protein location prediction using atomic composition and global features of the amino acid sequence

    Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India); Nair, Achuthsankar S. [Centre for Bioinformatics, University of Kerala, Kariyavattom Campus, Thiruvananthapuram, Kerala (India)

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  14. Nucleotide and Predicted Amino Acid Sequence-Based Analysis of the Avian Metapneumovirus Type C Cell Attachment Glycoprotein Gene: Phylogenetic Analysis and Molecular Epidemiology of U.S. Pneumoviruses

    Alvarez, Rene; Lwamba, Humphrey M.; Kapczynski, Darrell R.; Njenga, M. Kariuki; Seal, Bruce S.

    2003-01-01

    A serologically distinct avian metapneumovirus (aMPV) was isolated in the United States after an outbreak of turkey rhinotracheitis (TRT) in February 1997. The newly recognized U.S. virus was subsequently demonstrated to be genetically distinct from European subtypes and was designated aMPV serotype C (aMPV/C). We have determined the nucleotide sequence of the gene encoding the cell attachment glycoprotein (G) of aMPV/C (Colorado strain and three Minnesota isolates) and predicted amino acid sequence by sequencing cloned cDNAs synthesized from intracellular RNA of aMPV/C-infected cells. The nucleotide sequence comprised 1,321 nucleotides with only one predicted open reading frame encoding a protein of 435 amino acids, with a predicted Mr of 48,840. The structural characteristics of the predicted G protein of aMPV/C were similar to those of the human respiratory syncytial virus (hRSV) attachment G protein, including two mucin-like regions (heparin-binding domains) flanking both sides of a CX3C chemokine motif present in a conserved hydrophobic pocket. Comparison of the deduced G-protein amino acid sequence of aMPV/C with those of aMPV serotypes A, B, and D, as well as hRSV revealed overall predicted amino acid sequence identities ranging from 4 to 16.5%, suggesting a distant relationship. However, G-protein sequence identities ranged from 72 to 97% when aMPV/C was compared to other members within the aMPV/C subtype or 21% for the recently identified human MPV (hMPV) G protein. Ratios of nonsynonymous to synonymous nucleotide changes were greater than one in the G gene when comparing the more recent Minnesota isolates to the original Colorado isolate. Epidemiologically, this indicates positive selection among U.S. isolates since the first outbreak of TRT in the United States. PMID:12682171

  15. SNAD: sequence name annotation-based designer

    Gorbalenya Alexander E

    2009-08-01

    Full Text Available Abstract Background A growing diversity of biological data is tagged with unique identifiers (UIDs associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Results Here we introduce SNAD (Sequence Name Annotation-based Designer that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. Conclusion A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  16. Detailed protein sequence alignment based on Spectral Similarity Score (SSS

    Thomas Dina

    2005-04-01

    Full Text Available Abstract Background The chemical property and biological function of a protein is a direct consequence of its primary structure. Several algorithms have been developed which determine alignment and similarity of primary protein sequences. However, character based similarity cannot provide insight into the structural aspects of a protein. We present a method based on spectral similarity to compare subsequences of amino acids that behave similarly but are not aligned well by considering amino acids as mere characters. This approach finds a similarity score between sequences based on any given attribute, like hydrophobicity of amino acids, on the basis of spectral information after partial conversion to the frequency domain. Results Distance matrices of various branches of the human kinome, that is the full complement of human kinases, were developed that matched the phylogenetic tree of the human kinome establishing the efficacy of the global alignment of the algorithm. PKCd and PKCe kinases share close biological properties and structural similarities but do not give high scores with character based alignments. Detailed comparison established close similarities between subsequences that do not have any significant character identity. We compared their known 3D structures to establish that the algorithm is able to pick subsequences that are not considered similar by character based matching algorithms but share structural similarities. Similarly many subsequences with low character identity were picked between xyna-theau and xyna-clotm F/10 xylanases. Comparison of 3D structures of the subsequences confirmed the claim of similarity in structure. Conclusion An algorithm is developed which is inspired by successful application of spectral similarity applied to music sequences. The method captures subsequences that do not align by traditional character based alignment tools but give rise to similar secondary and tertiary structures. The Spectral

  17. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  18. Peptide based diagnostics: are random-sequence peptides more useful than tiling proteome sequences?

    Navalkar, Krupa Arun; Johnston, Stephan Albert; Stafford, Phillip

    2015-02-01

    Diagnostics using peptide ligands have been available for decades. However, their adoption in diagnostics has been limited, not because of poor sensitivity but in many cases due to diminished specificity. Numerous reports suggest that protein-based rather than peptide-based disease detection is more specific. We examined two different approaches to peptide-based diagnostics using Coccidioides (aka Valley Fever) as the disease model. Although the pathogen was discovered more than a century ago, a highly sensitive diagnostic remains unavailable. We present a case study where two different approaches to diagnosing Valley Fever were used: first, overlapping Valley Fever epitopes representing immunodominant Coccidioides antigens were tiled using a microarray format of presynthesized peptides. Second, a set of random sequence peptides identified using a 10,000 peptide immunosignaturing microarray was compared for sensitivity and specificity. The scientific hypothesis tested was that actual epitope peptides from Coccidioides would provide sufficient sensitivity and specificity as a diagnostic. Results demonstrated that random sequence peptides exhibited higher accuracy when classifying different stages of Valley Fever infection vs. epitope peptides. The epitope peptide array did provide better performance than the existing immunodiffusion array, but when directly compared to the random sequence peptides, reported lower overall accuracy. This study suggests that there are competing aspects of antibody recognition that involve conservation of pathogen sequence and aspects of mimotope recognition and amino acid substitutions. These factors may prove critical when developing the next generation of high-performance immunodiagnostics.

  19. PHYSIOLOGY OF ACID BASE BALANCE

    Awati

    2014-12-01

    Full Text Available Acid-base, electrolyte, and metabolic disturbances are common in the intensive care unit. Almost all critically ill patients often suffer from compound acid-base and electrolyte disorders. Successful evaluation and management of such patients requires recognition of common patterns (e.g., metabolic acidosis and the ability to dissect one disorder from another. The intensivists needs to identify and correct these condition with the easiest available tools as they are the associated with multiorgan failure. Understanding the elements of normal physiology in these areas is very important so as to diagnose the pathological condition and take adequate measures as early as possible. Arterial blood gas analysis is one such tool for early detection of acid base disorder. Physiology of acid base is complex and here is the attempt to simplify it in our day to day application for the benefit of critically ill patients.

  20. Comparison of levels of human immunodeficiency virus type 1 RNA in plasma as measured by the NucliSens nucleic acid sequence-based amplification and Quantiplex branched-DNA assays.

    Ginocchio, C C; Tetali, S; Washburn, D; Zhang, F; Kaplan, M H

    1999-04-01

    This study compared levels of human immunodeficiency virus type 1 RNA in plasma as measured by the Quantiplex branched-DNA and NucliSens nucleic acid sequence-based amplification assays. RNA was detectable in 118 of 184 samples (64.13%) by the Quantiplex assay and in 171 of 184 samples (92.94%) by the NucliSens assay. Regression analysis indicated that a linear relationship existed between the two sets of values (P < 0.0001), although the Quantiplex and NucliSens values were significantly different (P < 0.001), with the NucliSens values being approximately 0.323 log higher. Spearman correlation analysis indicated that the overall changes in patient viral load patterns were highly correlative between the two assays: r = 0.912, P < 0.0001. The lower limits of sensitivity were determined to be approximately 100 copies/ml and 1,200 to 1,400 copies/ml for the NucliSens and Quantiplex assays, respectively.

  1. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    Kasarda, D.D.; Okita, T W; Bernardin, J. E.; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with a...

  2. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  3. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  4. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  5. Molten fatty acid based microemulsions.

    Noirjean, Cecile; Testard, Fabienne; Dejugnat, Christophe; Jestin, Jacques; Carriere, David

    2016-06-21

    We show that ternary mixtures of water (polar phase), myristic acid (MA, apolar phase) and cetyltrimethylammonium bromide (CTAB, cationic surfactant) studied above the melting point of myristic acid allow the preparation of microemulsions without adding a salt or a co-surfactant. The combination of SANS, SAXS/WAXS, DSC, and phase diagram determination allows a complete characterization of the structures and interactions between components in the molten fatty acid based microemulsions. For the different structures characterized (microemulsion, lamellar or hexagonal phases), a similar thermal behaviour is observed for all ternary MA/CTAB/water monophasic samples and for binary MA/CTAB mixtures without water: crystalline myristic acid melts at 52 °C, and a thermal transition at 70 °C is assigned to the breaking of hydrogen bounds inside the mixed myristic acid/CTAB complex (being the surfactant film in the ternary system). Water determines the film curvature, hence the structures observed at high temperature, but does not influence the thermal behaviour of the ternary system. Myristic acid is partitioned in two "species" that behave independently: pure myristic acid and myristic acid associated with CTAB to form an equimolar complex that plays the role of the surfactant film. We therefore show that myristic acid plays the role of a solvent (oil) and a co-surfactant allowing the fine tuning of the structure of oil and water mixtures. This solvosurfactant behaviour of long chain fatty acid opens the way for new formulations with a complex structure without the addition of any extra compound.

  6. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  7. Repeat-based Sequence Typing of Carnobacterium maltaromaticum.

    Rahman, Abdur; El Kheir, Sara M; Back, Alexandre; Mangavel, Cécile; Revol-Junelles, Anne-Marie; Borges, Frédéric

    2016-06-01

    Carnobacterium maltaromaticum is a Lactic Acid Bacterium (LAB) of technological interest for the food industry, especially the dairy as bioprotection and ripening flora. The industrial use of this LAB requires accurate and resolutive typing tools. A new typing method for C. maltaromaticum inspired from MLVA analysis and called Repeat-based Sequence Typing (RST) is described. Rather than electrophoresis analysis, our RST method is based on sequence analysis of multiple loci containing Variable-Number Tandem-Repeats (VNTRs). The method described here for C. maltaromaticum relies on the analysis of three VNTR loci, and was applied to a collection of 24 strains. For each strain, a PCR product corresponding to the amplification of each VNTR loci was sequenced. Sequence analysis allowed delineating 11, 11, and 12 alleles for loci VNTR-A, VNTR-B, and VNTR-C, respectively. Considering the allele combination exhibited by each strain allowed defining 15 genotypes, ending in a discriminatory index of 0.94. Comparison with MLST revealed that both methods were complementary for strain typing in C. maltaromaticum.

  8. Structure of the fully modified left-handed cyclohexene nucleic acid sequence GTGTACAC.

    Robeyns, Koen; Herdewijn, Piet; Van Meervelt, Luc

    2008-02-13

    CeNA oligonucleotides consist of a phosphorylated backbone where the deoxyribose sugars are replaced by cyclohexene moieties. The X-ray structure determination and analysis of a fully modified octamer sequence GTGTACAC, which is the first crystal structure of a carbocyclic-based nucleic acid, is presented. This particular sequence was built with left-handed building blocks and crystallizes as a left-handed double helix. The helix can be characterized as belonging to the (mirrored) A-type family. Crystallographic data were processed up to 1.53 A, and the octamer sequence crystallizes in the space group R32. The sugar puckering is found to adopt the 3H2 half-chair conformation which mimics the C3'-endo conformation of the ribose sugar. The double helices stack on top of each other to form continuous helices, and static disorder is observed due to this end-to-end stacking.

  9. Cloning and expression of Chromobacterium violaceum phenylalanine hydroxylase in Escherichia coli and comparison of amino acid sequence with mammalian aromatic amino acid hydroxylases.

    Onishi, A; Liotta, L J; Benkovic, S J

    1991-10-05

    The complete amino acid sequence (296 amino acids) of Chromobacterium violaceum phenylalanine hydroxylase (PAH) was determined by nucleotide analysis of a DNA clone isolated using both a synthetic oligonucleotide probe based on the NH2-terminal amino acid sequence and an antibody against this enzyme. The ApaL I fragment (approximately 1.9 kilobase pairs) containing the entire PAH gene was subcloned in pBluescript II and induced by isopropyl-beta-D-thiogalactopyranoside. In order to eliminate fusion proteins the XbaI/ClaI fragment which contained the PAH gene from the Bluescript construct was subcloned into pMAC 5-8 containing the TAC promoter. The recombinant protein reacts with antibody raised to authentic C. violaceum PAH and its NH2-terminal 20-amino acid sequence and COOH-terminal amino acid residue were identical with the wild-type protein. Key physical and chemical characteristics of the recombinant protein, i.e. its copper content and Michaelis-Menten parameters, were the same as wild-type. Comparison of amino acid sequences revealed a highly conserved region between C. violaceum PAH and three different mammalian aromatic amino acid hydroxylases. This conserved area may well be a catalytically important domain of these pterin- and metal-requiring aromatic amino acid hydroxylases. The over-expression of C. violaceum PAH in Escherichia coli will facilitate the analysis of the enzyme mechanism by various spectroscopic methods.

  10. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    Myers, G.; Foley, B.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States). Theoretical Div.; Mellors, J.W. [ed.] [Univ. of Pittsburgh, PA (United States); Jeang, K.T. [ed.] [National Institutes of Health, Bethesda, MD (United States). Molecular Virology Section; Wain-Hobson, S. [Pasteur Inst., Paris (France)] [ed.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  11. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  12. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2016-10-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  13. Entamoeba histolytica: observations on metabolism based on thegenome sequence

    Anderson, Iain J.; Loftus, Brendan J.

    2005-07-01

    The sequencing of the genome of Entamoeba histolytica has allowed a reconstruction of its metabolic pathways, many of which are unusual for a eukaryote. Based on the genome sequence, it appears that amino acids may play a larger role than previously thought in energy metabolism, with roles in both ATP synthesis and NAD regeneration. Arginine decarboxylase may be involved in survival of E. histolytica during its passage through the stomach. The usual pyrimidine synthesis pathway is absent, but a partial pyrimidine degradation pathway could be part of a novel pyrimidine synthesis pathway. Ribonucleotide reductase was not found in the E. histolytica genome, but it was found in the close relatives Entamoeba invadens and Entamoeba moshkovskii, suggesting a recent loss from E. histolytica. The usual eukaryotic glucose transporters are not present, but members of a prokaryotic monosaccharide transporter family are present.

  14. Stream cipher based on GSS sequences

    HU Yupu; XIAO Guozhen

    2004-01-01

    Generalized self-shrinking sequences, simply named the GSS sequences,are novel periodic sequences that have many advantages in cryptography. In this paper,we give several results about GSS sequence's application to cryptography. First, we give a simple method for selecting those GSS sequences whose least periods reach the maximum. Second, we give a method for describing and computing the auto-correlation coefficients of GSS sequences. Finally, we point out that some GSS sequences, when used as stream ciphers, have a security weakness.

  15. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.

  16. Quick identification of acetic acid bacteria based on nucleotide sequences of the 16S-23S rDNA internal transcribed spacer region and of the PQQ-dependent alcohol dehydrogenase gene.

    Trcek, Janja

    2005-10-01

    Acetic acid bacteria (AAB) are well known for oxidizing different ethanol-containing substrates into various types of vinegar. They are also used for production of some biotechnologically important products, such as sorbose and gluconic acids. However, their presence is not always appreciated since certain species also spoil wine, juice, beer and fruits. To be able to follow AAB in all these processes, the species involved must be identified accurately and quickly. Because of inaccuracy and very time-consuming phenotypic analysis of AAB, the application of molecular methods is necessary. Since the pairwise comparison among the 16S rRNA gene sequences of AAB shows very high similarity (up to 99.9%) other DNA-targets should be used. Our previous studies showed that the restriction analysis of 16S-23S rDNA internal transcribed spacer region is a suitable approach for quick affiliation of an acetic acid bacterium to a distinct group of restriction types and also for quick identification of a potentially novel species of acetic acid bacterium (Trcek & Teuber 2002; Trcek 2002). However, with the exception of two conserved genes, encoding tRNAIle and tRNAAla, the sequences of 16S-23S rDNA are highly divergent among AAB species. For this reason we analyzed in this study a gene encoding PQQ-dependent ADH as a possible DNA-target. First we confirmed the expression of subunit I of PQQ-dependent ADH (AdhA) also in Asaia, the only genus of AAB which exhibits little or no ADH-activity. Further we analyzed the partial sequences of adhA among some representative species of the genera Acetobacter, Gluconobacter and Gluconacetobacter. The conserved and variable regions in these sequences made possible the construction of A. acetispecific oligonucleotide the specificity of which was confirmed in PCR-reaction using 45 well-defined strains of AAB as DNA-templates. The primer was also successfully used in direct identification of A. aceti from home made cider vinegar as well as for

  17. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    Ruan Jishou

    2007-04-01

    Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are

  18. The complete amino acid sequence of the basic nuclear protein of bull spermatozoa

    Coelingh, J.P.; Monfoort, Cornelis H.; Rozijn, Thomas H.; Gevers Leuven, Jan A.; Schiphof, R.; Steyn-Parvé, Elizabeth P.; Braunitzer, Gerhard; Schrank, Barbara; Ruhfus, Annette

    1972-01-01

    The complete amino acid sequence of the basic nuclear protein of bull spermatozoa has been established. The sequence was partially deduced by characterization of peptides isolated from thermolysine and chymotryptic digests of the reduced and S-aminoethylated protein. The complete sequence of the fir

  19. Nucleic acid based logical systems.

    Han, Da; Kang, Huaizhi; Zhang, Tao; Wu, Cuichen; Zhou, Cuisong; You, Mingxu; Chen, Zhuo; Zhang, Xiaobing; Tan, Weihong

    2014-05-12

    Researchers increasingly visualize a significant role for artificial biochemical logical systems in biological engineering, much like digital logic circuits in electrical engineering. Those logical systems could be utilized as a type of servomechanism to control nanodevices in vitro, monitor chemical reactions in situ, or regulate gene expression in vivo. Nucleic acids (NA), as carriers of genetic information with well-regulated and predictable structures, are promising materials for the design and engineering of biochemical circuits. A number of logical devices based on nucleic acids (NA) have been designed to handle various processes for technological or biotechnological purposes. This article focuses on the most recent and important developments in NA-based logical devices and their evolution from in vitro, through cellular, even towards in vivo biological applications.

  20. Quick Trickle Permutation Based on Quick Trickle Characteristic Sequence

    Wang Li-na; Fei Ru-chun; Liu Zhu

    2003-01-01

    The concept of quick trickle characteristic sequence is presented, the properties and count of quick trickle characteristic sequence are researched, the mapping relationship between quick trickle characteristic sequence and quick trickle permutation is discussed. Finally, an efficient construction of quick trickle permutation based on quick trickle characteristic sequence is given, by which quick trickle permutation can be figured out after constructing quick trickle characteristic sequence. Quick trickle permutation has good cryptographic properties.

  1. Period of the d-Sequence Based Random Number Generator

    Thippireddy, Suresh; Chalasani, Sandeep

    2007-01-01

    This paper presents an expression to compute the exact period of a recursive random number generator based on d-sequences. Using the multi-recursive version of this generator we can produce large number of pseudorandom sequences.

  2. Amino acid sequences of neuropeptides in the sinus gland of the land crab Cardisoma carnifex: a novel neuropeptide proteolysis site.

    Newcomb, R W

    1987-08-01

    The sinus gland is a major neurosecretory structure in Crustacea. Five peptides, labeled C, D, E, F, and I, isolated from the sinus gland of the land crab have been hypothesized to arise from the incomplete proteolysis at two internal sites on a single biosynthetic intermediate peptide "H", based on amino acid composition additivities and pulse-chase radiolabeling studies. The presence of only a single major precursor for the sinus gland peptides implies that peptide H may be synthesized on a common precursor with crustacean hyperglycemic hormone forms, "J" and "L," and a peptide, "K," similar to peptides with molt inhibiting activity. Here I report amino acid sequences of these peptides. The amino terminal sequence of the parent peptide, H, (and the homologous fragments) proved refractory to Edman degradation. Data from amino acid analysis and carboxypeptidase digestion of the naturally occurring fragments and of fragments produced by endopeptidase digestion were used together with Edman degradation to obtain the sequences. Amino acid analysis of fragments of the naturally occurring "overlap" peptides (those produced by internal cleavage at one site on H) was used to obtain the sequences across the cleavage sites. The amino acid sequence of the land crab peptide H is Arg-Ser-Ala-Asp-Gly-Phe-Gly-Arg-Met-Glu-Ser-Leu-Leu-Thr-Ser-Leu-Arg-Gly- Ser-Ala-Glu- Ser-Pro-Ala-Ala-Leu-Gly-Glu-Ala-Ser-Ala-Ala-His-Pro-Leu-Glu. In vivo cleavage at one site involves excision of arginine from the sequence Leu-Arg-Gly, whereas cleavage at the other site involves excision of serine from the sequence Glu-Ser-Leu. Proteolysis at the latter sequence has not been previously reported in intact secretory granules. The aspartate at position 4 is possibly covalently modified.

  3. Amino acid sequence of Japanese quail (Coturnix japonica) and northern bobwhite (Colinus virginianus) myoglobin.

    Goodson, John; Beckstead, Robert B; Payne, Jason; Singh, Rakesh K; Mohan, Anand

    2015-08-15

    Myoglobin has an important physiological role in vertebrates, and as the primary sarcoplasmic pigment in meat, influences quality perception and consumer acceptability. In this study, the amino acid sequences of Japanese quail and northern bobwhite myoglobin were deduced by cDNA cloning of the coding sequence from mRNA. Japanese quail myoglobin was isolated from quail cardiac muscles, purified using ammonium sulphate precipitation and gel-filtration, and subjected to multiple enzymatic digestions. Mass spectrometry corroborated the deduced protein amino acid sequence at the protein level. Sequence analysis revealed both species' myoglobin structures consist of 153 amino acids, differing at only three positions. When compared with chicken myoglobin, Japanese quail showed 98% sequence identity, and northern bobwhite 97% sequence identity. The myoglobin in both quail species contained eight histidine residues instead of the nine present in chicken and turkey.

  4. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  5. Spot-Based Generations for Meta-Fibonacci Sequences

    Dalton, Barnaby; Tanny, Stephen

    2011-01-01

    For many meta-Fibonacci sequences it is possible to identify a partition of the sequence into successive intervals (sometimes called blocks) with the property that the sequence behaves "similarly" in each block. This partition provides insights into the sequence properties. To date, for any given sequence, only ad hoc methods have been available to identify this partition. We apply a new concept - the spot-based generation sequence - to derive a general methodology for identifying this partition for a large class of meta-Fibonacci sequences. This class includes the Conolly and Conway sequences and many of their well-behaved variants, and even some highly chaotic sequences, such as Hofstadter's famous Q-sequence.

  6. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  7. Synthesis of circular double-stranded DNA having single-stranded recognition sequence as molecular-physical probe for nucleic acid hybridization detection based on atomic force microscopy imaging.

    Nakano, Koji; Matsunaga, Hideshi; Murata, Masaharu; Soh, Nobuaki; Imato, Toshihiko

    2009-08-01

    A new class of DNA probes having a mechanically detectable tag is reported. The DNA probe, which consists of a single-stranded recognition sequence and a double-stranded circular DNA entity, was prepared by polymerase reaction. M13mp18 single strand and a 32mer oligodeoxynucleotide whose 5'-end is decorated with the recognition sequence were used in combination as template and primer, respectively. We have successfully demonstrated that the DNA probe is useful for bioanalytical purposes: by deliberately attaching target DNA molecules onto Au(111) substrates and by mechanically reading out the tag-entity using a high-resolution microscopy including atomic force microscopy, visualization/detection of the individual target/probe DNA conjugate was possible simply yet straightforwardly. The present DNA probe can be characterized as a 100%-nucleic acid product material. It is simply available by one-pod synthesis. A surface topology parameter, image roughness, has witnessed its importance as a quantitative analysis index with particular usability in the present visualization/detection method.

  8. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    Croteau, Rodney Bruce (Pullman, WA); Wildung, Mark Raymond (Colfax, WA); Lange, Bernd Markus (Pullman, WA); McCaskill, David G. (Pullman, WA)

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  9. The amino acid sequence of elephant (Elephas maximus) myoglobin and the phylogeny of Proboscidea.

    Dene, H; Goodman, M; Romero-Herrera, A E

    1980-02-13

    The complete amino acid sequence of skeletal myoglobin from the Asian elephant (Elephas maximus) is reported. The functional significance of variations seen when this sequence is compared with that of sperm whale myoglobin is explored in the light of the crystallographic model available for the latter molecule. The phylogenetic implications of the elephant myoglobin amino acid sequence are evaluated by using the maximum parsimony technique. A similar analysis is also presented which incorporates all of the proteins sequenced from the elephant. These results are discussed with respect to current views on proboscidean phylogeny.

  10. The Acid-Base Titration of a Very Weak Acid: Boric Acid

    Celeste, M.; Azevedo, C.; Cavaleiro, Ana M. V.

    2012-01-01

    A laboratory experiment based on the titration of boric acid with strong base in the presence of d-mannitol is described. Boric acid is a very weak acid and direct titration with NaOH is not possible. An auxiliary reagent that contributes to the release of protons in a known stoichiometry facilitates the acid-base titration. Students obtain the…

  11. Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix

    Yu, Lulu; Zhang, Yusen; Gutman, Ivan; Shi, Yongtang; Dehmer, Matthias

    2017-01-01

    We develop a novel position-feature-based model for protein sequences by employing physicochemical properties of 20 amino acids and the measure of graph energy. The method puts the emphasis on sequence order information and describes local dynamic distributions of sequences, from which one can get a characteristic B-vector. Afterwards, we apply the relative entropy to the sequences representing B-vectors to measure their similarity/dissimilarity. The numerical results obtained in this study show that the proposed methods leads to meaningful results compared with competitors such as Clustal W. PMID:28393857

  12. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    James B Howard

    Full Text Available Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification

  13. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    Howard, James B; Kechris, Katerina J; Rees, Douglas C; Glazer, Alexander N

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases for

  14. A sequence based synteny map between soybean and Arabidopsis thaliana

    Lightfoot David A

    2007-01-01

    Full Text Available Abstract Background Soybean (Glycine max, L. Merr. is one of the world's most important crops, however, its complete genomic sequence has yet to be determined. Nonetheless, a large body of sequence information exists, particularly in the form of expressed sequence tags (ESTs. Herein, we report the use of the model organism Arabidopsis thaliana (thale cress for which the entire genomic sequence is available as a framework to align thousands of short soybean sequences. Results A series of JAVA-based programs were created that processed and compared 341,619 soybean DNA sequences against A. thaliana chromosomal DNA. A. thaliana DNA was probed for short, exact matches (15 bp to each soybean sequence, and then checked for the number of additional 7 bp matches in the adjacent 400 bp region. The position of these matches was used to order soybean sequences in relation to the A. thaliana genome. Conclusion Reported associations between soybean sequences and A. thaliana were within a 95% confidence interval of e-30 – e-100. In addition, the clustering of soybean expressed sequence tags (ESTs based on A. thaliana sequence was accurate enough to identify potential single nucleotide polymorphisms (SNPs within the soybean sequence clusters. An EST, bacterial artificial chromosome (BAC end sequence and marker amplicon sequence synteny map of soybean and A. thaliana is presented. In addition, all JAVA programs used to create this map are available upon request and on the WEB.

  15. Simulation-Based Evaluation of Learning Sequences for Instructional Technologies

    McEneaney, John E.

    2016-01-01

    Instructional technologies critically depend on systematic design, and learning hierarchies are a commonly advocated tool for designing instructional sequences. But hierarchies routinely allow numerous sequences and choosing an optimal sequence remains an unsolved problem. This study explores a simulation-based approach to modeling learning…

  16. Students' Understanding of Acids/Bases in Organic Chemistry Contexts

    Cartrette, David P.; Mayo, Provi M.

    2011-01-01

    Understanding key foundational principles is vital to learning chemistry across different contexts. One such foundational principle is the acid/base behavior of molecules. In the general chemistry sequence, the Bronsted-Lowry theory is stressed, because it lends itself well to studying equilibrium and kinetics. However, the Lewis theory of…

  17. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697

  18. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Chun-Tien Chang

    2012-01-01

    Full Text Available The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs, insertion-deletions (indels, short tandem repeats (STRs, and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR, which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS; (iii determine human papilloma virus (HPV genotypes by searching current viral databases in cases of double infections; (iv estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4 and its paralog HSPDP3.

  19. Mixed sequence reader: a program for analyzing DNA sequences with heterozygous base calling.

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3.

  20. Efficient Quantum Private Communication Based on Dynamic Control Code Sequence

    Cao, Zheng-Wen; Feng, Xiao-Yi; Peng, Jin-Ye; Zeng, Gui-Hua; Qi, Jin

    2017-04-01

    Based on chaos and quantum properties, we propose a quantum private communication scheme with dynamic control code sequence. The initial sequence is obtained via chaotic systems, and the control code sequence is derived by grouping, XOR and extracting. A shift cycle algorithm is designed to enable the dynamic change of control code sequence. Analysis shows that transmission efficiency could reach 100 % with high dynamics and security.

  1. Efficient Quantum Private Communication Based on Dynamic Control Code Sequence

    Cao, Zheng-Wen; Feng, Xiao-Yi; Peng, Jin-Ye; Zeng, Gui-Hua; Qi, Jin

    2016-12-01

    Based on chaos and quantum properties, we propose a quantum private communication scheme with dynamic control code sequence. The initial sequence is obtained via chaotic systems, and the control code sequence is derived by grouping, XOR and extracting. A shift cycle algorithm is designed to enable the dynamic change of control code sequence. Analysis shows that transmission efficiency could reach 100 % with high dynamics and security.

  2. Recursive organizer (ROR): an analytic framework for sequence-based association analysis.

    Zhao, Lue Ping; Huang, Xin

    2013-07-01

    The advent of next-generation sequencing technologies affords the ability to sequence thousands of subjects cost-effectively, and is revolutionizing the landscape of genetic research. With the evolving genotyping/sequencing technologies, it is not unrealistic to expect that we will soon obtain a pair of diploidic fully phased genome sequences from each subject in the near future. Here, in light of this potential, we propose an analytic framework called, recursive organizer (ROR), which recursively groups sequence variants based upon sequence similarities and their empirical disease associations, into fewer and potentially more interpretable super sequence variants (SSV). As an illustration, we applied ROR to assess an association between HLA-DRB1 and type 1 diabetes (T1D), discovering SSVs of HLA-DRB1 with sequence data from the Wellcome Trust Case Control Consortium. Specifically, ROR reduces 36 observed unique HLA-DRB1 sequences into 8 SSVs that empirically associate with T1D, a fourfold reduction of sequence complexity. Using HLA-DRB1 data from Type 1 Diabetes Genetics Consortium as cases and data from Fred Hutchinson Cancer Research Center as controls, we are able to validate associations of these SSVs with T1D. Further, SSVs consist of nine nucleotides, and each associates with its corresponding amino acids. Detailed examination of these selected amino acids reveals their potential functional roles in protein structures and possible implication to the mechanism of T1D.

  3. Peptide Nucleic Acids Having Enhanced Binding Affinity, Sequence Specificity and Solubility

    1998-01-01

    A novel class of compounds, known as peptide nucleic acids, bind complementary DNA and RNA strands more strongly than a corresponding DNA strand, and exhibit increased sequence specificity and solubility. The peptide nucleic acids comprise ligands selected from a group consisting of naturally......-occurring nucleobases and non-naturally-occurring nucleobases attached to a polyamide backbone, and contain C1-C8 alkylamine side chains. Methods of enhancing the solubility, binding affinity and sequence specificity of PNAs are provided....

  4. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  5. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  6. Genome Sequence of Clostridium tyrobutyricum ATCC 25755, a Butyric Acid-Overproducing Strain.

    Jiang, Ling; Zhu, Liying; Xu, Xian; Li, Yanping; Li, Shuang; Huang, He

    2013-05-30

    Clostridium tyrobutyricum ATCC 25755 is an efficient producer of butyric acid. Here we report a 3.01-Mb assembly of its genome sequence and other useful information, including the coding sequences (CDSs) responsible for an alternative pathway leading to acetate synthesis as well as a series of membrane transport systems.

  7. Genome Sequence of Clostridium tyrobutyricum ATCC 25755, a Butyric Acid-Overproducing Strain

    2013-01-01

    Clostridium tyrobutyricum ATCC 25755 is an efficient producer of butyric acid. Here we report a 3.01-Mb assembly of its genome sequence and other useful information, including the coding sequences (CDSs) responsible for an alternative pathway leading to acetate synthesis as well as a series of membrane transport systems.

  8. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    Geissler, Andreas J.; Behr, Jürgen; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability.

  9. Amino Acid Sequence - KOME | LSDB Archive [Life Science Database Archive metadata

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...ll_sequence_amino_db.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST...ta.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kome/LATEST/kome_ine_full_sequence_amino_db.fasta.zip Fi...date History of This Database Site Policy | Contact Us Amino Acid Sequence - KOME | LSDB Archive ...

  10. Fusion protein predicted amino acid sequence of the first US avian pneumovirus isolate and lack of heterogeneity among other US isolates.

    Seal, B S; Sellers, H S; Meinersmann, R J

    2000-02-01

    Avian pneumovirus (APV) was first isolated from turkeys in the west-central US following emergence of turkey rhinotracheitis (TRT) during 1996. Subsequently, several APV isolates were obtained from the north-central US. Matrix (M) and fusion (F) protein genes of these isolates were examined for sequence heterogeneity and compared with European APV subtypes A and B. Among US isolates the M gene shared greater than 98% nucleotide sequence identity with only one nonsynonymous change occurring in a single US isolate. Although the F gene among US APV isolates shared 98% nucleotide sequence identity, nine conserved substitutions were detected in the predicted amino acid sequence. The predicted amino acid sequence of the US APV isolate's F protein had 72% sequence identity to the F protein of APV subtype A and 71% sequence identity to the F protein of APV subtype B. This compares with 83% sequence identity between the APV subtype A and B predicted amino acid sequences of the F protein. The US isolates were phylogenetically distinguishable from their European counterparts based on F gene nucleotide or predicted amino acid sequences. Lack of sequence heterogeneity among US APV subtypes indicates these viruses have maintained a relatively stable population since the first outbreak of TRT. Phylogenetic analysis of the F protein among APV isolates supports classification of US isolates as a new APV subtype C.

  11. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  12. Identification of protein superfamily from structure- based sequence motif

    2002-01-01

    The structure-based sequence motif of the distant proteins in evolution, protein tyrosine phosphatases (PTP) Ⅰ and Ⅱ superfamilies, as an example, has been defined by the structural comparison, structure-based sequence alignment and analyses on substitution patterns of residues in common sequence conserved regions. And the phosphatases Ⅰ and Ⅱ can be correctly identified together by the structure-based PTP sequence motif from SWISS-PROT and TrEBML databases. The results show that the correct rates of identification are over 98%. This is the first time to identify PTP Ⅰ and Ⅱ together by this motif.

  13. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  14. Complete Genome Sequence of Streptomyces clavuligerus F613-1, an Industrial Producer of Clavulanic Acid.

    Cao, Guangxiang; Zhong, Chuanqing; Zong, Gongli; Fu, Jiafang; Liu, Zhong; Zhang, Guimin; Qin, Ronghuo

    2016-01-01

    Streptomyces clavuligerus strain F613-1 is an industrial strain with high-yield clavulanic acid production. In this study, the complete genome sequence of S. clavuligerus strain F613-1 was determined, including one linear chromosome and one linear plasmid, carrying numerous sets of genes involving in the biosynthesis of clavulanic acid.

  15. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    Cowell, G M; Kønigshøfer, E; Danielsen, E M;

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250-555 p...

  16. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  17. Persisting viral sequences shape microbial CRISPR-based immunity.

    Ariel D Weinberger

    Full Text Available Well-studied innate immune systems exist throughout bacteria and archaea, but a more recently discovered genomic locus may offer prokaryotes surprising immunological adaptability. Mediated by a cassette-like genomic locus termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR, the microbial adaptive immune system differs from its eukaryotic immune analogues by incorporating new immunities unidirectionally. CRISPR thus stores genomically recoverable timelines of virus-host coevolution in natural organisms refractory to laboratory cultivation. Here we combined a population genetic mathematical model of CRISPR-virus coevolution with six years of metagenomic sequencing to link the recoverable genomic dynamics of CRISPR loci to the unknown population dynamics of virus and host in natural communities. Metagenomic reconstructions in an acid-mine drainage system document CRISPR loci conserving ancestral immune elements to the base-pair across thousands of microbial generations. This 'trailer-end conservation' occurs despite rapid viral mutation and despite rapid prokaryotic genomic deletion. The trailer-ends of many reconstructed CRISPR loci are also largely identical across a population. 'Trailer-end clonality' occurs despite predictions of host immunological diversity due to negative frequency dependent selection (kill the winner dynamics. Statistical clustering and model simulations explain this lack of diversity by capturing rapid selective sweeps by highly immune CRISPR lineages. Potentially explaining 'trailer-end conservation,' we record the first example of a viral bloom overwhelming a CRISPR system. The polyclonal viruses bloom even though they share sequences previously targeted by host CRISPR loci. Simulations show how increasing random genomic deletions in CRISPR loci purges immunological controls on long-lived viral sequences, allowing polyclonal viruses to bloom and depressing host fitness. Our results thus link

  18. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  19. Image-based temporal alignment of echocardiographic sequences

    Danudibroto, Adriyana; Bersvendsen, Jørn; Mirea, Oana; Gerard, Olivier; D'hooge, Jan; Samset, Eigil

    2016-04-01

    Temporal alignment of echocardiographic sequences enables fair comparisons of multiple cardiac sequences by showing corresponding frames at given time points in the cardiac cycle. It is also essential for spatial registration of echo volumes where several acquisitions are combined for enhancement of image quality or forming larger field of view. In this study, three different image-based temporal alignment methods were investigated. First, a method based on dynamic time warping (DTW). Second, a spline-based method that optimized the similarity between temporal characteristic curves of the cardiac cycle using 1D cubic B-spline interpolation. Third, a method based on the spline-based method with piecewise modification. These methods were tested on in-vivo data sets of 19 echo sequences. For each sequence, the mitral valve opening (MVO) time was manually annotated. The results showed that the average MVO timing error for all methods are well under the time resolution of the sequences.

  20. Comparison of sequence-based and structure-based phylogenetic trees of homologous proteins: Inferences on protein evolution

    S Balaji; N Srinivasan

    2007-01-01

    Several studies based on the known three-dimensional (3-D) structures of proteins show that two homologous proteins with insignificant sequence similarity could adopt a common fold and may perform same or similar biochemical functions. Hence, it is appropriate to use similarities in 3-D structure of proteins rather than the amino acid sequence similarities in modelling evolution of distantly related proteins. Here we present an assessment of using 3-D structures in modelling evolution of homologous proteins. Using a dataset of 108 protein domain families of known structures with at least 10 members per family we present a comparison of extent of structural and sequence dissimilarities among pairs of proteins which are inputs into the construction of phylogenetic trees. We find that correlation between the structure-based dissimilarity measures and the sequence-based dissimilarity measures is usually good if the sequence similarity among the homologues is about 30% or more. For protein families with low sequence similarity among the members, the correlation coefficient between the sequence-based and the structure-based dissimilarities are poor. In these cases the structure-based dendrogram clusters proteins with most similar biochemical functional properties better than the sequence-similarity based dendrogram. In multi-domain protein families and disulphide-rich protein families the correlation coefficient for the match of sequence-based and structure-based dissimilarity (SDM) measures can be poor though the sequence identity could be higher than 30%. Hence it is suggested that protein evolution is best modelled using 3-D structures if the sequence similarities (SSM) of the homologues are very low.

  1. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  2. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  3. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons

    Mehtap Abu-Qarn

    2006-01-01

    Full Text Available Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  4. ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors

    2009-01-01

    This article presents the design of a sequence-based predictor named ProteDNA for identifying the sequence-specific binding residues in a transcription factor (TF). Concerning protein–DNA interactions, there are two types of binding mechanisms involved, namely sequence-specific binding and nonspecific binding. Sequence-specific bindings occur between protein sidechains and nucleotide bases and correspond to sequence-specific recognition of genes. Therefore, sequence-specific bindings are esse...

  5. Genetic characterization of three novel chicken parvovirus strains based on analysis of their coding sequences.

    Koo, Bon-Sang; Lee, Hae-Rim; Jeon, Eun-Ok; Han, Moo-Sung; Min, Kyeong-Cheol; Lee, Seung-Baek; Bae, Yeon-Ji; Cho, Sun-Hyung; Mo, Jong-Suk; Kwon, Hyuk Moo; Sung, Haan Woo; Kim, Jong-Nyeo; Mo, In-Pil

    2015-01-01

    Chicken parvovirus (ChPV) is one of the causative agents of viral enteritis. Recently, the genome of the ABU-P1 strain of ChPV was fully sequenced and determined to have a distinct genomic composition compared with that of vertebrate parvoviruses. However, no comparative sequence analysis of coding regions of ChPVs was possible because of the lack of other sequence information. In this study, we obtained the nucleotide sequences of all genomic coding regions of three ChPVs by polymerase chain reaction using 13 primer sets, and deduced the amino acid sequences from the nucleotide sequences. The non-structural protein 1 (NS1) gene of the three ChPVs showed 95.0 to 95.5% nucleotide sequence identity and 96.5 to 98.1% amino acid sequence identity to those of NS1 from the ABU-P1 strain, respectively, and even higher nucleotide and amino acid similarities to one another. The viral proteins (VP) gene was more divergent between the three ChPV Korean strains and ABU-P1, with 88.1 to 88.3% nucleotide identity and 93.0% amino acid identity. Analysis of the putative tertiary structure of the ChPV VP2 protein showed that variable regions with less than 80% nucleotide similarity between the three Korean strains and ABU-P1 occurred in large loops of the VP2 protein believed to be involved in antigenicity, pathogenicity, and tissue tropism in other parvoviruses. Based on our analysis of full-length coding sequences, we discovered greater variation in ChPV strains than reported previously, especially in partial regions of the VP2 protein.

  6. An Incremental Algorithm of Text Clustering Based on Semantic Sequences

    FENG Zhonghui; SHEN Junyi; BAO Junpeng

    2006-01-01

    This paper proposed an incremental textclustering algorithm based on semantic sequence.Using similarity relation of semantic sequences and calculating the cover of similarity semantic sequences set, the candidate cluster with minimum entropy overlap value was selected as a result cluster every time in this algorithm.The comparison of experimental results shows that the precision of the algorithm is higher than other algorithms under same conditions and this is obvious especially on long documents set.

  7. RNA-RNA interaction prediction based on multiple sequence alignments

    Li, Andrew X; Qin, Jing; Reidys, Christian M

    2010-01-01

    Recently, $O(N^6)$ time and $O(N^4)$ space dynamic programming algorithms have become available that compute the partition function of RNA-RNA interaction complexes for pairs of RNA sequences. These algorithms and the biological requirement of more reliable interactions motivate to utilize the additional information contained in multiple sequence alignments and to generalize the above framework to the partition function and base pairing probabilities for multiple sequence alignments.

  8. Contig sequences and their annotation (amino acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Full Text Available Dicty_cDB Contig sequences and their annotation (amino acid sequence and results of homology search), and expre...s of homology search), and expression profile Description of data contents Contig...TogoDB edition only. Annotation Translated Amino Acid sequence Representative ami....u-tokyo.ac.jp/ ), which is the program to predict the localization of proteins in the cell. Expression prof...o acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive ...

  9. Feature-based Image Sequence Compression Coding

    2001-01-01

    A novel compressing method for video teleconference applications is presented. Semantic-based coding based on human image feature is realized, where human features are adopted as parameters. Model-based coding and the concept of vector coding are combined with the work on image feature extraction to obtain the result.

  10. Rare variant detection using family-based sequencing analysis.

    Peng, Gang; Fan, Yu; Palculict, Timothy B; Shen, Peidong; Ruteshouser, E Cristy; Chi, Aung-Kyaw; Davis, Ronald W; Huff, Vicki; Scharfe, Curt; Wang, Wenyi

    2013-03-05

    Next-generation sequencing is revolutionizing genomic analysis, but this analysis can be compromised by high rates of missing true variants. To develop a robust statistical method capable of identifying variants that would otherwise not be called, we conducted sequence data simulations and both whole-genome and targeted sequencing data analysis of 28 families. Our method (Family-Based Sequencing Program, FamSeq) integrates Mendelian transmission information and raw sequencing reads. Sequence analysis using FamSeq reduced the number of false negative variants by 14-33% as assessed by HapMap sample genotype confirmation. In a large family affected with Wilms tumor, 84% of variants uniquely identified by FamSeq were confirmed by Sanger sequencing. In children with early-onset neurodevelopmental disorders from 26 families, de novo variant calls in disease candidate genes were corrected by FamSeq as mendelian variants, and the number of uniquely identified variants in affected individuals increased proportionally as additional family members were included in the analysis. To gain insight into maximizing variant detection, we studied factors impacting actual improvements of family-based calling, including pedigree structure, allele frequency (common vs. rare variants), prior settings of minor allele frequency, sequence signal-to-noise ratio, and coverage depth (∼20× to >200×). These data will help guide the design, analysis, and interpretation of family-based sequencing studies to improve the ability to identify new disease-associated genes.

  11. Identifying Affinity Classes of Inorganic Materials Binding Sequences via a Graph-Based Model.

    Du, Nan; Knecht, Marc R; Swihart, Mark T; Tang, Zhenghua; Walsh, Tiffany R; Zhang, Aidong

    2015-01-01

    Rapid advances in bionanotechnology have recently generated growing interest in identifying peptides that bind to inorganic materials and classifying them based on their inorganic material affinities. However, there are some distinct characteristics of inorganic materials binding sequence data that limit the performance of many widely-used classification methods when applied to this problem. In this paper, we propose a novel framework to predict the affinity classes of peptide sequences with respect to an associated inorganic material. We first generate a large set of simulated peptide sequences based on an amino acid transition matrix tailored for the specific inorganic material. Then the probability of test sequences belonging to a specific affinity class is calculated by minimizing an objective function. In addition, the objective function is minimized through iterative propagation of probability estimates among sequences and sequence clusters. Results of computational experiments on two real inorganic material binding sequence data sets show that the proposed framework is highly effective for identifying the affinity classes of inorganic material binding sequences. Moreover, the experiments on the structural classification of proteins (SCOP) data set shows that the proposed framework is general and can be applied to traditional protein sequences.

  12. An Ant-Based Model for Multiple Sequence Alignment

    Guinand, Frédéric

    2008-01-01

    Multiple sequence alignment is a key process in today's biology, and finding a relevant alignment of several sequences is much more challenging than just optimizing some improbable evaluation functions. Our approach for addressing multiple sequence alignment focuses on the building of structures in a new graph model: the factor graph model. This model relies on block-based formulation of the original problem, formulation that seems to be one of the most suitable ways for capturing evolutionary aspects of alignment. The structures are implicitly built by a colony of ants laying down pheromones in the factor graphs, according to relations between blocks belonging to the different sequences.

  13. Facility Layout Based on Sequence Analysis: Design of Flowshops

    ZHOU Jin; WU Zhi-ming

    2009-01-01

    A computer-aided method to design a hybrid layout-tree-shape planar flowlines is presented. In new-type flowshop layout, the common machines shared by several flowlines could be located together in functional sections. The approach combines traditional cell formation techniques with sequence alignment algorithms. Firstly, a sequence analysis based cell formation procedure is adopted; then the operation sequences for parts are aligned to maximize machines adjacency in hyperedge representations; finally a tree-shape planar flowline will be obtained for each part family. With the help of a sample of operation sequences obtained from industry, this algorithm is illustrated.

  14. Nucleotide and amino acid sequences of a coat protein of an Ukrainian isolate of Potato virus Y: comparison with homologous sequences of other isolates and phylogenetic analysis

    Budzanivska I. G.

    2014-03-01

    Full Text Available Aim. Identification of the widespread Ukrainian isolate(s of PVY (Potato virus Y in different potato cultivars and subsequent phylogenetic analysis of detected PVY isolates based on NA and AA sequences of coat protein. Methods. ELISA, RT-PCR, DNA sequencing and phylogenetic analysis. Results. PVY has been identified serologically in potato cultivars of Ukrainian selection. In this work we have optimized a method for total RNA extraction from potato samples and offered a sensitive and specific PCR-based test system of own design for diagnostics of the Ukrainian PVY isolates. Part of the CP gene of the Ukrainian PVY isolate has been sequenced and analyzed phylogenetically. It is demonstrated that the Ukrainian isolate of Potato virus Y (CP gene has a higher percentage of homology with the recombinant isolates (strains of this pathogen (approx. 98.8– 99.8 % of homology for both nucleotide and translated amino acid sequences of the CP gene. The Ukrainian isolate of PVY is positioned in the separate cluster together with the isolates found in Syria, Japan and Iran; these isolates possibly have common origin. The Ukrainian PVY isolate is confirmed to be recombinant. Conclusions. This work underlines the need and provides the means for accurate monitoring of Potato virus Y in the agroecosystems of Ukraine. Most importantly, the phylogenetic analysis demonstrated the recombinant nature of this PVY isolate which has been attributed to the strain group O, subclade N:O.

  15. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  16. Protein sequence for clustering DNA based on Artificial Neural Networks

    Gamal. F. Elhadi

    2012-01-01

    Full Text Available DNA is a nucleic acid that contains the genetic instructions used in the development and functioning of all known living organisms and some viruses. Clustering is a process that groups a set of objects into clusters so that the similarity among objects in the same cluster is high, while that among the objects in different clusters is low. In this paper, we proposed an approach for clustering DNA sequences using Self-Organizing Map (SOM algorithm and Protein Sequence. The main objective is to analyze biological data and to bunch DNA to many clusters more easily and efficiently. We use the proposed approach to analyze both large and small amount of input DNA sequences. The results show that the similarity of the sequences does not depend on the amount of input sequences. Our approach depends on evaluating the degree of the DNA sequences similarity using the hierarchal representation Dendrogram. Representing large amount of data using hierarchal tree gives the ability to compare large sequences efficiently

  17. Movement Pattern Analysis Based on Sequence Signatures

    Seyed Hossein Chavoshi

    2015-09-01

    Full Text Available Increased affordability and deployment of advanced tracking technologies have led researchers from various domains to analyze the resulting spatio-temporal movement data sets for the purpose of knowledge discovery. Two different approaches can be considered in the analysis of moving objects: quantitative analysis and qualitative analysis. This research focuses on the latter and uses the qualitative trajectory calculus (QTC, a type of calculus that represents qualitative data on moving point objects (MPOs, and establishes a framework to analyze the relative movement of multiple MPOs. A visualization technique called sequence signature (SESI is used, which enables to map QTC patterns in a 2D indexed rasterized space in order to evaluate the similarity of relative movement patterns of multiple MPOs. The applicability of the proposed methodology is illustrated by means of two practical examples of interacting MPOs: cars on a highway and body parts of a samba dancer. The results show that the proposed method can be effectively used to analyze interactions of multiple MPOs in different domains.

  18. The Kidney and Acid-Base Regulation

    Koeppen, Bruce M.

    2009-01-01

    Since the topic of the role of the kidneys in the regulation of acid base balance was last reviewed from a teaching perspective (Koeppen BM. Renal regulation of acid-base balance. Adv Physiol Educ 20: 132-141, 1998), our understanding of the specific membrane transporters involved in H+, HCO , and NH transport, and especially how these…

  19. The Conjugate Acid-Base Chart.

    Treptow, Richard S.

    1986-01-01

    Discusses the difficulties that beginning chemistry students have in understanding acid-base chemistry. Describes the use of conjugate acid-base charts in helping students visualize the conjugate relationship. Addresses chart construction, metal ions, buffers and pH titrations, and the organic functional groups and nonaqueous solvents. (TW)

  20. Swarm-based Sequencing Recommendations in E-learning

    Van den Berg, Bert; Tattersall, Colin; Janssen, José; Brouns, Francis; Kurvers, Hub; Koper, Rob

    2005-01-01

    Van den Berg, B., Tattersall, C., Janssen, J., Brouns, F., Kurvers, H., & Koper, R. (2006). Swarm-based Sequencing Recommendations in E-learning. International Journal of Computer Science & Applications, III(III), 1-11.

  1. Immune and Genetic Algorithm Based Assembly Sequence Planning

    YANG Jian-guo; LI Bei-zhi; YU Lei; JIN Yu-song

    2004-01-01

    In this paper an assembly sequence planning model inspired by natural immune and genetic algorithm (ASPIG) based on the part degrees of freedom matrix (PDFM) is proposed, and a proto system - DSFAS based on the ASPIG is introduced to solve assembly sequence problem. The concept and generation of PDFM and DSFAS are also discussed. DSFAS can prevent premature convergence, and promote population diversity, and can accelerate the learning and convergence speed in behavior evolution problem.

  2. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Meng-Jun Li; Ai-Qin Li; Han Xia; Chuan-Zhi Zhao; Chang-Sheng Li; Shu-Bo Wan; Yu-Ping Bi; Xing-Jun Wang

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, -ketoacyl-ACP synthase (I, II, III), -ketoacyl-ACP reductase, -hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  3. Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences

    Chen, Peng

    2013-07-23

    Hot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.

  4. Cloning and sequence analysis of putative type II fatty acid synthase genes from Arachis hypogaea L.

    Li, Meng-Jun; Li, Ai-Qin; Xia, Han; Zhao, Chuan-Zhi; Li, Chang-Sheng; Wan, Shu-Bo; Bi, Yu-Ping; Wang, Xing-Jun

    2009-06-01

    The cultivated peanut is a valuable source of dietary oil and ranks fifth among the world oil crops. Plant fatty acid biosynthesis is catalysed by type II fatty acid synthase (FAS) in plastids and mitochondria. By constructing a full-length cDNA library derived from immature peanut seeds and homology-based cloning, candidate genes of acyl carrier protein (ACP), malonyl-CoA:ACP transacylase, beta-ketoacyl-ACP synthase (I, II, III), beta-ketoacyl-ACP reductase, beta-hydroxyacyl-ACP dehydrase and enoyl-ACP reductase were isolated. Sequence alignments revealed that primary structures of type II FAS enzymes were highly conserved in higher plants and the catalytic residues were strictly conserved in Escherichia coli and higher plants. Homologue numbers of each type II FAS gene expressing in developing peanut seeds varied from 1 in KASII, KASIII and HD to 5 in ENR. The number of single-nucleotide polymorphisms (SNPs) was quite different in each gene. Peanut type II FAS genes were predicted to target plastids except ACP2 and ACP3. The results suggested that peanut may contain two type II FAS systems in plastids and mitochondria. The type II FAS enzymes in higher plants may have similar functions as those in E. coli.

  5. Novel Frequency Hopping Sequences Generator Based on AES Algorithm

    李振荣; 庄奕琪; 张博; 张超

    2010-01-01

    A novel frequency hopping(FH) sequences generator based on advanced encryption standard(AES) iterated block cipher is proposed for FH communication systems.The analysis shows that the FH sequences based on AES algorithm have good performance in uniformity, correlation, complexity and security.A high-speed, low-power and low-cost ASIC of FH sequences generator is implemented by optimizing the structure of S-Box and MixColumns of AES algorithm, proposing a hierarchical power management strategy, and applying ...

  6. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66

    Bin Liu

    2016-06-01

    Full Text Available Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA. Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276, with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.

  7. Sequence Pattern Correlation of Amino Acid in Collision-induced Dissociation Electrospray Ionization Mass Spectrometry

    SONG,Hao-Wei(宋浩威); YUE,Gui-Hua(岳贵花); LU,Yu(陆宇); YANG,Peng-Yuan(杨芃原); WANG,Hong-Hai(王洪海)

    2002-01-01

    A novel approach of sequence pattern correlation has been applied to predict an expected amino acid sequence from CID ESI-MS spectra. The proposed approach deduces sequence patterns with no help from known protein database such that it is useful to identify an unknown peptide or new protein. The algorithm applies a cross-correlation to match an experimental CID spectrum with predicted sequence pattern generated from fragmentation information. The fragmentation knowledge of both y-series and other non y-series are utilized to generate the predicted sequence patterns. In contrast to the normal de novo approach, the proposed approach is insensitive to mass tolerance and non-susceptive to spectral integrality with no need for selection of a starting point.

  8. Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Full Text Available Gclust Server Amino acid sequences used for clusterintg (Multi FASTA format) Data detail Data name Amino aci...d sequences used for clusterintg (Multi FASTA format) Description of data contents Amino acid sequences of p...redicted proteins and their annotation for 95 organism species. FASTA format file...5.fa.zip File size: 161MB Simple search URL - Data acquisition method - Data analysis method - Number of data entries - FAST... Site Policy | Contact Us Amino acid sequences used for clusterintg (Multi FASTA format) - Gclust Server | LSDB Archive ...

  9. Molecular Systematics of Polygonum minus Huds. Based on ITS Sequences

    Normah Mohd Noor

    2011-11-01

    Full Text Available Plastid trnL-trnF and nuclear ribosomal ITS sequences were obtained from selected wild-type individuals of Polygonum minus Huds. in Peninsular Malaysia. The 380 bp trnL-trnF sequences of the Polygonum minus accessions were identical. Therefore, the trnL-trnF failed to distinguish between the Polygonum minus accessions. However, the divergence of ITS sequences (650 bp among the Polygonum minus accessions was 1%, indicating that these accessions could be distinguished by the ITS sequences. A phylogenetic relationship based on the ITS sequences was inferred using neighbor-joining, maximum parsimony and Bayesian inference. All of the tree topologies indicated that Polygonum minus from Peninsular Malaysia is unique and different from the synonymous Persicaria minor (Huds. Opiz and Polygonum kawagoeanum Makino.

  10. A comparative analysis of HIV drug resistance interpretation based on short reverse transcriptase sequences versus full sequences

    Stevens Wendy

    2010-10-01

    Full Text Available Abstract Background As second-line antiretroviral treatment (ART becomes more accessible in resource-limited settings (RLS, the need for more affordable monitoring tools such as point-of-care viral load assays and simplified genotypic HIV drug resistance (HIVDR tests increases substantially. The prohibitive expenses of genotypic HIVDR assays could partly be addressed by focusing on a smaller region of the HIV reverse transcriptase gene (RT that encompasses the majority of HIVDR mutations for people on ART in RLS. In this study, an in silico analysis of 125,329 RT sequences was performed to investigate the effect of submitting short RT sequences (codon 41 to 238 to the commonly used virco®TYPE and Stanford genotype interpretation tools. Results Pair-wise comparisons between full-length and short RT sequences were performed. Additionally, a non-inferiority approach with a concordance limit of 95% and two-sided 95% confidence intervals was used to demonstrate concordance between HIVDR calls based on full-length and short RT sequences. The results of this analysis showed that HIVDR interpretations based on full-length versus short RT sequences, using the Stanford algorithms, had concordance significantly above 95%. When using the virco®TYPE algorithm, similar concordance was demonstrated (>95%, but some differences were observed for d4T, AZT and TDF, where predictions were affected in more than 5% of the sequences. Most differences in interpretation, however, were due to shifts from fully susceptible to reduced susceptibility (d4T or from reduced response to minimal response (AZT, TDF or vice versa, as compared to the predicted full RT sequence. The virco®TYPE prediction uses many more mutations outside the RT 41-238 amino acid domain, which significantly contribute to the HIVDR prediction for these 3 antiretroviral agents. Conclusions This study illustrates the acceptability of using a shortened RT sequences (codon 41-238 to obtain reliable

  11. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923).

    Wasels, François; Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-03-03

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain.

  12. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain.

  13. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  14. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  15. An optical CDMA system based on chaotic sequences

    Liu, Xiao-lei; En, De; Wang, Li-guo

    2014-03-01

    In this paper, a coherent asynchronous optical code division multiple access (OCDMA) system is proposed, whose encoder/decoder is an all-optical generator. This all-optical generator can generate analog and bipolar chaotic sequences satisfying the logistic maps. The formula of bit error rate (BER) is derived, and the relationship of BER and the number of simultaneous transmissions is analyzed. Due to the good property of correlation, this coherent OCDMA system based on these bipolar chaotic sequences can support a large number of simultaneous users, which shows that these chaotic sequences are suitable for asynchronous OCDMA system.

  16. New chaos-based encryption scheme for digital sequence

    Zhang Zhengwei; Fan Yangyu; Zeng Li

    2007-01-01

    To enhance the anti-breaking performance of privacy information, this article proposes a new encryption method utilizing the leaping peculiarity of the periodic orbits of chaos systems. This method maps the secret sequence to several chaos periodic orbits, and a short sequence obtained by evolving the system parameters of the periodic orbits in another nonlinear system will be the key to reconstruct these periodic orbits. In the decryption end, the shadowing method of chaos trajectory based on the modified Newton-Raphson algorithm is adopted to restore these system parameters. Through deciding which orbit each pair coordinate falls on, the original digital sequence can be decrypted.

  17. Detection of DBD-carbamoyl amino acids in amino acid sequence and D/L configuration determination of peptides with fluorogenic Edman reagent 7-[(N,N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate.

    Huang, Y; Matsunaga, H; Toriba, A; Santa, T; Fukushima, T; Imai, K

    1999-06-01

    A method for amino acid sequence and D/L configuration identification of peptides by using fluorogenic Edman reagent 7-[(N, N-dimethylamino)sulfonyl]-2,1,3-benzoxadiazol-4-yl isothiocyanate (DBD-NCS) has been developed. This method was based on the Edman degradation principle with some modifications. A peptide or protein was coupled with DBD-NCS under basic conditions and then cyclized/cleaved to produce DBD-thiazolinone (TZ) derivative by BF3, a Lewis acid, which could significantly suppress the amino acid racemization. The liberated DBD-TZ amino acid was hydrolyzed to DBD-thiocarbamoyl (TC) amino acid under a weakly acidic condition and then oxidized by NaNO2/H+ to DBD-carbamoyl (CA) amino acid which was a stable and had a strong fluorescence intensity. The individual DBD-CA amino acids were separated on a reversed-phase high-performance liquid chromatography (RP-HPLC) for amino acid sequencing and their enantiomers were resolved on a chiral stationary-phase HPLC for identifying their D/L configurations. Combination of the two HPLC systems, the amino acid sequence and D/L configuration of peptides could be determined. This method will be useful for searching D-amino-acid-containing peptides in animals.

  18. Markov chaotic sequences for correlation based watermarking schemes

    Tefas, A.; Nikolaidis, A.; Nikolaidis, N.; Solachidis, V.; Tsekeridou, S.; Pitas, I. E-mail: pitas@zeus.csd.auth.gr

    2003-07-01

    In this paper, statistical analysis of watermarking schemes based on correlation detection is presented. Statistical properties of watermark sequences generated by piecewise-linear Markov maps are exploited, resulting in superior watermark detection reliability. Correlation/spectral properties of such sequences are easily controllable, a fact that affects the watermarking system performance. A family of chaotic maps, namely the skew tent map family, is proposed for use in watermarking schemes.

  19. Purification, amino acid sequence, and cDNA cloning of trypsin inhibitors from onion (Allium cepa L.) bulbs.

    Deshimaru, Masanobu; Watanabe, Akira; Suematsu, Keiko; Hatano, Maki; Terada, Shigeyuki

    2003-08-01

    Three protease inhibitors (OTI-1-3) have been purified from onion (Allium cepa L.) bulbs. Molecular masses of these inhibitors were found to be 7,370.2, 7,472.2, and 7,642.6 Da by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), respectively. Based on amino acid composition and N-terminal sequence, OTI-1 and -2 are the N-terminal truncated proteins of OTI-3. All the inhibitors are stable to heat and extreme pH. OTI-3 inhibited trypsin, chymotrypsin, and plasmin with dissociation constants of 1.3 x 10(-9) M, 2.3 x 10(-7) M, and 3.1 x 10(-7) M, respectively. The complete amino acid sequence of OTI-3 showed a significant homology to Bowman-Birk family inhibitors, and the first reactive site (P1) was found to be Arg17 by limited proteolysis by trypsin. The second reactive site (P1) was estimated to be Leu46, that may inhibit chymotrypsin. OTI-3 lacks an S-S bond near the second reactive site, resulting in a low affinity for the enzyme. The sequence of OTI-3 was also ascertained by the nucleotide sequence of a cDNA clone encoding a 101-residue precursor of the onion inhibitor.

  20. Complete amino acid sequence of the human alpha 5 (IV) collagen chain and identification of a single-base mutation in exon 23 converting glycine 521 in the collagenous domain to cysteine in an Alport syndrome patient

    Zhou, J; Hertz, Jens Michael; Leinonen, A;

    1992-01-01

    alleles. The mutation which was located to exon 23 was sequenced from a polymerase chain reaction-amplified product, and shown to be a G----T change in the coding strand. The mutation changed the GGT codon of glycine 521 to cysteine. The same mutation was found in one allele of the female cousin...

  1. Boronic acid-based autoligation of nucleic acids

    Barbeyron, R.; Vasseur, J.-J.; Smietana, M.;

    2013-01-01

    Abstract: The development of synthetic systems displaying dynamic and adaptive characteristics is a formidable challenge with wide applications from biotechnology to therapeutics. Recently, we described a dynamic and programmable nucleic acid-based system relying on the formation of reversible...... boronate internucleosidic linkages. The DNA- or RNA-templated system comprises a 5′-ended boronic acid probe connecting a 3′-ended ribonucleosidic oligonucleotide partner. To explore the dominant factors that control the reversible linkage, we synthesized a series of 3′-end modified ribonucleotidic strands...

  2. Amino acid sequences and structures of chicken and turkey beta 2-microglobulin

    Welinder, K G; Jespersen, H M; Walther-Rasmussen, J;

    1991-01-01

    The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11......,048. The higher apparent Mr obtained for the avian beta 2-microglobulins as compared to human beta 2-microglobulin by SDS-PAGE is not understood. Chicken and turkey beta 2-microglobulin consist of 98 residues and deviate at seven positions: 60, 66, 74-76, 78 and 82. The chicken and turkey sequences are identical...... suggest that the seven chicken to turkey differences are exposed to solvent in the avian MHC class I complex. The key residues of beta 2-microglobulin involved in alpha chain contacts within the MHC class I molecule are highly conserved between chicken and man. This explains that heterologous human beta 2...

  3. Whole body acid-base modeling revisited.

    Ring, Troels; Nielsen, Søren

    2017-04-01

    The textbook account of whole body acid-base balance in terms of endogenous acid production, renal net acid excretion, and gastrointestinal alkali absorption, which is the only comprehensive model around, has never been applied in clinical practice or been formally validated. To improve understanding of acid-base modeling, we managed to write up this conventional model as an expression solely on urine chemistry. Renal net acid excretion and endogenous acid production were already formulated in terms of urine chemistry, and we could from the literature also see gastrointestinal alkali absorption in terms of urine excretions. With a few assumptions it was possible to see that this expression of net acid balance was arithmetically identical to minus urine charge, whereby under the development of acidosis, urine was predicted to acquire a net negative charge. The literature already mentions unexplained negative urine charges so we scrutinized a series of seminal papers and confirmed empirically the theoretical prediction that observed urine charge did acquire negative charge as acidosis developed. Hence, we can conclude that the conventional model is problematic since it predicts what is physiologically impossible. Therefore, we need a new model for whole body acid-base balance, which does not have impossible implications. Furthermore, new experimental studies are needed to account for charge imbalance in urine under development of acidosis.

  4. Probabilistic model based error correction in a set of various mutant sequences analyzed by next-generation sequencing.

    Aita, Takuyo; Ichihashi, Norikazu; Yomo, Tetsuya

    2013-12-01

    To analyze the evolutionary dynamics of a mutant population in an evolutionary experiment, it is necessary to sequence a vast number of mutants by high-throughput (next-generation) sequencing technologies, which enable rapid and parallel analysis of multikilobase sequences. However, the observed sequences include many errors of base call. Therefore, if next-generation sequencing is applied to analysis of a heterogeneous population of various mutant sequences, it is necessary to discriminate between true bases as point mutations and errors of base call in the observed sequences, and to subject the sequences to error-correction processes. To address this issue, we have developed a novel method of error correction based on the Potts model and a maximum a posteriori probability (MAP) estimate of its parameters corresponding to the "true sequences". Our method of error correction utilizes (1) the "quality scores" which are assigned to individual bases in the observed sequences and (2) the neighborhood relationship among the observed sequences mapped in sequence space. The computer experiments of error correction of artificially generated sequences supported the effectiveness of our method, showing that 50-90% of errors were removed. Interestingly, this method is analogous to a probabilistic model based method of image restoration developed in the field of information engineering.

  5. Thermodynamics-based models of transcriptional regulation with gene sequence.

    Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing

    2015-12-01

    Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.

  6. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  7. A Direct, Biomass-Based Synthesis of Benzoic Acid: Formic Acid-Mediated Deoxygenation of the Glucose-Derived Materials Quinic Acid and Shikimic Acid

    Arceo, Elena; Ellman, Jonathan; Bergman, Robert

    2010-05-03

    An alternative biomass-based route to benzoic acid from the renewable starting materials quinic acid and shikimic acid is described. Benzoic acid is obtained selectively using a highly efficient, one-step formic acid-mediated deoxygenation method.

  8. Accuracy of structure-based sequence alignment of automatic methods

    Lee Byungkook

    2007-09-01

    similarity is low, structure-based methods produce better sequence alignments than by using sequence similarities alone. However, current structure-based methods still mis-align 11–19% of the conserved core residues when compared to the human-curated CDD alignments. The alignment quality of each program depends on the protein structural type and similarity, with DaliLite showing the most agreement with CDD on average.

  9. Alignment of the amino terminal amino acid sequence of human cytochrome c oxidase subunits I and II with the sequence of their putative mRNAs.

    CHOMYN, A.; Hunkapiller, M W; Attardi, G

    1981-01-01

    Thirteen of the first fifteen amino acids from the NH2-terminus of the primary sequence of human cytochrome c oxidase subunit I and eleven of the first twelve amino acids of subunit II have been identified by microsequencing procedures. These sequences have been compared with the recently determined 5'-end proximal sequences of the HeLa cell mitochondrial mRNAs and unambiguously aligned with two of them. This alignment has allowed the identification of the putative mRNA for subunit I, and has...

  10. Repeat Sequences and Base Correlations in Human Y Chromosome Palindromes

    Neng-zhi Jin; Zi-xian Liu; Yan-jiao Qi; Wen-yuan Qiu

    2009-01-01

    On the basis of information theory and statistical methods, we use mutual information, n-tuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes. The magnitude distribution of the long range correlation which can be reflected by the mutual information is P5>P5a>P5b (P5a and P5b are the sequences that replace solely Alu repeats and all interspersed repeats with random uncorrelated sequences in human Y chromosome palindrome 5, respectively); and the magnitude distribution of the short range correlation which can be reflected by the n-tuple entropy and the conditional entropy is P5>P5a>P5b>random uncorrelated sequence. In other words, when the Alu repeats and all interspersed repeats replace with random uncorrelated sequence, the long range and short range correlation decrease gradually. However, the random uncorrelated sequence has no correlation. This research indicates that more repeat sequences result in stronger correlation between bases in human Y chromosome. The analyses may be helpful to understand the special structures of human Y chromosome palindromes profoundly.

  11. The complete amino acid sequence of the major component myoglobin of Amazon river dolphin (Inia geoffrensis).

    Dwulet, F E; Bogardt, R A; Jones, B N; Lehman, L D; Gurd, F R

    1975-12-02

    The complete amino acid sequence of the major component myoglobin from Amazon River dolphin, Inia geoffrensis, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Three easily separable peptides were obtained by cleaving the protein with cyanogen bromide at the methionine residues and four peptides were obtained by cleaving the methyl-acetimidated protein with trypsin at the arginine residues. From these peptides over 85% of the sequence was completed. The remainder of the sequence was obtained by fragmentation of the large cyanogen bromide peptide with trypsin. This protein differs from that of the common porpoise, Phocoena phocoena, at seven positions, from that of the common dolphin, Delphinus delphis, at 11 positions, and from that of the sperm whale, Physeter catodon, at 15 positions. By comparison of this sequence with the three-dimensional structure of sperm whale myoglobin it appears that those residues close to the heme group are most conserved followed by those in nonhelical regions and lastly by those in the helical segments. All of the substitutions observed in this sequence fit easily into the three-dimensional structure of the sperm whale myoglobin.

  12. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  13. Protein Function Prediction Based on Sequence and Structure Information

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  14. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.

    Brejnev Muhizi Muhire

    Full Text Available The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV. There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT, a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms.

  15. Amino acid sequences mediating vascular cell adhesion molecule 1 binding to integrin alpha 4: homologous DSP sequence found for JC polyoma VP1 coat protein

    Michael Andrew Meyer

    2013-07-01

    Full Text Available The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4 to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3. For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  16. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  17. Which Microbial Communities Are Present? Sequence-Based Metagenomics

    Caffrey, Sean M.

    The use of metagenomic methods that directly sequence environmental samples has revealed the extraordinary microbial diversity missed by traditional culture-based methodologies. Therefore, to develop a complete and representative model of an environment's microbial community and activities, metagenomic analysis is an essential tool.

  18. Identification of Amino Acid Sequences with Good Folding Properties in an Off-Lattice Model

    Irbäck, Anders; Potthast, Frank

    2008-01-01

    Folding properties of a two-dimensional toy protein model containing only two amino-acid types, hydrophobic and hydrophilic, respectively, are analyzed. An efficient Monte Carlo procedure is employed to ensure that the ground states are found. The thermodynamic properties are found to be strongly sequence dependent in contrast to the kinetic ones. Hence, criteria for good folders are defined entirely in terms of thermodynamic fluctuations. With these criteria sequence patterns that fold well are isolated. For 300 chains with 20 randomly chosen binary residues approximately 10% meet these criteria. Also, an analysis is performed by means of statistical and artificial neural network methods from which it is concluded that the folding properties can be predicted to a certain degree given the binary numbers characterizing the sequences.

  19. Sequence-selective targeting of duplex DNA by peptide nucleic acids

    Nielsen, Peter E

    2010-01-01

    Sequence-selective gene targeting constitutes an attractive drug-discovery approach for genetic therapy, with the aim of reducing or enhancing the activity of specific genes at the transcriptional level, or as part of a methodology for targeted gene repair. The pseudopeptide DNA mimic peptide...... nucleic acid (PNA) can recognize duplex DNA with high sequence specificity and affinity in triplex, duplex and double-duplex invasive modes or non-invasive triplex modes. Novel PNA modification has improved the affinity for DNA recognition via duplex invasion, double-duplex invasion and triplex...... recognition considerably. Such modifications have also resulted in new approaches to targeted gene repair and sequence-selective double-strand cleavage of genomic DNA....

  20. Complete Genome Sequence of the Probiotic Lactic Acid Bacterium Lactobacillus Rhamnosus

    Samat Kozhakhmetov

    2014-01-01

    Full Text Available Introduction: Lactobacilli are a bacteria commonly found in the gastrointestinal tract. Some species of this genus have probiotic properties. The most common of these is Lactobacillus rhamnosus, a microoganism, generally regarded as safe (GRAS. It is also a homofermentative L-(+-lactic acid producer. The genus Lactobacillus is characterized by an extraordinary degree of the phenotypic and genotypic diversity. However, the studies of the genus were conducted mostly with the unequally distributed, non-random choice of species for sequencing; thus, there is only one representative genome from the Lactobacillus rhamnosus clade available to date. The aim of this study was to characterize the genome sequencing of selected strains of Lactobacilli. Methods: 109 samples were isolated from national domestic dairy products in the laboratory of Center for life sciences. After screaning isolates for probiotic properties, a highly active Lactobacillus spp strain was chosen. Genomic DNA was extracted according to the manufacturing protocol (Wizard® Genomic DNA Purification Kit. The Lactobacillus rhamnosus strain was identified as the highly active Lactobacillus strain accoridng to its morphological, cultural, physiological, and biochemical properties, and a genotypic analysis. Results: The genome of Lactobacillus rhamnosus was sequenced using the Roche 454 GS FLX (454 GS FLX platforms. The initial draft assembly was prepared from 14 large contigs (20 all contigs by the Newbler gsAssembler 2.3 (454 Life Sciences, Branford, CT. Conclusion: A full genome-sequencing of selected strains of lactic acid bacteria was made during the study.

  1. Metazoan Remaining Genes for Essential Amino Acid Biosynthesis: Sequence Conservation and Evolutionary Analyses

    Igor R. Costa

    2014-12-01

    Full Text Available Essential amino acids (EAA consist of a group of nine amino acids that animals are unable to synthesize via de novo pathways. Recently, it has been found that most metazoans lack the same set of enzymes responsible for the de novo EAA biosynthesis. Here we investigate the sequence conservation and evolution of all the metazoan remaining genes for EAA pathways. Initially, the set of all 49 enzymes responsible for the EAA de novo biosynthesis in yeast was retrieved. These enzymes were used as BLAST queries to search for similar sequences in a database containing 10 complete metazoan genomes. Eight enzymes typically attributed to EAA pathways were found to be ubiquitous in metazoan genomes, suggesting a conserved functional role. In this study, we address the question of how these genes evolved after losing their pathway partners. To do this, we compared metazoan genes with their fungal and plant orthologs. Using phylogenetic analysis with maximum likelihood, we found that acetolactate synthase (ALS and betaine-homocysteine S-methyltransferase (BHMT diverged from the expected Tree of Life (ToL relationships. High sequence conservation in the paraphyletic group Plant-Fungi was identified for these two genes using a newly developed Python algorithm. Selective pressure analysis of ALS and BHMT protein sequences showed higher non-synonymous mutation ratios in comparisons between metazoans/fungi and metazoans/plants, supporting the hypothesis that these two genes have undergone non-ToL evolution in animals.

  2. Constructing Breaker Sequence based System Restoration Strategy with Graph Theory

    Peng, C.; Qin, Z.; Wang, C.; Hou, Y

    2014-01-01

    This paper has proposed a mapping approach to serve as an interface between the branch-bus model and the breaker-based model. In order to find the specific optimal operation for breakers in substations according to the restoration strategies, firstly, the paper has established the breaker-based model for the substation by using graphic theory, and then the optimal operation sequence for breakers has been figured out by adopting Dijkstra algorithm. Finally, a case study for a realistic power s...

  3. A New Images Hiding Scheme Based on Chaotic Sequences

    LIU Nian-sheng; GUO Dong-hui; WU Bo-xi; Parr G

    2005-01-01

    We propose a data hidding technique in a still image. This technique is based on chaotic sequence in the transform domain of covert image. We use different chaotic random sequences multiplied by multiple sensitive images, respectively, to spread the spectrum of sensitive images. Multiple sensitive images are hidden in a covert image as a form of noise. The results of theoretical analysis and computer simulation show the new hiding technique have better properties with high security, imperceptibility and capacity for hidden information in comparison with the conventional scheme such as LSB (Least Significance Bit).

  4. 3D Motion Parameters Determination Based on Binocular Sequence Images

    2006-01-01

    Exactly capturing three dimensional (3D) motion information of an object is an essential and important task in computer vision, and is also one of the most difficult problems. In this paper, a binocular vision system and a method for determining 3D motion parameters of an object from binocular sequence images are introduced. The main steps include camera calibration, the matching of motion and stereo images, 3D feature point correspondences and resolving the motion parameters. Finally, the experimental results of acquiring the motion parameters of the objects with uniform velocity and acceleration in the straight line based on the real binocular sequence images by the mentioned method are presented.

  5. Genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia

    Anastasiia Kovaliova

    2017-03-01

    Full Text Available Here we report the draft genome sequence of the acid-tolerant Desulfovibrio sp. DV isolated from the sediments of a Pb-Zn mine tailings dam in the Chita region, Russia. The draft genome has a size of 4.9 Mb and encodes multiple K+-transporters and proton-consuming decarboxylases. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain DV clusters together with the acid-tolerant Desulfovibrio sp. TomC and Desulfovibrio magneticus. The draft genome sequence and annotation have been deposited at GenBank under the accession number MLBG00000000.

  6. Steganalytic method based on short and repeated sequence distance statistics

    WANG GuoXin; PING XiJian; XU ManKun; ZHANG Tao; BAO XiRui

    2008-01-01

    According to the distribution characteristics of short and repeated sequence (SRS),a steganalytic method based on the correlation of image bit planes is proposed.Firstly,we provide the conception of SRS distance statistics and deduce its statistical distribution.Because the SRS distance statistics can effectively reflect the correlation of the sequence,SRS has statistical features when the image bit plane sequence equals the image width.Using this characteristic,the steganalytic method is fulfilled by the distinct test of Poisson distribution.Experimental results show a good performance for detecting LSB matching steganographic method in still images.By the way,the proposed method is not designed for specific steganographic algorithms and has good generality.

  7. Genetic interaction mapping with microfluidic-based single cell sequencing

    Haliburton, John R.; Shao, Wenjun; Deutschbauer, Adam; Arkin, Adam; Abate, Adam R.

    2017-01-01

    Genetic interaction mapping is useful for understanding the molecular basis of cellular decision making, but elucidating interactions genome-wide is challenging due to the massive number of gene combinations that must be tested. Here, we demonstrate a simple approach to thoroughly map genetic interactions in bacteria using microfluidic-based single cell sequencing. Using single cell PCR in droplets, we link distinct genetic information into single DNA sequences that can be decoded by next generation sequencing. Our approach is scalable and theoretically enables the pooling of entire interaction libraries to interrogate multiple pairwise genetic interactions in a single culture. The speed, ease, and low-cost of our approach makes genetic interaction mapping viable for routine characterization, allowing the interaction network to be used as a universal read out for a variety of biology experiments, and for the elucidation of interaction networks in non-model organisms. PMID:28170417

  8. Spectroscopic investigation on the telomeric DNA base sequence repeat

    2002-01-01

    Telomeres are protein-DNA complexes at the terminals of linear chromosomes, which protect chromosomal integrity and maintain cellular replicative capacity.From single-cell organisms to advanced animals and plants,structures and functions of telomeres are both very conservative. In cells of human and vertebral animals, telomeric DNA base sequences all are (TTAGGG)n. In the present work, we have obtained absorption and fluorescence spectra measured from seven synthesized oligonucleotides to simulate the telomeric DNA system and calculated their relative fluorescence quantum yields on which not only telomeric DNA characteristics are predicted but also possibly the shortened telomeric sequences during cell division are imrelative fluorescence quantum yield and remarkable excitation energy innerconversion, which tallies with the telomeric sequence of (TTAGGG)n. This result shows that telomeric DNA has a strong non-radiative or innerconvertible capability.``

  9. The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

    Mir, Rafia; Jallu, Shais; Singh, T P

    2015-06-01

    The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.

  10. Genome Sequence of Lactobacillus saerimneri 30a (Formerly Lactobacillus sp. Strain 30a), a Reference Lactic Acid Bacterium Strain Producing Biogenic Amines

    Romano, Andrea; Trip, Hein; Campbell-Sills, Hugo; Bouchez, Olivier; Sherman, David; Lolkema, Juke S.; Lucas, Patrick M.

    2013-01-01

    Lactobacillus sp. strain 30a (Lactobacillus saerimneri) produces the biogenic amines histamine, putrescine, and cadaverine by decarboxylating their amino acid precursors. We report its draft genome sequence (1,634,278 bases, 42.6% G+C content) and the principal findings from its annotation, which mi

  11. Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.

    Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier

    2014-01-01

    Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.

  12. DUK - A Fast and Efficient Kmer Based Sequence Matching Tool

    Li, Mingkun; Copeland, Alex; Han, James

    2011-03-21

    A new tool, DUK, is developed to perform matching task. Matching is to find whether a query sequence partially or totally matches given reference sequences or not. Matching is similar to alignment. Indeed many traditional analysis tasks like contaminant removal use alignment tools. But for matching, there is no need to know which bases of a query sequence matches which position of a reference sequence, it only need know whether there exists a match or not. This subtle difference can make matching task much faster than alignment. DUK is accurate, versatile, fast, and has efficient memory usage. It uses Kmer hashing method to index reference sequences and Poisson model to calculate p-value. DUK is carefully implemented in C++ in object oriented design. The resulted classes can also be used to develop other tools quickly. DUK have been widely used in JGI for a wide range of applications such as contaminant removal, organelle genome separation, and assembly refinement. Many real applications and simulated dataset demonstrate its power.

  13. Solid-State Nanopore-Based DNA Sequencing Technology

    Zewen Liu

    2016-01-01

    Full Text Available The solid-state nanopore-based DNA sequencing technology is becoming more and more attractive for its brand new future in gene detection field. The challenges that need to be addressed are diverse: the effective methods to detect base-specific signatures, the control of the nanopore’s size and surface properties, and the modulation of translocation velocity and behavior of the DNA molecules. Among these challenges, the realization of the high-quality nanopores with the help of modern micro/nanofabrication technologies is a crucial one. In this paper, typical technologies applied in the field of solid-state nanopore-based DNA sequencing have been reviewed.

  14. Hypokalemic paralysis and acid-base balance

    Ivo Casagranda

    2006-10-01

    Full Text Available Three cases of hypokalemic paralysis are reported, presenting to the Emergency Department. The first is a patient with a hypokalemic periodic paralysis with a normal acid-base status, the second is a case of hypokalemic flaccid paralysis of all extremities with a normal anion gap metabolic acidosis, the last is a patient with a hypokalemic distal paralysis of right upper arm with metabolic alkalosis. Afterwards some pathophysiologic principles and the clinical aspects of hypokalemia are discussed and an appropriate approach to do in Emergency Department, to identify the hypokalemic paralysis etiologies in the Emergency Department, is presented, beginning from the evaluation of acid-base status.

  15. Evolutionary insights from suffix array-based genome sequence analysis

    Anindya Poddar; Nagasuma Chandra; Madhavi Ganapathiraju; K Sekar; Judith Klein-Seetharaman; Raj Reddy; N Balakrishnan

    2007-08-01

    Gene and protein sequence analyses, central components of studies in modern biology are easily amenable to string matching and pattern recognition algorithms. The growing need of analysing whole genome sequences more efficiently and thoroughly, has led to the emergence of new computational methods. Suffix trees and suffix arrays are data structures, well known in many other areas and are highly suited for sequence analysis too. Here we report an improvement to the design of construction of suffix arrays. Enhancement in versatility and scalability, enabled by this approach, is demonstrated through the use of real-life examples. The scalability of the algorithm to whole genomes renders it suitable to address many biologically interesting problems. One example is the evolutionary insight gained by analysing unigrams, bi-grams and higher n-grams, indicating that the genetic code has a direct influence on the overall composition of the genome. Further, different proteomes have been analysed for the coverage of the possible peptide space, which indicate that as much as a quarter of the total space at the tetra-peptide level is left un-sampled in prokaryotic organisms, although almost all tri-peptides can be seen in one protein or another in a proteome. Besides, distinct patterns begin to emerge for the counts of particular tetra and higher peptides, indicative of a ‘meaning’ for tetra and higher n-grams. The toolkit has also been used to demonstrate the usefulness of identifying repeats in whole proteomes efficiently. As an example, 16 members of one COG, coded by the genome of Mycobacterium tuberculosis H37Rv have been found to contain a repeating sequence of 300 amino acids.

  16. Speeding disease gene discovery by sequence based candidate prioritization

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  17. Revision of Begomovirus taxonomy based on pairwise sequence comparisons

    Brown, Judith K.

    2015-04-18

    Viruses of the genus Begomovirus (family Geminiviridae) are emergent pathogens of crops throughout the tropical and subtropical regions of the world. By virtue of having a small DNA genome that is easily cloned, and due to the recent innovations in cloning and low-cost sequencing, there has been a dramatic increase in the number of available begomovirus genome sequences. Even so, most of the available sequences have been obtained from cultivated plants and are likely a small and phylogenetically unrepresentative sample of begomovirus diversity, a factor constraining taxonomic decisions such as the establishment of operationally useful species demarcation criteria. In addition, problems in assigning new viruses to established species have highlighted shortcomings in the previously recommended mechanism of species demarcation. Based on the analysis of 3,123 full-length begomovirus genome (or DNA-A component) sequences available in public databases as of December 2012, a set of revised guidelines for the classification and nomenclature of begomoviruses are proposed. The guidelines primarily consider a) genus-level biological characteristics and b) results obtained using a standardized classification tool, Sequence Demarcation Tool, which performs pairwise sequence alignments and identity calculations. These guidelines are consistent with the recently published recommendations for the genera Mastrevirus and Curtovirus of the family Geminiviridae. Genome-wide pairwise identities of 91 % and 94 % are proposed as the demarcation threshold for begomoviruses belonging to different species and strains, respectively. Procedures and guidelines are outlined for resolving conflicts that may arise when assigning species and strains to categories wherever the pairwise identity falls on or very near the demarcation threshold value.

  18. Sequencing-based typing reveals new insight in HLA-DPA1 polymorphism.

    Rozemuller, E H; Bouwens, A G; van Oort, E; Versluis, L F; Marsh, S G; Bodmer, J G; Tilanus, M G

    1995-01-01

    An HLA-DPA1 sequencing-based typing (SBT) system has been developed to identify DPA1 alleles. Up to now eight DPA1 alleles have been defined. Six can be discriminated based upon exon 2 polymorphism. The three subtypes of DPA1*01: DPA1*0101, DPA1*0102 and DPA1*0103, have identical exon 2 sequences but show differences in exon 4. Exon 4 sequences were known for only the three DPA1*01 subtypes and for DPA1*0201. We now present additional sequence information for exon 4 and the unknown segments at the 3' end of exon 2. Additionally with the use of this sequencing technique it is also possible to identify previously unidentified polymorphism. We have studied the exon 2 and exon 4 polymorphism of DPA1 in 40 samples which include all known DPA1 alleles. A new allele, DPA1*01 new, was identified which differs by one nucleotide in exon 2 from DPA1*0103, resulting in an aspartic acid at codon 28. The DPA1*01 subtypes DPA1*0101 and DPA1*0102 could not be confirmed in samples which previously were used to define these subtypes, and consequently they do not exist. The exon 4 sequence of DPA1*0201 is corrected based on sequence data of DAUDI, the cell line in which DPA1*0202 was originally defined. The exon 4 regions of the remaining four alleles were resolved: the exon 4 regions of the alleles DPA1*02021 and DPA1*02022 were found to be identical to the--corrected--DPA1*0201 whereas the exon 4 region of DPA1*0301 differs by one nucleotide compared to DPA1*0103. The DPA1*0401 exon 4 region differs by one nucleotide compared to the corrected DPA1*0201.(ABSTRACT TRUNCATED AT 250 WORDS)

  19. Fermented probiotic beverages based on acid whey

    Katarzyna Skryplonek

    2015-12-01

    Full Text Available Background. Production of fermented probiotic beverages can be a good method for acid whey usage. The obtained products combine a high nutritional value of whey with health benefits claimed for probiotic bac- teria. The aim of the study was to define quality properties of beverages based on fresh acid whey and milk with addition of buttermilk powder or sweet whey powder. Material and methods. Samples were inoculated with two strains of commercial probiotic cultures: Lac- tobacillus acidophilus La-5 or Bifidobacterium animalis Bb-12. After fermentation, samples were stored at refrigerated conditions. After 1, 4, 7, 14 and 21 days sensory characteristics, hardness, acetaldehyde content, titratable acidity, pH acidity and count of bacteria cells were evaluated. Results. Throughout all storage period, the number of bacteria was higher than 8 log cfu/ml in the all sam- ples. Beverages with La-5 strain had higher hardness and acidity, whilst samples with Bb-12 contained more acetaldehyde. Samples with buttermilk powder had better sensory properties than with sweet whey powder. Conclusions. Obtained products made of acid whey combined with milk and fortified with buttermilk pow- der or sweet whey powder, are good medium for growth and survival of examined probiotic bacteria strains. The level of bacteria was sufficient to provide health benefits to consumers.

  20. The Magic Sign: Acids, Bases, and Indicators.

    Phillips, Donald B.

    1986-01-01

    Presents an approach that is used to introduce elementary and junior high students to a series of activities that will provide concrete experiences with acids, bases, and indicators. Provides instructions and listings of needed solutions and materials for developing this "magic sign" device. Includes background information and several…

  1. Jigsaw Cooperative Learning: Acid-Base Theories

    Tarhan, Leman; Sesen, Burcin Acar

    2012-01-01

    This study focused on investigating the effectiveness of jigsaw cooperative learning instruction on first-year undergraduates' understanding of acid-base theories. Undergraduates' opinions about jigsaw cooperative learning instruction were also investigated. The participants of this study were 38 first-year undergraduates in chemistry education…

  2. Glycine/Glycolic acid based copolymers

    Veld, in 't Peter J.A.; Shen, Zheng-Rong; Takens, Gijsbert A.J.; Dijkstra, Pieter J.; Feijen, Jan

    1994-01-01

    Glycine/glycolic acid based biodegradable copolymers have been prepared by ring-opening homopolymerization of morpholine-2,5-dione, and ring-opening copolymerization of morpholine-2,5-dione and glycolide. The homopolymerization of morpholine-2,5-dione was carried out in the melt at 200°C for 3 min u

  3. alpha-Amylase gene of Streptomyces limosus: nucleotide sequence, expression motifs, and amino acid sequence homology to mammalian and invertebrate alpha-amylases.

    1987-01-01

    The nucleotide sequence of the coding and regulatory regions of the alpha-amylase gene (aml) of Streptomyces limosus was determined. High-resolution S1 mapping was used to locate the 5' end of the transcript and demonstrated that the gene is transcribed from a unique promoter. The predicted amino acid sequence has considerable identity to mammalian and invertebrate alpha-amylases, but not to those of plant, fungal, or eubacterial origin. Consistent with this is the susceptibility of the enzym...

  4. Generation of deviation parameters for amino acid singlets, doublets and triplets from three-dimensional structures of proteins and its implications for secondary structure prediction from amino acid sequences

    S A Mugilan; K Veluraja

    2000-03-01

    We present a new method, secondary structure prediction by deviation parameter (SSPDP) for predicting the secondary structure of proteins from amino acid sequence. Deviation parameters (DP) for amino acid singlets, doublets and triplets were computed with respect to secondary structural elements of proteins based on the dictionary of secondary structure prediction (DSSP)-generated secondary structure for 408 selected non-homologous proteins. To the amino acid triplets which are not found in the selected dataset, a DP value of zero is assigned with respect to the secondary structural elements of proteins. The total number of parameters generated is 15,432, in the possible parameters of 25,260. Deviation parameter is complete with respect to amino acid singlets, doublets, and partially complete with respect to amino acid triplets. These generated parameters were used to predict secondary structural elements from amino acid sequence. The secondary structure predicted by our method (SSPDP) was compared with that of single sequence (NNPREDICT) and multiple sequence (PHD) methods. The average value of the percentage of prediction accuracy for α-helix by SSPDP, NNPREDICT and PHD methods was found to be 57%, 44% and 69% respectively for the proteins in the selected dataset. For -strand the prediction accuracy is found to be 69%, 21% and 53% respectively by SSPDP, NNPREDICT and PHD methods. This clearly indicates that the secondary structure prediction by our method is as good as PHD method but much better than NNPREDICT method.

  5. AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences

    Claros M Gonzalo

    2010-06-01

    Full Text Available Abstract Background Multiple sequence alignments are used to study gene or protein function, phylogenetic relations, genome evolution hypotheses and even gene polymorphisms. Virtually without exception, all available tools focus on conserved segments or residues. Small divergent regions, however, are biologically important for specific quantitative polymerase chain reaction, genotyping, molecular markers and preparation of specific antibodies, and yet have received little attention. As a consequence, they must be selected empirically by the researcher. AlignMiner has been developed to fill this gap in bioinformatic analyses. Results AlignMiner is a Web-based application for detection of conserved and divergent regions in alignments of conserved sequences, focusing particularly on divergence. It accepts alignments (protein or nucleic acid obtained using any of a variety of algorithms, which does not appear to have a significant impact on the final results. AlignMiner uses different scoring methods for assessing conserved/divergent regions, Entropy being the method that provides the highest number of regions with the greatest length, and Weighted being the most restrictive. Conserved/divergent regions can be generated either with respect to the consensus sequence or to one master sequence. The resulting data are presented in a graphical interface developed in AJAX, which provides remarkable user interaction capabilities. Users do not need to wait until execution is complete and can.even inspect their results on a different computer. Data can be downloaded onto a user disk, in standard formats. In silico and experimental proof-of-concept cases have shown that AlignMiner can be successfully used to designing specific polymerase chain reaction primers as well as potential epitopes for antibodies. Primer design is assisted by a module that deploys several oligonucleotide parameters for designing primers "on the fly". Conclusions AlignMiner can be used

  6. Development of a SCAR (sequence-characterised amplified region) marker for acid resistance-related gene in Lactobacillus plantarum.

    Liu, Shu-Wen; Li, Kai; Yang, Shi-Ling; Tian, Shu-Fen; He, Ling

    2015-03-01

    A sequence characterised amplified region marker was developed to determine an acid resistance-related gene in Lactobacillus plantarum. A random amplified polymorphic DNA marker named S116-680 was reported to be closely related to the acid resistance of the strains. The DNA band corresponding to this marker was cloned and sequenced with the induction of specific designed PCR primers. The results of PCR test helped to amplify a clear specific band of 680 bp in the tested acid-resistant strains. S116-680 marker would be useful to explore the acid-resistant mechanism of L. plantarum and to screen desirable malolactic fermentation strains.

  7. Amino acid sequences of predicted proteins and their annotation for 95 organism species. - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...Gclust Server Amino acid sequences of predicted proteins and their annotation for 95 organism species. Data ...detail Data name Amino acid sequences of predicted proteins and their annotation for 95 organism species. De...scription of data contents Amino acid sequences of predicted proteins and their a...nload License Update History of This Database Site Policy | Contact Us Amino acid sequences of predicted pro

  8. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    Xiaoyu Wang

    Full Text Available Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  9. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences.

    Tully, Philip J; Lindén, Henrik; Hennig, Matthias H; Lansner, Anders

    2016-05-01

    Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN) learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx). We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison.

  10. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences.

    Philip J Tully

    2016-05-01

    Full Text Available Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx. We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison.

  11. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  12. Amino acid sequence alignment of vertebrate CAPN3/calpain-3/p94

    Yasuko Ono

    2015-12-01

    Full Text Available CAPN3 is a calpain superfamily member that is predominantly expressed in skeletal muscle. So far, clear CAPN3 orthologs were found only in vertebrates. CAPN3 is a unique protease in that it undergoes extremely rapid and exhaustive autolysis and that autolyzed fragments spontaneously associate each other to reconstitute the proteolytic activity. These unique properties of CAPN3 are dependent on IS1 and IS2, two CAPN3-characterizing sequences that do not exist in other calpains or any other proteases. To understand how IS1 and IS2 are conserved among vertebrates, this data article provides amino acid sequence alignment of representative vertebrate CAPN3s. For further analysis and discussion, see Ono et al. [1

  13. Heterodimeric l-amino acid oxidase enzymes from Egyptian Cerastes cerastes venom: Purification, biochemical characterization and partial amino acid sequencing

    A.E. El Hakim

    2015-12-01

    Full Text Available Two l-amino acid oxidase enzyme isoforms, Cc-LAAOI and Cc-LAAOII were purified to apparent homogeneity from Cerastes cerastes venom in a sequential two-step chromatographic protocol including; gel filtration and anion exchange chromatography. The native molecular weights of the isoforms were 115 kDa as determined by gel filtration on calibrated Sephacryl S-200 column, while the monomeric molecular weights of the enzymes were, 60, 56 kDa and 60, 53 kDa for LAAOI and LAAOII, respectively. The tryptic peptides of the two isoforms share high sequence homology with other snake venom l-amino acid oxidases. The optimal pH and temperature values of Cc-LAAOI and Cc-LAAOII were 7.8, 50 °C and 7, 60 °C, respectively. The two isoenzymes were thermally stable up to 70 °C. The Km and Vmax values were 0.67 mM, 0.135 μmol/min for LAAOI and 0.82 mM, 0.087 μmol/min for LAAOII. Both isoenzymes displayed high catalytic preference to long-chain, hydrophobic and aromatic amino acids. The Mn2+ ion markedly increased the LAAO activity for both purified isoforms, while Na+, K+, Ca2+, Mg2+ and Ba2+ ions showed a non-significant increase in the enzymatic activity of both isoforms. Furthermore, Zn2+, Ni2+, Co2+, Cu2+ and AL3+ ions markedly inhibited the LAAOI and LAAOII activities. l-Cysteine and reduced glutathione completely inhibited the LAAO activity of both isoenzymes, whereas, β-mercaptoethanol, O-phenanthroline and PMSF completely inhibited the enzymatic activity of LAAOII. Furthermore, iodoacitic acid inhibited the enzymatic activity of LAAOII by 46% and had no effect on the LAAOI activity.

  14. Caption detection from video sequence based on fuzzy neural networks

    Gao, Xinbo; Xin, Hong; Li, Jie

    2001-09-01

    Caption graphically superimposed in video frames can provide important indexing information. The automatic detection and recognition of video captions can be of great help in querying topics of interest in digital news library. To detect the caption from video sequence, we present algorithms based on fuzzy clustering neural networks. Since neural networks have the capabilities of learning and self-organizing and parallel computing mechanism, with the great increasing of digital images and video databases, neural networks based techniques become more efficient and popular tools for multimedia processing. Experimental results show that our caption detection scheme is effective and robust.

  15. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  16. Comparative amino acid sequence analysis of hemolysins produced by Vibrio hollisae and Vibrio parahaemolyticus.

    Yoh, M; Honda, T.; Miwatani, T; Tsunasawa, S; Sakiyama, F

    1989-01-01

    Vibrio hollisae produces a hemolysin (Vh-rTDH) that is related to the thermostable direct hemolysin of Vibrio parahaemolyticus (Vp-TDH). Although both hemolysins are essentially similar biologically and immunologically, they differ markedly in heat stability; Vp-TDH is heat stable, whereas Vh-rTDH is heat labile. To elucidate the relationships between their characteristics and molecular structures, we analyzed the amino acid sequence of Vh-rTDH and compared it with that of Vp-TDH. Vh-rTDH con...

  17. Leveraging Sequence Classification by Taxonomy-Based Multitask Learning

    Widmer, Christian; Leiva, Jose; Altun, Yasemin; Rätsch, Gunnar

    In this work we consider an inference task that biologists are very good at: deciphering biological processes by bringing together knowledge that has been obtained by experiments using various organisms, while respecting the differences and commonalities of these organisms. We look at this problem from an sequence analysis point of view, where we aim at solving the same classification task in different organisms. We investigate the challenge of combining information from several organisms, whereas we consider the relation between the organisms to be defined by a tree structure derived from their phylogeny. Multitask learning, a machine learning technique that recently received considerable attention, considers the problem of learning across tasks that are related to each other. We treat each organism as one task and present three novel multitask learning methods to handle situations in which the relationships among tasks can be described by a hierarchy. These algorithms are designed for large-scale applications and are therefore applicable to problems with a large number of training examples, which are frequently encountered in sequence analysis. We perform experimental analyses on synthetic data sets in order to illustrate the properties of our algorithms. Moreover, we consider a problem from genomic sequence analysis, namely splice site recognition, to illustrate the usefulness of our approach. We show that intelligently combining data from 15 eukaryotic organisms can indeed significantly improve the prediction performance compared to traditional learning approaches. On a broader perspective, we expect that algorithms like the ones presented in this work have the potential to complement and enrich the strategy of homology-based sequence analysis that are currently the quasi-standard in biological sequence analysis.

  18. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  19. New monoclonal antibodies to the Ebola virus glycoprotein: Identification and analysis of the amino acid sequence of the variable domains.

    Panina, A A; Aliev, T K; Shemchukova, O B; Dement'yeva, I G; Varlamov, N E; Pozdnyakova, L P; Bokov, M N; Dolgikh, D A; Sveshnikov, P G; Kirpichnikov, M P

    2016-03-01

    We determined the nucleotide and amino acid sequences of variable domains of three new monoclonal antibodies to the glycoprotein of Ebola virus capsid. The framework and hypervariable regions of immunoglobulin heavy and light chains were identified. The primary structures were confirmed using massspectrometry analysis. Immunoglobulin database search showed the uniqueness of the sequences obtained.

  20. Watermarking scheme of colour image based on chaotic sequences

    LIU Nian-sheng; GUO Dong-hui

    2009-01-01

    The proposed perceptual mask is based on the singularity of cover image and matches very well with the properties of the human visual system. The cover colour image is decomposed into several subbands by the wavelet transform. The water-mark composed of chaotic sequence and the covert image is embedded into the subband with the largest energy. The chaos system plays an important role in the security invisibility and robustness of the proposed scheme. The parameter and initial state of chaos system can directly influence the generation of watermark information as a key. Moreover, the watermark information has the property of spread spectrum signal by chaotic sequence to improve the invisibility and security of watermarked image. Experimental results and comparisons with other watermarking techniques prove that the proposed algorithm is effective and feasible, and improves the security, invisibility and robustness of watermarking information.

  1. [Segmentation Method for Liver Organ Based on Image Sequence Context].

    Zhang, Meiyun; Fang, Bin; Wang, Yi; Zhong, Nanchang

    2015-10-01

    In view of the problems of more artificial interventions and segmentation defects in existing two-dimensional segmentation methods and abnormal liver segmentation errors in three-dimensional segmentation methods, this paper presents a semi-automatic liver organ segmentation method based on the image sequence context. The method takes advantage of the existing similarity between the image sequence contexts of the prior knowledge of liver organs, and combines region growing and level set method to carry out semi-automatic segmentation of livers, along with the aid of a small amount of manual intervention to deal with liver mutation situations. The experiment results showed that the liver segmentation algorithm presented in this paper had a high precision, and a good segmentation effect on livers which have greater variability, and can meet clinical application demands quite well.

  2. The first proton sponge-based amino acids: synthesis, acid-base properties and some reactivity.

    Ozeryanskii, Valery A; Gorbacheva, Anastasia Yu; Pozharskii, Alexander F; Vlasenko, Marina P; Tereznikov, Alexander Yu; Chernov'yants, Margarita S

    2015-08-21

    The first hybrid base constructed from 1,8-bis(dimethylamino)naphthalene (proton sponge or DMAN) and glycine, N-methyl-N-(8-dimethylamino-1-naphthyl)aminoacetic acid, was synthesised in high yield and its hydrobromide was structurally characterised and used to determine the acid-base properties via potentiometric titration. It was found that the basic strength of the DMAN-glycine base (pKa = 11.57, H2O) is on the level of amidine amino acids like arginine and creatine and its structure, zwitterionic vs. neutral, based on the spectroscopic (IR, NMR, mass) and theoretical (DFT) approaches has a strong preference to the zwitterionic form. Unlike glycine, the DMAN-glycine zwitterion is N-chiral and is hydrolytically cleaved with the loss of glycolic acid on heating in DMSO. This reaction together with the mild decarboxylative conversion of proton sponge-based amino acids into 2,3-dihydroperimidinium salts under air-oxygen was monitored with the help of the DMAN-alanine amino acid. The newly devised amino acids are unique as they combine fluorescence, strongly basic and redox-active properties.

  3. Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

    D. Satyanarayana Rao

    2007-02-01

    Full Text Available We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1 57-amino-acid-residue PxV domain, (2 122-amino-acid-residue FxF domain, (3 111-amino-acid-residue YEFF domain, (4 109-amino-acid-residue IMxxH domain, (5 103-amino-acid-residue VxxT domain, (6 84-amino-acid-residue ExW domain, (7 104-amino-acid-residue NTGFIG domain, (8 36-amino-acid-residue NxGK repeat, (9 95-amino-acid-residue VYV domain, (10 75-amino-acid-residue KEWE domain, (11 59-amino-acid-residue AFL domain, (12 53-amino-acid-residue RIDVK repeat, (13 (a 41-amino-acid-residue AGQF repeat and (b 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure.

  4. Kinetic investigation of a solvent-free, chemoenzymatic reaction sequence towards enantioselective synthesis of a β-amino acid ester.

    Strompen, Simon; Weiss, Markus; Ingram, Thomas; Smirnova, Irina; Gröger, Harald; Hilterhaus, Lutz; Liese, Andreas

    2012-06-01

    A solvent-free, chemoenzymatic reaction sequence for the enantioselective synthesis of β-amino acid esters has been kinetically and thermodynamically characterized. The coupled sequence comprises a thermal aza-Michael addition of cheap starting materials and a lipase catalyzed aminolysis for the kinetic resolution of the racemic ester. Excellent ee values of >99% were obtained for the β-amino acid ester at 60% conversion. Kinetic constants for the aza-Michael addition were obtained by straightforward numerical integration of second-order rate equations and nonlinear fitting of the progress curves. A different strategy had to be devised for the biocatalytic reaction. Initially, a simplified Michaelis-Menten model including product inhibition was developed for the reaction running in THF as an organic solvent. Activity based parameters were used instead of concentrations in order to facilitate the transfer of the kinetic model to the solvent-free system. Observed solvent effects not accounted for by the use of thermodynamic activities were incorporated into the kinetic model. Enzyme deactivation was observed to depend on the ratio of the applied substrates and also included in the kinetic model. The developed simple model is in very good agreement with the experimental data and allows the simulation and optimization of the solvent-free process.

  5. Phylogenetic analysis of dicyemid mesozoans (phylum Dicyemida) from innexin amino acid sequences: dicyemids are not related to Platyhelminthes.

    Suzuki, Takahito G; Ogino, Kazutoyo; Tsuneki, Kazuhiko; Furuya, Hidetaka

    2010-06-01

    Dicyemid mesozoans are endoparasites, or endosymbionts, found only in the renal sac of benthic cephalopod molluscs. The body organization of dicyemids is very simple, consisting of usually 10 to 40 cells, with neither body cavities nor differentiated organs. Dicyemids were considered as primitive animals, and the out-group of all metazoans, or as occupying a basal position of lophotrochozoans close to flatworms. We cloned cDNAs encoding for the gap junction component proteins, innexin, from the dicyemids. Its expression pattern was observed by whole-mount in situ hybridization. In adult individuals, the innexin was expressed in calottes, infusorigens, and infusoriform embryos. The unique temporal pattern was observed in the developing infusoriform embryos. Innexin amino acid sequences had taxon-specific indels which enabled identification of the 3 major protostome lineages, i.e., 2 ecdysozoans (arthropods and nematodes) and the lophotrochozoans. The dicyemids show typical, lophotrochozoan-type indels. In addition, the Bayesian and maximum likelihood trees based on the innexin amino acid sequences suggested dicyemids to be more closely related to the higher lophotrochozoans than to the flatworms. Flatworms were the sister group, or consistently basal, to the other lophotrochozoan clade that included dicyemids, annelids, molluscs, and brachiopods.

  6. Diversity of the 47-kD HtrA nucleic acid and translated amino acid sequences from 17 recent human isolates of Orientia.

    Jiang, Ju; Paris, Daniel H; Blacksell, Stuart D; Aukkanit, Nuntipa; Newton, Paul N; Phetsouvanh, Rattanaphone; Izzard, Leonard; Stenos, John; Graves, Stephen R; Day, Nicholas P J; Richards, Allen L

    2013-06-01

    Orientia tsutsugamushi, the etiologic agent of potentially fatal scrub typhus, is characterized by a high antigenic diversity, which complicates the development of a broadly protective vaccine. Efficacy studies in murine and nonhuman primate models demonstrated the DNA vaccine candidate pKarp47, based upon the O. tsutsugamushi Karp 47-kD HtrA protein gene, to be a successful immunoprophylactic against scrub typhus. To characterize 47-kD HtrA protein diversity among human isolates of Orientia, we sequenced the full open reading frame (ORF) of the 47-kD HtrA gene and analyzed the translated amino acid sequences of 17 patient isolates from Thailand (n=13), Laos (n=2), Australia (n=1), and the United Arab Emirates (UAE) (n=1) and 9 reference strains: Karp (New Guinea), Kato (Japan), Ikeda (Japan), Gilliam (Burma), Boryong (Korea), TA763, TH1811 and TH1817 (Thailand), and MAK243 (China). The percentage identity (similarity) of translated amino acid sequences between 16 new isolates and 9 reference strains of O. tsutsugamushi ranged from 96.4% to 100% (97.4% to 100%). However, inclusion of the recently identified Orientia chuto sp. nov. reduced identity (similarity) values to 82.2% to 83.3% (90.4% to 91.4%). These results demonstrate the diversity of Orientia 47-kD HtrA among isolates encountered by humans and therefore provide support for the necessity of developing a broadly protective scrub typhus vaccine that takes this diversity into account.

  7. Model-based quality assessment and base-calling for second-generation sequencing data.

    Bravo, Héctor Corrada; Irizarry, Rafael A

    2010-09-01

    Second-generation sequencing (sec-gen) technology can sequence millions of short fragments of DNA in parallel, making it capable of assembling complex genomes for a small fraction of the price and time of previous technologies. In fact, a recently formed international consortium, the 1000 Genomes Project, plans to fully sequence the genomes of approximately 1200 people. The prospect of comparative analysis at the sequence level of a large number of samples across multiple populations may be achieved within the next five years. These data present unprecedented challenges in statistical analysis. For instance, analysis operates on millions of short nucleotide sequences, or reads-strings of A,C,G, or T's, between 30 and 100 characters long-which are the result of complex processing of noisy continuous fluorescence intensity measurements known as base-calling. The complexity of the base-calling discretization process results in reads of widely varying quality within and across sequence samples. This variation in processing quality results in infrequent but systematic errors that we have found to mislead downstream analysis of the discretized sequence read data. For instance, a central goal of the 1000 Genomes Project is to quantify across-sample variation at the single nucleotide level. At this resolution, small error rates in sequencing prove significant, especially for rare variants. Sec-gen sequencing is a relatively new technology for which potential biases and sources of obscuring variation are not yet fully understood. Therefore, modeling and quantifying the uncertainty inherent in the generation of sequence reads is of utmost importance. In this article, we present a simple model to capture uncertainty arising in the base-calling procedure of the Illumina/Solexa GA platform. Model parameters have a straightforward interpretation in terms of the chemistry of base-calling allowing for informative and easily interpretable metrics that capture the variability in

  8. Prediction of potential drug targets based on simple sequence properties

    Lai Luhua

    2007-09-01

    Full Text Available Abstract Background During the past decades, research and development in drug discovery have attracted much attention and efforts. However, only 324 drug targets are known for clinical drugs up to now. Identifying potential drug targets is the first step in the process of modern drug discovery for developing novel therapeutic agents. Therefore, the identification and validation of new and effective drug targets are of great value for drug discovery in both academia and pharmaceutical industry. If a protein can be predicted in advance for its potential application as a drug target, the drug discovery process targeting this protein will be greatly speeded up. In the current study, based on the properties of known drug targets, we have developed a sequence-based drug target prediction method for fast identification of novel drug targets. Results Based on simple physicochemical properties extracted from protein sequences of known drug targets, several support vector machine models have been constructed in this study. The best model can distinguish currently known drug targets from non drug targets at an accuracy of 84%. Using this model, potential protein drug targets of human origin from Swiss-Prot were predicted, some of which have already attracted much attention as potential drug targets in pharmaceutical research. Conclusion We have developed a drug target prediction method based solely on protein sequence information without the knowledge of family/domain annotation, or the protein 3D structure. This method can be applied in novel drug target identification and validation, as well as genome scale drug target predictions.

  9. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  10. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  11. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  12. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  13. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    Yan Koon-Kiu

    2007-11-01

    Full Text Available Abstract Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw" duplication and deletion rates rdup∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsgaKjabbwha1jabbchaWbqaaiabgEHiQaaaaaa@3283@, rdel∗ MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabbsga

  14. Complete amino acid sequence of the myoglobin from the Atlantic bottlenosed dolphin, Tursiops truncatus.

    Jones, B N; Vigna, R A; Dwulet, F E; Bogardt, R A; Lehman, L D; Gurd, F R

    1976-10-05

    The complete amino acid sequence of the major component myoglobin from the Atlantic bottlenosed dolphin, Tursiops truncatus, was determined by specific cleavage of the protein to obtain large peptides that are readily degraded by the automatic sequencer. Three easily separable peptides were obtained by cleaving the protein with cyanogen bromide at the 2 methionine residues and 4 peptides were obtained by cleaving the methyl acetimidated protein with trypsin at the 3 arginine residues. By subjecting 4 of these peptides and the apomyoglobin to automatic Edman degradation, over 80% of the covalent structure of the protein was obtained. The remainder of the primary structure was determined by further digestion of the central cyanogen bromide peptide with trypsin and staphylococcal protease. This myoglobin differs from that of the sperm whale, Physter catodon, at 15 positions, from that of the California gray whale, Eschrichtius gibbosus, at 14 positions, from that of the common porpoise, Phocoena phocoena, at 6 positions, and from the myoglobin of the Black Sea dolphin, Delphinus delphis and the Amazon River dolphin, Inia goeffrensis, at 5 and 7 positions, respecitvely. All substitutions observed in this sequence fit easily into the tertiary structure of sperm whale myoglobin.

  15. Secondary Structure Predictions for Long RNA Sequences Based on Inversion Excursions and MapReduce.

    Yehdego, Daniel T; Zhang, Boyu; Kodimala, Vikram K R; Johnson, Kyle L; Taufer, Michela; Leung, Ming-Ying

    2013-05-01

    Secondary structures of ribonucleic acid (RNA) molecules play important roles in many biological processes including gene expression and regulation. Experimental observations and computing limitations suggest that we can approach the secondary structure prediction problem for long RNA sequences by segmenting them into shorter chunks, predicting the secondary structures of each chunk individually using existing prediction programs, and then assembling the results to give the structure of the original sequence. The selection of cutting points is a crucial component of the segmenting step. Noting that stem-loops and pseudoknots always contain an inversion, i.e., a stretch of nucleotides followed closely by its inverse complementary sequence, we developed two cutting methods for segmenting long RNA sequences based on inversion excursions: the centered and optimized method. Each step of searching for inversions, chunking, and predictions can be performed in parallel. In this paper we use a MapReduce framework, i.e., Hadoop, to extensively explore meaningful inversion stem lengths and gap sizes for the segmentation and identify correlations between chunking methods and prediction accuracy. We show that for a set of long RNA sequences in the RFAM database, whose secondary structures are known to contain pseudoknots, our approach predicts secondary structures more accurately than methods that do not segment the sequence, when the latter predictions are possible computationally. We also show that, as sequences exceed certain lengths, some programs cannot computationally predict pseudoknots while our chunking methods can. Overall, our predicted structures still retain the accuracy level of the original prediction programs when compared with known experimental secondary structure.

  16. Multifunctional hybrid networks based on self assembling peptide sequences

    Sathaye, Sameer

    The overall aim of this dissertation is to achieve a comprehensive correlation between the molecular level changes in primary amino acid sequences of amphiphilic beta-hairpin peptides and their consequent solution-assembly properties and bulk network hydrogel behavior. This has been accomplished using two broad approaches. In the first approach, amino acid substitutions were made to peptide sequence MAX1 such that the hydrophobic surfaces of the folded beta-hairpins from the peptides demonstrate shape specificity in hydrophobic interactions with other beta-hairpins during the assembly process, thereby causing changes to the peptide nanostructure and bulk rheological properties of hydrogels formed from the peptides. Steric lock and key complementary hydrophobic interactions were designed to occur between two beta-hairpin molecules of a single molecule, LNK1 during beta-sheet fibrillar assembly of LNK1. Experimental results from circular dichroism, transmission electron microscopy and oscillatory rheology collectively indicate that the molecular design of the LNK1 peptide can be assigned the cause of the drastically different behavior of the networks relative to MAX1. The results indicate elimination or significant reduction of fibrillar branching due to steric complementarity in LNK1 that does not exist in MAX1, thus supporting the original hypothesis. As an extension of the designed steric lock and key complementarity between two beta-hairpin molecules of the same peptide molecule. LNK1, three new pairs of peptide molecules LP1-KP1, LP2-KP2 and LP3-KP3 that resemble complementary 'wedge' and 'trough' shapes when folded into beta-hairpins were designed and studied. All six peptides individually and when blended with their corresponding shape complement formed fibrillar nanostructures with non-uniform thickness values. Loose packing in the assembled structures was observed in all the new peptides as compared to the uniform tight packing in MAX1 by SANS analysis. This

  17. Analysis of Sequence Based Classifier Prediction for HIV Subtypes

    S. Santhosh Kumar

    2012-10-01

    Full Text Available Human immunodeficiency virus (HIV is a lent virus that causes acquired immunodeficiency syndrome (AIDS. The main drawback in HIV treatment process is its sub type prediction. The sub type and group classification of HIV is based on its genetic variability and location. HIV can be divided into two major types, HIV type 1 (HIV-1 and HIV type 2 (HIV-2. Many classifier approaches have been used to classify HIV subtypes based on their group, but some of cases are having two groups in one; in such cases the classification becomes more complex. The methodology used is this paper based on the HIV sequences. For this work several classifier approaches are used to classify the HIV1 and HIV2. For implementation of the work a real time patient database is taken and the patient records are experimented and the final best classifier is identified with quick response time and least error rate.

  18. Next-Generation Sequencing-Based Molecular Diagnosis of Choroideremia

    Kayo Shimizu

    2015-07-01

    Full Text Available We screened patients with choroideremia using next-generation sequencing (NGS and identified a novel mutation and a known mutation in the CHM gene. One patient presented an atypical fundus appearance for choroideremia. Another patient presented macular hole retinal detachment in the left eye. The present case series shows the utility of NGS-based screening in patients with choroideremia. In addition, the presence of macular hole in 1 of the 2 patients, together with a previous report, indicated the susceptibility of patients with choroideremia to macular hole.

  19. All-optical pseudorandom bit sequences generator based on TOADs

    Sun, Zhenchao; Wang, Zhi; Wu, Chongqing; Wang, Fu; Li, Qiang

    2016-03-01

    A scheme for all-optical pseudorandom bit sequences (PRBS) generator is demonstrated with optical logic gate 'XNOR' and all-optical wavelength converter based on cascaded Tera-Hertz Optical Asymmetric Demultiplexer (TOADs). Its feasibility is verified by generation of return-to-zero on-off keying (RZ-OOK) 263-1 PRBS at the speed of 1 Gb/s with 10% duty radio. The high randomness of ultra-long cycle PRBS is validated by successfully passing the standard benchmark test.

  20. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    Meiler Arno

    2012-09-01

    Full Text Available Abstract Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  1. Open questions in origin of life: experimental studies on the origin of nucleic acids and proteins with specific and functional sequences by a chemical synthetic biology approach.

    Adamala, Katarzyna; Anella, Fabrizio; Wieczorek, Rafal; Stano, Pasquale; Chiarabelli, Cristiano; Luisi, Pier Luigi

    2014-01-01

    In this mini-review we present some experimental approaches to the important issue in the origin of life, namely the origin of nucleic acids and proteins with specific and functional sequences. The formation of macromolecules on prebiotic Earth faces practical and conceptual difficulties. From the chemical viewpoint, macromolecules are formed by chemical pathways leading to the condensation of building blocks (amino acids, or nucleotides) in long-chain copolymers (proteins and nucleic acids, respectively). The second difficulty deals with a conceptual problem, namely with the emergence of specific sequences among a vast array of possible ones, the huge "sequence space", leading to the question "why these macromolecules, and not the others?" We have recently addressed these questions by using a chemical synthetic biology approach. In particular, we have tested the catalytic activity of small peptides, like Ser-His, with respect to peptide- and nucleotides-condensation, as a realistic model of primitive organocatalysis. We have also set up a strategy for exploring the sequence space of random proteins and RNAs (the so-called "never born biopolymer" project) with respect to the production of folded structures. Being still far from solved, the main aspects of these "open questions" are discussed here, by commenting on recent results obtained in our groups and by providing a unifying view on the problem and possible solutions. In particular, we propose a general scenario for macromolecule formation via fragment-condensation, as a scheme for the emergence of specific sequences based on molecular growth and selection.

  2. Localization of an O-glycosylated site in the recombinant barley alpha-amylase 1 produced in yeast and correction of the amino acid sequence using matrix-assisted laser desorption/ionization mass spectrometry of peptide mixtures

    Andersen, Jens S.; Søgaard, M; Svensson, B

    1994-01-01

    , and analyzed directly by MALDI-MS. Based on the three mass spectrometric peptide maps, an error in the sequence deduced from cDNA, resulting in a mass difference of 28 Da, was located to a sequence stretch of 5 amino acid residues; furthermore, a dihexose substituent was identified on Thr410. Subsequent Edman...

  3. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L. (Univ. of Massachusetts Medical School, Worcester (USA))

    1988-06-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity.

  4. Amino acid sequence of the oligomycin sensitivity-conferring protein (OSCP) of beef-heart mitochondria and its homology with the delta-subunit of the F1-ATPase of Escherichia coli.

    Ovchinnikov, Y A; Modyanov, N N; Grinkevich, V A; Aldanova, N A; Trubetskaya, O E; Nazimov, I V; Hundal, T; Ernster, L

    1984-01-23

    The complete amino acid sequence of the oligomycin sensitivity-conferring protein (OSCP) of beef-heart mitochondria is reported. The protein contains 190 amino acids and has a molecular mass of 20 967. Its structure is characterized by a concentration of charged amino acids in the two terminal segments (N 1-77 and C 128-190) of the protein, whereas its central region is more hydrophobic. The earlier reported homology of the protein with the delta-subunit of E. coli F1, based on the terminal amino acid sequences of OSCP, is further substantiated.

  5. Investigating Students' Reasoning about Acid-Base Reactions

    Cooper, Melanie M.; Kouyoumdjian, Hovig; Underwood, Sonia M.

    2016-01-01

    Acid-base chemistry is central to a wide range of reactions. If students are able to understand how and why acid-base reactions occur, it should provide a basis for reasoning about a host of other reactions. Here, we report the development of a method to characterize student reasoning about acid-base reactions based on their description of…

  6. Analysis of chimpanzee history based on genome sequence alignments.

    Jennifer L Caswell

    2008-04-01

    Full Text Available Population geneticists often study small numbers of carefully chosen loci, but it has become possible to obtain orders of magnitude for more data from overlaps of genome sequences. Here, we generate tens of millions of base pairs of multiple sequence alignments from combinations of three western chimpanzees, three central chimpanzees, an eastern chimpanzee, a bonobo, a human, an orangutan, and a macaque. Analysis provides a more precise understanding of demographic history than was previously available. We show that bonobos and common chimpanzees were separated approximately 1,290,000 years ago, western and other common chimpanzees approximately 510,000 years ago, and eastern and central chimpanzees at least 50,000 years ago. We infer that the central chimpanzee population size increased by at least a factor of 4 since its separation from western chimpanzees, while the western chimpanzee effective population size decreased. Surprisingly, in about one percent of the genome, the genetic relationships between humans, chimpanzees, and bonobos appear to be different from the species relationships. We used PCR-based resequencing to confirm 11 regions where chimpanzees and bonobos are not most closely related. Study of such loci should provide information about the period of time 5-7 million years ago when the ancestors of humans separated from those of the chimpanzees.

  7. Rapid sequencing of DNA based on single-molecule detection

    Soper, Steven A.; Davis, Lloyd M.; Fairfield, Frederick R.; Hammond, Mark L.; Harger, Carol A.; Jett, James H.; Keller, Richard A.; Marrone, Babetta L.; Martin, John C.; Nutter, Harvey L.; Shera, E. Brooks; Simpson, Daniel J.

    1991-07-01

    Sequencing the human genome is a major undertaking considering the large number of nucleotides present in the genome and the slow methods currently available to perform the task. The authors have recently reported on a scheme to sequence DNA rapidly using a non-gel based technique. The concept is based upon the incorporation of fluorescently labeled nucleotides into a strand of DNA, isolation and manipulation of a labeled DNA fragment and the detection of single nucleotides using ultra-sensitive laser-induced fluorescence detection following their cleavage from the fragment. Detection of individual fluorophores in the liquid phase was accomplished with time-gated detection following pulsed-laser excitation. The photon bursts from individual rhodamine 6G (R6G) molecules travelling through a laser beam have been observed, as have bursts from single fluorescently modified nucleotides. Using two different biotinylated nucleotides as a model system for fluorescently labeled nucleotides, the authors have observed synthesis of the complementary copy of M13 bacteriophage. Work with fluorescently labeled nucleotides is underway. Individual molecules of DNA attached to a microbead have been observed and manipulated with an epifluorescence microscope.

  8. Streaming Support for Data Intensive Cloud-Based Sequence Analysis

    Shadi A. Issa

    2013-01-01

    Full Text Available Cloud computing provides a promising solution to the genomics data deluge problem resulting from the advent of next-generation sequencing (NGS technology. Based on the concepts of “resources-on-demand” and “pay-as-you-go”, scientists with no or limited infrastructure can have access to scalable and cost-effective computational resources. However, the large size of NGS data causes a significant data transfer latency from the client’s site to the cloud, which presents a bottleneck for using cloud computing services. In this paper, we provide a streaming-based scheme to overcome this problem, where the NGS data is processed while being transferred to the cloud. Our scheme targets the wide class of NGS data analysis tasks, where the NGS sequences can be processed independently from one another. We also provide the elastream package that supports the use of this scheme with individual analysis programs or with workflow systems. Experiments presented in this paper show that our solution mitigates the effect of data transfer latency and saves both time and cost of computation.

  9. A machine-learning approach for predicting palmitoylation sites from integrated sequence-based features.

    Li, Liqi; Luo, Qifa; Xiao, Weidong; Li, Jinhui; Zhou, Shiwen; Li, Yongsheng; Zheng, Xiaoqi; Yang, Hua

    2017-02-01

    Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein-protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.

  10. Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

    Nagib eAhsan

    2012-07-01

    Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.

  11. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    Selvaraj S

    2011-10-01

    Full Text Available Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching. Results We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix. Conclusions The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.

  12. EFFECT OF CASEIN-BASED SEMISYNTHETIC FOOD ON RENAL ACID EXCRETION AND ACID-BASE STATE OF BLOOD IN DOGS

    ZIJLSTRA, WG; LANGBROEK, AJM; KRAAN, J; RISPENS, P; NIJMEIJER, A

    1995-01-01

    Urinary acid excretion and blood acid-base stare were determined in dogs fed a casein-based semi-synthetic food (SSF), to which different amounts of salts had been added, in comparison with feeding normal dog food. Net acid excretion (NAE) and inorganic acid excretion (IAE) increased during SSF feed

  13. A protein block based fold recognition method for the annotation of twilight zone sequences.

    Suresh, V; Ganesan, K; Parthasarathy, S

    2013-03-01

    The description of protein backbone was recently improved with a group of structural fragments called Structural Alphabets instead of the regular three states (Helix, Sheet and Coil) secondary structure description. Protein Blocks is one of the Structural Alphabets used to describe each and every region of protein backbone including the coil. According to de Brevern (2000) the Protein Blocks has 16 structural fragments and each one has 5 residues in length. Protein Blocks fragments are highly informative among the available Structural Alphabets and it has been used for many applications. Here, we present a protein fold recognition method based on Protein Blocks for the annotation of twilight zone sequences. In our method, we align the predicted Protein Blocks of a query amino acid sequence with a library of assigned Protein Blocks of 953 known folds using the local pair-wise alignment. The alignment results with z-value ≥ 2.5 and P-value ≤ 0.08 are predicted as possible folds. Our method is able to recognize the possible folds for nearly 35.5% of the twilight zone sequences with their predicted Protein Block sequence obtained by pb_prediction, which is available at Protein Block Export server.

  14. Sequence-based classification using discriminatory motif feature selection.

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  15. The myoglobin of Emperor penguin (Aptenodytes forsteri): amino acid sequence and functional adaptation to extreme conditions.

    Tamburrini, M; Romano, M; Giardina, B; di Prisco, G

    1999-02-01

    In the framework of a study on molecular adaptations of the oxygen-transport and storage systems to extreme conditions in Antarctic marine organisms, we have investigated the structure/function relationship in Emperor penguin (Aptenodytes forsteri) myoglobin, in search of correlation with the bird life style. In contrast with previous reports, the revised amino acid sequence contains one additional residue and 15 differences. The oxygen-binding parameters seem well adapted to the diving behaviour of the penguin and to the environmental conditions of the Antarctic habitat. Addition of lactate has no major effect on myoglobin oxygenation over a large temperature range. Therefore, metabolic acidosis does not impair myoglobin function under conditions of prolonged physical effort, such as diving.

  16. Bipolar Membranes for Acid Base Flow Batteries

    Anthamatten, Mitchell; Roddecha, Supacharee; Jorne, Jacob; Coughlan, Anna

    2011-03-01

    Rechargeable batteries can provide grid-scale electricity storage to match power generation with consumption and promote renewable energy sources. Flow batteries offer modular and flexible design, low cost per kWh and high efficiencies. A novel flow battery concept will be presented based on acid-base neutralization where protons (H+) and hydroxyl (OH-) ions react electrochemically to produce water. The large free energy of this highly reversible reaction can be stored chemically, and, upon discharge, can be harvested as usable electricity. The acid-base flow battery concept avoids the use of a sluggish oxygen electrode and utilizes the highly reversible hydrogen electrode, thus eliminating the need for expensive noble metal catalysts. The proposed flow battery is a hybrid of a battery and a fuel cell---hydrogen gas storing chemical energy is produced at one electrode and is immediately consumed at the other electrode. The two electrodes are exposed to low and high pH solutions, and these solutions are separated by a hybrid membrane containing a hybrid cation and anion exchange membrane (CEM/AEM). Membrane design will be discussed, along with ion-transport data for synthesized membranes.

  17. Development of Sequence-Based Microsatellite Marker for Phalaenopsis Orchid

    FATIMAH

    2011-06-01

    Full Text Available Phalaenopsis is one of the most interesting genera of orchids due to the members are often used as parents to produce hybrids. The establishment and development of highly reliable and discriminatory methods for identifying species and cultivars has become increasingly more important to plant breeders and members of the nursery industry. The aim of this research was to develop sequence-based microsatellite (eSSR markers for the Phalaenopsis orchid designed from the sequence of GenBank NCBI. Seventeen primers were designed and thirteen primers pairs could amplify the DNA giving the expected PCR product with polymorphism. A total of 51 alleles, with an average of 3 alleles per locus and polymorphism information content (PIC values at 0.674, were detected at the 16 SSR loci. Therefore, these markers could be used for identification of the Phalaenopsis orchid used in this study. Genetic similarity and principle coordinate analysis identified five major groups of Phalaenopsis sp. the first group consisted of P. amabilis, P. fuscata, P. javanica, and P. zebrine. The second group consisted of P. amabilis, P. amboinensis, P. bellina, P. floresens, and P. mannii. The third group consisted of P. bellina, P. cornucervi, P. cornucervi, P. violaceae sumatra, P. modesta. The forth group consisted of P. cornucervi and P. lueddemanniana, and the fifth group was P. amboinensis.

  18. Biodegradable polyesters based on succinic acid

    Nikolić Marija S.

    2003-01-01

    Full Text Available Two series of aliphatic polyesters based on succinic acid were synthesized by copolymerization with adipic acid for the first series of saturated polyesters, and with fumaric acid for the second series. Polyesters were prepared starting from the corresponding dimethyl esters and 1,4-butanediol by melt transesterification in the presence of a highly effective catalyst tetra-n-butyl-titanate, Ti(0Bu4. The molecular structure and composition of the copolyesters was determined by 1H NMR spectroscopy. The effect of copolymer composition on the physical and thermal properties of these random polyesters were investigated using differential scanning calorimetry. The degree of crystallinity was determined by DSC and wide angle X-ray. The degrees of crystallinity of the saturated and unsaturated copolyesters were generally reduced with respect to poly(butylene succinate, PBS. The melting temperatures of the saturated polyesters were lower, while the melting temperatures of the unsaturated copolyesters were higher than the melting temperature of PBS. The biodegradability of the polyesters was investigated by enzymatic degradation tests. The enzymatic degradation tests were performed in a buffer solution with Candida cylindracea lipase and for the unsaturated polyesters with Rhizopus arrhizus lipase. The extent of biodegradation was quantified as the weight loss of polyester films. Also the surface of the polyester films after degradation was observed using optical microscopy. It could be concluded that the biodegradability depended strongly on the degree of crystallinity, but also on the flexibility of the chain backbone. The highest biodegradation was observed for copolyesters containing 50 mol.% of adipic acid units, and in the series of unsaturated polyesters for copolyesters containing 5 and 10 mol.% of fumarate units. Although the degree of crystallinity of the unsaturated polyesters decreased slightly with increasing unsaturation, the biodegradation

  19. Establishment of the nucleic acid sequence-based amplificationmethod of detecting Vibrio parahaemolyticus%依赖于核酸序列恒温扩增技术快速检测副溶血性弧菌方法的建立

    倪鑫; 王志聪; 雷质文; 梁成珠; 汪东风; 姜英辉; 王建广; 祝素贞

    2011-01-01

    为建立副溶血性弧菌的快速检测方法,本研究采用依赖于核酸序列恒温扩增(NASBA)技术,以副溶血性弧菌的tlh基因为扩增的靶基因设计特异性引物和探针,建立可快速扩增副溶血性弧菌的NASBA方法.特异性和灵敏度试验结果表明:该NASBA方法对副溶血性弧菌的最小检出量为5.1×102 cfu/mL,高于普通PCR方法,而且与其他种属的菌无任何交叉反应.此外,本研究将副溶血弧菌扩增产物采用通用型核酸扩增物快速检测板进行检测,实现了特异性强的快速副溶血弧菌的检测.该方法对仪器要求低,具有良好的应用前景.%In this study, we adopted a technology based on nucleic acid sequence-based amplification (NASBA), to establish a rapid test method of Vibrio parahaemolyticus. Specific primers and probes were designed for the tlh target gene of V. Parahaemolyticus and the detection method of NASBA was established. Its specificity and sensitivity were tested. The results showed that the sensitivity of the NASBA was 5.1 x 102 cfii/mL which was higher than that of PCR method and this method had no cross reaction with other species of bacteria. The amplification products of V. Pamhaemolyticus were rapidly detected by using the universal rapid detection board of nucleic acid amplification product. In conclusion, detecting V. Parahaemolyticus with NASBA method was more specific and sensitive than PCR method and has lower instrumental requirement and a broad application.

  20. Using Willie's Acid-Base Box for Blood Gas Analysis

    Dietz, John R.

    2011-01-01

    In this article, the author describes a method developed by Dr. William T. Lipscomb for teaching blood gas analysis of acid-base status and provides three examples using Willie's acid-base box. Willie's acid-base box is constructed using three of the parameters of standard arterial blood gas analysis: (1) pH; (2) bicarbonate; and (3) CO[subscript…

  1. Teaching Acid/Base Physiology in the Laboratory

    Friis, Ulla G.; Plovsing, Ronni; Hansen, Klaus; Laursen, Bent G.; Wallstedt, Birgitta

    2010-01-01

    Acid/base homeostasis is one of the most difficult subdisciplines of physiology for medical students to master. A different approach, where theory and practice are linked, might help students develop a deeper understanding of acid/base homeostasis. We therefore set out to develop a laboratory exercise in acid/base physiology that would provide…

  2. Hybridization probe for femtomolar quantification of selected nucleic acid sequences on a disposable electrode.

    Jenkins, Daniel M; Chami, Bilal; Kreuzer, Matthias; Presting, Gernot; Alvarez, Anne M; Liaw, Bor Yann

    2006-04-01

    Mixed monolayers of electroactive hybridization probes on gold surfaces of a disposable electrode were investigated as a technology for simple, sensitive, selective, and rapid gene identification. Hybridization to the ferrocene-labeled hairpin probes reproducibly diminished cyclic redox currents, presumably due to a displacement of the label from the electrode. Observed peak current densities were roughly 1000x greater than those observed in previous studies, such that results could easily be interpreted without the use of algorithms to correct for background polarization currents. Probes were sensitive to hybridization with a number of oligonucleotide sequences with varying homology, but target oligonucleotides could be distinguished from competing nontarget sequences based on unique "melting" profiles from the probe. Detection limits were demonstrated down to nearly 100 fM, which may be low enough to identify certain genetic conditions or infections without amplification. This technology has rich potential for use in field devices for gene identification as well as in gene microarrays.

  3. Application of peptide nucleic acids containing azobenzene self-assembled electrochemical biosensors in detecting DNA sequences

    2009-01-01

    Hybridization of peptide nucleic acids probe containing azobenzene (NH2-TNT4, N-PNAs) with DNA was performed by covalently immobilizing of NH2-TNT4 in sequence on the 3-mercaptopropionic acid self-assembled monolayer modified gold electrode with the helps of N-(3-dimethylaminopropy1)-N’-ethylcarbodiimide hydrochloride (EDC) and N-hydroxysuccinimide (NHS), and the hybrid was coded as N-PNAs/DNA. Using [Fe(CN)6]4-/3- (1:1) as the electrochemical indicator, the electrochemical properties of the N-PNAs self-assembled monolayer (N-PNAs-SAMs) and N-PNAs/DNA hybridization system under the conditions of before and after UV light irradiation were characterized with cyclic voltammetry (CV), differential pulse voltammetry (DPV), and electrochemical impedance spectra (EIS). Results showed that the redox currents decreased with the increase of irradiation time, suggesting that the ability of the charge transfer on the electrode surface was weakened and the conformation of hybrid system had been changed, and the control of PNAs/DNA hybridization could be realized by UV light irradiation.

  4. Similarity Measurement of Web Sessions Based on Sequence Alignment

    LI Chaofeng; LU Yansheng

    2007-01-01

    The task of clustering Web sessions is to group Web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity.The first and foremost question needed to be considered in clustering Web sessions is how to measure the similarity between Web sessions. However, there are many shortcomings in traditional measurements. This paper introduces a new method for measuring similarities between Web pages that takes into account not only the URL but also the viewing time of the visited Web page. Then we give a new method to measure the similarity of Web sessions using sequence alignment and the similarity of Web page access in detail.Experiments have proved that our method is valid and efficient.

  5. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences

    Tully, Philip J; Lindén, Henrik; Hennig, Matthias H

    2016-01-01

    of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx). We show that the learning......Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed...... in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN) learning rule. We find that the formation of distributed memories, embodied by increased periods...

  6. Canine amino acid transport system Xc(-): cDNA sequence, distribution and cystine transport activity in lens epithelial cells.

    Maruo, Takuya; Kanemaki, Nobuyuki; Onda, Ken; Sato, Reiichiro; Ichihara, Nobuteru; Ochiai, Hideharu

    2014-04-01

    The cystine transport activity of a lens epithelial cell line originated from a canine mature cataract was investigated. The distinct cystine transport activity was observed, which was inhibited to 28% by extracellular 1 mM glutamate. The cDNA sequences of canine cysteine/glutamate exchanger (xCT) and 4F2hc were determined. The predicted amino acid sequences were 527 and 533 amino acid polypeptides, respectively. The amino acid sequences of canine xCT and 4F2hc showed high similarities (>80%) to those of humans. The expression of xCT in lens epithelial cell line was confirmed by western blot analysis. RT-PCR analysis revealed high level expression only in the brain, and it was below the detectable level in other tissues.

  7. A novel phytase with sequence similarity to purple acid phosphatases is expressed in cotyledons of germinating soybean seedlings.

    Hegeman, C E; Grabau, E A

    2001-08-01

    Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases.

  8. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    Myers, G.; Korber, B. [eds.] [Los Alamos National Lab., NM (United States); Wain-Hobson, S. [ed.] [Laboratory of Molecular Retrovirology, Pasteur Inst.; Smith, R.F. [ed.] [Baylor Coll. of Medicine, Houston, TX (United States). Dept. of Pharmacology; Pavlakis, G.N. [ed.] [National Cancer Inst., Frederick, MD (United States). Cancer Research Facility

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  9. Liquid Crystalline Furandicarboxylic Acid-based Aaromatic Polyesters

    WILSENS, CAROLUS HENRICUS R. MARIA; RASTOGI, SANJAY; VELD, MARTIJN ARNOLDUS JOHANNES; KLOP, ENNO ANTON; NOORDOVER, BART ADRIANUS JOHANNES

    2013-01-01

    The invention pertains to a fully aromatic liquid crystalline furandicarboxylic acid- based aromatic polyester obtainable from a mixture of monomers comprising 2,5- furandicarboxylic acid, p-hydroxybenzoic acid, an aromatic diol, and 5-40 mol% of an aromatic monocarboxylic acid selected from vanilli

  10. ProteDNA: a sequence-based predictor of sequence-specific DNA-binding residues in transcription factors.

    Chu, Wen-Yi; Huang, Yu-Feng; Huang, Chun-Chin; Cheng, Yi-Sheng; Huang, Chien-Kang; Oyang, Yen-Jen

    2009-07-01

    This article presents the design of a sequence-based predictor named ProteDNA for identifying the sequence-specific binding residues in a transcription factor (TF). Concerning protein-DNA interactions, there are two types of binding mechanisms involved, namely sequence-specific binding and nonspecific binding. Sequence-specific bindings occur between protein sidechains and nucleotide bases and correspond to sequence-specific recognition of genes. Therefore, sequence-specific bindings are essential for correct gene regulation. In this respect, ProteDNA is distinctive since it has been designed to identify sequence-specific binding residues. In order to accommodate users with different application needs, ProteDNA has been designed to operate under two modes, namely, the high-precision mode and the balanced mode. According to the experiments reported in this article, under the high-precision mode, ProteDNA has been able to deliver precision of 82.3%, specificity of 99.3%, sensitivity of 49.8% and accuracy of 96.5%. Meanwhile, under the balanced mode, ProteDNA has been able to deliver precision of 60.8%, specificity of 97.6%, sensitivity of 60.7% and accuracy of 95.4%. ProteDNA is available at the following websites: http://protedna.csbb.ntu.edu.tw/, http://protedna.csie.ntu.edu.tw/, http://bio222.esoe.ntu.edu.tw/ProteDNA/.

  11. The application of nucleic acid sequence?based amplification,real?time PCR and GM test in invasive aspergillosis diagnosis%核酸序列依赖性扩增、Real?time PCR及GM试验诊断侵袭性曲霉菌感染的临床应用评价

    王立朋; 鲍翠霞; 于丽梅; 张晓录; 于威娟; 张霞; 李玮; 黄葆华; 李杰

    2015-01-01

    Objective To study the diagnostic performance of nucleic acid sequence?based amplification ( NASBA) assay,real?time PCR and GM test in detecting invasive aspergillosis for clinical diagnosis.Methods Blood samples from 80 patients at a high risk for IA were collected during from November 2013 to June 2014.These patients were categorized as 8 proven IA,26 probable IA, and 46 non?IA according to the 2008 revised definitions of EORTC/MSG.Blood samples were tested by NASBA,real?time PCR and GM test and their diagnostic parameters were calculated,respectively.Result The sensitivity of NASBA,real?time PCR and GM test was 76.47%,67.65% and 52.94%,while their specificity was 80.43%,89.13%,80.43%,respectively.The efficiency of various com?binations of tests was also evaluated.Perfect specificity (100%) and positive predictive value (100%) were achieved by combining NASBA and real?time PCR as a serial testing.A combination of NASBA and real?time PCR as a parallel testing was the most sensitive (94.12%).Conclusion The sensitivity and specificity of NASBA and real?time PCR were superior to GM test.Combination of these assays could be particularly useful in specific clinical situations.%目的 核酸序列依赖性扩增 ( nucleic acid sequence?based amplification,NASBA)、Real?time PCR及GM试验在侵袭性曲霉菌感染中的诊断价值. 方法 收集2013年11月~2014年6月临床上曲霉菌感染高危病患的血液标本80例,并根据EORTC/MSG诊断标准分为确诊组8例,拟诊组26例,非感染组46例,分别利用NASBA、real?time PCR及GM试验进行检测,计算3种方法的诊断指标并分析评价. 结果 NASBA、real?time PCR及GM试验3种方法的灵敏度分别为76.47%、67.65%、52.94%,特异度分别为80.43%、89.13%、80.43%. 联合诊断结果显示,NASBA与real?time PCR串联方案有最好的特异度 (100%)及阳性预测值(100%);NASBA与real?time PCR并联方案则最为灵敏(94.12%). 结论 NASBA用于诊断IA最为敏感,而real?time PCR

  12. Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

    Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

    2017-04-01

    The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.

  13. Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

    Leitner Dietmar

    2005-04-01

    Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.

  14. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  15. Mapping the molecular characteristics of Brazilian human T-cell lymphotropic virus type 1 Env (gp46 and Pol amino acid sequences for vaccine design

    Aline Cristina Mota-Miranda

    2007-09-01

    Full Text Available This study was carried out to evaluate the molecular pattern of all available Brazilian human T-cell lymphotropic virus type 1 Env (n = 15 and Pol (n = 43 nucleotide sequences via epitope prediction, physico-chemical analysis, and protein potential sites identification, giving support to the Brazilian AIDS vaccine program. In 12 previously described peptides of the Env sequences we found 12 epitopes, while in 4 peptides of the Pol sequences we found 4 epitopes. The total variation on the amino acid composition was 9 and 17% for human leukocyte antigen (HLA class I and class II Env epitopes, respectively. After analyzing the Pol sequences, results revealed a total amino acid variation of 0.75% for HLA-I and HLA-II epitopes. In 5 of the 12 Env epitopes the physico-chemical analysis demonstrated that the mutations magnified the antigenicity profile. The potential protein domain analysis of Env sequences showed the loss of a CK-2 phosphorylation site caused by D197N mutation in one epitope, and a N-glycosylation site caused by S246Y and V247I mutations in another epitope. Besides, the analysis of selection pressure have found 8 positive selected sites (w = 9.59 using the codon-based substitution models and maximum-likelihood methods. These studies underscore the importance of this Env region for the virus fitness, for the host immune response and, therefore, for the development of vaccine candidates.

  16. BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

    Bahr, A; Thompson, J D; Thierry, J C; Poch, O

    2001-01-01

    BAliBASE is specifically designed to serve as an evaluation resource to address all the problems encountered when aligning complete sequences. The database contains high quality, manually constructed multiple sequence alignments together with detailed annotations. The alignments are all based on three-dimensional structural superpositions, with the exception of the transmembrane sequences. The first release provided sets of reference alignments dealing with the problems of high variability, unequal repartition and large N/C-terminal extensions and internal insertions. Here we describe version 2.0 of the database, which incorporates three new reference sets of alignments containing structural repeats, trans-membrane sequences and circular permutations to evaluate the accuracy of detection/prediction and alignment of these complex sequences. BAliBASE can be viewed at the web site http://www-igbmc.u-strasbg. fr/BioInfo/BAliBASE2/index.html or can be downloaded from ftp://ftp-igbmc.u-strasbg.fr/pub/BAliBASE2 /.

  17. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    Chang, Soo-Ik (Harvard Medical School, Boston, MA (USA)); Hammes, G.G. (Univ. of California, Santa Barbara (USA))

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  18. Multiple sequence alignment using multi-objective based bacterial foraging optimization algorithm.

    Rani, R Ranjani; Ramyachitra, D

    2016-12-01

    Multiple sequence alignment (MSA) is a widespread approach in computational biology and bioinformatics. MSA deals with how the sequences of nucleotides and amino acids are sequenced with possible alignment and minimum number of gaps between them, which directs to the functional, evolutionary and structural relationships among the sequences. Still the computation of MSA is a challenging task to provide an efficient accuracy and statistically significant results of alignments. In this work, the Bacterial Foraging Optimization Algorithm was employed to align the biological sequences which resulted in a non-dominated optimal solution. It employs Multi-objective, such as: Maximization of Similarity, Non-gap percentage, Conserved blocks and Minimization of gap penalty. BAliBASE 3.0 benchmark database was utilized to examine the proposed algorithm against other methods In this paper, two algorithms have been proposed: Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC) and Bacterial Foraging Optimization Algorithm. It was found that Hybrid Genetic Algorithm with Artificial Bee Colony performed better than the existing optimization algorithms. But still the conserved blocks were not obtained using GA-ABC. Then BFO was used for the alignment and the conserved blocks were obtained. The proposed Multi-Objective Bacterial Foraging Optimization Algorithm (MO-BFO) was compared with widely used MSA methods Clustal Omega, Kalign, MUSCLE, MAFFT, Genetic Algorithm (GA), Ant Colony Optimization (ACO), Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO) and Hybrid Genetic Algorithm with Artificial Bee Colony (GA-ABC). The final results show that the proposed MO-BFO algorithm yields better alignment than most widely used methods.

  19. Sequence-dependent elasticity and electrostatics of single-stranded DNA: signatures of base-stacking.

    McIntosh, Dustin B; Duggan, Gina; Gouil, Quentin; Saleh, Omar A

    2014-02-04

    Base-stacking is a key factor in the energetics that determines nucleic acid structure. We measure the tensile response of single-stranded DNA as a function of sequence and monovalent salt concentration to examine the effects of base-stacking on the mechanical and thermodynamic properties of single-stranded DNA. By comparing the elastic response of highly stacked poly(dA) and that of a polypyrimidine sequence with minimal stacking, we find that base-stacking in poly(dA) significantly enhances the polymer's rigidity. The unstacking transition of poly(dA) at high force reveals that the intrinsic electrostatic tension on the molecule varies significantly more weakly on salt concentration than mean-field predictions. Further, we provide a model-independent estimate of the free energy difference between stacked poly(dA) and unstacked polypyrimidine, finding it to be ∼-0.25 kBT/base and nearly constant over three orders of magnitude in salt concentration.

  20. Purification, amino-acid sequence and partial characterization of two toxins with anti-insect activity from the venom of the South American scorpion Tityus bahiensis (Buthidae).

    Pimenta, A M; Martin-Eauclaire, M; Rochat, H; Figueiredo, S G; Kalapothakis, E; Afonso, L C; De Lima, M E

    2001-07-01

    We report here the isolation by a two-step chromatographic procedure of two new toxins from the South American scorpion Tityus bahiensis. Their amino-acid sequences and some of their biological features were established. The two toxins have different biological properties. Toxin TbIT-I had almost no activity or pharmacological effects in vertebrate tissues whereas it was lethal to house flies (LD50 80.0 ng/house fly). In contrast, Tb2-II was active against both mammals (intracerebroventricular injection of 100 ng/mouse was lethal) and insects (LD50 40.0 ng/house fly). The amino-acid sequences of these toxins were established and found to be similar (60-95%) to previously described beta-toxins from the Tityus genus. Based on the available comparative information, this study attempts identify possible structure-function relationships that may be responsible for the differences in bioactivity displayed by these toxins.

  1. Mink serum amyloid A protein. Expression and primary structure based on cDNA sequences.

    Marhaug, G; Husby, G; Dowton, S B

    1990-06-15

    The nucleotide sequences of two mink serum amyloid A (SAA) cDNA clones have been analyzed, one (SAA1) 776 base pairs long and the other (SAA2) 552 base pairs long. Significant differences were discovered when derived amino acid sequences were compared with data for apoSAA isolated from high density lipoprotein. Previous studies of mink protein SAA and amyloid protein A (AA) suggest that only one SAA isotype is amyloidogenic. The cDNA clone for SAA2 defines the "amyloid prone" isotype while SAA1 is found only in serum. Mink SAA1 has alanine in position 10, isoleucine in positions 24, 67, and 71, lysine in position 27, and proline in position 105. Residue 10 in mink SAA2 is valine while arginine and asparagine are at positions 24 and 27, respectively, all characteristics of protein AA isolated from mink amyloid fibrils. Mink SAA2 also has valine in position 67, phenylalanine in position 71, and amino acid 105 is serine. It remains unknown why these six amino acid substitutions render SAA2 more amyloidogenic than SAA1. Eighteen hours after lipopolysaccharide stimulation, mink SAA mRNA is abundant in liver with relatively minor accumulations in brain and lung. Genes encoding both SAA isotypes are expressed in all three organs while no SAA mRNA was detectable in amyloid prone organs, including spleen and intestine, indicating that deposition of AA from locally synthesized SAA is unlikely. A third mRNA species (2.2 kilobases) was identified and hybridizes with cDNA probes for mink SAA1 and SAA2. In addition to a major primary translation product (molecular mass 14,400 Da) an additional product with molecular mass 28,000 Da was immunoprecipitable.

  2. Multiple sequence alignment based on combining genetic algorithm with chaotic sequences.

    Gao, C; Wang, B; Zhou, C J; Zhang, Q

    2016-06-24

    In bioinformatics, sequence alignment is one of the most common problems. Multiple sequence alignment is an NP (nondeterministic polynomial time) problem, which requires further study and exploration. The chaos optimization algorithm is a type of chaos theory, and a procedure for combining the genetic algorithm (GA), which uses ergodicity, and inherent randomness of chaotic iteration. It is an efficient method to solve the basic premature phenomenon of the GA. Applying the Logistic map to the GA and using chaotic sequences to carry out the chaotic perturbation can improve the convergence of the basic GA. In addition, the random tournament selection and optimal preservation strategy are used in the GA. Experimental evidence indicates good results for this process.

  3. Pseudo-Random Sequences Generator Based on Discrete Hyperchaotic Systems

    李昌刚; 韩正之

    2003-01-01

    We first design a discrete hyperchaotic system via piecewise linear state feedback. The states of the closed loop system are locally expanding in two directions but absolutely bounded on the whole, which implies hyperchaos. Then, we use three suchlike hyperchaotie systems with different feedback gain matrices to design a pseudo-random sequence generator (PRSG). Through a threshold function, three sub-sequences generated from the output of piecewise linear functions are changed into 0-1 sequences. Then, followed by XOR operation, an unpredictable pseudo-random sequence (PRS) is ultimately obtained. The analysis and simulation results indicate that the PRS, generated with hyperchaotic systems, has desirable statistical features.

  4. The phylogenetic status of Paxillosida (Asteroidea) based on complete mitochondrial DNA sequences.

    Matsubara, Mioko; Komatsu, Miéko; Araki, Takeyoshi; Asakawa, Shuichi; Yokobori, Shin-ichi; Watanabe, Kimitsuna; Wada, Hiroshi

    2005-09-01

    One of the most important issues in asteroid phylogeny is the phylogenetic status of Paxillosida. This group lacks an anus and suckers on the tube feet in adults and does not develop the brachiolaria stage in early development. Two controversial hypotheses have been proposed for the phylogenetic status of Paxillosida, i.e., Paxillosida is primitive or rather specialized in asteroids. In this study, we determined the complete mitochondrial DNA nucleotide sequences from two paxillosidans (Astropecten polyacanthus and Luidia quinaria) and one forcipulatidan (Asterias amurensis). The mitochondrial genomes of the three asteroids were identical with respect to gene order and transcription direction, and were identical to the previously reported mitochondrial genomes of Asterina pectinifera (Valvatida) and Pisaster ochraceus (Forcipulatida) in this respect. Therefore, the comparison of genome structures was uninformative for the purposes of asteroid phylogeny. However, molecular phylogenetic analyses based on the amino acid sequences and the nucleotide sequences from the five asteroids supported the monophyly of the clade that included the two paxillosidans and Asterina. This suggests that the paxillosidan characters are secondarily derived ones.

  5. A FRET Biosensor for ROCK Based on a Consensus Substrate Sequence Identified by KISS Technology.

    Li, Chunjie; Imanishi, Ayako; Komatsu, Naoki; Terai, Kenta; Amano, Mutsuki; Kaibuchi, Kozo; Matsuda, Michiyuki

    2017-01-11

    Genetically-encoded biosensors based on Förster/fluorescence resonance energy transfer (FRET) are versatile tools for studying the spatio-temporal regulation of signaling molecules within not only the cells but also tissues. Perhaps the hardest task in the development of a FRET biosensor for protein kinases is to identify the kinase-specific substrate peptide to be used in the FRET biosensor. To solve this problem, we took advantage of kinase-interacting substrate screening (KISS) technology, which deduces a consensus substrate sequence for the protein kinase of interest. Here, we show that a consensus substrate sequence for ROCK identified by KISS yielded a FRET biosensor for ROCK, named Eevee-ROCK, with high sensitivity and specificity. By treating HeLa cells with inhibitors or siRNAs against ROCK, we show that a substantial part of the basal FRET signal of Eevee-ROCK was derived from the activities of ROCK1 and ROCK2. Eevee-ROCK readily detected ROCK activation by epidermal growth factor, lysophosphatidic acid, and serum. When cells stably-expressing Eevee-ROCK were time-lapse imaged for three days, ROCK activity was found to increase after the completion of cytokinesis, concomitant with the spreading of cells. Eevee-ROCK also revealed a gradual increase in ROCK activity during apoptosis. Thus, Eevee-ROCK, which was developed from a substrate sequence predicted by the KISS technology, will pave the way to a better understanding of the function of ROCK in a physiological context.

  6. SVM-PB-Pred: SVM based protein block prediction method using sequence profiles and secondary structures.

    Suresh, V; Parthasarathy, S

    2014-01-01

    We developed a support vector machine based web server called SVM-PB-Pred, to predict the Protein Block for any given amino acid sequence. The input features of SVM-PB-Pred include i) sequence profiles (PSSM) and ii) actual secondary structures (SS) from DSSP method or predicted secondary structures from NPS@ and GOR4 methods. There were three combined input features PSSM+SS(DSSP), PSSM+SS(NPS@) and PSSM+SS(GOR4) used to test and train the SVM models. Similarly, four datasets RS90, DB433, LI1264 and SP1577 were used to develop the SVM models. These four SVM models developed were tested using three different benchmarking tests namely; (i) self consistency, (ii) seven fold cross validation test and (iii) independent case test. The maximum possible prediction accuracy of ~70% was observed in self consistency test for the SVM models of both LI1264 and SP1577 datasets, where PSSM+SS(DSSP) input features was used to test. The prediction accuracies were reduced to ~53% for PSSM+SS(NPS@) and ~43% for PSSM+SS(GOR4) in independent case test, for the SVM models of above two same datasets. Using our method, it is possible to predict the protein block letters for any query protein sequence with ~53% accuracy, when the SP1577 dataset and predicted secondary structure from NPS@ server were used. The SVM-PB-Pred server can be freely accessed through http://bioinfo.bdu.ac.in/~svmpbpred.

  7. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE Consensus Sequence Repeats in the Viral Genome.

    Ashutosh Kumar

    2016-08-01

    Full Text Available Owing to the reports of microcephaly as a consistent outcome in the foetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV - microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favour of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1 sequence homology between ZIKV genome and the response element of an early neural tube developmental marker ‘retinoic acid’ in human DNA and (2 comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE consensus sequence (5′–AGGTCA–3′ in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly and other viruses available in National Institute of Health genetic sequence database (GenBank for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause foetal brain defects (for which maternal-foetal transmission during developing stage may be required. The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although bioinformatic

  8. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  9. Whole-body acid-base modeling revisited

    Ring, Troels; Nielsen, Søren

    2017-01-01

    The textbook account of whole-body acid-base balance in terms of endogenous acid production, renal net acid excretion and gastrointestinal alkali absorption which is the only comprehensive model around, has never been applied in clinical practice or been formally validated. In order to improve...... understanding of acid-base modeling, we managed to write up this conventional model as an expression solely on urine chemistry. Renal net acid excretion and endogenous acid production was already formulated in terms of urine chemistry, and we could from the literature also see gastrointestinal alkali absorption...

  10. Roche genome sequencer FLX based high-throughput sequencing of ancient DNA

    Alquezar-Planas, David E; Fordyce, Sarah Louise

    2012-01-01

    Since the development of so-called "next generation" high-throughput sequencing in 2005, this technology has been applied to a variety of fields. Such applications include disease studies, evolutionary investigations, and ancient DNA. Each application requires a specialized protocol to ensure tha...

  11. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  12. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-03-03

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes.

  13. N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

    Kuhn, H.; Fietzek, P P; Lampen, J O

    1982-01-01

    The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.

  14. High-Throughput Sequencing Based Methods of RNA Structure Investigation

    Kielpinski, Lukasz Jan

    describe several computational methods. One that alleviates PCR bias by estimating number of unique molecules existing before the amplification, and two methods for data normalization: one applicable when the paired end sequencing is performed, and the other that works with the single read sequencing...

  15. Amino acid sequence of Coprinus macrorhizus peroxidase and cDNA sequence encoding Coprinus cinereus peroxidase. A new family of fungal peroxidases.

    Baunsgaard, L; Dalbøge, H; Houen, G; Rasmussen, E M; Welinder, K G

    1993-04-01

    Sequence analysis and cDNA cloning of Coprinus peroxidase (CIP) were undertaken to expand the understanding of the relationships of structure, function and molecular genetics of the secretory heme peroxidases from fungi and plants. Amino acid sequencing of Coprinus macrorhizus peroxidase, and cDNA sequencing of Coprinus cinereus peroxidase showed that the mature proteins are identical in amino acid sequence, 343 residues in size and preceded by a 20-residue signal peptide. Their likely identity to peroxidase from Arthromyces ramosus is discussed. CIP has an 8-residue, glycine-rich N-terminal extension blocked with a pyroglutamate residue which is absent in other fungal peroxidases. The presence of pyroglutamate, formed by cyclization of glutamine, and the finding of a minor fraction of a variant form lacking the N-terminal residue, indicate that signal peptidase cleavage is followed by further enzymic processing. CIP is 40-45% identical in amino-acid sequence to 11 lignin peroxidases from four fungal species, and 42-43% identical to the two known Mn-peroxidases. Like these white-rot fungal peroxidases, CIP has an additional segment of approximately 40 residues at the C-terminus which is absent in plant peroxidases. Although CIP is much more similar to horseradish peroxidase (HRP C) in substrate specificity, specific activity and pH optimum than to white-rot fungal peroxidases, the sequences of CIP and HRP C showed only 18% identity. Hence, CIP qualifies as the first member of a new family of fungal peroxidases. The nine invariant residues present in all plant, fungal and bacterial heme peroxidases are also found in CIP. The present data support the hypothesis that only one chromosomal CIP gene exists. In contrast, a large number of secretory plant and fungal peroxidases are expressed from several peroxidase gene clusters. Analyses of three batches of CIP protein and of 49 CIP clones revealed the existence of only two highly similar alleles indicating less

  16. Phylogenetic Analysis of Cynoglossidae in the Yangtze Estuary Based on Partial Sequence of Mitochondrial COⅠ

    Song Chao; Yu Yanan; Zhang Tao; Yang Gang; Zhang Longzhen

    2015-01-01

    To determine the role of mitochondrial COⅠgene in classification and identification of species,a total of 39 single individuals from nine species pertaining to two genera of Cynoglossidae in the Yangtze Estuary were barcoded by COⅠ,sequenced and compared with that of other Cynoglossidae species recorded in the Gen Bank. Total genomic DNA was extracted from each scale sample using the classic phenol / chloroform extraction method. Six hundred and fifty base pairs( bp)COⅠfragments were amplified using the primers ’i. e. ’ F1: 5’- TCA ACC AAC CAC AAA GAC ATT GGC AC- 3’,R1: 5’- TAG ACT TCT GGG TGG CCA AAG AAT CA- 3’. Every PCR amplification was performed in a total volume of 50 μL of PCR mixture. PCR products were purified and then sequenced in both forward and reverse directions using an ABI PRISMTM 3730 XL Automated Sequencer. DNA sequences were aligned with clustal W using default parameters. Base composition,variable and parsimony informative sites were determined using MEGA 5. 0. Neighbor- joining( NJ) and Maximum parsimony( MP) phylogenetic trees were constructed for COⅠhaplotypes( Kimura 2 Parameter substitution model,K2P; 1 000 bootstraps pseudoreplications) using MEGA 5. 0. Using the MEGA5. 0 software for statistical analysis,the averaged AT content was greater than the GC content( Tab. 2). The GC content of codon position 1 averaged 53. 8%( 51. 8%-57. 3%),which of position 2 for 42. 0%,and that of position 3 ranged from 28. 1% to 37. 8% in average of 32. 4%( Tab. 4). The transitional pairs( si) was slightly more than the transversional pairs( sv),and the ratio( R = si/sv) was 1. 45( Tab. 3). Analysis of the frequency of amino acids in COⅠgene encoding protein showed that the highest frequency of amino acid was leucine,and the lowest frequency of amino acid was tryptophan( Tab. 5). The average K2 P distances pairwise-species and within-species were 0. 191 and 0. 003,respectively( Tab. 6). The K2 P distance pairwise-species was 63. 7

  17. Sequence-specific high mobility group box factors recognize 10-12-base pair minor groove motifs

    van Beest, M; Dooijes, D; van De Wetering, M;

    2000-01-01

    of promoter elements controlled by the yeast genes ste11 and Rox1 has indicated strict conservation of a larger DNA motif. By site selection, we identify a highly specific 12-base pair motif for Ste11, AGAACAAAGAAA. Similarly, we show that Tcf1, MatMc, and Sox4 bind unique, highly specific DNA motifs of 12......, 12, and 10 base pairs, respectively. Footprinting with a deletion mutant of Ste11 reveals a novel interaction between the 3' base pairs of the extended DNA motif and amino acids C-terminal to the HMG domain. The sequence-specific interaction of Ste11 with these 3' base pairs contributes significantly......Sequence-specific high mobility group (HMG) box factors bind and bend DNA via interactions in the minor groove. Three-dimensional NMR analyses have provided the structural basis for this interaction. The cognate HMG domain DNA motif is generally believed to span 6-8 bases. However, alignment...

  18. Extraction of Sequence Conservation Features for the Prioritization of Candidate Single Amino Acid Polymorphisms

    Jiaxin Wu

    2011-03-01

    Full Text Available Although remarkable success has been achieved by genome-wide association (GWA studies over the past few years, genetic variants discovered in GWA studies can typically account for only a small fraction of heritability of most common diseases. As such, the identification of multiple rare variants that are associated with complex diseases has been receiving more and more attentions. However, most of the recently developed statistical approaches for detecting association of rare variants with diseases require the selection of functional variants before the successive analysis, making an effective bioinformatics method for filtering out non-relevant rare variants indispensible. In this paper, we focus on a specific type of genetic variants called single amino acid polymorphisms (SAAPs. We propose to prioritize candidate SAAPs for a specific disease according to their association scores that are calculated using a guilt-by-association model with a set of features derived from protein sequences. We validate the proposed approach in a systematic way and demonstrate that the proposed model is powerful in distinguishing disease-associated SAAPs for the specific disease of interest.

  19. Acid-base bifunctional catalytic surfaces for nucleophilic addition reactions.

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2008-09-01

    This article illustrates the modification of oxide surfaces with organic amine functional groups to create acid-base bifunctional catalysts, summarizing our previous reports and also presenting new data. Immobilization of organic amines as bases on inorganic solid-acid surfaces afforded highly active acid-base bifunctional catalysts, which enabled various organic transformations including C--C coupling reactions, though these reactions did not proceed with either the homogeneous amine precursors or the acidic supports alone. Spectroscopic characterization, such as by solid-state MAS NMR and FTIR, revealed not only the interactions between acidic and basic sites but also bifunctional catalytic reaction mechanisms.

  20. Whole genome semiconductor based sequencing of farmed European sea bass (Dicentrarchus labrax) Mediterranean genetic stocks using a DNA pooling approach.

    Bertolini, Francesca; Geraci, Claudia; Schiavo, Giuseppina; Sardina, Maria Teresa; Chiofalo, Vincenzo; Fontanesi, Luca

    2016-08-01

    European sea bass (Dicentrarchus labrax) is an important marine species for commercial and sport fisheries and aquaculture production. Recently, the European sea bass genome has been sequenced and assembled. This resource can open new opportunities to evaluate and monitor variability and identify variants that could contribute to the adaptation to farming conditions. In this work, two DNA pools constructed from cultivated European sea bass were sequenced using a next generation semiconductor sequencing approach based on Ion Proton sequencer. Using the first draft version of the D. labrax genome as reference, sequenced reads obtained a total of about 1.6 million of single nucleotide polymorphisms (SNPs), spread all over the chromosomes. Transition/transversion (Ti/Tv) was equal to 1.28, comparable to what was already reported in Salmon species. A pilot homozygosity analysis across the D. labrax genome using DNA pool sequence datasets indicated that this approach can identify chromosome regions with putative signatures of selection, including genes involved in ion transport and chloride channel functions, amino acid metabolism and circadian clock and related neurological systems. This is the first study that reported genome wide polymorphisms in a fish species obtained with the Ion Proton sequencer. Moreover, this study provided a methodological approach for selective sweep analysis in this species.

  1. STING Millennium: a web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence

    Neshich, Goran; Togawa, Roberto C.; Mancini, Adauto L.; Kuser, Paula R.; Yamagishi, Michel E. B.; Pappas, Georgios; Torres, Wellington V.; Campos, Tharsis Fonseca e; Ferreira, Leonardo L.; Luna, Fabio M.; Oliveira, Adilton G.; Miura, Ronald T.; Inoue, Marcus K.; Horita, Luiz G.; de Souza, Dimas F.; Dominiquini, Fabiana; Álvaro, Alexandre; Lima, Cleber S.; Ogawa, Fabio O.; Gomes, Gabriel B.; Palandrani, Juliana F.; dos Santos, Gabriela F.; de Freitas, Esther M.; Mattiuz, Amanda R.; Costa, Ivan C.; de Almeida, Celso L.; Souza, Savio; Baudet, Christian; Higa, Roberto H.

    2003-01-01

    STING Millennium Suite (SMS) is a new web-based suite of programs and databases providing visualization and a complex analysis of molecular sequence and structure for the data deposited at the Protein Data Bank (PDB). SMS operates with a collection of both publicly available data (PDB, HSSP, Prosite) and its own data (contacts, interface contacts, surface accessibility). Biologists find SMS useful because it provides a variety of algorithms and validated data, wrapped-up in a user friendly web interface. Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment, indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts and Cα–Cα and Cβ–Cβ distance geometry. Specific emphasis in SMS is given to interface forming residues (IFR)—amino acids that define the interactive portion of the protein surfaces. SMS may simultaneously display and analyze previously superimposed structures. PDB updates trigger SMS updates in a synchronized fashion. SMS is freely accessible for public data at http://www.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS and http://trantor.bioc.columbia.edu/SMS. PMID:12824333

  2. Cloning, DNA sequencing and heterologous expression of the gene for thermostable N-acylamino acid racemase from Amycolatopsis sp. TS-1-60 in Escherichia coli.

    Tokuyama, S; Hatano, K

    1995-03-01

    The gene encoding the novel enzyme N-acylamino acid racemase (AAR) was cloned in recombinant phage lambda-4 from the DNA library of Amycolatopsis sp. TS-1-60, a rare actinomycete, using antiserum against the enzyme. The cloned gene was subcloned and transformed in Escherichia coli JM105 using pUC118 as a vector. The AAR gene consists of an open-reading frame of 1104 nucleotides, which specifies a 368-amino-acid protein with a molecular mass of 39411Da. The molecular mass deduced from the AAR gene is in good agreement with the subunit molecular mass (40kDa) of AAR from Amycolatopsis sp. TS-1-60. The guanosine plus cytosine content of the AAR gene was about 70%. Although the AAR gene uses the unusual initiation codon GTG, the gene was expressed in Escherichia coli using the lac promoter of pUC118. The amount of the enzyme produced by the transformant was 16 times that produced by Amycolatopsis sp. TS-1-60. When the unusual initiation codon GTG was changed to ATG, the enzyme productivity of the transformant increased to more than 37 times that of Amycolatopsis sp. TS-1-60. In the comparison of the DNA sequence and the deduced amino acid sequence of AAR with those of known racemases and epimerases in data bases, no significant sequence homology was found. However, AAR resembles mandelate racemase in that requires metal ions for enzyme activity.(ABSTRACT TRUNCATED AT 250 WORDS)

  3. Sequence and structure-based prediction of fructosyltransferase activity for functional subclassification of fungal GH32 enzymes.

    Trollope, Kim M; van Wyk, Niël; Kotjomela, Momo A; Volschenk, Heinrich

    2015-12-01

    Sucrolytic enzymes catalyse sucrose hydrolysis or the synthesis of fructooligosaccharides (FOSs), a prebiotic in human and animal nutrition. FOS synthesis capacity differs between sucrolytic enzymes. Amino-acid-sequence-based classification of FOS synthesizing enzymes would greatly facilitate the in silico identification of novel catalysts, as large amounts of sequence data lie untapped. The development of a bioinformatics tool to rapidly distinguish between high-level FOSs synthesizing predominantly sucrose hydrolysing enzymes from fungal genomic data is presented. Sequence comparison of functionally characterized enzymes displaying low- and high-level FOS synthesis revealed conserved motifs unique to each group. New light is shed on the sequence context of active site residues in three previously identified conserved motifs. We characterized two enzymes predicted to possess low- and high-level FOS synthesis activities based on their conserved motif sequences. FOS data for the enzymes confirmed our successful prediction of their FOS synthesis capacity. Structural comparison of enzymes displaying low- and high-level FOS synthesis identified steric hindrance between nystose and a long loop region present only in low-level FOS synthesizers. This loop is proposed to limit the synthesis of FOS species with higher degrees of polymerization, a phenomenon observed among enzymes displaying low-level FOS synthesis. Conserved sequence motifs surrounding catalytic residues and a distant structural determinant were identifiers of FOS synthesis capacity and allow for functional annotation of sucrolytic enzymes directly from amino acid sequence. The tool presented may also be useful to study the structure-function relationships of β-fructofuranosidases by identifying mutations present in a group of closely related enzymes displaying similar function.

  4. PHARMACOGENETIC TESTING OPPORTUNITIES IN CARDIOLOGY BASED ON EXOME SEQUENCING

    N. V. Shcherbakova

    2014-01-01

    Full Text Available Aim. To study what cardiac drugs currently have any comments on biomarkers and what information can be obtained by pharmacogenetic testing using data exome sequencing in patients with cardiac diseases.Material and methods. Exome sequencing in random participant of the ATEROGEN IVANOVO study and bioinformatics analysis of the data were performed. Point mutations were annotated using ANNOVAR program, as well as comparison with a number of specialized databases was done on the basis of user protocols.Results. 11 cardiac drugs and 7 genes which variants can influence cardiac drug metabolism were analyzed. According to exome sequencing of the participant we did not reveal allelic variants that require dose regime correction and careful efficacy control.Conclusion. The exome sequencing application is the next step to a wide range of personalized therapy. Future opportunities for improvement of the risk-benefit ratio in each patient are the main purpose of the collection and analysis of pharmacogenetic data.

  5. Phylogenetic relationships of Salmonella based on rRNA sequences

    Christensen, H.; Nordentoft, Steen; Olsen, J.E.

    1998-01-01

    To establish the phylogenetic relationships between the subspecies of Salmonella enterica (official name Salmonella choleraesuis), Salmonella bongori and related members of Enterobacteriaceae, sequence comparison of rRNA was performed by maximum-likelihood analysis. The two Salmonella species wer...

  6. EST sequences and their annotation (amino acid sequence and results of homology search) - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Full Text Available and VS) derived from five developmental stages. Clone ID ID of cDNA clone Atlas ID ID of Atlas database ( ht...tp://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ) and link to Atlas database NBRP ID ID of cDNA c...ir annotations (amino acid sequence, homology search results (with target DBs: dicty EST-DB, DNA-DB and prot...ein-DB)). Links to the Atlas database ( http://dictycdb.biol.tsukuba.ac.jp/~tools/bin/ISH/index.html ), whic

  7. ACID-BASE INTERACTIONS BETWEEN POLYMERS AND FILLERS

    WANG Qingguo; CHEN Fute; HUANG Yuanfu; ZHOU Qingli

    1987-01-01

    Inverse gas chromatography(IGC) and Fourier-transform infrared (FT-IR) techniques were applied to determining the relative acid-base strength of polymers and coupling agents. The acid-base characteristics of fillers such as CaCO3 could be altered by treatment with different coupling agents. It was shown that some mechanical properties of filled polymers were obviously associated with acid-base interactions between polymers and fillers.

  8. A novel chaos-based image encryption algorithm using DNA sequence operations

    Chai, Xiuli; Chen, Yiran; Broyde, Lucie

    2017-01-01

    An image encryption algorithm based on chaotic system and deoxyribonucleic acid (DNA) sequence operations is proposed in this paper. First, the plain image is encoded into a DNA matrix, and then a new wave-based permutation scheme is performed on it. The chaotic sequences produced by 2D Logistic chaotic map are employed for row circular permutation (RCP) and column circular permutation (CCP). Initial values and parameters of the chaotic system are calculated by the SHA 256 hash of the plain image and the given values. Then, a row-by-row image diffusion method at DNA level is applied. A key matrix generated from the chaotic map is used to fuse the confused DNA matrix; also the initial values and system parameters of the chaotic system are renewed by the hamming distance of the plain image. Finally, after decoding the diffused DNA matrix, we obtain the cipher image. The DNA encoding/decoding rules of the plain image and the key matrix are determined by the plain image. Experimental results and security analyses both confirm that the proposed algorithm has not only an excellent encryption result but also resists various typical attacks.

  9. Antibiotic Selection Pressure Determination through Sequence-Based Metagenomics.

    Willmann, Matthias; El-Hadidi, Mohamed; Huson, Daniel H; Schütz, Monika; Weidenmaier, Christopher; Autenrieth, Ingo B; Peter, Silke

    2015-12-01

    The human gut forms a dynamic reservoir of antibiotic resistance genes (ARGs). Treatment with antimicrobial agents has a significant impact on the intestinal resistome and leads to enhanced horizontal transfer and selection of resistance. We have monitored the development of intestinal ARGs over a 6-day course of ciprofloxacin (Cp) treatment in two healthy individuals by using sequenced-based metagenomics and different ARG quantification methods. Fixed- and random-effect models were applied to determine the change in ARG abundance per defined daily dose of Cp as an expression of the respective selection pressure. Among various shifts in the composition of the intestinal resistome, we found in one individual a strong positive selection for class D beta-lactamases which were partly located on a mobile genetic element. Furthermore, a trend to a negative selection has been observed with class A beta-lactamases (-2.66 hits per million sample reads/defined daily dose; P = 0.06). By 4 weeks after the end of treatment, the composition of ARGs returned toward their initial state but to a different degree in both subjects. We present here a novel analysis algorithm for the determination of antibiotic selection pressure which can be applied in clinical settings to compare therapeutic regimens regarding their effect on the intestinal resistome. This information is of critical importance for clinicians to choose antimicrobial agents with a low selective force on their patients' intestinal ARGs, likely resulting in a diminished spread of resistance and a reduced burden of hospital-acquired infections with multidrug-resistant pathogens.

  10. Molecular cloning and chromosomal localization of the nucleic acid sequences encoding the cerebrovascular and plaque amyloid peptide

    Robakis, N.K.; Ramakrishna, N.; Wolfe, G.; Wisniewski, H.M.

    1987-05-01

    Amyloid deposits in vessels and neuritic plaques are found in large numbers in the brains of Alzheimer's Disease (AD) and adult Downs Syndrome (DS) patients. The partial amino acid sequence of the amyloid peptide has been determined. They used this amino acid sequence to synthesize an oligonucleotide probe specific for the amyloid peptide gene. Screening of a human brain cDNA library with this probe, yielded a clone which contained an insert 1.8 kb. This clone contains a long open reading frame including a region which encodes the 28 amino acids of the amyloid peptide. Northern blots of human brain mRNA detected a transcript of 3.3 kb long which hybridized to their cDNA clone. A similar mRNA was detected in the hamster, mouse, sheep and rabbit brains. Southern blots under stringent hybridization conditions detected sequences homologous to the amyloid gene in the genomes of hamster, mouse, sheep and rabbit suggesting that this gene has been conserved during mammalian evolution. Hybridization under reduced stringency revealed the presence of additional sequences related to the amyloid gene in the genome of the above organisms. Hybridization analysis of human x chinese hamster cell lines DNA showed that the gene encoding the amyloid peptide is located on chromosome 21, suggesting a genetic relationship between AD and DS.

  11. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  12. Next-generation re-sequencing of genes involved in increased platelet reactivity in diabetic patients on acetylsalicylic acid.

    Postula, Marek; Janicki, Piotr K; Eyileten, Ceren; Rosiak, Marek; Kaplon-Cieslicka, Agnieszka; Sugino, Shigekazu; Wilimski, Radosław; Kosior, Dariusz A; Opolski, Grzegorz; Filipiak, Krzysztof J; Mirowska-Guzel, Dagmara

    2016-06-01

    The objective of this study was to investigate whether rare missense genetic variants in several genes related to platelet functions and acetylsalicylic acid (ASA) response are associated with the platelet reactivity in patients with diabetes type 2 (T2D) on ASA therapy. Fifty eight exons and corresponding introns of eight selected genes, including PTGS1, PTGS2, TXBAS1, PTGIS, ADRA2A, ADRA2B, TXBA2R, and P2RY1 were re-sequenced in 230 DNA samples from T2D patients by using a pooled PCR amplification and next-generation sequencing by Illumina HiSeq2000. The observed non-synonymous variants were confirmed by individual genotyping of 384 DNA samples comprising of the individuals from the original discovery pools and additional verification cohort of 154 ASA-treated T2DM patients. The association between investigated phenotypes (ASA induced changes in platelets reactivity by PFA-100, VerifyNow and serum thromboxane B2 level [sTxB2]), and accumulation of rare missense variants (genetic burden) in investigated genes was tested using statistical collapsing tests. We identified a total of 35 exonic variants, including 3 common missense variants, 15 rare missense variants, and 17 synonymous variants in 8 investigated genes. The rare missense variants exhibited statistically significant difference in the accumulation pattern between a group of patients with increased and normal platelet reactivity based on PFA-100 assay. Our study suggests that genetic burden of the rare functional variants in eight genes may contribute to differences in the platelet reactivity measured with the PFA-100 assay in the T2DM patients treated with ASA.

  13. Quinoline based receptor in fluorometric discrimination of carboxylic acids

    2008-12-01

    Full Text Available Quinoline and naphthalene-based fluororeceptors 1 and 2 have been designed and synthesized for detection of hydroxy carboxylic acids in less polar solvents. The receptor 1 shows monomer emission quenching followed by excimer emission upon hydrogen bond-mediated complexation of carboxylic acids. The excimer emission distinguishes aromatic dicarboxylic acids from aliphatic dicarboxylic acids and even long chain aliphatic dicarboxylic acids from short chain aliphatic dicarboxylic acids. The receptor 1 is found to be selective for citric acid with a strong excimer emission in CHCl3. On the contrary, the receptor 2 exhibited less binding constant value and did not form any excimer upon complexation with the same acids under similar conditions. This established the role of quinoline ring nitrogen in binding with the acids.

  14. A Concise Synthesis of Glycolipids Based on Aspartic Acid Building Blocks

    Lorna Abbey

    2012-09-01

    Full Text Available L-Aspartic acid building blocks bearing galactosyl moieties were used to synthesise glycolipid mimetics of variable hydrocarbon chain length. The glycolipids were readily prepared through amide bond formation using the TBTU/HOBt coupling methodology. It was observed that, under these conditions, activation of the α-carboxylic acid of the intermediates led to near complete racemisation of the chiral centre if the reaction was carried out in the presence of a base such as triethylamine. The enantiomerically pure glycolipids were obtained after careful consideration of the synthetic sequence and by performing the coupling reactions in the absence of base.

  15. Enhanced Acid/Base Catalysis in High Temperature Liquid Water

    Xiu Yang LU; Qi JING; Zhun LI; Lei YUAN; Fei GAO; Xin LIU

    2006-01-01

    Two novel and environmentally benign solvent systems, organic acids-enriched high temperature liquid water (HTLW) and NH3-enriched HTLW, were developed, which can enhance the reaction rate of acid/base-catalyzed organic reactions in HTLW. We investigated the decomposition of fructose in organic acids-enriched HTLW, hydrolysis of cinnamaldehyde and aldol condensation of phenylaldehyde with acetaldehyde in NH3-enriched HTLW. The experimental results demonstrated that organic acids-enriched or NH3-enriched HTLW can greatly accelerate acid/base-catalyzed organic reactions in HTLW.

  16. Acid-base strengths in pyridine

    Bos, M.; Dahmen, E.A.M.F.

    1971-01-01

    Although pyridine is a solvent with a low dielectric constant, spectrophotometric determinations show simple dissociation without ion pairs as intermediates for some sulfonphthaleins and polynitrophenols in pyridine. The salts of a number of amines and hydrochloric acid, perchloric acid and picric

  17. Random Coding Bounds for DNA Codes Based on Fibonacci Ensembles of DNA Sequences

    2008-07-01

    COVERED (From - To) 6 Jul 08 – 11 Jul 08 4. TITLE AND SUBTITLE RANDOM CODING BOUNDS FOR DNA CODES BASED ON FIBONACCI ENSEMBLES OF DNA SEQUENCES ... sequences which are generalizations of the Fibonacci sequences . 15. SUBJECT TERMS DNA Codes, Fibonacci Ensembles, DNA Computing, Code Optimization 16...coding bound on the rate of DNA codes is proved. To obtain the bound, we use some ensembles of DNA sequences which are generalizations of the Fibonacci

  18. Recognition of Nucleic Acid Junctions Using Triptycene-Based Molecules

    Barros, Stephanie A.; Chenoweth, David M.

    2014-01-01

    Nucleic acid modulation by small molecules is an essential process across the kingdoms of life. Targeting nucleic acids with small molecules represents a significant challenge at the forefront of chemical biology. Nucleic acid junctions are ubiquitous structural motifs in nature and in designed materials. Herein, we describe a new class of structure specific nucleic acid junction stabilizers based on a triptycene scaffold. Triptycenes provide significant stabilization of DNA and RNA three-way...

  19. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  20. Acid-Base Pairs in Lewis Acidic Zeolites Promote Direct Aldol Reactions by Soft Enolization.

    Lewis, Jennifer D; Van de Vyver, Stijn; Román-Leshkov, Yuriy

    2015-08-17

    Hf-, Sn-, and Zr-Beta zeolites catalyze the cross-aldol condensation of aromatic aldehydes with acetone under mild reaction conditions with near quantitative yields. NMR studies with isotopically labeled molecules confirm that acid-base pairs in the Si-O-M framework ensemble promote soft enolization through α-proton abstraction. The Lewis acidic zeolites maintain activity in the presence of water and, unlike traditional base catalysts, in acidic solutions.

  1. Novel Sequence Number Based Secure Authentication Scheme for Wireless LANs

    Rajeev Singh; Teek Parval Sharma

    2015-01-01

    Authentication per frame is an implicit necessity for security in wireless local area networks (WLANs). We propose a novel per frame secure authentication scheme which provides authentication to data frames in WLANs. The scheme involves no cryptographic overheads for authentication of frames. It utilizes the sequence number of the frame along with the authentication stream generators for authentication. Hence, it requires no extra bits or messages for the authentication purpose and also no change in the existing frame format is required. The scheme provides authentication by modifying the sequence number of the frame at the sender, and that the modification is verified at the receiver. The modified sequence number is protected by using the XOR operation with a random number selected from the random stream. The authentication is lightweight due to the fact that it requires only trivial arithmetic operations like the subtraction and XOR operation.

  2. Whole-genome sequence-based analysis of thyroid function

    Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J.; Traglia, Michela; Brown, Suzanne J.; Mullin, Benjamin H; Shihab, Hashem A.; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R.; Beilby, John P.; Charoen, Pimphen

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 1...

  3. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences

    Tully, Philip J; Lindén, Henrik; Hennig, Matthias H;

    2016-01-01

    of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx). We show that the learning...... and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times...

  4. Predicting tissue-specific expressions based on sequence characteristics

    Paik, Hyojung

    2011-04-30

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  5. Ionisation constants of inorganic acids and bases in aqueous solution

    Perrin, D D

    2013-01-01

    Ionisation Constants of Inorganic Acids and Bases in Aqueous Solution, Second Edition provides a compilation of tables that summarize relevant data recorded in the literature up to the end of 1980 for the ionization constants of inorganic acids and bases in aqueous solution. This book includes references to acidity functions for strong acids and bases, as well as details about the formation of polynuclear species. This text then explains the details of each column of the tables, wherein column 1 gives the name of the substance and the negative logarithm of the ionization constant and column 2

  6. Tentative revision of the global Pliocene-Pleistocene sequences based on the sequence stratigraphy in the Gulf of Mexico

    Wornardt, W.W. Jr. (Micro-Strat, Inc., Houston, TX (United States) Rice Univ., Houston, TX (United States)); Vail, P.R. (Rice Univ., Houston, TX (United States))

    1991-08-01

    The Pliocene-Pleistocene sequence chronostratigraphy presented in this paper is based on a vast amount of data obtained from more than 100 wells drilled over the past eight years in the south additions and deep-water areas in offshore Texas and Louisiana, Gulf of Mexico. This high-resolution biostratigraphic data base consists of individual checklist with the abundance, diversity, occurrences, and ranges of planktonic and benthic foraminifers and calcareous nannofossils plotted against depth. The benthic foraminifers have been interpreted largely for their paleobathymetric significance and result in a water-depth curve for each well studied. These wells have been further calibrated by having a portion of the study wells tied to sequence stratigraphic interpretations of seismic record sections through a two-way-time log or synthetic seismogram. The Pliocene and Pleistocene is tentatively divided into 14 fourth-order sequences from 3.0 to 0 Ma and three third-order cycles from 5.5 to 3.0 Ma. Each of the cycles is bounded by a sequence boundary and has an age-dateable maximum flooding surface. Depending on location, each sequence may have lowstand, transgressive, and highstand systems tracts within the basin. The age (Ma) of the sequence boundaries are third-order, 5.5, 4.2, 3.8, fourth-order, 3.0, 2.6, 2.4, 1.86, 1.4., 1.0, 0.82, 0.72, 0.62, 0.52, 0.42., 0.32, 0.22, 0.12, and 0.02. Within these sequence boundaries are the 5.0, 4.0, 3.4, 2.7, 2.45, 2.0, 1.47, 1.3, 0.92, 0.76, 0.66, 0.56, 0.46, 0.36, 0.26, 0.16, and 0.06 Ma maximum flooding surfaces, respectively. All of the condensed sections associated with the maximum flooding surfaces, systems tract boundaries, and sequences boundaries in the Pliocene-Pleistocene can be recognized and traced on well logs and seismic record sections in the offshore Texas and Louisiana areas.

  7. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  8. Measurement of word frequencies in genomic DNA sequences based on partial alignment and fuzzy set.

    Shida, Fumiya; Mizuta, Satoshi

    2014-08-01

    Accompanied with the rapid increase of the amount of data registered in the databases of biological sequences, the need for a fast method of sequence comparison applicable to sequences of large size is also increasing. In general, alignment is used for sequence comparison. However, the alignment may not be appropriate for comparison of sequences of large size such as whole genome sequences due to its large time complexity. In this article, we propose a semi alignment-free method of sequence comparison based on word frequency distributions, in which we partially use the alignment to measure word frequencies along with the idea of fuzzy set theory. Experiments with ten bacterial genome sequences demonstrated that the fuzzy measurements has the effect that facilitates discrimination between close relatives and distant relatives.

  9. A DNA Structure-Based Bionic Wavelet Transform and Its Application to DNA Sequence Analysis

    Fei Chen

    2003-01-01

    Full Text Available DNA sequence analysis is of great significance for increasing our understanding of genomic functions. An important task facing us is the exploration of hidden structural information stored in the DNA sequence. This paper introduces a DNA structure-based adaptive wavelet transform (WT – the bionic wavelet transform (BWT – for DNA sequence analysis. The symbolic DNA sequence can be separated into four channels of indicator sequences. An adaptive symbol-to-number mapping, determined from the structural feature of the DNA sequence, was introduced into WT. It can adjust the weight value of each channel to maximise the useful energy distribution of the whole BWT output. The performance of the proposed BWT was examined by analysing synthetic and real DNA sequences. Results show that BWT performs better than traditional WT in presenting greater energy distribution. This new BWT method should be useful for the detection of the latent structural features in future DNA sequence analysis.

  10. Angiosperm phylogeny based on matK sequence information

    Hilu, K.W.; Borsch, T.; Müller, K.; Soltis, D.E.; Savolainen, V.; Chase, M.W.; Powell, M.; Alice, L.A.; Evans, R.; Sauquet, H.; Neinhuis, C.; Slotta, T.A.B.; Rohwer, J.G.; Campbell, C.; Chatrou, L.W.

    2003-01-01

    Plastid matK gene sequences for 374 genera representing all angiosperm orders and 12 genera of gymnosperms were analyzed using parsimony (MP) and Bayesian inference (BI) approaches. Traditionally, slowly evolving genomic regions have been preferred for deep-level phylogenetic inference in angiosperm

  11. Customer Clustering Based on Customer Purchasing Sequence Data

    Yen-Chung Liu

    2017-01-01

    Full Text Available Customer clustering has become a priority for enterprises because of the importance of customer relationship management. Customer clustering can improve understanding of the composition and characteristics of customers, thereby enabling the creation of appropriate marketing strategies for each customer group. Previously, different customer clustering approaches have been proposed according to data type, namely customer profile data, customer value data, customer transaction data, and customer purchasing sequence data. This paper considers the customer clustering problem in the context of customer purchasing sequence data. However, two major aspects distinguish this paper from past research: (1 in our model, a customer sequence contains itemsets, which is a more realistic configuration than previous models, which assume a customer sequence would merely consist of items; and (2 in our model, a customer may belong to multiple clusters or no cluster, whereas in existing models a customer is limited to only one cluster. The second difference implies that each cluster discovered using our model represents a crucial type of customer behavior and that a customer can exhibit several types of behavior simultaneously. Finally, extensive experiments are conducted through a retail data set, and the results show that the clusters obtained by our model can provide more accurate descriptions of customer purchasing behaviors.

  12. Whole-genome sequence-based analysis of thyroid function

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome seque...

  13. Identification of the new HLA-DRB1{sup *}0812 allele detected by sequencing based typing

    Versluis, L.F.; Zwan, A.W. van der; Tilanus, M.G.J. [Univ. Hospital Utrecht (Netherlands); Savelkoul, P.H.M.; Berg-Loonen, E.M. van den [Univ. Hospital Maastricht (Netherlands)] [and others

    1996-12-31

    HLA-DRB typing by polymerase chain reaction-sequence specific priming (PCR-SSP) and sequencing based typing (SBT) was studied within the framework of the Antigen and Haplotype Society 11 and the Sequencing Based Typing Component of the Twelfth International HLA workshop. Sequencing was performed as described by McGinnis and co-workers in 1995 on coded samples, including most DR2 subtypes, resulting in high resolution HLA-DR typing. Sequences were compared with a database containing 107 DRB1, four DRB3, and five DRB5 alleles in a similar way as described for HLA-DPB. One sample showed a new DR8 sequence, indicating the presence of a new allele. This individual (4390) is of Indonesian origin. The specific amplification of the DR8 allele and subsequent sequencing resulted in a sequence which did not match the database and new polymorphism was identified. The complementary strand was sequenced and confirmed the presence of a new DRB1 allele. Cloning and subsequent sequencing of the polymerase chain reaction fragment resulted in confirmation of the direct sequence data. Later this variant was officially named DRB1{sup *}0812. The complete nucleotide sequence of exon 2 of this new allele is shown. This allele differs from DRB1{sup *}0810 by one nucleotide at codon 85, resulting in an alanine (GTT), whereas DRB1{sup *}0810 carries a valine (GCT). 5 refs., 1 fig.

  14. Bayesian prediction of bacterial growth temperature range based on genome sequences

    Jensen Dan B

    2012-12-01

    Full Text Available Abstract Background The preferred habitat of a given bacterium can provide a hint of which types of enzymes of potential industrial interest it might produce. These might include enzymes that are stable and active at very high or very low temperatures. Being able to accurately predict this based on a genomic sequence, would thus allow for an efficient and targeted search for production organisms, reducing the need for culturing experiments. Results This study found a total of 40 protein families useful for distinction between three thermophilicity classes (thermophiles, mesophiles and psychrophiles. The predictive performance of these protein families were compared to those of 87 basic sequence features (relative use of amino acids and codons, genomic and 16S rDNA AT content and genome size. When using naïve Bayesian inference, it was possible to correctly predict the optimal temperature range with a Matthews correlation coefficient of up to 0.68. The best predictive performance was always achieved by including protein families as well as structural features, compared to either of these alone. A dedicated computer program was created to perform these predictions. Conclusions This study shows that protein families associated with specific thermophilicity classes can provide effective input data for thermophilicity prediction, and that the naïve Bayesian approach is effective for such a task. The program created for this study is able to efficiently distinguish between thermophilic, mesophilic and psychrophilic adapted bacterial genomes.

  15. Safety assessment of Bifidobacterium longum J DM301 based on complete genome sequences

    Yan-Xia Wei; Zhuo-Yang Zhang; Chang Liu; Xiao-Kui Guo; Pradeep K Malakar

    2012-01-01

    AIM: To assess the safety of Bifidobacterium longum (B.longum) JDM301 based on complete genome sequences. METHODS: The complete genome sequences of JDM301 were determined using the GS 20 system. Putative virulence factors, putative antibiotic resistance genes and genes encoding enzymes responsible for harmful metabolites were identified by blast with virulence factors database, antibiotic resistance genes database and genes associated with harmful metabolites in previous reports. Minimum inhibitory concentration of 16 common antimicrobial agents was evaluated by E-test. RESULTS: JDM301 was shown to contain 36 genes associated with antibiotic resistance, 5 enzymes related to harmful metabolites and 162 nonspecific virulence factors mainly associated with transcriptional regulation, adhesion, sugar and amino acid transport. B. longum JDM301 was intrinsically resistant tocipro ciprofloxacin,amikacin, gentamicin and streptomycin and susceptible to vancomycin, amoxicillin, cephalothin, chloramphenicol, erythromycin, ampicillin, cefotaxime, rifampicin, imipenemandtrimethoprim and trimethoprim-sulphamethoxazol. JDM301.JDM301 was moderately resistant to bacitracin, while an earlier study showed that bifidobacteria were susceptible to this antibiotic. A tetracycline resistance gene with the risk of transfer was found in JDM301, which needs to be experimentally validated. CONCLUSION: The safety assessment of JDM301 using information derived from complete bacterial genome will contribute to a wider and deeper insight into the safety of probiotic bacteria.

  16. An amphipathic trans-acting phosphorothioate DNA element delivers uncharged PNA and PMO nucleic acid sequences in mammalian cells.

    Jain, Harsh V; Beaucage, Serge L

    An innovative approach to the delivery of uncharged peptide nucleic acids (PNA) and phosphorodiamidate morpholino (PMO) oligomers in mammalian cells is described and consists of extending the sequence of those oligomers with a short PNA-polyA or PMO-polyA tail. Recognition of the polyA-tailed PNA or PMO oligomers by an amphipathic trans-acting polythymidylic thiophosphate triester element (dTtaPS) results in efficient internalization of those oligomers in several cell lines. Our findings indicate that cellular uptake of the oligomers occurs through an energy-dependent mechanism and macropinocytosis appears to be the predo-minant endocytic pathway used for internalization. The functionality of the internalized oligomers is demonstrated by alternate splicing of the pre-mRNA encoding luciferase in HeLa pLuc 705 cells. Amphipathic phosphorothioate DNA elements may represent a unique class of cellular transporters for robust delivery of uncharged nucleic acid sequences in live mammalian cells.

  17. A new representation of acid-base disturbances

    M. Hekking (Marcel); E.S. Gelsema; J. Lindemans (Jan)

    1994-01-01

    textabstractThe acid-base status of intensive care patients is monitored on the basis of three quantities. The graphical representation which may be of help for the monitoring task is therefore cumbersome. The classical Siggaard-Andersen acid-base chart is such a representation, but it is only suite

  18. A Closer Look at Acid-Base Olfactory Titrations

    Neppel, Kerry; Oliver-Hoyo, Maria T.; Queen, Connie; Reed, Nicole

    2005-01-01

    Olfactory titrations using raw onions and eugenol as acid-base indicators are reported. An in-depth investigation on olfactory titrations is presented to include requirements for potential olfactory indicators and protocols for using garlic, onions, and vanillin as acid-base olfactory indicators are tested.

  19. What is the Ultimate Goal in Acid-Base Regulation?

    Balakrishnan, Selvakumar; Gopalakrishnan, Maya; Alagesan, Murali; Prakash, E. Sankaranarayanan

    2007-01-01

    It is common to see chapters on acid-base physiology state that the goal of acid-base regulatory mechanisms is to maintain the pH of arterial plasma and not arterial PCO [subscript 2] (Pa[subscript CO[subscript 2

  20. The Roles of Acids and Bases in Enzyme Catalysis

    Weiss, Hilton M.

    2007-01-01

    Many organic reactions are catalyzed by strong acids or bases that protonate or deprotonate neutral reactants leading to reactive cations or anions that proceed to products. In enzyme reactions, only weak acids and bases are available to hydrogen bond to reactants and to transfer protons in response to developing charges. Understanding this…

  1. Asynchronous symmetry-based sequences for homonuclear dipolar recoupling in solid-state nuclear magnetic resonance

    Tan, Kong Ooi; Ernst, Matthias, E-mail: madhu@tifr.res.in, E-mail: maer@ethz.ch [Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich (Switzerland); Rajeswari, M. [Department of Chemical Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai 400 005 (India); Madhu, P. K., E-mail: madhu@tifr.res.in, E-mail: maer@ethz.ch [Department of Chemical Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai 400 005 (India); TIFR Centre for Interdisciplinary Sciences, 21 Brundavan Colony, Narsingi, Hyderabad 500 075 (India)

    2015-02-14

    We show a theoretical framework, based on triple-mode Floquet theory, to analyze recoupling sequences derived from symmetry-based pulse sequences, which have a non-vanishing effective field and are not rotor synchronized. We analyze the properties of one such sequence, a homonuclear double-quantum recoupling sequence derived from the C7{sub 2}{sup 1} sequence. The new asynchronous sequence outperforms the rotor-synchronized version for spin pairs with small dipolar couplings in the presence of large chemical-shift anisotropy. The resonance condition of the new sequence is analyzed using triple-mode Floquet theory. Analytical calculations of second-order effective Hamiltonian are performed to compare the efficiency in suppressing second-order cross terms. Experiments and numerical simulations are shown to corroborate the results of the theoretical analysis.

  2. Connecting Acids and Bases with Encapsulation... and Chemistry with Nanotechnology

    Criswell, Brett

    2007-01-01

    The features and the development of various new acids and bases activity sets that combines chemistry with nanotechnology are being described. These sets lead to the generation of many nanotechnology-based pharmaceuticals for the treatment of various diseases.

  3. Sequence Alignment with Dynamic Divisor Generation for Keystroke Dynamics Based User Authentication

    Jiacang Ho; Dae-Ki Kang

    2015-01-01

    Keystroke dynamics based authentication is one of the prevention mechanisms used to protect one’s account from criminals’ illegal access. In this authentication mechanism, keystroke dynamics are used to capture patterns in a user typing behavior. Sequence alignment is shown to be one of effective algorithms for keystroke dynamics based authentication, by comparing the sequences of keystroke data to detect imposter’s anomalous sequences. In previous research, static divisor has been used for s...

  4. A Hybrid Time Synchronization Algorithm Based on Broadcast Sequencing for Wireless Sensor Networks

    2014-09-01

    sequence per the flow charts detailed in Figures 43–45 located in Appendix A. The input 1 in Figure 12 is a recursive step from some of the...SYNCHRONIZATION ALGORITHM BASED ON BROADCAST SEQUENCING FOR WIRELESS SENSOR NETWORKS by Sung C. Park September 2014 Thesis Co-Advisors...REPORT TYPE AND DATES COVERED Master’s Thesis 4. TITLE AND SUBTITLE A HYBRID TIME SYNCHRONIZATION ALGORITHM BASED ON BROADCAST SEQUENCING FOR

  5. MR-based attenuation correction in brain PET based on UTE sequences

    Cabello, Jorge; Nekolla, Stephan G; Ziegler, Sibylle I [Department of Nuclear Medicine, Klinikum rechts der Isar, Technische Universität München (Germany)

    2014-07-29

    Attenuation correction (AC) in brain PET/MR has recently emerged as one of the challenging tasks in the PET/MR field. It has been shown that to ignore the attenuation produced by bone can lead to errors ranging from 5-30% in regions close to bone structures. Since the information provided by the MR signal is not directly related to tissue attenuation, alternative methods have to be developed. Signal from bone tissue is difficult to measure given its short transverse relaxation time (T2). Ultrashort-echo time (UTE) pulse sequences were developed to measure signal from tissues with short T2. A combination of two consecutive UTE echoes has been used in several works to measure signal from bone tissue. The first echo is able to measure signal from bone tissue in addition to soft tissue, while the second echo contains most of the soft tissue contained in the first echo but not bone. In this work we extract the attenuation information from the difference between the logarithm of two images obtained after applying two consecutive UTE pulse sequences using the mMR scanner (Siemens Healthcare). Subsequently, image processing techniques are applied to reduce the noise and extract air cavities within the head. The resulting image is converted to linear attenuation coefficients, generating what is known as µ-map, to be used during reconstruction. For comparison purposes PET/CT scans of the same patients were acquired prior to the PET/MR scan. Additional µ-maps obtained for comparison were extracted from a Dixon sequence (used in clinical routine) and an additional µ-map calculated by the scanner based on UTE pulse sequences. Preliminary quantitative results measured in the cerebellum, using the value obtained with CT-based AC as reference, show differences of 34% without AC, 13% using the Dixon-based and UTE-based provided by the scanner, and 0.8% with the AC strategy presented here.

  6. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    Joungmin Lee

    2016-06-01

    Full Text Available Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755, which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain.

  7. A novel chaotic based image encryption using a hybrid model of deoxyribonucleic acid and cellular automata

    Enayatifar, Rasul; Sadaei, Hossein Javedani; Abdullah, Abdul Hanan; Lee, Malrey; Isnin, Ismail Fauzi

    2015-08-01

    Currently, there are many studies have conducted on developing security of the digital image in order to protect such data while they are sending on the internet. This work aims to propose a new approach based on a hybrid model of the Tinkerbell chaotic map, deoxyribonucleic acid (DNA) and cellular automata (CA). DNA rules, DNA sequence XOR operator and CA rules are used simultaneously to encrypt the plain-image pixels. To determine rule number in DNA sequence and also CA, a 2-dimension Tinkerbell chaotic map is employed. Experimental results and computer simulations, both confirm that the proposed scheme not only demonstrates outstanding encryption, but also resists various typical attacks.

  8. Spherical Nucleic Acids as Intracellular Agents for Nucleic Acid Based Therapeutics

    Hao, Liangliang

    Recent functional discoveries on the noncoding sequences of human genome and transcriptome could lead to revolutionary treatment modalities because the noncoding RNAs (ncRNAs) can be applied as therapeutic agents to manipulate disease-causing genes. To date few nucleic acid-based therapeutics have been translated into the clinic due to challenges in the delivery of the oligonucleotide agents in an effective, cell specific, and non-toxic fashion. Unmodified oligonucleotide agents are destroyed rapidly in biological fluids by enzymatic degradation and have difficulty crossing the plasma membrane without the aid of transfection reagents, which often cause inflammatory, cytotoxic, or immunogenic side effects. Spherical nucleic acids (SNAs), nanoparticles consisting of densely organized and highly oriented oligonucleotides, pose one possible solution to circumventing these problems in both the antisense and RNA interference (RNAi) pathways. The unique three dimensional architecture of SNAs protects the bioactive oligonucleotides from unspecific degradation during delivery and supports their targeting of class A scavenger receptors and endocytosis via a lipid-raft-dependent, caveolae-mediated pathway. Owing to their unique structure, SNAs are able to cross cell membranes and regulate target genes expression as a single entity, without triggering the cellular innate immune response. Herein, my thesis has focused on understanding the interactions between SNAs and cellular components and developing SNA-based nanostructures to improve therapeutic capabilities. Specifically, I developed a novel SNA-based, nanoscale agent for delivery of therapeutic oligonucleotides to manipulate microRNAs (miRNAs), the endogenous post-transcriptional gene regulators. I investigated the role of SNAs involving miRNAs in anti-cancer or anti-inflammation responses in cells and in in vivo murine disease models via systemic injection. Furthermore, I explored using different strategies to construct

  9. DeNovoID: a web-based tool for identifying peptides from sequence and mass tags deduced from de novo peptide sequencing by mass spectroscopy.

    Halligan, Brian D; Ruotti, Victor; Twigger, Simon N; Greene, Andrew S

    2005-07-01

    One of the core activities of high-throughput proteomics is the identification of peptides from mass spectra. Some peptides can be identified using spectral matching programs like Sequest or Mascot, but many spectra do not produce high quality database matches. De novo peptide sequencing is an approach to determine partial peptide sequences for some of the unidentified spectra. A drawback of de novo peptide sequencing is that it produces a series of ordered and disordered sequence tags and mass tags rather than a complete, non-degenerate peptide amino acid sequence. This incomplete data is difficult to use in conventional search programs such as BLAST or FASTA. DeNovoID is a program that has been specifically designed to use degenerate amino acid sequence and mass data derived from MS experiments to search a peptide database. Since the algorithm employed depends on the amino acid composition of the peptide and not its sequence, DeNovoID does not have to consider all possible sequences, but rather a smaller number of compositions consistent with a spectrum. DeNovoID also uses a geometric indexing scheme that reduces the number of calculations required to determine the best peptide match in the database. DeNovoID is available at http://proteomics.mcw.edu/denovoid.

  10. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  11. General base-general acid catalysis by terpenoid cyclases.

    Pemberton, Travis A; Christianson, David W

    2016-07-01

    Terpenoid cyclases catalyze the most complex reactions in biology, in that more than half of the substrate carbon atoms often undergo changes in bonding during the course of a multistep cyclization cascade that proceeds through multiple carbocation intermediates. Many cyclization mechanisms require stereospecific deprotonation and reprotonation steps, and most cyclization cascades are terminated by deprotonation to yield an olefin product. The first bacterial terpenoid cyclase to yield a crystal structure was pentalenene synthase from Streptomyces exfoliatus UC5319. This cyclase generates the hydrocarbon precursor of the pentalenolactone family of antibiotics. The structures of pentalenene synthase and other terpenoid cyclases reveal predominantly nonpolar active sites typically lacking amino acid side chains capable of serving general base-general acid functions. What chemical species, then, enables the Brønsted acid-base chemistry required in the catalytic mechanisms of these enzymes? The most likely candidate for such general base-general acid chemistry is the co-product inorganic pyrophosphate. Here, we briefly review biological and nonbiological systems in which phosphate and its derivatives serve general base and general acid functions in catalysis. These examples highlight the fact that the Brønsted acid-base activities of phosphate derivatives are comparable to the Brønsted acid-base activities of amino acid side chains.

  12. Identification Lactic acid bacteria isolated from mongolia fermented milk products bases on 16S rDNA sequence analysis and tuf-RFLP technology%利用16S rDNA序列及tuf-RFLP鉴定蒙古国发酵乳中的乳酸菌

    王宏梅; 于洁; 包秋华; 吕嫱; 孟和毕力格; 张和平

    2011-01-01

    One hundred and ten strains of lactic acid bacteria (LAB) isolated from twenty five samples of naturally fermented yoghurt in Dza-vhan of Mongolia were classified by 16S rDNA sequence analysis and tuf- RFLP. They were preliminary identified as Streptococcus thermophilus (41 strains), Lactobacillus helvetkus (40 strains), Lactobarillus delbrueckii subsp. Bulgaricus (11 strains), Lactobadllus fermentum (2 strains), Leuconostoc mesenteroides subsp. Mesenteroides (2 strains), Lactococcus lactis subsp. Lactis (1 strain) and Lactobadllus casei group (12 strains) by 16S rDNA sequence analysis. However, the 16S rDNA sequence of Lactobadllus casei group existed very small difference, tuf-RFLP was used to confirm results given by 16S rDNA sequencing. The tuf- RFLP profile pattern revealed that the twelve strains were accurately identified as Lactobadllus casei.%运用16S rDNA序列分析和tuf-RFLP技术对采于蒙古国扎布汗省的25份发酵乳样中分离出的110株乳酸菌进行鉴定.首先将分离的110株乳酸菌的16S rRNA基因进行扩增,测序并构建系统发育树,初步鉴定为41株嗜热链球菌,40株瑞士乳杆菌,11株德氏乳杆菌保加利亚亚种,2株发酵乳杆菌,1株乳明串珠菌,2株肠膜明串肠膜亚种,1株乳酸乳球乳酸亚种和12株属于干酪族的菌株.由于干酪乳杆菌族的16S rDNA序列差异很小,故采用tuf-RFLP 技术对这12株进行了进一步的验证,通过分离菌株与模式菌株tuf-RFLP图谱的比较分析,结果表明这12株菌均为干酪乳杆菌.

  13. Generation of novel cationic antimicrobial peptides from natural non-antimicrobial sequences by acid-amide substitution

    Tamada Yasushi

    2011-03-01

    Full Text Available Abstract Background Cationic antimicrobial peptides (CAMPs are well recognized to be promising as novel antimicrobial and antitumor agents. To obtain novel skeletons of CAMPs, we propose a simple strategy using acid-amide substitution (i.e. Glu→Gln, Asp→Asn to confer net positive charge to natural non-antimicrobial sequences that have structures distinct from known CAMPs. The potential of this strategy was verified by a trial study. Methods The pro-regions of nematode cecropin P1-P3 (P1P-P3P were selected as parent sequences. P1P-P3P and their acid-amide-substituted mutants (NP1P-NP3P were chemically synthesized. Bactericidal and membrane-disruptive activities of these peptides were evaluated. Conformational changes were estimated from far-ultraviolet circular dichroism (CD spectra. Results NP1P-NP3P acquired potent bactericidal activities via membrane-disruption although P1P-P3P were not antimicrobial. Far-ultraviolet CD spectra of NP1P-NP3P were similar to those of their parent peptides P1P-P3P, suggesting that NP1P-NP3P acquire microbicidal activity without remarkable conformational changes. NP1P-NP3P killed bacteria in almost parallel fashion with their membrane-disruptive activities, suggesting that the mode of action of those peptides was membrane-disruption. Interestingly, membrane-disruptive activity of NP1P-NP3P were highly diversified against acidic liposomes, indicating that the acid-amide-substituted nematode cecropin pro-region was expected to be a unique and promising skeleton for novel synthetic CAMPs with diversified membrane-discriminative properties. Conclusions The acid-amide substitution successfully generated some novel CAMPs in our trial study. These novel CAMPs were derived from natural non-antimicrobial sequences, and their sequences were completely distinct from any categories of known CAMPs, suggesting that such mutated natural sequences could be a promising source of novel skeletons of CAMPs.

  14. A locked nucleic Acid-based nanocrawler

    Astakhova, I Kira; Pasternak, Karol; Campbell, Meghan A;

    2013-01-01

    Herein we introduce a novel fluorescent LNA/DNA machine, a nanocrawler, which reversibly moves along a directionally polar complementary road controlled by affinity-enhancing locked nucleic acid (LNA) monomers and additional regulatory strands. Polyaromatic hydrocarbon (PAH) dyes attached to 2...

  15. Evaluation of T-cell responses to peptides and lipopeptides with MHC class I binding motifs derived from the amino acid sequence of the19-kDa lipoprotein of Mycobacterium tuberculosis

    Fonseca, DPAJ; Joosten, D; Snippe, H; Verheul, AFM

    2000-01-01

    Cytotoxic T-lymphocyte (CTL) epitopes on the 19-kDa lipoprotein from Mycobacterium tuberculosis were identified by the use of lipopeptides and their cytokine profile studied. Selection of candidate CTL epitopes was based on synthetic peptides derived from the amino acid sequence of the 19-kDa lipopr

  16. Carbon-based strong solid acid for cornstarch hydrolysis

    Nata, Iryanti Fatyasari; Irawan, Chairul; Mardina, Primata; Lee, Cheng-Kang

    2015-10-01

    Highly sulfonated carbonaceous spheres with diameter of 100-500 nm can be generated by hydrothermal carbonization of glucose in the presence of hydroxyethylsulfonic acid and acrylic acid at 180 °C for 4 h. The acidity of the prepared carbonaceous sphere C4-SO3H can reach 2.10 mmol/g. It was used as a solid acid catalyst for the hydrolysis of cornstarch. Total reducing sugar (TRS) concentration of 19.91 mg/mL could be obtained by hydrolyzing 20 mg/mL cornstarch at 150 °C for 6 h using C4-SO3H as solid acid catalyst. The solid acid catalyst demonstrated good stability that only 9% decrease in TRS concentration was observed after five repeat uses. The as-prepared carbon-based solid acid catalyst can be an environmentally benign replacement for homogeneous catalyst.

  17. BAC-end sequence-based SNP mining in Allotetraploid Cotton (Gossypium) utilizing re-sequencing data, phylogenetic inferences and perspectives for genetic mapping

    A bacterial artificial chromosome (BAC) library and BAC-end sequences for Gossypium hirsutum L. have recently been developed. Here we report on genomic-based genome-wide SNP mining utilizing re-sequencing data with a BAC-end sequence reference for twelve G. hirsutum L. lines, one G. barbadense L. li...

  18. Comparison of illumina and 454 deep sequencing in participants failing raltegravir-based antiretroviral therapy.

    Jonathan Z Li

    Full Text Available The impact of raltegravir-resistant HIV-1 minority variants (MVs on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs.A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegravir resistance at the time of virologic failure. A control library was created by pooling integrase clones at predefined proportions. Multiplexed sequencing was performed with Illumina and 454 platforms at comparable costs. Illumina sequence analysis was performed with the novel snp-assess tool and 454 sequencing was analyzed with V-Phaser.Illumina sequencing resulted in significantly higher sequence coverage and a 0.095% limit of detection. Illumina accurately detected all MVs in the control library at ≥0.5% and 7/10 MVs expected at 0.1%. 454 sequencing failed to detect any MVs at 0.1% with 5 false positive calls. For MVs detected in the patient samples by both 454 and Illumina, the correlation in the detected variant frequencies was high (R2 = 0.92, P<0.001. Illumina sequencing detected 2.4-fold greater nucleotide MVs and 2.9-fold greater amino acid MVs compared to 454. The only raltegravir-resistant MV detected was an E138K mutation in one participant by Illumina sequencing, but not by 454.In participants of A5262 with raltegravir resistance at virologic failure, baseline raltegravir-resistant MVs were rarely detected. At comparable costs to 454 sequencing, Illumina demonstrated greater depth of coverage, increased sensitivity for detecting HIV MVs, and fewer false positive variant calls.

  19. DNA Lossless Differential Compression Algorithm based on Similarity of Genomic Sequence Database

    Afify, Heba; Wahed, Manal Abdel

    2011-01-01

    Modern biological science produces vast amounts of genomic sequence data. This is fuelling the need for efficient algorithms for sequence compression and analysis. Data compression and the associated techniques coming from information theory are often perceived as being of interest for data communication and storage. In recent years, a substantial effort has been made for the application of textual data compression techniques to various computational biology tasks, ranging from storage and indexing of large datasets to comparison of genomic databases. This paper presents a differential compression algorithm that is based on production of difference sequences according to op-code table in order to optimize the compression of homologous sequences in dataset. Therefore, the stored data are composed of reference sequence, the set of differences, and differences locations, instead of storing each sequence individually. This algorithm does not require a priori knowledge about the statistics of the sequence set. The...

  20. Sequence specificity of mutagen-nucleic acid complexes in solution: intercalation and mutagen-base pair overlap geometries for proflavine binding to dC-dC-dG-dG and dG-dG-dC-dC self-complementary duplexes.

    Patel, D J; Canuel, L L

    1977-07-01

    The complex formed between the mutagen proflavine and the dC-dC-dG-dG and dG-dG-dC-dC self-complementary tetranucleotide duplexes has been monitored by proton high resolution nuclear magnetic resonance spectroscopy in 0.1 M phosphate solution at high nucleotide/drug ratios. The large upfield shifts (0.5 to 0.85 ppm) observed at all the proflavine ring nonexchangeable protons on complex formation are consistent with intercalation of the mutagen between base pairs of the tetranucleotide duplex. We have proposed an approximate overlap geometry between the proflavine ring and nearest neighbor base pairs at the intercalation site from a comparison between experimental shifts and those calculated for various stacking orientations. We have compared the binding of actinomycin D, propidium diiodide, and proflavine to self-complementary tetranucleotide sequences dC-dC-dG-dG and dG-dG-dC-dC by UV absorbance changes in the drug bands between 400 and 500 nm. Actinomycin D exhibits a pronounced specificity for sequences with dG-dC sites (dG-dG-dC-dC), while propidium diiodide and proflavine exhibit a specificity for sequences with dC-dG sites (dC-dC-dG-dG). Actinomycin D binds more strongly than propidium diiodide and proflavine to dC-dG-dC-dG (contains dC-dG and dG-dC binding sites), indicative of the additional stabilization from hydrogen bonding and hydrophobic interactions between the pentapeptide lactone rings of actinomycin D and the base pair edges and sugar-phosphate backbone of the tetranucleotide duplex.

  1. The Effects of White Light on Chlorogenic Acid Contents in Eucommia Ulmoides Calli Based on Transcriptome High-Throughput Sequencing%基于转录组高通量测序分析白光对杜仲愈伤组织中绿原酸含量的影响

    张俊娥; 邓华锋; 郑文强; 江京; 金聪; 赵夫凯

    2015-01-01

    In this study,Eucommia ulmoides calli for 18d under white light( light intensity was 12 000 lux,16 h light and 8 h darkness)were cultured and 24 h darkness culture was as control. Then transcriptome high-throughput sequencing for the Eucommia ulmoides calli were performed using the Illumina HiSeqTM 2500 sequencing platform. Functional annotation and enrichment analysis of differentially expressed genes were also carried out. Effect of white light on the content of chlorogenic acid of Eucommia ulmoides calli was studied. The results showed de novo assem-bly generated 62 030 unigenes. 25 417(40. 98%)annotated unigenes were obtained by selecting BLAST parameter ( E-value ≤10 -5 )and HMMER parameter( E-value ≤10 -10 ). Metabolic pathway analysis revealed that 3 uni-genes were predicted to be responsible for the chlorogenic acid,the 3 unigenes were up-regulated and in the calli under light culture,which was meaning that the production capacity of chlorogenic acid increased. The three en-zymes dominated chlorogenic acid production in Eucommia ulmoides calli. The three enzymes were phenylalanine ammonia-lyase(EC 4. 3. 1. 24,PAL),trans-cinnamate 4-monooxygenase(EC 1. 14. 13. 11,C4H)and Shikimate o-hydroxyl-cinnamoyltransferase(EC 2. 3. 1. 133,HCT). The study showed that the white light could improve the production capacity of chlorogenic acid components of Eucommia ulmoides calli and represents a valuable resource for future genomic studies on Eucommia ulmoides.%利用Illumina HiSeqTM 2500平台对光暗培养的杜仲愈伤组织进行了转录组高通量测序,利用BLAST软件进行差异表达基因的功能注释和富集分析,就白光(光强为12000 lux,16 h 光照,8 h 黑暗)对杜仲愈伤组织中绿原酸含量的影响进行了研究.结果表明:通过Trinity软件合并组装后共获得62030个Unigenes,通过BLASTX比对,共获得25417(40.98%)个有注释信息的Unigenes.通过对KEGG通路进行深入分析

  2. Reactive Distillation for Esterification of Bio-based Organic Acids

    Fields, Nathan; Miller, Dennis J.; Asthana, Navinchandra S.; Kolah, Aspi K.; Vu, Dung; Lira, Carl T.

    2008-09-23

    The following is the final report of the three year research program to convert organic acids to their ethyl esters using reactive distillation. This report details the complete technical activities of research completed at Michigan State University for the period of October 1, 2003 to September 30, 2006, covering both reactive distillation research and development and the underlying thermodynamic and kinetic data required for successful and rigorous design of reactive distillation esterification processes. Specifically, this project has led to the development of economical, technically viable processes for ethyl lactate, triethyl citrate and diethyl succinate production, and on a larger scale has added to the overall body of knowledge on applying fermentation based organic acids as platform chemicals in the emerging biorefinery. Organic acid esters constitute an attractive class of biorenewable chemicals that are made from corn or other renewable biomass carbohydrate feedstocks and replace analogous petroleum-based compounds, thus lessening U.S. dependence on foreign petroleum and enhancing overall biorefinery viability through production of value-added chemicals in parallel with biofuels production. Further, many of these ester products are candidates for fuel (particularly biodiesel) components, and thus will serve dual roles as both industrial chemicals and fuel enhancers in the emerging bioeconomy. The technical report from MSU is organized around the ethyl esters of four important biorenewables-based acids: lactic acid, citric acid, succinic acid, and propionic acid. Literature background on esterification and reactive distillation has been provided in Section One. Work on lactic acid is covered in Sections Two through Five, citric acid esterification in Sections Six and Seven, succinic acid in Section Eight, and propionic acid in Section Nine. Section Ten covers modeling of ester and organic acid vapor pressure properties using the SPEAD (Step Potential

  3. Effect of benzoic acid supplementation on acid-base status and mineralmetabolism in catheterized growing pigs

    Nørgaard, Jan Værum; Fernández, José Adalberto; Sørensen, Kristina Ulrich;

    2010-01-01

    Benzoic acid (BA) in diets for growing pigs results in urinary acidification and reduced ammonia emission. The objective was to study the impact of BA supplementation on the acid-base status and mineral metabolism in pigs. Eight female 50-kg pigs, fitted with a catheter in the abdominal aorta, were...

  4. Facile syntheses of dissymmetric ferrocene-functionalized Lewis acids and acid-base pairs.

    Morgan, Ian R; Di Paolo, Angela; Vidovic, Dragoslav; Fallis, Ian A; Aldridge, Simon

    2009-12-21

    A facile synthetic approach is reported for the synthesis of dissymmetric 1,2-ferrocenediyl Lewis acids and mixed acid-base pairs including the first example of a 1-phosphino-2-borylferrocene; the use of non-racemic electrophiles allows for the isolation of single diastereomer products.

  5. H.264 MOTION ESTIMATION ALGORITHM BASED ON VIDEO SEQUENCES ACTIVITY

    2008-01-01

    Motion estimation is an important part of H.264/AVC encoding progress, with high computational complexity. Therefore, it is quite necessary to find a fast motion estimation algorithm for real-time applications. The algorithm proposed in this letter adjudges the macroblocks activity degree first; then classifies different video sequences, and applies different search strategies according to the result. Experiments show that this method obtains almost the same video quality with the Full Search (FS) algorithm but with reduced more than 95% computation cost.

  6. Cluster-Based Multipolling Sequencing Algorithm for Collecting RFID Data in Wireless LANs

    Choi, Woo-Yong; Chatterjee, Mainak

    2015-03-01

    With the growing use of RFID (Radio Frequency Identification), it is becoming important to devise ways to read RFID tags in real time. Access points (APs) of IEEE 802.11-based wireless Local Area Networks (LANs) are being integrated with RFID networks that can efficiently collect real-time RFID data. Several schemes, such as multipolling methods based on the dynamic search algorithm and random sequencing, have been proposed. However, as the number of RFID readers associated with an AP increases, it becomes difficult for the dynamic search algorithm to derive the multipolling sequence in real time. Though multipolling methods can eliminate the polling overhead, we still need to enhance the performance of the multipolling methods based on random sequencing. To that extent, we propose a real-time cluster-based multipolling sequencing algorithm that drastically eliminates more than 90% of the polling overhead, particularly so when the dynamic search algorithm fails to derive the multipolling sequence in real time.

  7. Research on Nucleic Acid Sequence Design Methods for DNA Computing%DNA计算中核酸序列设计方法比较研究

    张凯; 耿修堂; 肖建华; 赵东明

    2008-01-01

    DNA computing maps the instances of a reality problem onto specific nucleic acid mol-ecules and protocols so that the result contains the answers to the problem's instances to enable successful extraction.Good DNA sequences prevent unwanted hybridization errors during the computation and enable easy retrieval the answers in the extraction phase.At first,the paper in-troduces two typical nucleic acid sequence methods,free energy based method and Hamming dis-tance based method,and analyzes the influence of constraints on the quality of DNA sequences.In addition,the paper has compared the integrity and computational cost of two methods for ex-cluding the non-specific hybridization sequence.At last,the paper analyzes the efficiency of two nucleic acid sequence design methods.Through the comparison we can see that the Hamming dis-tance based DNA sequence design method can only approximately estimate the thermodynamic stability of DNA hybridization,and can not be a perfect substitute for the minimum free energy method.Further analysis shows that when only consider the free energy of Watson-crick base pairs,the calculation of free energy will be equal to the Hamming distance method.So,if the precision satisfied the requirement of experiment,Hamming distance based DNA sequence design method,not only can effectively distinguish between specific and non-specific hybridization,and effectively reduce the amount of calculation,improve the efficiency of the DNA sequence designing.%DNA计算是将现实问题进行编码,映射到DNA分子上,然后通过分子生物实验产生出代表问题解的DNA分子.最后通过检测技术提取出该DNA分子.高质量的DNA编码可以尽可能避免或减少计算过程中出现的错误,并使检测阶段易于提取出代表问题解的DNA分子.文中对基于汉明距离和基于自由能的DNA核酸编码方法进行研究,分析了两类方法的约束条件对DNA编码质量的影响,比较了两类方法排除非特异性

  8. Construction of a phylogenetic tree of photosynthetic prokaryotes based on average similarities of whole genome sequences.

    Satoh, Soichirou; Mimuro, Mamoru; Tanaka, Ayumi

    2013-01-01

    Phylogenetic trees have been constructed for a wide range of organisms using gene sequence information, especially through the identification of orthologous genes that have been vertically inherited. The number of available complete genome sequences is rapidly increasing, and many tools for construction of genome trees based on whole genome sequences have been proposed. However, development of a reasonable method of using complete genome sequences for construction of phylogenetic trees has not been established. We have developed a method for construction of phylogenetic trees based on the average sequence similarities of whole genome sequences. We used this method to examine the phylogeny of 115 photosynthetic prokaryotes, i.e., cyanobacteria, Chlorobi, proteobacteria, Chloroflexi, Firmicutes and nonphotosynthetic organisms including Archaea. Although the bootstrap values for the branching order of phyla were low, probably due to lateral gene transfer and saturated mutation, the obtained tree was largely consistent with the previously reported phylogenetic trees, indicating that this method is a robust alternative to traditional phylogenetic methods.

  9. Modulation of anti-endotoxin property of Temporin L by minor amino acid substitution in identified phenylalanine zipper sequence.

    Srivastava, Saurabh; Kumar, Amit; Tripathi, Amit Kumar; Tandon, Anshika; Ghosh, Jimut Kanti

    2016-11-01

    A 13-residue frog antimicrobial peptide Temporin L (TempL) possesses versatile antimicrobial activities and is considered a lead molecule for the development of new antimicrobial agents. To find out the amino acid sequences that influence the anti-microbial property of TempL, a phenylalanine zipper-like sequence was identified in it which was not reported earlier. Several alanine-substituted analogs and a scrambled peptide having the same composition of TempL were designed for evaluating the role of this motif. To investigate whether leucine residues instead of phenylalanine residues at 'a' and/or 'd' position(s) of the heptad repeat sequence could alter its antimicrobial property, several TempL analogs were synthesized after replacing these phenylalanine residues with leucine residues. Replacing phenylalanine residues with alanine residues in the phenylalanine zipper sequence significantly compromised the anti-endotoxin property of TempL. This is evident from the higher production of tumor necrosis factor-α and interleukin-6 in lipopolysaccharide (LPS)-stimulated rat bone-marrow-derived macrophage cells in the presence of its alanine-substituted analogs than TempL itself. However, replacement of these phenylalanine residues with leucine residues significantly augmented anti-endotoxin property of TempL. A single alanine-substituted TempL analog (F8A-TempL) showed significantly reduced cytotoxicity but retained the antibacterial activity of TempL, while the two single leucine-substituted analogs (F5L-TempL and F8L-TempL), although exhibiting lower cytotoxicity, were able to retain the antibacterial activity of the parent peptide. The results demonstrate how minor amino acid substitutions in the identified phenylalanine zipper sequence in TempL could yield analogs with better antibacterial and/or anti-endotoxin properties with their plausible mechanism of action.

  10. DNA sequence-based analysis of the Pseudomonas species.

    Mulet, Magdalena; Lalucat, Jorge; García-Valdés, Elena

    2010-06-01

    Partial sequences of four core 'housekeeping' genes (16S rRNA, gyrB, rpoB and rpoD) of the type strains of 107 Pseudomonas species were analysed in order to obtain a comprehensive view regarding the phylogenetic relationships within the Pseudomonas genus. Gene trees allowed the discrimination of two lineages or intrageneric groups (IG), called IG P. aeruginosa and IG P. fluorescens. The first IG P. aeruginosa, was divided into three main groups, represented by the species P. aeruginosa, P. stutzeri and P. oleovorans. The second IG was divided into six groups, represented by the species P. fluorescens, P. syringae, P. lutea, P. putida, P. anguilliseptica and P. straminea. The P. fluorescens group was the most complex and included nine subgroups, represented by the species P. fluorescens, P. gessardi, P. fragi, P. mandelii, P. jesseni, P. koreensis, P. corrugata, P. chlororaphis and P. asplenii. Pseudomonas rhizospherae was affiliated with the P. fluorescens IG in the phylogenetic analysis but was independent of any group. Some species were located on phylogenetic branches that were distant from defined clusters, such as those represented by the P. oryzihabitans group and the type strains P. pachastrellae, P. pertucinogena and P. luteola. Additionally, 17 strains of P. aeruginosa, 'P. entomophila', P. fluorescens, P. putida, P. syringae and P. stutzeri, for which genome sequences have been determined, have been included to compare the results obtained in the analysis of four housekeeping genes with those obtained from whole genome analyses.

  11. A sequence-based variation map of zebrafish.

    Patowary, Ashok; Purkanti, Ramya; Singh, Meghna; Chauhan, Rajendra; Singh, Angom Ramcharan; Swarnkar, Mohit; Singh, Naresh; Pandey, Vikas; Torroja, Carlos; Clark, Matthew D; Kocher, Jean-Pierre; Clark, Karl J; Stemple, Derek L; Klee, Eric W; Ekker, Stephen C; Scaria, Vinod; Sivasubbu, Sridhar

    2013-03-01

    Zebrafish (Danio rerio) is a popular vertebrate model organism largely deployed using outbred laboratory animals. The nonisogenic nature of the zebrafish as a model system offers the opportunity to understand natural variations and their effect in modulating phenotype. In an effort to better characterize the range of natural variation in this model system and to complement the zebrafish reference genome project, the whole genome sequence of a wild zebrafish at 39-fold genome coverage was determined. Comparative analysis with the zebrafish reference genome revealed approximately 5.2 million single nucleotide variations and over 1.6 million insertion-deletion variations. This dataset thus represents a new catalog of genetic variations in the zebrafish genome. Further analysis revealed selective enrichment for variations in genes involved in immune function and response to the environment, suggesting genome-level adaptations to environmental niches. We also show that human disease gene orthologs in the sequenced wild zebrafish genome show a lower ratio of nonsynonymous to synonymous single nucleotide variations.

  12. Carbon-based strong solid acid for cornstarch hydrolysis

    Nata, Iryanti Fatyasari, E-mail: yanti_tkunlam@yahoo.com [Chemical Engineering Study Program, Faculty of Engineering, Lambung Mangkurat University, Jl. A. Yani Km. 36 Banjarbaru, South Kalimantan 70714 (Indonesia); Irawan, Chairul; Mardina, Primata [Chemical Engineering Study Program, Faculty of Engineering, Lambung Mangkurat University, Jl. A. Yani Km. 36 Banjarbaru, South Kalimantan 70714 (Indonesia); Lee, Cheng-Kang, E-mail: cklee@mail.ntust.edu.tw [Department of Chemical Engineering, National Taiwan University of Science and Technology, 43 Keelung Rd. Sec.4, Taipei 106, Taiwan (China)

    2015-10-15

    Highly sulfonated carbonaceous spheres with diameter of 100–500 nm can be generated by hydrothermal carbonization of glucose in the presence of hydroxyethylsulfonic acid and acrylic acid at 180 °C for 4 h. The acidity of the prepared carbonaceous sphere C4-SO{sub 3}H can reach 2.10 mmol/g. It was used as a solid acid catalyst for the hydrolysis of cornstarch. Total reducing sugar (TRS) concentration of 19.91 mg/mL could be obtained by hydrolyzing 20 mg/mL cornstarch at 150 °C for 6 h using C4-SO{sub 3}H as solid acid catalyst. The solid acid catalyst demonstrated good stability that only 9% decrease in TRS concentration was observed after five repeat uses. The as-prepared carbon-based solid acid catalyst can be an environmentally benign replacement for homogeneous catalyst. - Highlights: • Carbon solid acid was successfully prepared by one-step hydrothermal carbonization. • The acrylic acid as monomer was effectively reduce the diameter size of particle. • The solid acid catalyst show good catalytic performance of starch hydrolysis. • The solid acid catalyst is not significantly deteriorated after repeated use.

  13. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  14. Teaching acid/base physiology in the laboratory

    Friis, Ulla G; Plovsing, Ronni; Hansen, Klaus;

    2010-01-01

    Acid/base homeostasis is one of the most difficult subdisciplines of physiology for medical students to master. A different approach, where theory and practice are linked, might help students develop a deeper understanding of acid/base homeostasis. We therefore set out to develop a laboratory...... exercise in acid/base physiology that would provide students with unambiguous and reproducible data that clearly would illustrate the theory in practice. The laboratory exercise was developed to include both metabolic acidosis and respiratory alkalosis. Data were collected from 56 groups of medical...

  15. Neural network predicts sequence of TP53 gene based on DNA chip

    Spicker, J.S.; Wikman, F.; Lu, M.L.;

    2002-01-01

    We have trained an artificial neural network to predict the sequence of the human TP53 tumor suppressor gene based on a p53 GeneChip. The trained neural network uses as input the fluorescence intensities of DNA hybridized to oligonucleotides on the surface of the chip and makes between zero...... and four errors in the predicted 1300 bp sequence when tested on wild-type TP53 sequence....

  16. Parallel divide and conquer bio-sequence comparison based on Smith-Waterman algorithm

    ZHANG Fa; QIAO Xiangzhen; LIU Zhiyong

    2004-01-01

    Tools for pair-wise bio-sequence alignment have for long played a central role in computation biology. Several algorithms for bio-sequence alignment have been developed. The Smith-Waterman algorithm, based on dynamic programming, is considered the most fundamental alignment algorithm in bioinformatics. However the existing parallel Smith-Waterman algorithm needs large memory space, and this disadvantage limits the size of a sequence to be handled. As the data of biological sequences expand rapidly, the memory requirement of the existing parallel SmithWaterman algorithm has become a critical problem. For solving this problem, we develop a new parallel bio-sequence alignment algorithm, using the strategy of divide and conquer, named PSW-DC algorithm. In our algorithm, first, we partition the query sequence into several subsequences and distribute them to every processor respectively,then compare each subsequence with the whole subject sequence in parallel, using the Smith-Waterman algorithm, and get an interim result, finally obtain the optimal alignment between the query sequence and subject sequence, through the special combination and extension method. Memory space required in our algorithm is reduced significantly in comparison with existing ones. We also develop a key technique of combination and extension, named the C&E method, to manipulate the interim results and obtain the final sequences alignment. We implement the new parallel bio-sequences alignment algorithm,the PSW-DC, in a cluster parallel system.

  17. Environmentally Benign Bifunctional Solid Acid and Base Catalysts

    Elmekawy, A.; Shiju, N.R.; Rothenberg, G.; Brown, D.R.

    2014-01-01

    Solid bifunctional acid-​base catalysts were prepd. in two ways on an amorphous silica support: (1) by grafting mercaptopropyl units (followed by oxidn. to propylsulfonic acid) and aminopropyl groups to the silica surface (NH2-​SiO2-​SO3H)​, and (2) by grafting only aminopropyl groups and then parti

  18. Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

    Zhang, Zhang

    2010-11-08

    Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.

  19. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia.

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  20. Boronic acid-tethered amphiphilic hyaluronic acid derivative-based nanoassemblies for tumor targeting and penetration.

    Jeong, Jae Young; Hong, Eun-Hye; Lee, Song Yi; Lee, Jae-Young; Song, Jae-Hyoung; Ko, Seung-Hak; Shim, Jae-Seong; Choe, Sunghwa; Kim, Dae-Duk; Ko, Hyun-Jeong; Cho, Hyun-Jong

    2017-02-16

    (3-Aminomethylphenyl)boronic acid (AMPB)-installed hyaluronic acid-ceramide (HACE)-based nanoparticles (NPs), including manassantin B (MB), were fabricated for tumor-targeted delivery. The amine group of AMPB was conjugated to the carboxylic acid group of hyaluronic acid (HA) via amide bond formation, and synthesis was confirmed by spectroscopic methods. HACE-AMPB/MB NPs with a 239-nm mean diameter, narrow size distribution, negative zeta potential, and >90% drug encapsulation efficiency were fabricated. Exposed AMPB in the outer surface of HACE-AMPB NPs (in the aqueous environment) may react with sialic acid of cancer cells. The improved cellular accumulation efficiency, in vitro antitumor efficacy, and tumor penetration efficiency of HACE-AMPB/MB NPs, compared with HACE/MB NPs, in MDA-MB-231 cells (CD44 receptor-positive human breast adenocarcinoma cells) may be based on the CD44 receptor-mediated endocytosis and phenylboronic acid-sialic acid interaction. Enhanced in vivo tumor targetability, infiltration efficiency, and antitumor efficacies of HACE-AMPB NPs, compared with HACE NPs, were observed in a MDA-MB-231 tumor-xenografted mouse model. In addition to passive tumor targeting (based on an enhanced permeability and retention effect) and active tumor targeting (interaction between HA and CD44 receptor), the phenylboronic acid-sialic acid interaction can play important roles in augmented tumor targeting and penetration of HACE-AMPB NPs. STATEMENT OF SIGNIFICANCE: (3-Aminomethylphenyl)boronic acid (AMPB)-tethered hyaluronic acid-ceramide (HACE)-based nanoparticles (NPs), including manassantin B (MB), were fabricated and their tumor targeting and penetration efficiencies were assessed in MDA-MB-231 (CD44 receptor-positive human adenocarcinoma) tumor models. MB, which exhibited antitumor efficacies via the inhibition of angiogenesis and hypoxia inducible factor (HIF)-1, was entrapped in HACE-AMPB NPs in this study. Phenylboronic acid located in the outer surface

  1. Complete Genome Sequence of the Unclassified Iron-Oxidizing, Chemolithoautotrophic Burkholderiales Bacterium GJ-E10, Isolated from an Acidic River.

    Fukushima, Jun; Tojo, Fuyumi; Asano, Ryoki; Kobayashi, Yayoi; Shimura, Yoichiro; Okano, Kunihiro; Miyata, Naoyuki

    2015-02-05

    Burkholderiales bacterium GJ-E10, isolated from the Tamagawa River in Akita Prefecture, Japan, is an unclassified, iron-oxidizing chemolithoautotrophic bacterium. Its single circular genome, consisting of 3,276,549 bp, was sequenced by using three types of next-generation sequencers and the sequences were then confirmed by PCR-based Sanger sequencing.

  2. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    Chen, Chunxian; Gmitter Jr, Fred G

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for...

  3. Marker-Based Human Motion Capture in Multiview Sequences

    Canton-Ferrer Cristian

    2010-01-01

    Full Text Available This paper presents a low-cost real-time alternative to available commercial human motion capture systems. First, a set of distinguishable markers are placed on several human body landmarks, and the scene is captured by a number of calibrated and synchronized cameras. In order to establish a physical relation among markers, a human body model is defined. Markers are detected on all camera views and delivered as the input of an annealed particle filter scheme where every particle encodes an instance of the pose of the body model to be estimated. Likelihood between particles and input data is performed through the robust generalized symmetric epipolar distance and kinematic constrains are enforced in the propagation step towards avoiding impossible poses. Tests over the HumanEva annotated data set yield quantitative results showing the effectiveness of the proposed algorithm. Results over sequences involving fast and complex motions are also presented.

  4. Characterization of DNA-binding sequences for CcaR in the cephamycin-clavulanic acid supercluster of Streptomyces clavuligerus.

    Santamarta, I; López-García, M T; Kurt, A; Nárdiz, N; Alvarez-Álvarez, R; Pérez-Redondo, R; Martín, J F; Liras, P

    2011-08-01

    RT-PCR analysis of the genes in the clavulanic acid cluster revealed three transcriptional polycistronic units that comprised the ceaS2-bls2-pah2-cas2, cyp-fd-orf12-orf13 and oppA2-orf16 genes, whereas oat2, car, oppA1, claR, orf14, gcaS and pbpA were expressed as monocistronic transcripts. Quantitative RT-PCR of Streptomyces clavuligerus ATCC 27064 and the mutant S. clavuligerus ccaR::aph showed that, in the mutant, there was a 1000- to 10,000-fold lower transcript level for the ceaS2 to cas2 polycistronic transcript that encoded CeaS2, the first enzyme of the clavulanic acid pathway that commits arginine to clavulanic acid biosynthesis. Smaller decreases in expression were observed in the ccaR mutant for other genes in the cluster. Two-dimensional electrophoresis and MALDI-TOF analysis confirmed the absence in the mutant strain of proteins CeaS2, Bls2, Pah2 and Car that are required for clavulanic acid biosynthesis, and CefF and IPNS that are required for cephamycin biosynthesis. Gel shift electrophoresis using recombinant r-CcaR protein showed that it bound to the ceaS2 and claR promoter regions in the clavulanic acid cluster, and to the lat, cefF, cefD-cmcI and ccaR promoter regions in the cephamycin C gene cluster. Footprinting experiments indicated that triple heptameric conserved sequences were protected by r-CcaR, and allowed identification of heptameric sequences as CcaR binding sites.

  5. Sequence-Length Requirement of Distance-Based Phylogeny Reconstruction: Breaking the Polynomial Barrier

    Roch, Sebastien

    2009-01-01

    We introduce a new distance-based phylogeny reconstruction technique which provably achieves, at sufficiently short branch lengths, a polylogarithmic sequence-length requirement -- improving significantly over previous polynomial bounds for distance-based methods. The technique is based on an averaging procedure that implicitly reconstructs ancestral sequences. In the same token, we extend previous results on phase transitions in phylogeny reconstruction to general time-reversible models. More precisely, we show that in the so-called Kesten-Stigum zone (roughly, a region of the parameter space where ancestral sequences are well approximated by ``linear combinations'' of the observed sequences) sequences of length $\\poly(\\log n)$ suffice for reconstruction when branch lengths are discretized. Here $n$ is the number of extant species. Our results challenge, to some extent, the conventional wisdom that estimates of evolutionary distances alone carry significantly less information about phylogenies than full sequ...

  6. Evaluation of cleaved amplified polymorphic sequence markers for Chamaecyparis obtusa based on expressed sequence tag information from Cryptomeria japonica.

    Matsumoto, A; Tsumura, Y

    2004-12-01

    We have developed and evaluated sequence-tagged site (STS) primers based on expressed sequence-tag information derived from sugi (Cryptomeria japonica) for use in hinoki (Chamaecyparis obtusa), a species that belongs to a different family (although it appears to be fairly closely related to sugi). Of the 417 C. japonica STS primer pairs we screened, 120 (approximately 30%) were transferable and provided specific PCR amplification products from 16 C. obtusa plus trees. We used haploid megagametophytes to investigate the homology of 80 STS fragments between C. obtusa and C. japonica and to identify orthologous loci. Nearly 90% of the fragments showed high (>70%) degrees of similarity between the species, and 35 STSs indicated homology to entries with the same putative function in a public DNA database. Of the 120 STS fragments amplified, 72 showed restriction fragment length polymorphisms; in addition, the CC2430 primers detected amplicon length polymorphism. We assessed the inheritance pattern of 27 cleaved amplified polymorphic sequence markers, using 20 individuals from the segregation population. All the markers analyzed were consistent with the marker inheritance patterns obtained from the screening panel, and no markers (except CC2716) showed significant (Pobtusa. Most of the markers should also provide reliable anchor loci for comparative mapping studies of the C. obtusa and C. japonica genomes.

  7. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  8. Phylogeny of Pelargonium (Geraniaceae) based on DNA sequences from three genomes

    Bakker, F.T.; Culham, A.; Hettiarachi, P.; Touloumendidou, T.; Gibby, M.

    2004-01-01

    Phylogenetic hypotheses for the largely South African genus Pelargonium L'Hér. (Geraniaceae) were derived based on DNA sequence data from nuclear, chloroplast and mitochondrial encoded regions. The datasets were unequally represented and comprised cpDNA trnL-F sequences for 152 taxa, nrDNA ITS seque

  9. Rapid detection of a norovirus pseudo-outbreak by using real-time sequence based information

    Rahamat-Langendoen, J. C.; Lokate, M.; Scholvinck, E. H.; Friedrich, A. W.; Niesters, H. G. M.

    2013-01-01

    Background: Sequence based information is increasingly used to study the epidemiology of viruses, not only to provide insight in viral evolution, but also to understand transmission patterns during outbreaks. However, sequence analysis is not yet routinely performed by diagnostic laboratories, limit

  10. Genotyping of Histomonas meleagridis isolates based on Internal Transcribed Spacer-1 sequences

    H.M.J.F. van der Heijden; W.J.M. Landman; S. Greve; R. Peek

    2006-01-01

    C-profiling is a novel genotyping method for protozoan pathogens, based on polymerase chain reaction and sequencing of AT-rich Internal Transcribed Spacer-1 sequences. It was applied to various Histomonas meleagridis isolates originating from outbreaks of histomoniasis in six Dutch turkey and chicke

  11. Geometric Patterns for Neighboring Bases Near the Stacked State in Nucleic Acid Strands.

    Sedova, Ada; Banavali, Nilesh K

    2017-03-14

    Structural variation in base stacking has been analyzed frequently in isolated double helical contexts for nucleic acids, but not as often in nonhelical geometries or in complex biomolecular environments. In this study, conformations of two neighboring bases near their stacked state in any environment are comprehensively characterized for single-strand dinucleotide (SSD) nucleic acid crystal structure conformations. An ensemble clustering method is used to identify a reduced set of representative stacking geometries based on pairwise distances between select atoms in consecutive bases, with multiple separable conformational clusters obtained for categories divided by nucleic acid type (DNA/RNA), SSD sequence, stacking face orientation, and the presence or absence of a protein environment. For both DNA and RNA, SSD conformations are observed that are either close to the A-form, or close to the B-form, or intermediate between the two forms, or further away from either form, illustrating the local structural heterogeneity near the stacked state. Among this large variety of distinct conformations, several common stacking patterns are observed between DNA and RNA, and between nucleic acids in isolation or in complex with proteins, suggesting that these might be stable stacking orientations. Noncanonical face/face orientations of the two bases are also observed for neighboring bases in the same strand, but their frequency is much lower, with multiple SSD sequences across categories showing no occurrences of such unusual stacked conformations. The resulting reduced set of stacking geometries is directly useful for stacking-energy comparisons between empirical force fields, prediction of plausible localized variations in single-strand structures near their canonical states, and identification of analogous stacking patterns in newly solved nucleic acid containing structures.

  12. Synthesis and catalytic application of amino acid based dendritic macromolecules

    Koten, G. van; Gossage, R.A.; Jastrzebski, J.T.B.H.; Ameijde, J. van; Mulders, S.J.E.; Brouwer, Arwin J.; Liskamp, R.M.J.

    1999-01-01

    The use of amino acid based dendrimers as molecular scaffolds for the attachment of catalytically active organometallic Ni ''pincer'' complexes, via a urea functionality, is described; the dendrimer catalysts have comparable activity to their mononuclear (NCN)NiX analogues.

  13. An Olfactory Indicator for Acid-Base Titrations.

    Flair, Mark N.; Setzer, William N.

    1990-01-01

    The use of an olfactory acid-base indicator in titrations for visually impaired students is discussed. Potential olfactory indicators include eugenol, thymol, vanillin, and thiophenol. Titrations performed with each indicator with eugenol proved to be successful. (KR)

  14. Identification of the novel HLA-DPB1*5801 allele detected by sequenced based typing

    Versluis, L.F.; Zwan, A.W. van der; Tilanus, M.G.J. [Academic Hospital Utrecht (Netherlands); Daly, L.N.; Degli-Esposti, M.A.; Dawkins, R.L. [Royal Perth Hospital, Perth (Australia)

    1995-01-11

    Within the framework of HLA-DPB typing of the Fourth Asia-Oceanic Histocompatibility workshop (4AOH) we have typed the A, B, and E panels representing 101 samples. Sequenced based typing (SBT) was used as the method for typing described by Versluis and co-workers, but the sequencing chemistry was modified; in this study we have used the Sequenase enzyme instead of the thermal stable Taq enzyme. Sequences obtained were compared to a database containing all known 55 HLA-DPB1 alleles. One sample showed a new heterozygous sequence, indicating the presence of a new allele. 4 refs., 1 fig.

  15. THE CONSTRUCTIONS OF ALMOST BINARY SEQUENCE PAIRS WITH THREE-LEVEL CORRELATION BASED ON CYCLOTOMY

    Peng Xiuping; Xu Chengqian

    2012-01-01

    In this paper,a new class of almost binary sequence pair with a single zero element is presented.The almost binary sequence pairs with three-level correlation are constructed based on cyclotomic numbers of order 2,4,and 6.Most of them have good correlation and balance property,whose maximum nontrivial correlation magnitudes are 2 and the difference between the numbers of occurrence of +1's and -1's are 0 or 1.In addition,the corresponding binary sequence pairs are investigated as well and we can also get some kinds of binary sequence pairs with optimum balance and good correlation.

  16. An Approach to Assembly Sequence Plannning Based on Hierarchical Strategy and Genetic Algorithm

    Niu Xinwen; Ding Han; Xiong Youlun

    2001-01-01

    Using group and subassembly cluster methods, the hierarchical structure of a product is.generated automatically, which largely reduces the complexity of planning. Based on genetic algofithn the optimal of assembly sequence of each stracture level can be obtained by sequence-bysequence search. As a result, a better assembly sequence of the product can be generated by combining the assembly sequences of all hierarchical structures, which provides more parallelism and flexibility for assembly operations. An industrial example is solved by this new approach.

  17. A method for amplification of unknown flanking sequences based on touchdown PCR and suppression-PCR.

    Gao, Song; He, Dan; Li, Guangquan; Zhang, Yanhua; Lv, Huiying; Wang, Li

    2016-09-15

    Thermal asymmetric staggered PCR is the most widely used technique to obtain the flanking sequences. However, it has some limitations, including a low rate of positivity, and complex operation. In this study, a improved method of it was made based on suppression-PCR and touchdown PCR. The PCR fragment obtained by the amplification was used directly for sequencing after gel purification. Using this improved method, the positive rate of amplified flanking sequences of the ATMT mutants reached 99%. In addition, the time from DNA extraction to flanking sequence analysis was shortened to 2 days with about 6 dollars each sample.

  18. An efficient binomial model-based measure for sequence comparison and its application.

    Liu, Xiaoqing; Dai, Qi; Li, Lihua; He, Zerong

    2011-04-01

    Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment-based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.

  19. Towards lactic acid bacteria-based biorefineries.

    Mazzoli, Roberto; Bosco, Francesca; Mizrahi, Itzhak; Bayer, Edward A; Pessione, Enrica

    2014-11-15

    Lactic acid bacteria (LAB) have long been used in industrial applications mainly as starters for food fermentation or as biocontrol agents or as probiotics. However, LAB possess several characteristics that render them among the most promising candidates for use in future biorefineries in converting plant-derived biomass-either from dedicated crops or from municipal/industrial solid wastes-into biofuels and high value-added products. Lactic acid, their main fermentation product, is an attractive building block extensively used by the chemical industry, owing to the potential for production of polylactides as biodegradable and biocompatible plastic alternative to polymers derived from petrochemicals. LA is but one of many high-value compounds which can be produced by LAB fermentation, which also include biofuels such as ethanol and butanol, biodegradable plastic polymers, exopolysaccharides, antimicrobial agents, health-promoting substances and nutraceuticals. Furthermore, several LAB strains have ascertained probiotic properties, and their biomass can be considered a high-value product. The present contribution aims to provide an extensive overview of the main industrial applications of LAB and future perspectives concerning their utilization in biorefineries. Strategies will be described in detail for developing LAB strains with broader substrate metabolic capacity for fermentation of cheaper biomass.

  20. Complete amino acid sequence of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris.

    Lehman, L D; Jones, B N; Dwulet, F E; Bogardt, R A; Gurd, F R

    1980-10-21

    The complete primary structure of the major component myoglobin from the goose-beaked whale, Ziphius cavirostris, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at its two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage with trypsin of the citraconylated apomyoglobin at its three arginine residues. Further digestion of the central cyanogen bromide peptide with S. aureus strain V8 protease and the 1,2-cyclohexanedione-treated central cyanogen bromide peptide with trypsin enabled the determination of the remainder of the covalent structure. This myoglobin differs from the cetacean myoglobins determined to date at 12 to 17 positions. These large sequence differences reflect the distant taxonomic relationships between the goose-beaked whale and the other species of Cetacea the myoglobin sequences of which have previously been determined.

  1. Synthesis and antimicrobial activities of new higher amino acid Schiff base derivatives of 6-aminopenicillanic acid and 7-aminocephalosporanic acid

    Özdemir (nee Güngör), Özlem; Gürkan, Perihan; Özçelik, Berrin; Oyardı, Özlem

    2016-02-01

    Novel β-lactam derivatives (1c-3c) (1d-3d) were produced by using 6-aminopenicillanic acid (6-APA), 7-aminocephalosporanic acid (7-ACA) and the higher amino acid Schiff bases. The synthesized compounds were characterized by elemental analysis, IR, 1H/13C NMR and UV-vis spectra. Antibacterial activities of all the higher amino acid Schiff bases (1a-3a) (1b-3b) and β-lactam derivatives were screened against three gram negative bacteria (Escherichia coli ATCC 25922, Pseudomonas aeruginosa ATCC 27853, Acinetobacter baumannii RSKK 02026), three gram positive bacteria (Staphylococcus aureus ATCC 25923, Enterococcus faecalis ATCC 07005, Bacillus subtilis ATCC 6633) and their drug-resistant isolates by using broth microdilution method. Two fungi (Candida albicans and Candida krusei) were used for antifungal activity.

  2. Kislinsko-bazno ravnovesje: Acid-base balance:

    2002-01-01

    Acids are donors, while bases are acceptors of protons in solutions. Many various acids and bases are incorporated into organisms by everyday alimentation and cell metabolism, while the output routes are the respiratory system, kidneys and gastrointestinal system. The concentration of free protonsin the blood (expressed as pH value) is strictly regulated, as it has agreat impact on cell metabolism. There are many ways in which organisms defend themselves against harmful pH changes. The first ...

  3. Acid-base homeostasis in the human system

    White, R. J.

    1974-01-01

    Acid-base regulation is a cooperative phenomena in vivo with body fluids, extracellular and intracellular buffers, lungs, and kidneys all playing important roles. The present account is much too brief to be considered a review of present knowledge of these regulatory systems, and should be viewed, instead, as a guide to the elements necessary to construct a simple model of the mutual interactions of the acid-base regulatory systems of the body.

  4. Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

    Chengxi Ye

    2016-06-01

    Full Text Available Motivation. The third generation sequencing (3GS technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15–40%, significantly higher than those of the prevalent next generation sequencing (NGS technologies (less than 1%. Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality sequences that need to be extracted from these long but erroneous 3GS sequences. Results. We describe a versatile and efficient linear complexity consensus algorithm Sparc to facilitate de novo genome assembly. Sparc builds a sparse k-mer graph using a collection of sequences from a targeted genomic region. The heaviest path which approximates the most likely genome sequence is searched through a sparsity-induced reweighted graph as the consensus sequence. Sparc supports using NGS and 3GS data together, which leads to significant improvements in both cost efficiency and computational efficiency. Experiments with Sparc show that our algorithm can efficiently provide high-quality consensus sequences using both PacBio and Oxford Nanopore sequencing technologies. With only 30× PacBio data, Sparc can reach a consensus with error rate <0.5%. With the more challenging Oxford Nanopore data, Sparc can also achieve similar error rate when combined with NGS data. Compared with the existing approaches, Sparc calculates the consensus with higher accuracy, and uses approximately 80% less memory and time. Availability. The source code is available for download at https://github.com/yechengxi/Sparc.

  5. CAPS satellite spread spectrum communication blind multi-user detecting system based on chaotic sequences

    2009-01-01

    Multiple Path Interference (MPI) and Multiple Access Interference (MAI) are important factors that affect the performance of Chinese Area Positioning System (CAPS). These problems can be solved by using spreading sequences with ideal properties and multi-user detectors. Chaotic sequences based on Chebyshev map are studied and the satellite communication system model is set up to investigate the application of chaotic sequences for CAPS in this paper. Simulation results show that chaotic sequences have desirable correlation properties and it is easy to generate a large number of chaotic sequences with good security. It has great practical value to apply chaotic sequences to CAPS together with multi-user detecting technology and the system performance can be improved greatly.

  6. CAPS satellite spread spectrum communication blind multi-user detecting system based on chaotic sequences

    LEI LiHua; SHI HuLi; MA GuanYi

    2009-01-01

    Multiple Path Interference (MPI) and Multiple Access Interference (MAI) are Important factors that affect the performance of Chinese Area Positioning System (CAPS),These problems can be solved by using spreading sequences with ideal properties and multi-user detectors.Chaotic sequences based on Chebyshev map are studied and the satellite communication system model is set up to investigate the application of chaotic sequences for CAPS in this paper,Simulation results show that chaotic sequences have desirable correlation properties and it is easy to generate a large number of chaotic sequences with good security.It has great practical value to apply chaotic sequences to CAPS together with multi-user detecting technology and the system performance can be improved greatly.

  7. Biodegradation and Osteosarcoma Cell Cultivation on Poly(aspartic acid) Based Hydrogels.

    Juriga, Dávid; Nagy, Krisztina; Jedlovszky-Hajdú, Angéla; Perczel-Kovách, Katalin; Chen, Yong Mei; Varga, Gábor; Zrínyi, Miklós

    2016-09-14

    Development of novel biodegradable and biocompatible scaffold materials with optimal characteristics is important for both preclinical and clinical applications. The aim of the present study was to analyze the biodegradability of poly(aspartic acid)-based hydrogels, and to test their usability as scaffolds for MG-63 osteoblast-like cells. Poly(aspartic acid) was fabricated from poly(succinimide) and hydrogels were prepared using natural amines as cross-linkers (diaminobutane and cystamine). Disulfide bridges were cleaved to thiol groups and the polymer backbone was further modified with RGD sequence. Biodegradability of the hydrogels was evaluated by experiments on the base of enzymes and cell culture medium. Poly(aspartic acid) hydrogels possessing only disulfide bridges as cross-links proved to be degradable by collagenase I. The MG-63 cells showed healthy, fibroblast-like morphology on the double cross-linked and RGD modified hydrogels. Thiolated poly(aspartic acid) based hydrogels provide ideal conditions for adhesion, survival, proliferation, and migration of osteoblast-like cells. The highest viability was found on the thiolated PASP gels while the RGD motif had influence on compacted cluster formation of the cells. These biodegradable and biocompatible poly(aspartic acid)-based hydrogels are promising scaffolds for cell cultivation.

  8. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer

    Zambanini, Thiemo; Buescher, Joerg M.; Meurer, Guido; Blank, Lars M.

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  9. Isolation and amino acid sequence of crustacean hyperglycemic hormone precursor-related peptides.

    Tensen, C P; Verhoeven, A H; Gaus, G; Janssen, K P; Keller, R; Van Herp, F

    1991-01-01

    The crustacean hyperglycemic hormone (CHH) is synthesized as part of a larger preprohormone in which the sequence of CHH is N-terminally flanked by a peptide for which the name CPRP (CHH precursor-related peptide) is proposed. Both CHH and CPRP are present in the sinus gland, the neurohemal organ of neurosecretory cells located in the eyestalk of decapod crustaceans. This paper describes the isolation and sequence analysis of CPRPs isolated from sinus glands of the crab Carcinus maenas, the crayfish Orconectes limosus and the lobster Homarus americanus. The published sequence of "peptide H" isolated from the land crab, Cardisoma carnifex, has now been recognized as a CPRP in this species. Sequence comparison reveals a high level of identity for the N-terminal region (residues 1-13) between all four peptides, while identity in the C-terminal domain is high between lobster and crayfish CPRP on the one hand, and between both crab species on the other. Conserved N-terminal residues include a putative monobasic processing site at position 11, which suggests that CPRP may be a biosynthetic intermediate from which a potentially bioactive decapeptide can be derived.

  10. Complete genome sequence of the prototype lactic acid bacterium Lactococcus lactis subsp cremoris MG1363

    Wegmann, Udo; O'Connell-Motherwy, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 ps

  11. Speed limits for acid-base chemistry in aqueous solutions.

    Donten, Mateusz L; Vandevondele, Joost; Hamm, Peter

    2012-01-01

    Proton transfer reactions, including acid-base recombination, are commonly considered to occur 'nearly instantaneously'. However, their actual time scales may stretch far into the microsecond range, as acid-base reactions are diffusion controlled and the concentrations are low near neutral pH. The interplay of competing bases in the pH relaxation is illustrated using a model acid-base system consisting of o-nitrobenzaldehyde (oNBA) as a proton cage and acetate ions and hydroxyl ions as bases. The kinetically controlled behavior leads to highly counterintuitive states, i.e. acetate ions are transiently protonated for hundreds of nanoseconds despite the presence of a much stronger base OH-.

  12. Graph-based sequence annotation using a data integration approach.

    Pesch, Robert; Lysenko, Artem; Hindle, Matthew; Hassani-Pak, Keywan; Thiele, Ralf; Rawlings, Christopher; Köhler, Jacob; Taubert, Jan

    2008-08-25

    The automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara-Cyc) which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation. The methods and algorithms presented in this publication are an integral part of the ONDEX system which is freely available from http://ondex.sf.net/.

  13. Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

    Radova, A.; Sebela, M.; Galuszka, P.

    2001-01-01

    This paper reports the first purification method developed for the isolation of an homogeneous polyamine oxidase (PAO) from etiolated barley seedlings. The crude enzyme preparation was obtained after initial precipitation of the extract with protamine sulphate and ammonium sulphate. The enzyme...... was further purified to a final homogeneity (by the criteria of isoelectric focusing and SDS-PAGE) using techniques of low pressure chromatography followed by two FPLC steps. The purified yellow enzyme showed visible absorption maxima of a flavoprotein at 380 and 450 nm: the presence of FAD as the cofactor...... was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...

  14. Control allocation and management of redundant control effectors based on bases sequenced optimal method

    2010-01-01

    For an advanced aircraft, the amount of its effectors is much more than that for a traditional one, the functions of effectors are more complex and the coupling between each other is more severe. Based on the current control allocation research, this paper puts forward the concept and framework of the control allocation and management system for aircrafts with redundancy con-trol effectors. A new optimal control allocation method, bases sequenced optimal (BSO) method, is then presented. By analyz-ing the physical meaning of the allocation process of BSO method, four types of management strategies are adopted by the system, which act on the control allocation process under different flight conditions, mission requirements and effectors work-ing conditions. Simulation results show that functions of the control allocation system are extended and the system adaptability to flight status, mission requirements and effector failure conditions is improved.

  15. Generating Multiple Base-Resolution DNA Methylomes Using Reduced Representation Bisulfite Sequencing.

    Chatterjee, Aniruddha; Rodger, Euan J; Stockwell, Peter A; Le Mée, Gwenn; Morison, Ian M

    2017-01-01

    Reduced representation bisulfite sequencing (RRBS) is an effective technique for profiling genome-wide DNA methylation patterns in eukaryotes. RRBS couples size selection, bisulfite conversion, and second-generation sequencing to enrich for CpG-dense regions of the genome. The progressive improvement of second-generation sequencing technologies and reduction in cost provided an opportunity to examine the DNA methylation patterns of multiple genomes. Here, we describe a protocol for sequencing multiple RRBS libraries in a single sequencing reaction to generate base-resolution methylomes. Furthermore, we provide a brief guideline for base-calling and data analysis of multiplexed RRBS libraries. These strategies will be useful to perform large-scale, genome-wide DNA methylation analysis.

  16. E-probe Diagnostic Nucleic acid Analysis (EDNA): A theoretical approach for handling of next generation sequencing data for diagnostics

    There are many plant pathogen-specific diagnostic assays, based on PCR and immune-detection. However, the ability to test for large numbers of pathogens simultaneously is lacking. Next generation sequencing (NGS) allows one to detect all organisms within a given sample, but has computational limitat...

  17. Transgenerational inheritance: Models and mechanisms of non-DNA sequence-based inheritance.

    Miska, Eric A; Ferguson-Smith, Anne C

    2016-10-07

    Heritability has traditionally been thought to be a characteristic feature of the genetic material of an organism-notably, its DNA. However, it is now clear that inheritance not based on DNA sequence exists in multiple organisms, with examples found in microbes, plants, and invertebrate and vertebrate animals. In mammals, the molecular mechanisms have been challenging to elucidate, in part due to difficulties in designing robust models and approaches. Here we review some of the evidence, concepts, and potential mechanisms of non-DNA sequence-based transgenerational inheritance. We highlight model systems and discuss whether phenotypes are replicated or reconstructed over successive generations, as well as whether mechanisms operate at transcriptional and/or posttranscriptional levels. Finally, we explore the short- and long-term implications of non-DNA sequence-based inheritance. Understanding the effects of non-DNA sequence-based mechanisms is key to a full appreciation of heritability in health and disease.

  18. Base J glucosyltransferase does not regulate the sequence specificity of J synthesis in trypanosomatid telomeric DNA.

    Bullard, Whitney; Cliffe, Laura; Wang, Pengcheng; Wang, Yinsheng; Sabatini, Robert

    2015-12-01

    Telomeric DNA of trypanosomatids possesses a modified thymine base, called base J, that is synthesized in a two-step process; the base is hydroxylated by a thymidine hydroxylase forming hydroxymethyluracil (hmU) and a glucose moiety is then attached by the J-associated glucosyltransferase (JGT). To examine the importance of JGT in modifiying specific thymine in DNA, we used a Leishmania episome system to demonstrate that the telomeric repeat (GGGTTA) stimulates J synthesis in vivo while mutant telomeric sequences (GGGTTT, GGGATT, and GGGAAA) do not. Utilizing an in vitro GT assay we find that JGT can glycosylate hmU within any sequence with no significant change in Km or kcat, even mutant telomeric sequences that are unable to be J-modified in vivo. The data suggests that JGT possesses no DNA sequence specificity in vitro, lending support to the hypothesis that the specificity of base J synthesis is not at the level of the JGT reaction.

  19. Group Graded Associated Ideals with Flat Base Change of Rings and Short Exact Sequences

    Srinivas Behara; Shiv Datt Kumar

    2011-05-01

    This paper deals with the study of behaviour of -associated ideals and strong Krull -associated ideals with flat base change of rings and behaviour of -associated ideals with short exact sequences over rings graded by finitely generated abelian group .

  20. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Rhee, Mun Su [University of Florida, Gainesville; Moritz, Brelan E. [University of Florida, Gainesville; Xie, Gary [Los Alamos National Laboratory (LANL); Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Chertkov, Olga [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Patel, Milind [University of Florida, Gainesville; Ou, Mark [University of Florida, Gainesville; Harbrucker, Roberta [University of Florida, Gainesville; Ingram, Lonnie O. [University of Florida; Shanmugam, Keelnathan T. [University of Florida

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  1. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    Xie, Gary [Los Alamos National Laboratory (LANL); Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  2. Prediction of the Occurrence of the ADP-binding βαβ-fold in Proteins, Using an Amino Acid Sequence Fingerprint

    Wierenga, Rik K.; Terpstra, Peter; Hol, Wim G.J.

    1986-01-01

    An amino acid sequence "fingerprint” has been derived that can be used to test if a particular sequence will fold into a βαβ-unit with ADP-binding properties. It was deduced from a careful analysis of the known three-dimensional structures of ADP-binding βαβ-folds. This fingerprint is in fact a set

  3. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    2010-07-01

    ... “Sequence Listing” file. (6) All computer readable forms must have a label permanently affixed thereto on...) Computer readable form files submitted may be in any of the following media: (1) Diskette: 3.50 inch, 1.44... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824...

  4. Diverse bacterial PKS sequences derived from okadaic acid-producing dinoflagellates.

    Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S

    2008-05-22

    Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  5. Adaptation of a Fault-Tolerant Fpga-Based Launch Sequencer as a Cubesat Payload Processor

    2014-06-01

    FAULT–TOLERANT FPGA –BASED LAUNCH SEQUENCER AS A CUBESAT PAYLOAD PROCESSOR by Jordan K. Goff June 2014 Thesis Co-Advisors: Herschel H...TYPE AND DATES COVERED Master’s Thesis 4. TITLE AND SUBTITLE ADAPTATION OF A FAULT–TOLERANT FPGA –BASED LAUNCH SEQUENCER AS A CUBESAT PAYLOAD...set. This processor is implemented on a field programmable gate array ( FPGA ) and will be used as the foundation for a payload processor on a cube

  6. Amino acid sequence diversity within the family of antibodies bearing the major antiarsonate cross-reactive idiotype of the A strain mouse

    1983-01-01

    VH region amino acid sequences are described for five A/J anti-p- azophenylarsonate (anti-Ars) hybridoma antibodies for which the VL region sequences have previously been determined, thus completing the V domain sequences of these molecules. These antibodies all belong to the family designated Ars-A which bears the major anti-arsonate cross- reactive idiotype (CRI) of the A strain mouse. However, they differ in the degree to which they express the CRI in standard competition radioimmunoassays...

  7. Cofactory: Sequence-based prediction of cofactor specificity of Rossmann folds

    Geertz-Hansen, Henrik Marcus; Blom, Nikolaj; Feist, Adam;

    2014-01-01

    cofactor specificity using only primary amino acid sequence information. The algorithm identifies potential cofactor binding Rossinann folds and predicts the specificity for the cofactors FAD(H2), NAD(H), and NADP(H) The Rossmann fold sequence search is carried out using hidden Markov models whereas...... artificial neural networks are used for specificity prediction. Training was carried out using experimental data from protein cofactor structure complexes. The overall performance was benchmarked against an independent evaluation set obtaining Matthews correlation coefficients of 0.94, 0.79, and 0.65 for FAD...

  8. A new approach based on PSO algorithm to find good computational encoding sequences

    Cui Guangzhao; Niu Yunyun; Wang Yanfeng; Zhang Xuncai; Pan Linqiang

    2007-01-01

    Computational encoding DNA sequence design is one of the most important steps in molecular computation. A lot of research work has been done to design reliable sequence library. A revised method based on the support system developed by Tanaka et al.is proposed here with different criteria to construct fitness function. Then we adapt particle swarm optimization (PSO) algorithm to our encoding problem. By using the new algorithm, a set of sequences with good quality is generated. The result also shows that our PSO- based approach could rapidly converge at the minimum level for an output of the simulation model. The celerity of the algorithm fits our requirements.

  9. A sequencing-based linkage map of cucumber

    Genetic maps are important tools for molecular breeding, gene cloning, and study of meiotic recombination. In cucumber (Cucumis sativus L.), the marker density, resolution and genome coverage of previously developed genetic maps using PCR-based molecular markers are relatively low. In this study we ...

  10. Molecular identification of veterinary yeast isolates by use of sequence-based analysis of the D1/D2 region of the large ribosomal subunit.

    Garner, Cherilyn D; Starr, Jennifer K; McDonough, Patrick L; Altier, Craig

    2010-06-01

    Conventional methods of yeast identification are often time-consuming and difficult; however, recent studies of sequence-based identification methods have shown promise. Additionally, little is known about the diversity of yeasts identified from various animal species in veterinary diagnostic laboratories. Therefore, in this study, we examined three methods of identification by using 109 yeast samples isolated during a 1-year period from veterinary clinical samples. Comparison of the three methods-traditional substrate assimilation, fatty acid profile analysis, and sequence-based analysis of the region spanning the D1 and D2 regions (D1/D2) of the large ribosomal subunit-showed that sequence analysis provided the highest percent identification among the three. Sequence analysis identified 87% of isolates to the species level, whereas substrate assimilation and fatty acid profile analysis identified only 54% and 47%, respectively. Less-stringent criteria for identification increased the percentage of isolates identified to 98% for sequence analysis, 62% for substrate assimilation, and 55% for fatty acid profile analysis. We also found that sequence analysis of the internal transcribed spacer 2 (ITS2) region provided further identification for 36% of yeast not identified to the species level by D1/D2 sequence analysis. Additionally, we identified a large variety of yeast from animal sources, with at least 30 different species among the isolates tested, and with the majority not belonging to the common Candida spp., such as C. albicans, C. glabrata, C. tropicalis, and the C. parapsilosis group. Thus, we determined that sequence analysis of the D1/D2 region was the best method for identification of the variety of yeasts found in a veterinary population.

  11. Amino acid sequence of the cold-active alkaline phosphatase from Atlantic cod (Gadus morhua)

    Asgeirsson, Bjarni; Nielsen, Berit Noesgaard; Højrup, Peter

    2003-01-01

    -linked glycosylation sites were found. The glycan structure was determined as complex biantennary in type with fucose and sialic acid attached, although a trace of complex tri-antennary structure was also observed. A three-dimensional model was obtained by homology modelling using the human placental AP scaffold. Cod...

  12. Sequence-specific inhibition of duck hepatitis B virus reverse transcription by peptide nucleic acids (PNA)

    Robaczewska, Magdalena; Narayan, Ramamurthy; Seigneres, Beatrice

    2005-01-01

    BACKGROUND/AIMS: Peptide nucleic acids (PNAs) appear as promising new antisense agents, that have not yet been examined as hepatitis B virus (HBV) inhibitors. Our aim was to study the ability of PNAs targeting the duck HBV (DHBV) encapsidation signal epsilon to inhibit reverse transcription (RT...

  13. Expanding the diversity of oenococcal bacteriophages: insights into a novel group based on the integrase sequence.

    Jaomanjaka, Fety; Ballestra, Patricia; Dols-lafargue, Marguerite; Le Marrec, Claire

    2013-09-01

    Temperate bacteriophages are a contributor of the genetic diversity in the lactic acid bacterium Oenococcus oeni. We used a classification scheme for oenococcal prophages based on integrase gene polymorphism, to analyze a collection of Oenococcus strains mostly isolated in the area of Bordeaux, which represented the major lineages identified through MLST schemes in the species. Genome sequences of oenococcal prophages were clustered into four integrase groups (A to D) which were related to the chromosomal integration site. The prevalence of each group was determined and we could show that members of the intB- and intC-prophage groups were rare in our panel of strains. Our study focused on the so far uncharacterized members of the intD-group. Various intD viruses could be easily isolated from wine samples, while intD lysogens could be induced to produce phages active against two permissive O. oeni isolates. These data support the role of this prophage group in the biology of O. oeni. Global alignment of three relevant intD-prophages revealed significant conservation and highlighted a number of unique ORFs that may contribute to phage and lysogen fitness.

  14. Sequence-based analysis of the bacterial and fungal compositions of multiple kombucha (tea fungus) samples.

    Marsh, Alan J; O'Sullivan, Orla; Hill, Colin; Ross, R Paul; Cotter, Paul D

    2014-04-01

    Kombucha is a sweetened tea beverage that, as a consequence of fermentation, contains ethanol, carbon dioxide, a high concentration of acid (gluconic, acetic and lactic) as well as a number of other metabolites and is thought to contain a number of health-promoting components. The sucrose-tea solution is fermented by a symbiosis of bacteria and yeast embedded within a cellulosic pellicle, which forms a floating mat in the tea, and generates a new layer with each successful fermentation. The specific identity of the microbial populations present has been the focus of attention but, to date, the majority of studies have relied on culture-based analyses. To gain a more comprehensive insight into the kombucha microbiota we have carried out the first culture-independent, high-throughput sequencing analysis of the bacterial and fungal populations of 5 distinct pellicles as well as the resultant fermented kombucha at two time points. Following the analysis it was established that the major bacterial genus present was Gluconacetobacter, present at >85% in most samples, with only trace populations of Acetobacter detected (95% in the fermented beverage, with a greater fungal diversity present in the cellulosic pellicle, including numerous species not identified in kombucha previously. Ultimately, this study represents the most accurate description of the microbiology of kombucha to date.

  15. Case-Based Plan Recognition Using Action Sequence Graphs

    2014-10-01

    while probabilistic algorithms include those that use stochastic grammars and probabilistic relational models. Both these approaches are sensitive to...Proceedings of the Fifth Game-On International Conference (pp. 36-40). Reading, UK: University of Wolverhampton Press. Cox, M. T., & Kerkez, B...algorithm based on plan tree grammars . Artificial Intelligence, 173(11), 1101-1132. Ghallab,M., Nau, D., & Traverso, P. (2004). Automated planning: Theory

  16. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2232 from Karijini National Park, Australia.

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Reddy, Tatiparthi; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  17. Identification of a novel HMW glutenin subunit and comparison of its amino acid sequence with those of homologous subunits

    2002-01-01

    Aegilops tauschii is the donor of the D genome of common wheat (Triticum aestivum). Genetic variation of HMW glutenin subunits encoded by the Glu-1Dt locus of Ae. tauschii has been found to be higher than that specified by the Glu-1D locus in common wheat. In the present note, we report the identification of a novel HMW glutenin subunit, Dy13t, from Ae. tauschii. The newly identified subunit possessed an electrophoretic mobility that was faster than that of the Dy12 subunit of common wheat. The complete ORF of encoding the Dy13t subunit contained 624 codons (excluding the stop codons). The amino acid sequence deduced from the Dy13t gene ORF was the shortest among those of the previously reported subunits derived by the D genome. A further comparison of Dy13t amino acid sequence with those of the subunits characterized from the A, B, D, R genomes of Triticeae showed that the smaller size of the Dy13t subunit was associated with a reduction in the size of its repetitive domain.

  18. Rapid Conversion of Traditional Introductory Physics Sequences to an Activity-Based Format

    Yoder, Garett; Cook, Jerry

    2014-01-01

    The Department of Physics at EKU [Eastern Kentucky University] with support from the National Science Foundations Course Curriculum and Laboratory Improvement Program has successfully converted our entire introductory physics sequence, both algebra-based and calculus-based courses, to an activity-based format where laboratory activities,…

  19. Role of acids and bases in nanoparticle growth

    Yli-Juuti, Taina; Barsanti, Kelley; Bzdek, Bryan; Hildebrandt Ruiz, Lea; Jokinen, Tuija; Kieloaho, Antti-Jussi; Makkonen, Ulla; Petäjä, Tuukka; Ruuskanen, Taina; Johnston, Murray; Kulmala, Markku; Riipinen, Ilona

    2014-05-01

    Secondary aerosol particles that are formed in atmosphere by gas-to-particle conversion during new particle formation events have potential to affect climate significantly due to their typically high number concentrations. This, however, requires that the freshly formed nanoparticles of about 1 nm in diameter grow tens of nanometers and reach climatically relevant sizes, i.e. sizes where they can act as cloud condensation nuclei. During the growth towards larger sizes the nanoparticles are subject to coagulational losses, and the rate at which the nanoparticles grow by condensation of vapors is a key factor affecting their probability to survive to climatically relevant sizes. Vapors that condense on the nanoparticles can be produced in the atmosphere from volatile compounds through gas phase chemical reactions, and their volatility can also be further lowered by particle phase processes. Therefore, particle composition and particle phase processes may influence nanoparticle growth. We study the growth of atmospheric nanoparticles and especially the role of particle phase salt formation in the nanoparticle growth using MABNAG model (Model for Acid-Base chemistry in NAnoparticle Growth) and by comparing to atmospheric measurements. MABNAG is a condensation growth model for aqueous solution particles. In MABNAG the dynamics of gas phase mass transport of vapors to particle are coupled with thermodynamics of particle phase acid-base chemistry, and both the composition and size dependence of equilibrium vapor pressures are accounted for. The model is applied especially for boreal forest environment. Here nanoparticle growth is modeled with a system of water, two acids (sulfuric acid and an organic acid) and two bases (ammonia and an amine) as condensing vapors. Focus is on the neutralization of acids by the bases and the related effects on the particle growth. According to the model predictions the enhancement of condensation of organic acid due to salt formation is

  20. GrabCut-based human segmentation in video sequences.

    Hernández-Vela, Antonio; Reyes, Miguel; Ponce, Víctor; Escalera, Sergio

    2012-11-09

    In this paper, we present a fully-automatic Spatio-Temporal GrabCut human segmentation methodology that combines tracking and segmentation. GrabCut initialization is performed by a HOG-based subject detection, face detection, and skin color model. Spatial information is included by Mean Shift clustering whereas temporal coherence is considered by the historical of Gaussian Mixture Models. Moreover, full face and pose recovery is obtained by combining human segmentation with Active Appearance Models and Conditional Random Fields. Results over public datasets and in a new Human Limb dataset show a robust segmentation and recovery of both face and pose using the presented methodology.

  1. GrabCut-Based Human Segmentation in Video Sequences

    Sergio Escalera

    2012-11-01

    Full Text Available In this paper, we present a fully-automatic Spatio-Temporal GrabCut human segmentation methodology that combines tracking and segmentation. GrabCut initialization is performed by a HOG-based subject detection, face detection, and skin color model. Spatial information is included by Mean Shift clustering whereas temporal coherence is considered by the historical of Gaussian Mixture Models. Moreover, full face and pose recovery is obtained by combining human segmentation with Active Appearance Models and Conditional Random Fields. Results over public datasets and in a new Human Limb dataset show a robust segmentation and recovery of both face and pose using the presented methodology.

  2. The complete amino acid sequence of the major component myoglobin from the arctic minke whale, Balaenoptera acutorostrata.

    Lehman, L D; Dwulet, F E; Bogardt, R A; Jones, B N; Gurd, F R

    1977-02-22

    The complete primary structure of the major component myoglobin from the Arctic minke whale, Balaenoptera acutorostrata, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. Over 80% of the amino acid sequence was established from the three peptides resulting from the cleavage of the apomyoglobin at the two methionine residues with cyanogen bromide along with the four peptides resulting from the cleavage of the methylacetimidated apomyoglobin at the three arginine residues with trypsin. The further digestion of the central cyanogen bromide peptide with trypsin and S. aureus strain V8 protease enabled the determining of the remainder of the covalent structure. This myoglobin differs from that of the dwarf sperm whale, Kogia simus, at 16 positions, and the common dolphin, Delphinus delphis, at 14 positions, from that of the common porpoise, Phocaena phocaena, and the bottlenosed dolphin, Tursiops truncatus at 13 positions, from that of the Amazon River dolphin, Inia geoffrensis, at 10 positions, and from that of California gray whale, Eschrichtius gibbosus, at 3 positions- All of the substitutions observed in this sequence fit easily into the three-dimensional structure of the sperm whale myoglobin.

  3. Amino acid sequence and disulfide bond assignment of myotoxin a isolated from the venom of prairie rattlesnake (Crotalus viridis viridis)

    Fox, J.W.; Elzinga, M.; Tu, A.T.

    1979-02-20

    The primary structure of myotoxin a, a myotoxin protein from the venom of the North American rattlesnake Crotalus viridis viridis, was determined and the position of the disulfide bonds assigned. The toxin was isolated, carboxymethylated, and cleaved by cyanogen bromide, and the resultant peptides were isolated. The cyanogen bromide peptides were subjected to amino acid sequence analysis. In order to assign the positions of the three disulfide bonds, the native toxin was cleaved sequentially with cyanogen bromide and trypsin. A two peptide unit connected by one disulfide bond was isolated and characterized, and a three-peptide unit connected by two disulfide bonds was isolated. One peptide in the three-peptide unit was identified as Cys-Cys-Lys. In order to establish the linkages between the peptides and Cys-Cys-Lys, one cycle of Edman degradation was carried out such that the Cys-Cys bond was cleaved. Upon isolation and analysis of the cleavage products, the disulfide bonds connecting the three peptides were determined. The positions of the disulfide bridges of myotoxin a were determined to be totally different from those of neurotoxins isolated from snake venoms. The sequence of myotoxin a was compared with the sequences of other snake venom toxins using the computer program RELATE to determine whether myotoxin a is similar to any other types of toxins. From the computer analysis, myotoxin a did not show any close relationship to other toxins except crotamine from the South American rattlesnake Crotalus durissus terrificus.

  4. Acid-base disturbance in patients with cirrhosis

    Henriksen, Jens H; Bendtsen, Flemming; Møller, Søren

    2015-01-01

    PURPOSE: Acid-base disturbances were investigated in patients with cirrhosis in relation to hemodynamic derangement to analyze the hyperventilatory effects and the metabolic compensation. METHODS: A total of 66 patients with cirrhosis and 44 controls were investigated during a hemodynamic study......, and effects of unidentified ions (all Pacid-base disturbances could not be identified. CONCLUSION: Hypocapnic alkalosis is related to disease severity and hyperdynamic systemic circulation in patients with cirrhosis. The metabolic compensation includes...... alterations in serum albumin and water retention that may result in a delicate acid-base balance in these patients....

  5. Poly (ricinoleic acid) based novel thermosetting elastomer.

    Ebata, Hiroki; Yasuda, Mayumi; Toshima, Kazunobu; Matsumura, Shuichi

    2008-01-01

    A novel bio-based thermosetting elastomer was prepared by the lipase-catalyzed polymerization of methyl ricinoleate with subsequent vulcanization. Some mechanical properties of the cured carbon black-filled polyricinoleate compounds were evaluated as a thermosetting elastomer. It was found that the carbon black-filled polyricinoleate compounds were readily cured by sulfur curatives to produce a thermosetting elastomer that formed a rubber-like sheet with a smooth and non-sticky surface. The curing behaviors and mechanical properties were dependent on both the molecular weight of the polyricinoleate and the amount of the sulfur curatives. Cured compounds consisting of polyricinoleate with a molecular weight of 100,800 showed good mechanical properties, such as a hardness of 48 A based on the durometer A measurements, a tensile strength at break of 6.91 MPa and an elongation at break of 350%.

  6. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    Kathleen S. Rein

    2008-05-01

    Full Text Available Okadaic acid (OA and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.

  7. Soil Studies: Applying Acid-Base Chemistry to Environmental Analysis.

    West, Donna M.; Sterling, Donna R.

    2001-01-01

    Laboratory activities for chemistry students focus attention on the use of acid-base chemistry to examine environmental conditions. After using standard laboratory procedures to analyze soil and rainwater samples, students use web-based resources to interpret their findings. Uses CBL probes and graphing calculators to gather and analyze data and…

  8. Parallel proton transfer pathways in aqueous acid-base reactions

    Cox, M.J.; Bakker, H.J.

    2008-01-01

    We study the mechanism of proton transfer (PT) between the photoacid 8-hydroxy-1,3, 6-pyrenetrisulfonic acid (HPTS) and the base chloroacetate in aqueous solution. We investigate both proton and deuteron transfer reactions in solutions with base concentrations ranging from 0.25M to 4M. Using femtose

  9. A European Acid Rain Program based on the US experience

    Brandt, U. Steiner; Svendsen, Gert Tinggaard

    2000-01-01

    The paper shows that cost-effective involvement of the source location involves utmost difficulty in practice. Based on the RAINS model, it is recommended that source location should be ignored in a European market for SO2, as is the case in the US Acid Rain Program. Based on the political target...

  10. Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

    Chunsheng Gao; Pengfei Xin; Chaohua Cheng; Qing Tang; Ping Chen; Changbiao Wang; Gonggu Zang; Lining Zhao

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SS...

  11. Ionic liquid supported acid/base-catalyzed production of biodiesel.

    Lapis, Alexandre A M; de Oliveira, Luciane F; Neto, Brenno A D; Dupont, Jairton

    2008-01-01

    The transesterification (alcoholysis) reaction was successfully applied to synthesize biodiesel from vegetable oils using imidazolium-based ionic liquids under multiphase acidic and basic conditions. Under basic conditions, the combination of the ionic liquid 1-n-butyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide (BMINTf2), alcohols, and K2CO3 (40 mol %) results in the production of biodiesel from soybean oil in high yields (>98%) and purity. H2SO4 immobilized in BMINTf2 efficiently promotes the transesterification reaction of soybean oil and various primary and secondary alcohols. In this multiphase process the acid is almost completely retained in the ionic liquid phase, while the biodiesel forms a separate phase. The recovered ionic liquid containing the acid could be reused at least six times without any significant loss in the biodiesel yield or selectivity. In both catalytic processes (acid and base), the reactions proceed as typical multiphasic systems in which the formed biodiesel accumulates as the upper phase and the glycerol by-product is selectively captured by the alcohol-ionic liquid-acid/base phase. Classical ionic liquids such as 1-n-butyl-3-methylimidazolium tetrafluoroborate and hexafluorophosphate are not stable under these acidic or basic conditions and decompose.

  12. MRI-Based Thermometry for Tumor Thermal Ablation: A Comparison of Different MR Sequences

    T. J. Vogl

    2010-05-01

    Full Text Available Background/Objective: To evaluate T1 and PRF thermometry methods utilizing fast MR sequences and fluoroptic thermometer."nMaterials and Methods: The MR-guided LITT (Laser-Induced Interstitial Thermotherapy with a laser wavelength/power of 1064nm/30W was applied to pig liver and a gel phantom. During the ablation process, the temperature was measured using a fluoroptic thermometer and MR imaging was performed applying a 1.5-Tesla tomograph with an EPI (Echo Planar Imaging sequence for PRF (Proton Resonance Frequency method and FLASH, IRTF, SRTF and TRUFI sequences for T1 method. Plotting MR signal intensity against measured temperature determined the temperature constant for each of the T1 sequences. To determine the PRF temperature constant, phase values were recorded from phase images and then plotted against temperature. The PRF temperature constant was verified comparing the MR temperature with the measured one obtained from a second LITT experiment on gel phantom."nResults: The experiments determining the temperature constant for T1 method showed that the IRTF and FLASH sequences have the highest temperature sensitivity and the most linear relationship between MR signal intensity and measured temperature. SRTF sequence presented relatively good linearity but inferior temperature sensitivity compared to IRTF and FLASH sequences. Conversely, TRUFI sequence exhibited the lowest temperature sensitivity and linearity of data points. Concerning the PRF method, the measured and the MR-based temperatures agreed up to approximately 70 C."nConclusion: To demonstrate and control temperature in target tissue during the LITT process, the PRF method with an EPI sequence is preferred for temperatures below 70 C due to its acceptable accuracy. Among the T1 sequences, FLASH is preferable as the most robust, though not the most accurate T1 sequence.

  13. A computational study of ultrafast acid dissociation and acid-base neutralization reactions. I. The model.

    Maurer, Patrick; Thomas, Vibin; Rivard, Ugo; Iftimie, Radu

    2010-07-28

    Ultrafast, time-resolved investigations of acid-base neutralization reactions have recently been performed using systems containing the photoacid 8-hydroxypyrene-1,3,6-trisulfonic acid trisodium salt (HPTS) and various Bronsted bases. Two conflicting neutralization mechanisms have been formulated by Mohammed et al. [Science 310, 83 (2005)] and Siwick et al. [J. Am. Chem. Soc. 129, 13412 (2007)] for the same acid-base system. Herein an ab initio molecular dynamics based computational model is formulated, which is able to investigate the validity of the proposed mechanisms in the general context of ground-state acid-base neutralization reactions. Our approach consists of using 2,4,6-tricyanophenol (exp. pKa congruent with 1) as a model for excited-state HPTS( *) (pKa congruent with 1.4) and carboxylate ions for the accepting base. We employ our recently proposed dipole-field/quantum mechanics (QM) treatment [P. Maurer and R. Iftimie, J. Chem. Phys. 132, 074112 (2010)] of the proton donor and acceptor molecules. This approach allows one to tune the free energy of neutralization to any desired value as well as model initial nonequilibrium hydration effects caused by a sudden increase in acidity, making it possible to achieve a more realistic comparison with experimental data than could be obtained via a full-QM treatment of the entire system. It is demonstrated that the dipole-field/QM model reproduces correctly key properties of the 2,4,6-tricyanophenol acid molecule including gas-phase proton dissociation energies and dipole moments, and condensed-phase hydration structure and pKa values.

  14. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry.

    Carter, Charles W; Wolfenden, Richard

    2016-01-01

    The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.

  15. Integrated and sequence-ordered BAC- and YAC-based physical maps for the rat genome.

    Krzywinski, Martin; Wallis, John; Gösele, Claudia; Bosdet, Ian; Chiu, Readman; Graves, Tina; Hummel, Oliver; Layman, Dan; Mathewson, Carrie; Wye, Natasja; Zhu, Baoli; Albracht, Derek; Asano, Jennifer; Barber, Sarah; Brown-John, Mabel; Chan, Susanna; Chand, Steve; Cloutier, Alison; Davito, Jonathon; Fjell, Chris; Gaige, Tony; Ganten, Detlev; Girn, Noreen; Guggenheimer, Kurtis; Himmelbauer, Heinz; Kreitler, Thomas; Leach, Stephen; Lee, Darlene; Lehrach, Hans; Mayo, Michael; Mead, Kelly; Olson, Teika; Pandoh, Pawan; Prabhu, Anna-Liisa; Shin, Heesun; Tänzer, Simone; Thompson, Jason; Tsai, Miranda; Walker, Jason; Yang, George; Sekhon, Mandeep; Hillier, LaDeana; Zimdahl, Heike; Marziali, Andre; Osoegawa, Kazutoyo; Zhao, Shaying; Siddiqui, Asim; de Jong, Pieter J; Warren, Wes; Mardis, Elaine; McPherson, John D; Wilson, Richard; Hübner, Norbert; Jones, Steven; Marra, Marco; Schein, Jacqueline

    2004-04-01

    As part of the effort to sequence the genome of Rattus norvegicus, we constructed a physical map comprised of fingerprinted bacterial artificial chromosome (BAC) clones from the CHORI-230 BAC library. These BAC clones provide approximately 13-fold redundant coverage of the genome and have been assembled into 376 fingerprint contigs. A yeast artificial chromosome (YAC) map was also constructed and aligned with the BAC map via fingerprinted BAC and P1 artificial chromosome clones (PACs) sharing interspersed repetitive sequence markers with the YAC-based physical map. We have annotated 95% of the fingerprint map clones in contigs with coordinates on the version 3.1 rat genome sequence assembly, using BAC-end sequences and in silico mapping methods. These coordinates have allowed anchoring 358 of the 376 fingerprint map contigs onto the sequence assembly. Of these, 324 contigs are anchored to rat genome sequences localized to chromosomes, and 34 contigs are anchored to unlocalized portions of the rat sequence assembly. The remaining 18 contigs, containing 54 clones, still require placement. The fingerprint map is a high-resolution integrative data resource that provides genome-ordered associations among BAC, YAC, and PAC clones and the assembled sequence of the rat genome.

  16. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  17. Phylogeny of the Zygomycota based on nuclear ribosomal sequence data.

    White, Merlin M; James, Timothy Y; O'Donnell, Kerry; Cafaro, Matías J; Tanabe, Yuuhiko; Sugiyama, Junta

    2006-01-01

    The Zygomycota is an ecologically heterogenous assemblage of nonzoosporic fungi comprising two classes, Zygomycetes and Trichomycetes. Phylogenetic analyses have suggested that the phylum is polyphyletic; two of four orders of Trichomycetes are related to the Mesomycetozoa (protists) that diverged near the fungal/animal split. Current circumscription of the Zygomycota includes only orders with representatives that produce zygospores. We present a molecular-based phylogeny including recognized representatives of the Zygomycetes and Trichomycetes with a combined dataset for nuclear rRNA 18S (SSU), 5.8S and 28S (LSU) genes. Tree reconstruction by Bayesian analyses suggests the Zygomycota is paraphyletic. Although 12 clades were identified only some of these correspond to the nine orders of Zygomycota currently recognized. A large superordinal clade, comprising the Dimargaritales, Harpellales, Kickxellales and Zoopagales, grouping together many symbiotic fungi, also is identified in part by a unique septal structure. Although Harpellales and Kickxellales are not monophyletic, these lineages are distinct from the Mucorales, Endogonales and Mortierellales, which appear more closely related to the Ascomycota + Basidiomycota + Glomeromycota. The final major group, the insect-associated Entomophthorales, appears to be polyphyletic. In the present analyses Basidiobolus and Neozygites group within Zygomycota but not with the Entomophthorales. Clades are discussed with special reference to traditional classifications, mapping morphological characters and ecology, where possible, as a snapshot of our current phylogenetic perspective of the Zygomycota.

  18. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  19. Comparative RNA-Sequence Transcriptome Analysis of Phenolic Acid Metabolism in Salvia miltiorrhiza, a Traditional Chinese Medicine Model Plant

    Zhenqiao Song

    2017-01-01

    Full Text Available Salvia miltiorrhiza Bunge is an important traditional Chinese medicine (TCM. In this study, two S. miltiorrhiza genotypes (BH18 and ZH23 with different phenolic acid concentrations were used for de novo RNA sequencing (RNA-seq. A total of 170,787 transcripts and 56,216 unigenes were obtained. There were 670 differentially expressed genes (DEGs identified between BH18 and ZH23, 250 of which were upregulated in ZH23, with genes involved in the phenylpropanoid biosynthesis pathway being the most upregulated genes. Nine genes involved in the lignin biosynthesis pathway were upregulated in BH18 and thus result in higher lignin content in BH18. However, expression profiles of most genes involved in the core common upstream phenylpropanoid biosynthesis pathway were higher in ZH23 than that in BH18. These results indicated that genes involved in the core common upstream phenylpropanoid biosynthesis pathway might play an important role in downstream secondary metabolism and demonstrated that lignin biosynthesis was a putative partially competing pathway with phenolic acid biosynthesis. The results of this study expanded our understanding of the regulation of phenolic acid biosynthesis in S. miltiorrhiza.

  20. Mass Spectrometry Analysis Coupled with de novo Sequencing Reveals Amino Acid Substitutions in Nucleocapsid Protein from Influenza A Virus

    Zijian Li

    2014-02-01

    Full Text Available Amino acid substitutions in influenza A virus are the main reasons for both antigenic shift and virulence change, which result from non-synonymous mutations in the viral genome. Nucleocapsid protein (NP, one of the major structural proteins of influenza virus, is responsible for regulation of viral RNA synthesis and replication. In this report we used LC-MS/MS to analyze tryptic digestion of nucleocapsid protein of influenza virus (A/Puerto Rico/8/1934 H1N1, which was isolated and purified by SDS poly-acrylamide gel electrophoresis. Thus, LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three substituted amino acid residues R452K, T423A and N430T in two tryptic peptides. The obtained results provided experimental evidence that amino acid substitutions resulted from non-synonymous gene mutations could be directly characterized by mass spectrometry in proteins of RNA viruses such as influenza A virus.

  1. Comparative RNA-Sequence Transcriptome Analysis of Phenolic Acid Metabolism in Salvia miltiorrhiza, a Traditional Chinese Medicine Model Plant

    Song, Zhenqiao; Guo, Linlin; Liu, Tian; Lin, Caicai; Wang, Jianhua

    2017-01-01

    Salvia miltiorrhiza Bunge is an important traditional Chinese medicine (TCM). In this study, two S. miltiorrhiza genotypes (BH18 and ZH23) with different phenolic acid concentrations were used for de novo RNA sequencing (RNA-seq). A total of 170,787 transcripts and 56,216 unigenes were obtained. There were 670 differentially expressed genes (DEGs) identified between BH18 and ZH23, 250 of which were upregulated in ZH23, with genes involved in the phenylpropanoid biosynthesis pathway being the most upregulated genes. Nine genes involved in the lignin biosynthesis pathway were upregulated in BH18 and thus result in higher lignin content in BH18. However, expression profiles of most genes involved in the core common upstream phenylpropanoid biosynthesis pathway were higher in ZH23 than that in BH18. These results indicated that genes involved in the core common upstream phenylpropanoid biosynthesis pathway might play an important role in downstream secondary metabolism and demonstrated that lignin biosynthesis was a putative partially competing pathway with phenolic acid biosynthesis. The results of this study expanded our understanding of the regulation of phenolic acid biosynthesis in S. miltiorrhiza. PMID:28194403

  2. Deoxyribonucleic acid base compositions of dermatophytes.

    Davison, F D; Mackenzie, D W; Owen, R J

    1980-06-01

    DNA was extracted and purified from 55 dermatophyte isolates representing 34 species of Trichophyton, Microsporum and Epidermophyton. The base compositions of the chromosomal DNA were determined by CsCl density gradient centrifugation and were found to be in the narrow range of 48.7 to 50.3 mol % G + C. A satellite DNA component assumed to be of mitochondrial origin was present in most strains, with a G + C content ranging from 14.7 to 30.8 mol % G + C. Heterogeneity in microscopic and colonial characteristics was not reflected in differences in the mean G + C content of the chromosomal DNAs. Strains varied in the G + C contents of satelite DNA, but these did not correlate with traditional species concepts.

  3. Sequence Alignment with Dynamic Divisor Generation for Keystroke Dynamics Based User Authentication

    Jiacang Ho

    2015-01-01

    Full Text Available Keystroke dynamics based authentication is one of the prevention mechanisms used to protect one’s account from criminals’ illegal access. In this authentication mechanism, keystroke dynamics are used to capture patterns in a user typing behavior. Sequence alignment is shown to be one of effective algorithms for keystroke dynamics based authentication, by comparing the sequences of keystroke data to detect imposter’s anomalous sequences. In previous research, static divisor has been used for sequence generation from the keystroke data, which is a number used to divide a time difference of keystroke data into an equal-length subinterval. After the division, the subintervals are mapped to alphabet letters to form sequences. One major drawback of this static divisor is that the amount of data for this subinterval generation is often insufficient, which leads to premature termination of subinterval generation and consequently causes inaccurate sequence alignment. To alleviate this problem, we introduce sequence alignment of dynamic divisor (SADD in this paper. In SADD, we use mean of Horner’s rule technique to generate dynamic divisors and apply them to produce the subintervals with different length. The comparative experimental results with SADD and other existing algorithms indicate that SADD is usually comparable to and often outperforms other existing algorithms.

  4. Main: Sequences [KOME

    Full Text Available Sequences Amino Acid Sequence Amino Acid sequence of full length cDNA (Longest ORF) kome_ine_full_seq...uence_amino_db.fasta.zip kome_ine_full_sequence_amino_db.zip kome_ine_full_sequence_amino_db ...

  5. Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

    Martin Mascher

    Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new

  6. Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

    Mascher, Martin; Wu, Shuangye; Amand, Paul St; Stein, Nils; Poland, Jesse

    2013-01-01

    The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS), a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL) population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new sequencing

  7. Efficient Simulation of Quantum States Based on Classical Fields Modulated with Pseudorandom Phase Sequences

    Fu, Jian

    2010-01-01

    We demonstrate that a tensor product structure could be obtained by introducing pseudorandom phase sequences into classical fields with two orthogonal modes. Using classical fields modulated with pseudorandom phase sequences, we discuss efficient simulation of several typical quantum states, including product state, Bell states, GHZ state, and W state. By performing quadrature demodulation scheme, we could obtain the mode status matrix of the simulating classical fields, based on which we propose a sequence permutation mechanism to reconstruct the simulated quantum states. The research on classical simulation of quantum states is important, for it not only enables potential practical applications in quantum computation, but also provides useful insights into fundamental concepts of quantum mechanics.

  8. MOST: a modified MLST typing tool based on short read sequencing

    Dallman, Timothy; Schaefer, Ulf; Sheppard, Carmen L.; Ashton, Philip; Pichon, Bruno; Ellington, Matthew; Swift, Craig; Green, Jonathan; Underwood, Anthony

    2016-01-01

    Multilocus sequence typing (MLST) is an effective method to describe bacterial populations. Conventionally, MLST involves Polymerase Chain Reaction (PCR) amplification of housekeeping genes followed by Sanger DNA sequencing. Public Health England (PHE) is in the process of replacing the conventional MLST methodology with a method based on short read sequence data derived from Whole Genome Sequencing (WGS). This paper reports the comparison of the reliability of MLST results derived from WGS data, comparing mapping and assembly-based approaches to conventional methods using 323 bacterial genomes of diverse species. The sensitivity of the two WGS based methods were further investigated with 26 mixed and 29 low coverage genomic data sets from Salmonella enteridis and Streptococcus pneumoniae. Of the 323 samples, 92.9% (n = 300), 97.5% (n = 315) and 99.7% (n = 322) full MLST profiles were derived by the conventional method, assembly- and mapping-based approaches, respectively. The concordance between samples that were typed by conventional (92.9%) and both WGS methods was 100%. From the 55 mixed and low coverage genomes, 89.1% (n = 49) and 67.3% (n = 37) full MLST profiles were derived from the mapping and assembly based approaches, respectively. In conclusion, deriving MLST from WGS data is more sensitive than the conventional method. When comparing WGS based methods, the mapping based approach was the most sensitive. In addition, the mapping based approach described here derives quality metrics, which are difficult to determine quantitatively using conventional and WGS-assembly based approaches. PMID:27602279

  9. A new unnatural base pair system between fluorophore and quencher base analogues for nucleic acid-based imaging technology.

    Kimoto, Michiko; Mitsui, Tsuneo; Yamashige, Rie; Sato, Akira; Yokoyama, Shigeyuki; Hirao, Ichiro

    2010-11-03

    In the development of orthogonal extra base pairs for expanding the genetic alphabet, we created novel, unnatural base pairs between fluorophore and quencher nucleobase analogues. We found that the nucleobase analogue, 2-nitropyrrole (denoted by Pn), and its 4-substitutions, such as 2-nitro-4-propynylpyrrole (Px) and 4-[3-(6-aminohexanamido)-1-propynyl]-2-nitropyrrole (NH(2)-hx-Px), act as fluorescence quenchers. The Pn and Px bases specifically pair with their pairing partner, 7-(2,2'-bithien-5-yl)imidazo[4,5-b]pyridine (Dss), which is strongly fluorescent. Thus, these unnatural Dss-Pn and Dss-Px base pairs function as reporter-quencher base pairs, and are complementarily incorporated into DNA by polymerase reactions as a third base pair in combination with the natural A-T and G-C pairs. Due to the static contact quenching, the Pn and Px quencher bases significantly decreased the fluorescence intensity of Dss by the unnatural base pairings in DNA duplexes. In addition, the Dss-Px pair exhibited high efficiency and selectivity in PCR amplification. Thus, this new unnatural base pair system would be suitable for detection methods of target nucleic acid sequences, and here we demonstrated the applications of the Dss-Pn and Dss-Px pairs as molecular beacons and in real-time PCR. The genetic alphabet expansion system with the replicable, unnatural fluorophore-quencher base pair will be a useful tool for sensing and diagnostic applications, as well as an imaging tool for basic research.

  10. Performance of Correspondence Algorithms in Vision-Based Driver Assistance Using an Online Image Sequence Database

    Klette, Reinhard; Krüger, Norbert; Vaudrey, Tobi;

    2011-01-01

    This paper discusses options for testing correspondence algorithms in stereo or motion analysis that are designed or considered for vision-based driver assistance. It introduces a globally available database, with a main focus on testing on video sequences of real-world data. We suggest the class...... rankings of correspondence techniques on sets of basic sequences that show different situations. It is suggested that correspondence techniques should adaptively be chosen in real time using some type of statistical situation classifiers....

  11. Amino Acid sequence analysis of the two major outer Capsid Proteins (VP7 and VP4 from human-derived canine G3P[3] Rotavirus Strain Detected in Brazil

    Adriana Luchs

    2013-12-01

    Full Text Available Introduction: A close look at the rotavirus group A (RVA genotypes in Brazil revealed the detection of a rare G3P[3] strain close related to canine strains. The aim of this study was to add to the already known genetic analysis by the description of the G3P[3] (IAL-R2638 strain amino acid characteristics. Methods: Amino acid sequence analysis and protein based trees were conducted using BioEdit and MEGA 4.0. Results: The VP7 and VP4 protein of the IAL-R2638 strain displayed the highest amino acid identity to the canine-derived human strain HCR3A (99.2%, and to the canine strain RV52/96 (96.4%, respectively. IAL-R2638 strain did not possess an extra VP7 N-linked glycosylation site at amino acid 238 recently described for some G3 strains, as well as RotaTeqTM G3 vaccine strain. The topology exhibited by phylogenetic trees in previous analysis were maintained in the present amino acid-based trees, reinforcing a stable relationship between G3P[3] strains. Conclusions: Amino acid analysis data were consistent with the previous sequence of data obtained for the IAL-R2638 strain, supporting its possible canine origin. Theoretically, RotaTeqTM vaccine could efficiently protect against G3P[3] infections based on the lack of the extra VP7 N-linked glycosylation site at amino acid 238. Phylogenetic analysis hypothesizes that all features undergo evolution independently of each other; however, unfavorable effects of nucleotide substitutions may be compensated by substitutions in other positions. The present study raises the question as to whether the amino acid-based trees could be applied as an approach to the study of RVA evolution, avoiding incorrect phylogenetic reconstructions.

  12. Sequence-selective recognition of double-stranded RNA and enhanced cellular uptake of cationic nucleobase and backbone-modified peptide nucleic acids.

    Hnedzko, Dziyana; McGee, Dennis W; Karamitas, Yannis A; Rozners, Eriks

    2017-01-01

    Sequence-selective recognition of complex RNAs in live cells could find broad applications in biology, biomedical research, and biotechnology. However, specific recognition of structured RNA is challenging, and generally applicable and effective methods are lacking. Recently, we found that peptide nucleic acids (PNAs) were unusually well-suited ligands for recognition of double-stranded RNAs. Herein, we report that 2-aminopyridine (M) modified PNAs and their conjugates with lysine and arginine tripeptides form strong (Ka = 9.4 to 17 × 10(7) M(-1)) and sequence-selective triple helices with RNA hairpins at physiological pH and salt concentration. The affinity of PNA-peptide conjugates for the matched RNA hairpins was unusually high compared to the much lower affinity for DNA hairpins of the same sequence (Ka = 0.05 to 1.1 × 10(7) M(-1)). The binding of double-stranded RNA by M-modified PNA-peptide conjugates was a relatively fast process (kon = 2.9 × 10(4) M(-1) sec(-1)) compared to the notoriously slow triple helix formation by oligodeoxynucleotides (kon ∼ 10(3) M(-1) sec(-1)). M-modified PNA-peptide conjugates were not cytotoxic and were efficiently delivered in the cytosol of HEK293 cells at 10 µM. Surprisingly, M-modified PNAs without peptide conjugation were also taken up by HEK293 cells, which, to the best of our knowledge, is the first example of heterocyclic base modification that enhances the cellular uptake of PNA. Our results suggest that M-modified PNA-peptide conjugates are promising probes for sequence-selective recognition of double-stranded RNA in live cells and other biological systems.

  13. Amino acid sequence and crystal structure of BaP1, a metalloproteinase from Bothrops asper snake venom that exerts multiple tissue-damaging activities

    Watanabe, Leandra; Shannon, John D.; Valente, Richard H.; Rucavado, Alexandra; Alape-Girón, Alberto; Kamiguti, Aura S.; Theakston, R. David G.; Fox, Jay W.; Gutiérrez, José María; Arni, Raghuvir K.

    2003-01-01

    BaP1 is a 22.7-kD P-I-type zinc-dependent metalloproteinase isolated from the venom of the snake Bothrops asper, a medically relevant species in Central America. This enzyme exerts multiple tissue-damaging activities, including hemorrhage, myonecrosis, dermonecrosis, blistering, and edema. BaP1 is a single chain of 202 amino acids that shows highest sequence identity with metalloproteinases isolated from the venoms of snakes of the subfamily Crotalinae. It has six Cys residues involved in three disulfide bridges (Cys 117–Cys 197, Cys 159–Cys 181, Cys 157–Cys 164). It has the consensus sequence H142E143XXH146XXGXXH152, as well as the sequence C164I165M166, which characterize the “metzincin” superfamily of metalloproteinases. The active-site cleft separates a major subdomain (residues 1–152), comprising four α-helices and a five-stranded β-sheet, from the minor subdomain, which is formed by a single α-helix and several loops. The catalytic zinc ion is coordinated by the Nɛ2 nitrogen atoms of His 142, His 146, and His 152, in addition to a solvent water molecule, which in turn is bound to Glu 143. Several conserved residues contribute to the formation of the hydrophobic pocket, and Met 166 serves as a hydrophobic base for the active-site groups. Sequence and structural comparisons of hemorrhagic and nonhemorrhagic P-I metalloproteinases from snake venoms revealed differences in several regions. In particular, the loop comprising residues 153 to 176 has marked structural differences between metalloproteinases with very different hemorrhagic activities. Because this region lies in close proximity to the active-site microenvironment, it may influence the interaction of these enzymes with physiologically relevant substrates in the extracellular matrix. PMID:14500885

  14. Regulated acid-base transport in the collecting duct.

    Wagner, Carsten A; Devuyst, Olivier; Bourgeois, Soline; Mohebbi, Nilufar

    2009-05-01

    The renal collecting system serves the fine-tuning of renal acid-base secretion. Acid-secretory type-A intercalated cells secrete protons via a luminally expressed V-type H(+)-ATPase and generate new bicarbonate released by basolateral chloride/bicarbonate exchangers including the AE1 anion exchanger. Efficient proton secretion depends both on the presence of titratable acids (mainly phosphate) and the concomitant secretion of ammonia being titrated to ammonium. Collecting duct ammonium excretion requires the Rhesus protein RhCG as indicated by recent KO studies. Urinary acid secretion by type-A intercalated cells is strongly regulated by various factors among them acid-base status, angiotensin II and aldosterone, and the Calcium-sensing receptor. Moreover, urinary acidification by H(+)-ATPases is modulated indirectly by the activity of the epithelial sodium channel ENaC. Bicarbonate secretion is achieved by non-type-A intercalated cells characterized by the luminal expression of the chloride/bicarbonate exchanger pendrin. Pendrin activity is driven by H(+)-ATPases and may serve both bicarbonate excretion and chloride reabsorption. The activity and expression of pendrin is regulated by different factors including acid-base status, chloride delivery, and angiotensin II and may play a role in NaCl retention and blood pressure regulation. Finally, the relative abundance of type-A and non-type-A intercalated cells may be tightly regulated. Dysregulation of intercalated cell function or abundance causes various syndromes of distal renal tubular acidosis underlining the importance of these processes for acid-base homeostasis.

  15. Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains

    Branislava Gemovic

    2013-01-01

    Full Text Available There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM, a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.

  16. Troger's base molecular scaffolds in dicarboxylic acid recognition.

    Goswami, S; Ghosh, K; Dasgupta, S

    2000-04-07

    Artificial receptors (1-5) have been designed and synthesized from simple precursors. The chain length selectivity studies of dicarboxylic acids within the cavities of new fluorescent Troger's base molecular frameworks (1-3) have been carried out with a critical examination of their role of rigidity as well as flexibility in selective binding in comparison to receptor 5. The chiral resolution of the racemic Troger's base receptors (1 and 2) by chiral recognition with (+)- camphoric acid using hydrogen-bonding interactions has been studied.

  17. A new EPR dosimeter based on sulfanilic acid

    Maghraby, A. [Radiation Dosimetry Department, National Institute for Standards (NIS), Ministry of Scientific Research, Haram, Giza, Tersa Street, P.O. Box 136 (Egypt)]. E-mail: maghrabism@yahoo.com; Tarek, E. [Physics Department, Faculty of Science, Fayoum University, Fayoum (Egypt)

    2006-02-15

    A new radiation dosimeter based on sulfanilic acid was suggested and tested. Radiation-induced radical EPR spectrum was recorded and characterized. EPR spectrum of sulfanilic acid is characterized by its simple single signal, which is located at g-factor=2.0053+/-0.00017. Its microwave absorption curve and effect of modulation amplitude on signal intensity and line width were studied. The radiation-induced radical was found to be stable after about 24h following irradiation within +/-1.2%. Energy dependence curves of sulfanilic acid compared to soft tissue and alanine were plotted. Sulfanilic acid is characterized also by its isotropic behavior and its calibration curve in {sup 137}Cs gamma radiation is plotted compared to that of alanine. Its sensitivity is about 85% of alanine, while its detection limit is about 100+/-30mGy.

  18. Carbon Dot Based Sensing of Dopamine and Ascorbic Acid

    Upama Baruah

    2014-01-01

    Full Text Available We demonstrate carbon dot based sensor of catecholamine, namely, dopamine and ascorbic acid. Carbon dots (CDs were prepared from a green source: commercially available Assam tea. The carbon dots prepared from tea had particle sizes of ∼0.8 nm and are fluorescent. Fluorescence of the carbon dots was found to be quenched in the presence of dopamine and ascorbic acid with greater sensitivity for dopamine. The minimum detectable limits were determined to be 33 μM and 98 μM for dopamine and ascorbic acid, respectively. The quenching constants determined from Stern-Volmer plot were determined to be 5 × 10−4 and 1 × 10−4 for dopamine and ascorbic acid, respectively. A probable mechanism of quenching has been discussed in the paper.

  19. Amplicon-based metagenomic analysis of mixed fungal samples using proton release amplicon sequencing.

    Daniel P Tonge

    Full Text Available Next generation sequencing technology has revolutionised microbiology by allowing concurrent analysis of whole microbial communities. Here we developed and verified similar methods for the analysis of fungal communities using a proton release sequencing platform with the ability to sequence reads of up to 400 bp in length at significant depth. This read length permits the sequencing of amplicons from commonly used fungal identification regions and thereby taxonomic classification. Using the 400 bp sequencing capability, we have sequenced amplicons from the ITS1, ITS2 and LSU fungal regions to a depth of approximately 700,000 raw reads per sample. Representative operational taxonomic units (OTUs were chosen by the USEARCH algorithm, and identified taxonomically through nucleotide blast (BLASTn. Combination of this sequencing technology with the bioinformatics pipeline allowed species recognition in two controlled fungal spore populations containing members of known identity and concentration. Each species included within the two controlled populations was found to correspond to a representative OTU, and these OTUs were found to be highly accurate representations of true biological sequences. However, the absolute number of reads attributed to each OTU differed among species. The majority of species were represented by an OTU derived from all three genomic regions although in some cases, species were only represented in two of the regions due to the absence of conserved primer binding sites or due to sequence composition. It is apparent from our data that proton release sequencing technologies can deliver a qualitative assessment of the fungal members comprising a sample. The fact that some fungi cannot be amplified by specific "conserved" primer pairs confirms our recommendation that a multi-region approach be taken for other amplicon-based metagenomic studies.

  20. [Clinical Application of Extraction and Analysis of the Key Frames Based on IVUS Sequences].

    Mao, Haiqun; Yang, Feng; Huang, Zheng; Cui, Kai; Wang, Xinxin

    2015-08-01

    In this paper, we propose an image-based key frame gating method to reduce motion artifacts in intravascular ultrasound (IVUS) longitudinal cuts. The artifacts are mainly caused by the periodic relative displacement between blood vessels and the IVUS catheter due to cardiac motion. The method is achieved in four steps as following. Firstly, we convert IVUS image sequences to polar coordinates to cut down the amount of calculation. Secondly, we extracted a one-dimensional signal cluster reflecting cardiac motion by spectral analysis and filtering techniques. Thirdly, we designed a Butterworth band-pass filter for filtering the one-dimensional signal clusters. Fourthly, we retrieved the extremes of the filtered signal clusters to seek key frames to compose key-frames gated sequences. Experimental results showed that our algorithm was fast and the average frame processing time was 17ms. Observing the longitudinal viewpictures, we found that comparing to the original ones, the gated sequences had similar trend, less saw tooth shape, and good continuity. We selected 12 groups of clinical IVUS sequences [images (876 +/- 65 frames), coronary segments length (14.61 +/- 1.08 mm)] to calculate vessel volume, lumen volume, mean plaque burden of the original and gated sequences. Statistical results showed that, on one hand, both vessel volume and lumen volume measured of the gated sequences were significantly smaller than those of the original ones, and there was no significant difference on mean plaque burden between original and gated sequences, which met the need of the clinical diagnosis and treatment. On the other hand, variances of vessel area and lumen area of the gated sequences were significantly smaller than those of the original sequences, indicating that the gated sequences would be more stable than the original ones.

  1. Genome sequence of the copper resistant and acid-tolerant Desulfosporosinus sp. BG isolated from the tailings of a molybdenum-tungsten mine in the Transbaikal area

    Olga V. Karnachuk

    2017-03-01

    Full Text Available Here, we report on the draft genome of a copper-resistant and acidophilic Desulfosporosinus sp. BG, isolated from the tailings of a molybdenum-tungsten mine in Transbaikal area. The draft genome has a size of 4.52 Mb and encodes transporters of heavy metals. The phylogenetic analysis based on concatenated ribosomal proteins revealed that strain BG clusters together with the other acidophilic copper-resistant strains Desulfosporosinus sp. OT and Desulfosporosinus sp. I2. The K+-ATPase, Na+/H+ antiporter and amino acid decarboxylases may participate in enabling growth at low pH. The draft genome sequence and annotation have been deposited at GenBank under the accession number NZ_MASS00000000.

  2. High Interlaboratory Reprocucibility of DNA Sequence-based Typing of Bacteria in a Multicenter Study

    Sousa, MA de; Boye, Kit; Lencastre, H de;

    2006-01-01

    Current DNA amplification-based typing methods for bacterial pathogens often lack interlaboratory reproducibility. In this international study, DNA sequence-based typing of the Staphylococcus aureus protein A gene (spa, 110 to 422 bp) showed 100% intra- and interlaboratory reproducibility without...

  3. Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics.

    Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

    2016-06-17

    With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy.

  4. OrchidBase: a collection of sequences of the transcriptome derived from orchids.

    Fu, Chih-Hsiung; Chen, Yun-Wen; Hsiao, Yu-Yun; Pan, Zhao-Jun; Liu, Zhong-Jian; Huang, Yueh-Min; Tsai, Wen-Chieh; Chen, Hong-Hwa

    2011-02-01

    Orchids are one of the most ecological and evolutionarily significant plants, and the Orchidaceae is one of the most abundant families of the angiosperms. Genetic databases will be useful not only for gene discovery but also for future genomic annotation. For this purpose, OrchidBase was established from 37,979,342 sequence reads collected from 11 in-house Phalaenopsis orchid cDNA libraries. Among them, 41,310 expressed sequence tags (ESTs) were obtained by using Sanger sequencing, whereas 37,908,032 reads were obtained by using next-generation sequencing (NGS) including both Roche 454 and Solexa Illumina sequencers. These reads were assembled into 8,501 contigs and 76,116 singletons, resulting in 84,617 non-redundant transcribed sequences with an average length of 459 bp. The analysis pipeline of the database is an automated system written in Perl and C#, and consists of the following components: automatic pre-processing of EST reads, assembly of raw sequences, annotation of the assembled sequences and storage of the analyzed information in SQL databases. A web application was implemented with HTML and a Microsoft .NET Framework C# program for browsing and querying the database, creating dynamic web pages on the client side, analyzing gene ontology (GO) and mapping annotated enzymes to KEGG pathways. The online resources for putative annotation can be searched either by text or by using BLAST, and the results can be explored on the website and downloaded. Consequently, the establishment of OrchidBase will provide researchers with a high-quality genetic resource for data mining and facilitate efficient experimental studies on orchid biology and biotechnology. The OrchidBase database is freely available at http://lab.fhes.tn.edu.tw/est.

  5. A next generation semiconductor based sequencing approach for the identification of meat species in DNA mixtures.

    Francesca Bertolini

    Full Text Available The identification of the species of origin of meat and meat products is an important issue to prevent and detect frauds that might have economic, ethical and health implications. In this paper we evaluated the potential of the next generation semiconductor based sequencing technology (Ion Torrent Personal Genome Machine for the identification of DNA from meat species (pig, horse, cattle, sheep, rabbit, chicken, turkey, pheasant, duck, goose and pigeon as well as from human and rat in DNA mixtures through the sequencing of PCR products obtained from different couples of universal primers that amplify 12S and 16S rRNA mitochondrial DNA genes. Six libraries were produced including PCR products obtained separately from 13 species or from DNA mixtures containing DNA from all species or only avian or only mammalian species at equimolar concentration or at 1:10 or 1:50 ratios for pig and horse DNA. Sequencing obtained a total of 33,294,511 called nucleotides of which 29,109,688 with Q20 (87.43% in a total of 215,944 reads. Different alignment algorithms were used to assign the species based on sequence data. Error rate calculated after confirmation of the obtained sequences by Sanger sequencing ranged from 0.0003 to 0.02 for the different species. Correlation about the number of reads per species between different libraries was high for mammalian species (0.97 and lower for avian species (0.70. PCR competition limited the efficiency of amplification and sequencing for avian species for some primer pairs. Detection of low level of pig and horse DNA was possible with reads obtained from different primer pairs. The sequencing of the products obtained from different universal PCR primers could be a useful strategy to overcome potential problems of amplification. Based on these results, the Ion Torrent technology can be applied for the identification of meat species in DNA mixtures.

  6. A next generation semiconductor based sequencing approach for the identification of meat species in DNA mixtures.

    Bertolini, Francesca; Ghionda, Marco Ciro; D'Alessandro, Enrico; Geraci, Claudia; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    The identification of the species of origin of meat and meat products is an important issue to prevent and detect frauds that might have economic, ethical and health implications. In this paper we evaluated the potential of the next generation semiconductor based sequencing technology (Ion Torrent Personal Genome Machine) for the identification of DNA from meat species (pig, horse, cattle, sheep, rabbit, chicken, turkey, pheasant, duck, goose and pigeon) as well as from human and rat in DNA mixtures through the sequencing of PCR products obtained from different couples of universal primers that amplify 12S and 16S rRNA mitochondrial DNA genes. Six libraries were produced including PCR products obtained separately from 13 species or from DNA mixtures containing DNA from all species or only avian or only mammalian species at equimolar concentration or at 1:10 or 1:50 ratios for pig and horse DNA. Sequencing obtained a total of 33,294,511 called nucleotides of which 29,109,688 with Q20 (87.43%) in a total of 215,944 reads. Different alignment algorithms were used to assign the species based on sequence data. Error rate calculated after confirmation of the obtained sequences by Sanger sequencing ranged from 0.0003 to 0.02 for the different species. Correlation about the number of reads per species between different libraries was high for mammalian species (0.97) and lower for avian species (0.70). PCR competition limited the efficiency of amplification and sequencing for avian species for some primer pairs. Detection of low level of pig and horse DNA was possible with reads obtained from different primer pairs. The sequencing of the products obtained from different universal PCR primers could be a useful strategy to overcome potential problems of amplification. Based on these results, the Ion Torrent technology can be applied for the identification of meat species in DNA mixtures.

  7. Complete amino acid sequence of Mytilus anterior byssus retractor paramyosin and its putative phosphorylation site.

    Watabe, S; Iwasaki, K; Funabara, D; Hirayama, Y; Nakaya, M; Kikuchi, K

    2000-01-01

    A cDNA encoding the full-length paramyosin molecule was cloned from the mussel Mytilus galloprovincialis, a species closely related to Mytilus edulis. It contained 3,497 nucleotides (nt), with 79 and 826 nt for the 5' and 3' non-coding regions, respectively. The coding region was composed of 2,592 nt for 864 amino acid residues, a size typical of paramyosin. While genomic DNA digests with either HindIII or PstI exhibited a single band when hybridized with a SacI fragment of paramyosin cDNA, the digests with either EcoRV or EcoRI showed two bands, suggesting that the mussel has at least two genes encoding paramyosin. The mRNAs encoding paramyosin were most abundant in muscle tissues from byssus retractor and adductor muscles. Only traces of paramyosin transcripts were found in the tissue of foot, gill, inner mantle, and outer mantle. The same phosphorylatable peptide previously reported for paramyosin from the bivalve Mercenaria mercenaria, Ser-Arg-Ser-Met-Ser(P)-Val-Ser-Arg (Watabe et al. 1989. Comp Biochem Physiol 94B:813-821) was found in the C-terminal non-helical part of this Mytilus paramyosin. We predict that this particular paramyosin has a coiled-coil structure composed of two alpha-helices that show the heptad repeats (a-b-c-d-e-f-g) with further 28-amino acid repeat zones, where a and d tend to be occupied by nonpolar residues.

  8. A knowledge based advisory system for acid/base titrations in non-aqueous solvents

    Bos, M.; Linden, van der W.E.

    1996-01-01

    A computer program was developed that could advice on the choice of solvent and titrant for acid/base titrations in nonaqueous media. It is shown that the feasibility of a titration in a given solvent can be calculated from solvent properties and intrinsic acid/base properties of the sample componen

  9. Using Triple Helix Forming Peptide Nucleic Acids for Sequence-selective Recognition of Double-stranded RNA

    Hnedzko, Dziyana; Cheruiyot, Samwel K.; Rozners, Eriks

    2014-01-01

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple helix forming peptide nucleic acids (PNAs) that bind in the major grove of RNA helix. The strategy developed uses chemically modified nucleobases, such as 2-aminopyridine (M) that enables strong triple helical binding at physiologically relevant conditions, and 2-pyrimidinone (P) and 3-oxo-2,3-dihydropyridazine (E) that enable recognition of isolated pyrimidines in the purine rich strand of the RNA duplex. Detailed protocols for preparation of modified PNA monomers, solid-phase synthesis and HPLC purification of PNA oligomers, and measuring dsRNA binding affinity using isothermal titration calorimetry are included. PMID:25199637

  10. Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

    Freimuth, Paul I.

    2010-04-06

    The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.

  11. Sequence-based identification of microbial contaminants in non-parenteral products

    Rajapandi Senthilraj

    Full Text Available ABSTRACT Phenotypic profiles for microbial identification are unusual for rare, slow-growing and fastidious microorganisms. In the last decade, as a result of the widespread use of PCR and DNA sequencing, 16S rRNA sequencing has played a pivotal role in the accurate identification of microorganisms and the discovery of novel isolates in microbiology laboratories. The 16S rRNA region is universally distributed among microorganisms and is species-specific. Accordingly, the aim of our study was the genotypic identification of microorganisms isolated from non-parenteral pharmaceutical formulations. DNA was separated from five isolates obtained from the formulations. The target regions of the rRNA genes were amplified by PCR and sequenced using suitable primers. The sequence data were analyzed and aligned in the order of increasing genetic distance to relevant sequences against a library database to achieve an identity match. The DNA sequences of the phylogenetic tree results confirmed the identity of the isolates as Bacillus tequilensis, B. subtilis, Staphylococcus haemolyticus and B. amyloliqueficians. It can be concluded that 16S rRNA sequence-based identification reduces the time by circumventing biochemical tests and also increases specificity and accuracy.

  12. Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins.

    Carpena, Pedro; Bernaola-Galván, Pedro A; Carretero-Campos, Concepción; Coronado, Ana V

    2016-11-01

    Symbolic sequences have been extensively investigated in the past few years within the framework of statistical physics. Paradigmatic examples of such sequences are written texts, and deoxyribonucleic acid (DNA) and protein sequences. In these examples, the spatial distribution of a given symbol (a word, a DNA motif, an amino acid) is a key property usually related to the symbol importance in the sequence: The more uneven and far from random the symbol distribution, the higher the relevance of the symbol to the sequence. Thus, many techniques of analysis measure in some way the deviation of the symbol spatial distribution with respect to the random expectation. The problem is then to know the spatial distribution corresponding to randomness, which is typically considered to be either the geometric or the exponential distribution. However, these distributions are only valid for very large symbolic sequences and for many occurrences of the analyzed symbol. Here, we obtain analytically the exact, randomly expected spatial distribution valid for any sequence length and any symbol frequency, and we study its main properties. The knowledge of the distribution allows us to define a measure able to properly quantify the deviation from randomness of the symbol distribution, especially for short sequences and low symbol frequency. We apply the measure to the problem of keyword detection in written texts and to study amino acid clustering in protein sequences. In texts, we show how the results improve with respect to previous methods when short texts are analyzed. In proteins, which are typically short, we show how the measure quantifies unambiguously the amino acid clustering and characterize its spatial distribution.

  13. Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins

    Carpena, Pedro; Bernaola-Galván, Pedro A.; Carretero-Campos, Concepción; Coronado, Ana V.

    2016-11-01

    Symbolic sequences have been extensively investigated in the past few years within the framework of statistical physics. Paradigmatic examples of such sequences are written texts, and deoxyribonucleic acid (DNA) and protein sequences. In these examples, the spatial distribution of a given symbol (a word, a DNA motif, an amino acid) is a key property usually related to the symbol importance in the sequence: The more uneven and far from random the symbol distribution, the higher the relevance of the symbol to the sequence. Thus, many techniques of analysis measure in some way the deviation of the symbol spatial distribution with respect to the random expectation. The problem is then to know the spatial distribution corresponding to randomness, which is typically considered to be either the geometric or the exponential distribution. However, these distributions are only valid for very large symbolic sequences and for many occurrences of the analyzed symbol. Here, we obtain analytically the exact, randomly expected spatial distribution valid for any sequence length and any symbol frequency, and we study its main properties. The knowledge of the distribution allows us to define a measure able to properly quantify the deviation from randomness of the symbol distribution, especially for short sequences and low symbol frequency. We apply the measure to the problem of keyword detection in written texts and to study amino acid clustering in protein sequences. In texts, we show how the results improve with respect to previous methods when short texts are analyzed. In proteins, which are typically short, we show how the measure quantifies unambiguously the amino acid clustering and characterize its spatial distribution.

  14. Acid-base transport in pancreas-new challenges

    Novak, Ivana; Haanes, Kristian Agmund; Wang, Jing

    2013-01-01

    Along the gastrointestinal tract a number of epithelia contribute with acid or basic secretions in order to aid digestive processes. The stomach and pancreas are the most extreme examples of acid (H+) and base (HCO-3) transporters, respectively. Nevertheless, they share the same challenges...... to consider in pancreas are the proton pumps (H-K-ATPases), as well as the calcium-activated K and Cl channels, such as K3.1 and TMEM16A/ANO1. Local regulators, such as purinergic signaling, fine-tune, and coordinate pancreatic secretion. Lastly, we speculate whether dys-regulation of acid-base transport...... contributes to pancreatic diseases including cystic fibrosis, pancreatitis, and cancer. © 2013 Novak, Haanes and Wang....

  15. Construction of a phylogenetic tree of photosynthetic prokaryotes based on average similarities of whole genome sequences.

    Soichirou Satoh

    Full Text Available Phylogenetic trees have been constructed for a wide range of organisms using gene sequence information, especially through the identification of orthologous genes that have been vertically inherited. The number of available complete genome sequences is rapidly increasing, and many tools for construction of genome trees based on whole genome sequences have been proposed. However, development of a reasonable method of using complete genome sequences for construction of phylogenetic trees has not been established. We have developed a method for construction of phylogenetic trees based on the average sequence similarities of whole genome sequences. We used this method to examine the phylogeny of 115 photosynthetic prokaryotes, i.e., cyanobacteria, Chlorobi, proteobacteria, Chloroflexi, Firmicutes and nonphotosynthetic organisms including Archaea. Although the bootstrap values for the branching order of phyla were low, probably due to lateral gene transfer and saturated mutation, the obtained tree was largely consistent with the previously reported phylogenetic trees, indicating that this method is a robust alternative to traditional phylogenetic methods.

  16. JiffyNet: a web-based instant protein network modeler for newly sequenced species.

    Kim, Eiru; Kim, Hanhae; Lee, Insuk

    2013-07-01

    Revolutionary DNA sequencing technology has enabled affordable genome sequencing for numerous species. Thousands of species already have completely decoded genomes, and tens of thousands more are in progress. Naturally, parallel expansion of the functional parts list library is anticipated, yet genome-level understanding of function also requires maps of functional relationships, such as functional protein networks. Such networks have been constructed for many sequenced species including common model organisms. Nevertheless, the majority of species with sequenced genomes still have no protein network models available. Moreover, biologists might want to obtain protein networks for their species of interest on completion of the genome projects. Therefore, there is high demand for accessible means to automatically construct genome-scale protein networks based on sequence information from genome projects only. Here, we present a public web server, JiffyNet, specifically designed to instantly construct genome-scale protein networks based on associalogs (functional associations transferred from a template network by orthology) for a query species with only protein sequences provided. Assessment of the networks by JiffyNet demonstrated generally high predictive ability for pathway annotations. Furthermore, JiffyNet provides network visualization and analysis pages for wide variety of molecular concepts to facilitate network-guided hypothesis generation. JiffyNet is freely accessible at http://www.jiffynet.org.

  17. Molecular mechanisms of acid-base sensing by the kidney.

    Brown, Dennis; Wagner, Carsten A

    2012-05-01

    A major function of the kidney is to collaborate with the respiratory system to maintain systemic acid-base status within limits compatible with normal cell and organ function. It achieves this by regulating the excretion and recovery of bicarbonate (mainly in the proximal tubule) and the secretion of buffered protons (mainly in the distal tubule and collecting duct). How proximal tubular cells and distal professional proton transporting (intercalated) cells sense and respond to changes in pH, bicarbonate, and CO(2) status is a question that has intrigued many generations of renal physiologists. Over the past few years, however, some candidate molecular pH sensors have been identified, including acid/alkali-sensing receptors (GPR4, InsR-RR), kinases (Pyk2, ErbB1/2), pH-sensitive ion channels (ASICs, TASK, ROMK), and the bicarbonate-stimulated adenylyl cyclase (sAC). Some acid-sensing mechanisms in other tissues, such as CAII-PDK2L1 in taste buds, might also have similar roles to play in the kidney. Finally, the function of a variety of additional membrane channels and transporters is altered by pH variations both within and outside the cell, and the expression of several metabolic enzymes are altered by acid-base status in parts of the nephron. Thus, it is possible that a master pH sensor will never be identified. Rather, the kidney seems equipped with a battery of molecules that scan the epithelial cell environment to mount a coordinated physiologic response that maintains acid-base homeostasis. This review collates current knowledge on renal acid-base sensing in the context of a whole organ sensing and response process.

  18. ParticleCall: A particle filter for base calling in next-generation sequencing systems

    Shen Xiaohu

    2012-07-01

    Full Text Available Abstract Background Next-generation sequencing systems are capable of rapid and cost-effective DNA sequencing, thus enabling routine sequencing tasks and taking us one step closer to personalized medicine. Accuracy and lengths of their reads, however, are yet to surpass those provided by the conventional Sanger sequencing method. This motivates the search for computationally efficient algorithms capable of reliable and accurate detection of the order of nucleotides in short DNA fragments from the acquired data. Results In this paper, we consider Illumina’s sequencing-by-synthesis platform which relies on reversible terminator chemistry and describe the acquired signal by reformulating its mathematical model as a Hidden Markov Model. Relying on this model and sequential Monte Carlo methods, we develop a parameter estimation and base calling scheme called ParticleCall. ParticleCall is tested on a data set obtained by sequencing phiX174 bacteriophage using Illumina’s Genome Analyzer II. The results show that the developed base calling scheme is significantly more computationally efficient than the best performing unsupervised method currently available, while achieving the same accuracy. Conclusions The proposed ParticleCall provides more accurate calls than the Illumina’s base calling algorithm, Bustard. At the same time, ParticleCall is significantly more computationally efficient than other recent schemes with similar performance, rendering it more feasible for high-throughput sequencing data analysis. Improvement of base calling accuracy will have immediate beneficial effects on the performance of downstream applications such as SNP and genotype calling. ParticleCall is freely available at https://sourceforge.net/projects/particlecall.

  19. Weather data analysis based on typical weather sequence analysis. Application: energy building simulation

    David, Mathieu; Garde, Francois; Boyer, Harry

    2014-01-01

    In building studies dealing about energy efficiency and comfort, simulation software need relevant weather files with optimal time steps. Few tools generate extreme and mean values of simultaneous hourly data including correlation between the climatic parameters. This paper presents the C++ Runeole software based on typical weather sequences analysis. It runs an analysis process of a stochastic continuous multivariable phenomenon with frequencies properties applied to a climatic database. The database analysis associates basic statistics, PCA (Principal Component Analysis) and automatic classifications. Different ways of applying these methods will be presented. All the results are stored in the Runeole internal database that allows an easy selection of weather sequences. The extreme sequences are used for system and building sizing and the mean sequences are used for the determination of the annual cooling loads as proposed by Audrier-Cros (Audrier-Cros, 1984). This weather analysis was tested with the datab...

  20. An Optimal Sorting of Pulse Amplitude Sequence Based on the Phased Array Radar Beam Tasks

    Chuan Sheng∗,Yongshun Zhang; Wenlong Lu

    2016-01-01

    The study of phased array radar ( PAR) pulse amplitude sequence characteristics is the key to understand the radar’s working state and its beam’s scanning manner. According to the principle of antenna pattern formation and the searching and tracking modes of beams, this paper analyzes the characteristics and differences of pulse amplitude sequence when the radar beams work in searching and tracking modes respectively. Then an optimal sorting model of pulse amplitude sequence is established based on least⁃squares and curve⁃fitting methods. This method is helpful for acquiring the current working state of the radar and recognizing its instantaneous beam pointing by sorting the pulse amplitude sequence without the necessity to estimate the antenna pattern.