close sequence comparisons: Topics by WorldWideScience.org

Sample records for close sequence comparisons

Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

Energy Technology Data Exchange (ETDEWEB)

Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

2005-12-01

Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.
Sequence Comparison: Close and Open problems

NARCIS (Netherlands)

Lenzini, Gabriele; Cerrai, P.; Freguglia, P.

Comparing sequences is a very important activity both in computer science and in a many other areas as well. For example thank to text editors, everyone knows the particular instance of a sequence comparison problem knonw as ``string mathcing problem''. It consists in searching a given work
eShadow: A tool for comparing closely related sequences

Energy Technology Data Exchange (ETDEWEB)

Ovcharenko, Ivan; Boffelli, Dario; Loots, Gabriela G.

2004-01-15

Primate sequence comparisons are difficult to interpret due to the high degree of sequence similarity shared between such closely related species. Recently, a novel method, phylogenetic shadowing, has been pioneered for predicting functional elements in the human genome through the analysis of multiple primate sequence alignments. We have expanded this theoretical approach to create a computational tool, eShadow, for the identification of elements under selective pressure in multiple sequence alignments of closely related genomes, such as in comparisons of human to primate or mouse to rat DNA. This tool integrates two different statistical methods and allows for the dynamic visualization of the resulting conservation profile. eShadow also includes a versatile optimization module capable of training the underlying Hidden Markov Model to differentially predict functional sequences. This module grants the tool high flexibility in the analysis of multiple sequence alignments and in comparing sequences with different divergence rates. Here, we describe the eShadow comparative tool and its potential uses for analyzing both multiple nucleotide and protein alignments to predict putative functional elements. The eShadow tool is publicly available at http://eshadow.dcode.org/
M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species

Directory of Open Access Journals (Sweden)

Messeguer Xavier

2006-10-01

Full Text Available Abstract Background Due to recent advances in whole genome shotgun sequencing and assembly technologies, the financial cost of decoding an organism's DNA has been drastically reduced, resulting in a recent explosion of genomic sequencing projects. This increase in related genomic data will allow for in depth studies of evolution in closely related species through multiple whole genome comparisons. Results To facilitate such comparisons, we present an interactive multiple genome comparison and alignment tool, M-GCAT, that can efficiently construct multiple genome comparison frameworks in closely related species. M-GCAT is able to compare and identify highly conserved regions in up to 20 closely related bacterial species in minutes on a standard computer, and as many as 90 (containing 75 cloned genomes from a set of 15 published enterobacterial genomes in an hour. M-GCAT also incorporates a novel comparative genomics data visualization interface allowing the user to globally and locally examine and inspect the conserved regions and gene annotations. Conclusion M-GCAT is an interactive comparative genomics tool well suited for quickly generating multiple genome comparisons frameworks and alignments among closely related species. M-GCAT is freely available for download for academic and non-commercial use at: http://alggen.lsi.upc.es/recerca/align/mgcat/intro-mgcat.html.
Genomic 3' terminal sequence comparison of three isolates of rabbit haemorrhagic disease virus.

Science.gov (United States)

Milton, I D; Vlasak, R; Nowotny, N; Rodak, L; Carter, M J

1992-05-15

Comparison of sequence data is necessary in older to investigate virus origins, identify features common to virulent strains, and characterize genomic organization within virus families. A virulent caliciviral disease of rabbits recently emerged in China. We have sequenced 1100 bases from the 3' ends of two independent European isolates of this virus, and compared these with previously determined calicivirus sequences. Rabbit caliciviruses were closely related, despite the different countries in which isolation was made. This supports the rapid spread of a new virus across Europe. The capsid protein sequences of these rabbit viruses differ markedly from those determined for feline calicivirus, but a hypothetical 3' open reading frame is relatively well conserved between the caliciviruses of these two different hosts and argues for a functional role.
Close sequence identity between ribosomal DNA episomes of the ...

Indian Academy of Sciences (India)

Unknown

The restriction map of the E. dispar rDNA circle showed close simi- larity to EhR1 .... for 30 cycles in a DNA Thermal cycler (MJ Research,. USA). 3. .... by asterisk. The gaps show the variation between E. dispar and E. histolytica sequences.
Improving pairwise comparison of protein sequences with domain co-occurrence

Science.gov (United States)

Gascuel, Olivier

2018-01-01

Comparing and aligning protein sequences is an essential task in bioinformatics. More specifically, local alignment tools like BLAST are widely used for identifying conserved protein sub-sequences, which likely correspond to protein domains or functional motifs. However, to limit the number of false positives, these tools are used with stringent sequence-similarity thresholds and hence can miss several hits, especially for species that are phylogenetically distant from reference organisms. A solution to this problem is then to integrate additional contextual information to the procedure. Here, we propose to use domain co-occurrence to increase the sensitivity of pairwise sequence comparisons. Domain co-occurrence is a strong feature of proteins, since most protein domains tend to appear with a limited number of other domains on the same protein. We propose a method to take this information into account in a typical BLAST analysis and to construct new domain families on the basis of these results. We used Plasmodium falciparum as a case study to evaluate our method. The experimental findings showed an increase of 14% of the number of significant BLAST hits and an increase of 25% of the proteome area that can be covered with a domain. Our method identified 2240 new domains for which, in most cases, no model of the Pfam database could be linked. Moreover, our study of the quality of the new domains in terms of alignment and physicochemical properties show that they are close to that of standard Pfam domains. Source code of the proposed approach and supplementary data are available at: https://gite.lirmm.fr/menichelli/pairwise-comparison-with-cooccurrence PMID:29293498
Sequence comparison and phylogenetic analysis of core gene of ...

African Journals Online (AJOL)

Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...
Intra-species sequence comparisons for annotating genomes

Energy Technology Data Exchange (ETDEWEB)

Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

2004-07-15

Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.
Complete genome sequence of the industrial bacterium Bacillus licheniformis and comparisons with closely related Bacillus species

Science.gov (United States)

Rey, Michael W; Ramaiya, Preethi; Nelson, Beth A; Brody-Karpin, Shari D; Zaretsky, Elizabeth J; Tang, Maria; de Leon, Alfredo Lopez; Xiang, Henry; Gusti, Veronica; Clausen, Ib Groth; Olsen, Peter B; Rasmussen, Michael D; Andersen, Jens T; Jørgensen, Per L; Larsen, Thomas S; Sorokin, Alexei; Bolotin, Alexander; Lapidus, Alla; Galleron, Nathalie; Ehrlich, S Dusko; Berka, Randy M

2004-01-01

Background Bacillus licheniformis is a Gram-positive, spore-forming soil bacterium that is used in the biotechnology industry to manufacture enzymes, antibiotics, biochemicals and consumer products. This species is closely related to the well studied model organism Bacillus subtilis, and produces an assortment of extracellular enzymes that may contribute to nutrient cycling in nature. Results We determined the complete nucleotide sequence of the B. licheniformis ATCC 14580 genome which comprises a circular chromosome of 4,222,336 base-pairs (bp) containing 4,208 predicted protein-coding genes with an average size of 873 bp, seven rRNA operons, and 72 tRNA genes. The B. licheniformis chromosome contains large regions that are colinear with the genomes of B. subtilis and Bacillus halodurans, and approximately 80% of the predicted B. licheniformis coding sequences have B. subtilis orthologs. Conclusions Despite the unmistakable organizational similarities between the B. licheniformis and B. subtilis genomes, there are notable differences in the numbers and locations of prophages, transposable elements and a number of extracellular enzymes and secondary metabolic pathway operons that distinguish these species. Differences include a region of more than 80 kilobases (kb) that comprises a cluster of polyketide synthase genes and a second operon of 38 kb encoding plipastatin synthase enzymes that are absent in the B. licheniformis genome. The availability of a completed genome sequence for B. licheniformis should facilitate the design and construction of improved industrial strains and allow for comparative genomics and evolutionary studies within this group of Bacillaceae. PMID:15461803
Substrate-driven mapping of the degradome by comparison of sequence logos.

Directory of Open Access Journals (Sweden)

Julian E Fuchs

Full Text Available Sequence logos are frequently used to illustrate substrate preferences and specificity of proteases. Here, we employed the compiled substrates of the MEROPS database to introduce a novel metric for comparison of protease substrate preferences. The constructed similarity matrix of 62 proteases can be used to intuitively visualize similarities in protease substrate readout via principal component analysis and construction of protease specificity trees. Since our new metric is solely based on substrate data, we can engraft the protease tree including proteolytic enzymes of different evolutionary origin. Thereby, our analyses confirm pronounced overlaps in substrate recognition not only between proteases closely related on sequence basis but also between proteolytic enzymes of different evolutionary origin and catalytic type. To illustrate the applicability of our approach we analyze the distribution of targets of small molecules from the ChEMBL database in our substrate-based protease specificity trees. We observe a striking clustering of annotated targets in tree branches even though these grouped targets do not necessarily share similarity on protein sequence level. This highlights the value and applicability of knowledge acquired from peptide substrates in drug design of small molecules, e.g., for the prediction of off-target effects or drug repurposing. Consequently, our similarity metric allows to map the degradome and its associated drug target network via comparison of known substrate peptides. The substrate-driven view of protein-protein interfaces is not limited to the field of proteases but can be applied to any target class where a sufficient amount of known substrate data is available.
Clinical evaluation of further-developed MRCP sequences in comparison with standard MRCP sequences

International Nuclear Information System (INIS)

Hundt, W.; Scheidler, J.; Reiser, M.; Petsch, R.

2002-01-01

The purpose of this study was the comparison of technically improved single-shot magnetic resonance cholangiopancreatography (MRCP) sequences with standard single-shot rapid acquisition with relaxation enhancement (RARE) and half-Fourier acquired single-shot turbo spin-echo (HASTE) sequences in evaluating the normal and abnormal biliary duct system. The bile duct system of 45 patients was prospectively investigated on a 1.5-T MRI system. The investigation was performed with RARE and HASTE MR cholangiography sequences with standard and high spatial resolutions, and with a delayed-echo half-Fourier RARE (HASTE) sequence. Findings of the improved MRCP sequences were compared with the standard MRCP sequences. The level of confidence in assessing the diagnosis was divided into five groups. The Wilcoxon signed-rank test at a level of p<0.05 was applied. In 15 patients no pathology was found. The MRCP showed stenoses of the bile duct system in 10 patients and choledocholithiasis and cholecystolithiasis in 16 patients. In 12 patients a dilatation of the bile duct system was found. Comparison of the low- and high spatial resolution sequences and the short and long TE times of the half-Fourier RARE (HASTE) sequence revealed no statistically significant differences regarding accuracy of the examination. The diagnostic confidence level in assessing normal or pathological findings for the high-resolution RARE and half-Fourier RARE (HASTE) was significantly better than for the standard sequences. For the delayed-echo half-Fourier RARE (HASTE) sequence no statistically significant difference was seen. The high-resolution RARE and half-Fourier RARE (HASTE) sequences had a higher confidence level, but there was no significant difference in diagnosis in terms of detection and assessment of pathological changes in the biliary duct system compared with standard sequences. (orig.)
A Comparison of the First Two Sequenced Chloroplast Genomes in Asteraceae: Lettuce and Sunflower

Energy Technology Data Exchange (ETDEWEB)

Timme, Ruth E.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Jansen, Robert K.

2006-01-20

Asteraceae is the second largest family of plants, with over 20,000 species. For the past few decades, numerous phylogenetic studies have contributed to our understanding of the evolutionary relationships within this family, including comparisons of the fast evolving chloroplast gene, ndhF, rbcL, as well as non-coding DNA from the trnL intron plus the trnLtrnF intergenic spacer, matK, and, with lesser resolution, psbA-trnH. This culminated in a study by Panero and Funk in 2002 that used over 13,000 bp per taxon for the largest taxonomic revision of Asteraceae in over a hundred years. Still, some uncertainties remain, and it would be very useful to have more information on the relative rates of sequence evolution among various genes and on genome structure as a potential set of phylogenetic characters to help guide future phylogenetic structures. By way of contributing to this, we report the first two complete chloroplast genome sequences from members of the Asteraceae, those of Helianthus annuus and Lactuca sativa. These plants belong to two distantly related subfamilies, Asteroideae and Cichorioideae, respectively. In addition to these, there is only one other published chloroplast genome sequence for any plant within the larger group called Eusterids II, that of Panax ginseng (Araliaceae, 156,318 bps, AY582139). Early chloroplast genome mapping studies demonstrated that H. annuus and L. sativa share a 22 kb inversion relative to members of the subfamily Barnadesioideae. By comparison to outgroups, this inversion was shown to be derived, indicating that the Asteroideae and Cichorioideae are more closely related than either is to the Barnadesioideae. Later sequencing study found that taxa that share this 22 kb inversion also contain within this region a second, smaller, 3.3 kb inversion. These sequences also enable an analysis of patterns of shared repeats in the genomes at fine level and of RNA editing by comparison to available EST sequences. In addition, since
IDENTIFICATION OF AVIAN-SPECIFIC FECAL METAGENOMIC SEQUENCES USING GENOME FRAGMENT ENRICHMENTS

Science.gov (United States)

Sequence analysis of microbial genomes has provided biologists the opportunity to compare genetic differences between closely related microorganisms. While random sequencing has also been used to study natural microbial communities, metagenomic comparisons via sequencing analysis...
Method and apparatus for biological sequence comparison

Science.gov (United States)

Marr, T.G.; Chang, W.I.

1997-12-23

A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.
Dynamic programming algorithms for biological sequence comparison.

Science.gov (United States)

Pearson, W R; Miller, W

1992-01-01

Efficient dynamic programming algorithms are available for a broad class of protein and DNA sequence comparison problems. These algorithms require computer time proportional to the product of the lengths of the two sequences being compared [O(N2)] but require memory space proportional only to the sum of these lengths [O(N)]. Although the requirement for O(N2) time limits use of the algorithms to the largest computers when searching protein and DNA sequence databases, many other applications of these algorithms, such as calculation of distances for evolutionary trees and comparison of a new sequence to a library of sequence profiles, are well within the capabilities of desktop computers. In particular, the results of library searches with rapid searching programs, such as FASTA or BLAST, should be confirmed by performing a rigorous optimal alignment. Whereas rapid methods do not overlook significant sequence similarities, FASTA limits the number of gaps that can be inserted into an alignment, so that a rigorous alignment may extend the alignment substantially in some cases. BLAST does not allow gaps in the local regions that it reports; a calculation that allows gaps is very likely to extend the alignment substantially. Although a Monte Carlo evaluation of the statistical significance of a similarity score with a rigorous algorithm is much slower than the heuristic approach used by the RDF2 program, the dynamic programming approach should take less than 1 hr on a 386-based PC or desktop Unix workstation. For descriptive purposes, we have limited our discussion to methods for calculating similarity scores and distances that use gap penalties of the form g = rk. Nevertheless, programs for the more general case (g = q+rk) are readily available. Versions of these programs that run either on Unix workstations, IBM-PC class computers, or the Macintosh can be obtained from either of the authors.
Extensive sequence divergence among bovine respiratory syncytial viruses isolated during recurrent outbreaks in closed herds

DEFF Research Database (Denmark)

Larsen, Lars Erik; Tjørnehøj, Kirsten; Viuff, B.

2000-01-01

and veal calf production units) in different years and from all confirmed outbreaks in Denmark within a short period. The results showed that identical viruses were isolated within a herd during outbreaks and that viruses from recurrent infections varied by up to 11% in sequence even in closed herds......The nucleotides coding for the extracellular part of the G glycoprotein and the full SH protein of bovine respiratory syncytial virus (BRSV) were sequenced from viruses isolated from numerous outbreaks of BRSV infection. The isolates included viruses isolated from the same herd (closed dairy farms....... It is possible that a quasispecies variant swarm of BRSV persisted in some of the calves in each herd and that a new and different highly fit virus type (master and consensus sequence) became dominant and spread from a single animal in connection with each new outbreak. Based on the high level of diversity...
Does the sequence of data collection influence participants' responses to closed and open-ended questions? A methodological study.

Science.gov (United States)

Covell, Christine L; Sidani, Souraya; Ritchie, Judith A

2012-06-01

The sequence used for collecting quantitative and qualitative data in concurrent mixed-methods research may influence participants' responses. Empirical evidence is needed to determine if the order of data collection in concurrent mixed methods research biases participants' responses to closed and open-ended questions. To examine the influence of the quantitative-qualitative sequence on responses to closed and open-ended questions when assessing the same variables or aspects of a phenomenon simultaneously within the same study phase. A descriptive cross-sectional, concurrent mixed-methods design was used to collect quantitative (survey) and qualitative (interview) data. The setting was a large multi-site health care centre in Canada. A convenience sample of 50 registered nurses was selected and participated in the study. Participants were randomly assigned to one of two sequences for data collection, quantitative-qualitative or qualitative-quantitative. Independent t-tests were performed to compare the two groups' responses to the survey items. Directed content analysis was used to compare the participants' responses to the interview questions. The sequence of data collection did not greatly affect the participants' responses to the closed-ended questions (survey items) or the open-ended questions (interview questions). The sequencing of data collection, when using both survey and semi-structured interviews, may not bias participants' responses to closed or open-ended questions. Additional research is required to confirm these findings. Copyright © 2011 Elsevier Ltd. All rights reserved.
Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

Energy Technology Data Exchange (ETDEWEB)

Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

2006-06-01

Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.
Quantitative comparison between a multiecho sequence and a single-echo sequence for susceptibility-weighted phase imaging.

Science.gov (United States)

Gilbert, Guillaume; Savard, Geneviève; Bard, Céline; Beaudoin, Gilles

2012-06-01

The aim of this study was to investigate the benefits arising from the use of a multiecho sequence for susceptibility-weighted phase imaging using a quantitative comparison with a standard single-echo acquisition. Four healthy adult volunteers were imaged on a clinical 3-T system using a protocol comprising two different three-dimensional susceptibility-weighted gradient-echo sequences: a standard single-echo sequence and a multiecho sequence. Both sequences were repeated twice in order to evaluate the local noise contribution by a subtraction of the two acquisitions. For the multiecho sequence, the phase information from each echo was independently unwrapped, and the background field contribution was removed using either homodyne filtering or the projection onto dipole fields method. The phase information from all echoes was then combined using a weighted linear regression. R2 maps were also calculated from the multiecho acquisitions. The noise standard deviation in the reconstructed phase images was evaluated for six manually segmented regions of interest (frontal white matter, posterior white matter, globus pallidus, putamen, caudate nucleus and lateral ventricle). The use of the multiecho sequence for susceptibility-weighted phase imaging led to a reduction of the noise standard deviation for all subjects and all regions of interest investigated in comparison to the reference single-echo acquisition. On average, the noise reduction ranged from 18.4% for the globus pallidus to 47.9% for the lateral ventricle. In addition, the amount of noise reduction was found to be strongly inversely correlated to the estimated R2 value (R=-0.92). In conclusion, the use of a multiecho sequence is an effective way to decrease the noise contribution in susceptibility-weighted phase images, while preserving both contrast and acquisition time. The proposed approach additionally permits the calculation of R2 maps. Copyright © 2012 Elsevier Inc. All rights reserved.

Testing statistical significance scores of sequence comparison methods with structure similarity

Directory of Open Access Journals (Sweden)

Leunissen Jack AM

2006-10-01

Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.
USE OF COMPETITIVE DNA HYBRIDIZATION TO IDENTIFY DIFFERENCES IN THE GENOMES OF TWO CLOSELY RELATED FECAL INDICATOR BACTERIA

Science.gov (United States)

Although recent technological advances in DNA sequencing and computational biology now allow scientists to compare entire microbial genomes, comparisons of closely related bacterial species and individual isolates by whole-genome sequencing approaches remains prohibitively expens...
The Pathogenomic Sequence Analysis of B. cereus and B.thuringiensis Isolates Closely Related to Bacillus anthracis

Energy Technology Data Exchange (ETDEWEB)

Han, Cliff S.; Xie, Gary; Challacombe, Jean F.; Altherr, MichaelR.; Smriti, B.; Bruce, David; Campbell, Connie S.; Campbell, Mary L.; Chen, Jin; Chertkov, Olga; Cleland, Cathy; Dimitrijevic-Bussod, M.; Doggett, Norman A.; Fawcett, John J.; Glavina, Tijana; Goodwin, Lynne A.; Hill, Karen K.; Hitchcock, Penny; Jackson, Paul J.; Keim, Paul; Kewalramani, Avinash Ramesh; Longmire, Jon; Lucas, Susan; Malfatti,Stephanie; McMurry, Kim; Meincke, Linda J.; Misra, Monica; Moseman,Bernice L.; Mundt, Mark; Munk, A. Christine; Okinaka, Richard T.; Parson-Quintana, B.; Reilly, Lee P.; Richardson, Paul; Robinson, DonnaL.; Rubin, Eddy; Saunders, Elizabeth; Tapia, Roxanne; Tesmer, Judith G.; Thayer, Nina; Thompson, Linda S.; Tice, Hope; Ticknor, Lawrence O.; Wills, Patti L.; Gilna, Payl; Brettin, Thomas S.

2005-08-18

The sequencing and analysis of two close relatives of Bacillus anthracis are reported. AFLP analysis of over 300 isolates of B.cereus, B. thuringiensis and B. anthracis identified two isolates as being very closely related to B. anthracis. One, a B. cereus, BcE33L, was isolated from a zebra carcass in Nambia; the second, a B. thuringiensis, 97-27, was isolated from a necrotic human wound. The B. cereus appears to be the closest anthracis relative sequenced to date. A core genome of over 3,900 genes was compiled for the Bacillus cereus group, including Banthracis. Comparative analysis of these two genomes with other members of the B. cereus group provides insight into the evolutionary relationships among these organisms. Evidence is presented that differential regulation modulates virulence, rather than simple acquisition of virulence factors. These genome sequences provide insight into the molecular mechanisms contributing to the host range and virulence of this group of organisms.
Closed Genome Sequence of Phytopathogen Biocontrol Agent Bacillus velezensis Strain AGVL-005, Isolated from Soybean.

Science.gov (United States)

Pylro, Victor Satler; Dias, Armando Cavalcante Franco; Andreote, Fernando Dini; Morais, Daniel Kumazawa; Varani, Alessandro de Mello; Andreote, Cristiane Cipolla Fasanella; Bernardo, Eduardo Roberto de Almeida; Zucchi, Tiago

2018-02-15

We report here the closed and near-complete genome sequence and annotation of Bacillus velezensis strain AGVL-005, a bacterium isolated from soybean seeds in Brazil and used for phytopathogen biocontrol. Copyright © 2018 Pylro et al.
An efficient binomial model-based measure for sequence comparison and its application.

Science.gov (United States)

Liu, Xiaoqing; Dai, Qi; Li, Lihua; He, Zerong

2011-04-01

Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment-based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.
The complete genome sequences of poxviruses isolated from a penguin and a pigeon in South Africa and comparison to other sequenced avipoxviruses.

Science.gov (United States)

Offerman, Kristy; Carulei, Olivia; van der Walt, Anelda Philine; Douglass, Nicola; Williamson, Anna-Lise

2014-06-12

Two novel avipoxviruses from South Africa have been sequenced, one from a Feral Pigeon (Columba livia) (FeP2) and the other from an African penguin (Spheniscus demersus) (PEPV). We present a purpose-designed bioinformatics pipeline for analysis of next generation sequence data of avian poxviruses and compare the different avipoxviruses sequenced to date with specific emphasis on their evolution and gene content. The FeP2 (282 kbp) and PEPV (306 kbp) genomes encode 271 and 284 open reading frames respectively and are more closely related to one another (94.4%) than to either fowlpox virus (FWPV) (85.3% and 84.0% respectively) or Canarypox virus (CNPV) (62.0% and 63.4% respectively). Overall, FeP2, PEPV and FWPV have syntenic gene arrangements; however, major differences exist throughout their genomes. The most striking difference between FeP2 and the FWPV-like avipoxviruses is a large deletion of ~16 kbp from the central region of the genome of FeP2 deleting a cc-chemokine-like gene, two Variola virus B22R orthologues, an N1R/p28-like gene and a V-type Ig domain family gene. FeP2 and PEPV both encode orthologues of vaccinia virus C7L and Interleukin 10. PEPV contains a 77 amino acid long orthologue of Ubiquitin sharing 97% amino acid identity to human ubiquitin. The genome sequences of FeP2 and PEPV have greatly added to the limited repository of genomic information available for the Avipoxvirus genus. In the comparison of FeP2 and PEPV to existing sequences, FWPV and CNPV, we have established insights into African avipoxvirus evolution. Our data supports the independent evolution of these South African avipoxviruses from a common ancestral virus to FWPV and CNPV.
The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry).

Science.gov (United States)

Buti, Matteo; Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro; Sargent, Daniel James

2018-04-01

The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family.
Comparison of ompP5 sequence-based typing and pulsed-filed gel ...

African Journals Online (AJOL)

In this study, comparison of the outer membrane protein P5 gene (ompP5) sequence-based typing with pulsed-field gel electrophoresis (PFGE) for the genotyping of Haemophilus parasuis, the 15 serovar reference strains and 43 isolates were investigated. When comparing the two methods, 31 ompP5 sequence types ...
Sequencing intractable DNA to close microbial genomes.

Directory of Open Access Journals (Sweden)

Richard A Hurt

Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Sequencing Intractable DNA to Close Microbial Genomes

Energy Technology Data Exchange (ETDEWEB)

Hurt, Jr., Richard Ashley [ORNL; Brown, Steven D [ORNL; Podar, Mircea [ORNL; Palumbo, Anthony Vito [ORNL; Elias, Dwayne A [ORNL

2012-01-01

Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled intractable resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such difficult regions in the non-contiguous finished Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. These developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Sequence comparison alignment-free approach based on suffix tree and L-words frequency.

Science.gov (United States)

Soares, Inês; Goios, Ana; Amorim, António

2012-01-01

The vast majority of methods available for sequence comparison rely on a first sequence alignment step, which requires a number of assumptions on evolutionary history and is sometimes very difficult or impossible to perform due to the abundance of gaps (insertions/deletions). In such cases, an alternative alignment-free method would prove valuable. Our method starts by a computation of a generalized suffix tree of all sequences, which is completed in linear time. Using this tree, the frequency of all possible words with a preset length L-L-words--in each sequence is rapidly calculated. Based on the L-words frequency profile of each sequence, a pairwise standard Euclidean distance is then computed producing a symmetric genetic distance matrix, which can be used to generate a neighbor joining dendrogram or a multidimensional scaling graph. We present an improvement to word counting alignment-free approaches for sequence comparison, by determining a single optimal word length and combining suffix tree structures to the word counting tasks. Our approach is, thus, a fast and simple application that proved to be efficient and powerful when applied to mitochondrial genomes. The algorithm was implemented in Python language and is freely available on the web.
Beyond Linear Sequence Comparisons: The use of genome-levelcharacters for phylogenetic reconstruction

Energy Technology Data Exchange (ETDEWEB)

Boore, Jeffrey L.

2004-11-27

Although the phylogenetic relationships of many organisms have been convincingly resolved by the comparisons of nucleotide or amino acid sequences, others have remained equivocal despite great effort. Now that large-scale genome sequencing projects are sampling many lineages, it is becoming feasible to compare large data sets of genome-level features and to develop this as a tool for phylogenetic reconstruction that has advantages over conventional sequence comparisons. Although it is unlikely that these will address a large number of evolutionary branch points across the broad tree of life due to the infeasibility of such sampling, they have great potential for convincingly resolving many critical, contested relationships for which no other data seems promising. However, it is important that we recognize potential pitfalls, establish reasonable standards for acceptance, and employ rigorous methodology to guard against a return to earlier days of scenario-driven evolutionary reconstructions.
Protein sequence comparison and protein evolution

Energy Technology Data Exchange (ETDEWEB)

Pearson, W.R. [Univ. of Virginia, Charlottesville, VA (United States). Dept. of Biochemistry

1995-12-31

This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. This tutorial examines how the information conserved during the evolution of a protein molecule can be used to infer reliably homology, and thus a shared proteinfold and possibly a shared active site or function. The authors start by reviewing a geological/evolutionary time scale. Next they look at the evolution of several protein families. During the tutorial, these families will be used to demonstrate that homologous protein ancestry can be inferred with confidence. They also examine different modes of protein evolution and consider some hypotheses that have been presented to explain the very earliest events in protein evolution. The next part of the tutorial will examine the technical aspects of protein sequence comparison. Both optimal and heuristic algorithms and their associated parameters that are used to characterize protein sequence similarities are discussed. Perhaps more importantly, they survey the statistics of local similarity scores, and how these statistics can both be used to improve the selectivity of a search and to evaluate the significance of a match. They them examine distantly related members of three protein families, the serine proteases, the glutathione transferases, and the G-protein-coupled receptors (GCRs). Finally, the discuss how sequence similarity can be used to examine internal repeated or mosaic structures in proteins.
Evaluation and comparison of closed-loop wash-water system

International Nuclear Information System (INIS)

Whitney, P.M.; Greer, C.R.

1991-01-01

Effluent from vehicle and equipment cleaning is known to contain a variety of potential pollutants, the most common being hydrocarbons and suspended solids. Proper treatment and discharge of this effluent is a growing concern as environmental awareness increases. In the United States, discharge of this effluent to municipal sewage treatment systems requires a permit from local authorities, discharge to surface waters requires a federal permit and, in most cases, discharge to the ground in prohibited. Furthermore, discharge to ground and surface waters can cause soil or groundwater contamination resulting in property devaluation, adverse impact on human health, fines from regulatory agencies, expensive cleanup and negative publicity. Effluent from vehicle washing typically does not meet the minimum pollutant levels allowed by regulatory agencies for discharge to surface waters or sewage treatment plants. Because of the liability associated with discharge to ground and surface waters and the difficulty in meeting municipal sewer discharge permit requirements, closed-loop wastewater treatment is an attractive alternative to discharge. Evaluation and comparison of systems from each category constitute the basis of this paper. Factors involved in selecting a system and available water-treatment technologies are discussed. The conclusion summarizes the results of the system comparison and makes recommendations for selecting and installing closed-loop water treatment systems for vehicle and equipment cleaning
Sequence Comparison Alignment-Free Approach Based on Suffix Tree and L-Words Frequency

Directory of Open Access Journals (Sweden)

Inês Soares

2012-01-01

Full Text Available The vast majority of methods available for sequence comparison rely on a first sequence alignment step, which requires a number of assumptions on evolutionary history and is sometimes very difficult or impossible to perform due to the abundance of gaps (insertions/deletions. In such cases, an alternative alignment-free method would prove valuable. Our method starts by a computation of a generalized suffix tree of all sequences, which is completed in linear time. Using this tree, the frequency of all possible words with a preset length L—L-words—in each sequence is rapidly calculated. Based on the L-words frequency profile of each sequence, a pairwise standard Euclidean distance is then computed producing a symmetric genetic distance matrix, which can be used to generate a neighbor joining dendrogram or a multidimensional scaling graph. We present an improvement to word counting alignment-free approaches for sequence comparison, by determining a single optimal word length and combining suffix tree structures to the word counting tasks. Our approach is, thus, a fast and simple application that proved to be efficient and powerful when applied to mitochondrial genomes. The algorithm was implemented in Python language and is freely available on the web.
Alignment-free Transcriptomic and Metatranscriptomic Comparison Using Sequencing Signatures with Variable Length Markov Chains.

Science.gov (United States)

Liao, Weinan; Ren, Jie; Wang, Kun; Wang, Shun; Zeng, Feng; Wang, Ying; Sun, Fengzhu

2016-11-23

The comparison between microbial sequencing data is critical to understand the dynamics of microbial communities. The alignment-based tools analyzing metagenomic datasets require reference sequences and read alignments. The available alignment-free dissimilarity approaches model the background sequences with Fixed Order Markov Chain (FOMC) yielding promising results for the comparison of microbial communities. However, in FOMC, the number of parameters grows exponentially with the increase of the order of Markov Chain (MC). Under a fixed high order of MC, the parameters might not be accurately estimated owing to the limitation of sequencing depth. In our study, we investigate an alternative to FOMC to model background sequences with the data-driven Variable Length Markov Chain (VLMC) in metatranscriptomic data. The VLMC originally designed for long sequences was extended to apply to high-throughput sequencing reads and the strategies to estimate the corresponding parameters were developed. The flexible number of parameters in VLMC avoids estimating the vast number of parameters of high-order MC under limited sequencing depth. Different from the manual selection in FOMC, VLMC determines the MC order adaptively. Several beta diversity measures based on VLMC were applied to compare the bacterial RNA-Seq and metatranscriptomic datasets. Experiments show that VLMC outperforms FOMC to model the background sequences in transcriptomic and metatranscriptomic samples. A software pipeline is available at https://d2vlmc.codeplex.com.
Conserved PCR primer set designing for closely-related species to complete mitochondrial genome sequencing using a sliding window-based PSO algorithm.

Directory of Open Access Journals (Sweden)

Cheng-Hong Yang

Full Text Available BACKGROUND: Complete mitochondrial (mt genome sequencing is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. For long template sequencing, i.e., like the entire mtDNA, it is essential to design primers for Polymerase Chain Reaction (PCR amplicons which are partly overlapping each other. The presented chromosome walking strategy provides the overlapping design to solve the problem for unreliable sequencing data at the 5' end and provides the effective sequencing. However, current algorithms and tools are mostly focused on the primer design for a local region in the genomic sequence. Accordingly, it is still challenging to provide the primer sets for the entire mtDNA. METHODOLOGY/PRINCIPAL FINDINGS: The purpose of this study is to develop an integrated primer design algorithm for entire mt genome in general, and for the common primer sets for closely-related species in particular. We introduce ClustalW to generate the multiple sequence alignment needed to find the conserved sequences in closely-related species. These conserved sequences are suitable for designing the common primers for the entire mtDNA. Using a heuristic algorithm particle swarm optimization (PSO, all the designed primers were computationally validated to fit the common primer design constraints, such as the melting temperature, primer length and GC content, PCR product length, secondary structure, specificity, and terminal limitation. The overlap requirement for PCR amplicons in the entire mtDNA is satisfied by defining the overlapping region with the sliding window technology. Finally, primer sets were designed within the overlapping region. The primer sets for the entire mtDNA sequences were successfully demonstrated in the example of two closely-related fish species. The pseudo code for the primer design algorithm is provided. CONCLUSIONS/SIGNIFICANCE: In conclusion, it can be said that our proposed sliding window-based PSO
BLAST and FASTA similarity searching for multiple sequence alignment.

Science.gov (United States)

Pearson, William R

2014-01-01

BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.
The sequence of camelpox virus shows it is most closely related to variola virus, the cause of smallpox.

Science.gov (United States)

Gubser, Caroline; Smith, Geoffrey L

2002-04-01

Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.
Image ranking in video sequences using pairwise image comparisons and temporal smoothing

CSIR Research Space (South Africa)

Burke, Michael

2016-12-01

Full Text Available The ability to predict the importance of an image is highly desirable in computer vision. This work introduces an image ranking scheme suitable for use in video or image sequences. Pairwise image comparisons are used to determine image ‘interest...

Enzyme sequence similarity improves the reaction alignment method for cross-species pathway comparison

Energy Technology Data Exchange (ETDEWEB)

Ovacik, Meric A. [Chemical and Biochemical Engineering Department, Rutgers University, Piscataway, NJ 08854 (United States); Androulakis, Ioannis P., E-mail: yannis@rci.rutgers.edu [Chemical and Biochemical Engineering Department, Rutgers University, Piscataway, NJ 08854 (United States); Biomedical Engineering Department, Rutgers University, Piscataway, NJ 08854 (United States)

2013-09-15

Pathway-based information has become an important source of information for both establishing evolutionary relationships and understanding the mode of action of a chemical or pharmaceutical among species. Cross-species comparison of pathways can address two broad questions: comparison in order to inform evolutionary relationships and to extrapolate species differences used in a number of different applications including drug and toxicity testing. Cross-species comparison of metabolic pathways is complex as there are multiple features of a pathway that can be modeled and compared. Among the various methods that have been proposed, reaction alignment has emerged as the most successful at predicting phylogenetic relationships based on NCBI taxonomy. We propose an improvement of the reaction alignment method by accounting for sequence similarity in addition to reaction alignment method. Using nine species, including human and some model organisms and test species, we evaluate the standard and improved comparison methods by analyzing glycolysis and citrate cycle pathways conservation. In addition, we demonstrate how organism comparison can be conducted by accounting for the cumulative information retrieved from nine pathways in central metabolism as well as a more complete study involving 36 pathways common in all nine species. Our results indicate that reaction alignment with enzyme sequence similarity results in a more accurate representation of pathway specific cross-species similarities and differences based on NCBI taxonomy.
Enzyme sequence similarity improves the reaction alignment method for cross-species pathway comparison

International Nuclear Information System (INIS)

Ovacik, Meric A.; Androulakis, Ioannis P.

2013-01-01

Pathway-based information has become an important source of information for both establishing evolutionary relationships and understanding the mode of action of a chemical or pharmaceutical among species. Cross-species comparison of pathways can address two broad questions: comparison in order to inform evolutionary relationships and to extrapolate species differences used in a number of different applications including drug and toxicity testing. Cross-species comparison of metabolic pathways is complex as there are multiple features of a pathway that can be modeled and compared. Among the various methods that have been proposed, reaction alignment has emerged as the most successful at predicting phylogenetic relationships based on NCBI taxonomy. We propose an improvement of the reaction alignment method by accounting for sequence similarity in addition to reaction alignment method. Using nine species, including human and some model organisms and test species, we evaluate the standard and improved comparison methods by analyzing glycolysis and citrate cycle pathways conservation. In addition, we demonstrate how organism comparison can be conducted by accounting for the cumulative information retrieved from nine pathways in central metabolism as well as a more complete study involving 36 pathways common in all nine species. Our results indicate that reaction alignment with enzyme sequence similarity results in a more accurate representation of pathway specific cross-species similarities and differences based on NCBI taxonomy
Markov model plus k-word distributions: a synergy that produces novel statistical measures for sequence comparison.

Science.gov (United States)

Dai, Qi; Yang, Yanchun; Wang, Tianming

2008-10-15

Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.
Cluster based on sequence comparison of homologous proteins of 95 organism species - Gclust Server | LSDB Archive [Life Science Database Archive metadata

Lifescience Database Archive (English)

Full Text Available List Contact us Gclust Server Cluster based on sequence comparison of homologous proteins of 95 organism spe...cies Data detail Data name Cluster based on sequence comparison of homologous proteins of 95 organism specie...istory of This Database Site Policy | Contact Us Cluster based on sequence compariso
Comparison of DNA Quantification Methods for Next Generation Sequencing.

Science.gov (United States)

Robin, Jérôme D; Ludlow, Andrew T; LaRanger, Ryan; Wright, Woodring E; Shay, Jerry W

2016-04-06

Next Generation Sequencing (NGS) is a powerful tool that depends on loading a precise amount of DNA onto a flowcell. NGS strategies have expanded our ability to investigate genomic phenomena by referencing mutations in cancer and diseases through large-scale genotyping, developing methods to map rare chromatin interactions (4C; 5C and Hi-C) and identifying chromatin features associated with regulatory elements (ChIP-seq, Bis-Seq, ChiA-PET). While many methods are available for DNA library quantification, there is no unambiguous gold standard. Most techniques use PCR to amplify DNA libraries to obtain sufficient quantities for optical density measurement. However, increased PCR cycles can distort the library's heterogeneity and prevent the detection of rare variants. In this analysis, we compared new digital PCR technologies (droplet digital PCR; ddPCR, ddPCR-Tail) with standard methods for the titration of NGS libraries. DdPCR-Tail is comparable to qPCR and fluorometry (QuBit) and allows sensitive quantification by analysis of barcode repartition after sequencing of multiplexed samples. This study provides a direct comparison between quantification methods throughout a complete sequencing experiment and provides the impetus to use ddPCR-based quantification for improvement of NGS quality.
SeqVISTA: a graphical tool for sequence feature visualization and comparison

Directory of Open Access Journals (Sweden)

Niu Tianhua

2003-01-01

Full Text Available Abstract Background Many readers will sympathize with the following story. You are viewing a gene sequence in Entrez, and you want to find whether it contains a particular sequence motif. You reach for the browser's "find in page" button, but those darn spaces every 10 bp get in the way. And what if the motif is on the opposite strand? Subsequently, your favorite sequence analysis software informs you that there is an interesting feature at position 13982–14013. By painstakingly counting the 10 bp blocks, you are able to examine the sequence at this location. But now you want to see what other features have been annotated close by, and this information is buried several screenfuls higher up the web page. Results SeqVISTA presents a holistic, graphical view of features annotated on nucleotide or protein sequences. This interactive tool highlights the residues in the sequence that correspond to features chosen by the user, and allows easy searching for sequence motifs or extraction of particular subsequences. SeqVISTA is able to display results from diverse sequence analysis tools in an integrated fashion, and aims to provide much-needed unity to the bioinformatics resources scattered around the Internet. Our viewer may be launched on a GenBank record by a single click of a button installed in the web browser. Conclusion SeqVISTA allows insights to be gained by viewing the totality of sequence annotations and predictions, which may be more revealing than the sum of their parts. SeqVISTA runs on any operating system with a Java 1.4 virtual machine. It is freely available to academic users at http://zlab.bu.edu/SeqVISTA.
Whole Genome Sequencing Shows a Low Proportion of Tuberculosis Disease Is Attributable to Known Close Contacts in Rural Malawi.

Directory of Open Access Journals (Sweden)

Judith R Glynn

Full Text Available The proportion of tuberculosis attributable to transmission from close contacts is not well known. Comparison of the genome of strains from index patients and prior contacts allows transmission to be confirmed or excluded.In Karonga District, Malawi, all tuberculosis patients are asked about prior contact with others with tuberculosis. All available strains from culture-positive patients were sequenced. Up to 10 single nucleotide polymorphisms between index patients and their prior contacts were allowed for confirmation, and ≥ 100 for exclusion. The population attributable fraction was estimated from the proportion of confirmed transmissions and the proportion of patients with contacts.From 1997-2010 there were 1907 new culture-confirmed tuberculosis patients, of whom 32% reported at least one family contact and an additional 11% had at least one other contact; 60% of contacts had smear-positive disease. Among case-contact pairs with sequences available, transmission was confirmed from 38% (62/163 smear-positive prior contacts and 0/17 smear-negative prior contacts. Confirmed transmission was more common in those related to the prior contact (42.4%, 56/132 than in non-relatives (19.4%, 6/31, p = 0.02, and in those with more intense contact, to younger index cases, and in more recent years. The proportion of tuberculosis attributable to known contacts was estimated to be 9.4% overall.In this population known contacts only explained a small proportion of tuberculosis cases. Even those with a prior family contact with smear positive tuberculosis were more likely to have acquired their infection elsewhere.
Definition and Analysis of a System for the Automated Comparison of Curriculum Sequencing Algorithms in Adaptive Distance Learning

Science.gov (United States)

Limongelli, Carla; Sciarrone, Filippo; Temperini, Marco; Vaste, Giulia

2011-01-01

LS-Lab provides automatic support to comparison/evaluation of the Learning Object Sequences produced by different Curriculum Sequencing Algorithms. Through this framework a teacher can verify the correspondence between the behaviour of different sequencing algorithms and her pedagogical preferences. In fact the teacher can compare algorithms…
Comparison of single-shot fast spin-echo sequence and T2-weighted fast spin-echo sequence in MR imaging of the brain

International Nuclear Information System (INIS)

Cha, Sung Ho; Seo, Jeong Jin; Jeong, Gwang Woo; Kim, Jae Kyu; Kim, Yun Hyeon; Jeong, Yong Yeon; Kang, Heoung Keun; Oh, Hee Yeon; Yoon, Jong Hoon

1998-01-01

The purpose of this study was to evaluate the usefulness of the single-shot fast spinecho (SS-FSE) sequence in comparison with the T2-weighted fast spin-echo (T2-FSE) sequence in brain MR imaging. In 41 patients aged 15-75 years with intracranial lesion, both SS-FSE and T2-FES images were obtained using a 1.5-T MR system. Lesions included cerebral ischemia or infarcts (n=3D23), tumors (n=3D10), hemorrhages (n=3D3), inflammatory diseases (n=3D2), arachnoid cysts(n=3D2), and vascular disease (n=3D1), and the MR images were retrospectively evaluated. To calculate contrast-to-noise ratio (CNR), percentage contrast, and signal-to-noise ratio (SNR)-and thus make a quantitative comparison-the mean signal intensities of lesions, normal brain tissue, and noise out-side the patient were measured. For qualitative comparison, the visibility, margin, and extent of the lesions were rated using a five-grade system, and the degree of MR artifacts was also evaluated. Wilcoxon's signed ranks test was used for statistical analysis. The mean CNR of lesions was significantly higher on SS-FSE (31.3) than on T2-FSE images (27.5) (p=3D0.0131). Mean percentage contrast was also higher on SS-FSE (159.0) than on T2-FSE images (108.5) (p=3D0.0222), but mean SNR was higher on T2-FSE (80.3) than on SS-FSE images (53.5) (p=3D0.0000). No significant differences in lesion visibility were observed between the two imaging sequences, though margin and extent of the lesion were worse on SS-FSE images. For MR artifacts, no significant differences were demonstrated. For the evaluation of most intracranial lesions, MR imaging using the SS-FSE sequence appears to be slightly inferior to the T2-FSE sequence, but may be useful where patients are ill or uncooperative, or where children require sedation.=20
Wakefield excitation in plasma resonator by a sequence of relativistic electron bunches

International Nuclear Information System (INIS)

Kiselev, V.A.; Linnik, A.F.; Mirny, V.I.; Onishchenko, I.N.; Uskov, V.V.

2008-01-01

Wakefield excitation in a plasma resonator by a sequence of relativistic electron bunches with the purpose to increase excited field amplitude in comparison to waveguide case is experimentally investigated. A sequence of short electron bunches is produced by the linear resonant accelerator. Plasma resonator is formed at the beam-plasma discharge in rectangular metal waveguide filled with gas and closed by metal foil at entrance and movable short-circuited plunger at exit. Measurements of wakefield amplitude are performed showing considerably higher wakefield amplitude for resonator case
Harnessing Whole Genome Sequencing in Medical Mycology.

Science.gov (United States)

Cuomo, Christina A

2017-01-01

Comparative genome sequencing studies of human fungal pathogens enable identification of genes and variants associated with virulence and drug resistance. This review describes current approaches, resources, and advances in applying whole genome sequencing to study clinically important fungal pathogens. Genomes for some important fungal pathogens were only recently assembled, revealing gene family expansions in many species and extreme gene loss in one obligate species. The scale and scope of species sequenced is rapidly expanding, leveraging technological advances to assemble and annotate genomes with higher precision. By using iteratively improved reference assemblies or those generated de novo for new species, recent studies have compared the sequence of isolates representing populations or clinical cohorts. Whole genome approaches provide the resolution necessary for comparison of closely related isolates, for example, in the analysis of outbreaks or sampled across time within a single host. Genomic analysis of fungal pathogens has enabled both basic research and diagnostic studies. The increased scale of sequencing can be applied across populations, and new metagenomic methods allow direct analysis of complex samples.
3D representations of amino acids—applications to protein sequence comparison and classification

Directory of Open Access Journals (Sweden)

Jie Li

2014-08-01

Full Text Available The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the representation of such a protein sequence facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D space derived from amino acid substitution matrices provides an adequate representation that can be used for direct comparison of protein sequences based on geometry. We measure the performance of such a representation in the context of the protein structural fold prediction problem. We compare the results of classifying different sets of proteins belonging to distinct structural folds against classifications of the same proteins obtained from sequence alone or directly from structural information. We find that sequence alone performs poorly as a structure classifier. We show in contrast that the use of the three dimensional representation of the sequences significantly improves the classification accuracy. We conclude with a discussion of the current limitations of such a representation and with a description of potential improvements.
Reconsidering the generation time hypothesis based on nuclear ribosomal ITS sequence comparisons in annual and perennial angiosperms

Directory of Open Access Journals (Sweden)

Fiz-Palacios Omar

2008-12-01

Full Text Available Abstract Background Differences in plant annual/perennial habit are hypothesized to cause a generation time effect on divergence rates. Previous studies that compared rates of divergence for internal transcribed spacer (ITS1 and ITS2 sequences of nuclear ribosomal DNA (nrDNA in angiosperms have reached contradictory conclusions about whether differences in generation times (or other life history features are associated with divergence rate heterogeneity. We compared annual/perennial ITS divergence rates using published sequence data, employing sampling criteria to control for possible artifacts that might obscure any actual rate variation caused by annual/perennial differences. Results Relative rate tests employing ITS sequences from 16 phylogenetically-independent annual/perennial species pairs rejected rate homogeneity in only a few comparisons, with annuals more frequently exhibiting faster substitution rates. Treating branch length differences categorically (annual faster or perennial faster regardless of magnitude with a sign test often indicated an excess of annuals with faster substitution rates. Annuals showed an approximately 1.6-fold rate acceleration in nucleotide substitution models for ITS. Relative rates of three nuclear loci and two chloroplast regions for the annual Arabidopsis thaliana compared with two closely related Arabidopsis perennials indicated that divergence was faster for the annual. In contrast, A. thaliana ITS divergence rates were sometimes faster and sometimes slower than the perennial. In simulations, divergence rate differences of at least 3.5-fold were required to reject rate constancy in > 80 % of replicates using a nucleotide substitution model observed for the combination of ITS1 and ITS2. Simulations also showed that categorical treatment of branch length differences detected rate heterogeneity > 80% of the time with a 1.5-fold or greater rate difference. Conclusion Although rate homogeneity was not rejected
Complete cDNA sequence of human complement C1s and close physical linkage of the homologous genes C1s and C1r

International Nuclear Information System (INIS)

Tosi, M.; Duponchel, C.; Meo, T.; Julier, C.

1987-01-01

Overlapping molecular clones encoding the complement subcomponent C1s were isolated from a human liver cDNA library. The nucleotide sequence reconstructed from these clones spans about 85% of the length of the liver C1s messenger RNAs, which occur in three distinct size classes around 3 kilobases in length. Comparisons with the sequence of C1r, the other enzymatic subcomponent of C1, reveal 40% amino acid identity and conservation of all the cysteine residues. Beside the serine protease domain, the following sequence motifs, previously described in C1r, were also found in C1s: (a) two repeats of the type found in the Ba fragment of complement factor B and in several other complement but also noncomplement proteins, (b) a cysteine-rich segment homologous to the repeats of epidermal growth factor precursor, and (c) a duplicated segment found only in C1r and C1s. Differences in each of these structural motifs provide significant clues for the interpretation of the functional divergence of these interacting serine protease zymogens. Hybridizations of C1r and C1s probes to restriction endonuclease fragments of genomic DNA demonstrate close physical linkage of the corresponding genes. The implications of this finding are discussed with respect to the evolution of C1r and C1s after their origin by tandem gene duplication and to the previously observed combined hereditary deficiencies of Clr and Cls
Whole-genome in-silico subtractive hybridization (WISH - using massive sequencing for the identification of unique and repetitive sex-specific sequences: the example of Schistosoma mansoni

Directory of Open Access Journals (Sweden)

Parrinello Hugues

2010-06-01

Full Text Available Abstract Background Emerging methods of massive sequencing that allow for rapid re-sequencing of entire genomes at comparably low cost are changing the way biological questions are addressed in many domains. Here we propose a novel method to compare two genomes (genome-to-genome comparison. We used this method to identify sex-specific sequences of the human blood fluke Schistosoma mansoni. Results Genomic DNA was extracted from male and female (heterogametic S. mansoni adults and sequenced with a Genome Analyzer (Illumina. Sequences are available at the NCBI sequence read archive http://www.ncbi.nlm.nih.gov/Traces/sra/ under study accession number SRA012151.6. Sequencing reads were aligned to the genome, and a pseudogenome composed of known repeats. Straightforward comparative bioinformatics analysis was performed to compare male and female schistosome genomes and identify female-specific sequences. We found that the S. mansoni female W chromosome contains only few specific unique sequences (950 Kb i.e. about 0.2% of the genome. The majority of W-specific sequences are repeats (10.5 Mb i.e. about 2.5% of the genome. Arbitrarily selected W-specific sequences were confirmed by PCR. Primers designed for unique and repetitive sequences allowed to reliably identify the sex of both larval and adult stages of the parasite. Conclusion Our genome-to-genome comparison method that we call "whole-genome in-silico subtractive hybridization" (WISH allows for rapid identification of sequences that are specific for a certain genotype (e.g. the heterogametic sex. It can in principle be used for the detection of any sequence differences between isolates (e.g. strains, pathovars or even closely related species.
Comparative Sequence Analysis of Multidrug-Resistant IncA/C Plasmids from Salmonella enterica.

Science.gov (United States)

Hoffmann, Maria; Pettengill, James B; Gonzalez-Escalona, Narjol; Miller, John; Ayers, Sherry L; Zhao, Shaohua; Allard, Marc W; McDermott, Patrick F; Brown, Eric W; Monday, Steven R

2017-01-01

Determinants of multidrug resistance (MDR) are often encoded on mobile elements, such as plasmids, transposons, and integrons, which have the potential to transfer among foodborne pathogens, as well as to other virulent pathogens, increasing the threats these traits pose to human and veterinary health. Our understanding of MDR among Salmonella has been limited by the lack of closed plasmid genomes for comparisons across resistance phenotypes, due to difficulties in effectively separating the DNA of these high-molecular weight, low-copy-number plasmids from chromosomal DNA. To resolve this problem, we demonstrate an efficient protocol for isolating, sequencing and closing IncA/C plasmids from Salmonella sp. using single molecule real-time sequencing on a Pacific Biosciences (Pacbio) RS II Sequencer. We obtained six Salmonella enterica isolates from poultry, representing six different serovars, each exhibiting the MDR-Ampc resistance profile. Salmonella plasmids were obtained using a modified mini preparation and transformed with Escherichia coli DH10Br. A Qiagen Large-Construct kit™ was used to recover highly concentrated and purified plasmid DNA that was sequenced using PacBio technology. These six closed IncA/C plasmids ranged in size from 104 to 191 kb and shared a stable, conserved backbone containing 98 core genes, with only six differences among those core genes. The plasmids encoded a number of antimicrobial resistance genes, including those for quaternary ammonium compounds and mercury. We then compared our six IncA/C plasmid sequences: first with 14 IncA/C plasmids derived from S. enterica available at the National Center for Biotechnology Information (NCBI), and then with an additional 38 IncA/C plasmids derived from different taxa. These comparisons allowed us to build an evolutionary picture of how antimicrobial resistance may be mediated by this common plasmid backbone. Our project provides detailed genetic information about resistance genes in
Comparative Sequence Analysis of Multidrug-Resistant IncA/C Plasmids from Salmonella enterica

Directory of Open Access Journals (Sweden)

Maria Hoffmann

2017-08-01

Full Text Available Determinants of multidrug resistance (MDR are often encoded on mobile elements, such as plasmids, transposons, and integrons, which have the potential to transfer among foodborne pathogens, as well as to other virulent pathogens, increasing the threats these traits pose to human and veterinary health. Our understanding of MDR among Salmonella has been limited by the lack of closed plasmid genomes for comparisons across resistance phenotypes, due to difficulties in effectively separating the DNA of these high-molecular weight, low-copy-number plasmids from chromosomal DNA. To resolve this problem, we demonstrate an efficient protocol for isolating, sequencing and closing IncA/C plasmids from Salmonella sp. using single molecule real-time sequencing on a Pacific Biosciences (Pacbio RS II Sequencer. We obtained six Salmonella enterica isolates from poultry, representing six different serovars, each exhibiting the MDR-Ampc resistance profile. Salmonella plasmids were obtained using a modified mini preparation and transformed with Escherichia coli DH10Br. A Qiagen Large-Construct kit™ was used to recover highly concentrated and purified plasmid DNA that was sequenced using PacBio technology. These six closed IncA/C plasmids ranged in size from 104 to 191 kb and shared a stable, conserved backbone containing 98 core genes, with only six differences among those core genes. The plasmids encoded a number of antimicrobial resistance genes, including those for quaternary ammonium compounds and mercury. We then compared our six IncA/C plasmid sequences: first with 14 IncA/C plasmids derived from S. enterica available at the National Center for Biotechnology Information (NCBI, and then with an additional 38 IncA/C plasmids derived from different taxa. These comparisons allowed us to build an evolutionary picture of how antimicrobial resistance may be mediated by this common plasmid backbone. Our project provides detailed genetic information about
cDNA cloning and nucleotide sequence comparison of Chinese hamster metallothionein I and II mRNAs

Energy Technology Data Exchange (ETDEWEB)

Griffith, B B; Walters, R A; Enger, M D; Hildebrand, C E; Griffith, J K

1983-01-01

Polyadenylated RNA was extracted from a cadmium resistant Chinese hamster (CHO) cell line, enriched for metal-induced, abundant RNA sequences and cloned as double-stranded cDNA in the plasmid pBR322. Two cDNA clones, pCHMT1 and pCHMT2, encoding two Chinese hamster isometallothioneins were identified, and the nucleotide sequence of each insert was determined. The two Chinese hamster metallothioneins show nucleotide sequence homologies of 80% in the protein coding region and approximately 35% in both the 5' and 3' untranslated regions. Interestingly, an 8 nucleotide sequence (TGTAAATA) has been conserved in sequence and position in the 3' untranslated regions of each metallothionein mRNA sequenced thus far. Estimated nucleotide substitution rates derived from interspecies comparisons were used to calculate a metallothionein gene duplication time of 45 to 120 million years ago. 39 references, 1 figure, 1 table.
Comparison of Enzymes / Non-Enzymes Proteins Classification Models Based on 3D, Composition, Sequences and Topological Indices

OpenAIRE

Munteanu, Cristian Robert

2014-01-01

Comparison of Enzymes / Non-Enzymes Proteins Classification Models Based on 3D, Composition, Sequences and Topological Indices, German Conference on Bioinformatics (GCB), Potsdam, Germany (September, 2007)
Close relationship of Plasmodium sequences detected from South American pampas deer (Ozotoceros bezoarticus to Plasmodium spp. in North American white-tailed deer

Directory of Open Access Journals (Sweden)

Masahito Asada

2018-04-01

Full Text Available We report, for the first time, the presence of ungulate malaria parasites in South America. We conducted PCR-based surveys of blood samples of multiple deer species and water buffalo from Brazil and detected Plasmodium sequences from pampas deer (Ozotoceros bezoarticus samples. Phylogenic analysis revealed that the obtained sequences are closely related to the Plasmodium odocoilei clade 2 sequence from North American white-tailed deer (Odocoileus virginianus. Nucleotide differences suggest that malaria parasites in South American pampas deer and North American P. odocoilei clade 2 branched more recently than the Great American Interchange. Keywords: Malaria, Pampas deer, South America, Plasmodium odocoilei, Brazil

Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution

OpenAIRE

Filée, Jonathan

2015-01-01

Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no tr...
Statistical method to compare massive parallel sequencing pipelines.

Science.gov (United States)

Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P

2017-03-01

Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment.

Science.gov (United States)

Baichoo, Shakuntala; Ouzounis, Christos A

A multitude of algorithms for sequence comparison, short-read assembly and whole-genome alignment have been developed in the general context of molecular biology, to support technology development for high-throughput sequencing, numerous applications in genome biology and fundamental research on comparative genomics. The computational complexity of these algorithms has been previously reported in original research papers, yet this often neglected property has not been reviewed previously in a systematic manner and for a wider audience. We provide a review of space and time complexity of key sequence analysis algorithms and highlight their properties in a comprehensive manner, in order to identify potential opportunities for further research in algorithm or data structure optimization. The complexity aspect is poised to become pivotal as we will be facing challenges related to the continuous increase of genomic data on unprecedented scales and complexity in the foreseeable future, when robust biological simulation at the cell level and above becomes a reality. Copyright © 2017 Elsevier B.V. All rights reserved.
In Silico Genome Comparison and Distribution Analysis of Simple Sequences Repeats in Cassava

Directory of Open Access Journals (Sweden)

Andrea Vásquez

2014-01-01

Full Text Available We conducted a SSRs density analysis in different cassava genomic regions. The information obtained was useful to establish comparisons between cassava’s SSRs genomic distribution and those of poplar, flax, and Jatropha. In general, cassava has a low SSR density (~50 SSRs/Mbp and has a high proportion of pentanucleotides, (24,2 SSRs/Mbp. It was found that coding sequences have 15,5 SSRs/Mbp, introns have 82,3 SSRs/Mbp, 5′ UTRs have 196,1 SSRs/Mbp, and 3′ UTRs have 50,5 SSRs/Mbp. Through motif analysis of cassava’s genome SSRs, the most abundant motif was AT/AT while in intron sequences and UTRs regions it was AG/CT. In addition, in coding sequences the motif AAG/CTT was also found to occur most frequently; in fact, it is the third most used codon in cassava. Sequences containing SSRs were classified according to their functional annotation of Gene Ontology categories. The identified SSRs here may be a valuable addition for genetic mapping and future studies in phylogenetic analyses and genomic evolution.
BLAST Ring Image Generator (BRIG: simple prokaryote genome comparisons

Directory of Open Access Journals (Sweden)

Beatson Scott A

2011-08-01

Full Text Available Abstract Background Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. Results BLAST Ring Image Generator (BRIG can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons
BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons.

Science.gov (United States)

Alikhan, Nabil-Fareed; Petty, Nicola K; Ben Zakour, Nouri L; Beatson, Scott A

2011-08-08

Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. BLAST Ring Image Generator (BRIG) can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons automatically. There is a clear need for a user
The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea.

Science.gov (United States)

Zhao, Fumei; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Kim, Sang-Min; Kwak, Do Yeon; Kim, Sun Lim; Lee, Bong Choon; Moon, Jae Sun

2016-07-01

The complete nucleotide sequence of a new barley polerovirus, tentatively named barley virus G (BVG), which was isolated in Gimje, South Korea, has been determined using an RNA sequencing technique combined with polymerase chain reaction methods. The viral genomic RNA of BVG is 5,620 nucleotides long and contains six typical open reading frames commonly observed in other poleroviruses. Sequence comparisons revealed that BVG is most closely related to maize yellow dwarf virus-RMV, with the highest amino acid identities being less than 90 % for all of the corresponding proteins. These results suggested that BVG is a member of a new species in the genus Polerovirus.
Comparative sequence analysis of Sordaria macrospora and Neurospora crassa as a means to improve genome annotation.

Science.gov (United States)

Nowrousian, Minou; Würtz, Christian; Pöggeler, Stefanie; Kück, Ulrich

2004-03-01

One of the most challenging parts of large scale sequencing projects is the identification of functional elements encoded in a genome. Recently, studies of genomes of up to six different Saccharomyces species have demonstrated that a comparative analysis of genome sequences from closely related species is a powerful approach to identify open reading frames and other functional regions within genomes [Science 301 (2003) 71, Nature 423 (2003) 241]. Here, we present a comparison of selected sequences from Sordaria macrospora to their corresponding Neurospora crassa orthologous regions. Our analysis indicates that due to the high degree of sequence similarity and conservation of overall genomic organization, S. macrospora sequence information can be used to simplify the annotation of the N. crassa genome.
Deep-sequencing protocols influence the results obtained in small-RNA sequencing.

Directory of Open Access Journals (Sweden)

Joern Toedling

Full Text Available Second-generation sequencing is a powerful method for identifying and quantifying small-RNA components of cells. However, little attention has been paid to the effects of the choice of sequencing platform and library preparation protocol on the results obtained. We present a thorough comparison of small-RNA sequencing libraries generated from the same embryonic stem cell lines, using different sequencing platforms, which represent the three major second-generation sequencing technologies, and protocols. We have analysed and compared the expression of microRNAs, as well as populations of small RNAs derived from repetitive elements. Despite the fact that different libraries display a good correlation between sequencing platforms, qualitative and quantitative variations in the results were found, depending on the protocol used. Thus, when comparing libraries from different biological samples, it is strongly recommended to use the same sequencing platform and protocol in order to ensure the biological relevance of the comparisons.
Sequencing and De Novo Transcriptome Assembly of Brachypodium sylvaticum (Poaceae

Directory of Open Access Journals (Sweden)

Samuel E. Fox

2013-03-01

Full Text Available Premise of the study: We report the de novo assembly and characterization of the transcriptomes of Brachypodium sylvaticum (slender false-brome accessions from native populations of Spain and Greece, and an invasive population west of Corvallis, Oregon, USA. Methods and Results: More than 350 million sequence reads from the mRNA libraries prepared from three B. sylvaticum genotypes were assembled into 120,091 (Corvallis, 104,950 (Spain, and 177,682 (Greece transcript contigs. In comparison with the B. distachyon Bd21 reference genome and GenBank protein sequences, we estimate >90% exome coverage for B. sylvaticum. The transcripts were assigned Gene Ontology and InterPro annotations. Brachypodium sylvaticum sequence reads aligned against the Bd21 genome revealed 394,654 single-nucleotide polymorphisms (SNPs and >20,000 simple sequence repeat (SSR DNA sites. Conclusions: To our knowledge, this is the first report of transcriptome sequencing of invasive plant species with a closely related sequenced reference genome. The sequences and identified SNP variant and SSR sites will provide tools for developing novel genetic markers for use in genotyping and characterization of invasive behavior of B. sylvaticum.
Genome Sequence of Bivens Arm Virus, a Tibrovirus Belonging to the Species Tibrogargan virus (Mononegavirales: Rhabdoviridae).

Science.gov (United States)

Lauck, Michael; Yú, Shu Qìng; Caì, Yíngyún; Hensley, Lisa E; Chiu, Charles Y; O'Connor, David H; Kuhn, Jens H

2015-03-19

The new rhabdoviral genus Tibrovirus currently has two members, Coastal Plains virus and Tibrogargan virus. Here, we report the coding-complete genome sequence of a putative member of this genus, Bivens Arm virus. A genomic comparison reveals Bivens Arm virus to be closely related to, but distinct from, Tibrogargan virus. Copyright © 2015 Lauck et al.
Boosting antibody developability through rational sequence optimization.

Science.gov (United States)

Seeliger, Daniel; Schulz, Patrick; Litzenburger, Tobias; Spitz, Julia; Hoerer, Stefan; Blech, Michaela; Enenkel, Barbara; Studts, Joey M; Garidel, Patrick; Karow, Anne R

2015-01-01

The application of monoclonal antibodies as commercial therapeutics poses substantial demands on stability and properties of an antibody. Therapeutic molecules that exhibit favorable properties increase the success rate in development. However, it is not yet fully understood how the protein sequences of an antibody translates into favorable in vitro molecule properties. In this work, computational design strategies based on heuristic sequence analysis were used to systematically modify an antibody that exhibited a tendency to precipitation in vitro. The resulting series of closely related antibodies showed improved stability as assessed by biophysical methods and long-term stability experiments. As a notable observation, expression levels also improved in comparison with the wild-type candidate. The methods employed to optimize the protein sequences, as well as the biophysical data used to determine the effect on stability under conditions commonly used in the formulation of therapeutic proteins, are described. Together, the experimental and computational data led to consistent conclusions regarding the effect of the introduced mutations. Our approach exemplifies how computational methods can be used to guide antibody optimization for increased stability.
Yeast genome sequencing:

DEFF Research Database (Denmark)

Piskur, Jure; Langkjær, Rikke Breinhold

2004-01-01

For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...
Comparison of modern 3D and 2D MR imaging sequences of the wrist at 3 Tesla

International Nuclear Information System (INIS)

Rehnitz, C.; Klaan, B.; Amarteifio, E.; Kauczor, H.U.; Weber, M.A.; Stillfried, F. von; Burkholder, I.

2016-01-01

To compare the image quality of modern 3 D and 2 D sequences for dedicated wrist imaging at 3 Tesla (T) MRI. At 3 T MRI, 18 patients (mean age: 36.2 years) with wrist pain and 16 healthy volunteers (mean age: 26.4 years) were examined using 2 D proton density-weighted fat-saturated (PDfs), isotropic 3 D TrueFISP, 3 D MEDIC, and 3 D PDfs SPACE sequences. Image quality was rated on a five-point scale (0 - 4) including overall image quality (OIQ), visibility of important structures (cartilage, ligaments, TFCC) and degree of artifacts. Signal-to-noise ratios (SNR) and contrast-to-noise ratios (CNR) of cartilage/bone/muscle/fluid as well as the mean overall SNR/CNR were calculated using region-of-interest analysis. ANOVA, paired t-, and Wilcoxon-signed-rank tests were applied. The image quality of all tested sequences was superior to 3 D PDfs SPACE (p < 0.01). 3 D TrueFISP had the highest combined cartilage score (mean: 3.4) and performed better in cartilage comparisons against 3 D PDfs SPACE in both groups and 2 D PDfs in volunteers (p < 0.05). 3 D MEDIC performed better in 7 of 8 comparisons (p < 0.05) regarding ligaments and TFCC. 2 D PDfs provided constantly high scores. The mean overall SNR/CNR for 2 D PDfs, 3 D PDfs SPACE, 3 D TrueFISP, and 3 D MEDIC were 68/65, 32/27, 45/47, and 57/45, respectively. 2 D PDfs performed best in most SNR/CNR comparisons (p < 0.05) and 3 D MEDIC performed best within the 3 D sequences (p < 0.05). Except 3 D PDfs SPACE, all tested 3 D and 2 D sequences provided high image quality. 3 D TrueFISP was best for cartilage imaging, 3 D MEDIC for ligaments and TFCC and 2 D PDfs for general wrist imaging.
Sequence assembly

DEFF Research Database (Denmark)

Scheibye-Alsing, Karsten; Hoffmann, S.; Frankel, Annett Maria

2009-01-01

Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and...... in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html....
Foundations of Sequence-to-Sequence Modeling for Time Series

OpenAIRE

Kuznetsov, Vitaly; Mariet, Zelda

2018-01-01

The availability of large amounts of time series data, paired with the performance of deep-learning algorithms on a broad class of problems, has recently led to significant interest in the use of sequence-to-sequence models for time series forecasting. We provide the first theoretical analysis of this time series forecasting framework. We include a comparison of sequence-to-sequence modeling to classical time series models, and as such our theory can serve as a quantitative guide for practiti...
Guided tooth eruption: Comparison of open and closed eruption techniques in labially impacted maxillary canines

Directory of Open Access Journals (Sweden)

S M londhe

2014-01-01

Full Text Available Background: After third molars, the maxillary canines are the most commonly impacted permanent teeth and one-third of these are labial impactions. Impacted canines often require orthodontic guidance in the eruption. This study was conducted to assess the posttreatment results of surgically exposed and orthodontically aligned labially impacted maxillary canines comparing two different surgical techniques. Materials and Methods: The study was conducted in two phases, a surgical phase and an orthodontic phase. In surgical phase, events during surgical exposure and recovery of 31 patients with labially impacted maxillary canine were recorded. Patients were managed with open and closed eruption technique. The assessment included comparison of two techniques of surgical exposure, postoperative pain, mobility, vitality, periodontal health, level of impaction, and duration of orthodontic treatment. Results: The postoperative recovery was longer after open eruption than close eruption technique (P = 0.000. Postoperative pain experienced by patients was similar, but regression of pain was faster in closed eruption technique. The mean surgical time for open eruption technique was lesser when compared with closed eruption technique (P = 0.000. The total duration of orthodontic treatment was directly dependent upon the level of impaction, with deeper level of impaction having longer duration of orthodontic treatment. The mobility and vitality of guided canine was similar in both techniques. Conclusion: The closed eruption technique was a longer surgical procedure, but the postoperative pain regression was faster. The duration of orthodontic treatment was longer with deeper level of impaction. The closed eruption surgical techniques provide better periodontal tissues around the guided erupted teeth.
Comparison of static model and dynamic model for the evaluation of station blackout sequences

International Nuclear Information System (INIS)

Lee, Kwang-Nam; Kang, Sun-Koo; Hong, Sung-Yull.

1992-01-01

Station blackout is one of major contributors to the core damage frequency (CDF) in many PSA studies. Since station blackout sequence exhibits dynamic features, accurate calculation of CDF for the station blackout sequence is not possible with event tree/fault tree (ET/FT) method. Although the integral method can determine accurate CDF, it is time consuming and is difficult to evaluate various alternative AC source configuration and sensitivities. In this study, a comparison is made between static model and dynamic model and a new methodology which combines static model and dynamic model is provided for the accurate quantification of CDF and evaluation of improvement alternatives. Results of several case studies show that accurate calculation of CDF is possible by introducing equivalent mission time. (author)
Phylogeny of the genus Haemophilus as determined by comparison of partial infB sequences

DEFF Research Database (Denmark)

Hedegaard, J; Okkels, H; Bruun, B

2001-01-01

A 453 bp fragment of infB, the gene encoding translation initiation factor 2, was sequenced and compared from 66 clinical isolates and type strains of Haemophilus species and related bacteria. Analysis of the partial infB sequences obtained suggested that the human isolates dependent on X and V...... factor, H. influenzae, H. haemolyticus, H. aegyptius and some cryptic genospecies of H. influenzae, were closely related to each other. H. parainfluenzae constituted a heterogeneous group within the boundaries of the genus, whereas H. aphrophilus/paraphrophilus and Actinobacillus actinomycetemcomitans...... were only remotely related to the type species of the genus Haemophilus H. parahaemolyticus and H. paraphrohaemolyticus took up an intermediary position and may not belong in the genus Haemophilus sensu stricto. Ambiguous results were obtained with seven isolates tentatively identified as H. segnis...
Direct chloroplast sequencing: comparison of sequencing platforms and analysis tools for whole chloroplast barcoding.

Directory of Open Access Journals (Sweden)

Marta Brozynska

Full Text Available Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina and Ion Torrent (Life Technology sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare. Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.

Comparison of Boolean analysis and standard phylogenetic methods using artificially evolved and natural mt-tRNA sequences from great apes.

Science.gov (United States)

Ari, Eszter; Ittzés, Péter; Podani, János; Thi, Quynh Chi Le; Jakó, Eena

2012-04-01

Boolean analysis (or BOOL-AN; Jakó et al., 2009. BOOL-AN: A method for comparative sequence analysis and phylogenetic reconstruction. Mol. Phylogenet. Evol. 52, 887-97.), a recently developed method for sequence comparison uses the Iterative Canonical Form of Boolean functions. It considers sequence information in a way entirely different from standard phylogenetic methods (i.e. Maximum Parsimony, Maximum-Likelihood, Neighbor-Joining, and Bayesian analysis). The performance and reliability of Boolean analysis were tested and compared with the standard phylogenetic methods, using artificially evolved - simulated - nucleotide sequences and the 22 mitochondrial tRNA genes of the great apes. At the outset, we assumed that the phylogeny of Hominidae is generally well established, and the guide tree of artificial sequence evolution can also be used as a benchmark. These offer a possibility to compare and test the performance of different phylogenetic methods. Trees were reconstructed by each method from 2500 simulated sequences and 22 mitochondrial tRNA sequences. We also introduced a special re-sampling method for Boolean analysis on permuted sequence sites, the P-BOOL-AN procedure. Considering the reliability values (branch support values of consensus trees and Robinson-Foulds distances) we used for simulated sequence trees produced by different phylogenetic methods, BOOL-AN appeared as the most reliable method. Although the mitochondrial tRNA sequences of great apes are relatively short (59-75 bases long) and the ratio of their constant characters is about 75%, BOOL-AN, P-BOOL-AN and the Bayesian approach produced the same tree-topology as the established phylogeny, while the outcomes of Maximum Parsimony, Maximum-Likelihood and Neighbor-Joining methods were equivocal. We conclude that Boolean analysis is a promising alternative to existing methods of sequence comparison for phylogenetic reconstruction and congruence analysis. Copyright Â© 2012 Elsevier Inc. All
Cloning, characterization and sequence comparison of the gene coding for IMP dehydrogenase from Pyrococcus furiosus.

Science.gov (United States)

Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

1996-10-03

We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.
Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable evolution among core genes with therapeutic potential

Directory of Open Access Journals (Sweden)

Siragusa Gregory R

2011-06-01

Full Text Available Abstract Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase and a holin (PF04531. Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1 strongly significant host-specific sequence variation within the endolysin, and 2 a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products.
RAPD and Internal Transcribed Spacer Sequence Analyses Reveal Zea nicaraguensis as a Section Luxuriantes Species Close to Zea luxurians

Science.gov (United States)

Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin

2011-01-01

Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species. PMID:21525982
Genomic sequence of a ranavirus (family Iridoviridae) associated with salamander mortalities in North America

Energy Technology Data Exchange (ETDEWEB)

Jancovich, James K; Jinghe, Mao; Chinchar, V Gregory; Wyatt, Christopher; Case, Steven T; Kumar, Sudhir; Valente, Graziela; Subramanian, Sankar; Davidson, Elizabeth W; Collins, James P; Jacobs, Bertram L

2003-11-10

Disease is among the suspected causes of amphibian population declines, and an iridovirus and a chytrid fungus are the primary pathogens associated with amphibian mortalities. Ambystoma tigrinum virus (ATV) and a closely related strain, Regina ranavirus (RRV), are implicated in salamander die-offs in Arizona and Canada, respectively. We report the complete sequence of the ATV genome and partial sequence of the RRV genome. Sequence analysis of the ATV/RRV genomes showed marked similarity to other ranaviruses, including tiger frog virus (TFV) and frog virus 3 (FV3), the type virus of the genus Ranavirus (family Iridoviridae), as well as more distant relationships to lymphocystis disease virus, Chilo iridescent virus, and infectious spleen and kidney necrosis virus. Putative open reading frames (ORFs) in the ATV sequence identified 24 genes that appear to control virus replication and block antiviral responses. In addition, >50 other putative genes, homologous to ORFs in other iridoviral genomes but of unknown function, were also identified. Sequence comparison performed by dot plot analysis between ATV and itself revealed a conserved 14-bp palindromic repeat within most intragenic regions. Dot plot analysis of ATV vs RRV sequences identified several polymorphisms between the two isolates. Finally, a comparison of ATV and TFV genomic sequences identified genomic rearrangements consistent with the high recombination frequency of iridoviruses. Given the adverse effects that ranavirus infections have on amphibian and fish populations, ATV/RRV sequence information will allow the design of better diagnostic probes for identifying ranavirus infections and extend our understanding of molecular events in ranavirus-infected cells.
Genomic sequence of a ranavirus (family Iridoviridae) associated with salamander mortalities in North America

International Nuclear Information System (INIS)

Jancovich, James K.; Mao Jinghe; Chinchar, V. Gregory; Wyatt, Christopher; Case, Steven T.; Kumar, Sudhir; Valente, Graziela; Subramanian, Sankar; Davidson, Elizabeth W.; Collins, James P.; Jacobs, Bertram L.

2003-01-01

Disease is among the suspected causes of amphibian population declines, and an iridovirus and a chytrid fungus are the primary pathogens associated with amphibian mortalities. Ambystoma tigrinum virus (ATV) and a closely related strain, Regina ranavirus (RRV), are implicated in salamander die-offs in Arizona and Canada, respectively. We report the complete sequence of the ATV genome and partial sequence of the RRV genome. Sequence analysis of the ATV/RRV genomes showed marked similarity to other ranaviruses, including tiger frog virus (TFV) and frog virus 3 (FV3), the type virus of the genus Ranavirus (family Iridoviridae), as well as more distant relationships to lymphocystis disease virus, Chilo iridescent virus, and infectious spleen and kidney necrosis virus. Putative open reading frames (ORFs) in the ATV sequence identified 24 genes that appear to control virus replication and block antiviral responses. In addition, >50 other putative genes, homologous to ORFs in other iridoviral genomes but of unknown function, were also identified. Sequence comparison performed by dot plot analysis between ATV and itself revealed a conserved 14-bp palindromic repeat within most intragenic regions. Dot plot analysis of ATV vs RRV sequences identified several polymorphisms between the two isolates. Finally, a comparison of ATV and TFV genomic sequences identified genomic rearrangements consistent with the high recombination frequency of iridoviruses. Given the adverse effects that ranavirus infections have on amphibian and fish populations, ATV/RRV sequence information will allow the design of better diagnostic probes for identifying ranavirus infections and extend our understanding of molecular events in ranavirus-infected cells
Openings and Closings in Telephone Conversations between Native Spanish Speakers.

Science.gov (United States)

Coronel-Molina, Serafin M.

1998-01-01

A study analyzed the opening and closing sequences of 11 dyads of native Spanish-speakers in natural telephone conversations conducted in Spanish. The objective was to determine how closely Hispanic cultural patterns of conduct for telephone conversations follow the sequences outlined in previous research. It is concluded that Spanish…
Ribosomal DNA sequence analysis of different geographically distributed Aloe Vera plants: Comparison with clonally regenerated plants

International Nuclear Information System (INIS)

Yagi, A.; Sato, Y.; Miwa, Y.; Kabbash, A.; Moustafa, S.; Shimomura, K.; El-Bassuony, A.

2006-01-01

A comparison of the sequences in an internally transcribed spacer (ITS) 1 region of rDNA between clonally regenerated A.vera and same species in Japan, USA and Egypt revealed the presence of two types of nucleotide sequences, 252 and 254 bps. Based on the findings in the ITS 1 region, A.vera having 252 and 254 bps clearly showed a stable sequence similarity, suggesting high conversation of the base peak sequence in the ITS 1 region. However, frequent base substitutions in the 252 bps samples leaves that came from callus tissue and micropropagated plants were observed around the regions of nucleotide positions 66, 99 and 199-201. The minor deviation in clonally regenerated A.vera may be due to the stage of regeneration and cell specification in cases of the callus tissue. In the present study, the base peak sequence of the Its 1 region of rDNA was adopted as a molecular marker for differentiating A.vera plants from geographically distributed and clonally regenerated A.vera plants and it was suggested that the base peak substitutions in the ITS 1 region may arise from the different nutritional and environmental factors in cultivation and plant growth stages. (author)
The phylogeny of Mediterranean tortoises and their close relativesbased on complete mitochondrial genome sequences from museumspecimens

Energy Technology Data Exchange (ETDEWEB)

Parham, James F.; Macey, J. Robert; Papenfuss, Theodore J.; Feldman, Chris R.; Turkozan, Oguz; Polymeni, Rosa; Boore, Jeffrey

2005-04-29

As part of an ongoing project to generate a mitochondrial database for terrestrial tortoises based on museum specimens, the complete mitochondrial genome sequences of 10 species and a {approx}14 kb sequence from an eleventh species are reported. The sampling of the present study emphasizes Mediterranean tortoises (genus Testudo and their close relatives). Our new sequences are aligned, along with those of two testudinoid turtles from GenBank, Chrysemys picta and Mauremys reevesii, yielding an alignment of 14,858 positions, of which 3,238 are parsimony informative. We develop a phylogenetic taxonomy for Testudo and related species based on well-supported, diagnosable clades. Several well-supported nodes are recovered, including the monophyly of a restricted Testudo, T. kleinmanni + T. marginata (the Chersus clade), and the placement of the enigmatic African pancake tortoise (Malacochersustornieri) within the predominantly Palearctic greater Testudo group (Testudona tax. nov.). Despite the large amount of sequence reported, there is low statistical support for some nodes within Testudona and Sowe do not propose names for those groups. A preliminary and conservative estimation of divergence times implies a late Miocene diversification for the testudonan clade (6-12 million years ago), matching their first appearance in the fossil record. The multi-continental distribution of testudonan turtles can be explained by the establishment of permanent connections between Europe, Africa, and Asia at this time. The arrival of testudonan turtles to Africa occurred after one or more initial tortoise invasions gave rise to the diverse (>25 species) 'Geochelone complex.'Two unusual genomic features are reported for the mtDNA of one tortoise, M. tornieri: (1) nad4 has a shift of reading frame that we suggest is resolved by translational frameshifting of the mRNA on the ribosome during protein synthesis and (2) there are two copies of the control region and trnF, with the
Comparison of base composition analysis and Sanger sequencing of mitochondrial DNA for four U.S. population groups.

Science.gov (United States)

Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M

2014-01-01

A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
A measure of the denseness of a phylogenetic network. [by sequenced proteins from extant species

Science.gov (United States)

Holmquist, R.

1978-01-01

An objective measure of phylogenetic denseness is developed to examine various phylogenetic criteria: alpha- and beta-hemoglobin, myoglobin, cytochrome c, and the parvalbumin family. Attention is given to the number of nucleotide replacements separating homologous sequences, and to the topology of the network (in other words, to the qualitative nature of the network as defined by how closely the studied species are related). Applications include quantitative comparisons of species origin, relation, and rates of evolution.
Comparison of next generation sequencing technologies for transcriptome characterization

Directory of Open Access Journals (Sweden)

Soltis Douglas E

2009-08-01

Full Text Available Abstract Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19. We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica and the magnoliid avocado (Persea americana using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB, 119,518 (88.7% mapped exactly to known exons, while 1,117 (0.8% mapped to introns, 11,524 (8.6% spanned annotated intron/exon boundaries, and 3,066 (2.3% extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
Classification of domains of closed operators

International Nuclear Information System (INIS)

Lassner, G.; Timmermann, W.

1975-01-01

The structure of domains of determining closed operators in the Hilbert space by means of sequence spaces is investigated. The final classification provides three classes of these domains. Necessary and sufficient conditions of equivalence of these domains are obtained in the form of equivalency of corresponding sequences of natural numbers. Connection with the perturbation theory is mentioned [ru
Genome sequence of the endosymbiont Rickettsia peacockii and comparison with virulent Rickettsia rickettsii: identification of virulence factors.

Directory of Open Access Journals (Sweden)

Roderick F Felsheim

2009-12-01

Full Text Available Rickettsia peacockii, also known as the East Side Agent, is a non-pathogenic obligate intracellular bacterium found as an endosymbiont in Dermacentor andersoni ticks in the western USA and Canada. Its presence in ticks is correlated with reduced prevalence of Rickettsia rickettsii, the agent of Rocky Mountain Spotted Fever. It has been proposed that a virulent SFG rickettsia underwent changes to become the East Side Agent. We determined the genome sequence of R. peacockii and provide a comparison to a closely related virulent R. rickettsii. The presence of 42 chromosomal copies of the ISRpe1 transposon in the genome of R. peacockii is associated with a lack of synteny with the genome of R. rickettsii and numerous deletions via recombination between transposon copies. The plasmid contains a number of genes from distantly related organisms, such as part of the glycosylation island of Pseudomonas aeruginosa. Genes deleted or mutated in R. peacockii which may relate to loss of virulence include those coding for an ankyrin repeat containing protein, DsbA, RickA, protease II, OmpA, ScaI, and a putative phosphoethanolamine transferase. The gene coding for the ankyrin repeat containing protein is especially implicated as it is mutated in R. rickettsii strain Iowa, which has attenuated virulence. Presence of numerous copies of the ISRpe1 transposon, likely acquired by lateral transfer from a Cardinium species, are associated with extensive genomic reorganization and deletions. The deletion and mutation of genes possibly involved in loss of virulence have been identified by this genomic comparison. It also illustrates that the introduction of a transposon into the genome can have varied effects; either correlating with an increase in pathogenicity as in Francisella tularensis or a loss of pathogenicity as in R. peacockii and the recombination enabled by multiple transposon copies can cause significant deletions in some genomes while not in others.
Gene discovery and transcript analyses in the corn smut pathogen Ustilago maydis: expressed sequence tag and genome sequence comparison

Directory of Open Access Journals (Sweden)

Saville Barry J

2007-09-01

Full Text Available Abstract Background Ustilago maydis is the basidiomycete fungus responsible for common smut of corn and is a model organism for the study of fungal phytopathogenesis. To aid in the annotation of the genome sequence of this organism, several expressed sequence tag (EST libraries were generated from a variety of U. maydis cell types. In addition to utility in the context of gene identification and structure annotation, the ESTs were analyzed to identify differentially abundant transcripts and to detect evidence of alternative splicing and anti-sense transcription. Results Four cDNA libraries were constructed using RNA isolated from U. maydis diploid teliospores (U. maydis strains 518 × 521 and haploid cells of strain 521 grown under nutrient rich, carbon starved, and nitrogen starved conditions. Using the genome sequence as a scaffold, the 15,901 ESTs were assembled into 6,101 contiguous expressed sequences (contigs; among these, 5,482 corresponded to predicted genes in the MUMDB (MIPS Ustilago maydis database, while 619 aligned to regions of the genome not yet designated as genes in MUMDB. A comparison of EST abundance identified numerous genes that may be regulated in a cell type or starvation-specific manner. The transcriptional response to nitrogen starvation was assessed using RT-qPCR. The results of this suggest that there may be cross-talk between the nitrogen and carbon signalling pathways in U. maydis. Bioinformatic analysis identified numerous examples of alternative splicing and anti-sense transcription. While intron retention was the predominant form of alternative splicing in U. maydis, other varieties were also evident (e.g. exon skipping. Selected instances of both alternative splicing and anti-sense transcription were independently confirmed using RT-PCR. Conclusion Through this work: 1 substantial sequence information has been provided for U. maydis genome annotation; 2 new genes were identified through the discovery of 619
Complete mitochondrial genome sequence of Indian medium carp, Labeo gonius (Hamilton, 1822) and its comparison with other related carp species.

Science.gov (United States)

Behera, Bijay Kumar; Kumari, Kavita; Baisvar, Vishwamitra Singh; Rout, Ajaya Kumar; Pakrashi, Sudip; Paria, Prasenjet; Jena, J K

2017-01-01

In the present study, the complete mitochondrial genome sequence of Labeo gonius is reported using PGM sequencer (Ion Torrent). The complete mitogenome of L. gonius is obtained by the de novo sequences assembly of genomic reads using the Torrent Mapping Alignment Program (TMAP) which is 16 614 bp in length. The mitogenome of L. gonius comprised of 13 protein-coding genes, 22 tRNAs, 2 rRNA genes, and D-loop as control region along with gene order and organization, being similar to most of other fish mitogenomes of NCBI databases. The mitogenome in the present study has 99% similarity to the complete mitogenome sequence of Labeo fimbriatus, as reported earlier. The phylogenetic analysis of Cypriniformes depicted that their mitogenomes are closely related to each other. The complete mitogenome sequence of L. gonius would be helpful in understanding the population genetics, phylogenetics, and evolution of Indian Carps.
HIV drug resistance testing among patients failing second line antiretroviral therapy. Comparison of in-house and commercial sequencing.

Science.gov (United States)

Chimukangara, Benjamin; Varyani, Bhavini; Shamu, Tinei; Mutsvangwa, Junior; Manasa, Justen; White, Elizabeth; Chimbetete, Cleophas; Luethy, Ruedi; Katzenstein, David

2017-05-01

HIV genotyping is often unavailable in low and middle-income countries due to infrastructure requirements and cost. We compared genotype resistance testing in patients with virologic failure, by amplification of HIV pol gene, followed by "in-house" sequencing and commercial sequencing. Remnant plasma samples from adults and children failing second-line ART were amplified and sequenced using in-house and commercial di-deoxysequencing, and analyzed in Harare, Zimbabwe and at Stanford, U.S.A, respectively. HIV drug resistance mutations were determined using the Stanford HIV drug resistance database. Twenty-six of 28 samples were amplified and 25 were successfully genotyped. Comparison of average percent nucleotide and amino acid identities between 23 pairs sequenced in both laboratories were 99.51 (±0.56) and 99.11 (±0.95), respectively. All pairs clustered together in phylogenetic analysis. Sequencing analysis identified 6/23 pairs with mutation discordances resulting in differences in phenotype, but these did not impact future regimens. The results demonstrate our ability to produce good quality drug resistance data in-house. Despite discordant mutations in some sequence pairs, the phenotypic predictions were not clinically significant. Copyright © 2016 Elsevier B.V. All rights reserved.
Comparison of Pre-Analytical FFPE Sample Preparation Methods and Their Impact on Massively Parallel Sequencing in Routine Diagnostics

Science.gov (United States)

Heydt, Carina; Fassunke, Jana; Künstlinger, Helen; Ihle, Michaela Angelika; König, Katharina; Heukamp, Lukas Carl; Schildhaus, Hans-Ulrich; Odenthal, Margarete; Büttner, Reinhard; Merkelbach-Bruse, Sabine

2014-01-01

Over the last years, massively parallel sequencing has rapidly evolved and has now transitioned into molecular pathology routine laboratories. It is an attractive platform for analysing multiple genes at the same time with very little input material. Therefore, the need for high quality DNA obtained from automated DNA extraction systems has increased, especially to those laboratories which are dealing with formalin-fixed paraffin-embedded (FFPE) material and high sample throughput. This study evaluated five automated FFPE DNA extraction systems as well as five DNA quantification systems using the three most common techniques, UV spectrophotometry, fluorescent dye-based quantification and quantitative PCR, on 26 FFPE tissue samples. Additionally, the effects on downstream applications were analysed to find the most suitable pre-analytical methods for massively parallel sequencing in routine diagnostics. The results revealed that the Maxwell 16 from Promega (Mannheim, Germany) seems to be the superior system for DNA extraction from FFPE material. The extracts had a 1.3–24.6-fold higher DNA concentration in comparison to the other extraction systems, a higher quality and were most suitable for downstream applications. The comparison of the five quantification methods showed intermethod variations but all methods could be used to estimate the right amount for PCR amplification and for massively parallel sequencing. Interestingly, the best results in massively parallel sequencing were obtained with a DNA input of 15 ng determined by the NanoDrop 2000c spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). No difference could be detected in mutation analysis based on the results of the quantification methods. These findings emphasise, that it is particularly important to choose the most reliable and constant DNA extraction system, especially when using small biopsies and low elution volumes, and that all common DNA quantification techniques can be used for
Comparison of pre-analytical FFPE sample preparation methods and their impact on massively parallel sequencing in routine diagnostics.

Directory of Open Access Journals (Sweden)

Carina Heydt

Full Text Available Over the last years, massively parallel sequencing has rapidly evolved and has now transitioned into molecular pathology routine laboratories. It is an attractive platform for analysing multiple genes at the same time with very little input material. Therefore, the need for high quality DNA obtained from automated DNA extraction systems has increased, especially to those laboratories which are dealing with formalin-fixed paraffin-embedded (FFPE material and high sample throughput. This study evaluated five automated FFPE DNA extraction systems as well as five DNA quantification systems using the three most common techniques, UV spectrophotometry, fluorescent dye-based quantification and quantitative PCR, on 26 FFPE tissue samples. Additionally, the effects on downstream applications were analysed to find the most suitable pre-analytical methods for massively parallel sequencing in routine diagnostics. The results revealed that the Maxwell 16 from Promega (Mannheim, Germany seems to be the superior system for DNA extraction from FFPE material. The extracts had a 1.3-24.6-fold higher DNA concentration in comparison to the other extraction systems, a higher quality and were most suitable for downstream applications. The comparison of the five quantification methods showed intermethod variations but all methods could be used to estimate the right amount for PCR amplification and for massively parallel sequencing. Interestingly, the best results in massively parallel sequencing were obtained with a DNA input of 15 ng determined by the NanoDrop 2000c spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA. No difference could be detected in mutation analysis based on the results of the quantification methods. These findings emphasise, that it is particularly important to choose the most reliable and constant DNA extraction system, especially when using small biopsies and low elution volumes, and that all common DNA quantification techniques can

Stress concentration during pellet cladding interaction: Comparison of closed-form solutions with 2D(r,θ) finite element simulations

International Nuclear Information System (INIS)

Sercombe, Jérôme; Masson, Renaud; Helfer, Thomas

2013-01-01

Highlights: • This paper presents closed-formed solutions concerning pellet cladding interaction. • First, the opening of a radial crack in a pellet fragment is estimated. • Second, the stresses in the cladding in front of the pellet crack are calculated. • The closed-formed solutions are found in good agreement with 2D FE simulations. • They are then used in the fuel code ALCYONE to model PCI during power ramps. -- Abstract: This paper presents two closed-form solutions that can be used to enrich the mechanical description of fuel pellets and cladding behavior in standard one-dimensional based fuel performance codes. The first one is concerned with the estimation of the opening of a radial crack in a pellet fragment induced by the radial thermal gradient in the pellet and limited by the pellet-clad contact pressure. The second one describes the stress distribution in a cladding bore in front of an opening pellet crack. A linear angular variation of the pellet-clad contact pressure and a constant prescribed radial displacement are considered. The closed-form solutions are checked by comparison to independent finite element models of the pellet fragment and of the cladding. Their ability to describe non-axisymmetric displacement and stress fields during loading histories representative of base irradiation and power ramps is then demonstrated by cross-comparison with the 2D pellet fragment-cladding model of the multi-dimensional fuel performance code ALCYONE. The calculated radial crack opening profiles at different times and the hoop stress concentration in the cladding at the top of the ramp are found in good agreement with ALCYONE
Comparison of pause predictions of two sequence-dependent transcription models

International Nuclear Information System (INIS)

Bai, Lu; Wang, Michelle D

2010-01-01

Two recent theoretical models, Bai et al (2004, 2007) and Tadigotla et al (2006), formulated thermodynamic explanations of sequence-dependent transcription pausing by RNA polymerase (RNAP). The two models differ in some basic assumptions and therefore make different yet overlapping predictions for pause locations, and different predictions on pause kinetics and mechanisms. Here we present a comprehensive comparison of the two models. We show that while they have comparable predictive power of pause locations at low NTP concentrations, the Bai et al model is more accurate than Tadigotla et al at higher NTP concentrations. The pausing kinetics predicted by Bai et al is also consistent with time-course transcription reactions, while Tadigotla et al is unsuited for this type of kinetic prediction. More importantly, the two models in general predict different pausing mechanisms even for the same pausing sites, and the Bai et al model provides an explanation more consistent with recent single molecule observations
Comparison of sequencing based CNV discovery methods using monozygotic twin quartets.

Directory of Open Access Journals (Sweden)

Marc-André Legault

Full Text Available The advent of high throughput sequencing methods breeds an important amount of technical challenges. Among those is the one raised by the discovery of copy-number variations (CNVs using whole-genome sequencing data. CNVs are genomic structural variations defined as a variation in the number of copies of a large genomic fragment, usually more than one kilobase. Here, we aim to compare different CNV calling methods in order to assess their ability to consistently identify CNVs by comparison of the calls in 9 quartets of identical twin pairs. The use of monozygotic twins provides a means of estimating the error rate of each algorithm by observing CNVs that are inconsistently called when considering the rules of Mendelian inheritance and the assumption of an identical genome between twins. The similarity between the calls from the different tools and the advantage of combining call sets were also considered.ERDS and CNVnator obtained the best performance when considering the inherited CNV rate with a mean of 0.74 and 0.70, respectively. Venn diagrams were generated to show the agreement between the different algorithms, before and after filtering out familial inconsistencies. This filtering revealed a high number of false positives for CNVer and Breakdancer. A low overall agreement between the methods suggested a high complementarity of the different tools when calling CNVs. The breakpoint sensitivity analysis indicated that CNVnator and ERDS achieved better resolution of CNV borders than the other tools. The highest inherited CNV rate was achieved through the intersection of these two tools (81%.This study showed that ERDS and CNVnator provide good performance on whole genome sequencing data with respect to CNV consistency across families, CNV breakpoint resolution and CNV call specificity. The intersection of the calls from the two tools would be valuable for CNV genotyping pipelines.
Filling gaps in biodiversity knowledge for macrofungi: contributions and assessment of an herbarium collection DNA barcode sequencing project.

Science.gov (United States)

Osmundson, Todd W; Robert, Vincent A; Schoch, Conrad L; Baker, Lydia J; Smith, Amy; Robich, Giovanni; Mizzan, Luca; Garbelotto, Matteo M

2013-01-01

Despite recent advances spearheaded by molecular approaches and novel technologies, species description and DNA sequence information are significantly lagging for fungi compared to many other groups of organisms. Large scale sequencing of vouchered herbarium material can aid in closing this gap. Here, we describe an effort to obtain broad ITS sequence coverage of the approximately 6000 macrofungal-species-rich herbarium of the Museum of Natural History in Venice, Italy. Our goals were to investigate issues related to large sequencing projects, develop heuristic methods for assessing the overall performance of such a project, and evaluate the prospects of such efforts to reduce the current gap in fungal biodiversity knowledge. The effort generated 1107 sequences submitted to GenBank, including 416 previously unrepresented taxa and 398 sequences exhibiting a best BLAST match to an unidentified environmental sequence. Specimen age and taxon affected sequencing success, and subsequent work on failed specimens showed that an ITS1 mini-barcode greatly increased sequencing success without greatly reducing the discriminating power of the barcode. Similarity comparisons and nonmetric multidimensional scaling ordinations based on pairwise distance matrices proved to be useful heuristic tools for validating the overall accuracy of specimen identifications, flagging potential misidentifications, and identifying taxa in need of additional species-level revision. Comparison of within- and among-species nucleotide variation showed a strong increase in species discriminating power at 1-2% dissimilarity, and identified potential barcoding issues (same sequence for different species and vice-versa). All sequences are linked to a vouchered specimen, and results from this study have already prompted revisions of species-sequence assignments in several taxa.
Whole-genome sequence analysis of the Mycobacterium avium complex and proposal of the transfer of Mycobacterium yongonense to Mycobacterium intracellulare subsp. yongonense subsp. nov.

Science.gov (United States)

Castejon, Maria; Menéndez, Maria Carmen; Comas, Iñaki; Vicente, Ana; Garcia, Maria J

2018-06-01

Bacterial whole-genome sequences contain informative features of their evolutionary pathways. Comparison of whole-genome sequences have become the method of choice for classification of prokaryotes, thus allowing the identification of bacteria from an evolutionary perspective, and providing data to resolve some current controversies. Currently, controversy exists about the assignment of members of the Mycobacterium avium complex, as is for the cases of Mycobacterium yongonense and 'Mycobacterium indicus pranii'. These two mycobacteria, closely related to Mycobacterium intracellulare on the basis of standard phenotypic and single gene-sequences comparisons, were not considered a member of such species on the basis on some particular differences displayed by a single strain. Whole-genome sequence comparison procedures, namely the average nucleotide identity and the genome distance, showed that those two mycobacteria should be considered members of the species M. intracellulare. The results were confirmed with other whole-genome comparison supplementary methods. According to the data provided, Mycobacterium yongonense and 'Mycobacterium indicus pranii' should be considered and renamed and included as members of M. intracellulare. This study highlights the problems caused when a novel species is accepted on the basis of a single strain, as was the case for M. yongonense. Based mainly on whole-genome sequence analysis, we conclude that M. yongonense should be reclassified as a subspecies of Mycobacterium intracellulareas Mycobacterium intracellularesubsp. yongonense and 'Mycobacterium indicus pranii' classified in the same subspecies as the type strain of Mycobacterium intracellulare and classified as Mycobacterium intracellularesubsp. intracellulare.
Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

Science.gov (United States)

Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

2015-03-01

The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
Complementary DNA and derived amino acid sequence of the β subunit of human complement protein C8: identification of a close structural and ancestral relationship to the α subunit and C9

International Nuclear Information System (INIS)

Howard, O.M.Z.; Rao, A.G.; Sodetz, J.M.

1987-01-01

A cDNA clone encoding the β subunit (M/sub r/ 64,000) of the eighth component of complement (C8) has been isolated from a human liver cDNA library. This clone has a cDNA insert of 1.95 kilobases (kb) and contains the entire β sequence [1608 base pairs (bp)]. Analysis of total cellular RNA isolated from the hepatoma cell line HepG2 revealed the mRNA for β to be ∼ 2.5 kb. This is similar to the message size for the α subunit of C8 and confirms the existence of different mRNAs for α and β. This finding supports genetic evidence that α and β are encoded at different loci. Analysis of the derived amino acid sequence revealed several membrane surface seeking segments that may facilitate β interaction with target membranes during complement-mediated cytolysis. Determined of the carbohydrate composition indicated 1 or 2 asparagine-linked but no O-linked oligosaccharide chains. Comparison of the β sequence to that reported earlier and to that of human C9 revealed a striking homology between all three proteins. For β and α, the overall homology is 33% on the basis of identity and 53% when conserved substitutions are allowed. For β and C9, the values are 26% and 47 5 , respectively. All three have a large internal domain that is nearly cysteine free and N- and C-termini that are cysteine-rich and homologous to the low-density lipoprotein receptor repeat and epidermal growth factor type sequences, respectively. The overall homology and similarities in size and structural organization are indicative of a close ancestral relationship. It is concluded that α, β and C9 are members of a family of structurally related proteins that are capable of interacting to produce a hydrophilic to amphiphilic transition and membrane association
A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.

Directory of Open Access Journals (Sweden)

Wenyu Zhang

Full Text Available The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulated datasets from Solexa sequencing platform. Considering the computational time, maximum random access memory (RAM occupancy, assembly accuracy and integrity, our study indicate that string-based assemblers, overlap-layout-consensus (OLC assemblers are well-suited for very short reads and longer reads of small genomes respectively. For large datasets of more than hundred millions of short reads, De Bruijn graph-based assemblers would be more appropriate. In terms of software implementation, string-based assemblers are superior to graph-based ones, of which SOAPdenovo is complex for the creation of configuration file. Our comparison study will assist researchers in selecting a well-suited assembler and offer essential information for the improvement of existing assemblers or the developing of novel assemblers.
Whole-genome comparison of two Campylobacter jejuni isolates of the same sequence type reveals multiple loci of different ancestral lineage.

Directory of Open Access Journals (Sweden)

Patrick J Biggs

Full Text Available Campylobacter jejuni ST-474 is the most important human enteric pathogen in New Zealand, and yet this genotype is rarely found elsewhere in the world. Insight into the evolution of this organism was gained by a whole genome comparison of two ST-474, flaA SVR-14 isolates and other available C. jejuni isolates and genomes. The two isolates were collected from different sources, human (H22082 and retail poultry (P110b, at the same time and from the same geographical location. Solexa sequencing of each isolate resulted in ~1.659 Mb (H22082 and ~1.656 Mb (P110b of assembled sequences within 28 (H22082 and 29 (P110b contigs. We analysed 1502 genes for which we had sequences within both ST-474 isolates and within at least one of 11 C. jejuni reference genomes. Although 94.5% of genes were identical between the two ST-474 isolates, we identified 83 genes that differed by at least one nucleotide, including 55 genes with non-synonymous substitutions. These covered 101 kb and contained 672 point differences. We inferred that 22 (3.3% of these differences were due to mutation and 650 (96.7% were imported via recombination. Our analysis estimated 38 recombinant breakpoints within these 83 genes, which correspond to recombination events affecting at least 19 loci regions and gives a tract length estimate of ~2 kb. This includes a ~12 kb region displaying non-homologous recombination in one of the ST-474 genomes, with the insertion of two genes, including ykgC, a putative oxidoreductase, and a conserved hypothetical protein of unknown function. Furthermore, our analysis indicates that the source of this recombined DNA is more likely to have come from C. jejuni strains that are more closely related to ST-474. This suggests that the rates of recombination and mutation are similar in order of magnitude, but that recombination has been much more important for generating divergence between the two ST-474 isolates.
Sequence comparison and phylogenetic analysis of core gene of ...

African Journals Online (AJOL)

STORAGESEVER

2010-07-19

Jul 19, 2010 ... and antisense primers, a single band of 573 base pairs .... Amino acid sequence alignment of Cluster I and Cluster II of phylogenetic tree. First ten sequences ... sequence weighting, postion-spiecific gap penalties and weight.
Small-target leak detection for a closed vessel via infrared image sequences

Science.gov (United States)

Zhao, Ling; Yang, Hongjiu

2017-03-01

This paper focus on a leak diagnosis and localization method based on infrared image sequences. Some problems on high probability of false warning and negative affect for marginal information are solved by leak detection. An experimental model is established for leak diagnosis and localization on infrared image sequences. The differential background prediction is presented to eliminate the negative affect of marginal information on test vessel based on a kernel regression method. A pipeline filter based on layering voting is designed to reduce probability of leak point false warning. A synthesize leak diagnosis and localization algorithm is proposed based on infrared image sequences. The effectiveness and potential are shown for developed techniques through experimental results.
[Complete genome sequencing of polymalic acid-producing strain Aureobasidium pullulans CCTCC M2012223].

Science.gov (United States)

Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang

2017-01-04

To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.
A multithreaded parallel implementation of a dynamic programming algorithm for sequence comparison.

Science.gov (United States)

Martins, W S; Del Cuvillo, J B; Useche, F J; Theobald, K B; Gao, G R

2001-01-01

This paper discusses the issues involved in implementing a dynamic programming algorithm for biological sequence comparison on a general-purpose parallel computing platform based on a fine-grain event-driven multithreaded program execution model. Fine-grain multithreading permits efficient parallelism exploitation in this application both by taking advantage of asynchronous point-to-point synchronizations and communication with low overheads and by effectively tolerating latency through the overlapping of computation and communication. We have implemented our scheme on EARTH, a fine-grain event-driven multithreaded execution and architecture model which has been ported to a number of parallel machines with off-the-shelf processors. Our experimental results show that the dynamic programming algorithm can be efficiently implemented on EARTH systems with high performance (e.g., speedup of 90 on 120 nodes), good programmability and reasonable cost.
Genomic comparison of the endophyte Herbaspirillum seropedicae SmR1 and the phytopathogen Herbaspirillum rubrisubalbicans M1 by suppressive subtractive hybridization and partial genome sequencing.

Science.gov (United States)

Monteiro, Rose A; Balsanelli, Eduardo; Tuleski, Thalita; Faoro, Helison; Cruz, Leonardo M; Wassem, Roseli; de Baura, Valter A; Tadra-Sfeir, Michelle Z; Weiss, Vinícius; DaRocha, Wanderson D; Muller-Santos, Marcelo; Chubatsu, Leda S; Huergo, Luciano F; Pedrosa, Fábio O; de Souza, Emanuel M

2012-05-01

Herbaspirillum rubrisubalbicans M1 causes the mottled stripe disease in sugarcane cv. B-4362. Inoculation of this cultivar with Herbaspirillum seropedicae SmR1 does not produce disease symptoms. A comparison of the genomic sequences of these closely related species may permit a better understanding of contrasting phenotype such as endophytic association and pathogenic life style. To achieve this goal, we constructed suppressive subtractive hybridization (SSH) libraries to identify DNA fragments present in one species and absent in the other. In a parallel approach, partial genomic sequence from H. rubrisubalbicans M1 was directly compared in silico with the H. seropedicae SmR1 genome. The genomic differences between the two organisms revealed by SSH suggested that lipopolysaccharide and adhesins are potential molecular factors involved in the different phenotypic behavior. The cluster wss probably involved in cellulose biosynthesis was found in H. rubrisubalbicans M1. Expression of this gene cluster was increased in H. rubrisubalbicans M1 cells attached to the surface of maize root, and knockout of wssD gene led to decrease in maize root surface attachment and endophytic colonization. The production of cellulose could be responsible for the maize attachment pattern of H. rubrisubalbicans M1 that is capable of outcompeting H. seropedicae SmR1. © 2012 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Tools for integrated sequence-structure analysis with UCSF Chimera

Directory of Open Access Journals (Sweden)

Huang Conrad C

2006-07-01

Full Text Available Abstract Background Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit; (c can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals. Results The molecular graphics program UCSF Chimera includes a suite of tools for interactive analyses of sequences and structures. Structures automatically associate with sequences in imported alignments, allowing many kinds of crosstalk. A novel method is provided to superimpose structures in the absence of a pre-existing sequence alignment. The method uses both sequence and secondary structure, and can match even structures with very low sequence identity. Another tool constructs structure-based sequence alignments from superpositions of two or more proteins. Chimera is designed to be extensible, and mechanisms for incorporating user-specific data without Chimera code development are also provided. Conclusion The tools described here apply to many problems involving comparison and analysis of protein structures and their sequences. Chimera includes complete documentation and is intended for use by a wide range of scientists, not just those in the computational disciplines. UCSF Chimera is free for non-commercial use and is
Comparison of variable region 3 sequences of human immunodeficiency virus type 1 from infected children with the RNA and DNA sequences of the virus populations of their mothers.

Science.gov (United States)

Scarlatti, G; Leitner, T; Halapi, E; Wahlberg, J; Marchisio, P; Clerici-Schoeller, M A; Wigzell, H; Fenyö, E M; Albert, J; Uhlén, M

1993-01-01

We have compared the variable region 3 sequences from 10 human immunodeficiency virus type 1 (HIV-1)-infected infants to virus sequences from the corresponding mothers. The sequences were derived from DNA of uncultured peripheral blood mononuclear cells (PBMC), DNA of cultured PBMC, and RNA from serum collected at or shortly after delivery. The infected infants, in contrast to the mothers, harbored homogeneous virus populations. Comparison of sequences from the children and clones derived from DNA of the corresponding mothers showed that the transmitted virus represented either a minor or a major virus population of the mother. In contrast to an earlier study, we found no evidence of selection of minor virus variants during transmission. Furthermore, the transmitted virus variant did not show any characteristic molecular features. In some cases the transmitted virus was more related to the virus RNA population of the mother and in other cases it was more related to the virus DNA population. This suggests that either cell-free or cell-associated virus may be transmitted. These data will help AIDS researchers to understand the mechanism of transmission and to plan strategies for prevention of transmission. PMID:8446584
Transcriptome-based differentiation of closely-related Miscanthus lines.

Directory of Open Access Journals (Sweden)

Philippe Chouvarine

Full Text Available BACKGROUND: Distinguishing between individuals is critical to those conducting animal/plant breeding, food safety/quality research, diagnostic and clinical testing, and evolutionary biology studies. Classical genetic identification studies are based on marker polymorphisms, but polymorphism-based techniques are time and labor intensive and often cannot distinguish between closely related individuals. Illumina sequencing technologies provide the detailed sequence data required for rapid and efficient differentiation of related species, lines/cultivars, and individuals in a cost-effective manner. Here we describe the use of Illumina high-throughput exome sequencing, coupled with SNP mapping, as a rapid means of distinguishing between related cultivars of the lignocellulosic bioenergy crop giant miscanthus (Miscanthus × giganteus. We provide the first exome sequence database for Miscanthus species complete with Gene Ontology (GO functional annotations. RESULTS: A SNP comparative analysis of rhizome-derived cDNA sequences was successfully utilized to distinguish three Miscanthus × giganteus cultivars from each other and from other Miscanthus species. Moreover, the resulting phylogenetic tree generated from SNP frequency data parallels the known breeding history of the plants examined. Some of the giant miscanthus plants exhibit considerable sequence divergence. CONCLUSIONS: Here we describe an analysis of Miscanthus in which high-throughput exome sequencing was utilized to differentiate between closely related genotypes despite the current lack of a reference genome sequence. We functionally annotated the exome sequences and provide resources to support Miscanthus systems biology. In addition, we demonstrate the use of the commercial high-performance cloud computing to do computational GO annotation.
Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

DEFF Research Database (Denmark)

Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk

2013-01-01

to the consensus sequence. Additionally, we got an average sequence depth for the genome of 4000 for the Iontorrent PGM and 400 for the FLX platform making the mapping suitable for single nucleotide variant (SNV) detection. The analysis revealed a single non-silent SNV A10665G leading to the amino acid change D......Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms...
Comparison of the nucleotide sequence of wild-type hepatitis - A virus and its attenuated candidate vaccine derivative

International Nuclear Information System (INIS)

Cohen, J.I.; Rosenblum, B.; Ticehurst, J.R.; Daemer, R.; Feinstone, S.; Purcell, R.H.

1987-01-01

Development of attenuated mutants for use as vaccines is in progress for other viruses, including influenza, rotavirus, varicella-zoster, cytomegalovirus, and hepatitis-A virus (HAV). Attenuated viruses may be derived from naturally occurring mutants that infect human or nonhuman hosts. Alternatively, attenuated mutants may be generated by passage of wild-type virus in cell culture. Production of attenuated viruses in cell culture is a laborious and empiric process. Despite previous empiric successes, understanding the molecular basis for attenuation of vaccine viruses could facilitate future development and use of live-virus vaccines. Comparison of the complete nucleotide sequences of wild-type (virulent) and vaccine (attenuated) viruses has been reported for polioviruses and yellow fever virus. Here, the authors compare the nucleotide sequence of wild-type HAV HM-175 with that of a candidate vaccine derivative
Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

Directory of Open Access Journals (Sweden)

Pietro Liò

Full Text Available A major goal of bioinformatics is the characterization of transcription factors and the transcriptional programs they regulate. Given the speed of genome sequencing, we would like to quickly annotate regulatory sequences in newly-sequenced genomes. In such cases, it would be helpful to predict sequence motifs by using experimental data from closely related model organism. Here we present a general algorithm that allow to identify transcription factor binding sites in one newly sequenced species by performing Bayesian regression on the annotated species. First we set the rationale of our method by applying it within the same species, then we extend it to use data available in closely related species. Finally, we generalise the method to handle the case when a certain number of experiments, from several species close to the species on which to make inference, are available. In order to show the performance of the method, we analyse three functionally related networks in the Ascomycota. Two gene network case studies are related to the G2/M phase of the Ascomycota cell cycle; the third is related to morphogenesis. We also compared the method with MatrixReduce and discuss other types of validation and tests. The first network is well known and provides a biological validation test of the method. The two cell cycle case studies, where the gene network size is conserved, demonstrate an effective utility in annotating new species sequences using all the available replicas from model species. The third case, where the gene network size varies among species, shows that the combination of information is less powerful but is still informative. Our methodology is quite general and could be extended to integrate other high-throughput data from model organisms.

Delineation of the species Haemophilus influenzae by phenotype, multilocus sequence phylogeny, and detection of marker genes

DEFF Research Database (Denmark)

Nørskov-Lauritsen, Niels; Overballe, MD; Kilian, Mogens

2009-01-01

To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic genospec......To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic...... genospecies biotype IV, and the never formally validated species "Haemophilus intermedius". Multilocus sequence phylogeny based on six housekeeping genes separated a cluster encompassing the type and the reference strains of H. influenzae from 31 more distantly related strains. Comparison of 16S rRNA gene...
Cyprinus carpio Genome sequencing and assembly

NARCIS (Netherlands)

Kolder, I.C.R.M.; Plas-Duivesteijn, van der Suzanne J.; Tan, G.; Wiegertjes, G.; Forlenza, M.; Guler, A.T.; Travin, D.Y.; Nakao, M.; Moritomo, T.; Irnazarow, I.; Jansen, H.J.

2013-01-01

Sequencing of the common carp (Cyprinus carpio carpio Linnaeus, 1758) genome, with the objective of establishing carp as a model organism to supplement the closely related zebrafish (Danio rerio). The sequenced individual is a homozygous female (by gynogenesis) of R3 x R8 carp, the heterozygous
Recent speciation in three closely related sympatric specialists: inferences using multi-locus sequence, post-mating isolation and endosymbiont data.

Directory of Open Access Journals (Sweden)

Huai-Jun Xue

Full Text Available Shifting between unrelated host plants is relatively rare for phytophagous insects, and distinct host specificity may play crucial roles in reproductive isolation. However, the isolation status and the relationship between parental divergence and post-mating isolation among closely related sympatric specialists are still poorly understood. Here, multi-locus sequence were used to estimate the relationship among three host plant-specific closely related flea beetles, Altica cirsicola, A. fragariae and A. viridicyanea (abbreviated as AC, AF and AV respectively. The tree topologies were inconsistent using different gene or different combinations of gene fragments. The relationship of AF+(AC+AV was supported, however, by both gene tree and species tree based on concatenated data. Post-mating reproductive data on the results of crossing these three species are best interpreted in the light of a well established phylogeny. Nuclear-induced but not Wolbachia-induced unidirectional cytoplasmic incompatibility, which was detected in AC-AF and AF-AV but not in AC-AV, may also suggest more close genetic affinity between AC and AV. Prevalence of Wolbachia in these three beetles, and the endosymbiont in most individuals of AV and AC sharing a same wsp haplotype may give another evidence of AF+(AC+AV. Our study also suggested that these three flea beetles diverged in a relative short time (0.94 My, which may be the result of shifting between unrelated host plants and distinct host specificity. Incomplete post-mating isolation while almost complete lineage sorting indicated that effective pre-mating isolation among these three species should have evolved.
Rapid high resolution genotyping of Francisella tularensis by whole genome sequence comparison of annotated genes ("MLST+".

Directory of Open Access Journals (Sweden)

Markus H Antwerpen

Full Text Available The zoonotic disease tularemia is caused by the bacterium Francisella tularensis. This pathogen is considered as a category A select agent with potential to be misused in bioterrorism. Molecular typing based on DNA-sequence like canSNP-typing or MLVA has become the accepted standard for this organism. Due to the organism's highly clonal nature, the current typing methods have reached their limit of discrimination for classifying closely related subpopulations within the subspecies F. tularensis ssp. holarctica. We introduce a new gene-by-gene approach, MLST+, based on whole genome data of 15 sequenced F. tularensis ssp. holarctica strains and apply this approach to investigate an epidemic of lethal tularemia among non-human primates in two animal facilities in Germany. Due to the high resolution of MLST+ we are able to demonstrate that three independent clones of this highly infectious pathogen were responsible for these spatially and temporally restricted outbreaks.
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

Science.gov (United States)

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

Directory of Open Access Journals (Sweden)

Nathan D. Olson

2015-03-01

Full Text Available This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1 identity of biologically conserved position, (2 ratio of 16S rRNA gene copies featuring identified variants, and (3 the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies.
RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

Science.gov (United States)

Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

1990-07-01

The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.
Identification of Clinical Coryneform Bacterial Isolates: Comparison of Biochemical Methods and Sequence Analysis of 16S rRNA and rpoB Genes▿

Science.gov (United States)

Adderson, Elisabeth E.; Boudreaux, Jan W.; Cummings, Jessica R.; Pounds, Stanley; Wilson, Deborah A.; Procop, Gary W.; Hayden, Randall T.

2008-01-01

We compared the relative levels of effectiveness of three commercial identification kits and three nucleic acid amplification tests for the identification of coryneform bacteria by testing 50 diverse isolates, including 12 well-characterized control strains and 38 organisms obtained from pediatric oncology patients at our institution. Between 33.3 and 75.0% of control strains were correctly identified to the species level by phenotypic systems or nucleic acid amplification assays. The most sensitive tests were the API Coryne system and amplification and sequencing of the 16S rRNA gene using primers optimized for coryneform bacteria, which correctly identified 9 of 12 control isolates to the species level, and all strains with a high-confidence call were correctly identified. Organisms not correctly identified were species not included in the test kit databases or not producing a pattern of reactions included in kit databases or which could not be differentiated among several genospecies based on reaction patterns. Nucleic acid amplification assays had limited abilities to identify some bacteria to the species level, and comparison of sequence homologies was complicated by the inclusion of allele sequences obtained from uncultivated and uncharacterized strains in databases. The utility of rpoB genotyping was limited by the small number of representative gene sequences that are currently available for comparison. The correlation between identifications produced by different classification systems was poor, particularly for clinical isolates. PMID:18160450
Centroid based clustering of high throughput sequencing reads based on n-mer counts.

Science.gov (United States)

Solovyov, Alexander; Lipkin, W Ian

2013-09-08

Many problems in computational biology require alignment-free sequence comparisons. One of the common tasks involving sequence comparison is sequence clustering. Here we apply methods of alignment-free comparison (in particular, comparison using sequence composition) to the challenge of sequence clustering. We study several centroid based algorithms for clustering sequences based on word counts. Study of their performance shows that using k-means algorithm with or without the data whitening is efficient from the computational point of view. A higher clustering accuracy can be achieved using the soft expectation maximization method, whereby each sequence is attributed to each cluster with a specific probability. We implement an open source tool for alignment-free clustering. It is publicly available from github: https://github.com/luscinius/afcluster. We show the utility of alignment-free sequence clustering for high throughput sequencing analysis despite its limitations. In particular, it allows one to perform assembly with reduced resources and a minimal loss of quality. The major factor affecting performance of alignment-free read clustering is the length of the read.
Sequence embedding for fast construction of guide trees for multiple sequence alignment

LENUS (Irish Health Repository)

Blackshields, Gordon

2010-05-14

Abstract Background The most widely used multiple sequence alignment methods require sequences to be clustered as an initial step. Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires memory and time proportional to N 2 for N sequences. When N grows larger than 10,000 or so, this becomes increasingly prohibitive and can form a significant barrier to carrying out very large multiple alignments. Results In this paper, we have tested variations on a class of embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. Conclusions We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignment. Source code is available for download from http:\\/\\/www.clustal.org\\/mbed.tgz.
Analysis of high-depth sequence data for studying viral diversity: a comparison of next generation sequencing platforms using Segminator II

Directory of Open Access Journals (Sweden)

Archer John

2012-03-01

Full Text Available Abstract Background Next generation sequencing provides detailed insight into the variation present within viral populations, introducing the possibility of treatment strategies that are both reactive and predictive. Current software tools, however, need to be scaled up to accommodate for high-depth viral data sets, which are often temporally or spatially linked. In addition, due to the development of novel sequencing platforms and chemistries, each with implicit strengths and weaknesses, it will be helpful for researchers to be able to routinely compare and combine data sets from different platforms/chemistries. In particular, error associated with a specific sequencing process must be quantified so that true biological variation may be identified. Results Segminator II was developed to allow for the efficient comparison of data sets derived from different sources. We demonstrate its usage by comparing large data sets from 12 influenza H1N1 samples sequenced on both the 454 Life Sciences and Illumina platforms, permitting quantification of platform error. For mismatches median error rates at 0.10 and 0.12%, respectively, suggested that both platforms performed similarly. For insertions and deletions median error rates within the 454 data (at 0.3 and 0.2%, respectively were significantly higher than those within the Illumina data (0.004 and 0.006%, respectively. In agreement with previous observations these higher rates were strongly associated with homopolymeric stretches on the 454 platform. Outside of such regions both platforms had similar indel error profiles. Additionally, we apply our software to the identification of low frequency variants. Conclusion We have demonstrated, using Segminator II, that it is possible to distinguish platform specific error from biological variation using data derived from two different platforms. We have used this approach to quantify the amount of error present within the 454 and Illumina platforms in
The nucleotide sequences of two leghemoglobin genes from soybean

DEFF Research Database (Denmark)

Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O

1982-01-01

We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences in identical positions. Comparison of the coding sequences with known amino-acid sequences of soybean leghemoglobins suggest that the two genes...
Comparison of methods for genomic localization of gene trap sequences

Directory of Open Access Journals (Sweden)

Ferrin Thomas E

2006-09-01

Full Text Available Abstract Background Gene knockouts in a model organism such as mouse provide a valuable resource for the study of basic biology and human disease. Determining which gene has been inactivated by an untargeted gene trapping event poses a challenging annotation problem because gene trap sequence tags, which represent sequence near the vector insertion site of a trapped gene, are typically short and often contain unresolved residues. To understand better the localization of these sequences on the mouse genome, we compared stand-alone versions of the alignment programs BLAT, SSAHA, and MegaBLAST. A set of 3,369 sequence tags was aligned to build 34 of the mouse genome using default parameters for each algorithm. Known genome coordinates for the cognate set of full-length genes (1,659 sequences were used to evaluate localization results. Results In general, all three programs performed well in terms of localizing sequences to a general region of the genome, with only relatively subtle errors identified for a small proportion of the sequence tags. However, large differences in performance were noted with regard to correctly identifying exon boundaries. BLAT correctly identified the vast majority of exon boundaries, while SSAHA and MegaBLAST missed the majority of exon boundaries. SSAHA consistently reported the fewest false positives and is the fastest algorithm. MegaBLAST was comparable to BLAT in speed, but was the most susceptible to localizing sequence tags incorrectly to pseudogenes. Conclusion The differences in performance for sequence tags and full-length reference sequences were surprisingly small. Characteristic variations in localization results for each program were noted that affect the localization of sequence at exon boundaries, in particular.
The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

Directory of Open Access Journals (Sweden)

Dong-Keun Yi

2016-06-01

Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

Science.gov (United States)

Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

1991-02-15

The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
Computational Software to Fit Seismic Data Using Epidemic-Type Aftershock Sequence Models and Modeling Performance Comparisons

Science.gov (United States)

Chu, A.

2016-12-01

Modern earthquake catalogs are often analyzed using spatial-temporal point process models such as the epidemic-type aftershock sequence (ETAS) models of Ogata (1998). My work implements three of the homogeneous ETAS models described in Ogata (1998). With a model's log-likelihood function, my software finds the Maximum-Likelihood Estimates (MLEs) of the model's parameters to estimate the homogeneous background rate and the temporal and spatial parameters that govern triggering effects. EM-algorithm is employed for its advantages of stability and robustness (Veen and Schoenberg, 2008). My work also presents comparisons among the three models in robustness, convergence speed, and implementations from theory to computing practice. Up-to-date regional seismic data of seismic active areas such as Southern California and Japan are used to demonstrate the comparisons. Data analysis has been done using computer languages Java and R. Java has the advantages of being strong-typed and easiness of controlling memory resources, while R has the advantages of having numerous available functions in statistical computing. Comparisons are also made between the two programming languages in convergence and stability, computational speed, and easiness of implementation. Issues that may affect convergence such as spatial shapes are discussed.
The Comparison of Biochemical and Sequencing 16S rDNA Gene Methods to Identify Nontuberculous Mycobacteria

Directory of Open Access Journals (Sweden)

Shafipour1, M.

2014-11-01

Full Text Available The identification of Mycobacteria in the species level has great medical importance. Biochemical tests are laborious and time-consuming, so new techniques could be used to identify the species. This research aimed to the comparison of biochemical and sequencing 16S rDNA gene methods to identify nontuberculous Mycobacteria in patients suspected to tuberculosis in Golestan province which is the most prevalent region of tuberculosis in Iran. Among 3336 patients suspected to tuberculosis referred to hospitals and health care centres in Golestan province during 2010-2011, 319 (9.56% culture positive cases were collected. Identification of species by using biochemical tests was done. On the samples recognized as nontuberculous Mycobacteria, after DNA extraction by boiling, 16S rDNA PCR was done and their sequencing were identified by NCBI BLAST. Of the 319 positive samples in Golestan Province, 300 cases were M.tuberculosis and 19 cases (5.01% were identified as nontuberculous Mycobacteria by biochemical tests. 15 out of 19 nontuberculous Mycobacteria were identified by PCR and sequencing method as similar by biochemical methods (similarity rate: 78.9%. But after PCR, 1 case known as M.simiae by biochemical test was identified as M. lentiflavum and 3 other cases were identified as Nocardia. Biochemical methods corresponded to the 16S rDNA PCR and sequencing in 78.9% of cases. However, in identification of M. lentiflavum and Nocaria sp. the molecular method is better than biochemical methods.
Genomic sequence of 'Candidatus Liberibacter solanacearum' haplotype C and its comparison with haplotype A and B genomes.

Directory of Open Access Journals (Sweden)

Jinhui Wang

Full Text Available Haplotypes A and B of 'Candidatus Liberibacter solanacearum' (CLso are associated with diseases of solanaceous plants, especially Zebra chip disease of potato, and haplotypes C, D and E are associated with symptoms on apiaceous plants. To date, one complete genome of haplotype B and two high quality draft genomes of haplotype A have been obtained for these unculturable bacteria using metagenomics from the psyllid vector Bactericera cockerelli. Here, we present the first genomic sequences obtained for the carrot-associated CLso. These two genomic sequences of haplotype C, FIN114 (1.24 Mbp and FIN111 (1.20 Mbp, were obtained from carrot psyllids (Trioza apicalis harboring CLso. Genomic comparisons between the haplotypes A, B and C revealed that the genome organization differs between these haplotypes, due to large inversions and other recombinations. Comparison of protein-coding genes indicated that the core genome of CLso consists of 885 ortholog groups, with the pan-genome consisting of 1327 ortholog groups. Twenty-seven ortholog groups are unique to CLso haplotype C, whilst 11 ortholog groups shared by the haplotypes A and B, are not found in the haplotype C. Some of these ortholog groups that are not part of the core genome may encode functions related to interactions with the different host plant and psyllid species.
Modelling estimation and analysis of dynamic processes from image sequences using temporal random closed sets and point processes with application to the cell exocytosis and endocytosis

OpenAIRE

Díaz Fernández, Ester

2010-01-01

In this thesis, new models and methodologies are introduced for the analysis of dynamic processes characterized by image sequences with spatial temporal overlapping. The spatial temporal overlapping exists in many natural phenomena and should be addressed properly in several Science disciplines such as Microscopy, Material Sciences, Biology, Geostatistics or Communication Networks. This work is related to the Point Process and Random Closed Set theories, within Stochastic Ge...
Transcriptome analysis and comparison reveal divergence between two invasive whitefly cryptic species

Directory of Open Access Journals (Sweden)

Xia Jun

2011-09-01

Full Text Available Abstract Background Invasive species are valuable model systems for examining the evolutionary processes and molecular mechanisms associated with their specific characteristics by comparison with closely related species. Over the past 20 years, two species of the whitefly Bemisia tabaci species complex, Middle East-Asia Minor 1 (MEAM1 and Mediterranean (MED, have both spread from their origin Middle East/Mediterranean to many countries despite their apparent differences in many life history parameters. Previously, we have sequenced the transcriptome of MED. In this study, we sequenced the transcriptome of MEAM1 and took a comparative genomic approach to investigate the transcriptome evolution and the genetic factors underlying the differences between MEAM1 and MED. Results Using Illumina sequencing technology, we generated 17 million sequencing reads for MEAM1. These reads were assembled into 57,741 unique sequences and 15,922 sequences were annotated with an E-value above 10-5. Compared with the MED transcriptome, we identified 3,585 pairs of high quality orthologous genes and inferred their sequence divergences. The average differences in coding, 5' untranslated and 3' untranslated region were 0.83%, 1.66% and 1.43%, respectively. The level of sequence divergence provides additional support to the proposition that MEAM1 and MED are two species. Based on the ratio of nonsynonymous and synonymous substitutions, we identified 24 sequences that have evolved in response to positive selection. Many of those genes are predicted to be involved in metabolism and insecticide resistance which might contribute to the divergence of the two whitefly species. Conclusions Our data present a comprehensive sequence comparison between the two invasive whitefly species. This study will provide a road map for future investigations on the molecular mechanisms underlying their biological differences.

SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq Data

Directory of Open Access Journals (Sweden)

Yuxiang Tan

2015-01-01

Full Text Available The performance evaluation of fusion detection algorithms from high-throughput sequencing data crucially relies on the availability of data with known positive and negative cases of gene rearrangements. The use of simulated data circumvents some shortcomings of real data by generation of an unlimited number of true and false positive events, and the consequent robust estimation of accuracy measures, such as precision and recall. Although a few simulated fusion datasets from RNA Sequencing (RNA-Seq are available, they are of limited sample size. This makes it difficult to systematically evaluate the performance of RNA-Seq based fusion-detection algorithms. Here, we present SimFuse to address this problem. SimFuse utilizes real sequencing data as the fusions’ background to closely approximate the distribution of reads from a real sequencing library and uses a reference genome as the template from which to simulate fusions’ supporting reads. To assess the supporting read-specific performance, SimFuse generates multiple datasets with various numbers of fusion supporting reads. Compared to an extant simulated dataset, SimFuse gives users control over the supporting read features and the sample size of the simulated library, based on which the performance metrics needed for the validation and comparison of alternative fusion-detection algorithms can be rigorously estimated.
[Comparison of rDNA internal transcribed spacer sequences in asparagus].

Science.gov (United States)

Ou, Li-Jun; Ye, Wei; Zeng, Gui-Ping; Jiang, Xiang-Hui; She, Chao-Wen; Xu, Dong; Yang, Jia-Qiang

2010-10-01

Using ITS sequence of nine species to identify counterfeiting medicine and analyse phylogenetic of Asparagus. Analysing ITS sequences by amplification, cloning,sequencing and alignment. The length range of ITS sequence of nine species was from 711 to 748 bp, the percentage of G + C content was about 60%. The phylogenetic tree constructed on the basis of the ITS sequences showed that nine species were divided into two branches: Asparagus cochinchinensis, Asparagus officinalis, Asparagus densiflorus, Asparagus densiflorus cv. Myers and Asparagus densiflorus cv. Sprengeri were a branch and the others were a branch. Asparagus densiflorus and Asparagus densflorus cv. Myers those were from Africa had priority to clustering and then clustering with Asparagus densiflorus cv. Sprengeri that was a variant of Asparagus densiflorus in the first branch. Asparagus setaceus had relatively distant genetic relationship with the others three materials in another branch. The ITS sequences could distinguish species of Asparagus to test the counterfeit. Division status in phylogenetic tree of some species were debatable and ITS sequence was combined with others analytical tools to analyze the realistic phylogeny.
Apophysomyces variabilis: draft genome sequence and comparison of predictive virulence determinants with other medically important Mucorales.

Science.gov (United States)

Prakash, Hariprasath; Rudramurthy, Shivaprakash Mandya; Gandham, Prasad S; Ghosh, Anup Kumar; Kumar, Milner M; Badapanda, Chandan; Chakrabarti, Arunaloke

2017-09-18

Apophysomyces species are prevalent in tropical countries and A. variabilis is the second most frequent agent causing mucormycosis in India. Among Apophysomyces species, A. elegans, A. trapeziformis and A. variabilis are commonly incriminated in human infections. The genome sequences of A. elegans and A. trapeziformis are available in public database, but not A. variabilis. We, therefore, performed the whole genome sequence of A. variabilis to explore its genomic structure and possible genes determining the virulence of the organism. The whole genome of A. variabilis NCCPF 102052 was sequenced and the genomic structure of A. variabilis was compared with already available genome structures of A. elegans, A. trapeziformis and other medically important Mucorales. The total size of genome assembly of A. variabilis was 39.38 Mb with 12,764 protein-coding genes. The transposable elements (TEs) were low in Apophysomyces genome and the retrotransposon Ty3-gypsy was the common TE. Phylogenetically, Apophysomyces species were grouped closely with Phycomyces blakesleeanus. OrthoMCL analysis revealed 3025 orthologues proteins, which were common in those three pathogenic Apophysomyces species. Expansion of multiple gene families/duplication was observed in Apophysomyces genomes. Approximately 6% of Apophysomyces genes were predicted to be associated with virulence on PHIbase analysis. The virulence determinants included the protein families of CotH proteins (invasins), proteases, iron utilisation pathways, siderophores and signal transduction pathways. Serine proteases were the major group of proteases found in all Apophysomyces genomes. The carbohydrate active enzymes (CAZymes) constitute the majority of the secretory proteins. The present study is the maiden attempt to sequence and analyze the genomic structure of A. variabilis. Together with available genome sequence of A. elegans and A. trapeziformis, the study helped to indicate the possible virulence determinants of
Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4.

Science.gov (United States)

Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

2013-07-01

Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.
Evolutionary relationships within the human rhinovirus genus: comparison of serotypes 89, 2, and 14

International Nuclear Information System (INIS)

Duechler, M.; Skern, T.; Sommergruber, W.; Neubauer, C.; Gruendler, P.; Fogy, I.; Blaas, D.; Kuechler, E.

1987-01-01

The complete nucleotide sequence of the genome of human rhinovirus type 89 was determined from the cDNA that had been cloned into Escherichia coli. The genome is 7152 nucleotides long and contains a single large open reading frame of 2164 codons. Translation commences at position 619 and ends 42 nucleotides before the poly(a) tract. The positions of three proteolytic cleavage sites in the polyprotein were determined by N-terminal amino acid sequencing of the capsid proteins; the remainder were predicted from comparisons with other picornaviruses. Extensive similarity between the derived amino acid sequences of human rhinovirus types 89 and 2 was found, whereas the similarity between human rhinovirus types 89 and 14 was considerably less. It is apparent that human rhinoviruses may be more closely related than has been previously thought
COMPARISON BETWEEN MIXED INTEGER PROGRAMMING WITH HEURISTIC METHOD FOR JOB SHOP SCHEDULING WITH SEPARABLE SEQUENCE-DEPENDENT SETUPS

Directory of Open Access Journals (Sweden)

I Gede Agus Widyadana

2001-01-01

Full Text Available The decisions to choose appropriate tools for solving industrial problems are not just tools that achieve optimal solution only but it should consider computation time too. One of industrial problems that still difficult to achieve both criteria is scheduling problem. This paper discuss comparison between mixed integer programming which result optimal solution and heuristic method to solve job shop scheduling problem with separable sequence-dependent setup. The problems are generated and the result shows that the heuristic methods still cannot satisfy optimal solution.
Revision of Begomovirus taxonomy based on pairwise sequence comparisons

KAUST Repository

Brown, Judith K.; Zerbini, F. Murilo; Navas-Castillo, Jesú s; Moriones, Enrique; Ramos-Sobrinho, Roberto; Silva, José C. F.; Fiallo-Olivé , Elvira; Briddon, Rob W.; Herná ndez-Zepeda, Cecilia; Idris, Ali; Malathi, V. G.; Martin, Darren P.; Rivera-Bustamante, Rafael; Ueda, Shigenori; Varsani, Arvind

2015-01-01

Viruses of the genus Begomovirus (family Geminiviridae) are emergent pathogens of crops throughout the tropical and subtropical regions of the world. By virtue of having a small DNA genome that is easily cloned, and due to the recent innovations in cloning and low-cost sequencing, there has been a dramatic increase in the number of available begomovirus genome sequences. Even so, most of the available sequences have been obtained from cultivated plants and are likely a small and phylogenetically unrepresentative sample of begomovirus diversity, a factor constraining taxonomic decisions such as the establishment of operationally useful species demarcation criteria. In addition, problems in assigning new viruses to established species have highlighted shortcomings in the previously recommended mechanism of species demarcation. Based on the analysis of 3,123 full-length begomovirus genome (or DNA-A component) sequences available in public databases as of December 2012, a set of revised guidelines for the classification and nomenclature of begomoviruses are proposed. The guidelines primarily consider a) genus-level biological characteristics and b) results obtained using a standardized classification tool, Sequence Demarcation Tool, which performs pairwise sequence alignments and identity calculations. These guidelines are consistent with the recently published recommendations for the genera Mastrevirus and Curtovirus of the family Geminiviridae. Genome-wide pairwise identities of 91 % and 94 % are proposed as the demarcation threshold for begomoviruses belonging to different species and strains, respectively. Procedures and guidelines are outlined for resolving conflicts that may arise when assigning species and strains to categories wherever the pairwise identity falls on or very near the demarcation threshold value.
Revision of Begomovirus taxonomy based on pairwise sequence comparisons

KAUST Repository

Brown, Judith K.

2015-04-18

Viruses of the genus Begomovirus (family Geminiviridae) are emergent pathogens of crops throughout the tropical and subtropical regions of the world. By virtue of having a small DNA genome that is easily cloned, and due to the recent innovations in cloning and low-cost sequencing, there has been a dramatic increase in the number of available begomovirus genome sequences. Even so, most of the available sequences have been obtained from cultivated plants and are likely a small and phylogenetically unrepresentative sample of begomovirus diversity, a factor constraining taxonomic decisions such as the establishment of operationally useful species demarcation criteria. In addition, problems in assigning new viruses to established species have highlighted shortcomings in the previously recommended mechanism of species demarcation. Based on the analysis of 3,123 full-length begomovirus genome (or DNA-A component) sequences available in public databases as of December 2012, a set of revised guidelines for the classification and nomenclature of begomoviruses are proposed. The guidelines primarily consider a) genus-level biological characteristics and b) results obtained using a standardized classification tool, Sequence Demarcation Tool, which performs pairwise sequence alignments and identity calculations. These guidelines are consistent with the recently published recommendations for the genera Mastrevirus and Curtovirus of the family Geminiviridae. Genome-wide pairwise identities of 91 % and 94 % are proposed as the demarcation threshold for begomoviruses belonging to different species and strains, respectively. Procedures and guidelines are outlined for resolving conflicts that may arise when assigning species and strains to categories wherever the pairwise identity falls on or very near the demarcation threshold value.
[Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

Science.gov (United States)

Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

2013-07-04

Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
AK SCO, FIRST DETECTION OF A HIGHLY DISTURBED ATMOSPHERE IN A PRE-MAIN-SEQUENCE CLOSE BINARY

International Nuclear Information System (INIS)

Gomez de Castro, Ana I.

2009-01-01

AK Sco is a unique source: a ∼10 Myr old pre-main-sequence (PMS) spectroscopic binary composed of two nearly equal F5 stars that at periastron are separated by barely 11 stellar radii, so the stellar magnetospheres fill the Roche lobe at periastron. The orbit is not yet circularized (e = 0.47) and very strong tides are expected. This makes AK Sco the ideal laboratory to study the effect of gravitational tides in the stellar magnetic field building up during PMS evolution. In this Letter, the detection of a highly disturbed (σ ≅ 100 km s -1 ) and very dense atmosphere (n e = 1.6 x 10 10 cm -3 ) is reported. Significant line broadening blurs any signs of ion belts or bow shocks in the spectrum of the atmospheric plasma. The radiative losses cannot be accounted for solely by the dissipation of energy from the tidal wave propagating in the stellar atmosphere or by the accreting material. The release of internal energy from the star seems to be the most likely source of the plasma heating. This is the first clear indication of a highly disturbed atmosphere surrounding a PMS close binary.
AK Sco, First Detection of a Highly Disturbed Atmosphere in a Pre-Main-Sequence Close Binary

Science.gov (United States)

Gómez de Castro, Ana I.

2009-06-01

AK Sco is a unique source: a ~10 Myr old pre-main-sequence (PMS) spectroscopic binary composed of two nearly equal F5 stars that at periastron are separated by barely 11 stellar radii, so the stellar magnetospheres fill the Roche lobe at periastron. The orbit is not yet circularized (e = 0.47) and very strong tides are expected. This makes AK Sco the ideal laboratory to study the effect of gravitational tides in the stellar magnetic field building up during PMS evolution. In this Letter, the detection of a highly disturbed (σ sime 100 km s-1) and very dense atmosphere (n e = 1.6 × 1010 cm-3) is reported. Significant line broadening blurs any signs of ion belts or bow shocks in the spectrum of the atmospheric plasma. The radiative losses cannot be accounted for solely by the dissipation of energy from the tidal wave propagating in the stellar atmosphere or by the accreting material. The release of internal energy from the star seems to be the most likely source of the plasma heating. This is the first clear indication of a highly disturbed atmosphere surrounding a PMS close binary.
Management of High-Throughput DNA Sequencing Projects: Alpheus.

Science.gov (United States)

Miller, Neil A; Kingsmore, Stephen F; Farmer, Andrew; Langley, Raymond J; Mudge, Joann; Crow, John A; Gonzalez, Alvaro J; Schilkey, Faye D; Kim, Ryan J; van Velkinburgh, Jennifer; May, Gregory D; Black, C Forrest; Myers, M Kathy; Utsey, John P; Frost, Nicholas S; Sugarbaker, David J; Bueno, Raphael; Gullans, Stephen R; Baxter, Susan M; Day, Steve W; Retzel, Ernest F

2008-12-26

High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multi-gigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.
Phylogenetic characterization of Canine Parvovirus VP2 partial sequences from symptomatic dogs samples.

Science.gov (United States)

Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A

2016-01-01

The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.
First report of a complete genome sequence for a begomovirus infecting Jatropha gossypifolia in the Americas.

Science.gov (United States)

Simmonds-Gordon, R N; Collins-Fairclough, A M; Stewart, C S; Roye, M E

2014-10-01

Jatropha gossypifolia is a weed that is commonly found with yellow mosaic symptoms growing along the roadside and in close proximity to cultivated crops in many farming communities in Jamaica. For the first time, the complete genome sequence of a new begomovirus, designated jatropha mosaic virus-[Jamaica:Spanish Town:2004] (JMV-[JM:ST:04]), was determined from field-infected J. gossypifolia in the western hemisphere. DNA-A nucleotide sequence comparisons showed closest identity (84 %) to two tobacco-infecting viruses from Cuba, tobacco mottle leaf curl virus-[Cuba:Sancti Spiritus:03] (TbMoLCV-[CU:SS:03]) and tobacco leaf curl Cuba virus-[Cuba:Taguasco:2005] (TbLCuCUV-[CU:Tag:05]), and two weed-infecting viruses from Cuba and Jamaica, Rhynchosia rugose golden mosaic virus-[Cuba:Camaguey:171:2009] (RhRGMV- [CU:Cam:171:09]) and Wissadula golden mosaic St. Thomas virus-[Jamaica:Albion:2005] (WGMSTV-[JM:Alb:05]). Phylogenetic analysis revealed that JMV-[JM:ST:04] is most closely related to tobacco and tomato viruses from Cuba and WGMSTV-[JM:Alb:05], a common malvaceous-weed-infecting virus from eastern Jamaica, and that it is distinct from begomoviruses infecting Jatropha species in India and Nigeria.
Closed-time path formalism of quantum scattering

International Nuclear Information System (INIS)

Manoukian, E.B.

1988-01-01

The closed-time path formalism of quantum mechanics, first introduced by Schwinger, is developed starting from a second-quantized formalism by using a functional calculus. An exact functional expression for the closed-time amplitude for a particle state (not just of the vacuum state)is derived from which time-dependent expectation value of observables may be written in closed functional form. In particular, this leads directly to the expression for transition probabilities for scattering theory without computing first the corresponding amplitudes. Finally it is made a comparison with the standard approach
Hardware Accelerated Sequence Alignment with Traceback

Directory of Open Access Journals (Sweden)

Scott Lloyd

2009-01-01

in a timely manner. Known methods to accelerate alignment on reconfigurable hardware only address sequence comparison, limit the sequence length, or exhibit memory and I/O bottlenecks. A space-efficient, global sequence alignment algorithm and architecture is presented that accelerates the forward scan and traceback in hardware without memory and I/O limitations. With 256 processing elements in FPGA technology, a performance gain over 300 times that of a desktop computer is demonstrated on sequence lengths of 16000. For greater performance, the architecture is scalable to more processing elements.
Microbial analysis of bite marks by sequence comparison of streptococcal DNA.

Directory of Open Access Journals (Sweden)

Darnell M Kennedy

Full Text Available Bite mark injuries often feature in violent crimes. Conventional morphometric methods for the forensic analysis of bite marks involve elements of subjective interpretation that threaten the credibility of this field. Human DNA recovered from bite marks has the highest evidentiary value, however recovery can be compromised by salivary components. This study assessed the feasibility of matching bacterial DNA sequences amplified from experimental bite marks to those obtained from the teeth responsible, with the aim of evaluating the capability of three genomic regions of streptococcal DNA to discriminate between participant samples. Bite mark and teeth swabs were collected from 16 participants. Bacterial DNA was extracted to provide the template for PCR primers specific for streptococcal 16S ribosomal RNA (16S rRNA gene, 16S-23S intergenic spacer (ITS and RNA polymerase beta subunit (rpoB. High throughput sequencing (GS FLX 454, followed by stringent quality filtering, generated reads from bite marks for comparison to those generated from teeth samples. For all three regions, the greatest overlaps of identical reads were between bite mark samples and the corresponding teeth samples. The average proportions of reads identical between bite mark and corresponding teeth samples were 0.31, 0.41 and 0.31, and for non-corresponding samples were 0.11, 0.20 and 0.016, for 16S rRNA, ITS and rpoB, respectively. The probabilities of correctly distinguishing matching and non-matching teeth samples were 0.92 for ITS, 0.99 for 16S rRNA and 1.0 for rpoB. These findings strongly support the tenet that bacterial DNA amplified from bite marks and teeth can provide corroborating information in the identification of assailants.
Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis

Directory of Open Access Journals (Sweden)

Posch Stefan

2010-03-01

Full Text Available Abstract Background One of the challenges of bioinformatics remains the recognition of short signal sequences in genomic DNA such as donor or acceptor splice sites, splicing enhancers or silencers, translation initiation sites, transcription start sites, transcription factor binding sites, nucleosome binding sites, miRNA binding sites, or insulator binding sites. During the last decade, a wealth of algorithms for the recognition of such DNA sequences has been developed and compared with the goal of improving their performance and to deepen our understanding of the underlying cellular processes. Most of these algorithms are based on statistical models belonging to the family of Markov random fields such as position weight matrix models, weight array matrix models, Markov models of higher order, or moral Bayesian networks. While in many comparative studies different learning principles or different statistical models have been compared, the influence of choosing different prior distributions for the model parameters when using different learning principles has been overlooked, and possibly lead to questionable conclusions. Results With the goal of allowing direct comparisons of different learning principles for models from the family of Markov random fields based on the same a-priori information, we derive a generalization of the commonly-used product-Dirichlet prior. We find that the derived prior behaves like a Gaussian prior close to the maximum and like a Laplace prior in the far tails. In two case studies, we illustrate the utility of the derived prior for a direct comparison of different learning principles with different models for the recognition of binding sites of the transcription factor Sp1 and human donor splice sites. Conclusions We find that comparisons of different learning principles using the same a-priori information can lead to conclusions different from those of previous studies in which the effect resulting from different
Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods

DEFF Research Database (Denmark)

Bohlin, Jon; Snipen, Lars; Cloeckaert, Axel

2010-01-01

BACKGROUND: Classification of bacteria within the genus Brucella has been difficult due in part to considerable genomic homogeneity between the different species and biovars, in spite of clear differences in phenotypes. Therefore, many different methods have been used to assess Brucella taxonomy....... In the current work, we examine 32 sequenced genomes from genus Brucella representing the six classical species, as well as more recently described species, using bioinformatical methods. Comparisons were made at the level of genomic DNA using oligonucleotide based methods (Markov chain based genomic signatures...... between the oligonucleotide based methods used. Whilst the Markov chain based genomic signatures grouped the different species in genus Brucella according to host preference, the codon and amino acid frequencies based methods reflected small differences between the Brucella species. Only minor differences...
The complete nucleotide sequence of the barley yellow dwarf GPV isolate from China shows that it is a new member of the genus Polerovirus.

Science.gov (United States)

Zhang, Wenwei; Cheng, Zhuomin; Xu, Lei; Wu, Maosen; Waterhouse, Peter; Zhou, Guanghe; Li, Shifang

2009-01-01

The complete nucleotide sequence of the ssRNA genome of a Chinese GPV isolate of barley yellow dwarf virus (BYDV) was determined. It comprised 5673 nucleotides, and the deduced genome organization resembled that of members of the genus Polerovirus. It was most closely related to cereal yellow dwarf virus-RPV (77% nt identity over the entire genome; coat protein amino acid identity 79%). The GPV isolate also differs in vector specificity from other BYDV strains. Biological properties, phylogenetic analyses and detailed sequence comparisons suggest that GPV should be considered a member of a new species within the genus, and the name Wheat yellow dwarf virus-GPV is proposed.

Triangular fibrocartilage lesions: comparison STIR sequence versus arthroscopy findings

International Nuclear Information System (INIS)

Wang Zhi; Meng; Xianghong; Wang Linsen; Suo Yongmei

2013-01-01

Objective: To explore the diagnostic value of short TI inversion recovery (STIR) sequence in evaluating triangular fibrocartilage (TFC) lesions, and to compare the findings with the arthroscopy findings. Materials and Methods: Wrist joint MR examination using STIR sequence and arthroscopy were performed in 56 patients with TFC lesions. The parameters of STIR sequence were: TR: 1164 ms, TE: 16 ms, and TI: 90 ms. The sensibility, specificity, positive predictive value, negative predictive value, and accuracy in the diagnosis of TFC lesions with STIR sequence were calculated, using arthroscopy as the standard. Results: (1) STIR manifested 10 patients with normal TFC; 6 with small edema or mucous degeneration in the body portion but not involving joint surface edge; 6 with horizontal avulsion in the body portion, but not involving joint surface edge; 6 with avulsion involving joint surface edge; 11 with perforation in central portion; 6 with avulsion in radial attached end; 5 with avulsion in ulnar attached end; 3 with avulsion in both radial and ulnar attached ends; 3 with irregular shape and thin on the whole TFC. (2) Arthroscopy manifested 21 patients with normal TFC; 8 with avulsion involving joint surface edge; 10 with perforation in central portion; 7 with avulsion in radial attached end; 5 with avulsion in ulnar attached end; 2 with avulsion in both radial and ulnar attached ends; 3 with irregular shape on the whole TFC. Using STIR sequence, the sensibility, specificity, positive predictive value, negative predictive value. and accuracy were 85.7%, 23.8%, 65.2%, 50%, and 62.5%, respectively, in detection of TFC lesions, with arthroscopy as the standard. Conclusion: STIR sequence has high diagnostic value in detection of TFC lesions. (authors)
Accuracy of microbial community diversity estimated by closed- and open-reference OTUs

Directory of Open Access Journals (Sweden)

Robert C. Edgar

2017-10-01

Full Text Available Next-generation sequencing of 16S ribosomal RNA is widely used to survey microbial communities. Sequences are typically assigned to Operational Taxonomic Units (OTUs. Closed- and open-reference OTU assignment matches reads to a reference database at 97% identity (closed, then clusters unmatched reads using a de novo method (open. Implementations of these methods in the QIIME package were tested on several mock community datasets with 20 strains using different sequencing technologies and primers. Richness (number of reported OTUs was often greatly exaggerated, with hundreds or thousands of OTUs generated on Illumina datasets. Between-sample diversity was also found to be highly exaggerated in many cases, with weighted Jaccard distances between identical mock samples often close to one, indicating very low similarity. Non-overlapping hyper-variable regions in 70% of species were assigned to different OTUs. On mock communities with Illumina V4 reads, 56% to 88% of predicted genus names were false positives. Biological inferences obtained using these methods are therefore not reliable.
Dibenzotetraaza[14]annulene-adenine conjugate recognizes complementary poly dT among ss-DNA/ss-RNA sequences.

Science.gov (United States)

Radić Stojković, Marijana; Škugor, Marko; Tomić, Sanja; Grabar, Marina; Smrečki, Vilko; Dudek, Łukasz; Grolik, Jarosław; Eilmes, Julita; Piantanida, Ivo

2013-06-28

Among three novel DBTAA derivatives only the DBTAA-propyl-adenine conjugate showed recognition of the consecutive oligo dT sequence by increased affinity and specific induced chirooptical response in comparison to other single stranded RNA and DNA; whereby of particular importance is the up until now unique efficient differentiation between dT and rU. At variance, its close analogue DBTAA-hexyl-adenine did not reveal any selectivity between ss-DNA/RNA pointing out the important role of steric factors (linker length); moreover non-selectivity of the reference compound (, lacking adenine) stressed the importance of adenine interactions in the selectivity.
Complete Genome Sequences of Isolates of Enterococcus faecium Sequence Type 117, a Globally Disseminated Multidrug-Resistant Clone

Science.gov (United States)

Tedim, Ana P.; Lanza, Val F.; Manrique, Marina; Pareja, Eduardo; Ruiz-Garbajosa, Patricia; Cantón, Rafael; Baquero, Fernando; Tobes, Raquel

2017-01-01

ABSTRACT The emergence of nosocomial infections by multidrug-resistant sequence type 117 (ST117) Enterococcus faecium has been reported in several European countries. ST117 has been detected in Spanish hospitals as one of the main causes of bloodstream infections. We analyzed genome variations of ST117 strains isolated in Madrid and describe the first ST117 closed genome sequences. PMID:28360174
Equid herpesvirus 8: Complete genome sequence and association with abortion in mares

Science.gov (United States)

Garvey, Marie; Suárez, Nicolás M.; Kerr, Karen; Hector, Ralph; Moloney-Quinn, Laura; Arkins, Sean; Davison, Andrew J.

2018-01-01

Equid herpesvirus 8 (EHV-8), formerly known as asinine herpesvirus 3, is an alphaherpesvirus that is closely related to equid herpesviruses 1 and 9 (EHV-1 and EHV-9). The pathogenesis of EHV-8 is relatively little studied and to date has only been associated with respiratory disease in donkeys in Australia and horses in China. A single EHV-8 genome sequence has been generated for strain Wh in China, but is apparently incomplete and contains frameshifts in two genes. In this study, the complete genome sequences of four EHV-8 strains isolated in Ireland between 2003 and 2015 were determined by Illumina sequencing. Two of these strains were isolated from cases of abortion in horses, and were misdiagnosed initially as EHV-1, and two were isolated from donkeys, one with neurological disease. The four genome sequences are very similar to each other, exhibiting greater than 98.4% nucleotide identity, and their phylogenetic clustering together demonstrated that genomic diversity is not dependent on the host. Comparative genomic analysis revealed 24 of the 76 predicted protein sequences are completely conserved among the Irish EHV-8 strains. Evolutionary comparisons indicate that EHV-8 is phylogenetically closer to EHV-9 than it is to EHV-1. In summary, the first complete genome sequences of EHV-8 isolates from two host species over a twelve year period are reported. The current study suggests that EHV-8 can cause abortion in horses. The potential threat of EHV-8 to the horse industry and the possibility that donkeys may act as reservoirs of infection warrant further investigation. PMID:29414990
Molecular phylogeny of Toxoplasmatinae: comparison between inferences based on mitochondrial and apicoplast genetic sequences

Directory of Open Access Journals (Sweden)

Michelle Klein Sercundes

2016-03-01

Full Text Available Abstract Phylogenies within Toxoplasmatinae have been widely investigated with different molecular markers. Here, we studied molecular phylogenies of the Toxoplasmatinae subfamily based on apicoplast and mitochondrial genes. Partial sequences of apicoplast genes coding for caseinolytic protease (clpC and beta subunit of RNA polymerase (rpoB, and mitochondrial gene coding for cytochrome B (cytB were analyzed. Laboratory-adapted strains of the closely related parasites Sarcocystis falcatula and Sarcocystis neurona were investigated, along with Neospora caninum, Neospora hughesi, Toxoplasma gondii (strains RH, CTG and PTG, Besnoitia akodoni, Hammondia hammondiand two genetically divergent lineages of Hammondia heydorni. The molecular analysis based on organellar genes did not clearly differentiate between N. caninum and N. hughesi, but the two lineages of H. heydorni were confirmed. Slight differences between the strains of S. falcatula and S. neurona were encountered in all markers. In conclusion, congruent phylogenies were inferred from the three different genes and they might be used for screening undescribed sarcocystid parasites in order to ascertain their phylogenetic relationships with organisms of the family Sarcocystidae. The evolutionary studies based on organelar genes confirm that the genusHammondia is paraphyletic. The primers used for amplification of clpC and rpoB were able to amplify genetic sequences of organisms of the genus Sarcocystisand organisms of the subfamily Toxoplasmatinae as well.
OTU analysis using metagenomic shotgun sequencing data.

Directory of Open Access Journals (Sweden)

Xiaolin Hao

Full Text Available Because of technological limitations, the primer and amplification biases in targeted sequencing of 16S rRNA genes have veiled the true microbial diversity underlying environmental samples. However, the protocol of metagenomic shotgun sequencing provides 16S rRNA gene fragment data with natural immunity against the biases raised during priming and thus the potential of uncovering the true structure of microbial community by giving more accurate predictions of operational taxonomic units (OTUs. Nonetheless, the lack of statistically rigorous comparison between 16S rRNA gene fragments and other data types makes it difficult to interpret previously reported results using 16S rRNA gene fragments. Therefore, in the present work, we established a standard analysis pipeline that would help confirm if the differences in the data are true or are just due to potential technical bias. This pipeline is built by using simulated data to find optimal mapping and OTU prediction methods. The comparison between simulated datasets revealed a relationship between 16S rRNA gene fragments and full-length 16S rRNA sequences that a 16S rRNA gene fragment having a length >150 bp provides the same accuracy as a full-length 16S rRNA sequence using our proposed pipeline, which could serve as a good starting point for experimental design and making the comparison between 16S rRNA gene fragment-based and targeted 16S rRNA sequencing-based surveys possible.
Pornography use and closeness with others in women

Directory of Open Access Journals (Sweden)

Popović Miodrag

2011-01-01

Full Text Available Introduction. Closeness/intimacy and pornography are sometimes linked and frequently presented as competing with each other. They have been the subject of some research but many issues in the area remain controversial and indeterminate. Objective. The aim of this pilot study was to establish whether female pornography users and non-users’ ratings in terms of socio-emotional closeness differed, i.e. to examine the association between pornography use and aspects of socio-emotional closeness in a non-clinical sample of females. Methods. Sixty-six females participated in the study. Their actual and ideal socioemotional closeness was measured by the Perceived Interpersonal Closeness Scale/PICS, while their pornography use was examined by the Background and Pornography Use Information Questionnaire. Potential links between the two variables and comparisons with the relevant results obtained by males are presented. Results. The results showed that there were no significant differences between self-reported female pornography users and non-users in terms of total closeness numbers and scores and also in specific socioemotional closeness with the most significant adults in their lives (i.e., partners, closest friends, mothers and fathers. Conclusion. The results confirmed that there were differences between females and males’ approaches to pornography and closeness; females had lower interest in pornography and their use of it had not been associated with higher total closeness numbers and scores. Due to the participant group’s size (N limitations, this sample was rather used for preliminary investigations that would enable some elementary insight into females’ relevant behaviours. Further investigations of pornography’s complex links with socio-emotional and sexual closeness on larger samples may allow more reliable comparisons between gender and pornography users groups.
Information decomposition method to analyze symbolical sequences

International Nuclear Information System (INIS)

Korotkov, E.V.; Korotkova, M.A.; Kudryashov, N.A.

2003-01-01

The information decomposition (ID) method to analyze symbolical sequences is presented. This method allows us to reveal a latent periodicity of any symbolical sequence. The ID method is shown to have advantages in comparison with application of the Fourier transformation, the wavelet transform and the dynamic programming method to look for latent periodicity. Examples of the latent periods for poetic texts, DNA sequences and amino acids are presented. Possible origin of a latent periodicity for different symbolical sequences is discussed
Estimates of statistical significance for comparison of individual positions in multiple sequence alignments

Directory of Open Access Journals (Sweden)

Sadreyev Ruslan I

2004-08-01

Full Text Available Abstract Background Profile-based analysis of multiple sequence alignments (MSA allows for accurate comparison of protein families. Here, we address the problems of detecting statistically confident dissimilarities between (1 MSA position and a set of predicted residue frequencies, and (2 between two MSA positions. These problems are important for (i evaluation and optimization of methods predicting residue occurrence at protein positions; (ii detection of potentially misaligned regions in automatically produced alignments and their further refinement; and (iii detection of sites that determine functional or structural specificity in two related families. Results For problems (1 and (2, we propose analytical estimates of P-value and apply them to the detection of significant positional dissimilarities in various experimental situations. (a We compare structure-based predictions of residue propensities at a protein position to the actual residue frequencies in the MSA of homologs. (b We evaluate our method by the ability to detect erroneous position matches produced by an automatic sequence aligner. (c We compare MSA positions that correspond to residues aligned by automatic structure aligners. (d We compare MSA positions that are aligned by high-quality manual superposition of structures. Detected dissimilarities reveal shortcomings of the automatic methods for residue frequency prediction and alignment construction. For the high-quality structural alignments, the dissimilarities suggest sites of potential functional or structural importance. Conclusion The proposed computational method is of significant potential value for the analysis of protein families.
Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

Science.gov (United States)

Kimura, M; Kimura, J; Hatakeyama, T

1988-11-21

The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).
Reconstruction of putative DNA virus from endogenous rice tungro bacilliform virus-like sequences in the rice genome: implications for integration and evolution

Directory of Open Access Journals (Sweden)

Kishima Yuji

2004-10-01

Full Text Available Abstract Background Plant genomes contain various kinds of repetitive sequences such as transposable elements, microsatellites, tandem repeats and virus-like sequences. Most of them, with the exception of virus-like sequences, do not allow us to trace their origins nor to follow the process of their integration into the host genome. Recent discoveries of virus-like sequences in plant genomes led us to set the objective of elucidating the origin of the repetitive sequences. Endogenous rice tungro bacilliform virus (RTBV-like sequences (ERTBVs have been found throughout the rice genome. Here, we reconstructed putative virus structures from RTBV-like sequences in the rice genome and characterized to understand evolutionary implication, integration manner and involvements of endogenous virus segments in the corresponding disease response. Results We have collected ERTBVs from the rice genomes. They contain rearranged structures and no intact ORFs. The identified ERTBV segments were shown to be phylogenetically divided into three clusters. For each phylogenetic cluster, we were able to make a consensus alignment for a circular virus-like structure carrying two complete ORFs. Comparisons of DNA and amino acid sequences suggested the closely relationship between ERTBV and RTBV. The Oryza AA-genome species vary in the ERTBV copy number. The species carrying low-copy-number of ERTBV segments have been reported to be extremely susceptible to RTBV. The DNA methylation state of the ERTBV sequences was correlated with their copy number in the genome. Conclusions These ERTBV segments are unlikely to have functional potential as a virus. However, these sequences facilitate to establish putative virus that provided information underlying virus integration and evolutionary relationship with existing virus. Comparison of ERTBV among the Oryza AA-genome species allowed us to speculate a possible role of endogenous virus segments against its related disease.
Comparison of 61 Sequenced Escherichia coli Genomes

DEFF Research Database (Denmark)

Lukjancenko, Oksana; Wassenaar, T. M.; Ussery, David

2010-01-01

Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics...
The SWISS-PROT protein sequence data bank: current status.

OpenAIRE

Bairoch, A; Boeckmann, B

1994-01-01

SWISS-PROT is an annotated protein sequence database established in 1986 and maintained collaboratively, since 1988, by the Department of Medical Biochemistry of the University of Geneva and the EMBL Data Library. The SWISS-PROT protein sequence data bank consist of sequence entries. Sequence entries are composed of different lines types, each with their own format. For standardization purposes the format of SWISS-PROT follows as closely as possible that of the EMBL Nucleotide Sequence Databa...
Gold nanoparticle-assisted primer walking for closing the human chromosomal gap

DEFF Research Database (Denmark)

Li, H; Shi, B; Li, X

2013-01-01

The finished sequence of the human genome still contains 260 euchromatic gaps. All the PCR-based genome walking techniques used to close gaps have common limitations, such as low efficiency and low specificity. We herein describe a strategy to solve this problem by employing gold nanoparticles (Au......NPs) to improve the efficiency in primer walking amplification. We used this strategy to close a gap in human chromosome 5 containing a DNA stretch composed of the 12SAT repeat. The obtained gap sequence is highly conserved among several mammalian genomes. The demonstrated AuNP-assisted primer walking strategy...
The optimal design of stepped wedge trials with equal allocation to sequences and a comparison to other trial designs.

Science.gov (United States)

Thompson, Jennifer A; Fielding, Katherine; Hargreaves, James; Copas, Andrew

2017-12-01

Background/Aims We sought to optimise the design of stepped wedge trials with an equal allocation of clusters to sequences and explored sample size comparisons with alternative trial designs. Methods We developed a new expression for the design effect for a stepped wedge trial, assuming that observations are equally correlated within clusters and an equal number of observations in each period between sequences switching to the intervention. We minimised the design effect with respect to (1) the fraction of observations before the first and after the final sequence switches (the periods with all clusters in the control or intervention condition, respectively) and (2) the number of sequences. We compared the design effect of this optimised stepped wedge trial to the design effects of a parallel cluster-randomised trial, a cluster-randomised trial with baseline observations, and a hybrid trial design (a mixture of cluster-randomised trial and stepped wedge trial) with the same total cluster size for all designs. Results We found that a stepped wedge trial with an equal allocation to sequences is optimised by obtaining all observations after the first sequence switches and before the final sequence switches to the intervention; this means that the first sequence remains in the control condition and the last sequence remains in the intervention condition for the duration of the trial. With this design, the optimal number of sequences is [Formula: see text], where [Formula: see text] is the cluster-mean correlation, [Formula: see text] is the intracluster correlation coefficient, and m is the total cluster size. The optimal number of sequences is small when the intracluster correlation coefficient and cluster size are small and large when the intracluster correlation coefficient or cluster size is large. A cluster-randomised trial remains more efficient than the optimised stepped wedge trial when the intracluster correlation coefficient or cluster size is small. A
AlignMe—a membrane protein sequence alignment web server

Science.gov (United States)

Stamm, Marcus; Staritzbichler, René; Khafizov, Kamil; Forrest, Lucy R.

2014-01-01

We present a web server for pair-wise alignment of membrane protein sequences, using the program AlignMe. The server makes available two operational modes of AlignMe: (i) sequence to sequence alignment, taking two sequences in fasta format as input, combining information about each sequence from multiple sources and producing a pair-wise alignment (PW mode); and (ii) alignment of two multiple sequence alignments to create family-averaged hydropathy profile alignments (HP mode). For the PW sequence alignment mode, four different optimized parameter sets are provided, each suited to pairs of sequences with a specific similarity level. These settings utilize different types of inputs: (position-specific) substitution matrices, secondary structure predictions and transmembrane propensities from transmembrane predictions or hydrophobicity scales. In the second (HP) mode, each input multiple sequence alignment is converted into a hydrophobicity profile averaged over the provided set of sequence homologs; the two profiles are then aligned. The HP mode enables qualitative comparison of transmembrane topologies (and therefore potentially of 3D folds) of two membrane proteins, which can be useful if the proteins have low sequence similarity. In summary, the AlignMe web server provides user-friendly access to a set of tools for analysis and comparison of membrane protein sequences. Access is available at http://www.bioinfo.mpg.de/AlignMe PMID:24753425
Quantitative diffusion characteristics of the human brain depend on MRI sequence parameters

International Nuclear Information System (INIS)

Wilson, M.; Blumhardt, L.D.; Morgan, P.S.

2002-01-01

Quantitative diffusion-weighted MRI has been applied to the study of neurological diseases, including multiple sclerosis, where the molecular self-diffusion coefficient D has been measured in both lesions and normal-appearing white matter. Histograms of D have been used as a novel measure of the ''lesion load'', with potential applications that include the monitoring of efficacy in new treatment trials. However different ways of measuring D may affect its value, making comparison between different centres and research groups impossible. We aimed to assess the effect, if any, of using two different MRI sequences on the value of D. We studied 13 healthy volunteers, using two different quantitative diffusion sequences (including different b max values and gradient applications). Maps of D were analysed using both regions of interest (ROI) in white matter and ''whole brain'' histograms, and compared between the two sequences. In addition, we studied three standardised test liquids (with known values of D) using both sequences. Histograms from the two sequences had different distributions, with a greater spread and higher peak position from the sequence with lower b max . This greater spread of D was also evident in the white matter and test liquid ROI. ''Limits of agreement'' analysis demonstrated that the differences could be clinically relevant, despite significant correlations between the sequences obtained using simple rank methods. We conclude that different quantitative diffusion sequences are unlikely to produce directly comparable values of D, particularly if different b max values are used. In addition, the use of inappropriate statistical tests may give false impressions of close agreement. Standardisation of methods for the measurement of D are required if these techniques are to become useful tools, for example in monitoring changes in the disease burden of multiple sclerosis. (orig.)
Quantitative diffusion characteristics of the human brain depend on MRI sequence parameters

Energy Technology Data Exchange (ETDEWEB)

Wilson, M.; Blumhardt, L.D. [University of Nottingham, Department of Neurology, Royal Preston Hospital, Preston (United Kingdom); Morgan, P.S. [Division of Academic Radiology, Queens Medical Centre, Nottingham (United Kingdom)

2002-07-01

Quantitative diffusion-weighted MRI has been applied to the study of neurological diseases, including multiple sclerosis, where the molecular self-diffusion coefficient D has been measured in both lesions and normal-appearing white matter. Histograms of D have been used as a novel measure of the ''lesion load'', with potential applications that include the monitoring of efficacy in new treatment trials. However different ways of measuring D may affect its value, making comparison between different centres and research groups impossible. We aimed to assess the effect, if any, of using two different MRI sequences on the value of D. We studied 13 healthy volunteers, using two different quantitative diffusion sequences (including different b{sub max} values and gradient applications). Maps of D were analysed using both regions of interest (ROI) in white matter and ''whole brain'' histograms, and compared between the two sequences. In addition, we studied three standardised test liquids (with known values of D) using both sequences. Histograms from the two sequences had different distributions, with a greater spread and higher peak position from the sequence with lower b{sub max}. This greater spread of D was also evident in the white matter and test liquid ROI. ''Limits of agreement'' analysis demonstrated that the differences could be clinically relevant, despite significant correlations between the sequences obtained using simple rank methods. We conclude that different quantitative diffusion sequences are unlikely to produce directly comparable values of D, particularly if different b{sub max} values are used. In addition, the use of inappropriate statistical tests may give false impressions of close agreement. Standardisation of methods for the measurement of D are required if these techniques are to become useful tools, for example in monitoring changes in the disease burden of multiple sclerosis. (orig.)
A comparison of 454 sequencing and clonal sequencing for the characterization of hepatitis C virus NS3 variants

NARCIS (Netherlands)

Ho, Cynthia K. Y.; Welkers, Matthijs R. A.; Thomas, Xiomara V.; Sullivan, James C.; Kieffer, Tara L.; Reesink, Henk W.; Rebers, Sjoerd P. H.; de Jong, Menno D.; Schinkel, Janke; Molenkamp, Richard

2015-01-01

We compared 454 amplicon sequencing with clonal sequencing for the characterization of intra-host hepatitis C virus (HCV) NS3 variants. Clonal and 454 sequences were obtained from 12 patients enrolled in a clinical phase I study for telaprevir, an NS3-4a protease inhibitor. Thirty-nine datasets were

Short read sequence typing (SRST: multi-locus sequence types from short reads

Directory of Open Access Journals (Sweden)

Inouye Michael

2012-07-01

Full Text Available Abstract Background Multi-locus sequence typing (MLST has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. Results We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. Conclusions SRST is a novel
MR-based attenuation correction in brain PET based on UTE sequences

Energy Technology Data Exchange (ETDEWEB)

Cabello, Jorge; Nekolla, Stephan G; Ziegler, Sibylle I [Department of Nuclear Medicine, Klinikum rechts der Isar, Technische Universität München (Germany)

2014-07-29

Attenuation correction (AC) in brain PET/MR has recently emerged as one of the challenging tasks in the PET/MR field. It has been shown that to ignore the attenuation produced by bone can lead to errors ranging from 5-30% in regions close to bone structures. Since the information provided by the MR signal is not directly related to tissue attenuation, alternative methods have to be developed. Signal from bone tissue is difficult to measure given its short transverse relaxation time (T2). Ultrashort-echo time (UTE) pulse sequences were developed to measure signal from tissues with short T2. A combination of two consecutive UTE echoes has been used in several works to measure signal from bone tissue. The first echo is able to measure signal from bone tissue in addition to soft tissue, while the second echo contains most of the soft tissue contained in the first echo but not bone. In this work we extract the attenuation information from the difference between the logarithm of two images obtained after applying two consecutive UTE pulse sequences using the mMR scanner (Siemens Healthcare). Subsequently, image processing techniques are applied to reduce the noise and extract air cavities within the head. The resulting image is converted to linear attenuation coefficients, generating what is known as µ-map, to be used during reconstruction. For comparison purposes PET/CT scans of the same patients were acquired prior to the PET/MR scan. Additional µ-maps obtained for comparison were extracted from a Dixon sequence (used in clinical routine) and an additional µ-map calculated by the scanner based on UTE pulse sequences. Preliminary quantitative results measured in the cerebellum, using the value obtained with CT-based AC as reference, show differences of 34% without AC, 13% using the Dixon-based and UTE-based provided by the scanner, and 0.8% with the AC strategy presented here.
Closed N-shell alkali spectra

International Nuclear Information System (INIS)

Ellis, D.G.; Curtis, L.J.

1982-01-01

Term values and ionization potentials have been calculated for several ions in the promethium (N = 61) isoelectronic sequence. As the nuclear charge is increased, the ground configuration changes from 4f 13 5s 2 to 4f 14 5s giving the upper portion of the sequence an alkali-like character. According to our most recent Hartree-Fock calculations with first-order relativistic corrections, the ground term is 5s 2 S for Z > 77 (Ir XVII) and the first excited term is 5p 2 P 0 for Z > 84 (P 0 XXIV). Comparisons are made with calculations of Cowan in W XIV. The prospects for observation of these spectra in fast ion beams are discussed. (orig.)
Comparison of Nucleotide Sequence of P2C Region in Diabetogenic and Non-Diabetogenic Coxsackie Virus B5 Isolates

Directory of Open Access Journals (Sweden)

Cheng-Chong Chou

2004-11-01

Full Text Available Enteroviruses are environmental triggers in the pathogenesis of type 1 diabetes mellitus (DM. A sequence of six identical amino acids (PEVKEK is shared by the 2C protein of Coxsackie virus B and the glutamic acid decarboxylase (GAD molecules. Between 1995 and 2002, we investigated 22 Coxsackie virus B5 (CVB5 isolates from southern Taiwan. Four of these isolates were obtained from four new-onset type 1 DM patients with diabetic ketoacidosis. We compared a 300 nucleotide sequence in the 2C protein gene (p2C in 24 CVB5 isolates (4 diabetogenic, 18 non-diabetogenic and 2 prototype. We found 0.3-10% nucleotide differences. In the four isolates from type 1 DM patients, there was only 2.4-3.4% nucleotide difference, and there was only 1.7-7.1% nucleotide difference between type 1 DM isolates and non-diabetogenic isolates. Comparison of the nucleotide sequence between prototype virus and 22 CVB5 isolates revealed 18.4-24.1% difference. Twenty-one CVB5 isolates from type 1 DM and non-type 1 DM patients contained the PEVKEK sequence, as shown by the p2C nucleotide sequence. Our data showed that the viral p2C sequence with homology with GAD is highly conserved in CVB5 isolates. There was no difference between diabetogenic and non-diabetogenic CVB5 isolates. All four type 1 DM patients had at least one of the genetic susceptibility alleles HLA-DR, DQA1, DQB1. Other genetic and autoimmune factors such as HLA genetic susceptibility and GAD may also play important roles in the pathogenesis in type 1 DM.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

Science.gov (United States)

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
A direct comparison of MELCOR 1.8.3 and MAAP4 results for several PWR ampersand BWR accident sequences

International Nuclear Information System (INIS)

Leonard, M.T.; Ashbaugh, S.G.; Cole, R.K.; Bergeron, K.D.; Nagashima, K.

1996-01-01

This paper presents a comparison of calculations of severe accident progression for several postulated accident sequences for representative Pressurized Water Reactors (PWR) and Boiling Water Reactors (BWR) nuclear power plants performed with the MELCOR 1.8.3 and the MAAP4 computer codes. The PWR system examined in this study is a 1100 MWe system similar in design to a Westinghouse 3-loop plant with a large dry containment; the BWR is a 1100 MWe system similar in design to General Electric BWR/4 with a Mark I containment. A total of nine accident sequences were studied with both codes. Results of these calculations are compared to identify major differences in the timing of key events in the calculated accident progression or other important aspects of severe accident behavior, and to identify specific sources of the observed differences
Cloning, sequencing, and transgenic expression of Podospora curvicolla and Sordaria macrospora eEF1A genes: relationship between cytosolic translation and longevity in filamentous fungi.

Science.gov (United States)

Gagny, B; Rossignol, M; Silar, P

1997-12-01

We have cloned and sequenced the gene encoding the translation elongation factor eEF1A from two filamentous fungi, Podospora curvicolla and Sordaria macrospora. These fungi are close relatives of Podospora anserina and also show senescence syndromes. Comparison of the sequences of the deduced proteins with that of P. anserina reveals that the three proteins differ in several positions. Replacement of the P. anserina gene by either of the two exogenous genes does not entail any modification in P. anserina physiology; the longevity of the fungus is not affected. No alteration of in vivo translational accuracy was detected; however, the exogenous proteins nonetheless promoted a modification of the resistance to the aminoglycoside antibiotic paromomycin. These data suggest that optimization of life span between these closely related fungi has likely not been performed during evolution through modifications of eEF1A activity, despite the fact that mutations in this factor can drastically affect longevity. Copyright 1997 Academic Press.
Genome sequence of Lactobacillus rhamnosus ATCC 8530.

Science.gov (United States)

Pittet, Vanessa; Ewen, Emily; Bushell, Barry R; Ziola, Barry

2012-02-01

Lactobacillus rhamnosus is found in the human gastrointestinal tract and is important for probiotics. We became interested in L. rhamnosus isolate ATCC 8530 in relation to beer spoilage and hops resistance. We report here the genome sequence of this isolate, along with a brief comparison to other available L. rhamnosus genome sequences.
Genome Sequence of Lactobacillus rhamnosus ATCC 8530

OpenAIRE

Pittet, Vanessa; Ewen, Emily; Bushell, Barry R.; Ziola, Barry

2012-01-01

Lactobacillus rhamnosus is found in the human gastrointestinal tract and is important for probiotics. We became interested in L. rhamnosus isolate ATCC 8530 in relation to beer spoilage and hops resistance. We report here the genome sequence of this isolate, along with a brief comparison to other available L. rhamnosus genome sequences.
Design and control of an ideal heat-integrated distillation column (ideal HIDiC) system separating a close-boiling ternary mixture

International Nuclear Information System (INIS)

Huang Kejin; Shan Lan; Zhu Qunxiong; Qian Jixin

2007-01-01

Despite the fact that a stand-alone ideal heat-integrated distillation column (ideal HIDiC) can be thermodynamically efficient and operationally stable, the application of an ideal HIDiC system to separate a close-boiling multi-component mixture is still a challenging problem because of the possibility of strong interactions within/between the ideal HIDiCs involved. In this work, employment of two ideal HIDiCs to separate a close-boiling ternary mixture is studied in terms of static and dynamic performance. It is found that the ideal HIDiC system can be a competitive alternative with a substantial energy saving and comparable dynamic performance in comparison with its conventional counterpart. The direct sequence appears to be superior to the indirect sequence due to the relatively small vapor flow rates to the compressors. Controlling the bottom composition of the first ideal HIDiC with the pressure elevation from the stripping section to the rectifying section helps to suppress the disturbances from the feed to the second ideal HIDiC. Special caution should, however, be taken when the latent heat of the distillates is to be recovered within/between the ideal HIDiCs involved, because a positive feedback mechanism may be formed and give rise to additional difficulties in process operation
Genome-wide identification of aquaporin encoding genes in Brassica oleracea and their phylogenetic sequence comparison to Brassica crops and Arabidopsis

Science.gov (United States)

Diehn, Till A.; Pommerrenig, Benjamin; Bernhardt, Nadine; Hartmann, Anja; Bienert, Gerd P.

2015-01-01

Aquaporins (AQPs) are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia, and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea) and other Brassica species. The recent releases of the genome sequences of B. oleracea and Brassica rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins. In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of Arabidopsis thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re-) name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of
The need for high-quality whole-genome sequence databases in microbial forensics.

Science.gov (United States)

Sjödin, Andreas; Broman, Tina; Melefors, Öjar; Andersson, Gunnar; Rasmusson, Birgitta; Knutsson, Rickard; Forsman, Mats

2013-09-01

Microbial forensics is an important part of a strengthened capability to respond to biocrime and bioterrorism incidents to aid in the complex task of distinguishing between natural outbreaks and deliberate acts. The goal of a microbial forensic investigation is to identify and criminally prosecute those responsible for a biological attack, and it involves a detailed analysis of the weapon--that is, the pathogen. The recent development of next-generation sequencing (NGS) technologies has greatly increased the resolution that can be achieved in microbial forensic analyses. It is now possible to identify, quickly and in an unbiased manner, previously undetectable genome differences between closely related isolates. This development is particularly relevant for the most deadly bacterial diseases that are caused by bacterial lineages with extremely low levels of genetic diversity. Whole-genome analysis of pathogens is envisaged to be increasingly essential for this purpose. In a microbial forensic context, whole-genome sequence analysis is the ultimate method for strain comparisons as it is informative during identification, characterization, and attribution--all 3 major stages of the investigation--and at all levels of microbial strain identity resolution (ie, it resolves the full spectrum from family to isolate). Given these capabilities, one bottleneck in microbial forensics investigations is the availability of high-quality reference databases of bacterial whole-genome sequences. To be of high quality, databases need to be curated and accurate in terms of sequences, metadata, and genetic diversity coverage. The development of whole-genome sequence databases will be instrumental in successfully tracing pathogens in the future.
Comparison of closely related, uncultivated Coxiella tick endosymbiont population genomes reveals clues about the mechanisms of symbiosis.

Science.gov (United States)

Tsementzi, Despina; Castro Gordillo, Juan; Mahagna, Mustafa; Gottlieb, Yuval; Konstantinidis, Konstantinos T

2018-05-01

Understanding the symbiotic interaction between Coxiella-like endosymbionts (CLE) and their tick hosts is challenging due to lack of isolates and difficulties in tick functional assays. Here we sequenced the metagenome of a CLE population from wild Rhipicephalus sanguineus ticks (CRs) and compared it to the previously published genome of its close relative, CLE of R. turanicus (CRt). The tick hosts are closely related sympatric species, and their two endosymbiont genomes are highly similar with only minor differences in gene content. Both genomes encode numerous pseudogenes, consistent with an ongoing genome reduction process. In silico flux balance metabolic analysis (FBA) revealed the excess production of L-proline for both genomes, indicating a possible proline transport from Coxiella to the tick. Additionally, both CR genomes encode multiple copies of the proline/betaine transporter, proP gene. Modelling additional Coxiellaceae members including other tick CLE, did not identify proline as an excreted metabolite. Although both CRs and CRt genomes encode intact B vitamin synthesis pathway genes, which are presumed to underlay the mechanism of CLE-tick symbiosis, the FBA analysis indicated no changes for their products. Therefore, this study provides new testable hypotheses for the symbiosis mechanism and a better understanding of CLE genome evolution and diversity. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.
Delineation of the genus Actinobacillus by comparison of partial infB sequences

DEFF Research Database (Denmark)

Nørskov-Lauritsen, Niels; Christensen, H; Okkels, H.

2004-01-01

A 426 bp fragment of infB, a housekeeping gene that encodes translation initiation factor 2, was sequenced from 59 clinical isolates and type strains of Actinobacillus species and sequences were compared. Partial sequences of 16S rRNA genes were also obtained. By comparing infB sequences, Actinob...
Combining real-time PCR and next-generation DNA sequencing to provide quantitative comparisons of fungal aerosol populations

Science.gov (United States)

Dannemiller, Karen C.; Lang-Yona, Naama; Yamamoto, Naomichi; Rudich, Yinon; Peccia, Jordan

2014-02-01

We examined fungal communities associated with the PM10 mass of Rehovot, Israel outdoor air samples collected in the spring and fall seasons. Fungal communities were described by 454 pyrosequencing of the internal transcribed spacer (ITS) region of the fungal ribosomal RNA encoding gene. To allow for a more quantitative comparison of fungal exposure in humans, the relative abundance values of specific taxa were transformed to absolute concentrations through multiplying these values by the sample's total fungal spore concentration (derived from universal fungal qPCR). Next, the sequencing-based absolute concentrations for Alternaria alternata, Cladosporium cladosporioides, Epicoccum nigrum, and Penicillium/Aspergillus spp. were compared to taxon-specific qPCR concentrations for A. alternata, C. cladosporioides, E. nigrum, and Penicillium/Aspergillus spp. derived from the same spring and fall aerosol samples. Results of these comparisons showed that the absolute concentration values generated from pyrosequencing were strongly associated with the concentration values derived from taxon-specific qPCR (for all four species, p 0.70). The correlation coefficients were greater for species present in higher concentrations. Our microbial aerosol population analyses demonstrated that fungal diversity (number of fungal operational taxonomic units) was higher in the spring compared to the fall (p = 0.02), and principal coordinate analysis showed distinct seasonal differences in taxa distribution (ANOSIM p = 0.004). Among genera containing allergenic and/or pathogenic species, the absolute concentrations of Alternaria, Aspergillus, Fusarium, and Cladosporium were greater in the fall, while Cryptococcus, Penicillium, and Ulocladium concentrations were greater in the spring. The transformation of pyrosequencing fungal population relative abundance data to absolute concentrations can improve next-generation DNA sequencing-based quantitative aerosol exposure assessment.
[Neuronal activity of monkey dorso-lateral premotor cortex during tasks of figure recognition guided motor sequence vs memorized spatial motor sequence].

Science.gov (United States)

Chen, Y C; Huang, F D; Chen, N H; Shou, J Y; Wu, L

1998-04-01

In the last 2-3 decades the role of the premotor cortex (PM) of monkey in memorized spatial sequential (MSS) movements has been amply investigated. However, it is as yet not known whether PM participates in the movement sequence behaviour guided by recognition of visual figures (i.e. the figure-recognition sequence, FRS). In the present work three monkeys were trained to perform both FRS and MSS tasks. Postmortem examination showed that 202 cells were in the dorso-lateral premotor cortex. Among 111 cells recorded during the two tasks, more than 50% changed their activity during the cue periods in either task. During the response period, the ratios of cells with changes of firing rate in both FRS and MSS were high and roughly equal to each other, while during the image period, the proportion in the FRS (83.7%) was significantly higher than that in the MSS (66.7%). Comparison of neuronal activities during same motor sequence of two different tasks showed that during the image periods PM neuronal activities were more closely related to the FRS task, while during the cue periods no difference could be found. Analysis of cell responses showed that the neurons with longer latency were much more in MSS than in FRS in either cue or image period. The present results indicate that the premotor cortex participates in FRS motor sequence as well as in MSS and suggest that the dorso-lateral PM represents another subarea in function shared by both FRS and MSS tasks. However, in view of the differences of PM neuronal responses in cue or image periods of FRS and MSS tasks, it seems likely that neural networks involved in FRS and MSS tasks are different.
Functional dissection of the alphavirus capsid protease: sequence requirements for activity.

Science.gov (United States)

Thomas, Saijo; Rai, Jagdish; John, Lijo; Günther, Stephan; Drosten, Christian; Pützer, Brigitte M; Schaefer, Stephan

2010-11-18

The alphavirus capsid is multifunctional and plays a key role in the viral life cycle. The nucleocapsid domain is released by the self-cleavage activity of the serine protease domain within the capsid. All alphaviruses analyzed to date show this autocatalytic cleavage. Here we have analyzed the sequence requirements for the cleavage activity of Chikungunya virus capsid protease of genus alphavirus. Amongst alphaviruses, the C-terminal amino acid tryptophan (W261) is conserved and found to be important for the cleavage. Mutating tryptophan to alanine (W261A) completely inactivated the protease. Other amino acids near W261 were not having any effect on the activity of this protease. However, serine protease inhibitor AEBSF did not inhibit the activity. Through error-prone PCR we found that isoleucine 227 is important for the effective activity. The loss of activity was analyzed further by molecular modelling and comparison of WT and mutant structures. It was found that lysine introduced at position 227 is spatially very close to the catalytic triad and may disrupt electrostatic interactions in the catalytic site and thus inactivate the enzyme. We are also examining other sequence requirements for this protease activity. We analyzed various amino acid sequence requirements for the activity of ChikV capsid protease and found that amino acids outside the catalytic triads are important for the activity.
Value of fat-suppressed PD-weighted TSE-sequences for detection of anterior and posterior cruciate ligament lesions-Comparison to arthroscopy

International Nuclear Information System (INIS)

Schaefer, Fritz K.W.; Schaefer, Philipp J.; Brossmann, Joachim; Frahm, Christian; Muhle, Claus; Hilgert, Ralf Erik; Heller, Martin; Jahnke, Thomas

2006-01-01

Objective: To evaluate fat-suppressed (FS) proton-density-weighted (PDw) turbo spin-echo (TSE) magnetic resonance imaging for the detection of anterior and posterior cruciate ligament lesions in comparison to arthroscopy. Materials and methods: In a prospective study 31 knee joints were imaged on a 1.5 T MR scanner (Vision[reg], Siemens, Erlangen) prior to arthroscopy using following sequences: (a) sagittal FS-PDw/T2w TSE (TR/TE: 4009/15/105 ms); (b) sagittal PDw/T2w TSE (TR/TE:3800/15/105 ms). Further imaging parameters: slice thickness 3 mm, FOV 160 mm, matrix 256 x 256. A total of 62 anterior and posterior cruciate ligaments (ACL/PCL) were evaluated, standard of reference was arthroscopy. Sensitivity, specificity, positive (ppv) and negative predictive value (npv) and accuracy were calculated. Results: Twenty-one cruciate ligament ruptures were detected in arthroscopy, 19 ACL- and 2 PCL-ruptures (on MRI 34/124, 25/62 ACL, 9/62 PCL lesions). For all four sequences in the 31 patients with arthroscopic correlation sensitivity, specificity, ppv, npv and accuracy were 86%, 98%, 95%, 93% and 94% for detection of tears, and 84%, 100%, 100%, 80% and 90% for ACL-ruptures respectively. The two PCL-ruptures were true positive in all sequences, one intact PCL was diagnosed as torn (false positive). Conclusions: Fat-suppressed PDw/T2w TSE-MR sequences are comparable to PDw TSE sequences for the detection of ACL/PCL-lesions
Comparison of Intravenous Morphine with Sublingual Buprenorphine in Management of Postoperative Pain after Closed Reduction Orthopedic Surgery.

Science.gov (United States)

Soltani, Ghasem; Khorsand, Mahmood; Shamloo, Alireza Sepehri; Jarahi, Lida; Zirak, Nahid

2015-10-01

Postoperative pain is a common side effect following surgery that can significantly reduce surgical quality and patient's satisfaction. Treatment options are morphine and buprenorphine. We aimed to compare the efficacy of a single dose of intravenous morphine with sublingual buprenorphine in postoperative pain control following closed reduction surgery. This triple blind clinical trial was conducted on 90 patients referred for closed reduction orthopedic surgery. They were older than 18 years and in classes I and II of the American Society of Anesthesiologists (ASA) with an operation time of 30-90 minutes. Patients were divided into two groups of buprenorphine (4.5µg/kg sublingually) and morphine (0.2mg/kg intravenously). Baseline characteristics, vital signs, pain score, level of sedation and pharmacological side effects were recorded in the recovery room (at 0 and 30 minutes), and in the ward (at 3, 6 and 12 hours). SPSS version 19 software was used for data analysis and the significance level was set at P<0.05. Ninety patients were studied, 60 males and 30 females with a mean age of 37.7±16.2 years. There was no significant difference between the two groups in terms of baseline characteristics. Pain score in the morphine group was significantly higher than the buprenorphine group with an average score of 2.5 (P<0.001). Postoperative mean heart rate in the buprenorphine group was four beats lower than the morphine group (P<0.001). Also, in the buprenorphine 48.6% and in the morphine group 86.7% of cases were conscious in recovery (P=0.001) with a higher rate of pruritus in the latter group (P=0.001). Sublingual buprenorphine administration before anesthesia induction in closed reduction surgery can lead to better postoperative pain control in comparison to intravenous morphine. Due to simple usage and longer postoperative sedation, sublingual buprenorphine is recommended as a suitable drug in closed reduction surgery.
The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.

Science.gov (United States)

Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo

2018-02-01

The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.

Abdominal MR imaging using a HASTE sequence : image comparison on the different echo times

International Nuclear Information System (INIS)

Park, Kwang Bo; Lee, Moon Gyu; Lim, Tae Hwan; Jeong, Yoong Ki; Ha, Hyun Kwon; Kim, Pyo Nyun; Auh, Yong Ho

1999-01-01

To determine the optimal parameters of abdominal HASTE imaging by means of a comparison of intermediate and long TE (echo time). We evaluated 30 consecutive patients who had undergone liver MR during a three-month period. Twelve patients were diagnosed as normal, four as having liver cirrhosis, and 14 were found to be suffering form hepatic hemangioma. On the basis of measured signal intensity of the liver, spleen, pancreas and gallbladder, and of fat, muscle, hemangioma, and background, we calculated the ratios of signal to noise (S/N), signal difference to noise (SD/N), and signal intensity (SI). Image quality was compared using these three ratios, and using two HASTE sequences with TEs of 90 msec and 134 msec, images were qualitatively evaluated. S/N ratio of the liver was higher when TE was 90 msec(p<.05), though S/N, SD/N and SI rations of the spleen, gallbladder, and pancreas-and of hemangiom-were higher when TE was 134 msec (p<.05). However, in muscle, all these three ratios were higher at a TE of 90 msec. SD/N ratio and SI of fat were higher at a TE of 134 msec. Overall image quality was better at a TE of 134 msec than at one of 90msec. A HASTE sequence with a TE of 134msec showed greater tissue contrast and stronger T2-weighted images than one with a TE of 90msec
Zseq: An Approach for Preprocessing Next-Generation Sequencing Data.

Science.gov (United States)

Alkhateeb, Abedalrhman; Rueda, Luis

2017-08-01

Next-generation sequencing technology generates a huge number of reads (short sequences), which contain a vast amount of genomic data. The sequencing process, however, comes with artifacts. Preprocessing of sequences is mandatory for further downstream analysis. We present Zseq, a linear method that identifies the most informative genomic sequences and reduces the number of biased sequences, sequence duplications, and ambiguous nucleotides. Zseq finds the complexity of the sequences by counting the number of unique k-mers in each sequence as its corresponding score and also takes into the account other factors such as ambiguous nucleotides or high GC-content percentage in k-mers. Based on a z-score threshold, Zseq sweeps through the sequences again and filters those with a z-score less than the user-defined threshold. Zseq algorithm is able to provide a better mapping rate; it reduces the number of ambiguous bases significantly in comparison with other methods. Evaluation of the filtered reads has been conducted by aligning the reads and assembling the transcripts using the reference genome as well as de novo assembly. The assembled transcripts show a better discriminative ability to separate cancer and normal samples in comparison with another state-of-the-art method. Moreover, de novo assembled transcripts from the reads filtered by Zseq have longer genomic sequences than other tested methods. Estimating the threshold of the cutoff point is introduced using labeling rules with optimistic results.
Advancing analytical algorithms and pipelines for billions of microbial sequences.

Science.gov (United States)

Gonzalez, Antonio; Knight, Rob

2012-02-01

The vast number of microbial sequences resulting from sequencing efforts using new technologies require us to re-assess currently available analysis methodologies and tools. Here we describe trends in the development and distribution of software for analyzing microbial sequence data. We then focus on one widely used set of methods, dimensionality reduction techniques, which allow users to summarize and compare these vast datasets. We conclude by emphasizing the utility of formal software engineering methods for the development of computational biology tools, and the need for new algorithms for comparing microbial communities. Such large-scale comparisons will allow us to fulfill the dream of rapid integration and comparison of microbial sequence data sets, in a replicable analytical environment, in order to describe the microbial world we inhabit. Copyright © 2011 Elsevier Ltd. All rights reserved.
Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species

Directory of Open Access Journals (Sweden)

Wang Xiao-Wei

2012-10-01

Full Text Available Abstract Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1 and Mediterranean (MED, respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84% and much higher than that of MEAM1 and MED (0.83%. This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for
[Comparison research on two-stage sequencing batch MBR and one-stage MBR].

Science.gov (United States)

Yuan, Xin-Yan; Shen, Heng-Gen; Sun, Lei; Wang, Lin; Li, Shi-Feng

2011-01-01

Aiming at resolving problems in MBR operation, like low nitrogen and phosphorous removal efficiency, severe membrane fouling and etc, comparison research on two-stage sequencing batch MBR (TSBMBR) and one-stage aerobic MBR has been done in this paper. The results indicated that TSBMBR owned advantages of SBR in removing nitrogen and phosphorous, which could make up the deficiency of traditional one-stage aerobic MBR in nitrogen and phosphorous removal. During steady operation period, effluent average NH4(+) -N, TN and TP concentration is 2.83, 12.20, 0.42 mg/L, which could reach domestic scenic environment use. From membrane fouling control point of view, TSBMBR has lower SMP in supernatant, specific trans-membrane flux deduction rate, membrane fouling resistant than one-stage aerobic MBR. The sedimentation and gel layer resistant of TSBMBR was only 6.5% and 33.12% of one-stage aerobic MBR. Besides high efficiency in removing nitrogen and phosphorous, TSBMBR could effectively reduce sedimentation and gel layer pollution on membrane surface. Comparing with one-stage MBR, TSBMBR could operate with higher trans-membrane flux, lower membrane fouling rate and better pollutants removal effects.
Association of poly-purine/poly-pyrimidine sequences with meiotic recombination hot spots

Directory of Open Access Journals (Sweden)

Pitt Joel PW

2006-07-01

Full Text Available Abstract Background Meiotic recombination events have been found to concentrate in 1–2.5 kilo base regions, but these recombination hot spots do not share a consensus sequence and why they occur at specific sites is not fully understood. Some previous evidence suggests that poly-purine/poly-pyrimidine (poly-pu/py tracts (PPTs, a class of sequence with distinctive biochemical properties, could be involved in recombination, but no general association of PPTs with meiotic recombination hot spots has previously been reported. Results We used computational methods to investigate in detail the relationship between PPTs and hot spots. We show statistical associations of PPT frequency with hot spots of meiotic recombination initiating lesions, double-strand breaks, in the genome of the yeast S. cerevisiae and with experimentally well characterized human meiotic recombination hot spots. Supporting a possible role of poly-pu/py-rich sequences in hot spot recombination, we also found that all three single nucleotide polymorphisms previously shown to be associated with human hot spot activity changes occur within sequence contexts of 14 bp or longer that are 85% or more poly-pu/py and at least 70% G/C. These polymorphisms are all close to the hot spot mid points. Comparing the sequences of experimentally characterized human hot spots with the orthologous regions of the chimpanzee genome previously shown not to contain hot spots, we found that in all five cases in which comparisons for the hot spot central regions are possible with publicly available sequence data, there are differences near the human hot spot mid points within sequences 14 bp or longer consisting of more than 80% poly-pu/py and at least 50% G/C. Conclusion Our results, along with previous evidence for the unique biochemical properties and recombination-stimulating potential of poly-pu/py-rich sequences, suggest that the possible functional involvement of this type of sequence in meiotic
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

Science.gov (United States)

Meinicke, Peter

2009-09-02

Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences

Directory of Open Access Journals (Sweden)

Meinicke Peter

2009-09-01

Full Text Available Abstract Background Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Description Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. Conclusion For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.
Comparative genome sequence analysis of Choristoneura occidentalis Freeman and C. rosaceana Harris (Lepidoptera: Tortricidae alphabaculoviruses.

Directory of Open Access Journals (Sweden)

David K Thumbi

Full Text Available The complete genome sequences of Choristoneura occidentalis and C. rosaceana nucleopolyhedroviruses (ChocNPV and ChroNPV, respectively (Baculoviridae: Alphabaculovirus were determined and compared with each other and with those of other baculoviruses, including the genome of the closely related C. fumiferana NPV (CfMNPV. The ChocNPV genome was 128,446 bp in length (1147 bp smaller than that of CfMNPV, had a G+C content of 50.1%, and contained 148 open reading frames (ORFs. In comparison, the ChroNPV genome was 129,052 bp in length, had a G+C content of 48.6% and contained 149 ORFs. ChocNPV and ChroNPV shared 144 ORFs in common, and had a 77% sequence identity with each other and 96.5% and 77.8% sequence identity, respectively, with CfMNPV. Five homologous regions (hrs, with sequence similarities to those of CfMNPV, were identified in ChocNPV, whereas the ChroNPV genome contained three hrs featuring up to 14 repeats. Both genomes encoded three inhibitors of apoptosis (IAP-1, IAP-2, and IAP-3, as reported for CfMNPV, and the ChocNPV IAP-3 gene represented the most divergent functional region of this genome relative to CfMNPV. Two ORFs were unique to ChocNPV, and four were unique to ChroNPV. ChroNPV ORF chronpv38 is a eukaryotic initiation factor 5 (eIF-5 homolog that has also been identified in the C. occidentalis granulovirus (ChocGV and is believed to be the product of horizontal gene transfer from the host. Based on levels of sequence identity and phylogenetic analysis, both ChocNPV and ChroNPV fall within group I alphabaculoviruses, where ChocNPV appears to be more closely related to CfMNPV than does ChroNPV. Our analyses suggest that it may be appropriate to consider ChocNPV and CfMNPV as variants of the same virus species.
Transcriptome sequencing of Crucihimalaya himalaica (Brassicaceae) reveals how Arabidopsis close relative adapt to the Qinghai-Tibet Plateau

Science.gov (United States)

Qiao, Qin; Wang, Qia; Han, Xi; Guan, Yanlong; Sun, Hang; Zhong, Yang; Huang, Jinling; Zhang, Ticao

2016-02-01

The extreme environment of the Qinghai-Tibet Plateau (QTP) provides an ideal natural laboratory for studies on adaptive evolution. Few genome/transcriptome based studies have been conducted on how plants adapt to the environments of QTP compared to numerous studies on vertebrates. Crucihimalaya himalaica is a close relative of Arabidopsis with typical QTP distribution, and is hoped to be a new model system to study speciation and ecological adaptation in extreme environment. In this study, we de novo generated a transcriptome sequence of C. himalaica, with a total of 49,438 unigenes. Compared to five relatives, 10,487 orthogroups were shared by all six species, and 4,286 orthogroups contain putative single copy gene. Further analysis identified 487 extremely significantly positively selected genes (PSGs) in C. himalaica transcriptome. Theses PSGs were enriched in functions related to specific adaptation traits, such as response to radiation, DNA repair, nitrogen metabolism, and stabilization of membrane. These functions are responsible for the adaptation of C. himalaica to the high radiation, soil depletion and low temperature environments on QTP. Our findings indicate that C. himalaica has evolved complex strategies for adapting to the extreme environments on QTP and provide novel insights into genetic mechanisms of highland adaptation in plants.
Very high resolution single pass HLA genotyping using amplicon sequencing on the 454 next generation DNA sequencers: Comparison with Sanger sequencing.

Science.gov (United States)

Yamamoto, F; Höglund, B; Fernandez-Vina, M; Tyan, D; Rastrou, M; Williams, T; Moonsamy, P; Goodridge, D; Anderson, M; Erlich, H A; Holcomb, C L

2015-12-01

Compared to Sanger sequencing, next-generation sequencing offers advantages for high resolution HLA genotyping including increased throughput, lower cost, and reduced genotype ambiguity. Here we describe an enhancement of the Roche 454 GS GType HLA genotyping assay to provide very high resolution (VHR) typing, by the addition of 8 primer pairs to the original 14, to genotype 11 HLA loci. These additional amplicons help resolve common and well-documented alleles and exclude commonly found null alleles in genotype ambiguity strings. Simplification of workflow to reduce the initial preparation effort using early pooling of amplicons or the Fluidigm Access Array™ is also described. Performance of the VHR assay was evaluated on 28 well characterized cell lines using Conexio Assign MPS software which uses genomic, rather than cDNA, reference sequence. Concordance was 98.4%; 1.6% had no genotype assignment. Of concordant calls, 53% were unambiguous. To further assess the assay, 59 clinical samples were genotyped and results compared to unambiguous allele assignments obtained by prior sequence-based typing supplemented with SSO and/or SSP. Concordance was 98.7% with 58.2% as unambiguous calls; 1.3% could not be assigned. Our results show that the amplicon-based VHR assay is robust and can replace current Sanger methodology. Together with software enhancements, it has the potential to provide even higher resolution HLA typing. Copyright © 2015. Published by Elsevier Inc.
Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

Directory of Open Access Journals (Sweden)

Martin Mascher

Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new
Complete nucleotide sequence of a novel Hibiscus-infecting Cilevirus from Florida and its relationship with closely associated Cileviruses

Science.gov (United States)

The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...
Complete Genome Sequence of Paenibacillus strain Y4.12MC10, a Novel Paenibacillus lautus strain Isolated from Obsidian Hot Spring in Yellowstone National Park.

Science.gov (United States)

Mead, David A; Lucas, Susan; Copeland, Alex; Lapidus, Alla; Cheng, Jan-Feng; Bruce, David C; Goodwin, Lynne A; Pitluck, Sam; Chertkov, Olga; Zhang, Xiaojing; Detter, John C; Han, Cliff S; Tapia, Roxanne; Land, Miriam; Hauser, Loren J; Chang, Yun-Juan; Kyrpides, Nikos C; Ivanova, Natalia N; Ovchinnikova, Galina; Woyke, Tanja; Brumm, Catherine; Hochstein, Rebecca; Schoenfeld, Thomas; Brumm, Phillip

2012-07-30

Paenibacillus sp.Y412MC10 was one of a number of organisms isolated from Obsidian Hot Spring, Yellowstone National Park, Montana, USA under permit from the National Park Service. The isolate was initially classified as a Geobacillus sp. Y412MC10 based on its isolation conditions and similarity to other organisms isolated from hot springs at Yellowstone National Park. Comparison of 16 S rRNA sequences within the Bacillales indicated that Geobacillus sp.Y412MC10 clustered with Paenibacillus species, and the organism was most closely related to Paenibacillus lautus. Lucigen Corp. prepared genomic DNA and the genome was sequenced, assembled, and annotated by the DOE Joint Genome Institute. The genome sequence was deposited at the NCBI in October 2009 (NC_013406). The genome of Paenibacillus sp. Y412MC10 consists of one circular chromosome of 7,121,665 bp with an average G+C content of 51.2%. Comparison to other Paenibacillus species shows the organism lacks nitrogen fixation, antibiotic production and social interaction genes reported in other paenibacilli. The Y412MC10 genome shows a high level of synteny and homology to the draft sequence of Paenibacillus sp. HGF5, an organism from the Human Microbiome Project (HMP) Reference Genomes. This, combined with genomic CAZyme analysis, suggests an intestinal, rather than environmental origin for Y412MC10.
Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

Directory of Open Access Journals (Sweden)

Jonathan eFilée

2015-06-01

Full Text Available Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales. Origin and evolution of these Giant Viruses (GVs remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for 5 groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements, whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.
Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

Science.gov (United States)

Filée, Jonathan

2015-01-01

Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.
Cloning and sequence analysis of benzo-a-pyreneinducible ...

African Journals Online (AJOL)

The phylogenetic tree based on the amino acid sequences clearly shows tilapia CYP1A and killifish CYP1A to be more closely related to each other than to the other CYP1A subfamilies. Sequence analysis of 3727 bp of genomic DNA showed that the clone obtained was the structural gene of CYP1A which consists of ...
GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

Science.gov (United States)

Lu, Bingxin; Leong, Hon Wai

2016-02-01

Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.
Chained learning architectures in a simple closed-loop behavioural context

DEFF Research Database (Denmark)

Kulvicius, Tomas; Porr, Bernd; Wörgötter, Florentin

2007-01-01

are very simple and consist of single learning unit. The current study is trying to solve this problem focusing on chained learning architectures in a simple closed-loop behavioural context. METHODS: We applied temporal sequence learning (Porr B and Wörgötter F 2006) in a closed-loop behavioural system...... where a driving robot learns to follow a line. Here for the first time we introduced two types of chained learning architectures named linear chain and honeycomb chain. We analyzed such architectures in an open and closed-loop context and compared them to the simple learning unit. CONCLUSIONS...
Genomic insight into the common carp (Cyprinus carpio genome by sequencing analysis of BAC-end sequences

Directory of Open Access Journals (Sweden)

Wang Jintu

2011-04-01

Full Text Available Abstract Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio, a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3

Genomic insight into the common carp (Cyprinus carpio) genome by sequencing analysis of BAC-end sequences

Science.gov (United States)

2011-01-01

Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3,100 microsyntenies, covering over 50% of
A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers

Directory of Open Access Journals (Sweden)

Quail Michael A

2012-07-01

Full Text Available Abstract Background Next generation sequencing (NGS technology has revolutionized genomic and genetic research. The pace of change in this area is rapid with three major new sequencing platforms having been released in 2011: Ion Torrent’s PGM, Pacific Biosciences’ RS and the Illumina MiSeq. Here we compare the results obtained with those platforms to the performance of the Illumina HiSeq, the current market leader. In order to compare these platforms, and get sufficient coverage depth to allow meaningful analysis, we have sequenced a set of 4 microbial genomes with mean GC content ranging from 19.3 to 67.7%. Together, these represent a comprehensive range of genome content. Here we report our analysis of that sequence data in terms of coverage distribution, bias, GC distribution, variant detection and accuracy. Results Sequence generated by Ion Torrent, MiSeq and Pacific Biosciences technologies displays near perfect coverage behaviour on GC-rich, neutral and moderately AT-rich genomes, but a profound bias was observed upon sequencing the extremely AT-rich genome of Plasmodium falciparum on the PGM, resulting in no coverage for approximately 30% of the genome. We analysed the ability to call variants from each platform and found that we could call slightly more variants from Ion Torrent data compared to MiSeq data, but at the expense of a higher false positive rate. Variant calling from Pacific Biosciences data was possible but higher coverage depth was required. Context specific errors were observed in both PGM and MiSeq data, but not in that from the Pacific Biosciences platform. Conclusions All three fast turnaround sequencers evaluated here were able to generate usable sequence. However there are key differences between the quality of that data and the applications it will support.
Side-by-side comparison of an open-chamber (TM 300) and a closed-chamber (Vapometer™) transepidermal water loss meter.

Science.gov (United States)

Steiner, Markus; Aikman-Green, Sylvie; Prescott, Gordon J; Dick, Finlay D

2011-08-01

The measurement of transepidermal water loss (TEWL) is used to monitor changes in the stratum corneum's permeability to water vapor. This measurement is widely used in the cosmetics industry and in dermatology research. However, only limited work has been undertaken to assess the comparability of results from different TEWL meters over an extended range of measurements. This study compared the results of TEWL measurements between two commonly used open-chamber and closed-chamber TEWL devices. Five hundred and forty measurements were taken in 17 participants on the dorsum and palm of both hands on two different days and the order of the devices was randomized. The results showed that the open TEWL meter's capacity for measuring high values of TEWL was restricted, and that the closed-chamber TEWL meter was less sensitive to differences in the lower range of measurements. Both devices have their strengths for different applications, but their results cannot be directly compared. We were unable to find a statistical model that would allow us to transform the measurements made on one device for a comparison with the results generated by the other device. © 2011 John Wiley & Sons A/S.
On Closed Form Calculation of Line Spectral Frequencies (LSF)

DEFF Research Database (Denmark)

Dalsgaard, Paul; Andersen, Ove

2014-01-01

of characteristic polynomial zeros. The theoretical analysis is based on decomposition of sequences into symmetric and anti-symmetric polynomials defined as a series expansion of reduced Chebyshev polynomials of the first kind. Two variants of closed form functions are presented — each characterised by using...
PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes.

Science.gov (United States)

Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping

2013-12-27

With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.
Scalable Kernel Methods and Algorithms for General Sequence Analysis

Science.gov (United States)

Kuksa, Pavel

2011-01-01

Analysis of large-scale sequential data has become an important task in machine learning and pattern recognition, inspired in part by numerous scientific and technological applications such as the document and text classification or the analysis of biological sequences. However, current computational methods for sequence comparison still lack…
3D reconstruction software comparison for short sequences

Science.gov (United States)

Strupczewski, Adam; Czupryński, BłaŻej

2014-11-01

Large scale multiview reconstruction is recently a very popular area of research. There are many open source tools that can be downloaded and run on a personal computer. However, there are few, if any, comparisons between all the available software in terms of accuracy on small datasets that a single user can create. The typical datasets for testing of the software are archeological sites or cities, comprising thousands of images. This paper presents a comparison of currently available open source multiview reconstruction software for small datasets. It also compares the open source solutions with a simple structure from motion pipeline developed by the authors from scratch with the use of OpenCV and Eigen libraries.
K2 and K2*: efficient alignment-free sequence similarity measurement based on Kendall statistics.

Science.gov (United States)

Lin, Jie; Adjeroh, Donald A; Jiang, Bing-Hua; Jiang, Yue

2018-05-15

Alignment-free sequence comparison methods can compute the pairwise similarity between a huge number of sequences much faster than sequence-alignment based methods. We propose a new non-parametric alignment-free sequence comparison method, called K2, based on the Kendall statistics. Comparing to the other state-of-the-art alignment-free comparison methods, K2 demonstrates competitive performance in generating the phylogenetic tree, in evaluating functionally related regulatory sequences, and in computing the edit distance (similarity/dissimilarity) between sequences. Furthermore, the K2 approach is much faster than the other methods. An improved method, K2*, is also proposed, which is able to determine the appropriate algorithmic parameter (length) automatically, without first considering different values. Comparative analysis with the state-of-the-art alignment-free sequence similarity methods demonstrates the superiority of the proposed approaches, especially with increasing sequence length, or increasing dataset sizes. The K2 and K2* approaches are implemented in the R language as a package and is freely available for open access (http://community.wvu.edu/daadjeroh/projects/K2/K2_1.0.tar.gz). yueljiang@163.com. Supplementary data are available at Bioinformatics online.
Poecilia picta, a Close Relative to the Guppy, Exhibits Red Male Coloration Polymorphism: A System for Phylogenetic Comparisons.

Directory of Open Access Journals (Sweden)

Anna K Lindholm

Full Text Available Studies on the evolution of female preference and male color polymorphism frequently focus on single species since traits and preferences are thought to co-evolve. The guppy, Poecilia reticulata, has long been a premier model for such studies because female preferences and orange coloration are well known to covary, especially in upstream/downstream pairs of populations. However, focused single species studies lack the explanatory power of the comparative method, which requires detailed knowledge of multiple species with known evolutionary relationships. Here we describe a red color polymorphism in Poecilia picta, a close relative to guppies. We show that this polymorphism is restricted to males and is maintained in natural populations of mainland South America. Using tests of female preference we show female P. picta are not more attracted to red males, despite preferences for red/orange in closely related species, such as P. reticulata and P. parae. Male color patterns in these closely related species are different from P. picta in that they occur in discrete patches and are frequently Y chromosome-linked. P. reticulata have an almost infinite number of male patterns, while P. parae males occur in discrete morphs. We show the red male polymorphism in P. picta extends continuously throughout the body and is not a Y-linked trait despite the theoretical prediction that sexually-selected characters should often be linked to the heterogametic sex chromosome. The presence/absence of red male coloration of P. picta described here makes this an ideal system for phylogenetic comparisons that could reveal the evolutionary forces maintaining mate choice and color polymorphisms in this speciose group.
Comparison of sequences of hypervariable region (HVR subunit S-1 gene of field isolate I-37 infectious bronchitis virus with Connecticut serotype

Directory of Open Access Journals (Sweden)

N.L.P Indi Dharmayanti

2003-06-01

Full Text Available Infectious Bronchitis is a contagious and acute respiratory disease in chickens caused by infectious bronchitis virus (IBV.Antigenic differences in IBV are associated with changes in the sequence of the spike glycoprotein (S. The subunit S1 which demonstrates more sequence variability than S-2 have been identified as hypervariable region (HVR-1 and 2. There were several IB virus field isolates included I-37 have been identified in Indonesia by serum neutralization method. However, gene sequence variation in HVR subunit S-1 had not yet been identified. Isolate I-37 was close to the serotype Connecticut 46 (Conn 46. The aim of this study is to identify sequence variation of HVR subunit S-1 gene of isolate I-37 produced by Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR and sequencing. Several procedures were carried out in the study including virus titration, propagation and was concentrated from the allantoic fluid infected with IBV. Then, RNA was extracted for RTPCR. urther the product was sequnced and its homology with IBV references from GenBank was compared by GenMac version 8.0. Result showed that isolate I-37 produced 515 bp of amplification product. Isolate I-37 and Conn 46 are same serotype, yet their HVR subunit S-1 nucleotides and amino acids (protein differ by 6.9% and 15.6% respectively. It might be concluded that isolate I-37 was variant of Conn 46.
Phylogeny and Taxonomy of Archaea: A Comparison of the Whole-Genome-Based CVTree Approach with 16S rRNA Sequence Analysis

Directory of Open Access Journals (Sweden)

Guanghong Zuo

2015-03-01

Full Text Available A tripartite comparison of Archaea phylogeny and taxonomy at and above the rank order is reported: (1 the whole-genome-based and alignment-free CVTree using 179 genomes; (2 the 16S rRNA analysis exemplified by the All-Species Living Tree with 366 archaeal sequences; and (3 the Second Edition of Bergey’s Manual of Systematic Bacteriology complemented by some current literature. A high degree of agreement is reached at these ranks. From the newly proposed archaeal phyla, Korarchaeota, Thaumarchaeota, Nanoarchaeota and Aigarchaeota, to the recent suggestion to divide the class Halobacteria into three orders, all gain substantial support from CVTree. In addition, the CVTree helped to determine the taxonomic position of some newly sequenced genomes without proper lineage information. A few discrepancies between the CVTree and the 16S rRNA approaches call for further investigation.
Synaptotagmin gene content of the sequenced genomes

Directory of Open Access Journals (Sweden)

Craxton Molly

2004-07-01

Full Text Available Abstract Background Synaptotagmins exist as a large gene family in mammals. There is much interest in the function of certain family members which act crucially in the regulated synaptic vesicle exocytosis required for efficient neurotransmission. Knowledge of the functions of other family members is relatively poor and the presence of Synaptotagmin genes in plants indicates a role for the family as a whole which is wider than neurotransmission. Identification of the Synaptotagmin genes within completely sequenced genomes can provide the entire Synaptotagmin gene complement of each sequenced organism. Defining the detailed structures of all the Synaptotagmin genes and their encoded products can provide a useful resource for functional studies and a deeper understanding of the evolution of the gene family. The current rapid increase in the number of sequenced genomes from different branches of the tree of life, together with the public deposition of evolutionarily diverse transcript sequences make such studies worthwhile. Results I have compiled a detailed list of the Synaptotagmin genes of Caenorhabditis, Anopheles, Drosophila, Ciona, Danio, Fugu, Mus, Homo, Arabidopsis and Oryza by examining genomic and transcript sequences from public sequence databases together with some transcript sequences obtained by cDNA library screening and RT-PCR. I have compared all of the genes and investigated the relationship between plant Synaptotagmins and their non-Synaptotagmin counterparts. Conclusions I have identified and compared 98 Synaptotagmin genes from 10 sequenced genomes. Detailed comparison of transcript sequences reveals abundant and complex variation in Synaptotagmin gene expression and indicates the presence of Synaptotagmin genes in all animals and land plants. Amino acid sequence comparisons indicate patterns of conservation and diversity in function. Phylogenetic analysis shows the origin of Synaptotagmins in multicellular eukaryotes and their
Identification of Escherichia coli and Shigella Species from Whole-Genome Sequences.

Science.gov (United States)

Chattaway, Marie A; Schaefer, Ulf; Tewolde, Rediat; Dallman, Timothy J; Jenkins, Claire

2017-02-01

Escherichia coli and Shigella species are closely related and genetically constitute the same species. Differentiating between these two pathogens and accurately identifying the four species of Shigella are therefore challenging. The organism-specific bioinformatics whole-genome sequencing (WGS) typing pipelines at Public Health England are dependent on the initial identification of the bacterial species by use of a kmer-based approach. Of the 1,982 Escherichia coli and Shigella sp. isolates analyzed in this study, 1,957 (98.4%) had concordant results by both traditional biochemistry and serology (TB&S) and the kmer identification (ID) derived from the WGS data. Of the 25 mismatches identified, 10 were enteroinvasive E. coli isolates that were misidentified as Shigella flexneri or S. boydii by the kmer ID, and 8 were S. flexneri isolates misidentified by TB&S as S. boydii due to nonfunctional S. flexneri O antigen biosynthesis genes. Analysis of the population structure based on multilocus sequence typing (MLST) data derived from the WGS data showed that the remaining discrepant results belonged to clonal complex 288 (CC288), comprising both S. boydii and S. dysenteriae strains. Mismatches between the TB&S and kmer ID results were explained by the close phylogenetic relationship between the two species and were resolved with reference to the MLST data. Shigella can be differentiated from E. coli and accurately identified to the species level by use of kmer comparisons and MLST. Analysis of the WGS data provided explanations for the discordant results between TB&S and WGS data, revealed the true phylogenetic relationships between different species of Shigella, and identified emerging pathoadapted lineages. © Crown copyright 2017.
Logic verification system for power plant sequence diagrams

International Nuclear Information System (INIS)

Fukuda, Mitsuko; Yamada, Naoyuki; Teshima, Toshiaki; Kan, Ken-ichi; Utsunomiya, Mitsugu.

1994-01-01

A logic verification system for sequence diagrams of power plants has been developed. The system's main function is to verify correctness of the logic realized by sequence diagrams for power plant control systems. The verification is based on a symbolic comparison of the logic of the sequence diagrams with the logic of the corresponding IBDs (interlock Block Diagrams) in combination with reference to design knowledge. The developed system points out the sub-circuit which is responsible for any existing mismatches between the IBD logic and the logic realized by the sequence diagrams. Applications to the verification of actual sequence diagrams of power plants confirmed that the developed system is practical and effective. (author)
Predictive uncertainty in auditory sequence processing

DEFF Research Database (Denmark)

Hansen, Niels Chr.; Pearce, Marcus T

2014-01-01

in a melodic sequence (inferred uncertainty). Finally, we simulate listeners' perception of expectedness and uncertainty using computational models of auditory expectation. A detailed model comparison indicates which model parameters maximize fit to the data and how they compare to existing models...
Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

Energy Technology Data Exchange (ETDEWEB)

Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

2006-07-06

Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.
Dispersion correction through movement of the closed orbit

International Nuclear Information System (INIS)

Parzen, G.

1980-01-01

The closed orbit correction system can be used to correct the vertical dispersion by displacing the orbit at the quadrupoles and sextupoles. The accuracy of the results have been verified by comparison with exact calculations. Results for correcting the horizontal dispersion are also given
Lacunary ideal convergence of multiple sequences

Directory of Open Access Journals (Sweden)

Bipan Hazarika

2016-01-01

Full Text Available An ideal I is a family of subsets of N×N which is closed under taking finite unions and subsets of its elements. In this article, the concept of lacunary ideal convergence of double sequences has been introduced. Also the relation between lacunary ideal convergent and lacunary Cauchy double sequences has been established. Furthermore, the notions of lacunary ideal limit point and lacunary ideal cluster points have been introduced and find the relation between these two notions. Finally, we have studied the properties such as solidity, monotonic.
Leaf sequencing algorithms for segmented multileaf collimation

International Nuclear Information System (INIS)

Kamath, Srijit; Sahni, Sartaj; Li, Jonathan; Palta, Jatinder; Ranka, Sanjay

2003-01-01

The delivery of intensity-modulated radiation therapy (IMRT) with a multileaf collimator (MLC) requires the conversion of a radiation fluence map into a leaf sequence file that controls the movement of the MLC during radiation delivery. It is imperative that the fluence map delivered using the leaf sequence file is as close as possible to the fluence map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf sequencing algorithms for segmental multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under most common leaf movement constraints that include minimum leaf separation constraint and leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bidirectional movement of the MLC leaves
Leaf sequencing algorithms for segmented multileaf collimation

Energy Technology Data Exchange (ETDEWEB)

Kamath, Srijit [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Sahni, Sartaj [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Li, Jonathan [Department of Radiation Oncology, University of Florida, Gainesville, FL (United States); Palta, Jatinder [Department of Radiation Oncology, University of Florida, Gainesville, FL (United States); Ranka, Sanjay [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States)

2003-02-07

The delivery of intensity-modulated radiation therapy (IMRT) with a multileaf collimator (MLC) requires the conversion of a radiation fluence map into a leaf sequence file that controls the movement of the MLC during radiation delivery. It is imperative that the fluence map delivered using the leaf sequence file is as close as possible to the fluence map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf sequencing algorithms for segmental multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under most common leaf movement constraints that include minimum leaf separation constraint and leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bidirectional movement of the MLC leaves.

The complete genome sequence of the plant growth-promoting bacterium Pseudomonas sp. UW4.

Directory of Open Access Journals (Sweden)

Jin Duan

Full Text Available The plant growth-promoting bacterium (PGPB Pseudomonas sp. UW4, previously isolated from the rhizosphere of common reeds growing on the campus of the University of Waterloo, promotes plant growth in the presence of different environmental stresses, such as flooding, high concentrations of salt, cold, heavy metals, drought and phytopathogens. In this work, the genome sequence of UW4 was obtained by pyrosequencing and the gaps between the contigs were closed by directed PCR. The P. sp. UW4 genome contains a single circular chromosome that is 6,183,388 bp with a 60.05% G+C content. The bacterial genome contains 5,423 predicted protein-coding sequences that occupy 87.2% of the genome. Nineteen genomic islands (GIs were predicted and thirty one complete putative insertion sequences were identified. Genes potentially involved in plant growth promotion such as indole-3-acetic acid (IAA biosynthesis, trehalose production, siderophore production, acetoin synthesis, and phosphate solubilization were determined. Moreover, genes that contribute to the environmental fitness of UW4 were also observed including genes responsible for heavy metal resistance such as nickel, copper, cadmium, zinc, molybdate, cobalt, arsenate, and chromate. Whole-genome comparison with other completely sequenced Pseudomonas strains and phylogeny of four concatenated "housekeeping" genes (16S rRNA, gyrB, rpoB and rpoD of 128 Pseudomonas strains revealed that UW4 belongs to the fluorescens group, jessenii subgroup.
The Complete Genome Sequence of the Plant Growth-Promoting Bacterium Pseudomonas sp. UW4

Science.gov (United States)

Duan, Jin; Jiang, Wei; Cheng, Zhenyu; Heikkila, John J.; Glick, Bernard R.

2013-01-01

The plant growth-promoting bacterium (PGPB) Pseudomonas sp. UW4, previously isolated from the rhizosphere of common reeds growing on the campus of the University of Waterloo, promotes plant growth in the presence of different environmental stresses, such as flooding, high concentrations of salt, cold, heavy metals, drought and phytopathogens. In this work, the genome sequence of UW4 was obtained by pyrosequencing and the gaps between the contigs were closed by directed PCR. The P. sp. UW4 genome contains a single circular chromosome that is 6,183,388 bp with a 60.05% G+C content. The bacterial genome contains 5,423 predicted protein-coding sequences that occupy 87.2% of the genome. Nineteen genomic islands (GIs) were predicted and thirty one complete putative insertion sequences were identified. Genes potentially involved in plant growth promotion such as indole-3-acetic acid (IAA) biosynthesis, trehalose production, siderophore production, acetoin synthesis, and phosphate solubilization were determined. Moreover, genes that contribute to the environmental fitness of UW4 were also observed including genes responsible for heavy metal resistance such as nickel, copper, cadmium, zinc, molybdate, cobalt, arsenate, and chromate. Whole-genome comparison with other completely sequenced Pseudomonas strains and phylogeny of four concatenated “housekeeping” genes (16S rRNA, gyrB, rpoB and rpoD) of 128 Pseudomonas strains revealed that UW4 belongs to the fluorescens group, jessenii subgroup. PMID:23516524
Molecular comparison of topotypic specimens confirms Anopheles (Nyssorhynchus dunhami Causey (Diptera: Culicidae in the Colombian Amazon

Directory of Open Access Journals (Sweden)

Freddy Ruiz

2010-11-01

Full Text Available The presence of Anopheles (Nyssorhynchus dunhami Causey in Colombia (Department of Amazonas is confirmed for the first time through direct comparison of mtDNA cytochrome c oxidase I (COI barcodes and nuclear rDNA second internal transcribed spacer (ITS2 sequences with topotypic specimens of An. dunhami from Tefé, Brazil. An. dunhami was identified through retrospective correlation of DNA sequences following misidentification as Anopheles nuneztovari s.l. using available morphological keys for Colombian mosquitoes. That An. dunhami occurs in Colombia and also possibly throughout the Amazon Basin, is of importance to vector control programs, as this non-vector species is morphologically similar to known malaria vectors including An. nuneztovari, Anopheles oswaldoi and Anopheles trinkae. Species identification of An. dunhami and differentiation from these closely related species are highly robust using either DNA ITS2 sequences or COI DNA barcode. DNA methods are advocated for future differentiation of these often sympatric taxa in South America.
Analysis and comparison of fragrant gene sequence in some rice cultivars

Directory of Open Access Journals (Sweden)

Karami Noushafarin

2016-01-01

Full Text Available It is known that the fragrant trait in rice (Oryza sativa L. is largely controlled by fgr gene on chromosome 8 and it has been specified that the existence of an 8 bp deletion and three single nucleotide polymorphism (SNP in exon 7 is effective on this trait. In this study, sequence alignment analysis of fgr exon7 on chromosome 8 for 11 different fragrant and non-fragrant cultivars revealed that 5 aromatic rice cultivars carried 3 SNPs and 8 bp deletion in exon7 which terminates prematurely at a TAA stop codon. However, 5 of the non-aromatics showed a sequence identical to the published Nipponbare, being non-fragrant Japonica variety sequence. An exception among them was Bejar, which had 8 bp deletion and 3SNPs but it was non-aromatic. Sequencing can determine nucleotide alignment of a gene and give beneficial information about gene function. In silico prediction showed proteins sequences alignment of fgr gene for Khazar and Domsiah genotypes were different. Betaine aldehyde dehydrogenase complete enzyme belongs to Khazar non-fragrant genotype that has complete length and 503 amino acids while non-functional BADH2 enzyme for Domsiah fragrant genotype has 251 amino acids that result in accumulate 2-acetyl-1-pyrroline (2AP and produces aroma in fragrant genotypes.
Illuminating choices for library prep: a comparison of library preparation methods for whole genome sequencing of Cryptococcus neoformans using Illumina HiSeq.

Directory of Open Access Journals (Sweden)

Johanna Rhodes

Full Text Available The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina, along with two new kits: the TruSeq Nano DNA kit (Illumina and the NEBNext Ultra DNA kit (New England Biolabs to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality being considered when ultimately deciding on which library prep method to use.
Illuminating choices for library prep: a comparison of library preparation methods for whole genome sequencing of Cryptococcus neoformans using Illumina HiSeq.

Science.gov (United States)

Rhodes, Johanna; Beale, Mathew A; Fisher, Matthew C

2014-01-01

The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina), along with two new kits: the TruSeq Nano DNA kit (Illumina) and the NEBNext Ultra DNA kit (New England Biolabs) to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality) being considered when ultimately deciding on which library prep method to use.
Genomic sequencing of Pleistocene cave bears

Energy Technology Data Exchange (ETDEWEB)

Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.
Commissioning of closed loop controls at CPP, HWP, Manuguru (Paper No. 3.5)

International Nuclear Information System (INIS)

Basu, Sukumar; Narasimham, P.L.

1992-01-01

The captive power plant (CPP) for Heavy Water Plant, Manuguru is equipped with 3x265 T/hr steam capacity boilers. The control system is built around ASEA master hardware for sequence interlocks, closed loop control, and data acquisition functions. This paper describes the configuration of the system hardware, the steps carried out during commissioning of closed loop controls in distributed digital control systems and also the problems faced during the commissioning of closed loops. (author). 3 figs
Comparative performance of double-digest RAD sequencing across divergent arachnid lineages.

Science.gov (United States)

Burns, Mercedes; Starrett, James; Derkarabetian, Shahan; Richart, Casey H; Cabrero, Allan; Hedin, Marshal

2017-05-01

Next-generation sequencing technologies now allow researchers of non-model systems to perform genome-based studies without the requirement of a (often unavailable) closely related genomic reference. We evaluated the role of restriction endonuclease (RE) selection in double-digest restriction-site-associated DNA sequencing (ddRADseq) by generating reduced representation genome-wide data using four different RE combinations. Our expectation was that RE selections targeting longer, more complex restriction sites would recover fewer loci than RE with shorter, less complex sites. We sequenced a diverse sample of non-model arachnids, including five congeneric pairs of harvestmen (Opiliones) and four pairs of spiders (Araneae). Sample pairs consisted of either conspecifics or closely related congeneric taxa, and in total 26 sample pair analyses were tested. Sequence demultiplexing, read clustering and variant calling were performed in the pyRAD program. The 6-base pair cutter EcoRI combined with methylated site-specific 4-base pair cutter MspI produced, on average, the greatest numbers of intra-individual loci and shared loci per sample pair. As expected, the number of shared loci recovered for a sample pair covaried with the degree of genetic divergence, estimated with cytochrome oxidase I sequences, although this relationship was non-linear. Our comparative results will prove useful in guiding protocol selection for ddRADseq experiments on many arachnid taxa where reference genomes, even from closely related species, are unavailable. © 2016 John Wiley & Sons Ltd.
Effects of Early Musical Experience on Auditory Sequence Memory

Directory of Open Access Journals (Sweden)

Adam T. Tierney

2008-12-01

Full Text Available The present study investigated a possible link between musical training and immediate memory span by testing experienced musicians and three groups of musically inexperienced subjects (gymnasts, Psychology 101 students, and video game players on sequence memory and word familiarity tasks. By including skilled gymnasts who began studying their craft by age six, video game players, and Psychology 101 students as comparison groups, we attempted to control for some of the ways skilled musicians may differ from participants drawn from the general population in terms of gross motor skills and intensive experience in a highly skilled domain from an early age. We found that musicians displayed longer immediate memory spans than the comparison groups on auditory presentation conditions of the sequence reproductive span task. No differences were observed between the four groups on the visual conditions of the sequence memory task. These results provide additional converging support to recent findings showing that early musical experience and activity-dependent learning may selectively affect verbal rehearsal processes and the allocation of attention in sequence memory tasks.
Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana.

Directory of Open Access Journals (Sweden)

Yanan Liu

Full Text Available The Eastern woodchuck (Marmota monax is a classical animal model for studying hepatitis B virus (HBV infection and hepatocellular carcinoma (HCC in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO. The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs and the simple sequence repeats (SSRs were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC
Parallel computation for biological sequence comparison: comparing a portable model to the native model for the Intel Hypercube.

Science.gov (United States)

Nadkarni, P M; Miller, P L

1991-01-01

A parallel program for inter-database sequence comparison was developed on the Intel Hypercube using two models of parallel programming. One version was built using machine-specific Hypercube parallel programming commands. The other version was built using Linda, a machine-independent parallel programming language. The two versions of the program provide a case study comparing these two approaches to parallelization in an important biological application area. Benchmark tests with both programs gave comparable results with a small number of processors. As the number of processors was increased, the Linda version was somewhat less efficient. The Linda version was also run without change on Network Linda, a virtual parallel machine running on a network of desktop workstations.
Comparison of double-locus sequence typing (DLST) and multilocus sequence typing (MLST) for the investigation of Pseudomonas aeruginosa populations.

Science.gov (United States)

Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S

2015-08-01

Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.
Assessment of the environmental footprint of nuclear energy systems. Comparison between closed and open fuel cycles

International Nuclear Information System (INIS)

Poinssot, Ch.; Bourg, S.; Ouvrier, N.; Combernoux, N.; Rostaing, C.; Vargas-Gonzalez, M.; Bruno, J.

2014-01-01

Energy perspectives for the current century are dominated by the anticipated significant increase of energy needs. Particularly, electricity consumption is anticipated to increase by a factor higher than two before 2050. Energy choices are considered as structuring political choices that implies a long-standing and stable policy based on objective criteria. LCA (life cycle analysis) is a structured basis for deriving relevant indicators which can allow the comparison of a wide range of impacts of different energy sources. Among the energy-mix, nuclear power is anticipated to have very low GHG-emissions. However, its viability is severely addressed by the public opinion after the Fukushima accident. Therefore, a global LCA of the French nuclear fuel cycle was performed as a reference model. Results were compared in terms of impact with other energy sources. It emphasized that the French nuclear energy is one of the less impacting energy, comparable with renewable energy. In a second, part, the French scenario was compared with an equivalent open fuel cycle scenario. It demonstrates that an open fuel cycle would require about 16% more natural uranium, would have a bigger environmental footprint on the “non radioactive indicators” and would produce a higher volume of high level radioactive waste. - Highlights: • A life cycle analysis of the French close nuclear fuel cycle is performed. • The French nuclear energy is one of the less environmental impacting energy. • The French close fuel cycle is compared to an equivalent open fuel cycle. • An open fuel cycle would have a bigger environmental impact than the French fuel cycle. • Spent nuclear fuel recycling has a positive impact on the environmental footprint
Fractal MapReduce decomposition of sequence alignment

Directory of Open Access Journals (Sweden)

Almeida Jonas S

2012-05-01

Full Text Available Abstract Background The dramatic fall in the cost of genomic sequencing, and the increasing convenience of distributed cloud computing resources, positions the MapReduce coding pattern as a cornerstone of scalable bioinformatics algorithm development. In some cases an algorithm will find a natural distribution via use of map functions to process vectorized components, followed by a reduce of aggregate intermediate results. However, for some data analysis procedures such as sequence analysis, a more fundamental reformulation may be required. Results In this report we describe a solution to sequence comparison that can be thoroughly decomposed into multiple rounds of map and reduce operations. The route taken makes use of iterated maps, a fractal analysis technique, that has been found to provide a "alignment-free" solution to sequence analysis and comparison. That is, a solution that does not require dynamic programming, relying on a numeric Chaos Game Representation (CGR data structure. This claim is demonstrated in this report by calculating the length of the longest similar segment by inspecting only the USM coordinates of two analogous units: with no resort to dynamic programming. Conclusions The procedure described is an attempt at extreme decomposition and parallelization of sequence alignment in anticipation of a volume of genomic sequence data that cannot be met by current algorithmic frameworks. The solution found is delivered with a browser-based application (webApp, highlighting the browser's emergence as an environment for high performance distributed computing. Availability Public distribution of accompanying software library with open source and version control at http://usm.github.com. Also available as a webApp through Google Chrome's WebStore http://chrome.google.com/webstore: search with "usm".
Comparison of MR sequences in early cerebral infarction at 0.5 T

International Nuclear Information System (INIS)

Saatci, I.; Baskan, O.; Cekirge, H.S.; Besim, A.

2000-01-01

To compare the diagnostic values of fluid-attenuated inversion recovery (FLAIR) and gradient spin-echo (GRASE) with those of conventional spin-echo (SE) and fast SE T2-weighted sequences in the evaluation of acute cerebrovascular lesions at 0.5 T. Material and Methods: Twenty-two consecutive patients with the clinical diagnosis of acute cerebrovascular accident were examined by MR imaging within the first 48 h of ictus. MR examination included 5-mm axial conventional SE and turbo SE (TSE) T2-weighted, dual-echo GRASE and FLAIR sequences. The patients also had pre- and postcontrast T1-weighted axial images. Two examiners evaluated the images and scored the conspicuity of the acute lesions. Results: Regardless of location, FLAIR provided the best lesion conspicuity in the detection of acute infarcts, followed by the GRASE sequence. In the posterior fossa, TSE and SE demonstrated the lesions better than GRASE and FLAIR techniques. In the detection of hemorrhagic elements within the ischemic region, TSE demonstrated statistically significant superiority over other sequences. Conclusion: In the detection of acute ischemic lesions in locations other than the posterior fossa, FLAIR provided the best lesion conspicuity among four T2-weighted sequences, including SE, TSE, GRASE and FLAIR. However, for the posterior fossa examination, preference of SE or TSE T2-weighted sequences is suggested
Comparison of MR sequences in early cerebral infarction at 0.5 T

Energy Technology Data Exchange (ETDEWEB)

Saatci, I.; Baskan, O.; Cekirge, H.S.; Besim, A. [Hacettepe Univ. Hospital, Ankara (Turkey). Radiology Dept.

2000-11-01

To compare the diagnostic values of fluid-attenuated inversion recovery (FLAIR) and gradient spin-echo (GRASE) with those of conventional spin-echo (SE) and fast SE T2-weighted sequences in the evaluation of acute cerebrovascular lesions at 0.5 T. Material and Methods: Twenty-two consecutive patients with the clinical diagnosis of acute cerebrovascular accident were examined by MR imaging within the first 48 h of ictus. MR examination included 5-mm axial conventional SE and turbo SE (TSE) T2-weighted, dual-echo GRASE and FLAIR sequences. The patients also had pre- and postcontrast T1-weighted axial images. Two examiners evaluated the images and scored the conspicuity of the acute lesions. Results: Regardless of location, FLAIR provided the best lesion conspicuity in the detection of acute infarcts, followed by the GRASE sequence. In the posterior fossa, TSE and SE demonstrated the lesions better than GRASE and FLAIR techniques. In the detection of hemorrhagic elements within the ischemic region, TSE demonstrated statistically significant superiority over other sequences. Conclusion: In the detection of acute ischemic lesions in locations other than the posterior fossa, FLAIR provided the best lesion conspicuity among four T2-weighted sequences, including SE, TSE, GRASE and FLAIR. However, for the posterior fossa examination, preference of SE or TSE T2-weighted sequences is suggested.
Phylogeny and evolution of the auks (subfamily Alcinae) based on mitochondrial DNA sequences

Science.gov (United States)

Moum, Truls; Johansen, Steinar; Erikstad, Kjell Einar; Piatt, John F.

1994-01-01

The genetic divergence and phylogeny of the auks was assessed by mitochondrial DNA sequence comparisons in a study using 19 of the 22 auk species and two outgroup representatives. We compared more than 500 nucleotides from each of two mitochondrial genes encoding 12S rRNA and the NADH dehydrogenase subunit 6. Divergence times were estimated from transversional substitutions. The dovekie (Alle alle) is related to the razorbill (Alca torda) and the murres (Uria spp). Furthermore, the Xantus's murrelet (Synthliboramphus hypoleucus) and the ancient (Synthliboramphus antiquus) and Japanese murrelets (Synthliboramphus wumizusume) are genetically distinct members of the same main lineage, whereas brachyramphine and synthliboramphine murrelets are not closely related. An early adaptive radiation of six main species groups of auks seems to trace back to Middle Miocene. Later speciation probably involved ecological differentiations and geographical isolations.
Authentication of Closely Related Fish and Derived Fish Products Using Tandem Mass Spectrometry and Spectral Library Matching.

Science.gov (United States)

Nessen, Merel A; van der Zwaan, Dennis J; Grevers, Sander; Dalebout, Hans; Staats, Martijn; Kok, Esther; Palmblad, Magnus

2016-05-11

Proteomics methodology has seen increased application in food authentication, including tandem mass spectrometry of targeted species-specific peptides in raw, processed, or mixed food products. We have previously described an alternative principle that uses untargeted data acquisition and spectral library matching, essentially spectral counting, to compare and identify samples without the need for genomic sequence information in food species populations. Here, we present an interlaboratory comparison demonstrating how a method based on this principle performs in a realistic context. We also increasingly challenge the method by using data from different types of mass spectrometers, by trying to distinguish closely related and commercially important flatfish, and by analyzing heavily contaminated samples. The method was found to be robust in different laboratories, and 94-97% of the analyzed samples were correctly identified, including all processed and contaminated samples.
Comparison of 3D turbo spin-echo SPACE sequences with conventional 2D MRI sequences to assess the shoulder joint

Energy Technology Data Exchange (ETDEWEB)

Kloth, Jost Karsten, E-mail: jost.kloth@med.uni-heidelberg.de [Diagnostic and Interventional Radiology, University Hospital Heidelberg, Im Neuenheimer Feld 110, D-69120 Heidelberg (Germany); Winterstein, Marianne, E-mail: marianne.winterstein@med.uni-heidelberg.de [Diagnostic and Interventional Radiology, University Hospital Heidelberg, Im Neuenheimer Feld 110, D-69120 Heidelberg (Germany); Akbar, Michael, E-mail: michael.akbar@med.uni-heidelberg.de [Orthopedic and Trauma Surgery, University Hospital Heidelberg, Schlierbacher Landstraße 200a, D-69118 Heidelberg (Germany); Meyer, Esther, E-mail: esther.meyer@siemens.com [Siemens Healthcare, Erlangen (Germany); Paul, Dominik, E-mail: dominik.paul@siemens.com [Siemens Healthcare, Erlangen (Germany); Kauczor, Haus-Ulrich, E-mail: hans-ulrich.kauczor@med.uni-heidelberg.de [Diagnostic and Interventional Radiology, University Hospital Heidelberg, Im Neuenheimer Feld 110, D-69120 Heidelberg (Germany); Weber, Marc-André, E-mail: marcandre.weber@med.uni-heidelberg.de [Diagnostic and Interventional Radiology, University Hospital Heidelberg, Im Neuenheimer Feld 110, D-69120 Heidelberg (Germany)

2014-10-15

Highlights: • 3D SPACE and conventional 2D TSE MRI for assessment of the shoulder joint were compared. • Concordance for most pathologys was substantial to almost perfect. • Examination time could be reduced up to 8 min (27%). • Regarding rotator cuff injuries an additional sagittal T2w TSE sequence in 3D protocol is recommended. - Abstract: Purpose: To determine the accuracy and reliability of three-dimensional (3D) T1- and proton density (PD)-weighted turbo spin-echo (TSE) sampling perfection with application-optimized contrasts using different flip-angle evolution (SPACE) compared with conventional 2D sequences in assessment of the shoulder-joint. Materials and methods: Ninety-three subjects were examined on a 3-T MRI system with both conventional 2D-TSE sequences in T1-, T2- and PD-weighting and 3D SPACE sequences in T1- and PD-weighting. All examinations were assessed independently by two reviewers for common pathologies of the shoulder-joint. Agreement between 2D- and 3D-sequences and inter-observer-agreement was evaluated using kappa-statistics. Results: Using conventional 2D TSE sequences as standard of reference, sensitivity, specificity, and accuracy values of 3D SPACE were 81.8%, 95.1%, and 93.5% for injuries of the supraspinatus-tendon (SSP), 81.3%, 93.5%, and 91.4% for the cartilage layer and 82.4%, 98.5%, and 97.5% for the long biceps tendon. Concordance between 2D and 3D was almost perfect for tendinopathies of the SSP (κ = 0.85), osteoarthritis (κ = 1), luxation of the biceps tendon (κ = 1) and adjacent bone marrow (κ = 0.92). Inter-observer-agreement was generally higher for conventional 2D TSE sequences (κ, 0.23–1.0), when compared to 3D SPACE sequences (κ, −0.33 to 1.0) except for disorders of the long biceps tendon and supraspinatus tendon rupture. Conclusion: Because of substantial and almost perfect concordance with conventional 2D TSE sequences for common shoulder pathologies, MRI examination-time can be reduced by nearly 40

The diploid genome sequence of an Asian individual

DEFF Research Database (Denmark)

Wang, Jun; Wang, Wei; Li, Ruiqiang

2008-01-01

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we...... used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP...... identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J...
Resampling nucleotide sequences with closest-neighbor trimming and its comparison to other methods.

Directory of Open Access Journals (Sweden)

Kouki Yonezawa

Full Text Available A large number of nucleotide sequences of various pathogens are available in public databases. The growth of the datasets has resulted in an enormous increase in computational costs. Moreover, due to differences in surveillance activities, the number of sequences found in databases varies from one country to another and from year to year. Therefore, it is important to study resampling methods to reduce the sampling bias. A novel algorithm-called the closest-neighbor trimming method-that resamples a given number of sequences from a large nucleotide sequence dataset was proposed. The performance of the proposed algorithm was compared with other algorithms by using the nucleotide sequences of human H3N2 influenza viruses. We compared the closest-neighbor trimming method with the naive hierarchical clustering algorithm and [Formula: see text]-medoids clustering algorithm. Genetic information accumulated in public databases contains sampling bias. The closest-neighbor trimming method can thin out densely sampled sequences from a given dataset. Since nucleotide sequences are among the most widely used materials for life sciences, we anticipate that our algorithm to various datasets will result in reducing sampling bias.
Computer simulation of replacement sequences in copper

International Nuclear Information System (INIS)

Schiffgens, J.O.; Schwartz, D.W.; Ariyasu, R.G.; Cascadden, S.E.

1978-01-01

Results of computer simulations of , , and replacement sequences in copper are presented, including displacement thresholds, focusing energies, energy losses per replacement, and replacement sequence lengths. These parameters are tabulated for six interatomic potentials and shown to vary in a systematic way with potential stiffness and range. Comparisons of results from calculations made with ADDES, a quasi-dynamical code, and COMENT, a dynamical code, show excellent agreement, demonstrating that the former can be calibrated and used satisfactorily in the analysis of low energy displacement cascades. Upper limits on , , and replacement sequences were found to be approximately 10, approximately 30, and approximately 14 replacements, respectively. (author)
Multi-species sequence comparison reveals dynamic evolution of the elastin gene that has involved purifying selection and lineage-specific insertions/deletions

Directory of Open Access Journals (Sweden)

Green Eric D

2004-05-01

Full Text Available Abstract Background The elastin gene (ELN is implicated as a factor in both supravalvular aortic stenosis (SVAS and Williams Beuren Syndrome (WBS, two diseases involving pronounced complications in mental or physical development. Although the complete spectrum of functional roles of the processed gene product remains to be established, these roles are inferred to be analogous in human and mouse. This view is supported by genomic sequence comparison, in which there are no large-scale differences in the ~1.8 Mb sequence block encompassing the common region deleted in WBS, with the exception of an overall reversed physical orientation between human and mouse. Results Conserved synteny around ELN does not translate to a high level of conservation in the gene itself. In fact, ELN orthologs in mammals show more sequence divergence than expected for a gene with a critical role in development. The pattern of divergence is non-conventional due to an unusually high ratio of gaps to substitutions. Specifically, multi-sequence alignments of eight mammalian sequences reveal numerous non-aligning regions caused by species-specific insertions and deletions, in spite of the fact that the vast majority of aligning sites appear to be conserved and undergoing purifying selection. Conclusions The pattern of lineage-specific, in-frame insertions/deletions in the coding exons of ELN orthologous genes is unusual and has led to unique features of the gene in each lineage. These differences may indicate that the gene has a slightly different functional mechanism in mammalian lineages, or that the corresponding regions are functionally inert. Identified regions that undergo purifying selection reflect a functional importance associated with evolutionary pressure to retain those features.
Abundance, composition and distribution of simple sequence ...

Indian Academy of Sciences (India)

δ∗(W-29, W-70) = 1.25; δ∗(W-93, W-70 = 0.75)) even though they originate from different geographical regions. We can, therefore, infer that the WSSV sequences are closely related by ancestry. Table 3. Dinucleotide relative abundance in the ...
Biomolecule Sequencer: Next-Generation DNA Sequencing Technology for In-Flight Environmental Monitoring, Research, and Beyond

Science.gov (United States)

Smith, David J.; Burton, Aaron; Castro-Wallace, Sarah; John, Kristen; Stahl, Sarah E.; Dworkin, Jason Peter; Lupisella, Mark L.

2016-01-01

On the International Space Station (ISS), technologies capable of rapid microbial identification and disease diagnostics are not currently available. NASA still relies upon sample return for comprehensive, molecular-based sample characterization. Next-generation DNA sequencing is a powerful approach for identifying microorganisms in air, water, and surfaces onboard spacecraft. The Biomolecule Sequencer payload, manifested to SpaceX-9 and scheduled on the Increment 4748 research plan (June 2016), will assess the functionality of a commercially-available next-generation DNA sequencer in the microgravity environment of ISS. The MinION device from Oxford Nanopore Technologies (Oxford, UK) measures picoamp changes in electrical current dependent on nucleotide sequences of the DNA strand migrating through nanopores in the system. The hardware is exceptionally small (9.5 x 3.2 x 1.6 cm), lightweight (120 grams), and powered only by a USB connection. For the ISS technology demonstration, the Biomolecule Sequencer will be powered by a Microsoft Surface Pro3. Ground-prepared samples containing lambda bacteriophage, Escherichia coli, and mouse genomic DNA, will be launched and stored frozen on the ISS until experiment initiation. Immediately prior to sequencing, a crew member will collect and thaw frozen DNA samples, connect the sequencer to the Surface Pro3, inject thawed samples into a MinION flow cell, and initiate sequencing. At the completion of the sequencing run, data will be downlinked for ground analysis. Identical, synchronous ground controls will be used for data comparisons to determine sequencer functionality, run-time sequence, current dynamics, and overall accuracy. We will present our latest results from the ISS flight experiment the first time DNA has ever been sequenced in space and discuss the many potential applications of the Biomolecule Sequencer for environmental monitoring, medical diagnostics, higher fidelity and more adaptable Space Biology Human
MR pulse sequences for selective relaxation time measurements: a phantom study

DEFF Research Database (Denmark)

Thomsen, C; Jensen, K E; Jensen, M

1990-01-01

a Siemens Magnetom wholebody magnetic resonance scanner operating at 1.5 Tesla was used. For comparison six imaging pulse sequences for relaxation time measurements were tested on the same phantom. The spectroscopic pulse sequences all had an accuracy better than 10% of the reference values....
Characterizing the D2 statistic: word matches in biological sequences.

Science.gov (United States)

Forêt, Sylvain; Wilson, Susan R; Burden, Conrad J

2009-01-01

Word matches are often used in sequence comparison methods, either as a measure of sequence similarity or in the first search steps of algorithms such as BLAST or BLAT. The D2 statistic is the number of matches of words of k letters between two sequences. Recent advances have been made in the characterization of this statistic and in the approximation of its distribution. Here, these results are extended to the case of approximate word matches. We compute the exact value of the variance of the D2 statistic for the case of a uniform letter distribution, and introduce a method to provide accurate approximations of the variance in the remaining cases. This enables the distribution of D2 to be approximated for typical situations arising in biological research. We apply these results to the identification of cis-regulatory modules, and show that this method detects such sequences with a high accuracy. The ability to approximate the distribution of D2 for both exact and approximate word matches will enable the use of this statistic in a more precise manner for sequence comparison, database searches, and identification of transcription factor binding sites.
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

OpenAIRE

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important...
The International Nucleotide Sequence Database Collaboration.

Science.gov (United States)

Cochrane, Guy; Karsch-Mizrachi, Ilene; Nakamura, Yasukazu

2011-01-01

Under the International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org), globally comprehensive public domain nucleotide sequence is captured, preserved and presented. The partners of this long-standing collaboration work closely together to provide data formats and conventions that enable consistent data submission to their databases and support regular data exchange around the globe. Clearly defined policy and governance in relation to free access to data and relationships with journal publishers have positioned INSDC databases as a key provider of the scientific record and a core foundation for the global bioinformatics data infrastructure. While growth in sequence data volumes comes no longer as a surprise to INSDC partners, the uptake of next-generation sequencing technology by mainstream science that we have witnessed in recent years brings a step-change to growth, necessarily making a clear mark on INSDC strategy. In this article, we introduce the INSDC, outline data growth patterns and comment on the challenges of increased growth.
Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes

Science.gov (United States)

Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu

2009-01-01

Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
A third-generation microsatellite-based linkage map of the honey bee, Apis mellifera, and its comparison with the sequence-based physical map.

Science.gov (United States)

Solignac, Michel; Mougel, Florence; Vautrin, Dominique; Monnerot, Monique; Cornuet, Jean-Marie

2007-01-01

The honey bee is a key model for social behavior and this feature led to the selection of the species for genome sequencing. A genetic map is a necessary companion to the sequence. In addition, because there was originally no physical map for the honey bee genome project, a meiotic map was the only resource for organizing the sequence assembly on the chromosomes. We present the genetic (meiotic) map here and describe the main features that emerged from comparison with the sequence-based physical map. The genetic map of the honey bee is saturated and the chromosomes are oriented from the centromeric to the telomeric regions. The map is based on 2,008 markers and is about 40 Morgans (M) long, resulting in a marker density of one every 2.05 centiMorgans (cM). For the 186 megabases (Mb) of the genome mapped and assembled, this corresponds to a very high average recombination rate of 22.04 cM/Mb. Honey bee meiosis shows a relatively homogeneous recombination rate along and across chromosomes, as well as within and between individuals. Interference is higher than inferred from the Kosambi function of distance. In addition, numerous recombination hotspots are dispersed over the genome. The very large genetic length of the honey bee genome, its small physical size and an almost complete genome sequence with a relatively low number of genes suggest a very promising future for association mapping in the honey bee, particularly as the existence of haploid males allows easy bulk segregant analysis.
Expression profiling and comparative sequence derived insights into lipid metabolism

Energy Technology Data Exchange (ETDEWEB)

Callow, Matthew J.; Rubin, Edward M.

2001-12-19

Expression profiling and genomic DNA sequence comparisons are increasingly being applied to the identification and analysis of the genes involved in lipid metabolism. Not only has genome-wide expression profiling aided in the identification of novel genes involved in important processes in lipid metabolism such as sterol efflux, but the utilization of information from these studies has added to our understanding of the regulation of pathways participating in the process. Coupled with these gene expression studies, cross species comparison, searching for sequences conserved through evolution, has proven to be a powerful tool to identify important non-coding regulatory sequences as well as the discovery of novel genes relevant to lipid biology. An example of the value of this approach was the recent chance discovery of a new apolipoprotein gene (apo AV) that has dramatic effects upon triglyceride metabolism in mice and humans.
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.

Science.gov (United States)

Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai

2015-12-01

The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Subsampled open-reference clustering creates consistent, comprehensive OTU definitions and scales to billions of sequences.

Science.gov (United States)

Rideout, Jai Ram; He, Yan; Navas-Molina, Jose A; Walters, William A; Ursell, Luke K; Gibbons, Sean M; Chase, John; McDonald, Daniel; Gonzalez, Antonio; Robbins-Pianka, Adam; Clemente, Jose C; Gilbert, Jack A; Huse, Susan M; Zhou, Hong-Wei; Knight, Rob; Caporaso, J Gregory

2014-01-01

We present a performance-optimized algorithm, subsampled open-reference OTU picking, for assigning marker gene (e.g., 16S rRNA) sequences generated on next-generation sequencing platforms to operational taxonomic units (OTUs) for microbial community analysis. This algorithm provides benefits over de novo OTU picking (clustering can be performed largely in parallel, reducing runtime) and closed-reference OTU picking (all reads are clustered, not only those that match a reference database sequence with high similarity). Because more of our algorithm can be run in parallel relative to "classic" open-reference OTU picking, it makes open-reference OTU picking tractable on massive amplicon sequence data sets (though on smaller data sets, "classic" open-reference OTU clustering is often faster). We illustrate that here by applying it to the first 15,000 samples sequenced for the Earth Microbiome Project (1.3 billion V4 16S rRNA amplicons). To the best of our knowledge, this is the largest OTU picking run ever performed, and we estimate that our new algorithm runs in less than 1/5 the time than would be required of "classic" open reference OTU picking. We show that subsampled open-reference OTU picking yields results that are highly correlated with those generated by "classic" open-reference OTU picking through comparisons on three well-studied datasets. An implementation of this algorithm is provided in the popular QIIME software package, which uses uclust for read clustering. All analyses were performed using QIIME's uclust wrappers, though we provide details (aided by the open-source code in our GitHub repository) that will allow implementation of subsampled open-reference OTU picking independently of QIIME (e.g., in a compiled programming language, where runtimes should be further reduced). Our analyses should generalize to other implementations of these OTU picking algorithms. Finally, we present a comparison of parameter settings in QIIME's OTU picking workflows and
Subsampled open-reference clustering creates consistent, comprehensive OTU definitions and scales to billions of sequences

Directory of Open Access Journals (Sweden)

Jai Ram Rideout

2014-08-01

Full Text Available We present a performance-optimized algorithm, subsampled open-reference OTU picking, for assigning marker gene (e.g., 16S rRNA sequences generated on next-generation sequencing platforms to operational taxonomic units (OTUs for microbial community analysis. This algorithm provides benefits over de novo OTU picking (clustering can be performed largely in parallel, reducing runtime and closed-reference OTU picking (all reads are clustered, not only those that match a reference database sequence with high similarity. Because more of our algorithm can be run in parallel relative to “classic” open-reference OTU picking, it makes open-reference OTU picking tractable on massive amplicon sequence data sets (though on smaller data sets, “classic” open-reference OTU clustering is often faster. We illustrate that here by applying it to the first 15,000 samples sequenced for the Earth Microbiome Project (1.3 billion V4 16S rRNA amplicons. To the best of our knowledge, this is the largest OTU picking run ever performed, and we estimate that our new algorithm runs in less than 1/5 the time than would be required of “classic” open reference OTU picking. We show that subsampled open-reference OTU picking yields results that are highly correlated with those generated by “classic” open-reference OTU picking through comparisons on three well-studied datasets. An implementation of this algorithm is provided in the popular QIIME software package, which uses uclust for read clustering. All analyses were performed using QIIME’s uclust wrappers, though we provide details (aided by the open-source code in our GitHub repository that will allow implementation of subsampled open-reference OTU picking independently of QIIME (e.g., in a compiled programming language, where runtimes should be further reduced. Our analyses should generalize to other implementations of these OTU picking algorithms. Finally, we present a comparison of parameter settings in
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.

Science.gov (United States)

Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing

2015-08-05

To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the
Calculation of ionization within the close-coupling formalism

International Nuclear Information System (INIS)

Bray, I.; Fursa, D.V.

1996-05-01

A method for calculation of differential ionization cross sections from theories that use the close-coupling expansions for the total wave functions is presented. It is shown how from a single such calculation elastic, excitation, and ionization cross sections may be extracted using solely the T-matrix elements arising from solution of the coupled equations. To demonstrate the applicability of this formalism, the convergent close-coupling (CCC) theory is systematically applied at incident energies of 150-600 eV to the calculation of e-He ionization. Comparison with available measurements is generally very good. 50 refs., 17 figs
Comparison of the phenomenology of SBO sequences with and without seals LOCA Westinghouse PWRs

International Nuclear Information System (INIS)

Mena Rosell, L.; Queral, C.; Jimenez Varas, G.

2013-01-01

SBO sequences have gained notoriety after the accident at Fukushima. Within this type of sequence the appearance or not of seals of the RCP LOCA determines the evolution of the accident. This work has been applied the methodology of integrated safety analysis (ISA), developed by the CSN, sequences of SBO. The objective is to compare the evolution of SBO sequences in a wide spectrum of conditions and recovery times of AC and DC loss. The simulations have been performed with the SCAIS tool coupled to MAAP. The set of simulations carried out, of the order of 2,000 sequences, clearly show the differences in the evolution of sequences with and without seals crazy. This type of analysis allows you to verify which would be the most appropriate management of sequence depending on the appearance or not of the MADWOMAN of seals.
A rare case of Moebius sequence

Directory of Open Access Journals (Sweden)

Abhishek Kulkarni

2012-01-01

Full Text Available We report a case of an 18-year-old male who presented with watering and inability to close the left eye completely since 6 months and inability to move both eyes outward and to close the mouth since childhood. Ocular, facial, and systemic examination revealed that the patient had bilateral complete lateral rectus and bilateral incomplete medial rectus palsy, left-sided facial nerve paralysis, thickening of lower lip and inability to close the mouth, along with other common musculoskeletal abnormalities. This is a typical presentation of Moebius syndrome which is a very rare congenital neurological disorder characterized by bilateral facial and abducens nerve paralysis. This patient had bilateral incomplete medial rectus palsy which is suggestive of the presence of horizontal gaze palsy or occulomotor nerve involvement as a component of Moebius sequence.

Identification and partial sequencing of a crocodile poxvirus associated with deeply penetrating skin lesions in farmed Nile crocodiles, Crocodylus niloticus.

Science.gov (United States)

Huchzermeyer, F W; Wallace, D B; Putterill, J F; Gerdes, G H

2009-09-01

When large numbers of crocodile skins were downgraded because of the presence of small pin prick-like holes, collapsed epidermal cysts were found deep in the dermis of juvenile crocodiles while forming cysts were observed in hatchlings. Histopathology of these forming cysts showed the presence of intracytoplasmic inclusions in proliferating and ballooning epidermal cells. Pox virions were seen in electron microscope preparations made from the scabs of such early lesions. The partial sequencing of virus material from scrapings of these lesions and comparison of it with the published sequence of crocodile poxvirus showed the virus associated with the deep lesions to be closely related, but different. To differentiate between the two forms of crocodile pox infection it is suggested that the previously known form should be called "classical crocodile pox" and the newly discovered form "atypical crocodile pox". The application of strict hygiene measures brought about a decline in the percentage of downgraded skins.
Identification and partial sequencing of a crocodile poxvirus associated with deeply penetrating skin lesions in farmed Nile crocodiles, Crocodylus niloticus

Directory of Open Access Journals (Sweden)

F.W. Huchzermeyer

2009-09-01

Full Text Available When large numbers of crocodile skins were downgraded because of the presence of small pin pricklike holes, collapsed epidermal cysts were found deep in the dermis of juvenile crocodiles while forming cysts were observed in hatchlings. Histopathology of these forming cysts showed the presence of intracytoplasmic inclusions in proliferating and ballooning epidermal cells. Pox virions were seen in electron microscope preparations made from the scabs of such early lesions. The partial sequencing of virus material from scrapings of these lesions and comparison of it with the published sequence of crocodile poxvirus showed the virus associated with the deep lesions to be closely related, but different. To differentiate between the two forms of crocodile pox infection it is suggested that the previously known form should be called ''classical crocodile pox'' and the newly discovered form ''atypical crocodile pox''. The application of strict hygiene measures brought about a decline in the percentage of downgraded skins.
Reranking candidate gene models with cross-species comparison for improved gene prediction

Directory of Open Access Journals (Sweden)

Pereira Fernando CN

2008-10-01

Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.
Comparison of illumina and 454 deep sequencing in participants failing raltegravir-based antiretroviral therapy.

Directory of Open Access Journals (Sweden)

Jonathan Z Li

Full Text Available The impact of raltegravir-resistant HIV-1 minority variants (MVs on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs.A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegravir resistance at the time of virologic failure. A control library was created by pooling integrase clones at predefined proportions. Multiplexed sequencing was performed with Illumina and 454 platforms at comparable costs. Illumina sequence analysis was performed with the novel snp-assess tool and 454 sequencing was analyzed with V-Phaser.Illumina sequencing resulted in significantly higher sequence coverage and a 0.095% limit of detection. Illumina accurately detected all MVs in the control library at ≥0.5% and 7/10 MVs expected at 0.1%. 454 sequencing failed to detect any MVs at 0.1% with 5 false positive calls. For MVs detected in the patient samples by both 454 and Illumina, the correlation in the detected variant frequencies was high (R2 = 0.92, P<0.001. Illumina sequencing detected 2.4-fold greater nucleotide MVs and 2.9-fold greater amino acid MVs compared to 454. The only raltegravir-resistant MV detected was an E138K mutation in one participant by Illumina sequencing, but not by 454.In participants of A5262 with raltegravir resistance at virologic failure, baseline raltegravir-resistant MVs were rarely detected. At comparable costs to 454 sequencing, Illumina demonstrated greater depth of coverage, increased sensitivity for detecting HIV MVs, and fewer false positive variant calls.
The diploid genome sequence of an individual human.

Directory of Open Access Journals (Sweden)

Samuel Levy

2007-09-01

Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.
Multi-species sequence comparison reveals conservation of ghrelin gene-derived splice variants encoding a truncated ghrelin peptide.

Science.gov (United States)

Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K

2016-06-01

The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

Science.gov (United States)

Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

2012-10-01

One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.
Entropic fluctuations in DNA sequences

Science.gov (United States)

Thanos, Dimitrios; Li, Wentian; Provata, Astero

2018-03-01

The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.
Molecular characterization of Fasciola gigantica from Mauritania based on mitochondrial and nuclear ribosomal DNA sequences.

Science.gov (United States)

Amor, Nabil; Farjallah, Sarra; Salem, Mohamed; Lamine, Dia Mamadou; Merella, Paolo; Said, Khaled; Ben Slimane, Badreddine

2011-10-01

Fasciolosis caused by Fasciola hepatica and Fasciola gigantica (Platyhelminthes: Trematoda: Digenea) is considered the most important helminth infection of ruminants in tropical countries, causing considerable socioeconomic problems. From Africa, F. gigantica has been previously characterized from Burkina Faso, Senegal, Kenya, Zambia and Mali, while F. hepatica has been reported from Morocco and Tunisia, and both species have been observed from Ethiopia and Egypt on the basis of morphometric differences, while the use of molecular markers is necessary to distinguish exactly between species. Samples identified morphologically as F. gigantica (n=60) from sheep and cattle from different geographical localities of Mauritania were genetically characterized by sequences of the first (ITS-1), the 5.8S, and second (ITS-2) Internal Transcribed Spacers (ITS) of nuclear ribosomal DNA (rDNA) genes and the mitochondrial Cytochrome c Oxidase I (COI) gene. Comparison of the sequences of the Mauritanian samples with sequences of Fasciola spp. from GenBank confirmed that all samples belong to the species F. gigantica. The nucleotide sequencing of ITS rDNA of F. gigantica showed no nucleotide variation in the ITS-1, 5.8S, and ITS-2 rDNA sequences among all samples examined and those from Burkina Faso, Kenya, Egypt and Iran. The phylogenetic trees based on the ITS-1 and ITS-2 sequences showed a close relationship of the Mauritanian samples with isolates of F. gigantica from different localities of Africa and Asia. The COI genotypes of the Mauritanian specimens of F. gigantica had a high level of diversity, and they belonged to the F. gigantica phylogenically distinguishable clade. The present study is the first molecular characterization of F. gigantica in sheep and cattle from Mauritania, allowing a reliable approach for the genetic differentiation of Fasciola spp. and providing basis for further studies on liver flukes in the African countries. Copyright © 2011 Elsevier Inc. All
Murine mammary tumor virus pol-related sequences in human DNA: characterization and sequence comparison with the complete murine mammary tumor virus pol gene

International Nuclear Information System (INIS)

Deen, K.C.; Sweet, R.W.

1986-01-01

Sequences in the human genome with homology to the murine mammary tumor virus (MMTV) pol gene were isolated from a human phage library. Ten clones with extensive pol homology were shown to define five separate loci. These loci share common sequences immediately adjacent to the pol-like segments and, in addition, contain a related repeat element which bounds this region. This organization is suggestive of a proviral structure. The authors estimate that the human genome contains 30 to 40 copies of these pol-related sequences. The pol region of one of the cloned segments (HM16) and the complete MMTV pol gene were sequenced and compared. The nucleotide homology between these pol sequences is 52% and is concentrated in the terminal regions. The MMTV pol gene contains a single long open reading frame encoding 899 amino acids and is demarcated from the partially overlapping putative gag gene by termination codons and a shift in translational reading frame. The pol sequence of HM16 is multiply terminated but does contain open reading frames which encode 370, 105, and 112 amino acids residues in separate reading frames. The authors deduced a composite pol protein sequence for HM16 by aligning it to the MMTV pol gene and then compared these sequences with other retroviral pol protein sequences. Conserved sequences occur in both the amino and carboxyl regions which lie within the polymerase and endonuclease domains of pol, respectively
SIS: a program to generate draft genome sequence scaffolds for prokaryotes

Directory of Open Access Journals (Sweden)

Dias Zanoni

2012-05-01

Full Text Available Abstract Background Decreasing costs of DNA sequencing have made prokaryotic draft genome sequences increasingly common. A contig scaffold is an ordering of contigs in the correct orientation. A scaffold can help genome comparisons and guide gap closure efforts. One popular technique for obtaining contig scaffolds is to map contigs onto a reference genome. However, rearrangements that may exist between the query and reference genomes may result in incorrect scaffolds, if these rearrangements are not taken into account. Large-scale inversions are common rearrangement events in prokaryotic genomes. Even in draft genomes it is possible to detect the presence of inversions given sufficient sequencing coverage and a sufficiently close reference genome. Results We present a linear-time algorithm that can generate a set of contig scaffolds for a draft genome sequence represented in contigs given a reference genome. The algorithm is aimed at prokaryotic genomes and relies on the presence of matching sequence patterns between the query and reference genomes that can be interpreted as the result of large-scale inversions; we call these patterns inversion signatures. Our algorithm is capable of correctly generating a scaffold if at least one member of every inversion signature pair is present in contigs and no inversion signatures have been overwritten in evolution. The algorithm is also capable of generating scaffolds in the presence of any kind of inversion, even though in this general case there is no guarantee that all scaffolds in the scaffold set will be correct. We compare the performance of sis, the program that implements the algorithm, to seven other scaffold-generating programs. The results of our tests show that sis has overall better performance. Conclusions sis is a new easy-to-use tool to generate contig scaffolds, available both as stand-alone and as a web server. The good performance of sis in our tests adds evidence that large
First complete genome sequence of canine bocavirus 2 in mainland China

Directory of Open Access Journals (Sweden)

S.-L. Zhai

2017-07-01

Full Text Available We obtained the first full-length genome sequence of canine bocavirus 2 (CBoV2 from the faeces of a healthy dog in Guangzhou city, Guangdong province, mainland China. The genome of GZHD15 consisted of 5059 nucleotides. Sequence analysis suggested that GZHD15 was close to a previously circulated Hong Kong isolate.
Comparison of microbial DNA enrichment tools for metagenomic whole genome sequencing.

Science.gov (United States)

Thoendel, Matthew; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Yao, Janet Z; Chia, Nicholas; Hanssen, Arlen D; Abdel, Matthew P; Patel, Robin

2016-08-01

Metagenomic whole genome sequencing for detection of pathogens in clinical samples is an exciting new area for discovery and clinical testing. A major barrier to this approach is the overwhelming ratio of human to pathogen DNA in samples with low pathogen abundance, which is typical of most clinical specimens. Microbial DNA enrichment methods offer the potential to relieve this limitation by improving this ratio. Two commercially available enrichment kits, the NEBNext Microbiome DNA Enrichment Kit and the Molzym MolYsis Basic kit, were tested for their ability to enrich for microbial DNA from resected arthroplasty component sonicate fluids from prosthetic joint infections or uninfected sonicate fluids spiked with Staphylococcus aureus. Using spiked uninfected sonicate fluid there was a 6-fold enrichment of bacterial DNA with the NEBNext kit and 76-fold enrichment with the MolYsis kit. Metagenomic whole genome sequencing of sonicate fluid revealed 13- to 85-fold enrichment of bacterial DNA using the NEBNext enrichment kit. The MolYsis approach achieved 481- to 9580-fold enrichment, resulting in 7 to 59% of sequencing reads being from the pathogens known to be present in the samples. These results demonstrate the usefulness of these tools when testing clinical samples with low microbial burden using next generation sequencing. Copyright © 2016 Elsevier B.V. All rights reserved.
Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

Directory of Open Access Journals (Sweden)

Stephan Pabinger

Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage
CompariMotif: quick and easy comparisons of sequence motifs.

Science.gov (United States)

Edwards, Richard J; Davey, Norman E; Shields, Denis C

2008-05-15

CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/
Bisulfite sequencing reveals that Aspergillus flavus holds a hollow in DNA methylation.

Directory of Open Access Journals (Sweden)

Si-Yang Liu

Full Text Available Aspergillus flavus first gained scientific attention for its production of aflatoxin. The underlying regulation of aflatoxin biosynthesis has been serving as a theoretical model for biosynthesis of other microbial secondary metabolites. Nevertheless, for several decades, the DNA methylation status, one of the important epigenomic modifications involved in gene regulation, in A. flavus remains to be controversial. Here, we applied bisulfite sequencing in conjunction with a biological replicate strategy to investigate the DNA methylation profiling of A. flavus genome. Both the bisulfite sequencing data and the methylome comparisons with other fungi confirm that the DNA methylation level of this fungus is negligible. Further investigation into the DNA methyltransferase of Aspergillus uncovers its close relationship with RID-like enzymes as well as its divergence with the methyltransferase of species with validated DNA methylation. The lack of repeat contents of the A. flavus' genome and the high RIP-index of the small amount of remanent repeat potentially support our speculation that DNA methylation may be absent in A. flavus or that it may possess de novo DNA methylation which occurs very transiently during the obscure sexual stage of this fungal species. This work contributes to our understanding on the DNA methylation status of A. flavus, as well as reinforces our views on the DNA methylation in fungal species. In addition, our strategy of applying bisulfite sequencing to DNA methylation detection in species with low DNA methylation may serve as a reference for later scientific investigations in other hypomethylated species.
The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human

Directory of Open Access Journals (Sweden)

Yuan Qiaoping

2012-06-01

Full Text Available Abstract Background As a model organism in biomedicine, the rhesus macaque (Macaca mulatta is the most widely used nonhuman primate. Although a draft genome sequence was completed in 2007, there has been no systematic genome-wide comparison of genetic variation of this species to humans. Comparative analysis of functional and nonfunctional diversity in this highly abundant and adaptable non-human primate could inform its use as a model for human biology, and could reveal how variation in population history and size alters patterns and levels of sequence variation in primates. Results We sequenced the mRNA transcriptome and H3K4me3-marked DNA regions in hippocampus from 14 humans and 14 rhesus macaques. Using equivalent methodology and sampling spaces, we identified 462,802 macaque SNPs, most of which were novel and disproportionately located in the functionally important genomic regions we had targeted in the sequencing. At least one SNP was identified in each of 16,797 annotated macaque genes. Accuracy of macaque SNP identification was conservatively estimated to be >90%. Comparative analyses using SNPs equivalently identified in the two species revealed that rhesus macaque has approximately three times higher SNP density and average nucleotide diversity as compared to the human. Based on this level of diversity, the effective population size of the rhesus macaque is approximately 80,000 which contrasts with an effective population size of less than 10,000 for humans. Across five categories of genomic regions, intergenic regions had the highest SNP density and average nucleotide diversity and CDS (coding sequences the lowest, in both humans and macaques. Although there are more coding SNPs (cSNPs per individual in macaques than in humans, the ratio of dN/dS is significantly lower in the macaque. Furthermore, the number of damaging nonsynonymous cSNPs (have damaging effects on protein functions from PolyPhen-2 prediction in the macaque is more
Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

Science.gov (United States)

2012-01-01

Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence
Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™.

Science.gov (United States)

Eduardoff, M; Gross, T E; Santos, C; de la Puente, M; Ballard, D; Strobl, C; Børsting, C; Morling, N; Fusco, L; Hussing, C; Egyed, B; Souto, L; Uacyisrael, J; Syndercombe Court, D; Carracedo, Á; Lareu, M V; Schneider, P M; Parson, W; Phillips, C; Parson, W; Phillips, C

2016-07-01

The EUROFORGEN Global ancestry-informative SNP (AIM-SNPs) panel is a forensic multiplex of 128 markers designed to differentiate an individual's ancestry from amongst the five continental population groups of Africa, Europe, East Asia, Native America, and Oceania. A custom multiplex of AmpliSeq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures, and the ancestry differentiation power of the final panel design, which required substitution of three original ancestry-informative SNPs with alternatives. Fourteen populations that had not been previously analyzed were genotyped using the custom multiplex and these studies allowed assessment of genotyping performance by comparison of data across five laboratories. Results indicate a low level of genotyping error can still occur from sequence misalignment caused by homopolymeric tracts close to the target SNP, despite careful scrutiny of candidate SNPs at the design stage. Such sequence misalignment required the exclusion of component SNP rs2080161 from the Global AIM-SNPs panel. However, the overall genotyping precision and sensitivity of this custom multiplex indicates the Ion PGM™ assay for the Global AIM-SNPs is highly suitable for forensic ancestry analysis with massively parallel sequencing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Genetic Relationships among Reptilian and Mammalian Campylobacter fetus Strains Determined by Multilocus Sequence Typing

NARCIS (Netherlands)

Dingle, K.E.; Blaser, M.J.; Tu, Z.C.; Pruckler, J.; Fitzgerald, C.; Bergen, van M.A.P.; Lawson, A.J.; Owen, R.J.; Wagenaar, J.A.

2010-01-01

Reptile Campylobacter fetus isolates and closely related strains causing human disease were characterized by multilocus sequence typing. They shared similar to 90% nucleotide sequence identity with classical mammalian C. fetus, and there was evidence of recombination among members of these two

Genotyping of major histocompatibility complex Class II DRB gene in Rohilkhandi goats by polymerase chain reaction-restriction fragment length polymorphism and DNA sequencing

Directory of Open Access Journals (Sweden)

Kush Shrivastava

2015-10-01

Full Text Available Aim: To study the major histocompatibility complex (MHC Class II DRB1 gene polymorphism in Rohilkhandi goat using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP and nucleotide sequencing techniques. Materials and Methods: DNA was isolated from 127 Rohilkhandi goats maintained at sheep and goat farm, Indian Veterinary Research Institute, Izatnagar, Bareilly. A 284 bp fragment of exon 2 of DRB1 gene was amplified and digested using BsaI and TaqI restriction enzymes. Population genetic parameters were calculated using Popgene v 1.32 and SAS 9.0. The genotypes were then sequenced using Sanger dideoxy chain termination method and were compared with related breeds/species using MEGA 6.0 and Megalign (DNASTAR software. Results: TaqI locus showed three and BsaI locus showed two genotypes. Both the loci were found to be in Hardy–Weinberg equilibrium (HWE, however, population genetic parameters suggest that heterozygosity is still maintained in the population at both loci. Percent diversity and divergence matrix, as well as phylogenetic analysis revealed that the MHC Class II DRB1 gene of Rohilkhandi goats was found to be in close cluster with Garole and Scottish blackface sheep breeds as compared to other goat breeds included in the sequence comparison. Conclusion: The PCR-RFLP patterns showed population to be in HWE and absence of one genotype at one locus (BsaI, both the loci showed excess of one or the other homozygote genotype, however, effective number of alleles showed that allelic diversity is present in the population. Sequence comparison of DRB1 gene of Rohilkhandi goat with other sheep and goat breed assigned Rohilkhandi goat in divergence with Jamanupari and Angora goats.
Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter.

Science.gov (United States)

Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C

2017-06-20

Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

Science.gov (United States)

Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

1993-02-01

A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

Science.gov (United States)

Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A; Larsen, Martin Jakob

2016-01-01

Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.
Algorithms for optimal sequencing of dynamic multileaf collimators

Energy Technology Data Exchange (ETDEWEB)

Kamath, Srijit [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Sahni, Sartaj [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Palta, Jatinder [Department of Radiation Oncology, University of Florida, Gainesville, FL (United States); Ranka, Sanjay [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States)

2004-01-07

Dynamic multileaf collimator (DMLC) intensity modulated radiation therapy (IMRT) is used to deliver intensity modulated beams using a multileaf collimator (MLC), with the leaves in motion. DMLC-IMRT requires the conversion of a radiation intensity map into a leaf sequence file that controls the movement of the MLC while the beam is on. It is imperative that the intensity map delivered using the leaf sequence file be as close as possible to the intensity map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf-sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf-sequencing algorithms for dynamic multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under the most common leaf movement constraints that include leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bi-directional movement of the MLC leaves.
Algorithms for optimal sequencing of dynamic multileaf collimators

International Nuclear Information System (INIS)

Kamath, Srijit; Sahni, Sartaj; Palta, Jatinder; Ranka, Sanjay

2004-01-01

Dynamic multileaf collimator (DMLC) intensity modulated radiation therapy (IMRT) is used to deliver intensity modulated beams using a multileaf collimator (MLC), with the leaves in motion. DMLC-IMRT requires the conversion of a radiation intensity map into a leaf sequence file that controls the movement of the MLC while the beam is on. It is imperative that the intensity map delivered using the leaf sequence file be as close as possible to the intensity map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf-sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf-sequencing algorithms for dynamic multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under the most common leaf movement constraints that include leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bi-directional movement of the MLC leaves
Whole Genome Sequencing of Enterovirus species C Isolates by High-throughput Sequencing: Development of Generic Primers

Directory of Open Access Journals (Sweden)

Maël Bessaud

2016-08-01

Full Text Available Enteroviruses are among the most common viruses infecting humans and can cause diverse clinical syndromes ranging from minor febrile illness to severe and potentially fatal diseases. Enterovirus species C (EV-C consists of more than 20 types, among which the 3 serotypes of polioviruses, the etiological agents of poliomyelitis, are included. Biodiversity and evolution of EV-C genomes are shaped by frequent recombination events. Therefore, identification and characterization of circulating EV-C strains require the sequencing of different genomic regions.A simple method was developed to sequence quickly the entire genome of EV-C isolates. Four overlapping fragments were produced separately by RT-PCR performed with generic primers. The four amplicons were then pooled and purified prior to be sequenced by high-throughput technique.The method was assessed on a panel of EV-Cs belonging to a wide-range of types. It can be used to determine full-length genome sequences through de novo assembly of thousands of reads. It was also able to discriminate reads from closely related viruses in mixtures.By decreasing the workload compared to classical Sanger-based techniques, this method will serve as a precious tool for sequencing large panels of EV-Cs isolated in cell cultures during environmental surveillance or from patients, including vaccine-derived polioviruses.
Next-Generation Sequencing Reveals the Impact of Repetitive DNA Across Phylogenetically Closely Related Genomes of Orobanchaceae

Science.gov (United States)

Piednoël, Mathieu; Aberer, Andre J.; Schneeweiss, Gerald M.; Macas, Jiri; Novak, Petr; Gundlach, Heidrun; Temsch, Eva M.; Renner, Susanne S.

2013-01-01

We used next-generation sequencing to characterize the genomes of nine species of Orobanchaceae of known phylogenetic relationships, different life forms, and including a polyploid species. The study species are the autotrophic, nonparasitic Lindenbergia philippensis, the hemiparasitic Schwalbea americana, and seven nonphotosynthetic parasitic species of Orobanche (Orobanche crenata, Orobanche cumana, Orobanche gracilis (tetraploid), and Orobanche pancicii) and Phelipanche (Phelipanche lavandulacea, Phelipanche purpurea, and Phelipanche ramosa). Ty3/Gypsy elements comprise 1.93%–28.34% of the nine genomes and Ty1/Copia elements comprise 8.09%–22.83%. When compared with L. philippensis and S. americana, the nonphotosynthetic species contain higher proportions of repetitive DNA sequences, perhaps reflecting relaxed selection on genome size in parasitic organisms. Among the parasitic species, those in the genus Orobanche have smaller genomes but higher proportions of repetitive DNA than those in Phelipanche, mostly due to a diversification of repeats and an accumulation of Ty3/Gypsy elements. Genome downsizing in the tetraploid O. gracilis probably led to sequence loss across most repeat types. PMID:22723303
Multilocus Sequence Analysis and rpoB Sequencing of Mycobacterium abscessus (Sensu Lato) Strains▿

Science.gov (United States)

Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

2011-01-01

Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536T, M. massiliense CIP 108297T, and M. bolletii CIP 108541T) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering
Multilocus sequence analysis and rpoB sequencing of Mycobacterium abscessus (sensu lato) strains.

Science.gov (United States)

Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

2011-02-01

Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the
Intersampler field comparison of Respicon(R), IOM, and closed-face 25-mm personal aerosol samplers during primary production of aluminium.

Science.gov (United States)

Skaugset, Nils Petter; Ellingsen, Dag G; Notø, Hilde; Jordbekken, Lars; Thomassen, Yngvar

2013-10-01

Intersampler field comparison of Respicon(®), 25-mm closed-face 'total dust' cassette (CFC), and IOM inhalable aerosol sampler was conducted in pot rooms at seven aluminium smelters. The aerosol mass and water-soluble fluoride were selected as airborne contaminants for the comparisons. The aerosol masses of 889 sample pairs of IOM and Respicon(®) inhalable aerosol sub-fraction, 165 of IOM and 25-mm CFC, and 194 of CFC and Respicon(®) thoracic aerosol sub-fraction were compared. The number of sample pairs for the comparison of water-soluble fluoride was 906, 170, and 195, respectively. The geometric mean aerosol mass collected with the inhalable Respicon(®) was 2.91 mg m(-3) compared with 3.38 mg m(-3) with the IOM. The overall ratio between IOM and Respicon(®) inhalable sub-fraction was 1.16 [95% confidence interval (CI) = 1.11-1.21] for aerosol mass and 1.13 (95% CI = 1.08-1.18) for water-soluble fluoride. The results indicate that Respicon(®) undersampled the aerosol mass and water-soluble fluoride in the inhalable sub-fraction compared with the IOM. The results indicated undersampling of the Respicon(®) at mass concentrations higher than 1.35 mg m(-3) and oversampling at lower mass concentrations. The overall ratio between aerosol mass collected with IOM and CFC was 4.19 (95% CI = 3.79-4.64) and 1.61 (95% CI = 1.51-1.72) for water-soluble fluoride. Thus, for this industry, a correction factor of 4.2 is suggested for the conversion of CFC to inhalable aerosol masses and a conversion factor of 1.6 for water-soluble fluoride if wall deposits in the CFC are included. CFC and thoracic Respicon(®) collected similar aerosol masses (ratio = 1.04; 95% CI = 0.97-1.12), whereas the ratio was 1.19 (95% CI = 1.11-1.28) for water-soluble fluoride. The variability of the exposure is substantial; thus, large data sets are required in sampler performance field comparisons.
Microbial culturomics to isolate halophilic bacteria from table salt: genome sequence and description of the moderately halophilic bacterium Bacillus salis sp. nov.

Science.gov (United States)

Seck, E H; Diop, A; Armstrong, N; Delerce, J; Fournier, P-E; Raoult, D; Khelaifia, S

2018-05-01

Bacillus salis strain ES3 T (= CSUR P1478 = DSM 100598) is the type strain of B. salis sp. nov. It is an aerobic, Gram-positive, moderately halophilic, motile and spore-forming bacterium. It was isolated from commercial table salt as part of a broad culturomics study aiming to maximize the culture conditions for the in-depth exploration of halophilic bacteria in salty food. Here we describe the phenotypic characteristics of this isolate, its complete genome sequence and annotation, together with a comparison with closely related bacteria. Phylogenetic analysis based on 16S rRNA gene sequences indicated 97.5% similarity with Bacillus aquimaris, the closest species. The 8 329 771 bp long genome (one chromosome, no plasmids) exhibits a G+C content of 39.19%. It is composed of 18 scaffolds with 29 contigs. Of the 8303 predicted genes, 8109 were protein-coding genes and 194 were RNAs. A total of 5778 genes (71.25%) were assigned a putative function.
MRI Sequences in Head & Neck Radiology - State of the Art.

Science.gov (United States)

Widmann, Gerlig; Henninger, Benjamin; Kremser, Christian; Jaschke, Werner

2017-05-01

Background Magnetic resonance imaging (MRI) has become an essential imaging modality for the evaluation of head & neck pathologies. However, the diagnostic power of MRI is strongly related to the appropriate selection and interpretation of imaging protocols and sequences. The aim of this article is to review state-of-the-art sequences for the clinical routine in head & neck MRI and to describe the evidence for which medical question these sequences and techniques are useful. Method Literature review of state-of-the-art sequences in head & neck MRI. Results and Conclusion Basic sequences (T1w, T2w, T1wC+) and fat suppression techniques (TIRM/STIR, Dixon, Spectral Fat sat) are important tools in the diagnostic workup of inflammation, congenital lesions and tumors including staging. Additional sequences (SSFP (CISS, FIESTA), SPACE, VISTA, 3D-FLAIR) are used for pathologies of the cranial nerves, labyrinth and evaluation of endolymphatic hydrops in Menière's disease. Vessel and perfusion sequences (3D-TOF, TWIST/TRICKS angiography, DCE) are used in vascular contact syndromes, vascular malformations and analysis of microvascular parameters of tissue perfusion. Diffusion-weighted imaging (EPI-DWI, non-EPI-DWI, RESOLVE) is helpful in cholesteatoma imaging, estimation of malignancy, and evaluation of treatment response and posttreatment recurrence in head & neck cancer. Understanding of MRI sequences and close collaboration with referring physicians improves the diagnostic confidence of MRI in the daily routine and drives further research in this fascinating image modality. Key Points: · Understanding of MRI sequences is essential for the correct and reliable interpretation of MRI findings.. · MRI protocols have to be carefully selected based on relevant clinical information.. · Close collaboration with referring physicians improves the output obtained from the diagnostic possibilities of MRI.. Citation Format · Widmann G, Henninger B, Kremser C et�
A 28,000 Years Old Cro-Magnon mtDNA Sequence Differs from All Potentially Contaminating Modern Sequences

Science.gov (United States)

Caramelli, David; Milani, Lucio; Vai, Stefania; Modi, Alessandra; Pecchioli, Elena; Girardi, Matteo; Pilli, Elena; Lari, Martina; Lippi, Barbara; Ronchitelli, Annamaria; Mallegni, Francesco; Casoli, Antonella; Bertorelle, Giorgio; Barbujani, Guido

2008-01-01

Background DNA sequences from ancient speciments may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal) and early modern (Cro-Magnoid) Europeans. Methodology/Principal Findings We typed the mitochondrial DNA (mtDNA) hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23) and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. Conclusions/Significance: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans. PMID:18628960
A 28,000 years old Cro-Magnon mtDNA sequence differs from all potentially contaminating modern sequences.

Directory of Open Access Journals (Sweden)

David Caramelli

Full Text Available BACKGROUND: DNA sequences from ancient specimens may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal and early modern (Cro-Magnoid Europeans. METHODOLOGY/PRINCIPAL FINDINGS: We typed the mitochondrial DNA (mtDNA hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23 and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. CONCLUSIONS/SIGNIFICANCE: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans.
Mesoscopic modeling of DNA denaturation rates: Sequence dependence and experimental comparison

Energy Technology Data Exchange (ETDEWEB)

Dahlen, Oda, E-mail: oda.dahlen@ntnu.no; Erp, Titus S. van, E-mail: titus.van.erp@ntnu.no [Department of Chemistry, Norwegian University of Science and Technology (NTNU), Høgskoleringen 5, Realfagbygget D3-117 7491 Trondheim (Norway)

2015-06-21

Using rare event simulation techniques, we calculated DNA denaturation rate constants for a range of sequences and temperatures for the Peyrard-Bishop-Dauxois (PBD) model with two different parameter sets. We studied a larger variety of sequences compared to previous studies that only consider DNA homopolymers and DNA sequences containing an equal amount of weak AT- and strong GC-base pairs. Our results show that, contrary to previous findings, an even distribution of the strong GC-base pairs does not always result in the fastest possible denaturation. In addition, we applied an adaptation of the PBD model to study hairpin denaturation for which experimental data are available. This is the first quantitative study in which dynamical results from the mesoscopic PBD model have been compared with experiments. Our results show that present parameterized models, although giving good results regarding thermodynamic properties, overestimate denaturation rates by orders of magnitude. We believe that our dynamical approach is, therefore, an important tool for verifying DNA models and for developing next generation models that have higher predictive power than present ones.
Complete genome sequence of Beijerinckia indica subsp. indica.

Science.gov (United States)

Tamas, Ivica; Dedysh, Svetlana N; Liesack, Werner; Stott, Matthew B; Alam, Maqsudul; Murrell, J Colin; Dunfield, Peter F

2010-09-01

Beijerinckia indica subsp. indica is an aerobic, acidophilic, exopolysaccharide-producing, N(2)-fixing soil bacterium. It is a generalist chemoorganotroph that is phylogenetically closely related to facultative and obligate methanotrophs of the genera Methylocella and Methylocapsa. Here we report the full genome sequence of this bacterium.
a Comparison of Morphological Taxonomy and Next Generation DNA Sequencing for the Assessment of Zooplankton Diversity

Science.gov (United States)

Harvey, J.; Fisher, J. L.; Johnson, S.; Morgan, S.; Peterson, W. T.; Satterthwaite, E. V.; Vrijenhoek, R. C.

2016-02-01

Our ability to accurately characterize the diversity of planktonic organisms is affected by both the methods we use to collect water samples and our approaches to assessing sample contents. Plankton nets collect organisms from high volumes of water, but integrate sample contents along the net's path. In contrast, plankton pumps collect water from discrete depths. Autonomous underwater vehicles (AUVs) can collect water samples with pinpoint accuracy from physical features such as upwelling fronts or biological features such as phytoplankton blooms, but sample volumes are necessarily much smaller than those possible with nets. Characterization of plankton diversity and abundances in water samples may also vary with the assessment method we apply. Morphological taxonomy provides visual identification and enumeration of organisms via microscopy, but is labor intensive. Next generation DNA sequencing (NGS) shows great promise for assessing plankton diversity in water samples but accurate assessment of relative abundances may not be possible in all cases. Comparison of morphological taxonomy to molecular approaches is necessary to identify areas of overlap and also areas of disagreement between these methods. We have compared morphological taxonomic assessments to mitochondrial COI and nuclear 28S ribosomal RNA NGS results for plankton net samples collected in Monterey bay, California. We have made a similar comparison for plankton pump samples, and have also applied our NGS methods to targeted, small volume water samples collected by an AUV. Our goal is to communicate current results and lessons learned regarding application of traditional taxonomy and novel molecular approaches to the study of plankton diversity in spatially and temporally variable, coastal marine environments.
A neurocomputational model of automatic sequence production.

Science.gov (United States)

Helie, Sebastien; Roeder, Jessica L; Vucovich, Lauren; Rünger, Dennis; Ashby, F Gregory

2015-07-01

Most behaviors unfold in time and include a sequence of submovements or cognitive activities. In addition, most behaviors are automatic and repeated daily throughout life. Yet, relatively little is known about the neurobiology of automatic sequence production. Past research suggests a gradual transfer from the associative striatum to the sensorimotor striatum, but a number of more recent studies challenge this role of the BG in automatic sequence production. In this article, we propose a new neurocomputational model of automatic sequence production in which the main role of the BG is to train cortical-cortical connections within the premotor areas that are responsible for automatic sequence production. The new model is used to simulate four different data sets from human and nonhuman animals, including (1) behavioral data (e.g., RTs), (2) electrophysiology data (e.g., single-neuron recordings), (3) macrostructure data (e.g., TMS), and (4) neurological circuit data (e.g., inactivation studies). We conclude with a comparison of the new model with existing models of automatic sequence production and discuss a possible new role for the BG in automaticity and its implication for Parkinson's disease.
Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta

Energy Technology Data Exchange (ETDEWEB)

Old, S.E.; Mohrenweiser, H.W. (Univ. of Michigan, Ann Arbor (USA))

1988-09-26

The triosephosphate isomerase gene from a rhesus monkey, Macaca mulatta, charon 34 library was sequenced. The human and chimpanzee enzymes differ from the rhesus enzyme at ASN 20 and GLU 198. The nucleotide sequence identity between rhesus and human is 97% in the coding region and >94% in the flanking regions. Comparison of the rhesus and chimp genes, including the intron and flanking sequences, does not suggest a mechanism for generating the two TPI peptides of proliferating cells from hominoids and a single peptide from the rhesus gene.

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

Science.gov (United States)

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356
The genome sequence of four isolates from the family Lichtheimiaceae.

Science.gov (United States)

Chibucos, Marcus C; Etienne, Kizee A; Orvis, Joshua; Lee, Hongkyu; Daugherty, Sean; Lockhart, Shawn R; Ibrahim, Ashraf S; Bruno, Vincent M

2015-07-01

This study reports the release of draft genome sequences of two isolates of Lichtheimia corymbifera and two isolates of L. ramosa. Phylogenetic analyses indicate that the two L. corymbifera strains (CDC-B2541 and 008-049) are closely related to the previously sequenced L. corymbifera isolate (FSU 9682) while our two L. ramosa strains CDC-B5399 and CDC-B5792 cluster apart from them. These genome sequences will further the understanding of intraspecies and interspecies genetic variation within the Mucoraceae family of pathogenic fungi. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evolution of massive close binaries and formation of neutron stars and black holes

International Nuclear Information System (INIS)

Massevitch, A.G.; Tutukov, A.V.; Yungelson, L.R.

1976-01-01

Main results of computations of evolution for massive close binaries (10 M(Sun)+9.4 M(Sun), 16 M(Sun)+15 M(Sun), 32 M(Sun)+30 M(Sun), 64 M(Sun)+60 M(Sun)) up to oxygen exhaustion in the core are described. Mass exchange starting in core hydrogen, shell hydrogen and core helium burning stages was studied. Computations were performed assuming both the Ledoux and Schwarzschild stability criteria for semiconvection. The influence of UFI-neutrino emission on evolution of close binaries was investigated. The results obtained allow to outline the following evolutionary chain: two detached Main-Sequence stars - mass exchange - Wolf-Rayet star or blue supergiant plus main sequence star - explosion of the initially more massive star appearing as a supernova event - collapsed or neutron star plus Main-Sequence star, that may be observed as a 'runaway star' - mass exchange leading to X-rays emission - collapsed or neutron star plus WR-star or blue supergiant - second explosion of supernova that preferentially disrupts the system and gives birth to two single high spatial velocity pulsars. Numerical estimates concerning the number and properties of WR-stars, pulsars and X-ray sources are presented. The results are in favour of the existence of UFI-neutrino and of the Ledoux criterion for describing semiconvection. Properties of several well-known X-ray sources and the binary pulsar are discussed on base of evolutionary chain of close binaries. (Auth.)
Development of highly polymorphic simple sequence repeat markers using genome-wide microsatellite variant analysis in Foxtail millet [Setaria italica (L.) P. Beauv].

Science.gov (United States)

Zhang, Shuo; Tang, Chanjuan; Zhao, Qiang; Li, Jing; Yang, Lifang; Qie, Lufeng; Fan, Xingke; Li, Lin; Zhang, Ning; Zhao, Meicheng; Liu, Xiaotong; Chai, Yang; Zhang, Xue; Wang, Hailong; Li, Yingtao; Li, Wen; Zhi, Hui; Jia, Guanqing; Diao, Xianmin

2014-01-28

Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1' by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei's genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species.
Collagen Sequence Analysis of the Extinct Giant Ground Sloths Lestodon and Megatherium.

Directory of Open Access Journals (Sweden)

Michael Buckley

Full Text Available For over 200 years, fossils of bizarre extinct creatures have been described from the Americas that have ranged from giant ground sloths to the 'native' South American ungulates, groups of mammals that evolved in relative isolation on South America. Ground sloths belong to the South American xenarthrans, a group with modern although morphologically and ecologically very different representatives (anteaters, armadillos and sloths, which has been proposed to be one of the four main eutherian clades. Recently, proteomics analyses of bone collagen have recently been used to yield a molecular phylogeny for a range of mammals including the unusual 'Malagasy aardvark' shown to be most closely related to the afrotherian tenrecs, and the south American ungulates supporting their morphological association with condylarths. However, proteomics results generate partial sequence information that could impact upon the phylogenetic placement that has not been appropriately tested. For comparison, this paper examines the phylogenetic potential of proteomics-based sequencing through the analysis of collagen extracted from two extinct giant ground sloths, Lestodon and Megatherium. The ground sloths were placed as sister taxa to extant sloths, but with a closer relationship between Lestodon and the extant sloths than the basal Megatherium. These results highlight that proteomics methods could yield plausible phylogenies that share similarities with other methods, but have the potential to be more useful in fossils beyond the limits of ancient DNA survival.
Complete Genome Sequence of Beijerinckia indica subsp. indica▿

Science.gov (United States)

Tamas, Ivica; Dedysh, Svetlana N.; Liesack, Werner; Stott, Matthew B.; Alam, Maqsudul; Murrell, J. Colin; Dunfield, Peter F.

2010-01-01

Beijerinckia indica subsp. indica is an aerobic, acidophilic, exopolysaccharide-producing, N2-fixing soil bacterium. It is a generalist chemoorganotroph that is phylogenetically closely related to facultative and obligate methanotrophs of the genera Methylocella and Methylocapsa. Here we report the full genome sequence of this bacterium. PMID:20601475
Fifty years of coiled-coils and alpha-helical bundles: a close relationship between sequence and structure.

Science.gov (United States)

Parry, David A D; Fraser, R D Bruce; Squire, John M

2008-09-01

alpha-Helical coiled coils are remarkable for the diversity of related conformations that they adopt in both fibrous and globular proteins, and for the range of functions that they exhibit. The coiled coils are based on a heptad (7-residue), hendecad (11-residue) or a related quasi-repeat of apolar residues in the sequences of the alpha-helical regions involved. Most of these, however, display one or more sequence discontinuities known as stutters or stammers. The resulting coiled coils vary in length, in the number of chains participating, in the relative polarity of the contributing alpha-helical regions (parallel or antiparallel), and in the pitch length and handedness of the supercoil (left- or right-handed). Functionally, the concept that a coiled coil can act only as a static rod is no longer valid, and the range of roles that these structures have now been shown to exhibit has expanded rapidly in recent years. An important development has been the recognition that the delightful simplicity that exists between sequence and structure, and between structure and function, allows coiled coils with specialized features to be designed de novo.
The cDNA sequence of a neutral horseradish peroxidase.

Science.gov (United States)

Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

1991-02-16

A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Genome Sequence of Gordonia Phage BetterKatz

Science.gov (United States)

Berryman, Emily N.; Forrest, Kaitlyn M.; McHale, Lilliana; Wertz, Anthony T.; Zhuang, Zenas; Kasturiarachi, Naomi S.; Pressimone, Catherine A.; Schiebel, Johnathon G.; Furbee, Emily C.; Grubb, Sarah R.; Warner, Marcie H.; Montgomery, Matthew T.; Garlena, Rebecca A.; Russell, Daniel A.; Jacobs-Sera, Deborah; Hatfull, Graham F.

2016-01-01

BetterKatz is a bacteriophage isolated from a soil sample collected in Pittsburgh, Pennsylvania using the host Gordonia terrae 3612. BetterKatz’s genome is 50,636 bp long and contains 75 predicted protein-coding genes, 35 of which have been assigned putative functions. BetterKatz is not closely related to other sequenced Gordonia phages. PMID:27516497
Identification of sequence changes in live attenuated goose parvovirus vaccine strains developed in Asia and Europe.

Science.gov (United States)

Shien, J-H; Wang, Y-S; Chen, C-H; Shieh, H K; Hu, C-C; Chang, P-C

2008-10-01

Live attenuated vaccines have been used for control of the disease caused by goose parvovirus (GPV), but the mechanism involved in attenuation of GPV remains elusive. This report presents the complete nucleotide sequences of two live attenuated strains of GPV (82-0321V and VG32/1) that were independently developed in Taiwan and Europe, together with the parental strain of 82-0321V and a field strain isolated in Taiwan in 2006. Sequence comparisons showed that 82-0321V and VG32/1 had multiple deletions and substitutions in the inverted terminal repeats region when compared with their parental strain or the field virus, but these changes did not affect the formation of the hairpin structure essential for viral replication. Moreover, 82-0321V and VG32/1 had five amino acid changes in the non-structural protein, but these changes were located at positions distant from known functional motifs in the non-structural protein. In contrast, 82-0321V had nine changes and VG32/1 had 11 changes in their capsid proteins (VP1), and the majority of these changes occurred at positions close to the putative receptor binding sites of VP1, as predicted using the structure of adeno-associated virus 2 as the model system. Taken together, the results suggest that changes in sequence near the receptor binding sites of VP1 might be responsible for attenuation of GPV. This is the first report of complete nucleotide sequences of GPV other than the virulent B strain, and suggests a possible mechanism for attenuation of GPV.
Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

Directory of Open Access Journals (Sweden)

Yunyun Liang

2015-01-01

Full Text Available Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM. Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS, segmented PsePSSM, and segmented autocovariance transformation (ACT based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640 are adopted in this paper. Then a 700-dimensional (700D feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA. To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.
Metagenomics profiling for assessing microbial diversity in both active and closed landfills.

Science.gov (United States)

Zainun, Mohamad Yusof; Simarani, Khanom

2018-03-01

The municipal landfill is an example of human-made environment that harbours some complex diversity of microorganism communities. To evaluate this complexity, the structures of bacterial communities in active (operational) and closed (non-operational) landfills in Malaysia were analysed with culture independent metagenomics approaches. Several points of soil samples were collected from 0 to 20cm depth and were subjected to physicochemical test, such as temperature, pH, and moisture content. In addition, the heavy metal contamination was determined by using ICPMS. The bacterial enumeration was examined on nutrient agar (NA) plates aerobically at 30°C. The soil DNA was extracted, purified and amplified prior to sequence the 16S rRNA gene for statistical and bioinformatics analyses. As a result, the average of bacteria for the closed landfill was higher compared to that for the active landfill at 9.16×10 7 and 1.50×10 7 , respectively. The higher bacterial OTUs sequenced was also recorded in closed landfills compared to active landfill i.e. 6625 and 4552 OTUs respectively. The data from both landfills showed that the predominant phyla belonged to Proteobacteria (55.7%). On average, Bacteroidetes was the second highest phylum followed by Firmicutes for the active landfill. While the phyla for communities in closed landfill were dominated by phyla from Acidobacteria and Actinobacteria. There was also Euryarchaeota (Archaea) which became a minor phylum that was detected in active landfill, but almost completely absent in closed landfill. As such, the composition of bacterial communities suggests some variances between the bacterial communities found in active and closed landfills. Thus, this study offers new clues pertaining to bacterial diversity pattern between the varied types of landfills studied. Copyright © 2017. Published by Elsevier B.V.
Efficiency comparison of 3 kinds of arterial puncture closing devices

International Nuclear Information System (INIS)

Feng Xiaodi; Jin Xian; Chen Yueguang; Xiao Hongbing; Yu Qiang; Chen Chengjun; Zhang Dadong

2007-01-01

Objective: To evaluate the efficiencies of arterial puncture closing devices (APCDs) including Angioseal, Perclose and Boomerang in patients undergone coronary angiography or percutaneous vascular interventions. Methods: 1497 patients underwent cardiac catheterization procedures were divided into manual compression group(639 cases) and APCDs closure group (576 cases with Angioseal, 151 cases Perclose and 11.3 cases of Boomerang). The times of maneuver, hemorrhage complication and other rare complications were assessed, recorded and compared. Results: The times for maneuver of standard manual compression group, Angioseal group, Perclose group and Boomerang group were (21.4±2.7) h, (3.5±2.3) h, (3.7± 2.6) h and (3.9±2.8) h respectively. The APCDs could obviously reduce bed rest time in comparing to that of manual compression. The rates of failure of the operations were 2.7%, 1.4%, 8.6% and 3.5% (P =0.006, P<0.001); and the rates of hemorrhage were 9.2%, 5.8%, 12.6% and 8.0% respectively for each of the four mentioned groups (P=0.005). Except the failure operations, the incidence of hemorrhage complications among the groups showed no significant differences. Conclusion: Application of APCDs to close the puncture site can significantly reduce the bed rest time, but not the incidence of hemorrhage complications. (authors)
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.

Science.gov (United States)

Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark

2016-01-01

Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and
COMPARISON OF TWO STRUCTURE AND MOTION STRATEGIES

Directory of Open Access Journals (Sweden)

R. Roncella

2012-09-01

Full Text Available Automatic orientation of image sequences in close range photogrammetry is becoming more and more important, not least to maintain a degree of competitiveness with other survey techniques, such as laser scanning. The objective of this paper is to compare two Structure from Motion (SFM strategies. The previous strategy has been used at our Department for some years already in a wide range of projects and is based on the Harris operator and the fundamental matrix plus the trifocal tensor estimation to filter out the outliers. While it has in most cases performed satisfactorily, the percentage of accepted matches is generally smaller than expected; sometimes this leads to failure of the successful estimation of the trifocal tensor. The second one has only recently been implemented and is still under testing; it is based on the SURF operator and the 5-point relative orientation algorithm. The paper will show a comparison between the two strategies on a series of test cases.
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

Directory of Open Access Journals (Sweden)

Anne Bruun Krøigård

Full Text Available Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.
Molecular diagnosis of lyssaviruses and sequence comparison of Australian bat lyssavirus samples.

Science.gov (United States)

Foord, A J; Heine, H G; Pritchard, L I; Lunt, R A; Newberry, K M; Rootes, C L; Boyle, D B

2006-07-01

To evaluate and implement molecular diagnostic tests for the detection of lyssaviruses in Australia. A published hemi-nested reverse transcriptase polymerase chain reaction (RT-PCR) for the detection of all lyssavirus genotypes was modified to a fully nested RT-PCR format and compared with the original assay. TaqMan assays for the detection of Australian bat lyssavirus (ABLV) were compared with both the nested and hemi-nested RT-PCR assays. The sequences of RT-PCR products were determined to assess sequence variations of the target region (nucleocapsid gene) in samples of ABLV originating from different regions. The nested RT-PCR assay was highly analytically specific, and at least as analytically sensitive as the hemi-nested assay. The TaqMan assays were highly analytically specific and more analytically sensitive than either RT-PCR assay, with a detection level of approximately 10 genome equivalents per microl. Sequence of the first 544 nucleotides of the nucleocapsid protein coding sequence was obtained from all samples of ABLV received at Australian Animal Health Laboratory during the study period. The nested RT-PCR provided a means for molecular diagnosis of all tested genotypes of lyssavirus including classical rabies virus and Australian bat lyssavirus. The published TaqMan assay proved to be superior to the RT-PCR assays for the detection of ABLV in terms of analytical sensitivity. The TaqMan assay would also be faster and cross contamination is less likely. Nucleotide sequence analyses of samples of ABLV from a wide geographical range in Australia demonstrated the conserved nature of this region of the genome and therefore the suitability of this region for molecular diagnosis.
Sequence determination and analysis of the NSs genes of two tospoviruses.

Science.gov (United States)

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.
Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms.

Science.gov (United States)

Yamamoto, Toshio; Nagasaki, Hideki; Yonemaru, Jun-ichi; Ebana, Kaworu; Nakajima, Maiko; Shibaya, Taeko; Yano, Masahiro

2010-04-27

To create useful gene combinations in crop breeding, it is necessary to clarify the dynamics of the genome composition created by breeding practices. A large quantity of single-nucleotide polymorphism (SNP) data is required to permit discrimination of chromosome segments among modern cultivars, which are genetically related. Here, we used a high-throughput sequencer to conduct whole-genome sequencing of an elite Japanese rice cultivar, Koshihikari, which is closely related to Nipponbare, whose genome sequencing has been completed. Then we designed a high-throughput typing array based on the SNP information by comparison of the two sequences. Finally, we applied this array to analyze historical representative rice cultivars to understand the dynamics of their genome composition. The total 5.89-Gb sequence for Koshihikari, equivalent to 15.7 x the entire rice genome, was mapped using the Pseudomolecules 4.0 database for Nipponbare. The resultant Koshihikari genome sequence corresponded to 80.1% of the Nipponbare sequence and led to the identification of 67,051 SNPs. A high-throughput typing array consisting of 1917 SNP sites distributed throughout the genome was designed to genotype 151 representative Japanese cultivars that have been grown during the past 150 years. We could identify the ancestral origin of the pedigree haplotypes in 60.9% of the Koshihikari genome and 18 consensus haplotype blocks which are inherited from traditional landraces to current improved varieties. Moreover, it was predicted that modern breeding practices have generally decreased genetic diversity Detection of genome-wide SNPs by both high-throughput sequencer and typing array made it possible to evaluate genomic composition of genetically related rice varieties. With the aid of their pedigree information, we clarified the dynamics of chromosome recombination during the historical rice breeding process. We also found several genomic regions decreasing genetic diversity which might be
Next generation sequencing reveals the hidden diversity of zooplankton assemblages.

Directory of Open Access Journals (Sweden)

Penelope K Lindeque

Full Text Available BACKGROUND: Zooplankton play an important role in our oceans, in biogeochemical cycling and providing a food source for commercially important fish larvae. However, difficulties in correctly identifying zooplankton hinder our understanding of their roles in marine ecosystem functioning, and can prevent detection of long term changes in their community structure. The advent of massively parallel next generation sequencing technology allows DNA sequence data to be recovered directly from whole community samples. Here we assess the ability of such sequencing to quantify richness and diversity of a mixed zooplankton assemblage from a productive time series site in the Western English Channel. METHODOLOGY/PRINCIPLE FINDINGS: Plankton net hauls (200 µm were taken at the Western Channel Observatory station L4 in September 2010 and January 2011. These samples were analysed by microscopy and metagenetic analysis of the 18S nuclear small subunit ribosomal RNA gene using the 454 pyrosequencing platform. Following quality control a total of 419,041 sequences were obtained for all samples. The sequences clustered into 205 operational taxonomic units using a 97% similarity cut-off. Allocation of taxonomy by comparison with the National Centre for Biotechnology Information database identified 135 OTUs to species level, 11 to genus level and 1 to order, <2.5% of sequences were classified as unknowns. By comparison a skilled microscopic analyst was able to routinely enumerate only 58 taxonomic groups. CONCLUSIONS: Metagenetics reveals a previously hidden taxonomic richness, especially for Copepoda and hard-to-identify meroplankton such as Bivalvia, Gastropoda and Polychaeta. It also reveals rare species and parasites. We conclude that Next Generation Sequencing of 18S amplicons is a powerful tool for elucidating the true diversity and species richness of zooplankton communities. While this approach allows for broad diversity assessments of plankton it may

Cardiac MR tagging: optimization of sequence parameters and comparison at 1.5 T and 3.0 T in a volunteer study

International Nuclear Information System (INIS)

Kramer, U.; Fenchel, M.; Klumpp, B.; Claussen, C.D.; Miller, S.; Deshpande, V.; Laub, G.; Finn, J.P.

2006-01-01

Purpose: The aim of this study was the optimization of a gradient echo (GRE) MR tagging sequence at 3.0 T in comparison to 1.5 T in order to obtain the best image contrast between the myocardium, tag lines and blood signal. Theoretically expected improvements of signal-to-noise (SNR) and contrast-to-noise ratios (CNR) were also calculated. Materials and methods: 14 healthy volunteers (8 male, 6 female; mean age 43.4±10.3 years) were scanned using a 3.0 T as well as a 1.5 T whole-body system. A GRE flash-2 D tagging sequence was evaluated (midventricular short axis view) by varying the flip angle (8-16 ), slice thickness (4-8 mm; fixed flip angle 1.5/3.0 T: 12 /8 , tag size 8 mm) and tag size (4-8 mm, fixed flip angle 1.5/3.0 T: 12 /8 , slice thickness 6 mm). The field of view, acquisition time and temporal resolution (45 ms) were kept constant. Qualitative and quantitative image analysis was performed by calculating the SNR, CNR tag as well as the relative contrast between the myocardium and tag lines (RCMT). Results: Based on individual comparison, the best imaging protocol was found at a slice thickness of 6 mm, tag size of 8 mm, optimized flip angle of 8 (3.0 T) and 12 (1.5 T), respectively. Compared to 1.5 T, a significantly higher overall image score was determined (mean±sd; 3.2±0.2 vs 2.7±0.4) and a strong correlation between the CNR tag and RCMT for flip angle α and the slice thickness was found. A higher field strength resulted in an 80% increase in the CNR tag compared to 1.5 T (mean 10.7/6.1). Furthermore, the SNR was improved by 35% (mean 20.6/15.3) and the RCMT by 35% (mean 0.47/0.35). Conclusion: Myocardial tagging at 3.0 T has shown superior image quality in comparison to 1.5 T due to a higher baseline SNR and an improved CNR as well as RCMT. The suppressed fading of the tags enables the accessibility to the diastolic phase of the cardiac cycle. (orig.)
Optimal choice of word length when comparing two Markov sequences using a χ 2-statistic.

Science.gov (United States)

Bai, Xin; Tang, Kujin; Ren, Jie; Waterman, Michael; Sun, Fengzhu

2017-10-03

Alignment-free sequence comparison using counts of word patterns (grams, k-tuples) has become an active research topic due to the large amount of sequence data from the new sequencing technologies. Genome sequences are frequently modelled by Markov chains and the likelihood ratio test or the corresponding approximate χ 2 -statistic has been suggested to compare two sequences. However, it is not known how to best choose the word length k in such studies. We develop an optimal strategy to choose k by maximizing the statistical power of detecting differences between two sequences. Let the orders of the Markov chains for the two sequences be r 1 and r 2 , respectively. We show through both simulations and theoretical studies that the optimal k= max(r 1 ,r 2 )+1 for both long sequences and next generation sequencing (NGS) read data. The orders of the Markov chains may be unknown and several methods have been developed to estimate the orders of Markov chains based on both long sequences and NGS reads. We study the power loss of the statistics when the estimated orders are used. It is shown that the power loss is minimal for some of the estimators of the orders of Markov chains. Our studies provide guidelines on choosing the optimal word length for the comparison of Markov sequences.
Comparison of whole genome amplification techniques for human single cell exome sequencing.

Science.gov (United States)

Borgström, Erik; Paterlini, Marta; Mold, Jeff E; Frisen, Jonas; Lundeberg, Joakim

2017-01-01

Whole genome amplification (WGA) is currently a prerequisite for single cell whole genome or exome sequencing. Depending on the method used the rate of artifact formation, allelic dropout and sequence coverage over the genome may differ significantly. The largest difference between the evaluated protocols was observed when analyzing the target coverage and read depth distribution. These differences also had impact on the downstream variant calling. Conclusively, the products from the AMPLI1 and MALBAC kits were shown to be most similar to the bulk samples and are therefore recommended for WGA of single cells. In this study four commercial kits for WGA (AMPLI1, MALBAC, Repli-G and PicoPlex) were used to amplify human single cells. The WGA products were exome sequenced together with non-amplified bulk samples from the same source. The resulting data was evaluated in terms of genomic coverage, allelic dropout and SNP calling.
Order and correlations in genomic DNA sequences. The spectral approach

International Nuclear Information System (INIS)

Lobzin, Vasilii V; Chechetkin, Vladimir R

2000-01-01

The structural analysis of genomic DNA sequences is discussed in the framework of the spectral approach, which is sufficiently universal due to the reciprocal correspondence and mutual complementarity of Fourier transform length scales. The spectral characteristics of random sequences of the same nucleotide composition possess the property of self-averaging for relatively short sequences of length M≥100-300. Comparison with the characteristics of random sequences determines the statistical significance of the structural features observed. Apart from traditional applications to the search for hidden periodicities, spectral methods are also efficient in studying mutual correlations in DNA sequences. By combining spectra for structure factors and correlation functions, not only integral correlations can be estimated but also their origin identified. Using the structural spectral entropy approach, the regularity of a sequence can be quantitatively assessed. A brief introduction to the problem is also presented and other major methods of DNA sequence analysis described. (reviews of topical problems)
Dynamic Tides and the Evolution of Stars in Close Binaries

OpenAIRE

Willems, B.; Claret, A.

2004-01-01

In this talk, we review some recent advances in the theory of dynamic tides in close binaries. We particularly focus on the effects of resonances of dynamic tides with free oscillation modes and on the role of dynamic tides in the comparison of theoretically predicted and observationally inferred apsidal-motion rates.
Genetic diversity in breonadia salicina based on intra-species sequence variation of chloroplast dna spacer sequence

International Nuclear Information System (INIS)

Qurainy, F.A.; Gaafar, A.R.Z.

2014-01-01

Assessment and knowledge of the genetic diversity and variation within and between populations of rare and endangered plants is very important for effective conservation. Intergenic spacer sequences variation of psbA-trnH locus of chloroplast genome was assessed within Breonadia salicina (Rubiaceae), a critically endangered and endemic plant species to South western part of Kingdom of Saudi Arabia. The obtained sequence data from 19 individuals in three populations revealed nine haplotypes. The aligned sequences obtained from the overall Saudi accessions extended to 355 bp, revealing nine haplotypes. A high level of haplotype diversity (Hd = 0.842) and low level of nucleotide diversity (Pi = 0.0058) were detected. Consistently, both hierarchical analysis of molecular variance (AMOVA) and constructed neighbor-joining tree indicated null genetic differentiation among populations. This level of differentiation between populations or between regions in psbA-trnH sequences may be due to effects of the abundance of ancestral haplotype sharing and the presence of private haplotypes fixed for each population. Furthermore, the results revealed almost the same level of genetic diversity in comparison with Yemeni accessions, in which Saudi accessions were sharing three haplotypes from the four haplotypes found in Yemeni accessions. (author)
Genome Analysis of Listeria monocytogenes Sequence Type 8 Strains Persisting in Salmon and Poultry Processing Environments and Comparison with Related Strains

Science.gov (United States)

Fagerlund, Annette; Langsrud, Solveig; Schirmer, Bjørn C. T.; Møretrø, Trond; Heir, Even

2016-01-01

Listeria monocytogenes is an important foodborne pathogen responsible for the disease listeriosis, and can be found throughout the environment, in many foods and in food processing facilities. The main cause of listeriosis is consumption of food contaminated from sources in food processing environments. Persistence in food processing facilities has previously been shown for the L. monocytogenes sequence type (ST) 8 subtype. In the current study, five ST8 strains were subjected to whole-genome sequencing and compared with five additionally available ST8 genomes, allowing comparison of strains from salmon, poultry and cheese industry, in addition to a human clinical isolate. Genome-wide analysis of single-nucleotide polymorphisms (SNPs) confirmed that almost identical strains were detected in a Danish salmon processing plant in 1996 and in a Norwegian salmon processing plant in 2001 and 2011. Furthermore, we show that L. monocytogenes ST8 was likely to have been transferred between two poultry processing plants as a result of relocation of processing equipment. The SNP data were used to infer the phylogeny of the ST8 strains, separating them into two main genetic groups. Within each group, the plasmid and prophage content was almost entirely conserved, but between groups, these sequences showed strong divergence. The accessory genome of the ST8 strains harbored genetic elements which could be involved in rendering the ST8 strains resilient to incoming mobile genetic elements. These included two restriction-modification loci, one of which was predicted to show phase variable recognition sequence specificity through site-specific domain shuffling. Analysis indicated that the ST8 strains harbor all important known L. monocytogenes virulence factors, and ST8 strains are commonly identified as the causative agents of invasive listeriosis. Therefore, the persistence of this L. monocytogenes subtype in food processing facilities poses a significant concern for food safety
Sequence comparisons of odorant receptors among tortricid moths reveal different rates of molecular evolution among family members.

Directory of Open Access Journals (Sweden)

Colm Carraher

Full Text Available In insects, odorant receptors detect volatile cues involved in behaviours such as mate recognition, food location and oviposition. We have investigated the evolution of three odorant receptors from five species within the moth genera Ctenopseustis and Planotrotrix, family Tortricidae, which fall into distinct clades within the odorant receptor multigene family. One receptor is the orthologue of the co-receptor Or83b, now known as Orco (OR2, and encodes the obligate ion channel subunit of the receptor complex. In comparison, the other two receptors, OR1 and OR3, are ligand-binding receptor subunits, activated by volatile compounds produced by plants--methyl salicylate and citral, respectively. Rates of sequence evolution at non-synonymous sites were significantly higher in OR1 compared with OR2 and OR3. Within the dataset OR1 contains 109 variable amino acid positions that are distributed evenly across the entire protein including transmembrane helices, loop regions and termini, while OR2 and OR3 contain 18 and 16 variable sites, respectively. OR2 shows a high level of amino acid conservation as expected due to its essential role in odour detection; however we found unexpected differences in the rate of evolution between two ligand-binding odorant receptors, OR1 and OR3. OR3 shows high sequence conservation suggestive of a conserved role in odour reception, whereas the higher rate of evolution observed in OR1, particularly at non-synonymous sites, may be suggestive of relaxed constraint, perhaps associated with the loss of an ancestral role in sex pheromone reception.
Simultaneous identification of long similar substrings in large sets of sequences

Directory of Open Access Journals (Sweden)

Wittig Burghardt

2007-05-01

Full Text Available Abstract Background Sequence comparison faces new challenges today, with many complete genomes and large libraries of transcripts known. Gene annotation pipelines match these sequences in order to identify genes and their alternative splice forms. However, the software currently available cannot simultaneously compare sets of sequences as large as necessary especially if errors must be considered. Results We therefore present a new algorithm for the identification of almost perfectly matching substrings in very large sets of sequences. Its implementation, called ClustDB, is considerably faster and can handle 16 times more data than VMATCH, the most memory efficient exact program known today. ClustDB simultaneously generates large sets of exactly matching substrings of a given minimum length as seeds for a novel method of match extension with errors. It generates alignments of maximum length with a considered maximum number of errors within each overlapping window of a given size. Such alignments are not optimal in the usual sense but faster to calculate and often more appropriate than traditional alignments for genomic sequence comparisons, EST and full-length cDNA matching, and genomic sequence assembly. The method is used to check the overlaps and to reveal possible assembly errors for 1377 Medicago truncatula BAC-size sequences published at http://www.medicago.org/genome/assembly_table.php?chr=1. Conclusion The program ClustDB proves that window alignment is an efficient way to find long sequence sections of homogenous alignment quality, as expected in case of random errors, and to detect systematic errors resulting from sequence contaminations. Such inserts are systematically overlooked in long alignments controlled by only tuning penalties for mismatches and gaps. ClustDB is freely available for academic use.
Dispersed repetitive sequences in eukaryotic genomes and their possible biological significance

International Nuclear Information System (INIS)

Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.

1983-01-01

In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table
A symbolic dynamics approach for the complexity analysis of chaotic pseudo-random sequences

International Nuclear Information System (INIS)

Xiao Fanghong

2004-01-01

By considering a chaotic pseudo-random sequence as a symbolic sequence, authors present a symbolic dynamics approach for the complexity analysis of chaotic pseudo-random sequences. The method is applied to the cases of Logistic map and one-way coupled map lattice to demonstrate how it works, and a comparison is made between it and the approximate entropy method. The results show that this method is applicable to distinguish the complexities of different chaotic pseudo-random sequences, and it is superior to the approximate entropy method
Triassic Sequence Geological Development of the Arctic with focus on Svalbard and the Barents Shelf

Energy Technology Data Exchange (ETDEWEB)

Moerk, Atle

1998-12-31

Triassic rocks are of great interest for exploration in Arctic areas as they have proved to include both good hydrocarbon source rocks and potential hydrogen reservoir rocks. In this thesis, the stratigraphy and sedimentology of the Arctic Triassic successions are studied within a sequence stratigraphical framework. Inter-regional comparisons throughout the Arctic are based on comparisons of transgressive-regressive sequences. Improved dating of the studied sequences, and the recognition and correlation of sequence boundaries of second and third order, facilitate interpretation of facies distribution and the geological development both within and between the studied areas. Main emphasis is given to the Triassic succession of Svalbard and the Barents Shelf, which through this study is integrated within a circum-Arctic sequence stratigraphical framework. Good correspondence of the Triassic sequence boundaries between the different Arctic areas indicate that they are mainly controlled by eustacy, while decreasing correspondence of the sequence boundaries in the Jurassic and Cretaceous periods indicate that local and large scale tectonism becomes progressively more dominant in the circum-Arctic Realm through the Mesozoic Era. These hypotheses are further discussed. 701 refs., 110 figs., 12 tabs.
A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.

Science.gov (United States)

Wickland, Daniel P; Battu, Gopal; Hudson, Karen A; Diers, Brian W; Hudson, Matthew E

2017-12-28

Genotyping-by-sequencing (GBS), a method to identify genetic variants and quickly genotype samples, reduces genome complexity by using restriction enzymes to divide the genome into fragments whose ends are sequenced on short-read sequencing platforms. While cost-effective, this method produces extensive missing data and requires complex bioinformatics analysis. GBS is most commonly used on crop plant genomes, and because crop plants have highly variable ploidy and repeat content, the performance of GBS analysis software can vary by target organism. Here we focus our analysis on soybean, a polyploid crop with a highly duplicated genome, relatively little public GBS data and few dedicated tools. We compared the performance of five GBS pipelines using low-coverage Illumina sequence data from three soybean populations. To address issues identified with existing methods, we developed GB-eaSy, a GBS bioinformatics workflow that incorporates widely used genomics tools, parallelization and automation to increase the accuracy and accessibility of GBS data analysis. Compared to other GBS pipelines, GB-eaSy rapidly and accurately identified the greatest number of SNPs, with SNP calls closely concordant with whole-genome sequencing of selected lines. Across all five GBS analysis platforms, SNP calls showed unexpectedly low convergence but generally high accuracy, indicating that the workflows arrived at largely complementary sets of valid SNP calls on the low-coverage data analyzed. We show that GB-eaSy is approximately as good as, or better than, other leading software solutions in the accuracy, yield and missing data fraction of variant calling, as tested on low-coverage genomic data from soybean. It also performs well relative to other solutions in terms of the run time and disk space required. In addition, GB-eaSy is built from existing open-source, modular software packages that are regularly updated and commonly used, making it straightforward to install and maintain
The Biomolecule Sequencer Project: Nanopore Sequencing as a Dual-Use Tool for Crew Health and Astrobiology Investigations

Science.gov (United States)

John, K. K.; Botkin, D. S.; Burton, A. S.; Castro-Wallace, S. L.; Chaput, J. D.; Dworkin, J. P.; Lehman, N.; Lupisella, M. L.; Mason, C. E.; Smith, D. J.;

2016-01-01

Human missions to Mars will fundamentally transform how the planet is explored, enabling new scientific discoveries through more sophisticated sample acquisition and processing than can currently be implemented in robotic exploration. The presence of humans also poses new challenges, including ensuring astronaut safety and health and monitoring contamination. Because the capability to transfer materials to Earth will be extremely limited, there is a strong need for in situ diagnostic capabilities. Nucleotide sequencing is a particularly powerful tool because it can be used to: (1) mitigate microbial risks to crew by allowing identification of microbes in water, in air, and on surfaces; (2) identify optimal treatment strategies for infections that arise in crew members; and (3) track how crew members, microbes, and mission-relevant organisms (e.g., farmed plants) respond to conditions on Mars through transcriptomic and genomic changes. Sequencing would also offer benefits for science investigations occurring on the surface of Mars by permitting identification of Earth-derived contamination in samples. If Mars contains indigenous life, and that life is based on nucleic acids or other closely related molecules, sequencing would serve as a critical tool for the characterization of those molecules. Therefore, spaceflight-compatible nucleic acid sequencing would be an important capability for both crew health and astrobiology exploration. Advances in sequencing technology on Earth have been driven largely by needs for higher throughput and read accuracy. Although some reduction in size has been achieved, nearly all commercially available sequencers are not compatible with spaceflight due to size, power, and operational requirements. Exceptions are nanopore-based sequencers that measure changes in current caused by DNA passing through pores; these devices are inherently much smaller and require significantly less power than sequencers using other detection methods

Complete plastid genome sequence of Primula sinensis (Primulaceae: structure comparison, sequence variation and evidence for accD transfer to nucleus

Directory of Open Access Journals (Sweden)

Tong-Jian Liu

2016-06-01

Full Text Available Species-rich genus Primula L. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence of Primula sinensis and compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp were separated by a large single-copy region (82,064 bp and a small single-copy region (17,725 bp. The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. The accD and infA genes lacking intact open reading frames (ORF were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome of Primula sinensis, comparing with another available plastome of P. poissonii. The four most variable regions, rpl36–rps8, rps16–trnQ, trnH–psbA and ndhC–trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found from P. sinensis transcriptome showed a high similarity to plastid accD functional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastid accD has been functionally transferred to the nucleus in P. sinensis.
Study and realisation of a programmable generator of pulse sequences, for nuclear magnetic resonance

International Nuclear Information System (INIS)

Lambert, Daniel

1974-01-01

After having recalled the operation of pulse-based nuclear magnetic resonance and the use of pulse sequences in NMR-based measurements, and outlined the need for a pulse sequence generator, the author reports the design and realisation of such a device. He describes its general organisation with its base sequence, base clock, sequence start, duration, displays, data transfers, data processing, and signal distribution. He presents the chosen technology (ECL logics), the sequence base set, time bases, multiplexers, comparison sets, the distribution set, the sequence programming, the sampling and output set. He reports tests and the use of the so-designed generator [fr
Harmonic spectral components in time sequences of Markov correlated events

Science.gov (United States)

Mazzetti, Piero; Carbone, Anna

2017-07-01

The paper concerns the analysis of the conditions allowing time sequences of Markov correlated events give rise to a line power spectrum having a relevant physical interest. It is found that by specializing the Markov matrix in order to represent closed loop sequences of events with arbitrary distribution, generated in a steady physical condition, a large set of line spectra, covering all possible frequency values, is obtained. The amplitude of the spectral lines is given by a matrix equation based on a generalized Markov matrix involving the Fourier transform of the distribution functions representing the time intervals between successive events of the sequence. The paper is a complement of a previous work where a general expression for the continuous power spectrum was given. In that case the Markov matrix was left in a more general form, thus preventing the possibility of finding line spectra of physical interest. The present extension is also suggested by the interest of explaining the emergence of a broad set of waves found in the electro and magneto-encephalograms, whose frequency ranges from 0.5 to about 40Hz, in terms of the effects produced by chains of firing neurons within the complex neural network of the brain. An original model based on synchronized closed loop sequences of firing neurons is proposed, and a few numerical simulations are reported as an application of the above cited equation.
Time-resolved echo-shared parallel MRA of the lung: observer preference study of image quality in comparison with non-echo-shared sequences

International Nuclear Information System (INIS)

Fink, C.; Puderbach, M.; Zaporozhan, J.; Plathow, C.; Kauczor, H.-U.; Ley, S.

2005-01-01

The aim of this study was to evaluate the image quality of time-resolved echo-shared parallel MRA of the lung. The pulmonary vasculature of nine patients (seven females, two males; median age: 44 years) with pulmonary disease was examined using a time-resolved MRA sequence combining echo sharing with parallel imaging (time-resolved echo-shared angiography technique, or TREAT). The sharpness of the vessel borders, conspicuousness of peripheral lung vessels, artifact level, and overall image quality of TREAT was assessed independently by four readers in a side-by-side comparison with non-echo-shared time-resolved parallel MRA data (pMRA) previously acquired in the same patients. Furthermore, the SNR of pulmonary arteries (PA) and veins (PV) achieved with both pulse sequences was compared. The mean voxel size of TREAT MRA was decreased by 24% compared with the non-echo-shared MRA. Regarding the sharpness of the vessel borders, conspicuousness of peripheral lung vessels, and overall image quality the TREAT sequence was rated superior in 75-76% of all cases. If the TREAT images were preferred over the pMRA images, the advantage was rated as major in 61-71% of all cases. The level of artifacts was not increased with the TREAT sequence. The mean interobserver agreement for all categories ranged between fair (artifact level) and good (overall image quality). The maximum SNR of TREAT did not differ from non-echo-shared parallel MRA (PA: TREAT: 273±45; pMRA: 280±71; PV: TREAT: 273±33; pMRA: 258±62). TREAT achieves a higher spatial resolution than non-echo-shared parallel MRA which is also perceived as an improved image quality. (orig.)
Reference genome sequence of the model plant Setaria.

Science.gov (United States)

Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M

2012-05-13

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Reference genome sequence of the model plant Setaria

Energy Technology Data Exchange (ETDEWEB)

Bennetzen, Jeffrey L [ORNL; Schmutz, Jeremy [Hudson Alpha Institute of Biotechnology; Wang, Hao [University of Georgia, Athens, GA; Percifield, Ryan [University of Georgia, Athens, GA; Hawkins, Jennifer [University of Georgia, Athens, GA; Pontaroli, Ana C. [University of Georgia, Athens, GA; Estep, Matt [University of Georgia, Athens, GA; Feng, Liang [University of Georgia, Athens, GA; Vaughn, Justin N [ORNL; Grimwood, Jane [Hudson Alpha Institute of Biotechnology; Jenkins, Jerry [Hudson Alpha Institute of Biotechnology; Barry, Kerrie [U.S. Department of Energy, Joint Genome Institute; Lindquist, Erika [U.S. Department of Energy, Joint Genome Institute; Hellsten, Uffe [U.S. Department of Energy, Joint Genome Institute; Deshpande, Shweta [U.S. Department of Energy, Joint Genome Institute; Wang, Xuewen [University of Georgia, Athens, GA; Wu, Xiaomei [University of Georgia, Athens, GA; Mitros, Therese [University of California, Berkeley; Triplett, Jimmy [University of Missouri, St. Louis; Yang, Xiaohan [ORNL; Ye, Chuyu [ORNL; Mauro-Herrera, Margarita [Oklahoma State University; Wang, Lin [Cornell University; Li, Pinghua [Cornell University; Sharma, Manoj [University of California, Davis; Sharma, Rita [University of California, Davis; Ronald, Pamela [University of California, Davis; Panaud, Olivier [Universite de Perpignan, Perpignan, France; Kellogg, Elizabeth A. [University of Missouri, St. Louis; Brutnell, Thomas P. [Cornell University; Doust, Andrew N. [Oklahoma State University; Tuskan, Gerald A [ORNL; Rokhsar, Daniel [U.S. Department of Energy, Joint Genome Institute; Devos, Katrien M [ORNL

2012-01-01

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).

Reference genome sequence of the model plant Setaria

Energy Technology Data Exchange (ETDEWEB)

Bennetzen, Jeffrey L [ORNL; Yang, Xiaohan [ORNL; Ye, Chuyu [ORNL; Tuskan, Gerald A [ORNL

2012-01-01

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Ancient Human Genome Sequence of an Extinct Palaeo-Eskimo

DEFF Research Database (Denmark)

Rasmussen, Morten; Li, Yingrui; Lindgreen, Stinus

2010-01-01

We report here the genome sequence of an ancient human. Obtained from approximately 4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20x, we recover 79% of the diploid genome...... possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence...
Sequence comparison of the rDNA introns from six different species of Tetrahymena

DEFF Research Database (Denmark)

Nielsen, Henrik; Engberg, J

1985-01-01

model for the intron RNA of Cech et al. (Proc. Natl. Acad. Sci. U.S.A. 80, 3903 (83)). Most of the sequence variation in the four new sequences reported here is found in single stranded loops in the model. However, in four cases we found nucleotide substitutions in duplex stem regions, two of them...
A prospective comparison of the efficacy and safety of fully closed ...

African Journals Online (AJOL)

We conducted a within-group comparison of three modes of ventilation, ASV, Intellivent-ASV and SIMV, using a Hamilton S1 ventilator (Hamilton Medical, Switzerland). Subjects were ventilated for 2 hours on each mode, and at the end of each 2-hour period, parameters of ventilation and haemodynamics were measured.
Finished Genome Sequence of Collimonas arenae Cal35

NARCIS (Netherlands)

Wu, Je-Jia; de Jager, Victor; Deng, Wen-ling; Leveau, Johan

2015-01-01

We announce the finished genome sequence of soil forest isolate Collimonas arenae Cal35, which comprises a 5.6-Mbp chromosome and 41-kb plasmid. The Cal35 genome is the second one published for the bacterial genus Collimonas and represents the first opportunity for high-resolution comparison of
Chloroplast Genome Sequence of pigeonpea (Cajanus cajan (L. Millspaugh and Cajanus scarabaeoides: Genome organization and Comparison with other legumes

Directory of Open Access Journals (Sweden)

Tanvi Kaila

2016-12-01

Full Text Available Pigeonpea (Cajanus cajan (L. Millspaugh, a diploid (2n = 22 legume crop with a genome size of 852 Mbp, serves as an important source of human dietary protein especially in South East Asian and African regions. In this study, the draft chloroplast genomes of Cajanus cajan and Cajanus scarabaeoides were sequenced. Cajanus scarabaeoides is an important species of the Cajanus gene pool and has also been used for developing promising CMS system by different groups. A male sterile genotype harbouring the Cajanus scarabaeoides cytoplasm was used for sequencing the plastid genome. The cp genome of Cajanus cajan is 152,242bp long, having a quadripartite structure with LSC of 83,455 bp and SSC of 17,871 bp separated by IRs of 25,398 bp. Similarly, the cp genome of Cajanus scarabaeoides is 152,201bp long, having a quadripartite structure in which IRs of 25,402 bp length separates 83,423 bp of LSC and 17,854 bp of SSC. The pigeonpea cp genome contains 116 unique genes, including 30 tRNA, 4 rRNA, 78 predicted protein coding genes and 5 pseudogenes. A 50kb inversion was observed in the LSC region of pigeonpea cp genome, consistent with other legumes. Comparison of cp genome with other legumes revealed the contraction of IR boundaries due to the absence of rps19 gene in the IR region. Chloroplast SSRs were mined and a total of 280 and 292 cpSSRs were identified in Cajanus scarabaeoides and Cajanus cajan respectively. RNA editing was observed at 37 sites in both Cajanus scarabaeoides and Cajanus cajan, with maximum occurrence in the ndh genes. The pigeonpea cp genome sequence would be beneficial in providing informative molecular markers which can be utilized for genetic diversity analysis and aid in understanding the plant systematics studies among major grain legumes.
A comparison of rumen microbial profiles in dairy cows as retrieved by 454 Roche and Ion Torrent (PGM sequencing platforms

Directory of Open Access Journals (Sweden)

Nagaraju Indugu

2016-02-01

Full Text Available Next generation sequencing (NGS technology is a widely accepted tool used by microbial ecologists to explore complex microbial communities in different ecosystems. As new NGS platforms continue to become available, it becomes imperative to compare data obtained from different platforms and analyze their effect on microbial community structure. In the present study, we compared sequencing data from both the 454 and Ion Torrent (PGM platforms on the same DNA samples obtained from the rumen of dairy cows during their transition period. Despite the substantial difference in the number of reads, error rate and length of reads among both platforms, we identified similar community composition between the two data sets. Procrustes analysis revealed similar correlations (M2 = 0.319; P = 0.001 in the microbial community composition between the two platforms. Both platforms revealed the abundance of the same bacterial phyla which were Bacteroidetes and Firmicutes; however, PGM recovered an additional four phyla. Comparisons made at the genus level by each platforms revealed differences in only a few genera such as Prevotella, Ruminococcus, Succiniclasticum and Treponema (p < 0.05; chi square test. Collectively, we conclude that the output generated from PGM and 454 yielded concurrent results, provided stringent bioinformatics pipelines are employed.
Comparison of ELISA, nested PCR and sequencing and a novel qPCR for detection of Giardia isolates from Jordan.

Science.gov (United States)

Hijjawi, Nawal; Yang, Rongchang; Hatmal, Ma'mon; Yassin, Yasmeen; Mharib, Taghrid; Mukbel, Rami; Mahmoud, Sameer Alhaj; Al-Shudifat, Abdel-Ellah; Ryan, Una

2018-02-01

Little is known about the prevalence of Giardia duodenalis in human patients in Jordan and all previous studies have used direct microscopy, which lacks sensitivity. The present study developed a novel quantitative PCR (qPCR) assay at the β-giardin (bg) locus and evaluated its use as a frontline test for the diagnosis of giardiasis in comparison with a commercially available ELISA using nested PCR and sequencing of the glutamate dehydrogenase (gdh) locus (gdh nPCR) as the gold standard. A total of 96 human faecal samples were collected from 96 patients suffering from diarrhoea from 5 regions of Jordan and were screened using the ELISA and qPCR. The analytical specificity of the bg qPCR assay revealed no cross-reactions with other genera and detected all the Giardia isolates tested. Analytical sensitivity was 1 Giardia cyst per μl of DNA extract. The overall prevalence of Giardia was 64.6%. The clinical sensitivity and specificity of the bg qPCR was 89.9% and 82.9% respectively compared to 76.5 and 68.0% for the ELISA. This study is the first to compare three different methods (ELISA, bg qPCR, nested PCR and sequencing at the gdh locus) to diagnose Jordanian patients suffering from giardiasis and to analyze their demographic data. Copyright © 2018 Elsevier Inc. All rights reserved.
Sequence comparison for non-enhanced MRA of the lower extremity arteries at 7 Tesla.

Directory of Open Access Journals (Sweden)

Sören Johst

Full Text Available In this study three sequences for non-contrast-enhanced MRA of the lower extremity arteries at 7T were compared. Cardiac triggering was used with the aim to reduce signal variations in the arteries. Two fast single-shot 2D sequences, a modified Ultrafast Spoiled Gradient Echo (UGRE sequence and a variant of the Quiescent-Interval Single-Shot (QISS sequence were triggered via phonocardiogram and compared in volunteer examinations to a non-triggered 2D gradient echo (GRE sequence. For image acquisition, a 16-channel transmit/receive coil and a manually positionable AngioSURF table were used. To tackle B1 inhomogeneities at 7T, Time-Interleaved Acquisition of Modes (TIAMO was integrated in GRE and UGRE. To compare the three sequences quantitatively, a vessel-to-background ratio (VBR was measured in all volunteers and stations. In conclusion, cardiac triggering was able to suppress flow artifacts satisfactorily. The modified UGRE showed only moderate image artifacts. Averaged over all volunteers and stations, GRE reached a VBR of 4.18±0.05, UGRE 5.20±0.06, and QISS 2.72±0.03. Using cardiac triggering and TIAMO imaging technique was essential to perform non-enhanced MRA of the lower extremities vessels at 7T. The modified UGRE performed best, as observed artifacts were only moderate and the highest average VBR was reached.
Comparison of 3 T and 7 T MRI clinical sequences for ankle imaging

Energy Technology Data Exchange (ETDEWEB)

Juras, Vladimir, E-mail: vladimir.juras@meduniwien.ac.at [Medical University of Vienna, Department of Radiology, Vienna General Hospital, Waeringer Guertel 18-20, A-1090 Vienna (Austria); Slovak Academy of Sciences, Institute of Measurement Science, Dubravska cesta 9, 84104 Bratislava (Slovakia); Welsch, Goetz, E-mail: welsch@bwh.harvard.edu [Medical University of Vienna, Department of Radiology, Vienna General Hospital, Waeringer Guertel 18-20, A-1090 Vienna (Austria); Baer, Peter, E-mail: baerpeter@siemens.com [Siemens Healthcare, Richard-Strauss-Strasse 76, D81679 Munich (Germany); Kronnerwetter, Claudia, E-mail: claudia.kronnerwetter@meduniwien.ac.at [Medical University of Vienna, Department of Radiology, Vienna General Hospital, Waeringer Guertel 18-20, A-1090 Vienna (Austria); Fujita, Hiroyuki, E-mail: hiroyuki.fujita@qualedyn.com [Quality Electrodynamics, LCC, 777 Beta Dr, Cleveland, OH 44143-2336 (United States); Trattnig, Siegfried, E-mail: siegfried.trattnig@meduniwien.ac.at [Medical University of Vienna, Department of Radiology, Vienna General Hospital, Waeringer Guertel 18-20, A-1090 Vienna (Austria)

2012-08-15

The purpose of this study was to compare 3 T and 7 T signal-to-noise and contrast-to noise ratios of clinical sequences for imaging of the ankles with optimized sequences and dedicated coils. Ten healthy volunteers were examined consecutively on both systems with three clinical sequences: (1) 3D gradient-echo, T{sub 1}-weighted; (2) 2D fast spin-echo, PD-weighted; and (3) 2D spin-echo, T{sub 1}-weighted. SNR was calculated for six regions: cartilage; bone; muscle; synovial fluid; Achilles tendon; and Kager's fat-pad. CNR was obtained for cartilage/bone, cartilage/fluid, cartilage/muscle, and muscle/fat-pad, and compared by a one-way ANOVA test for repeated measures. Mean SNR significantly increased at 7 T compared to 3 T for 3D GRE, and 2D TSE was 60.9% and 86.7%, respectively. In contrast, an average SNR decrease of almost 25% was observed in the 2D SE sequence. A CNR increase was observed in 2D TSE images, and in most 3D GRE images. There was a substantial benefit from ultra high-field MR imaging of ankles with routine clinical sequences at 7 T compared to 3 T. Higher SNR and CNR at ultra-high field MR scanners may be useful in clinical practice for ankle imaging. However, carefully optimized protocols and dedicated extremity coils are necessary to obtain optimal results.
Complete sequencing of IncI1 sequence type 2 plasmid pJIE512b indicates mobilization of blaCMY-2 from an IncA/C plasmid.

Science.gov (United States)

Tagg, Kaitlin A; Iredell, Jonathan R; Partridge, Sally R

2014-08-01

Sequencing of pJIE512b, a 92.3-kb IncI1 sequence type 2 (ST2) plasmid carrying bla(CMY-2), revealed a bla(CMY-2) context that appeared to have been mobilized from an IncA/C plasmid by the insertion sequence IS1294. A comparison with published plasmids suggests that bla(CMY-2) has been mobilized from IncA/C to IncI1 plasmids more than once by IS1294-like elements. Alignment of pJIE512b with the only other available IncI1 ST2 plasmid revealed differences across the backbones, indicating variability within this sequence type. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Resolution of the African hominoid trichotomy by use of a mitochondrial gene sequence

Energy Technology Data Exchange (ETDEWEB)

Ruvolo, M.; Disotell, T.R.; Allard, M.W. (Harvard Univ., Cambridge, MA (United States)); Brown, W.M. (Univ. of Michigan, Ann Arbor (United States)); Honeycutt, R.L. (Texas A and M Univ., College Station (United States))

1991-02-15

Mitochondrial DNA sequences encoding the cytochrome oxidase subunit II gene have been determined for five primate species, siamang (Hylobates syndactylus), lowland gorilla (Gorilla gorilla), pygmy chimpanzee (Pan paniscus), crab-eating macaque (Macaca fascicularis), and green monkey (Cercopithecus aethiops), and compared with published sequences of other primate and nonprimate species. Comparisons of cytochrome oxidase subunit II gene sequences provide clear-cut evidence from the mitochondrial genome for the separation of the African ape trichotomy into two evolutionary lineages, one leading to gorillas and the other to humans and chimpanzees. Several different tree-building methods support this same phylogenetic tree topology. The comparisons also yield trees in which a substantial length separates the divergence point of gorillas from that of humans and chimpanzees, suggesting that the lineage most immediately ancestral to humans and chimpanzees may have been in existence for a relatively long time.
Resolution of the African hominoid trichotomy by use of a mitochondrial gene sequence

International Nuclear Information System (INIS)

Ruvolo, M.; Disotell, T.R.; Allard, M.W.; Brown, W.M.; Honeycutt, R.L.

1991-01-01

Mitochondrial DNA sequences encoding the cytochrome oxidase subunit II gene have been determined for five primate species, siamang (Hylobates syndactylus), lowland gorilla (Gorilla gorilla), pygmy chimpanzee (Pan paniscus), crab-eating macaque (Macaca fascicularis), and green monkey (Cercopithecus aethiops), and compared with published sequences of other primate and nonprimate species. Comparisons of cytochrome oxidase subunit II gene sequences provide clear-cut evidence from the mitochondrial genome for the separation of the African ape trichotomy into two evolutionary lineages, one leading to gorillas and the other to humans and chimpanzees. Several different tree-building methods support this same phylogenetic tree topology. The comparisons also yield trees in which a substantial length separates the divergence point of gorillas from that of humans and chimpanzees, suggesting that the lineage most immediately ancestral to humans and chimpanzees may have been in existence for a relatively long time
A birth-death process suggested by a chain sequence

NARCIS (Netherlands)

Lenin, R.B.; Parthasarathy, P.R.

2000-01-01

We consider a birth-death process whose birth and death rates are suggested by a chain sequence. We use an elegant transformation to find the transition probabilities in a simple closed form. We also find an explicit expression for time-dependent mean. We find parallel results in discrete time.
Comparative photoluminescence study of close-packed and colloidal InP/ZnS quantum dots

Science.gov (United States)

Thuy, Ung Thi Dieu; Thuy, Pham Thi; Liem, Nguyen Quang; Li, Liang; Reiss, Peter

2010-02-01

This letter reports on the comparative photoluminescence study of InP/ZnS quantum dots in the close-packed solid state and in colloidal solution. The steady-state photoluminescence spectrum of the close-packed InP/ZnS quantum dots peaks at a longer wavelength than that of the colloidal ones. Time-resolved photoluminescence shows that the close-packed quantum dots possess a shorter luminescence decay time and strongly increased spectral shift with the time delayed from the excitation moment in comparison with the colloidal ones. The observed behavior is discussed on the basis of energy transfer enabled by the short interparticle distance between the close-packed quantum dots.
Retrospective comparison of three-dimensional imaging sequences in the visualization of posterior fossa cranial nerves.

Science.gov (United States)

Ors, Suna; Inci, Ercan; Turkay, Rustu; Kokurcan, Atilla; Hocaoglu, Elif

2017-12-01

To compare efficancy of three-dimentional SPACE (sampling perfection with application-optimized contrasts using different flip-angle evolutions) and CISS (constructive interference in steady state) sequences in the imaging of the cisternal segments of cranial nerves V-XII. Temporal MRI scans from 50 patients (F:M ratio, 27:23; mean age, 44.5±15.9 years) admitted to our hospital with vertigo, tinnitus, and hearing loss were retrospectively analyzed. All patients had both CISS and SPACE sequences. Quantitative analysis of SPACE and CISS sequences was performed by measuring the ventricle-to-parenchyma contrast-to-noise ratio (CNR). Qualitative analysis of differences in visualization capability, image quality, and severity of artifacts was also conducted. A score ranging 'no artefact' to 'severe artefacts and unreadable' was used for the assessment of artifacts and from 'not visualized' to 'completely visualized' for the assesment of image quality, respectively. The distribution of variables was controlled by the Kolmogorov-Smirnov test. Samples t-test and McNemar's test were used to determine statistical significance. Rates of visualization of posterior fossa cranial nerves in cases of complete visualization were as follows: nerve V (100% for both sequences), nerve VI (94% in SPACE, 86% in CISS sequences), nerves VII-VIII (100% for both sequences), IX-XI nerve complex (96%, 88%); nerve XII (58%, 46%) (p<0.05). SPACE sequences showed fewer artifacts than CISS sequences (p<0.002). Copyright © 2017 Elsevier B.V. All rights reserved.
Draft Genome Sequences of Three β-Lactam-Catabolizing Soil Proteobacteria

DEFF Research Database (Denmark)

Crofts, Terence S.; Wang, Bin; Spivak, Aaron

2017-01-01

Most antibiotics are derived from the soil, but their catabolism there, which is necessary to close the antibiotic carbon cycle, remains uncharacterized. We report the first draft genome sequences of soil Proteobacteria identified for subsisting solely on β-lactams as their carbon sources...
Reducing assembly complexity of microbial genomes with single-molecule sequencing

Science.gov (United States)

Genome assembly algorithms cannot fully reconstruct microbial chromosomes from the DNA reads output by first or second-generation sequencing instruments. Therefore, most genomes are left unfinished due to the significant resources required to manually close gaps left in the draft assemblies. Single-...
Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

Directory of Open Access Journals (Sweden)

Maggi Giorgio P

2008-06-01

Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.
MetaGaAP: A Novel Pipeline to Estimate Community Composition and Abundance from Non-Model Sequence Data

Directory of Open Access Journals (Sweden)

Christopher Noune

2017-02-01

Full Text Available Next generation sequencing and bioinformatic approaches are increasingly used to quantify microorganisms within populations by analysis of ‘meta-barcode’ data. This approach relies on comparison of amplicon sequences of ‘barcode’ regions from a population with public-domain databases of reference sequences. However, for many organisms relevant ‘barcode’ regions may not have been identified and large databases of reference sequences may not be available. A workflow and software pipeline, ‘MetaGaAP,’ was developed to identify and quantify genotypes through four steps: shotgun sequencing and identification of polymorphisms in a metapopulation to identify custom ‘barcode’ regions of less than 30 polymorphisms within the span of a single ‘read’, amplification and sequencing of the ‘barcode’, generation of a custom database of polymorphisms, and quantitation of the relative abundance of genotypes. The pipeline and workflow were validated in a ‘wild type’ Alphabaculovirus isolate, Helicoverpa armigera single nucleopolyhedrovirus (HaSNPV-AC53 and a tissue-culture derived strain (HaSNPV-AC53-T2. The approach was validated by comparison of polymorphisms in amplicons and shotgun data, and by comparison of predicted dominant and co-dominant genotypes with Sanger sequences. The computational power required to generate and search the database effectively limits the number of polymorphisms that can be included in a barcode to 30 or less. The approach can be used in quantitative analysis of the ecology and pathology of non-model organisms.

Comparison of the complete genome sequences of Pseudomonassyringae pv. syringae B728a and pv. tomato DC3000.

Energy Technology Data Exchange (ETDEWEB)

Feil, Helene; Feil, William S.; Chain, Patrick; Larimer, Frank; DiBartolo, Genevieve; Copeland, Alex; Lykidis, Athanasios; Trong,Stephen; Nolan, Matt; Goltsman, Eugene; Thiel, James; Malfatti,Stephanie; Loper, Joyce E.; Lapidus, Alla; Detter, John C.; Land, Miriam; Richardson, Paul M.; Kyrpides, Nikos C.; Ivanova, Natalia; Lindow, StevenE.

2005-04-01

The complete genomic sequence of Pseudomonas syringaepathovar syringae B728a (Pss B728a), has been determined and is comparedwith that of Pseudomonas syringae pv. tomato DC3000 (Pst DC3000). Thesetwo pathovars of this economically important species of plant pathogenicbacteria differ in host range and apparent patterns of interaction withplants, with Pss having a more pronounced epiphytic stage of growth andhigher abiotic stress tolerance and Pst DC3000 having a more pronouncedapoplastic growth habitat. The Pss B728a genome (6.1 megabases) containsa circular chromosome and no plasmid, whereas the Pst DC3000 genome is6.5 mbp in size, composed of a circular chromosome and two plasmids.While a high degree of similarity exists between the two sequencedPseudomonads, 976 protein-encoding genes are unique to Pss B728a whencompared to Pst DC3000, including large genomic islands likely tocontribute to virulence and host specificity. Over 375 repetitiveextragenic palindromic sequences (REPs) unique to Pss B728a when comparedto Pst DC3000 are widely distributed throughout the chromosome except in14 genomic islands, which generally had lower GC content than the genomeas a whole. Content of the genomic islands vary, with one containing aprophage and another the plasmid pKLC102 of P. aeruginosa PAO1. Among the976 genes of Pss B728a with no counterpart in Pst DC3000 are thoseencoding for syringopeptin (SP), syringomycin (SR), indole acetic acidbiosynthesis, arginine degradation, and production of ice nuclei. Thegenomic comparison suggests that several unique genes for Pss B728a suchas ectoine synthase, DNA repair, and antibiotic production may contributeto epiphytic fitness and stress tolerance of this organism. Pseudomonassyringae, a member of the gamma subgroup of the Proteobacteria, is awidespread bacterial pathogen of many plant species. The species P.syringae is subdivided into approximately 50 pathovars based onpathogenicity and host range. P. syringae is capable of
Car sequencing is NP-hard: a short proof

OpenAIRE

B Estellon; F Gardi

2013-01-01

In this note, a new proof is given that the car sequencing (CS) problem is NP-hard. Established from the Hamiltonian Path problem, the reduction is direct while closing some gaps remaining in the previous NP-hardness results. Since CS is studied in many operational research courses, this result and its proof are particularly interesting for teaching purposes.
Human Coronaviruses 229E and NL63: Close Yet Still So Far

NARCIS (Netherlands)

Dijkman, Ronald; van der Hoek, Lia

2009-01-01

HCoV-NL63 and HCoV-229E are two of the four human coronaviruses that circulate worldwide. These two viruses are unique in their relationship towards each other. Phylogenetically, the viruses are more closely related to each other than to any other human coronavirus, yet they only share 65% sequence
The family of DOF transcription factors in Brachypodium distachyon: phylogenetic comparison with rice and barley DOFs and expression profiling

Directory of Open Access Journals (Sweden)

Hernando-Amado Sara

2012-11-01

Full Text Available Abstract Background Transcription factors (TFs are proteins that have played a central role both in evolution and in domestication, and are major regulators of development in living organisms. Plant genome sequences reveal that approximately 7% of all genes encode putative TFs. The DOF (DNA binding with One Finger TF family has been associated with vital processes exclusive to higher plants and to their close ancestors (algae, mosses and ferns. These are seed maturation and germination, light-mediated regulation, phytohormone and plant responses to biotic and abiotic stresses, etc. In Hordeum vulgare and Oryza sativa, 26 and 30 different Dof genes, respectively, have been annotated. Brachypodium distachyon has been the first Pooideae grass to be sequenced and, due to its genomic, morphological and physiological characteristics, has emerged as the model system for temperate cereals, such as wheat and barley. Results Through searches in the B. distachyon genome, 27 Dof genes have been identified and a phylogenetic comparison with the Oryza sativa and the Hordeum vulgare DOFs has been performed. To explore the evolutionary relationship among these DOF proteins, a combined phylogenetic tree has been constructed with the Brachypodium DOFs and those from rice and barley. This phylogenetic analysis has classified the DOF proteins into four Major Cluster of Orthologous Groups (MCOGs. Using RT-qPCR analysis the expression profiles of the annotated BdDof genes across four organs (leaves, roots, spikes and seeds has been investigated. These results have led to a classification of the BdDof genes into two groups, according to their expression levels. The genes highly or preferentially expressed in seeds have been subjected to a more detailed expression analysis (maturation, dry stage and germination. Conclusions Comparison of the expression profiles of the Brachypodium Dof genes with the published functions of closely related DOF sequences from the cereal
A time warping approach to multiple sequence alignment.

Science.gov (United States)

Arribas-Gil, Ana; Matias, Catherine

2017-04-25

We propose an approach for multiple sequence alignment (MSA) derived from the dynamic time warping viewpoint and recent techniques of curve synchronization developed in the context of functional data analysis. Starting from pairwise alignments of all the sequences (viewed as paths in a certain space), we construct a median path that represents the MSA we are looking for. We establish a proof of concept that our method could be an interesting ingredient to include into refined MSA techniques. We present a simple synthetic experiment as well as the study of a benchmark dataset, together with comparisons with 2 widely used MSA softwares.
Investigating Correlation between Protein Sequence Similarity and Semantic Similarity Using Gene Ontology Annotations.

Science.gov (United States)

Ikram, Najmul; Qadir, Muhammad Abdul; Afzal, Muhammad Tanvir

2018-01-01

Sequence similarity is a commonly used measure to compare proteins. With the increasing use of ontologies, semantic (function) similarity is getting importance. The correlation between these measures has been applied in the evaluation of new semantic similarity methods, and in protein function prediction. In this research, we investigate the relationship between the two similarity methods. The results suggest absence of a strong correlation between sequence and semantic similarities. There is a large number of proteins with low sequence similarity and high semantic similarity. We observe that Pearson's correlation coefficient is not sufficient to explain the nature of this relationship. Interestingly, the term semantic similarity values above 0 and below 1 do not seem to play a role in improving the correlation. That is, the correlation coefficient depends only on the number of common GO terms in proteins under comparison, and the semantic similarity measurement method does not influence it. Semantic similarity and sequence similarity have a distinct behavior. These findings are of significant effect for future works on protein comparison, and will help understand the semantic similarity between proteins in a better way.
On generalized regular sequences and the finiteness for associated primes of local cohomology modules

International Nuclear Information System (INIS)

Le Thanh Nhan

2003-08-01

Let (R,m) be a Noetherian local ring and M a finitely generated R-module. The two notions of generalized regular sequence and generalized depth are introduced as extensions of the known notions of regular sequence and depth respectively. Some properties of generalized regular sequence and generalized depth, which are closely related to that of regular sequence and depth, are given. If x 1 ,... ,x r is a generalized regular sequence of M then union n1,...,nr Ass M/(x 1 n 1 ,... ,x r n r )M is a finite set. Some finiteness properties for associated primes of local cohomology modules are presented. (author)
Hypercapnic normalization of BOLD fMRI: comparison across field strengths and pulse sequences

DEFF Research Database (Denmark)

Cohen, Eric R.; Rostrup, Egill; Sidaros, Karam

2004-01-01

to be more accurately localized and quantified based on changes in venous blood oxygenation alone. The normalized BOLD signal induced by the motor task was consistent across different magnetic fields and pulse sequences, and corresponded well with cerebral blood flow measurements. Our data suggest...... size, as well as experimental, such as pulse sequence and static magnetic field strength (B(0)). Thus, it is difficult to compare task-induced fMRI signals across subjects, field strengths, and pulse sequences. This problem can be overcome by normalizing the neural activity-induced BOLD fMRI response...... for global stimulation, subjects breathed a 5% CO(2) gas mixture. Under all conditions, voxels containing primarily large veins and those containing primarily active tissue (i.e., capillaries and small veins) showed distinguishable behavior after hypercapnic normalization. This allowed functional activity...
StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase

Energy Technology Data Exchange (ETDEWEB)

Zemla, A; Lang, D; Kostova, T; Andino, R; Zhou, C

2010-11-29

Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitate the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected
Genetic Characterization of Fasciola Isolates from West Azerbaijan Province Iran Based on ITS1 and ITS2 Sequence of Ribosomal DNA

Science.gov (United States)

GALAVANI, Hossein; GHOLIZADEH, Saber; HAZRATI TAPPEH, Khosrow

2016-01-01

Background: Fascioliasis, caused by Fasciola hepatica and F. gigantica, has medical and economic importance in the world. Molecular approaches comparing traditional methods using for identification and characterization of Fasciola spp. are precise and reliable. The aims of current study were molecular characterization of Fasciola spp. in West Azerbaijan Province, Iran and then comparative analysis of them using GenBank sequences. Methods: A total number of 580 isolates were collected from different hosts in five cities of West Azerbaijan Province, in 2014 from 90 slaughtered cattle (n=50) and sheep (n=40). After morphological identification and DNA extraction, designing specific primer were used to amplification of ITS1, 5.8s and ITS2 regions, 50 samples were conducted to sequence, randomly. Result: Using morphometric characters 99.14% and 0.86% of isolates identified as F. hepatica and F. gigantica, respectively. PCR amplification of 1081 bp fragment and sequencing result showed 100% similarity with F. hepatica in ITS1 (428 bp), 5.8s (158 bp), and ITS2 (366 bp) regions. Sequence comparison among current study sequences and GenBank data showed 98% identity with 11 nucleotide mismatches. However, in phylogenetic tree F. hepatica sequences of West Azerbaijan Province, Iran, were in a close relationship with Iranian, Asian, and African isolates. Conclusions: Only F. hepatica species is distributed among sheep and cattle in West Azerbaijan Province Iran. However, 5 and 6 bp variation in ITS1 and ITS2 regions, respectively, is not enough to separate of Fasciola spp. Therefore, more studies are essential for designing new molecular markers to correct species identification. PMID:27095969
Integrated sequence analysis. Final report

International Nuclear Information System (INIS)

Andersson, K.; Pyy, P.

1998-02-01

The NKS/RAK subprojet 3 'integrated sequence analysis' (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term 'methodology' denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as
Integrated sequence analysis. Final report

Energy Technology Data Exchange (ETDEWEB)

Andersson, K.; Pyy, P

1998-02-01

The NKS/RAK subprojet 3 `integrated sequence analysis` (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term `methodology` denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as
SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

Science.gov (United States)

Yu, Xiaoyu; Reva, Oleg N

2018-01-01

Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.
Gain and loss of phototrophic genes revealed by comparison of two Citromicrobium bacterial genomes.

Directory of Open Access Journals (Sweden)

Qiang Zheng

Full Text Available Proteobacteria are thought to have diverged from a phototrophic ancestor, according to the scattered distribution of phototrophy throughout the proteobacterial clade, and so the occurrence of numerous closely related phototrophic and chemotrophic microorganisms may be the result of the loss of genes for phototrophy. A widespread form of bacterial phototrophy is based on the photochemical reaction center, encoded by puf and puh operons that typically are in a 'photosynthesis gene cluster' (abbreviated as the PGC with pigment biosynthesis genes. Comparison of two closely related Citromicrobial genomes (98.1% sequence identity of complete 16S rRNA genes, Citromicrobium sp. JL354, which contains two copies of reaction center genes, and Citromicrobium strain JLT1363, which is chemotrophic, revealed evidence for the loss of phototrophic genes. However, evidence of horizontal gene transfer was found in these two bacterial genomes. An incomplete PGC (pufLMC-puhCBA in strain JL354 was located within an integrating conjugative element, which indicates a potential mechanism for the horizontal transfer of genes for phototrophy.
Correlated mutations in protein sequences: Phylogenetic and structural effects

Energy Technology Data Exchange (ETDEWEB)

Lapedes, A.S. [Los Alamos National Lab., NM (United States). Theoretical Div.]|[Santa Fe Inst., NM (United States); Giraud, B.G. [C.E.N. Saclay, Gif/Yvette (France). Service Physique Theorique; Liu, L.C. [Los Alamos National Lab., NM (United States). Theoretical Div.; Stormo, G.D. [Univ. of Colorado, Boulder, CO (United States). Dept. of Molecular, Cellular and Developmental Biology

1998-12-01

Covariation analysis of sets of aligned sequences for RNA molecules is relatively successful in elucidating RNA secondary structure, as well as some aspects of tertiary structure. Covariation analysis of sets of aligned sequences for protein molecules is successful in certain instances in elucidating certain structural and functional links, but in general, pairs of sites displaying highly covarying mutations in protein sequences do not necessarily correspond to sites that are spatially close in the protein structure. In this paper the authors identify two reasons why naive use of covariation analysis for protein sequences fails to reliably indicate sequence positions that are spatially proximate. The first reason involves the bias introduced in calculation of covariation measures due to the fact that biological sequences are generally related by a non-trivial phylogenetic tree. The authors present a null-model approach to solve this problem. The second reason involves linked chains of covariation which can result in pairs of sites displaying significant covariation even though they are not spatially proximate. They present a maximum entropy solution to this classic problem of causation versus correlation. The methodologies are validated in simulation.
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.

Science.gov (United States)

Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A

2017-07-01

Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
AMS 4.0: consensus prediction of post-translational modifications in protein sequences.

Science.gov (United States)

Plewczynski, Dariusz; Basu, Subhadip; Saha, Indrajit

2012-08-01

We present here the 2011 update of the AutoMotif Service (AMS 4.0) that predicts the wide selection of 88 different types of the single amino acid post-translational modifications (PTM) in protein sequences. The selection of experimentally confirmed modifications is acquired from the latest UniProt and Phospho.ELM databases for training. The sequence vicinity of each modified residue is represented using amino acids physico-chemical features encoded using high quality indices (HQI) obtaining by automatic clustering of known indices extracted from AAindex database. For each type of the numerical representation, the method builds the ensemble of Multi-Layer Perceptron (MLP) pattern classifiers, each optimising different objectives during the training (for example the recall, precision or area under the ROC curve (AUC)). The consensus is built using brainstorming technology, which combines multi-objective instances of machine learning algorithm, and the data fusion of different training objects representations, in order to boost the overall prediction accuracy of conserved short sequence motifs. The performance of AMS 4.0 is compared with the accuracy of previous versions, which were constructed using single machine learning methods (artificial neural networks, support vector machine). Our software improves the average AUC score of the earlier version by close to 7 % as calculated on the test datasets of all 88 PTM types. Moreover, for the selected most-difficult sequence motifs types it is able to improve the prediction performance by almost 32 %, when compared with previously used single machine learning methods. Summarising, the brainstorming consensus meta-learning methodology on the average boosts the AUC score up to around 89 %, averaged over all 88 PTM types. Detailed results for single machine learning methods and the consensus methodology are also provided, together with the comparison to previously published methods and state-of-the-art software tools. The
Evaluation of multiple approaches to identify genome-wide polymorphisms in closely related genotypes of sweet cherry (Prunus avium L.

Directory of Open Access Journals (Sweden)

Seanna Hewitt

Full Text Available Identification of genetic polymorphisms and subsequent development of molecular markers is important for marker assisted breeding of superior cultivars of economically important species. Sweet cherry (Prunus avium L. is an economically important non-climacteric tree fruit crop in the Rosaceae family and has undergone a genetic bottleneck due to breeding, resulting in limited genetic diversity in the germplasm that is utilized for breeding new cultivars. Therefore, it is critical to recognize the best platforms for identifying genome-wide polymorphisms that can help identify, and consequently preserve, the diversity in a genetically constrained species. For the identification of polymorphisms in five closely related genotypes of sweet cherry, a gel-based approach (TRAP, reduced representation sequencing (TRAPseq, a 6k cherry SNParray, and whole genome sequencing (WGS approaches were evaluated in the identification of genome-wide polymorphisms in sweet cherry cultivars. All platforms facilitated detection of polymorphisms among the genotypes with variable efficiency. In assessing multiple SNP detection platforms, this study has demonstrated that a combination of appropriate approaches is necessary for efficient polymorphism identification, especially between closely related cultivars of a species. The information generated in this study provides a valuable resource for future genetic and genomic studies in sweet cherry, and the insights gained from the evaluation of multiple approaches can be utilized for other closely related species with limited genetic diversity in the breeding germplasm. Keywords: Polymorphisms, Prunus avium, Next-generation sequencing, Target region amplification polymorphism (TRAP, Genetic diversity, SNParray, Reduced representation sequencing, Whole genome sequencing (WGS
Microbial culturomics to isolate halophilic bacteria from table salt: genome sequence and description of the moderately halophilic bacterium Bacillus salis sp. nov.

Directory of Open Access Journals (Sweden)

E.H. Seck

2018-05-01

Full Text Available Bacillus salis strain ES3T (= CSUR P1478 = DSM 100598 is the type strain of B. salis sp. nov. It is an aerobic, Gram-positive, moderately halophilic, motile and spore-forming bacterium. It was isolated from commercial table salt as part of a broad culturomics study aiming to maximize the culture conditions for the in-depth exploration of halophilic bacteria in salty food. Here we describe the phenotypic characteristics of this isolate, its complete genome sequence and annotation, together with a comparison with closely related bacteria. Phylogenetic analysis based on 16S rRNA gene sequences indicated 97.5% similarity with Bacillus aquimaris, the closest species. The 8 329 771 bp long genome (one chromosome, no plasmids exhibits a G+C content of 39.19%. It is composed of 18 scaffolds with 29 contigs. Of the 8303 predicted genes, 8109 were protein-coding genes and 194 were RNAs. A total of 5778 genes (71.25% were assigned a putative function. Keywords: Bacillus salis, culturomics, genome, halophilic bacteria, human gut, taxonogenomics
A New Images Hiding Scheme Based on Chaotic Sequences

Institute of Scientific and Technical Information of China (English)

LIU Nian-sheng; GUO Dong-hui; WU Bo-xi; Parr G

2005-01-01

We propose a data hidding technique in a still image. This technique is based on chaotic sequence in the transform domain of covert image. We use different chaotic random sequences multiplied by multiple sensitive images, respectively, to spread the spectrum of sensitive images. Multiple sensitive images are hidden in a covert image as a form of noise. The results of theoretical analysis and computer simulation show the new hiding technique have better properties with high security, imperceptibility and capacity for hidden information in comparison with the conventional scheme such as LSB (Least Significance Bit).

ReAS: Recovery of ancestral sequences for transposable elements from the unassembled reads of a whole genome shotgun

DEFF Research Database (Denmark)

Li, Ruiqiang; Ye, Jia; Li, Songgang

2005-01-01

in comparison to their ancestral sequences. Tested on the japonica rice genome, ReAS was able to reconstruct all of the high copy sequences in the Repbase repository of known TEs, and increase the effectiveness of RepeatMasker in identifying TEs from genome sequences. Udgivelsesdato: 2005-Sep...
Origin of the Y genome in Elymus and its relationship to other genomes in Triticeae based on evidence from elongation factor G (EF-G) gene sequences.

Science.gov (United States)

Sun, Genlou; Komatsuda, Takao

2010-08-01

It is well known that Elymus arose through hybridization between representatives of different genera. Cytogenetic analyses show that all its members include the St genome in combination with one or more of four other genomes, the H, Y, P, and W genomes. The origins of the H, P, and W genomes are known, but not for the Y genome. We analyzed the single copy nuclear gene coding for elongation factor G (EF-G) from 28 accessions of polyploid Elymus species and 45 accessions of diploid Triticeae species in order to investigate origin of the Y genome and its relationship to other genomes in the tribe Triticeae. Sequence comparisons among the St, H, Y, P, W, and E genomes detected genome-specific polymorphisms at 66 nucleotide positions. The St and Y genomes are relatively dissimilar. The phylogeny of the Y genome sequences was investigated for the first time. They were most similar to the W genome sequences. The Y genome sequences were placed in two different groups. These two groups were included in an unresolved clade that included the W and E sequences as well as sequences from many annual species. The H genomes sequences were in a clade with the F, P, and Ns genome sequences as sister groups. These two clades were more closely related to each other and to the L and Xp genomes than they were to the St genome sequences. These data support the hypothesis that the Y genome evolved in a diploid species and has a different origin from the St genome. Copyright 2010 Elsevier Inc. All rights reserved.
Inaudible functional MRI using a truly mute gradient echo sequence

International Nuclear Information System (INIS)

Marcar, V.L.; Girard, F.; Rinkel, Y.; Schneider, J.F.; Martin, E.

2002-01-01

We performed functional MRI experiments using a mute version of a gradient echo sequence on adult volunteers using either a simple visual stimulus (flicker goggles: 4 subjects) or an auditory stimulus (music: 4 subjects). Because the mute sequence delivers fewer images per unit time than a fast echo planar imaging (EPI) sequence, we explored our data using a parametric ANOVA test and a non-parametric Wilcoxon-Mann-Whitney test in addition to performing a cross-correlation analysis. All three methods were in close agreement regarding the location of the BOLD contrast signal change. We demonstrated that, using appropriate statistical analysis, functional MRI using an MR sequence that is acoustically inaudible to the subject is feasible. Furthermore compared with the ''silent'' event-related procedures involving an EPI protocol, our mGE protocol compares favourably with respect to experiment time and the BOLD signal. (orig.)
Generation of sequence signatures from DNA amplification fingerprints with mini-hairpin and microsatellite primers.

Science.gov (United States)

Caetano-Anollés, G; Gresshoff, P M

1996-06-01

DNA amplification fingerprinting (DAF) with mini-hairpins harboring arbitrary "core" sequences at their 3' termini were used to fingerprint a variety of templates, including PCR products and whole genomes, to establish genetic relationships between plant tax at the interspecific and intraspecific level, and to identify closely related fungal isolates and plant accessions. No correlation was observed between the sequence of the arbitrary core, the stability of the mini-hairpin structure and DAF efficiency. Mini-hairpin primers with short arbitrary cores and primers complementary to simple sequence repeats present in microsatellites were also used to generate arbitrary signatures from amplification profiles (ASAP). The ASAP strategy is a dual-step amplification procedure that uses at least one primer in each fingerprinting stage. ASAP was able to reproducibly amplify DAF products (representing about 10-15 kb of sequence) following careful optimization of amplification parameters such as primer and template concentration. Avoidance of primer sequences partially complementary to DAF product termini was necessary in order to produce distinct fingerprints. This allowed the combinatorial use of oligomers in nucleic acid screening, with numerous ASAP fingerprinting reactions based on a limited number of primer sequences. Mini-hairpin primers and ASAP analysis significantly increased detection of polymorphic DNA, separating closely related bermudagrass (Cynodon) cultivars and detecting putatively linked markers in bulked segregant analysis of the soybean (Glycine max) supernodulation (nitrate-tolerant symbiosis) locus.
Fast comparison of IS radar code sequences for lag profile inversion

Directory of Open Access Journals (Sweden)

M. S. Lehtinen

2008-08-01

Full Text Available A fast method for theoretically comparing the posteriori variances produced by different phase code sequences in incoherent scatter radar (ISR experiments is introduced. Alternating codes of types 1 and 2 are known to be optimal for selected range resolutions, but the code sets are inconveniently long for many purposes like ground clutter estimation and in cases where coherent echoes from lower ionospheric layers are to be analyzed in addition to standard F-layer spectra.

The method is used in practice for searching binary code quads that have estimation accuracy almost equal to that of much longer alternating code sets. Though the code sequences can consist of as few as four different transmission envelopes, the lag profile estimation variances are near to the theoretical minimum. Thus the short code sequence is equally good as a full cycle of alternating codes with the same pulse length and bit length. The short code groups cannot be directly decoded, but the decoding is done in connection with more computationally expensive lag profile inversion in data analysis.

The actual code searches as well as the analysis and real data results from the found short code searches are explained in other papers sent to the same issue of this journal. We also discuss interesting subtle differences found between the different alternating codes by this method. We assume that thermal noise dominates the incoherent scatter signal.
Evaluation of ddRADseq for reduced representation metagenome sequencing

Directory of Open Access Journals (Sweden)

Michael Y. Liu

2017-09-01

Full Text Available Background Profiling of microbial communities via metagenomic shotgun sequencing has enabled researches to gain unprecedented insight into microbial community structure and the functional roles of community members. This study describes a method and basic analysis for a metagenomic adaptation of the double digest restriction site associated DNA sequencing (ddRADseq protocol for reduced representation metagenome profiling. Methods This technique takes advantage of the sequence specificity of restriction endonucleases to construct an Illumina-compatible sequencing library containing DNA fragments that are between a pair of restriction sites located within close proximity. This results in a reduced sequencing library with coverage breadth that can be tuned by size selection. We assessed the performance of the metagenomic ddRADseq approach by applying the full method to human stool samples and generating sequence data. Results The ddRADseq data yields a similar estimate of community taxonomic profile as obtained from shotgun metagenome sequencing of the same human stool samples. No obvious bias with respect to genomic G + C content and the estimated relative species abundance was detected. Discussion Although ddRADseq does introduce some bias in taxonomic representation, the bias is likely to be small relative to DNA extraction bias. ddRADseq appears feasible and could have value as a tool for metagenome-wide association studies.
W-curve alignments for HIV-1 genomic comparisons.

Directory of Open Access Journals (Sweden)

Douglas J Cork

2010-06-01

Full Text Available The W-curve was originally developed as a graphical visualization technique for viewing DNA and RNA sequences. Its ability to render features of DNA also makes it suitable for computational studies. Its main advantage in this area is utilizing a single-pass algorithm for comparing the sequences. Avoiding recursion during sequence alignments offers advantages for speed and in-process resources. The graphical technique also allows for multiple models of comparison to be used depending on the nucleotide patterns embedded in similar whole genomic sequences. The W-curve approach allows us to compare large numbers of samples quickly.We are currently tuning the algorithm to accommodate quirks specific to HIV-1 genomic sequences so that it can be used to aid in diagnostic and vaccine efforts. Tracking the molecular evolution of the virus has been greatly hampered by gap associated problems predominantly embedded within the envelope gene of the virus. Gaps and hypermutation of the virus slow conventional string based alignments of the whole genome. This paper describes the W-curve algorithm itself, and how we have adapted it for comparison of similar HIV-1 genomes. A treebuilding method is developed with the W-curve that utilizes a novel Cylindrical Coordinate distance method and gap analysis method. HIV-1 C2-V5 env sequence regions from a Mother/Infant cohort study are used in the comparison.The output distance matrix and neighbor results produced by the W-curve are functionally equivalent to those from Clustal for C2-V5 sequences in the mother/infant pairs infected with CRF01_AE.Significant potential exists for utilizing this method in place of conventional string based alignment of HIV-1 genomes, such as Clustal X. With W-curve heuristic alignment, it may be possible to obtain clinically useful results in a short time-short enough to affect clinical choices for acute treatment. A description of the W-curve generation process, including a comparison
W-curve alignments for HIV-1 genomic comparisons.

Science.gov (United States)

Cork, Douglas J; Lembark, Steven; Tovanabutra, Sodsai; Robb, Merlin L; Kim, Jerome H

2010-06-01

The W-curve was originally developed as a graphical visualization technique for viewing DNA and RNA sequences. Its ability to render features of DNA also makes it suitable for computational studies. Its main advantage in this area is utilizing a single-pass algorithm for comparing the sequences. Avoiding recursion during sequence alignments offers advantages for speed and in-process resources. The graphical technique also allows for multiple models of comparison to be used depending on the nucleotide patterns embedded in similar whole genomic sequences. The W-curve approach allows us to compare large numbers of samples quickly. We are currently tuning the algorithm to accommodate quirks specific to HIV-1 genomic sequences so that it can be used to aid in diagnostic and vaccine efforts. Tracking the molecular evolution of the virus has been greatly hampered by gap associated problems predominantly embedded within the envelope gene of the virus. Gaps and hypermutation of the virus slow conventional string based alignments of the whole genome. This paper describes the W-curve algorithm itself, and how we have adapted it for comparison of similar HIV-1 genomes. A treebuilding method is developed with the W-curve that utilizes a novel Cylindrical Coordinate distance method and gap analysis method. HIV-1 C2-V5 env sequence regions from a Mother/Infant cohort study are used in the comparison. The output distance matrix and neighbor results produced by the W-curve are functionally equivalent to those from Clustal for C2-V5 sequences in the mother/infant pairs infected with CRF01_AE. Significant potential exists for utilizing this method in place of conventional string based alignment of HIV-1 genomes, such as Clustal X. With W-curve heuristic alignment, it may be possible to obtain clinically useful results in a short time-short enough to affect clinical choices for acute treatment. A description of the W-curve generation process, including a comparison technique of
Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

Science.gov (United States)

van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

2017-10-01

Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is
Third-Generation Sequencing and Analysis of Four Complete Pig Liver Esterase Gene Sequences in Clones Identified by Screening BAC Library.

Science.gov (United States)

Zhou, Qiongqiong; Sun, Wenjuan; Liu, Xiyan; Wang, Xiliang; Xiao, Yuncai; Bi, Dingren; Yin, Jingdong; Shi, Deshi

2016-01-01

Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing. After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis. Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression. This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for
ANISOTROPIC WINDS FROM CLOSE-IN EXTRASOLAR PLANETS

International Nuclear Information System (INIS)

Stone, James M.; Proga, Daniel

2009-01-01

We present two-dimensional hydrodynamic models of thermally driven winds from highly irradiated, close-in extrasolar planets. We adopt a very simple treatment of the radiative heating processes at the base of the wind, and instead focus on the differences between the properties of outflows in multidimensions in comparison to spherically symmetric models computed with the same methods. For hot (T ∼> 2 x 10 4 K) or highly ionized gas, we find that strong (supersonic) polar flows are formed above the planet surface which produce weak shocks and outflow on the night side. In comparison to a spherically symmetric wind with the same parameters, the sonic surface on the day side is much closer to the planet surface in multidimensions, and the total mass-loss rate is reduced by almost a factor of 4. We also compute the steady-state structure of interacting planetary and stellar winds. Both winds end in a termination shock, with a parabolic contact discontinuity which is draped over the planet separating the two shocked winds. The planetary wind termination shock and the sonic surface in the wind are well separated, so that the mass-loss rate from the planet is essentially unaffected. However, the confinement of the planetary wind to the small volume bounded by the contact discontinuity greatly enhances the column density close to the planet, which might be important for the interpretation of observations of absorption lines formed by gas surrounding transiting planets.
Draft genome sequence of the Coccolithovirus Emiliania huxleyi virus 203.

Science.gov (United States)

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2011-12-01

The Coccolithoviridae are a recently discovered group of viruses that infect the marine coccolithophorid Emiliania huxleyi. Emiliania huxleyi virus 203 (EhV-203) has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 400 kbp, consisting of 464 coding sequences (CDSs). Here we describe the genomic features of EhV-203 together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.
Value of Fat-Suppressed Proton-Density-Weighted Turbo Spin-Echo Sequences in Detecting Meniscal Lesions: Comparison with Arthroscopy

International Nuclear Information System (INIS)

Schaefer, F.K.W.; Schaefer, P.J.; Brossmann, J.; Frahm, C.; Hilgert, R.E.; Heller, M.; Jahnke, T.

2006-01-01

Purpose: To evaluate fat-suppressed (FS) proton-density-weighted (PDw) turbo spin-echo (TSE) magnetic resonance imaging (MRI) compared to arthroscopy in the detection of meniscal lesions. Material and Methods: In a prospective study, 31 knee joints were imaged on a 1.5T MR scanner before arthroscopy using the following sequences: (a) coronal and sagittal FS-PDw TSE (TR/TE: 4009/15 ms); (b) coronal T1w SE (TR/TE: 722/20 ms), and sagittal PDw TSE (TR/TE: 3800/15 ms). Other imaging parameters were: slice thickness 3 mm, FOV 160 mm, matrix 256x256. A total of 186 meniscal regions (62 menisci; anterior horn, body, posterior horn) were evaluated. Standard of reference was arthroscopy. Sensitivity, specificity, negative predictive value (npv), positive predictive value (ppv), and accuracy were calculated. Results: Arthroscopically, meniscal lesions were detected in 55/186 segments (35 medial and 20 lateral meniscal lesions). Sensitivity, specificity, npv, ppv, and accuracy for combination of coronal and sagittal FS PDw TSE were 91.4%, 98.3%, 95%, 97%, and 93.5% for the medial meniscus, and 90%, 98.6%, 97.3%, 94.7%, and 96.8% for the lateral. The results were comparable to the combination of coronal T1w SE and sagittal PDw TSE for the medial (88.6%, 98.3%, 93.4%, 96.9%, 91.4%) and the lateral (90%, 95.9%, 97.2%, 85.7%, 92.5%) meniscus. Conclusion: FS PDw TSE-MR sequences are an excellent alternative for the detection of meniscal lesions in comparison with diagnostic arthroscopy
Next-Generation Mitogenomics: A Comparison of Approaches Applied to Caecilian Amphibian Phylogeny

OpenAIRE

Maddock, Simon T.; Briscoe, Andrew G.; Wilkinson, Mark; Waeschenbach, Andrea; San Mauro, Diego; Day, Julia J.; Littlewood, D. Tim J.; Foster, Peter G.; Nussbaum, Ronald A.; Gower, David J.

2016-01-01

Mitochondrial genome (mitogenome) sequences are being generated with increasing speed due to the advances of next-generation sequencing (NGS) technology and associated analytical tools. However, detailed comparisons to explore the utility of alternative NGS approaches applied to the same taxa have not been undertaken. We compared a ‘traditional’ Sanger sequencing method with two NGS approaches (shotgun sequencing and non-indexed, multiplex amplicon sequencing) on four different sequencing pla...
Depression, anxiety and quality of life in suicide survivors: a comparison of close and distant relationships.

Science.gov (United States)

Mitchell, Ann M; Sakraida, Teresa J; Kim, Yookyung; Bullian, Leann; Chiappetta, Laurel

2009-02-01

The study's purpose was to describe and compare depression, anxiety, and quality of life, by degree of relationship, between closely related and distantly related survivors (persons close to the suicide victim, or "suicide survivors"; N = 60) during the acute phase of bereavement (within 1 month of the death). The close relationship category included spouses, parents, children, and siblings, whereas the distant relationship category included in-laws, aunts/uncles, and nieces/nephews. Analysis of covariance examined differences between the two groups on the symptom measures. Results indicate that, after controlling for age and gender effects, closely related survivors had significantly higher mean levels of depression and anxiety and had lower levels of mental health quality of life. There were no statistically significant differences on the physical health quality of life subscale.
Harnessing NGS and Big Data Optimally: Comparison of miRNA Prediction from Assembled versus Non-assembled Sequencing Data--The Case of the Grass Aegilops tauschii Complex Genome.

Science.gov (United States)

Budak, Hikmet; Kantar, Melda

2015-07-01

MicroRNAs (miRNAs) are small, endogenous, non-coding RNA molecules that regulate gene expression at the post-transcriptional level. As high-throughput next generation sequencing (NGS) and Big Data rapidly accumulate for various species, efforts for in silico identification of miRNAs intensify. Surprisingly, the effect of the input genomics sequence on the robustness of miRNA prediction was not evaluated in detail to date. In the present study, we performed a homology-based miRNA and isomiRNA prediction of the 5D chromosome of bread wheat progenitor, Aegilops tauschii, using two distinct sequence data sets as input: (1) raw sequence reads obtained from 454-GS FLX Titanium sequencing platform and (2) an assembly constructed from these reads. We also compared this method with a number of available plant sequence datasets. We report here the identification of 62 and 22 miRNAs from raw reads and the assembly, respectively, of which 16 were predicted with high confidence from both datasets. While raw reads promoted sensitivity with the high number of miRNAs predicted, 55% (12 out of 22) of the assembly-based predictions were supported by previous observations, bringing specificity forward compared to the read-based predictions, of which only 37% were supported. Importantly, raw reads could identify several repeat-related miRNAs that could not be detected with the assembly. However, raw reads could not capture 6 miRNAs, for which the stem-loops could only be covered by the relatively longer sequences from the assembly. In summary, the comparison of miRNA datasets obtained by these two strategies revealed that utilization of raw reads, as well as assemblies for in silico prediction, have distinct advantages and disadvantages. Consideration of these important nuances can benefit future miRNA identification efforts in the current age of NGS and Big Data driven life sciences innovation.
Coverage and Rate of Downlink Sequence Transmissions with Reliability Guarantees

DEFF Research Database (Denmark)

Park, Jihong; Popovski, Petar

2017-01-01

Real-time distributed control is a promising application of 5G in which communication links should satisfy certain reliability guarantees. In this letter, we derive closed-form maximum average rate when a device (e.g. industrial machine) downloads a sequence of n operational commands through cell...
Insights from Human/Mouse genome comparisons

Energy Technology Data Exchange (ETDEWEB)

Pennacchio, Len A.

2003-03-30

Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.
The Bergshamra earthquake sequence of December 23, 1979

International Nuclear Information System (INIS)

Kulhanek, O.; John, N.; Meyer, K.; Eck, T. van; Wahlstroem, R.

1980-08-01

On December 23, 1979 an earthquake sequence occurred near Bergshamra-Roslagen, Sweden, about 50 km northeast of Stockholm. The main shock, which has been assigned a magnitude Msub(L)=3.2, has been followed, with a 3 minute delay, by a shock of magnitude Msub(L)=2.6 and, with additional 21-minute delay, by a third shock of magnitude Msub(L)=2.0. Whereas the main shock was recorded by almost all Finnish, Norwegian and Swedish permanent stations, the whole sequence has been observed only at UPP (Δ=68 km). A six-week field survey in the epicentral area revealed a number of small aftershocks located close to the main shock. The Bergshamra sequence took place in a zone of very low seismicity in eastern central Sweden and for Swedish earthquakes at unusual shallow depth. Since the epicentre lies less than 100 km from a nuclear power plant in Forsmark, the sequence received publicity which was not in proportion to the size of the shock. At his occasion, some rather strange explanations of the shock emerged. (Auth.)
MRI in neuro-Behcet's syndrome: comparison of conventional spin-echo and FLAIR pulse sequences

International Nuclear Information System (INIS)

Jaeger, H.R.; Albrecht, T.; Curati-Alasonatti, W.L.; Williams, E.J.; Haskard, D.O.

1999-01-01

We compared the sensitivity of a fluid-attenuated inversion-recovery (FLAIR) sequence with that of a conventional dual-echo spin-echo (SE) sequence to brain lesions in 20 patients with Behcet's syndrome. They underwent 25 MRI examinations. The images were independently analysed for the number, type and anatomical location of lesions shown. There were 18 abnormal studies (13 initial and 5 follow-up). The FLAIR sequence detected significantly more lesions than the SE TE 80 (P < 0.05) and SE TE 20 (P < 0.01) sequences. It was particularly useful for demonstrating lesions in the juxtacortical white matter, which accounted for over half the lesions detected on the FLAIR images. Of patients presenting with nonspecific symptoms such as headache, seven had normal and five had abnormal studies. All patients presenting with focal neurological signs had abnormal imaging. We found supratentorial and, in particular, juxtacortical lesions to be more frequent than previously described. (orig.)

Meniscal tear evaluation. Comparison of a conventional spin-echo proton density sequence with a fast spin-echo sequence utilizing a 512x358 matrix size

International Nuclear Information System (INIS)

Hopper, M.A.; Robinson, P.; Grainger, A.J.

2011-01-01

Aim: To determine the sensitivities, specificities, and receiver-operating characteristics (ROCs) for sagittal conventional spin-echo proton density (SE-PD) and fast spin-echo proton density (FSE-PD) sequences in the diagnosis of meniscal tears when compared to arthroscopic findings utilizing increased FSE matrix acquisition size. Method and materials: Magnetic resonance imaging (MRI) studies of 97 knees (194 menisci) were independently and prospectively interpreted by two experienced musculoskeletal radiologists over four separate readings at least 3 weeks apart. Readings 1 and 2 included images in all three planes in accordance with the standard protocol with either a SE or FSE sagittal PD, at readings 3 and 4 just the SE or FSE sagittal PD sequences were reported. The FSE sequence was acquired with an increased matrix size, compared to the SE sequence, to provide increased resolution. Menisci were graded for the presence of a tear and statistical analysis to calculate sensitivity and specificity was performed comparing to arthroscopy as the reference standard. ROC analysis for the diagnosis of meniscal tears on the SE and FSE sagittal sequences was also evaluated. Reader concordance for the SE and FSE sequences was calculated. Results: Sixty-seven tears were noted at arthroscopy; 60 were detected on SE and 56 on FSE. The sensitivity and specificity for SE was 90 and 90%, and for FSE was 84 and 94%, respectively, with no significant difference. ROC analysis showed no significant difference between the two sequences and kappa values demonstrated a higher level of reader agreement for the FSE than for the SE reading. Conclusion: Use of a FSE sagittal PD sequence with an increased matrix size provides comparable performance to conventional SE sagittal PD when evaluating meniscal disease with a modern system. The present study indicates an increased level of concordance between readers for the FSE sagittal sequence compared to the conventional SE.
Meniscal tear evaluation. Comparison of a conventional spin-echo proton density sequence with a fast spin-echo sequence utilizing a 512x358 matrix size

Energy Technology Data Exchange (ETDEWEB)

Hopper, M.A.; Robinson, P. [Leeds Teaching Hospitals NHS Trust, Leeds (United Kingdom); Grainger, A.J., E-mail: andrew.grainger@leedsth.nhs.u [Leeds Teaching Hospitals NHS Trust, Leeds (United Kingdom)

2011-04-15

Aim: To determine the sensitivities, specificities, and receiver-operating characteristics (ROCs) for sagittal conventional spin-echo proton density (SE-PD) and fast spin-echo proton density (FSE-PD) sequences in the diagnosis of meniscal tears when compared to arthroscopic findings utilizing increased FSE matrix acquisition size. Method and materials: Magnetic resonance imaging (MRI) studies of 97 knees (194 menisci) were independently and prospectively interpreted by two experienced musculoskeletal radiologists over four separate readings at least 3 weeks apart. Readings 1 and 2 included images in all three planes in accordance with the standard protocol with either a SE or FSE sagittal PD, at readings 3 and 4 just the SE or FSE sagittal PD sequences were reported. The FSE sequence was acquired with an increased matrix size, compared to the SE sequence, to provide increased resolution. Menisci were graded for the presence of a tear and statistical analysis to calculate sensitivity and specificity was performed comparing to arthroscopy as the reference standard. ROC analysis for the diagnosis of meniscal tears on the SE and FSE sagittal sequences was also evaluated. Reader concordance for the SE and FSE sequences was calculated. Results: Sixty-seven tears were noted at arthroscopy; 60 were detected on SE and 56 on FSE. The sensitivity and specificity for SE was 90 and 90%, and for FSE was 84 and 94%, respectively, with no significant difference. ROC analysis showed no significant difference between the two sequences and kappa values demonstrated a higher level of reader agreement for the FSE than for the SE reading. Conclusion: Use of a FSE sagittal PD sequence with an increased matrix size provides comparable performance to conventional SE sagittal PD when evaluating meniscal disease with a modern system. The present study indicates an increased level of concordance between readers for the FSE sagittal sequence compared to the conventional SE.
Searching sequences of resonant orbits between a spacecraft and Jupiter

International Nuclear Information System (INIS)

Formiga, J K S; Prado, A F B A

2013-01-01

This research shows a study of the dynamical behavior of a spacecraft that performs a series of close approaches with the planet Jupiter. The main idea is to find a sequence of resonant orbits that allows the spacecraft to stay in the region of the space near the orbit of Jupiter around the Sun gaining energy from each passage by the planet. The dynamical model considers the existence of only two massive bodies in the systems, which are the Sun and Jupiter. They are assumed to be in circular orbits around their center of mass. Analytical equations are used to obtain the values of the parameters required to get this sequence of close approaches. Those equations are useful, because they show which orbits are physically possible when taking into account that the periapsis distances have to be above the surface of the Sun and that the closest approach distances during the passage by Jupiter have to be above its surface
Complete genome sequence of switchgrass mosaic virus, a member of a proposed new species in the genus Marafivirus.

Science.gov (United States)

Agindotan, Bright O; Gray, Michael E; Hammond, Rosemarie W; Bradley, Carl A

2012-09-01

The complete genome sequence of a virus recently detected in switchgrass (Panicum virgatum) was determined and found to be closely related to that of maize rayado fino virus (MRFV), genus Marafivirus, family Tymoviridae. The genomic RNA is 6408 nucleotides long. It contains three predicted open reading frames (ORFs 1-3), encoding proteins of 227 kDa, 43.9 kDa, and 31.5 kDa, compared to two ORFs (1 and 2) for MRFV. The complete genome shares 76 % sequence identity with MRFV. The nucleotide sequence of ORF2 of this virus and the amino acid sequence of its encoded protein are 49 % and 77 % identical, respectively, to those of MRFV. The virus-encoded polyprotein and capsid protein aa sequences are 83 % and 74-80 % identical, respectively, to those of MRFV. Although closely related to MRFV, the amino acid sequence of its capsid protein (CP) forms a clade that is separate from that of MRFV. Based on the International Committee on Taxonomy of Viruses (ICTV) sequence-related criteria for delineation of species within the genus Marafivirus, the virus qualifies as a member of a new species, and the name Switchgrass mosaic virus (SwMV) is proposed.
In Silico Characterization of Pectate Lyase Protein Sequences from Different Source Organisms

Directory of Open Access Journals (Sweden)

Amit Kumar Dubey

2010-01-01

Full Text Available A total of 121 protein sequences of pectate lyases were subjected to homology search, multiple sequence alignment, phylogenetic tree construction, and motif analysis. The phylogenetic tree constructed revealed different clusters based on different source organisms representing bacterial, fungal, plant, and nematode pectate lyases. The multiple accessions of bacterial, fungal, nematode, and plant pectate lyase protein sequences were placed closely revealing a sequence level similarity. The multiple sequence alignment of these pectate lyase protein sequences from different source organisms showed conserved regions at different stretches with maximum homology from amino acid residues 439–467, 715–816, and 829–910 which could be used for designing degenerate primers or probes specific for pectate lyases. The motif analysis revealed a conserved Pec_Lyase_C domain uniformly observed in all pectate lyases irrespective of variable sources suggesting its possible role in structural and enzymatic functions.
SWPhylo – A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees

Science.gov (United States)

Yu, Xiaoyu; Reva, Oleg N

2018-01-01

Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA. PMID:29511354
Molecular characterization of Fasciola spp. from the endemic area of northern Iran based on nuclear ribosomal DNA sequences.

Science.gov (United States)

Amor, Nabil; Halajian, Ali; Farjallah, Sarra; Merella, Paolo; Said, Khaled; Ben Slimane, Badreddine

2011-07-01

Fasciolosis caused by Fasciola spp. (Platyhelminthes: Trematoda: Digenea) is considered as the most important helminth infection of ruminants in tropical countries, causing considerable socioeconomic problems. In the endemic regions of the North of Iran, Fasciola hepatica and Fasciola gigantica have been previously characterized on the basis of morphometric differences, but the use of molecular markers is necessary to distinguish exactly between species and intermediate forms. Samples from buffaloes and goats from different localities of northern Iran were identified morphologically and then genetically characterized by sequences of the first (ITS-1) and second (ITS-2) Internal Transcribed Spacers (ITS) of nuclear ribosomal DNA (rDNA). Comparison of the ITS of the northern Iranian samples with sequences of Fasciola spp. from GenBank showed that the examined specimens had sequences identical to those of the most frequent haplotypes of F. hepatica (n=25, 48.1%) and F. gigantica (n=20, 38.45%), which differed from each other in different variable nucleotide positions of ITS region sequences, and their intermediate forms (n=7, 13.45%), which had nucleotides overlapped between the two Fasciola species in all the positions. The ITS sequences from populations of Fasciola isolates in buffaloes and goats had experienced introgression/hybridization as previously reported in isolates from other ruminants and humans. Based on ITS-1 and ITS-2 sequences, flukes are scattered in pure F. hepatica, F. gigantica and intermediate Fasciola clades, revealing that multiple genotypes of Fasciola are able to infect goats and buffaloes in North of Iran. Furthermore, the phylogenetic trees based upon the ITS-1 and ITS-2 sequences showed a close relationship of the Iranian samples with isolates of F. hepatica and F. gigantica from different localities of Africa and Asia. In the present study, the intergenic transcribed spacers ITS-1 and ITS-2 showed to be reliable approaches for the genetic
Multilocus sequence typing of Xylella fastidiosa isolated from olive affected by “olive quick decline syndrome” in Italy

Directory of Open Access Journals (Sweden)

Toufic ELBEAINO

2015-01-01

Full Text Available The recent finding of Xylella fastidiosa (Xf in olive trees in southern Italy, the scanty molecular information on this bacterium and its association with the olive quick decline syndrome (OQDS prompted the necessity to isolate and acquire more genetic data on the type of strain present in that region. For the first time, the bacterium was isolated from infected olive on culture media. Genetic information were obtained through genomic comparison with other subspecies or strains. The sequences of thirteen genes from its genome, comprising seven housekeeping genes (leuA, petC, lacF, cysG, holC, nuoL and gltT usually used in multilocus sequence typing (MLST systems, and six genes involved in different biochemical functions (RNA Pol sigma-70 factor, hypothetical protein HL, 16S rRNA, rfbD, nuoN, and pilU, were analyzed. The sequences of the biochemical function genes were explored individually to study the genetic structure of this bacterium, while the MLST genes were linked together into one concatameric sequence (4161 bp long to increase the resolution of the phylogenetic analysis when compared with Xf strains previously reported. Sequence analyses of single genes showed that the Xf olive strain is distinct from the four previously defined taxons (Xf subsp. fastidiosa, Xf subsp. multiplex, Xf subsp. sandyi and Xf subsp. pauca with a dissimilarity rate that reached 4%. In particular, Xf from olive shared the greatest identity with the strain “9a5c” (subsp. pauca, but was nevertheless distinct from it. Similarly, the MLST based on concatameric sequences confirmed the genetic variance of Xf from olive by generating a novel sequence type profile (ST53. Phylogenetic tree analyses showed that Xf from olive clustered in one clade close to subspecies pauca (strains “9a5c” and “CVC0018”, but was nevertheless distinct from them. These results indicate molecular divergence of this olive bacterium with all other strains yet reported.
16S rRNA gene sequencing in routine identification of anaerobic bacteria isolated from blood cultures

DEFF Research Database (Denmark)

Justesen, Ulrik Stenz; Skov, Marianne Nielsine; Knudsen, Elisa

2010-01-01

A comparison between conventional identification and 16S rRNA gene sequencing of anaerobic bacteria isolated from blood cultures in a routine setting was performed (n = 127). With sequencing, 89% were identified to the species level, versus 52% with conventional identification. The times...
Comparison of MRI pulse sequences in defining prostate volume after permanent implantation

International Nuclear Information System (INIS)

McLaughlin, P.W.; Narayana, V.; Drake, D.G.; Miller, B.M.; Marsh, L.; Chan, J.; Gonda, R.; Winfield, R.J.; Roberson, P.L.

2002-01-01

Purpose: To determine the relative value of three MRI pulse sequences in defining the prostate volume after permanent implantation. Methods and Materials: A total of 45 patients who received a permanent 125 I implant were studied. Two weeks after implantation, an axial CT scan (2 mm thickness) and T 1 -weighted, T 1 -weighted fat saturation, and T 2 -weighted axial MRI (3-mm) studies were obtained. The prostate volumes were compared with the initial ultrasound planning volumes, and subsequently the CT, T 1 -weighted, and T 1 -weighted fat saturation MRI volumes were compared with the T 2 -weighted volumes. Discrepancies in volume were evaluated by visual inspection of the registered axial images and the registration of axial volumes on the sagittal T 2 -weighted volumes. In a limited set of patients, pre- and postimplant CT and T 2 -weighted MRI studies were available for comparison to determine whether prostate volume changes after implant were dependent on the imaging modality. Results: T 1 -weighted and T 1 -weighted fat saturation MRI and CT prostate volumes were consistently larger than the T 2 -weighted MRI prostate volumes, with a volume on average 1.33 (SD 0.24) times the T 2 -weighted volume. This discrepancy was due to the superiority of T 2 -weighted MRI for prostate definition at the following critical interfaces: membranous urethra, apex, and anterior base-bladder and posterior base-seminal vesicle interfaces. The differences in prostate definition in the anterior base region suggest that the commonly reported underdose may be due to overestimation of the prostate in this region by CT. The consistent difference in volumes suggests that the degree of swelling observed after implantation is in part a function of the imaging modality. In patients with pre- and postimplant CT and T 2 -weighted MRI images, swelling on the T 2 -weighted images was 1.1 times baseline and on CT was 1.3 times baseline, confirming the imaging modality dependence of prostate
MELCOR 1.8.2 calculations of selected sequences for the ABWR

International Nuclear Information System (INIS)

Kmetyk, L.N.

1994-07-01

This report summarizes the results from MELCOR calculations of severe accident sequences in the ABWR and presents comparisons with MAAP calculations for the same sequences. MELCOR was run for two low-pressure and three high-pressure sequences to identify the materials which enter containment and are available for release to the environment (source terms), to study the potential effects of core-concrete interaction, and to obtain event timings during each sequence; the source terms include fission products and other materials such as those generated by core-concrete interactions. Sensitivity studies were done on the impact of assuming limestone rather than basaltic concrete and on the effect of quenching core debris in the cavity compared to having hot, unquenched debris present
First occurrence of close-to-ideal Kirkiite at Vulcano (Aeolian Islands, Italy)

DEFF Research Database (Denmark)

Pinto, Daniela; Balic-Zunic, Tonci; Garavelli, anna

2006-01-01

Samples of kirkiite from the high temperature fumaroles of La Fossa crater of Vulcano (Aeolian islands, Italy) were chemically and structurally investigated in this work. Associated minerals are vurroite, bismuthinite, galenobismutite, cannizzarite, lillianite, heyrovsk ite, galena, and other less...... of the close-to-ideal kirkiite from Vulcano has been compared with the structure of the type specimen. The comparison reveals a variation in As-Bi substitution, with samples from Vulcano probably being close to the maximum possible Bi and the minimum As content for this structure type. This is reflected...
Progenitor-derivative relationships of Hordeum polyploids (Poaceae, Triticeae inferred from sequences of TOPO6, a nuclear low-copy gene region.

Directory of Open Access Journals (Sweden)

Jonathan Brassac

Full Text Available Polyploidization is a major mechanism of speciation in plants. Within the barley genus Hordeum, approximately half of the taxa are polyploids. While for diploid species a good hypothesis of phylogenetic relationships exists, there is little information available for the polyploids (4×, 6× of Hordeum. Relationships among all 33 diploid and polyploid Hordeum species were analyzed with the low-copy nuclear marker region TOPO6 for 341 Hordeum individuals and eight outgroup species. PCR products were either directly sequenced or cloned and on average 12 clones per individual were included in phylogenetic analyses. In most diploid Hordeum species TOPO6 is probably a single-copy locus. Most sequences found in polyploid individuals phylogenetically cluster together with sequences derived from diploid species and thus allow the identification of parental taxa of polyploids. Four groups of sequences occurring only in polyploid taxa are interpreted as footprints of extinct diploid taxa, which contributed to allopolyploid evolution. Our analysis identifies three key species involved in the evolution of the American polyploids of the genus. (i All but one of the American tetraploids have a TOPO6 copy originating from the Central Asian diploid H. roshevitzii, the second copy clustering with different American diploid species. (ii All hexaploid species from the New World have a copy of an extinct close relative of H. californicum and (iii possess the TOPO6 sequence pattern of tetraploid H. jubatum, each with an additional copy derived from different American diploids. Tetraploid H. bulbosum is an autopolyploid, while the assumed autopolyploid H. brevisubulatum (4×, 6× was identified as allopolyploid throughout most of its distribution area. The use of a proof-reading DNA polymerase in PCR reduced the proportion of chimerical sequences in polyploids in comparison to Taq polymerase.
Near-complete genome sequencing of swine vesicular disease virus using the Roche GS FLX sequencing platform

DEFF Research Database (Denmark)

Nielsen, Sandra Cathrine Abel; Bruhn, Christian Anders Wathne; Samaniego Castruita, Jose Alfredo

2014-01-01

Swine vesicular disease virus (SVDV) is an enterovirus that is both genetically and antigenically closely related to human coxsackievirus B5 within the Picornaviridae family. SVDV is the causative agent of a highly contagious (though rarely fatal) vesicular disease in pigs. We report a rapid method...... with significant genetic distances within the same species of viruses. All reference mappings used an iterative method to avoid bias. Further verification was achieved through phylogenetic analysis against published SVDV genomes and additional Enterovirus B sequences. This approach allows high confidence...
Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

International Nuclear Information System (INIS)

Feild, M.J.; Armstrong, F.B.

1987-01-01

E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and [ 3 H]-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealed limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region
Comparing whole-genome sequencing with Sanger sequencing for spa typing of methicillin-resistant Staphylococcus aureus.

Science.gov (United States)

Bartels, Mette Damkjær; Petersen, Andreas; Worning, Peder; Nielsen, Jesper Boye; Larner-Svensson, Hanna; Johansen, Helle Krogh; Andersen, Leif Percival; Jarløv, Jens Otto; Boye, Kit; Larsen, Anders Rhod; Westh, Henrik

2014-12-01

spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and an in-house analysis pipeline determines the spa types. Due to national surveillance, all MRSA isolates are sent to Statens Serum Institut, where the spa type is determined by PCR and Sanger sequencing. The purpose of this study was to evaluate the reliability of the spa types obtained by 150-bp paired-end Illumina WGS. MRSA isolates from new MRSA patients in 2013 (n = 699) in the capital region of Denmark were included. We found a 97% agreement between spa types obtained by the two methods. All isolates achieved a spa type by both methods. Nineteen isolates differed in spa types by the two methods, in most cases due to the lack of 24-bp repeats in the whole-genome-sequenced isolates. These related but incorrect spa types should have no consequence in outbreak investigations, since all epidemiologically linked isolates, regardless of spa type, will be included in the single nucleotide polymorphism (SNP) analysis. This will reveal the close relatedness of the spa types. In conclusion, our data show that WGS is a reliable method to determine the spa type of MRSA. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

Science.gov (United States)

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
[Identification and analysis of Corydalis boweri, Meconopsis horridula and their close related species of the same genus by using ITS2 DNA barcode].

Science.gov (United States)

Dou, Rong-kun; Bi, Zhen-fei; Bai, Rui-xue; Ren, Yao-yao; Tan, Rui; Song, Liang-ke; Li, Di-qiang; Mao, Can-quan

2015-04-01

The study is aimed to ensure the quality and safety of medicinal plants by using ITS2 DNA barcode technology to identify Corydalis boweri, Meconopsis horridula and their close related species. The DNA of 13 herb samples including C. boweri and M. horridula from Lhasa of Tibet was extracted, ITS PCR were amplified and sequenced. Both assembled and web downloaded 71 ITS2 sequences were removed of 5. 8S and 28S. Multiple sequence alignment was completed and the intraspecific and interspecific genetic distances were calculated by MEGA 5.0, while the neighbor-joining phylogenetic trees were constructed. We also predicted the ITS2 secondary structure of C. boweri, M. horridula and their close related species. The results showed that ITS2 as DNA barcode was able to identify C. boweri, M. horridula as well as well as their close related species effectively. The established based on ITS2 barcode method provides the regular and safe detection technology for identification of C. boweri, M. horridula and their close related species, adulterants and counterfeits, in order to ensure their quality control, safe medication, reasonable development and utilization.
Spin tune dependence on closed orbit in RHIC

International Nuclear Information System (INIS)

Ptitsyn, V.; Bai, M.; Roser, T.

2010-01-01

Polarized proton beams are accelerated in RHIC to 250 GeV energy with the help of Siberian Snakes. The pair of Siberian Snakes in each RHIC ring holds the design spin tune at 1/2 to avoid polarization loss during acceleration. However, in the presence of closed orbit errors, the actual spin tune can be shifted away from the exact 1/2 value. It leads to a corresponding shift of locations of higher-order ('snake') resonances and limits the available betatron tune space. The largest closed orbit effect on the spin tune comes from the horizontal orbit angle between the two snakes. During RHIC Run in 2009 dedicated measurements with polarized proton beams were taken to verify the dependence of the spin tune on the local orbits at the Snakes. The experimental results are presented along with the comparison with analytical predictions.
Draft genome sequence of the coccolithovirus Emiliania huxleyi virus 202.

Science.gov (United States)

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2012-02-01

Emiliania huxleyi virus 202 (EhV-202) is a member of the Coccolithoviridae, a group of viruses that infect the marine coccolithophorid Emiliania huxleyi. EhV-202 has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 407 kbp, consisting of 485 coding sequences (CDSs). Here we describe the genomic features of EhV-202, together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.

Genomic sequence and organization of two members of a human lectin gene family

International Nuclear Information System (INIS)

Gitt, M.A.; Barondes, S.H.

1991-01-01

The authors have isolated and sequenced the genomic DNA encoding a human dimeric soluble lactose-binding lectin. The gene has four exons, and its upstream region contains sequences that suggest control by glucocorticoids, heat (environmental) shock, metals, and other factors. They have also isolated and sequenced three exons of the gene encoding another human putative lectin, the existence of which was first indicated by isolation of its cDNA. Comparisons suggest a general pattern of genomic organization of members of this lectin gene family
Comparison of spin echo T1-weighted sequences versus fast spin-echo proton density-weighted sequences for evaluation of meniscal tears at 1.5 T

International Nuclear Information System (INIS)

Wolff, Andrew B.; Pesce, Lorenzo L.; Wu, Jim S.; Smart, L.R.; Medvecky, Michael J.; Haims, Andrew H.

2009-01-01

At our institution, fast spin-echo (FSE) proton density (PD) imaging is used to evaluate articular cartilage, while conventional spin-echo (CSE) T1-weighted sequences have been traditionally used to characterize meniscal pathology. We sought to determine if FSE PD-weighted sequences are equivalent to CSE T1-weighted sequences in the detection of meniscal tears, obviating the need to perform both sequences. We retrospectively reviewed the records of knee arthroscopies performed by two arthroscopy-focused surgeons from an academic medical center over a 2-year period. The preoperative MRI images were interpreted independently by two fellowship-trained musculoskeletal radiologists who graded the sagittal CSE T1 and FSE PD sequences at different sittings with grades 1-5, where 1 = normal meniscus, 2 = probable normal meniscus, 3 indeterminate, 4 = probable torn meniscus, and 5 = torn meniscus. Each meniscus was divided into an anterior and posterior half, and these halves were graded separately. Operative findings provided the gold standard. Receiver operating characteristic (ROC) analysis was performed to compare the two sequences. There were 131 tears in 504 meniscal halves. Using ROC analysis, the reader 1 area under curve for FSE PD was significantly better than CSE T1 (0.939 vs. 0.902, >95% confidence). For reader 2, the difference met good criteria for statistical non-inferiority but not superiority (0.913 for FSE PD and 0.908 for CSE T1; >95% non-inferiority for difference at most of -0.027). FSE PD-weighted sequences, using our institutional protocol, are not inferior to CSE T1-weighted sequences for the detection of meniscal tears and may be superior. (orig.)
[Phylogenetic relationships among the genera of Taxodiaceae and Cupressaceae from 28S rDNA sequences].

Science.gov (United States)

Li, Chun-Xiang; Yang, Qun

2003-03-01

DNA sequences from 28S rDNA were used to assess relationships between and within traditional Taxodiaceae and Cupressaceae s.s. The MP tree and NJ tree generally are similar to one another. The results show that Taxodiaceae and Cupressaceae s.s. form a monophyletic conifer lineage excluding Sciadopitys. In the Taxodiaceae-Cupressaceae s.s. monophyletic group, the Taxodiaceae is paraphyletic. Taxodium, Glyptostrobus and Cryptomeria forming a clade(Taxodioideae), in which Glyptostrobus and Taxodium are closely related and sister to Cryptomeria; Sequoia, Sequoiadendron and Metasequoia are closely related to each other, forming another clade (Sequoioideae), in which Sequoia and Sequoiadendron are closely related and sister to Metasequoia; the seven genera of Cupressaceae s.s. are found to be closely related to form a monophyletic lineage (Cupressoideae). These results are basically similar to analyses from chloroplast gene data. But the relationships among Taiwania, Sequoioideae, Taxodioideae, and Cupressoideae remain unclear because of the slow evolution rate of 28S rDNA, which might best be answered by sequencing more rapidly evolving nuclear genes.
On the Roche constants for main-sequence binaries

International Nuclear Information System (INIS)

Giannuzzi, M.A.

1979-01-01

The ratios C 1 /C 2 of the constants defining the equipotential surfaces which describe the external forms of the components of a close binary system have been calculated on the basis of evolutionary models. Theoretical systems have been considered allowing for a wide range of input parameters (masses and separation) and taking into account the evolutionary effects on the radii of the stars during their Main-Sequence lifetime. The systems have not undergone any transfer of matter and are representative of detached binaries with Main-sequence components. The ratios of the constants are confined in limited intervals and, for the highest values of the mass-ratios, they are clustered around the unit. (Auth.)
Complete genome sequence of the European sheatfish virus.

Science.gov (United States)

Mavian, Carla; López-Bueno, Alberto; Fernández Somalo, María Pilar; Alcamí, Antonio; Alejo, Alí

2012-06-01

Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985. We report the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome sequence shows that ESV belongs to the amphibian-like ranaviruses and is closely related to the epizootic hematopoietic necrosis virus (EHNV), a disease agent geographically confined to the Australian continent and notifiable to the World Organization for Animal Health.
Inaudible functional MRI using a truly mute gradient echo sequence

Energy Technology Data Exchange (ETDEWEB)

Marcar, V.L. [University of Zurich, Department of Psychology, Neuropsychology, Treichlerstrasse 10, 8032 Zurich (Switzerland); Girard, F. [GE Medical Systems SA, 283, rue de la Miniere B.P. 34, 78533 Buc Cedex (France); Rinkel, Y.; Schneider, J.F.; Martin, E. [University Children' s Hospital, Neuroradiology and Magnetic Resonance, Department of Diagnostic Imaging, Steinwiesstrasse 75, 8032 Zurich (Switzerland)

2002-11-01

We performed functional MRI experiments using a mute version of a gradient echo sequence on adult volunteers using either a simple visual stimulus (flicker goggles: 4 subjects) or an auditory stimulus (music: 4 subjects). Because the mute sequence delivers fewer images per unit time than a fast echo planar imaging (EPI) sequence, we explored our data using a parametric ANOVA test and a non-parametric Wilcoxon-Mann-Whitney test in addition to performing a cross-correlation analysis. All three methods were in close agreement regarding the location of the BOLD contrast signal change. We demonstrated that, using appropriate statistical analysis, functional MRI using an MR sequence that is acoustically inaudible to the subject is feasible. Furthermore compared with the ''silent'' event-related procedures involving an EPI protocol, our mGE protocol compares favourably with respect to experiment time and the BOLD signal. (orig.)
Improved taxonomic assignment of human intestinal 16S rRNA sequences by a dedicated reference database

NARCIS (Netherlands)

Ritari, Jarmo; Salojärvi, Jarkko; Lahti, Leo; Vos, de Willem M.

2015-01-01

Background: Current sequencing technology enables taxonomic profiling of microbial ecosystems at high resolution and depth by using the 16S rRNA gene as a phylogenetic marker. Taxonomic assignation of newly acquired data is based on sequence comparisons with comprehensive reference databases to
Comparison of MRI pulse sequences for investigation of lesions of the cervical spinal cord

International Nuclear Information System (INIS)

Campi, A.; Pontesilli, S.; Gerevini, S.; Scotti, G.

2000-01-01

Small spinal cord lesions, even if clinically significant, can be due to the low sensitivity of some pulse sequences. We compared T2-weighted fast (FSE), and conventional (CSE) spin-echo and short-tau inversion-recovery (STIR)-FSE overlooked on MRI sequences to evaluate their sensitivity to and specificity for lesions of different types. We compared the three sequences in MRI of 57 patients with cervical spinal symptoms. The image sets were assessed by two of us individually for final diagnosis, lesion detectability and image quality. Both readers arrived at the same final diagnoses with all sequences, differentiating four groups of patients. Group 1 (30 patients, 53 %), with a final diagnosis of multiple sclerosis (MS). Demyelinating lesions were better seen on STIR-FSE images, on which the number of lesions was significantly higher than on FSE, while the FSE and CSE images showed approximately equal numbers of lesions; additional lesions were found in 9 patients. The contrast-to-noise ratio (CNR) of 17 demyelinating lesions was significantly higher on STIR-FSE images than with the other sequences. Group 2, 19 patients (33 %) with cervical pain, 15 of whom had disc protrusion or herniation: herniated discs were equally well delineated with all sequences, with better myelographic effect on FSE. In five patients with intrinsic spinal cord abnormalities, the conspicuity and demarcation of the lesions were similar with STIR-FSE and FSE. Group 3, 4 patients (7 %) with acute myelopathy of unknown aetiology. In two patients, STIR-FSE gave better demarcation of lesions and in one a questionable additional lesions. Group 4, 4 patients (7 %) with miscellaneous final diagnoses. STIR-FSE had high sensitivity to demyelinating lesions, can be considered quite specific and should be included in spinal MRI for assessment of suspected demyelinating disease. (orig.)
A comparison of complete mitochondrial genomes of silver carp hypophthalmichthys molitrix and bighead carp hypophthalmichthys nobilis: Implications for their taxonomic relationship and phylogeny

Science.gov (United States)

Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.

2009-01-01

Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Next-Generation Sequencing Reveals the Impact of Repetitive DNA Across Phylogenetically Closely Related Genomes of Orobanchaceae

Czech Academy of Sciences Publication Activity Database

Piednoël, M.; Aberer, A.J.; Schneeweiss, G. M.; Macas, Jiří; Novák, Petr; Gundlach, H.; Temsch, E.M.; Renner, S.S.

2012-01-01

Roč. 29, č. 11 (2012), s. 3601-3611 ISSN 0737-4038 Institutional research plan: CEZ:AV0Z50510513 Institutional support: RVO:60077344 Keywords : next-generation sequencing * polyploidy * genome size * Ty3/Gypsy * transposable elements Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 10.353, year: 2012
Digital PCR provides sensitive and absolute calibration for high throughput sequencing

Directory of Open Access Journals (Sweden)

Fan H Christina

2009-03-01

Full Text Available Abstract Background Next-generation DNA sequencing on the 454, Solexa, and SOLiD platforms requires absolute calibration of the number of molecules to be sequenced. This requirement has two unfavorable consequences. First, large amounts of sample-typically micrograms-are needed for library preparation, thereby limiting the scope of samples which can be sequenced. For many applications, including metagenomics and the sequencing of ancient, forensic, and clinical samples, the quantity of input DNA can be critically limiting. Second, each library requires a titration sequencing run, thereby increasing the cost and lowering the throughput of sequencing. Results We demonstrate the use of digital PCR to accurately quantify 454 and Solexa sequencing libraries, enabling the preparation of sequencing libraries from nanogram quantities of input material while eliminating costly and time-consuming titration runs of the sequencer. We successfully sequenced low-nanogram scale bacterial and mammalian DNA samples on the 454 FLX and Solexa DNA sequencing platforms. This study is the first to definitively demonstrate the successful sequencing of picogram quantities of input DNA on the 454 platform, reducing the sample requirement more than 1000-fold without pre-amplification and the associated bias and reduction in library depth. Conclusion The digital PCR assay allows absolute quantification of sequencing libraries, eliminates uncertainties associated with the construction and application of standard curves to PCR-based quantification, and with a coefficient of variation close to 10%, is sufficiently precise to enable direct sequencing without titration runs.
Potentially Stressful Life Events and Emotional Closeness between Grandparents and Adult Grandchildren

Science.gov (United States)

Wood, Suzanne; Liossis, Poppy

2007-01-01

The purpose of this study is to explore the variation in emotional closeness in the adult grandchild and grandparent relationship in relation to the occurrence of potentially stressful life events in childhood. A sample of university students (N = 119) completed a questionnaire measuring elements of intergenerational solidarity. Comparisons were…
Phylogenetic relationships of Salmonella based on rRNA sequences

DEFF Research Database (Denmark)

Christensen, H.; Nordentoft, Steen; Olsen, J.E.

1998-01-01

separated by 16S rRNA analysis and found to be closely related to the Escherichia coli and Shigella complex by both 16S and 23S rRNA analyses. The diphasic serotypes S. enterica subspp. I and VI were separated from the monophasic serotypes subspp. IIIa and IV, including S. bongori, by 23S rRNA sequence...
NeSSM: a Next-generation Sequencing Simulator for Metagenomics.

Directory of Open Access Journals (Sweden)

Ben Jia

Full Text Available BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics. Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it's freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.
Next-generation sequencing for genetic testing of familial colorectal cancer syndromes.

Science.gov (United States)

Simbolo, Michele; Mafficini, Andrea; Agostini, Marco; Pedrazzani, Corrado; Bedin, Chiara; Urso, Emanuele D; Nitti, Donato; Turri, Giona; Scardoni, Maria; Fassan, Matteo; Scarpa, Aldo

2015-01-01

Genetic screening in families with high risk to develop colorectal cancer (CRC) prevents incurable disease and permits personalized therapeutic and follow-up strategies. The advancement of next-generation sequencing (NGS) technologies has revolutionized the throughput of DNA sequencing. A series of 16 probands for either familial adenomatous polyposis (FAP; 8 cases) or hereditary nonpolyposis colorectal cancer (HNPCC; 8 cases) were investigated for intragenic mutations in five CRC familial syndromes-associated genes (APC, MUTYH, MLH1, MSH2, MSH6) applying both a custom multigene Ion AmpliSeq NGS panel and conventional Sanger sequencing. Fourteen pathogenic variants were detected in 13/16 FAP/HNPCC probands (81.3 %); one FAP proband presented two co-existing pathogenic variants, one in APC and one in MUTYH. Thirteen of these 14 pathogenic variants were detected by both NGS and Sanger, while one MSH2 mutation (L280FfsX3) was identified only by Sanger sequencing. This is due to a limitation of the NGS approach in resolving sequences close or within homopolymeric stretches of DNA. To evaluate the performance of our NGS custom panel we assessed its capability to resolve the DNA sequences corresponding to 2225 pathogenic variants reported in the COSMIC database for APC, MUTYH, MLH1, MSH2, MSH6. Our NGS custom panel resolves the sequences where 2108 (94.7 %) of these variants occur. The remaining 117 mutations reside inside or in close proximity to homopolymer stretches; of these 27 (1.2 %) are imprecisely identified by the software but can be resolved by visual inspection of the region, while the remaining 90 variants (4.0 %) are blind spots. In summary, our custom panel would miss 4 % (90/2225) of pathogenic variants that would need a small set of Sanger sequencing reactions to be solved. The multiplex NGS approach has the advantage of analyzing multiple genes in multiple samples simultaneously, requiring only a reduced number of Sanger sequences to resolve
Swallow Event Sequencing: Comparing Healthy Older and Younger Adults.

Science.gov (United States)

Herzberg, Erica G; Lazarus, Cathy L; Steele, Catriona M; Molfenter, Sonja M

2018-04-23

Previous research has established that a great deal of variation exists in the temporal sequence of swallowing events for healthy adults. Yet, the impact of aging on swallow event sequence is not well understood. Kendall et al. (Dysphagia 18(2):85-91, 2003) suggested there are 4 obligatory paired-event sequences in swallowing. We directly compared adherence to these sequences, as well as event latencies, and quantified the percentage of unique sequences in two samples of healthy adults: young ( 65). The 8 swallowing events that contribute to the sequences were reliably identified from videofluoroscopy in a sample of 23 healthy seniors (10 male, mean age 74.7) and 20 healthy young adults (10 male, mean age 31.5) with no evidence of penetration-aspiration or post-swallow residue. Chi-square analyses compared the proportions of obligatory pairs and unique sequences by age group. Compared to the older subjects, younger subjects had significantly lower adherence to two obligatory sequences: Upper Esophageal Sphincter (UES) opening occurs before (or simultaneous with) the bolus arriving at the UES and UES maximum distention occurs before maximum pharyngeal constriction. The associated latencies were significantly different between age groups as well. Further, significantly fewer unique swallow sequences were observed in the older group (61%) compared with the young (82%) (χ 2 = 31.8; p < 0.001). Our findings suggest that paired swallow event sequences may not be robust across the age continuum and that variation in swallow sequences appears to decrease with aging. These findings provide normative references for comparisons to older individuals with dysphagia.
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

Science.gov (United States)

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Magnetic resonance imaging of anterior cruciate ligament of the knee: a comparison of four sequences

International Nuclear Information System (INIS)

Casillas, C.; Marti-Bonmati, L.; Molla, E.; Ferrer, P.; Dosda, R.

1999-01-01

To compare the diagnostic efficacy of the four magnetic resonance imaging (MRI) sequences that compose the standard protocol for the study of the knee in our center when employed in the examination of anterior cruciate ligament (ACL). A prospective study was carried out based on MRI findings in the knees of 326 consecutive patients. Sagittal [proton density (PD w eighted turbo-spin-echo and T2*-weighted gradient echo], coronal (PD-weighted turbo-spin-echo with fat suppression) and transverse (T2*-weighted gradient echo with magnetization transfer) images were evaluated. Each sequence was analyzed independently by two radiologists, while another two assessed all the sequences together with the clinical findings. Four categories were established: normal ACL, partially torn, completely torn and synovialized. The sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) with respect to the definitive diagnosis were calculated for each sequence. The statistical analysis of the findings for each category was done using the chi-squared test and the Kappa test was employed to assess the degree of agreement. According to the final diagnosis, 263 ACL were normal, 29 were partially torn, 33 were completely torn and there was 1 case of synovialization associated with a completely torn ACL. The relationship between the analysis of the ACL according to each sequence and the definitive diagnosis was very significant (p<0.001) and the agreement was excellent. All the sequences presented similar levels of diagnostic precision. The coronal sequence had least number of diagnostic errors (2.1%). The combinations of imaging techniques that resulted in the lowest error rate with respect to the definitive diagnosis were coronal PD-weighted turbo-spin-echo with fat suppression and sagittal PD-weighted turbo-spin-echo. Coronal images are highly precise in the evaluation of ACL. Sagittal sequences are the most valid for diagnosis of torn ACL. Transverse
Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

Science.gov (United States)

Wen, Chiu-Ming

2017-08-01

An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.
PHARMACOGENETIC TESTING OPPORTUNITIES IN CARDIOLOGY BASED ON EXOME SEQUENCING

Directory of Open Access Journals (Sweden)

N. V. Shcherbakova

2014-01-01

Full Text Available Aim. To study what cardiac drugs currently have any comments on biomarkers and what information can be obtained by pharmacogenetic testing using data exome sequencing in patients with cardiac diseases.Material and methods. Exome sequencing in random participant of the ATEROGEN IVANOVO study and bioinformatics analysis of the data were performed. Point mutations were annotated using ANNOVAR program, as well as comparison with a number of specialized databases was done on the basis of user protocols.Results. 11 cardiac drugs and 7 genes which variants can influence cardiac drug metabolism were analyzed. According to exome sequencing of the participant we did not reveal allelic variants that require dose regime correction and careful efficacy control.Conclusion. The exome sequencing application is the next step to a wide range of personalized therapy. Future opportunities for improvement of the risk-benefit ratio in each patient are the main purpose of the collection and analysis of pharmacogenetic data.

Genome Sequence of the Freshwater Yangtze Finless Porpoise.

Science.gov (United States)

Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jingsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang; Li, Songhai

2018-04-16

The Yangtze finless porpoise ( Neophocaena asiaeorientalis ssp. asiaeorientalis ) is a subspecies of the narrow-ridged finless porpoise ( N. asiaeorientalis ). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603.
Validation of rice genome sequence by optical mapping

Directory of Open Access Journals (Sweden)

Pape Louise

2007-08-01

Full Text Available Abstract Background Rice feeds much of the world, and possesses the simplest genome analyzed to date within the grass family, making it an economically relevant model system for other cereal crops. Although the rice genome is sequenced, validation and gap closing efforts require purely independent means for accurate finishing of sequence build data. Results To facilitate ongoing sequencing finishing and validation efforts, we have constructed a whole-genome SwaI optical restriction map of the rice genome. The physical map consists of 14 contigs, covering 12 chromosomes, with a total genome size of 382.17 Mb; this value is about 11% smaller than original estimates. 9 of the 14 optical map contigs are without gaps, covering chromosomes 1, 2, 3, 4, 5, 7, 8 10, and 12 in their entirety – including centromeres and telomeres. Alignments between optical and in silico restriction maps constructed from IRGSP (International Rice Genome Sequencing Project and TIGR (The Institute for Genomic Research genome sequence sources are comprehensive and informative, evidenced by map coverage across virtually all published gaps, discovery of new ones, and characterization of sequence misassemblies; all totalling ~14 Mb. Furthermore, since optical maps are ordered restriction maps, identified discordances are pinpointed on a reliable physical scaffold providing an independent resource for closure of gaps and rectification of misassemblies. Conclusion Analysis of sequence and optical mapping data effectively validates genome sequence assemblies constructed from large, repeat-rich genomes. Given this conclusion we envision new applications of such single molecule analysis that will merge advantages offered by high-resolution optical maps with inexpensive, but short sequence reads generated by emerging sequencing platforms. Lastly, map construction techniques presented here points the way to new types of comparative genome analysis that would focus on discernment of
Evolutionary growth process of highly conserved sequences in vertebrate genomes.

Science.gov (United States)

Ishibashi, Minaka; Noda, Akiko Ogura; Sakate, Ryuichi; Imanishi, Tadashi

2012-08-01

Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
Skeleton-based human action recognition using multiple sequence alignment

Science.gov (United States)

Ding, Wenwen; Liu, Kai; Cheng, Fei; Zhang, Jin; Li, YunSong

2015-05-01

Human action recognition and analysis is an active research topic in computer vision for many years. This paper presents a method to represent human actions based on trajectories consisting of 3D joint positions. This method first decompose action into a sequence of meaningful atomic actions (actionlets), and then label actionlets with English alphabets according to the Davies-Bouldin index value. Therefore, an action can be represented using a sequence of actionlet symbols, which will preserve the temporal order of occurrence of each of the actionlets. Finally, we employ sequence comparison to classify multiple actions through using string matching algorithms (Needleman-Wunsch). The effectiveness of the proposed method is evaluated on datasets captured by commodity depth cameras. Experiments of the proposed method on three challenging 3D action datasets show promising results.
Detecting Horizontal Gene Transfer between Closely Related Taxa.

Directory of Open Access Journals (Sweden)

Orit Adato

2015-10-01

Full Text Available Horizontal gene transfer (HGT, the transfer of genetic material between organisms, is crucial for genetic innovation and the evolution of genome architecture. Existing HGT detection algorithms rely on a strong phylogenetic signal distinguishing the transferred sequence from ancestral (vertically derived genes in its recipient genome. Detecting HGT between closely related species or strains is challenging, as the phylogenetic signal is usually weak and the nucleotide composition is normally nearly identical. Nevertheless, there is a great importance in detecting HGT between congeneric species or strains, especially in clinical microbiology, where understanding the emergence of new virulent and drug-resistant strains is crucial, and often time-sensitive. We developed a novel, self-contained technique named Near HGT, based on the synteny index, to measure the divergence of a gene from its native genomic environment and used it to identify candidate HGT events between closely related strains. The method confirms candidate transferred genes based on the constant relative mutability (CRM. Using CRM, the algorithm assigns a confidence score based on "unusual" sequence divergence. A gene exhibiting exceptional deviations according to both synteny and mutability criteria, is considered a validated HGT product. We first employed the technique to a set of three E. coli strains and detected several highly probable horizontally acquired genes. We then compared the method to existing HGT detection tools using a larger strain data set. When combined with additional approaches our new algorithm provides richer picture and brings us closer to the goal of detecting all newly acquired genes in a particular strain.
MR colonography with fecal tagging: comparison between 2D turbo FLASH and 3D FLASH sequences

International Nuclear Information System (INIS)

Papanikolaou, Nickolas; Grammatikakis, John; Maris, Thomas; Prassopoulos, Panos; Gourtsoyiannis, Nicholas; Lauenstein, Thomas

2003-01-01

The objective of this study was to compare inversion recovery turbo 2D fast low-angle shot (FLASH) and 3D FLASH sequences for fecal-tagged MR colonography studies. Fifteen consecutive patients with indications for colonoscopy underwent MR colonography with fecal tagging. An inversion recovery turbo-FLASH sequence was applied and compared in terms of artifacts presence, efficiency for masking residual stool, and colonic wall conspicuity with a fat-saturated 3D FLASH sequence. Both sequences were acquired following administration of paramagnetic contrast agent. Contrast-to-noise ratio and relative contrast between colonic wall and lumen were calculated and compared for both sequences. Turbo 2D FLASH provided fewer artifacts, higher efficiency for masking the residual stool, and colonic wall conspicuity equivalent to 3D FLASH. An inversion time of 10 ms provided homogeneously low signal intensity of the colonic lumen. Contrast to noise between colonic wall and lumen was significantly higher in the 3D FLASH images, whereas differences in relative contrast were not statistically significant. An optimized inversion-recovery 2D turbo-FLASH sequence provides better fecal tagging results and should be added to the 3D FLASH sequence when designing dark-lumen MR colonography examination protocols. (orig.)
Evidence for a close phylogenetic relationship between Melissococcus pluton, the causative agent of European foulbrood disease, and the genus Enterococcus.

Science.gov (United States)

Cai, J; Collins, M D

1994-04-01

The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.
Recognition of hypoxyloid and xylarioid Entonaema species and allied Xylaria species from a comparison of holomorphic morphology, HPLC profiles, andribosomal DNA sequences

DEFF Research Database (Denmark)

Stadler, M.; Fournier, J.; Læssøe, Thomas

2008-01-01

pallidum is thus regarded as a later synonym of E. mesentericum. Therefore, the latter name is transferred to Xylaria. A key to entonaemoid Xylariaceae is provided. Colour reactions (NH3, KOH) of the ectostroma were applied to a limited number of Xylaria spp., but metabolite profiles of cultures appear......The genus Entonaema comprises Xylariaceae with hollow, gelatinous stromata that accumulate liquid. Some of its species, including the type species, appear related to Daldinia from a polyphasic approach, comprising morphological studies, comparisons of ribosomal DNA sequences, and high performance...
Evaluation of TSE- and T1-3D-GRE-sequences for focal cartilage lesions in vitro in comparison to ultrahigh resolution multi-slice CT

International Nuclear Information System (INIS)

Stork, A.; Schulze, D.; Koops, A.; Kemper, J.; Adam, G.

2002-01-01

Purpose: Evaluation of TSE- and T 1 -3D-GRE-sequences for focal cartilage lesions in vitro in comparison to ultrahigh resolution multi-slice CT. Materials and methods: Forty artificial cartilage lesions in ten bovine patellae were immersed in a solution of iodinated contrast medium and assessed with ultrahigh resolution multi-slice CT. Fat-suppressed TSE images with intermediate- and T 2 -weighting at a slice thickness of 2, 3 and 4 mm as well as fat-suppressed T 1 -weighted 3D-FLASH images with an effective slice thickness of 1, 2 and 3 mm were acquired at 1.5 T. After adding Gd-DTPA to the saline solution containing the patellae, the T 1 -weighted 3D-FLASH imaging was repeated. Results: All cartilage lesions were visualised and graded with ultrahigh resolution multi-slice CT. The TSE images had a higher sensitivity and a higher inter- and intraobserver kappa compared to the FLASH-sequences (TSE: 70-95%; 0.82-0.83; 0.85-0.9; FLASH: 57.5-85%; 0.53-0.72; 0.73-0.82, respectively). An increase in slice thickness decreased the sensitivity, whereby deep lesions were even reliably depicted on TSE images at a slice thickness of 3 and 4 mm. Adding Gd-DTPA to the saline solution increased the sensitivity by 10% with no detectable advantage over the T 2 -weighted TSE images. Conclusion: TSE sequences and application of Gd-DTPA seemed to be superior to T 1 -weighted 3D-FLASH sequences without Gd-DTPA in the detection of focal cartilage lesions. The ultrahigh resolution multi-slice CT can serve as in vitro reference standard for focal cartilage lesions. (orig.) [de
FLAIR MR sequence in the diagnosis and follow-up of low-grade astrocytomas

Directory of Open Access Journals (Sweden)

Stošić-Opinćal Tatjana

2005-01-01

Full Text Available Aim. To evaluate the sensitivity of fluid-attenuated inversion recovery (FLAIR sequence in the diagnosis and follow-up of the patients with low-grade astrocytomas compared with T2-weighted (T2W sequence. Methods. Twenty-four patients with biopsy- confirmed low-grade astrocytoma (age range, 15-66 years underwent T1- weighted (T1W, T2W and FLAIR imaging with a superconducting unit 1.0 T. FLAIR images were qualitatively evaluated by comparison with T2W images by the three experienced neuroradiologists. To evaluate the diagnostic value of FLAIR, the neuroradiologists individually assessed the possibilities of the detection of lesions, as well as the possibilities of the differentiation of tumor from the surrounding edema on FLAIR vs. T2W images. Every examiner ranked FLAIR sequence vs. T2W in three degrees: worse, equal and better. Results. The comparison of FLAIR with T2W spin-echo (SE images with regard to the detection of the lesions showed that 82.8% of FLAIR studies were superior, 17.2% were of similar diagnostic value, and none was inferior to the T2W images. The comparison of images with regard to the differentiation of tumor boundaries vs. surrounding edema showed that 92.5% of FLAIR studies were superior, 7.5% were of similar diagnostic value, and none was inferior to the T2W images. Conclusion. Our results were similar to the previous studies' results concerning the advantages of FLAIR sequence in the diagnosis of low grade astrocytomas over T2W sequence. FLAIR was better at showing different tumor components, and at distinguishing CSF from the cystic component, and the postoperative cavity, compared with T2W images. Our conclusion was that FLAIR could be routinely used in the evaluation and follow-up of low-grade astrocytomas.
Google matrix analysis of DNA sequences.

Science.gov (United States)

Kandiah, Vivek; Shepelyansky, Dima L

2013-01-01

For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.
Google matrix analysis of DNA sequences.

Directory of Open Access Journals (Sweden)

Vivek Kandiah

Full Text Available For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW. At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.
Analysis of 16S rRNA amplicon sequencing options on the Roche/454 next-generation titanium sequencing platform.

Directory of Open Access Journals (Sweden)

Hideyuki Tamaki

Full Text Available BACKGROUND: 16S rRNA gene pyrosequencing approach has revolutionized studies in microbial ecology. While primer selection and short read length can affect the resulting microbial community profile, little is known about the influence of pyrosequencing methods on the sequencing throughput and the outcome of microbial community analyses. The aim of this study is to compare differences in output, ease, and cost among three different amplicon pyrosequencing methods for the Roche/454 Titanium platform METHODOLOGY/PRINCIPAL FINDINGS: The following three pyrosequencing methods for 16S rRNA genes were selected in this study: Method-1 (standard method is the recommended method for bi-directional sequencing using the LIB-A kit; Method-2 is a new option designed in this study for unidirectional sequencing with the LIB-A kit; and Method-3 uses the LIB-L kit for unidirectional sequencing. In our comparison among these three methods using 10 different environmental samples, Method-2 and Method-3 produced 1.5-1.6 times more useable reads than the standard method (Method-1, after quality-based trimming, and did not compromise the outcome of microbial community analyses. Specifically, Method-3 is the most cost-effective unidirectional amplicon sequencing method as it provided the most reads and required the least effort in consumables management. CONCLUSIONS: Our findings clearly demonstrated that alternative pyrosequencing methods for 16S rRNA genes could drastically affect sequencing output (e.g. number of reads before and after trimming but have little effect on the outcomes of microbial community analysis. This finding is important for both researchers and sequencing facilities utilizing 16S rRNA gene pyrosequencing for microbial ecological studies.
Endosymbiosis In Statu Nascendi: Close Phylogenetic RelationshipBetween Obligately Endosymbiotic and Obligately Free-LivingPolynucleobacter Strains (Betaproteobacteria)

Energy Technology Data Exchange (ETDEWEB)

Vannini, Claudia; Pockl, Matthias; Petroni, Giulio; Wu, Qinglong; Lang, Elke; Stackebrandt, Erko; Schrallhammer, Martina; Richardson, PaulM.; Hahn, Martin W.

2006-07-21

Bacterial strains affiliated to the phylogenetically shallowsubcluster C (PnecC) of the 28 Polynucleobacter cluster, which ischaracterized by a minimal 16S rRNA gene sequence similarity of approx.98.5 percent, have been reported to occur as obligate endosymbionts of 30ciliates (Euplotes spp.), as well as to occur as free-living cells in thepelagic zone of freshwater habitats. We investigated if these two groupsof closely related bacteria represent 32 strains fundamentally differingin lifestyle, or if they simply represent different stages of afacultative endosymbiotic lifestyle. The phylogenetic analysis of 16SrRNA gene and 16S34 23S ITS sequences of five endosymbiont strains fromtwo different Euplotes species and 40 pure culture strains demonstratedhost-species-specific clustering of the endosymbiont 36 sequences withinthe PnecC subcluster. The sequences of the endosymbionts showedcharacteristics indicating an obligate endosymbiotic lifestyle.Cultivation experiments 38 revealed fundamental differences inphysiological adaptations, and determination of the genome sizesindicated a slight size reduction in endosymbiotic strains. We concludethat the 40 two groups of PnecC bacteria represent obligately free-livingand obligately endosymbiotic strains, respectively, and do not representdifferent stages of the same complex lifecycle. 42 These closely relatedstrains occupy completely separated ecological niches. To our bestknowledge, this is the closest phylogenetic relationship between obligateendosymbionts and 44 obligately free-living bacteria everrevealed.
Applying Agrep to r-NSA to solve multiple sequences approximate matching.

Science.gov (United States)

Ni, Bing; Wong, Man-Hon; Lam, Chi-Fai David; Leung, Kwong-Sak

2014-01-01

This paper addresses the approximate matching problem in a database consisting of multiple DNA sequences, where the proposed approach applies Agrep to a new truncated suffix array, r-NSA. The construction time of the structure is linear to the database size, and the computations of indexing a substring in the structure are constant. The number of characters processed in applying Agrep is analysed theoretically, and the theoretical upper-bound can approximate closely the empirical number of characters, which is obtained through enumerating the characters in the actual structure built. Experiments are carried out using (synthetic) random DNA sequences, as well as (real) genome sequences including Hepatitis-B Virus and X-chromosome. Experimental results show that, compared to the straight-forward approach that applies Agrep to multiple sequences individually, the proposed approach solves the matching problem in much shorter time. The speed-up of our approach depends on the sequence patterns, and for highly similar homologous genome sequences, which are the common cases in real-life genomes, it can be up to several orders of magnitude.
Closed and Open Design Projects in the Education of Engineers

DEFF Research Database (Denmark)

Franksen, Ole Immanuel

1965-01-01

The two aspects of engineering education are the teaching of science and the teaching of design. By ``design'' is meant the procedure of selecting and combining distinct elements to create complete systems which will perform useful functions. In this paper, the author describes the application of...... of this concept of design teaching at The Technical University of Denmark, after a procedure which includes a sequence of closed and open design projects in both computational and experimental laboratories...
Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

Science.gov (United States)

Pelsy, F.; Merdinoglu, D.

2002-09-01

A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.
Zucchini yellow mosaic virus: biological properties, detection procedures and comparison of coat protein gene sequences.

Science.gov (United States)

Coutts, B A; Kehoe, M A; Webster, C G; Wylie, S J; Jones, R A C

2011-12-01

Between 2006 and 2010, 5324 samples from at least 34 weed, two cultivated legume and 11 native species were collected from three cucurbit-growing areas in tropical or subtropical Western Australia. Two new alternative hosts of zucchini yellow mosaic virus (ZYMV) were identified, the Australian native cucurbit Cucumis maderaspatanus, and the naturalised legume species Rhyncosia minima. Low-level (0.7%) seed transmission of ZYMV was found in seedlings grown from seed collected from zucchini (Cucurbita pepo) fruit infected with isolate Cvn-1. Seed transmission was absent in >9500 pumpkin (C. maxima and C. moschata) seedlings from fruit infected with isolate Knx-1. Leaf samples from symptomatic cucurbit plants collected from fields in five cucurbit-growing areas in four Australian states were tested for the presence of ZYMV. When 42 complete coat protein (CP) nucleotide (nt) sequences from the new ZYMV isolates obtained were compared to those of 101 complete CP nt sequences from five other continents, phylogenetic analysis of the 143 ZYMV sequences revealed three distinct groups (A, B and C), with four subgroups in A (I-IV) and two in B (I-II). The new Australian sequences grouped according to collection location, fitting within A-I, A-II and B-II. The 16 new sequences from one isolated location in tropical northern Western Australia all grouped into subgroup B-II, which contained no other isolates. In contrast, the three sequences from the Northern Territory fitted into A-II with 94.6-99.0% nt identities with isolates from the United States, Iran, China and Japan. The 23 new sequences from the central west coast and two east coast locations all fitted into A-I, with 95.9-98.9% nt identities to sequences from Europe and Japan. These findings suggest that (i) there have been at least three separate ZYMV introductions into Australia and (ii) there are few changes to local isolate CP sequences following their establishment in remote growing areas. Isolates from A-I and B
Time Separation Between Events in a Sequence: a Regional Property?

Science.gov (United States)

Muirwood, R.; Fitzenz, D. D.

2013-12-01

Earthquake sequences are loosely defined as events occurring too closely in time and space to appear unrelated. Depending on the declustering method, several, all, or no event(s) after the first large event might be recognized as independent mainshocks. It can therefore be argued that a probabilistic seismic hazard assessment (PSHA, traditionally dealing with mainshocks only) might already include the ground shaking effects of such sequences. Alternatively all but the largest event could be classified as an ';aftershock' and removed from the earthquake catalog. While in PSHA the question is only whether to keep or remove the events from the catalog, for Risk Management purposes, the community response to the earthquakes, as well as insurance risk transfer mechanisms, can be profoundly affected by the actual timing of events in such a sequence. In particular the repetition of damaging earthquakes over a period of weeks to months can lead to businesses closing and families evacuating from the region (as happened in Christchurch, New Zealand in 2011). Buildings that are damaged in the first earthquake may go on to be damaged again, even while they are being repaired. Insurance also functions around a set of critical timeframes - including the definition of a single 'event loss' for reinsurance recoveries within the 192 hour ';hours clause', the 6-18 month pace at which insurance claims are settled, and the annual renewal of insurance and reinsurance contracts. We show how temporal aspects of earthquake sequences need to be taken into account within models for Risk Management, and what time separation between events are most sensitive, both in terms of the modeled disruptions to lifelines and business activity as well as in the losses to different parties (such as insureds, insurers and reinsurers). We also explore the time separation between all events and between loss causing events for a collection of sequences from across the world and we point to the need to
A Snapshot of the Emerging Tomato Genome Sequence

Directory of Open Access Journals (Sweden)

Lukas A. Mueller

2009-03-01

Full Text Available The genome of tomato ( L. is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States as part of the larger “International Solanaceae Genome Project (SOL: Systems Approach to Diversity and Adaptation” initiative. The tomato genome sequencing project uses an ordered bacterial artificial chromosome (BAC approach to generate a high-quality tomato euchromatic genome sequence for use as a reference genome for the Solanaceae and euasterids. Sequence is deposited at GenBank and at the SOL Genomics Network (SGN. Currently, there are around 1000 BACs finished or in progress, representing more than a third of the projected euchromatic portion of the genome. An annotation effort is also underway by the International Tomato Annotation Group. The expected number of genes in the euchromatin is ∼40,000, based on an estimate from a preliminary annotation of 11% of finished sequence. Here, we present this first snapshot of the emerging tomato genome and its annotation, a short comparison with potato ( L. sequence data, and the tools available for the researchers to exploit this new resource are also presented. In the future, whole-genome shotgun techniques will be combined with the BAC-by-BAC approach to cover the entire tomato genome. The high-quality reference euchromatic tomato sequence is expected to be near completion by 2010.

Comparison of two approaches for the classification of 16S rRNA gene sequences.

Science.gov (United States)

Chatellier, Sonia; Mugnier, Nathalie; Allard, Françoise; Bonnaud, Bertrand; Collin, Valérie; van Belkum, Alex; Veyrieras, Jean-Baptiste; Emler, Stefan

2014-10-01

The use of 16S rRNA gene sequences for microbial identification in clinical microbiology is accepted widely, and requires databases and algorithms. We compared a new research database containing curated 16S rRNA gene sequences in combination with the lca (lowest common ancestor) algorithm (RDB-LCA) to a commercially available 16S rDNA Centroid approach. We used 1025 bacterial isolates characterized by biochemistry, matrix-assisted laser desorption/ionization time-of-flight MS and 16S rDNA sequencing. Nearly 80 % of isolates were identified unambiguously at the species level by both classification platforms used. The remaining isolates were mostly identified correctly at the genus level due to the limited resolution of 16S rDNA sequencing. Discrepancies between both 16S rDNA platforms were due to differences in database content and the algorithm used, and could amount to up to 10.5 %. Up to 1.4 % of the analyses were found to be inconclusive. It is important to realize that despite the overall good performance of the pipelines for analysis, some inconclusive results remain that require additional in-depth analysis performed using supplementary methods. © 2014 The Authors.
On H-closed and U-closed functions | Cammaroto | Quaestiones ...

African Journals Online (AJOL)

In this article, we extend the work on H-closed functions started by Cammaroto, Fedorchuk and Porter in 1998. Also, U-closed functions are introduced and characterized in terms of filters and adherence. The hereditary and productivity properties are examined and developed for both H-closed and U-closed functions.
Numerical modelling of closed-cell aluminium foam under dynamic loading

Science.gov (United States)

Hazell, Paul; Kader, M. A.; Islam, M. A.; Escobedo, J. P.; Saadatfar, M.

2015-06-01

Closed-cell aluminium foams are extensively used in aerospace and automobile industries. The understanding of their behaviour under impact loading conditions is extremely important since impact problems are directly related to design of these engineering structures. This research investigates the response of a closed-cell aluminium foam (CYMAT) subjected to dynamic loading using the finite element software ABAQUS/explicit. The aim of this research is to numerically investigate the material and structural properties of closed-cell aluminium foam under impact loading conditions with interest in shock propagation and its effects on cell wall deformation. A μ-CT based 3D foam geometry is developed to simulate the local cell collapse behaviours. A number of numerical techniques are applied for modelling the crush behaviour of aluminium foam to obtain the more accurate results. The simulation results are compared with experimental data. Comparison of the results shows a good correlation between the experimental results and numerical predictions.
Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity.

Science.gov (United States)

King, Brian R; Aburdene, Maurice; Thompson, Alex; Warres, Zach

2014-01-01

Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
Calculation of T2 relaxation time from ultrafast single shot sequences for differentiation of liver tumors. Comparison of echo-planar, HASTE, and spin-echo sequences

International Nuclear Information System (INIS)

Abe, Yasuko; Yamashita, Yasuyuki; Tang, Yi; Namimoto, Tomohiro; Takahashi, Mutsumasa

2000-01-01

The purpose of this study was to evaluate the accuracy of T2 calculation from single shot imaging sequences such as echo-planar imaging (EPI) and half-Fourier single shot turbo spin-echo (HASTE) imaging. For the phantom study, we prepared vials containing different concentrations of agarose, copper sulfate, and nickel chloride. The temperature of the phantom was kept at 22 deg C. MR images were obtained with a 1.5-Tesla superconductive magnet. Spin-echo (SE)-type EPI and HASTE sequences with different TEs were obtained for T2 calculation, and the T2 values were compared with those obtained from the Carr-Purcell-Meiborm-Gill (CPMG) sequence. The clinical study group consisted of 30 consecutive patients referred for MR imaging to characterize focal liver lesions. A total of 40 focal liver lesions were evaluated, including 25 primary or metastatic solid masses and 15 non-solid lesions. Single shot SE-type EPI and HASTE were both performed with TEs of 64 and 90 msec. In the phantom study, the T2 values obtained from both single shot sequences showed significant correlations with those from the CPMG sequence (T2 on EPI vs. T2 on CPMG: r=0.98, p<0.01; T2 on HASTE vs. T2 on CPMG: r=0.99, p<0.01). In the clinical study, mean T2 values for liver calculated from EPI (42 msec) were significantly shorter than those calculated from the HASTE sequence (58 msec) (p<0.001). Mean T2 values for solid tumors were 95 msec with HASTE and 72 msec with EPI, and mean T2 values for non-solid lesions were 128 msec with HASTE and 159 msec with EPI. Although mean T2 values between solid and non-solid lesions were significantly different for both EPI and HASTE sequences (p=0.01 for HASTE, p<0.001 for EPI), the overlap of solid and non-solid lesions was less frequent in EPI than in HASTE. With single shot sequences, it is possible to obtain the T2 values that show excellent correlation with the CPMG sequence. Although both HASTE and EPI are useful to calculate T2 values, EPI appears to be more
The close relationships of Lesbians and gay men.

Science.gov (United States)

Peplau, Letitia Anne; Fingerhut, Adam W

2007-01-01

This article reviews empirical studies of same-sex couples in the United States, highlighting consistent findings, drawing comparisons to heterosexual couples, and noting gaps in available research. U.S. Census data indicate that there were more than 600,000 same-sex couples living together in 2000. Research about relationship formation, the division of household labor, power, satisfaction, sexuality, conflict, commitment, and relationship stability is presented. Next, we highlight three recent research topics: the legalization of same-sex relationships through civil unions and same-sex marriage, the experiences of same-sex couples raising children, and the impact of societal prejudice and discrimination on same-sex partners. We conclude with comments about the contributions of empirical research to debunking negative stereotypes of same-sex couples, testing the generalizability of theories about close relationships, informing our understanding of gender and close relationships, and providing a scientific basis for public policy.
Armillaria phylogeny based on tef-1α sequences suggests ongoing divergent speciation within the boreal floristic kingdom

Science.gov (United States)

Ned B. Klopfenstein; John W. Hanna; Amy L. Ross-Davis; Jane E. Stewart; Yuko Ota; Rosario Medel-Ortiz; Miguel Armando Lopez-Ramirez; Ruben Damian Elias-Roman; Dionicio Alvarado-Rosales; Mee-Sook Kim

2013-01-01

Armillaria plays diverse ecological roles in forests worldwide, which has inspired interest in understanding phylogenetic relationships within and among species of this genus. Previous rDNA sequence-based phylogenetic analyses of Armillaria have shown general relationships among widely divergent taxa, but rDNA sequences were not reliable for separating closely related...
Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.

Science.gov (United States)

Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R

2017-02-05

Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.
An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

Science.gov (United States)

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051
In silico Coding Sequence Analysis of Walnut GAI and PIP2 Genes and Comparison with Different Plant Species

Directory of Open Access Journals (Sweden)

Mahdi Mohseniazar

2017-02-01

done with MEGA from aligned sequences. The motifs of protein sequences were found using the program of T-COFEE at website (http://www.ebi.ac.uk/Tools/msa/tcoffee/. The Neighbor-Joining (NJ method was used to designing the phylogenetic tree. The predicted exons and introns in mRNA sequences were done by http://genes.mit.edu/GENSCAN.html website. The secondary structure of proteins was predicted by PSIORED online on http://bioinf.cs.ucl.ac.uk/psipred/. Prediction of 3D model of protein was performed using the 3D alignment of protein structure by BLASTp and PDB database as source. Also, targeting prediction of proteins was done online by TargetP at (http://www.cbs.dtu.dk/services/TargetP/ website. Results and discussion: In phylogenetic investigation among 17 different species, Walnut species evolutionary stand in dicotyledonous and woody plants by both of GAI and PIP2 genes and protein sequence clustering. By multiple alignments and investigation in conserved sequence of these genes in plant revealed that despite differences in cDNA length, there were very similarities in conserved region, secondary and tertiary structure. Protein analysis in the GAI gene family showed that the following domains including DELLA, TVHYNP, VHIID, RKVATYFGEALARR, AVNSVFELH, RVER, and SAW were conserved in this proteins. In secondary structure of protein, β-sheets and α-helixes specified by PSIPRED software for both of GAI and PIP2 proteins. GAI protein had 9 β-sheets and 15 α-helixes in its structure, also PIP2 protein had2 β-sheet (at 180-188 and 248-253 and 8 α-helixes. In comparison of 3D structure, walnut PIP2 protein was very similar to chain A of PIP2 protein of spinach (Spinacia oleracea and GAI protein of walnut was similar to B-subunit of Arabidopsis GAI protein with 48% similarity. The length of GAI protein was varied from 636 aa in Malus baccata var. xiaojinensis to 336 aa in Physcomitrella patens among species. In walnut, the length of GAI and PIP2 protein was 613 aa and
In silico Coding Sequence Analysis of Walnut GAI and PIP2 Genes and Comparison with Different Plant Species

Directory of Open Access Journals (Sweden)

Mahdi Mohseniazar

2017-09-01

done with MEGA from aligned sequences. The motifs of protein sequences were found using the program of T-COFEE at website (http://www.ebi.ac.uk/Tools/msa/tcoffee/. The Neighbor-Joining (NJ method was used to designing the phylogenetic tree. The predicted exons and introns in mRNA sequences were done by http://genes.mit.edu/GENSCAN.html website. The secondary structure of proteins was predicted by PSIORED online on http://bioinf.cs.ucl.ac.uk/psipred/. Prediction of 3D model of protein was performed using the 3D alignment of protein structure by BLASTp and PDB database as source. Also, targeting prediction of proteins was done online by TargetP at (http://www.cbs.dtu.dk/services/TargetP/ website. Results and discussion: In phylogenetic investigation among 17 different species, Walnut species evolutionary stand in dicotyledonous and woody plants by both of GAI and PIP2 genes and protein sequence clustering. By multiple alignments and investigation in conserved sequence of these genes in plant revealed that despite differences in cDNA length, there were very similarities in conserved region, secondary and tertiary structure. Protein analysis in the GAI gene family showed that the following domains including DELLA, TVHYNP, VHIID, RKVATYFGEALARR, AVNSVFELH, RVER, and SAW were conserved in this proteins. In secondary structure of protein, β-sheets and α-helixes specified by PSIPRED software for both of GAI and PIP2 proteins. GAI protein had 9 β-sheets and 15 α-helixes in its structure, also PIP2 protein had2 β-sheet (at 180-188 and 248-253 and 8 α-helixes. In comparison of 3D structure, walnut PIP2 protein was very similar to chain A of PIP2 protein of spinach (Spinacia oleracea and GAI protein of walnut was similar to B-subunit of Arabidopsis GAI protein with 48% similarity. The length of GAI protein was varied from 636 aa in Malus baccata var. xiaojinensis to 336 aa in Physcomitrella patens among species. In walnut, the length of GAI and PIP2 protein was 613 aa and
A Murine Herpesvirus Closely Related to Ubiquitous Human Herpesviruses Causes T-Cell Depletion.

Science.gov (United States)

Patel, Swapneel J; Zhao, Guoyan; Penna, Vinay R; Park, Eugene; Lauron, Elvin J; Harvey, Ian B; Beatty, Wandy L; Plougastel-Douglas, Beatrice; Poursine-Laurent, Jennifer; Fremont, Daved H; Wang, David; Yokoyama, Wayne M

2017-05-01

The human roseoloviruses human herpesvirus 6A (HHV-6A), HHV-6B, and HHV-7 comprise the Roseolovirus genus of the human Betaherpesvirinae subfamily. Infections with these viruses have been implicated in many diseases; however, it has been challenging to establish infections with roseoloviruses as direct drivers of pathology, because they are nearly ubiquitous and display species-specific tropism. Furthermore, controlled study of infection has been hampered by the lack of experimental models, and until now, a mouse roseolovirus has not been identified. Herein we describe a virus that causes severe thymic necrosis in neonatal mice, characterized by a loss of CD4 + T cells. These phenotypes resemble those caused by the previously described mouse thymic virus (MTV), a putative herpesvirus that has not been molecularly characterized. By next-generation sequencing of infected tissue homogenates, we assembled a contiguous 174-kb genome sequence containing 128 unique predicted open reading frames (ORFs), many of which were most closely related to herpesvirus genes. Moreover, the structure of the virus genome and phylogenetic analysis of multiple genes strongly suggested that this virus is a betaherpesvirus more closely related to the roseoloviruses, HHV-6A, HHV-6B, and HHV-7, than to another murine betaherpesvirus, mouse cytomegalovirus (MCMV). As such, we have named this virus murine roseolovirus (MRV) because these data strongly suggest that MRV is a mouse homolog of HHV-6A, HHV-6B, and HHV-7. IMPORTANCE Herein we describe the complete genome sequence of a novel murine herpesvirus. By sequence and phylogenetic analyses, we show that it is a betaherpesvirus most closely related to the roseoloviruses, human herpesviruses 6A, 6B, and 7. These data combined with physiological similarities with human roseoloviruses collectively suggest that this virus is a murine roseolovirus (MRV), the first definitively described rodent roseolovirus, to our knowledge. Many biological and
SWI or T2*: which MRI sequence to use in the detection of cerebral microbleeds? The Karolinska Imaging Dementia Study.

Science.gov (United States)

Shams, S; Martola, J; Cavallin, L; Granberg, T; Shams, M; Aspelin, P; Wahlund, L O; Kristoffersen-Wiberg, M

2015-06-01

Cerebral microbleeds are thought to have potentially important clinical implications in dementia and stroke. However, the use of both T2* and SWI MR imaging sequences for microbleed detection has complicated the cross-comparison of study results. We aimed to determine the impact of microbleed sequences on microbleed detection and associated clinical parameters. Patients from our memory clinic (n = 246; 53% female; mean age, 62) prospectively underwent 3T MR imaging, with conventional thick-section T2*, thick-section SWI, and conventional thin-section SWI. Microbleeds were assessed separately on thick-section SWI, thin-section SWI, and T2* by 3 raters, with varying neuroradiologic experience. Clinical and radiologic parameters from the dementia investigation were analyzed in association with the number of microbleeds in negative binomial regression analyses. Prevalence and number of microbleeds were higher on thick-/thin-section SWI (20/21%) compared with T2*(17%). There was no difference in microbleed prevalence/number between thick- and thin-section SWI. Interrater agreement was excellent for all raters and sequences. Univariate comparisons of clinical parameters between patients with and without microbleeds yielded no difference across sequences. In the regression analysis, only minor differences in clinical associations with the number of microbleeds were noted across sequences. Due to the increased detection of microbleeds, we recommend SWI as the sequence of choice in microbleed detection. Microbleeds and their association with clinical parameters are robust to the effects of varying MR imaging sequences, suggesting that comparison of results across studies is possible, despite differing microbleed sequences. © 2015 by American Journal of Neuroradiology.
A comparison of Candle Auctions and Hard Close Auctions with Common Values

OpenAIRE

Sascha Füllbrunn

2009-01-01

With this study, we contribute to the literature of auction design by presenting a new auction format: the Candle auction, a popular auction in the Middle Ages. Considering a common value framework, we theoretically and experimentally point out that the Candle auction, where bidding is allowed until a stochastic deadline, yields a better outcome to the seller than the Hard Close auction, the popular eBay online auction format.
Trajectories of Childbearing among HIV Infected Indian Women : A Sequence Analysis Approach

NARCIS (Netherlands)

Darak, Shrinivas; Mills, Melinda; Kulkarni, Vinay; Kulkarni, Sanjeevani; Hutter, Inge; Janssen, Fanny

2015-01-01

Background HIV infection closely relates to and deeply affects the reproductive career of those infected. However, little is known about the reproductive career trajectories, specifically the interaction of the timing of HIV diagnosis with the timing and sequencing of reproductive events among HIV
Trajectories of childbearing among HIV infected Indian women: A sequence analysis approach

NARCIS (Netherlands)

Darak, S.; Mills, M.; Kulkarni, V.; Kulkarni, S.; Hutter, I.; Janssen, F.

2015-01-01

Background HIV infection closely relates to and deeply affects the reproductive career of those infected. However, little is known about the reproductive career trajectories, specifically the interaction of the timing of HIV diagnosis with the timing and sequencing of reproductive events among HIV
Trajectories of childbearing among HIV infected Indian women : A sequence analysis approach

NARCIS (Netherlands)

S. Darak (Shrinivas); M. Mills (Melinda); V. Kulkarni (Vinay); S. Kulkarni (Sanjeevani); I. Hutter (Inge); F. Janssen (Fanny)

2015-01-01

textabstractHIV infection closely relates to and deeply affects the reproductive career of those infected. However, little is known about the reproductive career trajectories, specifically the interaction of the timing of HIV diagnosis with the timing and sequencing of reproductive events among HIV
Molecular comparisons of full length metapneumovirus (MPV genomes, including newly determined French AMPV-C and -D isolates, further supports possible subclassification within the MPV Genus.

Directory of Open Access Journals (Sweden)

Paul A Brown

Full Text Available Four avian metapneumovirus (AMPV subgroups (A-D have been reported previously based on genetic and antigenic differences. However, until now full length sequences of the only known isolates of European subgroup C and subgroup D viruses (duck and turkey origin, respectively have been unavailable. These full length sequences were determined and compared with other full length AMPV and human metapneumoviruses (HMPV sequences reported previously, using phylogenetics, comparisons of nucleic and amino acid sequences and study of codon usage bias. Results confirmed that subgroup C viruses were more closely related to HMPV than they were to the other AMPV subgroups in the study. This was consistent with previous findings using partial genome sequences. Closer relationships between AMPV-A, B and D were also evident throughout the majority of results. Three metapneumovirus "clusters" HMPV, AMPV-C and AMPV-A, B and D were further supported by codon bias and phylogenetics. The data presented here together with those of previous studies describing antigenic relationships also between AMPV-A, B and D and between AMPV-C and HMPV may call for a subclassification of metapneumoviruses similar to that used for avian paramyxoviruses, grouping AMPV-A, B and D as type I metapneumoviruses and AMPV-C and HMPV as type II.
Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

DEFF Research Database (Denmark)

Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten

1985-01-01

DNA clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......, subcloned into M13 mp8, and sequenced at random by the dideoxy technique, thereby generating a contiguous sequence of 1703 base pairs. This clone contained coding sequence for the C-terminal 262 amino acid residues of the beta-chain, the entire C5a fragment, and the N-terminal 98 residues of the alpha......'-chain. The 3' end of the clone had a polyadenylated tail preceded by a polyadenylation recognition site, a 3'-untranslated region, and base pairs homologous to the human Alu concensus sequence. Comparison of the derived partial human C5 protein sequence with that previously determined for murine C3 and human...
Interference-free acquisition of overlapping sequences in explicit spatial memory.

Science.gov (United States)

Eggert, Thomas; Drever, Johannes; Straube, Andreas

2014-04-01

Some types of human sequential memory, e.g. the acquisition of a new composition by a trained musician, seem to be very efficient in extending the length of a memorized sequence and in flexible reuse of known subsequences in a newly acquired sequential context. This implies that interference between known and newly acquired subsequences can be avoided even when learning a sequence which is a partial mutation of a known sequence. It is known that established motor sequences do not have such flexibility. Using learning of deferred imitation, the current study investigates the flexibility of explicit spatial memory by quantifying the interferences between successively acquired, partially overlapping sequences. After learning a spatial sequence on day 1, this sequence was progressively modified on day 2. On day 3, a retention test was performed with both the initial and the modified sequence. The results show that subjects performed very well on day 1 and day 2. No spatial interference between changed and unchanged targets was observed during the stepwise progressive modification of the reproduced sequence. Surprisingly, subjects performed well on both sequences on day 3. Comparison with a control experiment without intermediate mutation training showed that the initial training on day 1 did not proactively interfere with the retention of the modified sequence on day 3. Vice versa, the mutation training on day 2 did not interfere retroactively with the retention of the original sequence as tested on day 3. The results underline the flexibility in acquiring explicit spatial memory. Copyright © 2014 Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.